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Description 

The present invention relates to the field of molecular biology. In particular, it relates to, among other things, nu- 
cleotide sequences of Staphylococcus aureus, contigs, ORFs, fragments, probes, primers and related polynucleotides 
5 thereof, peptides and polypeptides encoded by the sequences, and uses of the polynucleotides and sequences thereof, 
such as in fermentation, polypeptide production, assays and pharmaceutical development, among others. 

The genus Staphylococcus includes at least 20 distinct species. (For a review see Novick, R. R, The Staphyloco- 
ccus as a Molecular Genetic System, Chapter 1, pgs. 1-37 in MOLECULAR BIOLOGY OF THE STAPHYLOCOCCI, 
R. Novick, Ed., VCH Publishers, New York (1990)). Species differ from one another by 80% or more, by hybridization 
io kinetics, whereas strains within a species are at least 90% identical by the same measure. 

The species Staphylococcus aureus, a gram -positive, facultatively aerobic, clump-forming cocci, is among the 
most important etiological agents of bacterial infection in humans, as discussed briefly below. 

Human Health and S. Aureus 

15 

Staphylococcus aureus is a ubiquitous pathogen. (See, for instance, Mims et ai, MEDICAL MICROBfOLOGY, 
Mosby-Year Book Europe Limited, London, UK (1993)). It is an etiological agent of a variety of conditions, ranging in 
severity from mild to fatal. A few of the more common conditions caused by S. aureus infection are burns, cellulitis, 
eyelid infections, food poisoning, joint infections, neonatal conjunctivitis.osteomyelitis, skin infections, surgical wound 
20 infection, scalded skin syndrome and toxic shock syndrome, some of which are described further below. 

Burns 

Burn wounds generally are sterile initially. However, they generally compromise physical and immune barriers to 
25 infection, cause loss of fluid and electrolytes and result in local or general physiological dysfunction. After cooling, 
contact with viable bacteria results in mixed colonization at the injury site. Infection may be restricted to the non-viable 
debris on the burn surface ("eschar"), it may progress into full skin infection and invade viable tissue below the eschar 
and it may reach below the skin, enter the lymphatic and blood circulation and develop into septicaemia. S. aureus is 
among the most important pathogens typically found in burn wound infections. It can destroy granulation tissue and 
30 produce severe septicaemia. 

Cellulitis 

Cellulitis, an acute infection of the skin that expands from a typically superficial origin to spread below the cutaneous 
35 layer, most commonly is caused by Sr aureus in conjunction with S. pyrogenes. Cellulitis can lead to systemic infection. 
In fact, cellulitis can be one aspect of synergistic bacterial gangrene. This condition typically is caused by a mixture of 
S. aureus and microaerophilic streptococci. It causes necrosis and treatment is limited to excision of the necrotic tissue. 
The condition often is fatal. 

40 Eyelid infections 

S. aureus is the cause of styes and of sticky eye" in neonates, among other eye infections. Typically such infections 
are limited to the surface of the eye, and may occasionally penetrate the surface with more severe consequences. 

45 Food poisoning 

Some strains of S. aureus produce one or more of five serologically distinct, heat and acid stable enterotoxins that 
are not destroyed by digestive process of the stomach and small intestine (enterotoxins A-E). Ingestion of the toxin, 
in sufficient quantities, typically results in severe vomiting, but not diarrhoea. The effect does not require viable bacteria. 
so Although the toxins are known, their mechanism of action is not understood. 

Joint infections 

S. aureus infects bone joints causing diseases such osteomyelitis. 

55 

Osteomyelitis 

S. aureus is the most common causative agent of haematogenous osteomyelitis. The disease tends to occur in 
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children and adolescents more than adults and it is associated with non-penetrating injuries to bones. Infection typically 
occurs in the long end of growing bone, hence its occurrence in physically immature populations. Most often, infection 
is localized in the vicinity of sprouting capillary loops adjacent to epiphysial growth plates in the end of long, growing 
bones. 

5 

Skin infections 

S. aureus is the most common pathogen of such minor skin infections as abscesses and boils. Such infections 
often are resolved by normal host response mechanisms, but they also can develop into severe internal infections. 
10 Recurrent infections of the nasal passages plague nasal carriers of S. aureus. 

Surgical Wound Infections 

Surgical wounds often penetrate far into the body. Infection of such wound thus poses a grave risk to the patient. 
is S. aureus is the most important causative agent of infections in surgical wounds. S. aureus is unusually adept at 
invading surgical wounds; sutured wounds can be infected by far fewer S. aureus cells then are necessary to cause 
infection in normal skin. Invasion of surgical wound can lead to severe S. aureus septicaemia. Invasion of the blood 
stream by S. aureus can lead to seeding and infection of internal organs, particularly heart valves and bone, causing 
systemic diseases, such as endocarditis and osteomyelitis. 

20 

Scalded Skin Syndrome 

S. aureus is responsible for "scalded skin syndrome" (also called toxic epidermal necrosis, Ritter's disease and 
Lyell's disease). This diseases occurs in older children, typically in outbreaks caused by flowering of S. aureus strains 
25 produce exfoliation (also called scalded skin syndrome toxin). Although the bacteria initially may infect only a minor 
lesion, the toxin destroys intercellular connections, spreads epidermal layers and allows the infection to penetrate the 
outer layer of the skin, producing the desquamation that typifies the diseases. Shedding of the outer layer of skin 
generally reveals normal skin below, but fluid lost in the process can produce severe injury in young children if it is not 
treated properly. 

30 

Toxic Shock Syndrome 

Toxic shock syndrome is caused by strains of S. aureus that produce the so-called toxic shock syndrome toxin. 
The disease can be caused by S. aureus infection at any site, but it is too often erroneously viewed exclusively as a 
35 disease solely of women who use tampons. The disease involves toxaemia and septicaemia, and can be fatal. 

Nocosomial Infections 

In the 1984 National Nocosomial Infection Surveillance Study ("NNIS") S. aureus was the most prevalent agent 
40 of surgical wound infections in many hospital services, including medicine, surgery, obstetrics, pediatrics and newborns. 

Resistance to drugs of S aureus strains 

Prior to the introduction of penicillin the prognosis for patients seriously infected with S. aureus was unfavorable. 
45 Following the introduction of penicillin in the early 1 940s even the worst S. aureus infections generally could be treated 
successfully. The emergence of penicillin-resistant strains of S. aureus did not take long, however. Most strains of S. 
aureus encountered in hospital infections today do not respond to penicillin; although, fortunately, this is not the case 
for S. aureus encountered in community infections. 

It is well known now that penicillin-resistant strains of S. aureus produce a lactamase which converts penicillin to 
50 pencillinoic acid, and thereby destroys antibiotic activity. Furthermore, the lactamase gene often is propagated episo- 
mally, typically on a plasmid, and often is only one of several genes on an episomal element that, together, confer 
multidrug resistance. 

Methicillins, introduced in the 1960s, largely overcame the problem of penicillin resistance in S. aureus. These 
compounds conserve the portions of penicillin responsible for antibiotic activity and modify or alter other portions that 
55 make penicillin a good substrate for inactivating lactamases. However, methicillin resistance has emerged in S. aureus, 
along with resistance to many other antibiotics effective against this organism, including aminoglycosides, tetracycline, 
chloramphenicol, macrolides and lincosamides. In fact, methicillin-resistant strains of S. aureus generally are multiply 
drug resistant. 
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The molecular genetics of most types of drug resistance in S. aureus has been elucidated (See Lyon et al., Micro- 
biology Reviews 51 : 88-1 34 (1 987)). Generally, resistance Is mediated by plasmids, as noted above regarding penicillin 
resistance; however, several stable forms of drug resistance have been observed that apparently involve integration 
of a resistance element into the S. aureus genome itself. 
s Thus far each new antibiotic gives rise to resistance strains, stains emerge that are resistance to multiple drugs 

and increasingly persistent forms of resistance begin to emerge. Drug resistance of S. aureus infections already poses 
significant treatment difficulties, which are likely to get much worse unless new therapeutic agents are developed. 

Molecular Genetics of Staphylococcus Aureus 

10 

Despite its importance in, among other things, human disease, relatively little is known about the genome of this 
organism. 

Most genetic studies of S. aureus have been carried out using the the strain NCTC8325, which contains prophages 
psi11 psi12 and psil 3, and the UV-cured derivative of this strain, 8325-4 (also referred to as RN450), which is free of 
is the prophages. 

These studies revealed that the S. aureus genome, like that of other staphylococci, consists of one circular, cov- 
alently closed, double-stranded DNA and a collection of so-called variable accessory genetic elements, such as 
prophages, plasmids, transposons and the like. 

Physical characterization of the genome has not been carried out in any detail. Pattee et al. published a low res- 

20 olution and incomplete genetic and physical map of the chromosome of S. aureus strain NCTC 8325. (Pattee et al 
Genetic and Physical Mapping of Chromosome of Staphylococcus aureus NCTC 8325, Chapter 11, pgs. 163-169 in. 
MOLECULAR BIOLOGY OF THE STAPHYLOCOCCI, R.P Novick, Ed., VCH Publishers, New York, (1 990) The genetic 
map largely was produced by mapping insertions of Tn551 and Tn4001 , which, respectively, confer erythromycin and 
gentamicin resistance, and by analysis of Smal-digested DNA by Pulsed Field Gel Electrophoresis ("PFGE"). 

25 The map was of low resolution; even estimating the physical size of the genome was difficult, according to the 

investigators. The size of the largest Smal chromosome fragment, for instance, was too large for accurate sizing by 
PFGE. To estimate its size, additional restriction sites had to be introduced into the chromosome using a transposon 
containing a Smal recognition sequence. 

In sum, most physical characteristics and almost all of the genes of Staphylococcus aureus are unknown. Among 

30 the few genes that have been identified, most have not been physically mapped or characterized in detail. Only a very 
few genes of this organism have been sequenced. (See, for instance Thornsberry, J. , Antimicrobial Chemotherapy 21 
Suppl C : 9-16 (1988), current versions of GENBANK and other nucleic acid databases, and references that relate to 
the genome of S. aureus such as those set out elsewhere herein.) 

It is clear that the etiology of diseases mediated or exacerbated by S. aureus infection involves the programmed 

35 expression of S. aureus genes, and that characterizing the genes and their patterns of expression would add dramat- 
ically to our understanding of the organism and its host interactions. Knowledge of S. aureus genes and genomic 
organization would dramatically improve understanding of disease etiology and lead to improved and new ways of 
preventing, ameliorating, arresting and reversing diseases. Moreover, characterized genes and genomic fragments of 
S. aureus would provide reagents for, among other things, detecting, characterizing and controlling S. aureus infections. 

40 There is a need therefore to characterize the genome of S. aureus and for polynucleotides and sequences of this 
organism. 

The present invention is based on the sequencing of fragments of the Staphylococcus aureus genome. The primary 
nucleotide sequences which were generated are provided in SEQ ID NOS: 1-5,191. 

The present invention provides the nucleotide sequence of several thousand contigs of the Staphylococcus aureus 
^5 genome,- which are listed in tables below and set out in the Sequence Listing submitted herewith, and representative 
fragments thereof, in a form which can be readily used, analyzed, and interpreted by a skilled artisan. In one embod- 
iment, the present invention is provided as contiguous strings of primary sequence information corresponding to the 
nucleotide sequences depicted in SEQ ID NOS:1 -5,1 91 . 

The present invention further provides nucleotide sequences which are at least 95%, preferably 99% and most 
so preferably 99.9%, identical to the nucleotide sequences of SEQ ID NOS:1 -5, 1 91 . 

The nucleotide sequence of SEQ ID NOS:1-5,191, a representative fragment thereof, or a nucleotide sequence 
which is at least 95%, preferably 99% and most preferably 99.9%, identical to the nucleotide sequence of SEQ ID 
NOS:1-5,191 may be provided in a variety of mediums to facilitate its use. In one application of this embodiment, the 
sequences of the present invention are recorded on computer readable media. Such media includes, but is not limited 
55 to:magnetic storage media, such as floppy discs, hard disc storage medium, and magnetic tape; optical storage media 
such as CD-ROM; electrical storage media such as RAM and ROM; and hybrids of these categories such as magnetic/ 
optical storage media. 

The present invention further provides systems, particularly computer-based systems which contain the sequence 
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information herein described stored in a data storage means. Such systems are designed to identity commercially 
important fragments of the Staphylococcus aureus genome. 

Another embodiment of the present invention is directed to fragments, preferably isolated fragments, of the Sta- 
phylococcus aureus genome having particular structural or functional attributes. Such fragments of the Staphylococcus 
s aureus genome of the present invention include, but are not limited to, fragments which encode peptides, hereinafter 
referred to as open reading frames or ORFs," fragments which modulate the expression of an operably linked ORR 
hereinafter referred to as expression modulating fragments or EMFs," and fragments which can be used to diagnose 
the presence of Staphylococcus aureus in a sample, hereinafter referred to as diagnostic fragments or "DFs." 

Each of the ORFs in fragments of the Staphylococcus aureus genome disclosed in Tables 1-3, and the EMFs 
io found 5' to the ORFs, can be used in numerous ways as polynucleotide reagents. For instance, the sequences can be 
used as diagnostic probes or amplification primers for detecting or determining the presence of a specific microbe in 
a sample, to selectively control gene expression in a host and in the production of polypeptides, such as polypeptides 
encoded by ORFs of the present invention, particular those polypeptides that have a pharmacological activity 

The present invention further includes recombinant constructs comprising one or more fragments of the Staphy- 
15 lococcus aureus genome of the present invention. The recombinant constructs of the present invention comprise vec- 
tors, such as a plasmid or viral vector, into which a fragment of the Staphylococcus aureus has been inserted. 

The present invention further provides host cells containing any of the isolated fragments of the Staphylococcus 
aureus genome of the present invention. The host cells can be a higher eukaryotic host cell, such as a mammalian 
cell, a lower eukaryotic cell, such as a yeast cell, or a procaryotic cell such as a bacterial cell. 
20 The present invention is further directed to polypeptides and proteins, preferably isolated polypeptides and pro- 

teins, encoded by ORFs of the present invention. A variety of methods, well known to those of skill in the art, routinely 
may be utilized to obtain any of the polypeptides and proteins of the present invention. For instance, polypeptides and 
proteins of the present invention having relatively short, simple amino acid sequences readily can be synthesized using 
commercially available automated peptide synthesizers. Polypeptides and proteins of the present invention also may 
25 be purified from bacterial cells which naturally produce the protein. Yet another alternative is to purify polypeptide and 
proteins of the present invention can from cells which have been altered to express them. 

The invention further provides polypeptides, preferably isolated polypeptides, comprising Staphylococcus aureus 
epitopes and vaccine compositions comprising such polypeptides. Also provided are methods for vacciniating an in- 
dividual against Staphylococcus aureus infection. 
30 The invention further provides methods of obtaining homologs of the fragments of the Staphylococcus aureus 

genome of the present invention and homologs of the proteins encoded by the ORFs of the present invention. Specif- 
ically, by using the nucleotide and amino acid sequences disclosed herein as a probe or as primers, and techniques 
such as PCR cloning and colony/plaque hybridization, one skilled in the art can obtain homologs. 

The invention further provides antibodies which selectively bind polypeptides and proteins of the present invention. 
35 Such antibodies include both monoclonal and polyclonal antibodies. 

The invention further provides hybridomas which produce the above<iescribed antibodies. A hybridoma is an 
immortalized cell line which is capable of secreting a specific monoclonal antibody. 

The present invention further provides methods of identifying test samples derived from cells which express one 
of the ORFs of the present invention, or a homolog thereof. Such methods comprise incubating a test sample with one 
40 or more of the antibodies of the present invention, or one or more of the Dfs or antigens of the present invention, under 
conditions which allow a skilled artisan to determine if the sample contains the ORF or product produced therefrom. 

In another embodiment of the present invention, kits are provided which contain the necessary reagents to carry 
out the above-described assays. 

Specifically, the invention provides a compartmentalized kit to receive, in close confinement, one or more containers 
45 which comprises: (a) a first container comprising one of the antibodies, antigens, or one of the DFs of the present 
invention; and (b) one or more other containers comprising one or more of the following: wash reagents, reagents 
capable of detecting presence of bound antibodies, antigens or hybridized DFs. 

Using the isolated proteins of the present invention, the present invention further provides methods of obtaining 
and identifying agents capable of binding to a polypeptide or protein encoded by one of the ORFs of the present 
50 invention. Specifically, such agents include, as further described below, antibodies, peptides, carbohydrates, pharma- 
ceutical agents and the like. Such methods comprise steps of: (a)contacting an agent with an isolated protein encoded 
by one of the ORFs of the present invention; and (b)determining whether the agent binds to said protein. 

The present genomic sequences of Staphylococcus aureus will be of great value to all laboratories working with 
this organism and for a variety of commercial purposes. Many fragments of the Staphylococcus aureus genome will 
55 be immediately identified by similarity searches against GenBank or protein databases and will be of immediate value 
to Staphylococcus aureus researchers and for immediate commercial value for the production of proteins or to control 
gene expression. 

The methodology and technology for elucidating extensive genomic sequences of bacterial and other genomes 
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has and will greatly enhance the ability to analyze and understand chromosomal organization. In particular, sequenced 
contigs and genomes will provide the models for developing tools for the analysis of chromosome structure and function, 
including the ability to identify genes within large segments of genomic DNA, the structure, position, and spacing of 
regulatory elements, the identification of genes with potential industrial applications, and the ability to do comparative 
5 genomic and molecular phylogeny. 

FIGURE 1 is a block diagram of a computer system (102) that can be used to implement computer-based systems 
of present invention. 

FIGURE 2 is a schematic diagram depicting the data flow and computer programs used to collect, assemble, edit 
and annotate the contigs of the Staphylococcus aureus genome of the present invention. Both Macintosh and Unix 

10 platforms are-used to handle the AB 373 and 377 sequence data files, largely as described in Kerlavage et ai, Pro- 
ceedings of the Twenty-Sixth Annual Hawaii International Conference on System Sciences, 585, IEEE Computer So- 
ciety Press, Washington D.C. (1993). Factura (AB) is a Macintosh program designed for automatic vector sequence 
removal and end-trimming of sequence files. The program Loadis runs on a Macintosh platform and parses the feature 
data extracted from the sequence-files by Factura to the Unix based Staphylococcus aureus relational database. As- 

15 sembly of contigs (and whole genome sequences) is accomplished by retrieving a specific set of sequence files and 
their associated features using extrseq, a Unix utility for retrieving sequences from an SQL database. The resulting 
sequence file is processed by seqjitter to trim portions of the sequences with more than 2% ambiguous nucleotides. 
The sequence files were assembled using TIGR Assembler, an assembly engine designed at The Institute for Genomic 
Research ( TIGR") for rapid and accurate assembly of thousands of sequence fragments. The collection of contigs 

20 generated by the assembly step is loaded into the database with the lassie program.-ldentification of open reading 
frames (ORFs) is accomplished by processing contigs with zorf. The ORFs are searched against S. aureus sequences 
from Genbank and against all protein sequences using the BLASTN and BLASTP programs, described in Altschul et 
ai, J. MoL Biol. 215: 403-410 (1990)). Results of the ORF determination and similarity searching steps were loaded 
into the database. As described below, some results of the determination and the searches are set out in Tables 1-3.. 

25 The present invention is based on the sequencing of fragments of the Staphylococcus aureus genome and analysis 

of the sequences. The primary nucleotide sequences generated by sequencing the fragments are provided in SEQ ID 
NOS:1-5,191. (As used herein, the "primary sequence" refers to the nucleotide sequence represented by the IUPAC 
nomenclature system.) 

In addition to the aforementioned Staphylococcus aureus polynucleotide and polynucleotide sequences, the 
30 present invention provides the nucleotide sequences of SEQ ID NOS:1 -5,1 91 , or representative fragments thereof, in 
a form which can be readily used, analyzed, and interpreted by a skilled artisan. 

As used herein, a "representative fragment of the nucleotide sequence depicted in SEQ ID NOS:1 -5,191" refers 
to any portion of the SEQ ID NOS:1-5,191 which is not presently represented within a publicly available database. 
Preferred representative fragments of the present invention are Staphylococcus aureusopen reading frames ( ORFs"), 
35 expression modulating fragment ( EMFs") and fragments which can be used to diagnose the presence of Staphyloco- 
ccus aureus in sample ("DFs"). A non-limiting identification of preferred representative fragments is provided in Tables 
,1-3. 

As discussed in detail below, the information provided in SEQ ID NOS:1-5,191 and in Tables 1-3 together with 
routine cloning, synthesis, sequencing and assay methods will enable those skilled in the art to clone and sequence 
40 all "representative fragments" of interest, including open reading frames encoding a large variety of Staphylococcus 
aureus proteins. 

While the presently disclosed sequences of SEQ ID NOS:1 -5,1 91 are highly accurate, sequencing techniques are 
not perfect and, in relatively rare instances, further investigation of a fragment or sequence of the invention may reveal 
a nucleotide sequence error present in a nucleotide sequence disclosed in SEQ ID NOS:1-5,191. However, once the 

45 present invention is made available (i.e., once the information in SEQ ID NOS:1 -5,1 91 and Tables 1 -3 has been made 
available), resolving a rare sequencing error in SEQ ID NOS:1-5,191 will be well within the skill of the art. The present 
disclosure makes available sufficient sequence information to allow any of the described contigs or portions thereof to 
be obtained readily by straightforward application of routine techniques. Further sequencing of such polynucleotide 
may proceed in like manner using manual and automated sequencing methods which are employed ubiquitous in the 

50 art. Nucleotide sequence editing software is publicly available. For example, Applied Biosystem's (AB) AutoAssembler 
can be used as an aid during visual inspection of nucleotide sequences. By employing such routine techniques potential 
errors readily may be identified and the correct sequence then may be ascertained by targeting further sequencing 
effort, also of a routine nature, to the region containing the potential error. 

Even if all of the very rare sequencing errors in SEQ ID NOS:1-5,191 were corrected, the resulting nucleotide 

55 sequences would still be at least 95% identical, nearly all would be at least 99% identical, and the great majority would 
be at least 99.9% identical to the nucleotide sequences of SEQ ID NOS:1 -5,1 91 . 

As discussed elsewhere hererin, polynucleotides of the present invention readily may be obtained by routine ap- 
plication of well known and standard procedures for cloning and sequencing DNA. Detailed methods for obtaining 
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libraries and for sequencing are provided below, for instance. A wide variety of Staphylococcus aureus strains that can 
be used to prepare S aureus genomic DNA for cloning and for obtaining polynucleotides of the present invention are 
available to the public from recognized depository institutions, such as the American Type Culture Collection (ATCC°). 

The nucleotide sequences of the genomes from different strains of Staphylococcus aureus differ somewhat. How- 
5 ever, the nucleotide sequences of the genomes of all Staphylococcus aureus strains will be at least 95% identical, in 
corresponding part, to the nucleotide sequences provided in SEQ ID NOS:1-5,191. Nearly all will be at least 99% 
identical and the great majority will be 99.9% identical. 

Thus, the present invention further provides nucleotide sequences which are at least 95%, preferably 99% and 
most preferably 99.9% identical to the nucleotide sequences of SEQ ID NOS:1-5,191, in a.form which can be readily 
io used, analyzed and interpreted by the skilled artisan. 

Methods for determining whether a nucleotide sequence is at least 95%, at least 99% or at least 99.9% identical 
to the nucleotide sequences of SEQ ID NOS:1-5,1 91 are routine and readily available to the skilled artisan. For example, 
the well known fasta algorithm described in Pearson and Lipman, Proc. Natl. Acad. Sci. USA&S: 2444 (1988) can be 
used to generate the percent identity of nucleotide sequences. The BLASTN program also can be used to generate 
is an identity score of polynucleotides compared to one another. 

COMPUTER RELATED EMBODIMENTS 

The nucleotide sequences provided in SEQ ID NOS:1-5,191, a representative fragment thereof, or a nucleotide 

20 sequence at least 95%, preferably at least 99% and most preferably at least 99.9% identical to a polynucleotide se- 
quence of SEQ ID NOS:1-5,191 may be "provided" in a variety of mediums to facilitate use thereof. As used herein, 
(^provided" refers to a manufacture, other than an isolated nucleic acid molecule, which contains a nucleotide sequence 
of the present invention; i.e., a nucleotide sequence provided in SEQ ID NOS:1-5,191, a representative fragment 
thereof, or a nucleotide sequence at least 95%, preferably at least 99% and most preferably at least 99.9% identical 

2S to a polynucleotide of SEQ ID NOS: 1 -5, 1 91 . Such a manufacture provides a large portion of the Staphylococcus aureus 
genome and parts thereof (e.g., a Staphylococcus aureus open reading frame (ORF)) in a form which allows a skilled 
artisan to examine the manufacture using means not directly applicable to examining the Staphylococcus aureus ge- 
. nome or a subset thereof as it exists in nature or in purified form. 

In one application of this embodiment, a nucleotide sequence of the present invention can be recorded on computer 

30 readable media. As used herein, "computer readable media" refers to any medium which can be read and accessed 
directly by a computer. Such media include, but are not limited to: magnetic storage media, such as floppy discs, hard 
disc storage medium, and magnetic tape; optical storage media such as CD- ROM; electrical storage media such as 
RAM and ROM; and hybrids of these categories, such as magnetic/optical storage media. A skilled artisan can readily 
appreciate how any of the presently known computer readable mediums can be used to create a manufacture com- 

3S prising computer readable medium having recorded thereon a nucleotide sequence of the present invention. Likewise, 
it will be clear to those of skill how additional computer readable media that may be developed also can be used to 
create analogous manufactures having recorded thereon a nucleotide sequence of the present invention. 

As used herein, "recorded" refers to a process lor storing information on computer readable medium. A skilled 
artisan can readily adopt any of the presently know methods for recording information on computer readable medium 

40 to generate manufactures comprising the nucleotide sequence information of the present invention. 

A variety of data storage structures are available to a skilled artisan for creating a computer readable medium 
having recorded thereon a nucleotide sequence of the present invention. The choice of the data storage structure will 
generally be based on the means chosen to access the stored information. In addition, a variety of data processor 
programs and formats can be used to store the nucleotide sequence information of the present invention on computer 

45 readable medium. The sequence information can be represented in a word processing text file, formatted in commer- 
cially- available software such as WordPerfect and Microsoft Word, or represented in the form of an ASCII file, stored 
in a database application, such as DB2, Sybase, Oracle, or the like. A skilled artisan can readily adapt any number of 
data-processor structuring formats (e.g., text file or database) in order to obtain computer readable medium having 
recorded thereon the nucleotide sequence information of the present invention. 

50 Computer software is publicly available which allows a skilled artisan to access sequence information provided in 

a computer readable medium. Thus, by providing in computer readable form the nucleotide sequences of SEQ ID 
NOS: 1-5,1 91, a representative fragment thereof, or a nucleotide sequence at least 95%, preferably at least 99% and 
most preferably at least 99.9% identical to a sequence of SEQ ID NOS:1-5,191 the present invention enables the 
skilled artisan routinely to access the provided sequence information for a wide variety of purposes. 

55 The examples which follow demonstrate how software which implements the BLAST (Altschul et ai, J. Mol. Biol. 

215:403410 (1990)) and BLAZE (Brutlag et ai, Comp. Chem. 17:203-207 (1993)) search algorithms on a Sybase 
system was used to identify open reading frames (ORFs) within the Staphylococcus aureus genome which contain 
homology to ORFs or proteins from both Staphylococcus aureus and from other organisms. Among the ORFs discussed 
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herein are protein encoding fragments of the Staphylococcus aureus genome useful in producing commercially impor- 
tant proteins, such as enzymes used in fermentation reactions and in the production of commercially useful metabolites. 

The present invention further provides systems, particularly computer-based systems, which contain the sequence 
information described herein. Such systems are designed to identify, among other things, commercially important f rag- 
5 ments of the Staphylococcus aureus genome. 

As used herein, "a computer-based system" refers to the hardware means, software means, and data storage 
means used to analyze the nucleotide sequence information of the present invention. The minimum hardware means 
of the computer-based systems of the present invention comprises a central processing unit (CPU), input means, 
output means, and data storage means. A skilled artisan can readily appreciate that any one of the currently available 
10 computer-based system are suitable for use in the present invention. 

As stated above, the computer-based systems of the present invention comprise a data storage means having 
stored therein a nucleotide sequence of the present invention and the necessary hardware means and software means 
for supporting and implementing a search means. 

As used herein, "data storage means" refers to memory which can store nucleotide sequence information of the 
is present invention, or a memory access means which can access manufactures having recorded thereon the nucleotide 
sequence information of the present invention. 

As used herein, "search means" refers to one or more programs which are implemented on the computer- based 
system to compare a target sequence or target structural motif with the. sequence information stored within the data 
storage means. Search means are used to identify fragments or regions of the present genomic sequences which 
20 match a particular target sequence or target motif. A variety of known algorithms are disclosed publicly and a variety 
of commercially available software for conducting search means are and can be used in the computer-based systems 
of the present invention. Examples of such software includes, but is not limited to, MacPattern (EMBL), BLASTN and 
BLASTX (NCBIA). A skilled artisan can readily recognize that any one of the available algorithms or implementing 
software packages for conducting homology searches can be adapted for use in the present computer-based systems. 
25 As used herein, a "target sequence" can be any DNA or amino acid sequence of six or more nucleotides or two 

or more amino acids. A skilled artisan can readily recognize that the longer a target sequence is, the less likely a target 
sequence will be present as a random occurrence in the database. The most preferred sequence length of a target 
sequence is from about 1 0 to 1 00 amino acids or from about 30 to 300 nucleotide residues. However, it is well recognized 
that searches for commercially important fragments, such as sequence fragments involved in gene expression and 
30 protein processing, may be of shorter length. 

As used herein, "a target structural motif," or "target motif," refers to any rationally selected sequence or combi- 
nation of sequences in which the sequence(s) are chosen based on a three-dimensional configuration which is formed 
upon the folding of the target motif. There are a variety of target motifs known in the art. Protein target motifs include, 
but are not limited to, enzymic active sites and signal sequences. Nucleic acid target motifs include, but are not limited 
35 to, promoter sequences, hairpin structures and inducible expression elements (protein binding sequences). 

A variety of structural formats for the input and output means can be used to input and output the information in 
the computer-based systems of the present invention. A preferred format for an output means ranks fragments of the 
Staphylococcus aureus genomic sequences possessing varying degrees of homology to the target sequence or target 
motif. Such presentation provides a skilled artisan with a ranking of sequences which contain various amounts of the 
40 target sequence or target motif and identifies the degree of homology contained in the identified fragment. 

A variety of comparing means can be used to compare a target sequence or target motif with the data storage 
means to identify sequence fragments of the Staphylococcus aureus genome. In the present examples, implementing 
software which implement the BLAST and BLAZE algorithms, described in Altschul era/., J. Mol. Biol 215: 403-410 
(1990), was used to identify open reading frames within the Staphylococcus aureus genome. A skilled artisan can 
^5 readily recognize that any one of the publicly available homology search programs can be used as the search means 
for the computer-based systems of the present invention. Of course, suitable proprietary systems that may be known 
to those of skill also may be employed in this regard. 

Figure 1 provides a block diagram of a computer system illustrative of embodiments of this aspect of present 
invention. The computer system 1 02 includes a processor 1 06 connected to a bus 104. Also connected to the bus 104 
50 are a main memory 1 08 (preferably implemented as random access memory, RAM) and a variety of secondary storage 
devices 110, such as a hard drive 112 and a removable medium storage device 114. The removable medium storage 
device 114 may represent, for example, a floppy disk drive, a CD-ROM drive, a magnetic tape drive, etc. A removable 
storage medium 116 (such as a floppy disk, a compact disk, a magnetic tape, etc.) containing control logic and/or data 
recorded therein may be inserted into the removable medium storage device 114. The computer system 102 includes 
55 appropriate software for reading the control logic and/or the data from the removable medium storage device 114, once 
it is inserted into the removable medium storage device 114. 

A nucleotide sequence of the present invention may be stored in a well known manner in the main memory 108, 
any of the secondary storage devices 110, and/or a removable storage medium 116. During execution, software for 
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accessing and processing the genomic sequence (such as search tools, comparing tools, etc) reside in main memory 
108, in accordance with the requirements and operating parameters of the operating system, the hardware system 
and the software program or programs. 

s BIOCHEMICAL EMBODIMENTS 

Other embodiments of the present invention are directed to fragments of the Staphylococcus aureus genome, 
preferably to isolated fragments. The fragments of the Staphylococcus aureus genome of. the present invention include, 
but are not limited to fragments which encode peptides, hereinafter open reading frames (ORFs), fragments which 
io modulate the expression of an operably linked ORF, hereinafter.expression modulating fragments (EMFs) and frag- 
ments which can be used to diagnose the presence of Staphylococcus aureus in a sample, hereinafter diagnostic 
fragments (DFs). 

As used herein, an "isolated nucleic acid molecule 0 or an "isolated fragment of the Staphylococcus aureus genome" 
refers to a nucleic acid molecule possessing a specific nucleotide sequence which has been subjected to purification 
15 means to reduce, from the composition, the number of compounds which are normally associated with the composition. 
Particularly, the term refers to the nucleic acid molecules having the sequences set out in SEQ ID NOS:1-5,191 , to 
representative fragments thereof as described above, to polynucleotides at least 95%, preferably at least 99% and 
especially preferably at least 99.9% identical in sequence thereto, also as set out above. 

A variety of purification means can be used to generated the isolated fragments of the present invention. These 
20 include, but are not limited to methods which separate constituents of a solution based on charge, solubility, or size. 

In one embodiment, Staphylococcus aureus DNA can be mechanically sheared to produce fragments of 1 5-20 kb 
in length. These fragments can then be used to generate an Staphylococcus aureus library by inserting them into 
lambda clones as described in the Examples below. Primers flanking, for example, an ORF, such as those enumerated 
in Tables 1-3 can then be generated using nucleotide sequence information provided in SEQ ID NOS: 1-5,191. Well 
25 known and routine techniques of PGR cloning then can be used to isolate the ORF from the lambda DNA library of 
Staphylococcus aureus genomic DNA. Thus, given the availability of SEQ ID NOS:1-5,191, the information in Tables 
1, 2 and 3, and the information that may be obtained readily by analysis of the sequences of SEQ ID NOS:1-5,191 
using methods set out above, those of skill will be enabled by the present disclosure to isolate any ORF-containing or 
other nucleic acid fragment of the present invention. 
30 The isolated nucleic acid molecules of the present invention include, but are not limited to single stranded and 

double stranded DNA, and single stranded RNA. 

As used herein, an "open reading frame," ORF, means a series of triplets coding for amino acids without any 
termination codons and is a sequence translatable into protein. 

Tables 1, 2 and 3 list ORFs in the Staphylococcus aureus genomic contigs of the present invention that were 
35 identified as putative coding regions by the GeneMark software using organism-specific second-order Markov proba- 
bility transition matrices. It will be appreciated that other criteria can be used, in accordance with well known analytical 
methods, such as those discussed herein, to generate more inclusive, more restrictive or more selective lists. 

Table 1 sets out ORFs in the Staphylococcus aureus contigs of the present invention that are at least 80 amino 
acids long and over a continuous region of at least 50 bases which are 95% or more identical (by BLAST analysis) to 
40 an S. aureus nucleotide sequence available through Genbank in November 1 996. 

Table 2 sets out ORFs in the Staphylococcus aureus contigs of the present invention that are not in Table 1 and 
match, with a BLASTP probability score of 0.01 or less, a polypeptide sequence available through Genbank by Sep- 
tember 1996. 

Table 3 sets out ORFs in the Staphylococcus aureus contigs of the present invention that do not match significantly, 
45 by BLASTP analysis, a polypeptide sequence available through Genbank by September 1996. 

In each table, the first and second columns identify the ORF by, respectively, contig number and ORF number 
within the contig; the third column indicates the reading frame, taking the first 5' nucleotide of the contig as the start of 
the +1 frame; the fourth column indicates the first nucleotide of the ORF, counting from the 5' end of the contig strand; 
and the fifth column indicates the length of each ORF in nucleotides. . 
50 in Tables 1 and 2, column six, lists the Reference" for the closest matching sequence available through Genbank. 

These reference numbers are the databases entry numbers commonly used by those of skill in the art, who will be 
familiar with their denominators. Descriptions of the numenclature are available from the National Center for Biotech- 
nology Information. Column seven in Tables 1 and 2 provides the gene name" of the matching sequence; column eight 
provides the BLAST identity" score from the comparison of the ORF and the homologous gene; and column nine 
55 indicates the length in nucleotides of the highest scoring segment pair" identified by the BLAST identity analysis. 
In Table 3, the last column, column six, indicates the length of each ORF in amino acid residues. 
The concepts of percent identity and percent similarity of two polypeptide sequences is well understood in the art. 
For example, two polypeptides 10 amino acids in length which differ at three amino acid positions (e.g., at positions 
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1 , 3 and 5) are said to have a percent identity of 70%. However, the same two polypeptides would be deemed to have 
a percent similarity of 80% if, for example at position 5, the amino acids moieties, although not identical, were "similar" 
(i.e., possessed similar biochemical characteristics). Many programs for analysis of nucleotide or amino acid sequence 
similarity, such as fasta and BLAST specifically list per cent identity of a matching region as an output parameter. Thus, 
5 for instance, Tables 1 and 2 herein enumerate the per cent identity" of the highest scoring segment pair" in each ORF 
and its listed relative. Further details concerning the algorithms and criteria used for homology searches are provided 
below and are described in the pertinent literature highlighted by the citations provided below 

It will be appreciated that other criteria can be used to generate more inclusive and more exclusive listings of the 
types set out in the tables. As those of skill will appreciate, narrow and broad searches both are useful. Thus, a skilled 

10 artisan can readily identify ORFs in contigs of the Staphylococcus aureus genome other than those listed in Tables 
1 -3, such as ORFs which are overlapping or encoded by the opposite strand of an identified ORF in addition to those 
ascertainable using the computer-based systems of the present invention. . 

As used herein, an "expression modulating fragment," EMF, means a series of nucleotide molecules which mod- 
ulates the expression of an operably linked ORF or EMF. 

15 As used herein, a sequence is said to "modulate the expression of an operably linked sequence" when the ex- 

pression of the sequence is altered by the presence of the EMF. EMFs include, but are not limited to, promoters, and 
promoter modulating sequences (inducible elements). One class of EMFs are fragments which induce the expression 
or an operably linked ORF in response to a specific regulatory factor or physiological event. 

EMF sequences can be identified within the contigs of the Staphylococcus aureus genome by their proximity to 

20 the ORFs provided in Tables 1-3. An intergenic segment, or a fragment of the intergenic segment, from about 10 to 
200 nucleotides in length, taken from any one of the ORFs of Tables 1-3 will modulate the expression of an operably 
linked ORF in a fashion similar to that found with the naturally linked ORF sequence. As used herein, an "intergenic 
segment" refers to fragments of the Staphylococcus aureus genome which are between two ORF(s) herein described. 
EMFs also can be identified using known EMFs as a target sequence or target motif in the computer-based systems 

25 of the present invention. Further, the two methods can be combined and used together. 

The presence and activity of an EMF can be confirmed using an EMF trap vector. An EMF trap vector contains a 
cloning site linked to a marker sequence. A marker sequence encodes an identifiable phenotype, such as antibiotic 
resistance or a complementing nutrition auxotrophic factor, which can be identified or assayed when the EMF trap 
vector is placed within an appropriate host under appropriate conditions. As described above, a EMF will modulate the 

30 expression of an operably linked marker sequence. A more detailed discussion of various marker sequences is provided 
below. 

A sequence which is suspected as being an EMF is cloned in all three reading frames in one or more restriction 
, sites upstream from the marker sequence in the EMF trap vector. The vector is then transformed into an appropriate 
host using known procedures and the phenotype of the transformed host in examined under appropriate conditions. 
35 As described above, an EMF will modulate the expression of an operably linked marker sequence. 

. As used herein, a "diagnostic fragment," DF, means a series of nucleotide molecules which selectively hybridize 
to Staphylococcus aureus sequences. DFs can be readily identified by identifying unique sequences within contigs of 
the Staphylococcus aureus genome, such as by using well-known computer analysis software, and by generating and 
testing probes or amplification primers consisting of the DF sequence in an appropriate diagnostic format which de- 
40 termines amplification or hybridization selectivity. 

The sequences falling within the scope of the present invention are not limited to the specific sequences herein 
described, but also include allelic and species variations thereof. Allelic and species variations can be routinely deter- 
mined by comparing the sequences provided in SEQ ID NOS:1 -5, 1 91 , a representative fragment thereof, or a nucleotide 
sequence at least 95%, preferably 99% and most preferably 99.9% identical to SEQ ID NOS:1 -5, 1 91 , with a sequence 
45 from another isolate of the same species. 

Furthermore, to accomodate codon variability the invention includes nucleic acid molecules coding for the same 
amino acid sequences as do the nucleic acid sequences mentioned above. In other words, in the coding region of an 
ORF, substitution of one codon for another which encodes the same amino acid is expressly contemplated. 

Any specific sequence disclosed herein can be readily screened for errors by resequencing a particular fragment, 
50 such as an ORF, in both directions (i.e., sequence both strands). Alternatively, error screening can be performed by 
sequencing corresponding polynucleotides of Staphylococcus aureus origin isolated by using part or all of the fragments 
in question as a probe or primer. 

Each of the ORFs of the Staphylococcus aureus genome disclosed in Tables 1, 2 and 3, and the EMFs found 5' 
to the ORFs, can be used as polynucleotide reagents in numerous ways. For example, the sequences can be used 
55 as diagnostic probes or diagnostic amplification primers to detect the presence of a specific microbe in a sample, 
particular Staphylococcus aureus. Especially preferred in this regard are ORF such as those of Table 3, which do not 
match previously characterized sequences from other organisms and thus are most likely to be highly selective for 
Staphylococcus aureus. Also particularly preferred are ORFs that can be used to distinguish between strains of Ste- 
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phylococcus aureus, particularly those that distinguish medically important strain, such as drug-resistant strains. 

In addition, the fragments of the present invention, as broadly described, can be used to control gene expression 
through triple helix formation or antisense DNA or RNA, both of which methods are based on the binding of a polynu- 
cleotide sequence to DNA or RNA. Triple helix- formation optimally results in a shut-off of RNA transcription from DNA, 

5 while antisense RNA hybridization blocks translation of an mRNA molecule into polypeptide. Information from the 
sequences of the present invention can be used to design antisense and triple helix-forming oligonucleotides. Polynu- 
cleotides suitable for use in these methods are usually 20 to 40 bases in length and are designed to be complementary 
to a region of the gene involved in transcription, for triple-helix formation, or to the mRNA itself, for antisense inhibition. 
Both techniques have been demonstrated to be effective in model systems, and the requisite techniques are well known 

10 and involve routine procedures. Triple helix techniques are discussed in, for example, Lee et al, Nuci Acids Res. 6: 
3073 (1979); Cooney et al, Science 241 : 456 (1988); and Dervan et al, Science 251: 1360 (1991). Antisense tech- 
niques in general are discussed in, for instance, Okano, J. Neurochem. 56: 560 (1991) and OLIGODEOXYNUCLE- 
OTIDES AS ANTISENSE INHIBITORS OF GENE EXPRESSION, CRC Press, Boca Raton, FL (1988)). 

The present invention further provides recombinant constructs comprising one or more fragments of the Staphy- 

15 iococcus aureus genomic fragments and contigs of the present invention. Certain preferred recombinant constructs of 
the present invention comprise a vector, such as a plasmid or viral vector, into which a fragment of the Staphylococcus 
aureus genome has been inserted, in a forward or reverse orientation. In the case of a vector comprising one of the 
ORFs of the present invention, the vector may further comprise regulatory sequences, including for example, a pro- 
moter, operably linked to the ORF For vectors comprising the EMFs of the present invention, the vector may further 

20 comprise a marker sequence or heterologous ORF operably linked to the EMF. 

Large numbers of suitable vectors and promoters are known to those of skill in the art and are commercially 
available for generating the recombinant constructs of the present invention. The following vectors are provided by 
way of example. Useful bacterial vectors include phagescript, PsiX174, pBluescript SK and KS (+ and -), pNH8a, 
• pNH16a, pNH18a, pNH46a (available from Stratagene); pTrc99A, pKK223-3, pKK233-3, pDR540, pRIT5 (available 

25 from Pharmacia). Useful eukaryotic vectors include pWLneo, pSV2cat, pOG44, pXT1 , pSG (available from Stratagene) 
pSVK3, pBPV, pMSG, pSVL (available from Pharmacia). 

Promoter regions can be selected from any desired gene using CAT (chloramphenicol transferase) vectors or other 
vectors with selectable markers. Two appropriate vectors are pKK232-8 and pCM7. Particular named bacterial pro- 
moters include lad, lacZ, T3, T7, gpt, lambda PR, and trc. Eukaryotic promoters include CMV immediate early, HSV 

30 thymidine kinase, early and late SV40, LTRs from retrovirus, and mouse metallothionein- 1 . Selection of the appropriate 
vector and promoter is well within the level of ordinary skill in the art. 

The present invention further provides host cells containing any one of the isolated fragments of the Staphylococcus 
aureus genomic fragments and contigs of the present invention, wherein the fragment has been introduced into the 
host cell using known methods. The host cell can be a higher eukaryotic host cell, such as a mammalian cell, a lower 

35 eukaryotic host cell, such as a yeast cell, or a procaryotic cell, such as a bacterial cell. 

A polynucleotide of the present invention, such as a recombinant construct comprising an ORF of the present 
invention, may be introduced into the host by a variety of well established techniques that are standard in the art, such 
as calcium phosphate transfection, DEAE, dextran mediated transfection and electroporation, which are described in, 
for instance, Davis, L. etai, BASIC METHODS IN MOLECULAR BIOLOGY (1986). 

40 a host cell containing one of the fragments of the Staphylococcus aureus genomic fragments and contigs of the 

present invention, can be used in conventional manners to produce the gene product encoded by the isolated fragment 
(in the case of an ORF) or can be used to produce a heterologous protein under the control of the EMF 

The present invention further provides isolated polypeptides encoded by the nucleic acid fragments of the present 
invention or by degenerate variants of the nucleic acid fragments of the present invention. By "degenerate variant" is 

<5 intended nucleotide fragments which differ from a nucleic acid fragment of the present invention (e.g., an ORF) by 
nucleotide sequence but, due to the degeneracy of the Genetic Code, encode- an identical polypeptide sequence. 

Preferred nucleic acid fragments of the present invention are the ORFs depicted in Tables 2 and 3 which encode 
proteins. 

A variety of methodologies known in the art can be utilized to obtain any one of the isolated polypeptides or proteins 
50 of the present invention. At the simplest level, the amino acid sequence can be synthesized using commercially avail- 
able peptide synthesizers. This is particularly useful in producing small peptides and fragments of larger polypeptides. 
Such short fragments as may be obtained most readily by synthesis are useful, for example, in generating antibodies 
against the native polypeptide, as discussed further below. 

In an alternative method, the polypeptide or protein is purified from bacterial cells which naturally produce the 
55 polypeptide or protein. One skilled in the art can readily employ well-known methods for isolating polpeptides and 
proteins to isolate and purify polypeptides or proteins of the present invention produced naturally by a bacterial strain, 
or by other methods. Methods for isolation and purification that can be employed in this regard include, but are not 
limited to, immunochromatography, HPLC, size-exclusion chromatography, ion-exchange chromatography, and immu- 
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no-affinity chromatography. 

The polypeptides and proteins of the present invention also can be purified from cells which have been altered to 
express the desired polypeptide or protein. As used herein, a cell is said to be altered to express a desired polypeptide 
or protein when the cell, through genetic manipulation, is made to produce a polypeptide or protein which it normally 

s does not produce or which the cell normally produces at a lower level. Those skilled in the art can readily adapt pro- 
cedures for introducing and expressing either recombinant or synthetic sequences into eukaryotic or prokaryotic cells 
in order to generate a cell which produces one of the polypeptides or proteins of the present invention. 

Any host/vector system can be used to express one or more of the ORFs of the present invention. These include, 
but are not limited to, eukaryotic hosts such as HeLa cells, CV-1 cell, COS cells, and Sf9 cells, as well as prokaryotic 

10 host such as E. coli and B. subtilis. The most preferred cells are those which do not normally express the particular 
polypeptide or protein or which expresses the polypeptide or protein at low natural level. 

"Recombinant," as used herein, means that a polypeptide or protein is derived from recombinant (e.g., microbial 
or mammalian) expression systems. "Microbial 0 refers to recombinant polypeptides or proteins made in bacterial or 
fungal (e.g., yeast) expression systems. As a product, "recombinant microbial"defines a polypeptide or protein essen- 

15 tially free of native endogenous substances and unaccompanied by associated native glycosylation. Polypeptides or 
proteins expressed in most bacterial cultures, e.g., E. coli, will be free of glycosylation modifications; polypeptides or 
proteins expressed in yeast will have a glycosylation pattern different from that expressed in mammalian cells. 

"Nucleotide sequence" refers to a heteropolymer of deoxyribonucleotides. Generally, DNA segments encoding the 
polypeptides and proteins provided by this invention are assembled from fragments of the Staphylococcus aureus 

20 genome and short oligonucleotide linkers, or from a series of oligonucleotides, to provide a synthetic gene which is 
capable of being expressed in a recombinant transcriptional unit comprising regulatory elements derived from a mi- 
crobial or viral operon. 

"Recombinant expression vehicle or vector" refers to a plasmid or phage or virus or vector, for expressing a polypep- 
tide from a DNA (RNA) sequence. The expression vehicle can comprise a transcriptional unit comprising an assembly 

25 of (1) a genetic regulatory elements necessary for gene expression in the host, including elements required to initiate 
and maintain transcription at a level sufficient for suitable expression of the desired polypeptide, including, for example, 
promoters and, where necessary, an enhancers and a polyadenylation signal; (2) a structural or coding sequence 
which is transcribed into mRNA and translated into protein, and (3) appropriate signals to initiate translation at the 
beginning of the desired coding region and terminate translation at its end. Structural units intended for use in yeast 

30 or eukaryotic expression systems preferably include a leader sequence enabling extracellular secretion of translated 
protein by a host cell. Alternatively where recombinant protein is expressed without a leader or transport sequence, 
it may include an N-terminal methionine residue. This residue may or may not be subsequently cleaved from the 
expressed recombinant protein to provide a final product. 

"Recombinant expression system" means host cells which have stably integrated a recombinant transcriptional 

35 unit into chromosomal DNA or carry the recombinant transcriptional unit extra chromosomally. The cells can be prokary- 
otic or eukaryotic. Recombinant expression systems as defined herein will express heterologous polypeptides or pro- 
teins upon induction of the regulatory elements linked to the DNA segment or synthetic gene to be expressed. 

Mature proteins can be expressed in mammalian cells, yeast, bacteria, or other cells under the control of appro- 
priate promoters. Cell-free translation systems can also be employed to produce such proteins using RNAs derived 

40 from the DNA constructs of the present invention. Appropriate cloning and expression vectors for use with prokaryotic 
and eukaryotic hosts are described in Sambrook etai, MOLECULAR CLONING: A LABORATORY MANUAL, 2 nd Edi- 
tion, Cold Spring Harbor Laboratory Press, Cold Spring Harbor, New York (1989), the disclosure of which is hereby 
incorporated by reference in its entirety. 

Generally, recombinant expression vectors will include origins of replication and selectable markers permitting 

45 transformation of the host cell, e.g., the ampicillin resistance gene of E. coli and S. cerevisiae TRP1 gene, and a 
promoter derived from a highly expressed gene to direct transcription of a downstream structural sequence. Such 
promoters can be derived from operons encoding glycolytic enzymes such as 3-phosphoglycerate kinase (PGK), alpha- 
factor, acid phosphatase, or heat shock proteins, among others. The heterologous structural sequence is assembled 
in appropriate phase with translation initiation and termination sequences, and preferably, a leader sequence capable 

50 of directing secretion of translated protein into the periplasmic space or extracellular medium. Optionally, the heterol- 
ogous sequence can encode a fusion protein including an N-terminal identification peptide imparting desired charac- 
teristics, e.g., stabilization or simplified purification of expressed recombinant product. 

Useful expression vectors for bacterial use are constructed by inserting a structural DNA sequence encoding a 
desired protein together with suitable translation initiation and termination signals in operable reading phase with a 

55 functional promoter. The vector will comprise one or more phenotypic selectable markers and an origin of replication 
to ensure maintenance of the vector and, when desirable, provide amplification within the host. 

Suitable prokaryotic hosts for transformation include strains of Staphylococcus aureus, E. coli, B. subtilis, Salmo- 
nella typhimurium and various species within the genera Pseudomonas, Streptomyces, and Staphylococcus. Others 
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may, afso be employed as a matter of choice. 

As a representative but non-limiting example, useful expression vectors for bacterial use can comprise a selectable 
marker and bacterial origin of replication derived from commercially available plasmids comprising genetic elements 
of the well known cloning vector pBR322 (ATCC 37017). Such commercial vectors include, for example, pKK223-3 
5 (available form Pharmacia Fine Chemicals, Uppsala, Sweden) and GEM 1 (available from Promega Biotec, Madison, 
Wl, USA). These pBR322 "backbone" sections are combined with an appropriate promoter and the structural sequence 
to be expressed. 

Following transformation of a suitable host strain and growth of the host strain to an appropriate cell density, the 
selected promoter, where it is inducible, is derepressed or induced by appropriate means (e.g., temperature shift or 
to chemical induction) and cells are cultured for an additional period to provide for expression of the induced gene product. 
Thereafter cells are typically harvested, generally by centrifugation, disrupted to release expressed protein, generally 
by physical or chemical means, and the resulting crude extract is retained for further purification. 

Various mammalian cell culture systems can also be employed to express recombinant protein. Examples of mam- 
malian expression systems include the COS-7 lines of monkey kidney fibroblasts, described in Gluzman, Ce//23: 175 
15 (1981), and other cell lines capable of expressing a compatible vector, for example, the C127, 3T3, CHO, HeLa and 
BHK cell lines. 

Mammalian expression vectors will comprise an origin of replication, a suitable promoter and enhancer, and also 
any necessary ribosome binding sites, polyadenylation site, splice donor and acceptor sites, transcriptional termination 
sequences, and 5* flanking nontranscribed sequences. DNA sequences derived from the SV40 viral genome, for ex- 

20 ample, SV40 origin, early promoter, enhancer, splice, and polyadenylation sites may be used to provide the required 
nontranscribed genetic elements. 

Recombinant polypeptides and proteins produced in bacterial culture is usually isolated by initial extraction from 
cell pellets, followed by one or more salting-out, aqueous ion exchange or size exclusion chromatography steps. Mi- 
crobial cells employed in expression of proteins can be disrupted by any convenient method, including freeze-thaw 

25 cycling, sonication, mechanical disruption, or use of cell lysing agents. Protein refolding steps can be used, as neces- 
sary, in completing configuration of the mature protein. Finally, high performance liquid chromatography (HPLC) can 
be employed for final purification steps. 

An additional aspect of the invention includes Staphylococcus aureus polypeptides which are useful as immuno- 
diagnostic antigens and/or immunoprotective vaccines, collectively "immunologically useful polypeptides". Such im- 

30 munologically useful polypeptides may be selected from the ORFs disclosed herein based on techniques well known 
in the art and described elsewhere herein. The inventors have used the following criteria to select several immunolog- 
ically useful polypeptides: 

As is known in the art, an amino terminal type I signal sequence directs a nascent protein across the plasma and 
outer membranes to the exterior of the bacterial cell. Such outermembrane polypeptides are expected to be immuno- 

35 logically useful. According to Izard, J. W. et al., Mol. Microbiol. 13, 765-773; (1994), polypeptides containing type I 
signal sequences contain the following physical attributes: The length of the type I signal sequence is approximately 
15 to 25 primarily hydrophobic amino acid residues with a net positive charge in the extreme amino terminus; the 
central region of the signal sequence must adopt an alpha-helical conformation in a hydrophobic environment; and the 
region surrounding the actual site of cleavage is ideally six residues long, with small side-chain amino acids in the -1 

to and -3 positions. 

Also known in the art is the type IV signal sequence which is an example of the several types of functional signal 
sequences which exist in addition to the type I signal sequence detailed above. Although functionally related, the type 
IV signal sequence possesses a unique set of biochemical and physical attributes (Strom, M. S. and Lory, S., J. Bac- 
teriol. 174, 7345-7351; 1992)). These are typically six to eight amino acids with a net basic charge followed by an 

45 additional sixteen to thirty primarily hydrophobic residues. The cleavage site of a type IV signal sequence is typically 
after the initial six to eight amino acids at the extreme amino terminus. In addition, all type IV signal sequences contain 
a phenylalanine residue at the +1 site relative to the cleavage site. 

Studies of the cleavage sites of twenty-six bacterial lipoprotein precursors has allowed the definition of a consensus 
amino acid sequence for lipoprotein cleavage. Nearly three-fourths of the bacterial lipoprotein precursors examined 

50 contained the sequence L-(A,S)-(G,A)-C at positions -3 to +1, relative to the point of cleavage (Hayashi, S. and Wu, 
H. C. Lipoproteins in bacteria. J Bioenerg. Biomembr. 22, 45.1-471; 1990). 

It well known that most anchored proteins found on the surface of gram-positive bacteria possess a highly con- 
served carboxy terminal sequence. More than fifty such proteins from organisms such as S. pyogenes, S. mutans, E. 
faecalis, S. pneumoniae, and others, have been identified based on their extracellular location and carboxy terminal 

55 amino acid sequence (Fischetti, V A. Gram-positive commensal bacteria deliver antigens to elicit mucosal and systemic 
immunity. ASM News 62, 405410; 1 996). The conserved region is comprised of six charged amino acids at the extreme 
carboxy terminus coupled to 15-20 hydrophobic amino acids presumed to function as a transmembrane domain, im- 
mediately adjacent to the transmembrane domain is a six amino acid sequence conserved in nearly all proteins ex- 
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amined. The amino acid sequence of this region is L-P-X-T-G-X, where X is any amino acid. 

Amino acid sequence similarities to proteins of known function by BLAST enables the assignment of putative 
functions to novel amino acid sequences and allows for the selection of proteins thought to function outside the cell 
wall. Such proteins are well known in the art and include " lipoprotein" , "periplasmic", or "antigen". 
5 An algorithm for selecting antigenic and immunogenic Staphylococcus aureus polypeptides including the foregoing 

criteria was developed by the present inventors. Use of the algorithm by the inventors to select immunologically useful 
Staphylococcus aureus polypeptides resulted in the selection of several ORFs which are predicted to be outermem- 
brane-associated proteins. These proteins are identified in Table 4, below, and shown in the Sequence Listing as SEQ 
ID NOS:5,192to5,255. Thus the amino acid sequence of each of several anWgenicStaphyiococcus aureus polypeptides 

10 listed in Table 4 can be determined, for example, by locating the amino acid sequence of the ORF in the Sequence 
Listing. Likewise the polynucleotide sequence encoding each ORF can be found by locating the corresponding poly- 
nucleotide SEQ ID in Tables 1, 2, or 3, and finding the corresponding nucleotide sequence in the sequence listing. 

As will be appreciated by those of ordinary skill in the art, although a polypeptide representing an entire ORF may 
be the closest approximation to a protein found in vivo, it is not always technically practical to express a complete ORF 

*5 in vitro. It may be very challenging to express and purify a highly hydrophobic protein by common laboratory methods. 
As a result, the immunologically useful polypeptides described herein as SEQ ID NOS:5, 192-5,255 may have been 
modified slightly to simplify the production of recombinant protein, and are the preferred embodiments. In general, 
nucleotide sequences which encode highly hydrophobic domains, such as those found at the amino terminal signal 
sequence, are excluded for enhanced in vitro expression of the polypeptides. Furthermore, any highly hydrophobic 

20 amino acid sequences occurring at the carboxy terminus are also excluded. Such truncated polypeptides include for 
example the mature forms of the polypeptides expected to exist in nature. 

Those of ordinary skill in the art can identify soluble portions the polypeptide identified in Table 4, and in the case 
of truncated polypeptides sequences shown as SEQ ID NOS:5,1 92-5,255, may obtain the complete predicted amino 
acid sequence of each polypeptide by translating the corresponding polynucleotides sequences of the corresponding 

2S ORF listed in Tables 1 ,2 and 3 and found in the sequence listing. 

Accordingly, polypeptides comprising the complete amino acid of an immunologically useful polypeptide selected 
from the group of polypeptides encoded by the ORFs identified in Table 4, or an amino acid sequence at least 95% 
. identical thereto, preferably at least 97% identical thereto, and most preferably at least 99% identical thereto form an 
embodiment of the invention; in addition polypeptides comprising an amino acid sequence selected from the group of 

30 amino acid sequences shown in the sequence listing as SEQ I D NOS:5, 1 91 -5,255, or an amino acid sequence at least 
95% identical thereto, preferably at least 97% identical thereto and most preferably at least 99% identical thereto, form 
an embodiment of the invention. Polynucleotides encoding the foregoing polypeptides also form part of the present 
invention. 

In another aspect, the invention provides a peptide or polypeptide comprising an epitope-bearing portion of a 

35 polypeptide of the invention, particularly those epitope-bearing portions (antigenic regions) identified in Table 4. The 
epitope-bearing portion is an immunogenic or antigenic epitope of a polypeptide of the invention. An "immunogenic 
epitope" is defined as a part of a protein that elicits an antibody response when, the whole protein is the immunogen. 
On the other hand, a region of a protein molecule to which an antibody. can bind is defined as an "antigenic epitope." 
The number of immunogenic epitopes of a protein generally is less than the number of antigenic epitopes. See, for 

40 instance, Geysen et al., Proc. Natl. Acad. Sci. USA 81:3998- 4002 (1983). 

As to the selection of peptides or polypeptides bearing an antigenic epitope (i.e., that contain a region of a protein 
molecule to which an antibody can bind), it is well known in that art that relatively short synthetic peptides that mimic 
part of a protein sequence are routinely capable of eliciting an antiserum that reacts with the partially mimicked protein. 
See, for instance, Sutcliffe, J. G., Shinnick, T. M., Green, N. and Learner, R. A. (1983) "Antibodies that react with 

*s predetermined sites on proteins", Science, 21 9:660-666. Peptides capable of eliciting protein -reactive sera are fre- 
quently represented in the primary sequence of a protein, can be characterized by a set of simple chemical rules, and 
are confined neither to immunodominant regions of intact proteins (i.e., immunogenic epitopes) nor to the amino or 
carboxyl terminals. Antigenic epitope-bearing peptides and polypeptides of the invention are therefore useful to raise 
antibodies, including monoclonal antibodies, that bind specifically to a polypeptide of the invention. See, for instance, 

so Wilson etal., Cell 37:767-778 (1984) at 777. 

Antigenic epitope-bearing peptides and polypeptides of the invention preferably contain a sequence of at least 
seven, more preferably at least nine and most preferably between about 15 to about 30 amino acids contained within 
the amino acid sequence of a polypeptide of the invention. Non-limiting examples of antigenic polypeptides or peptides 
that can be used to generate S. aureus specific antibodies include: a polypeptide comprising peptides shown in Table 

55 4 below. These polypeptide fragments have been determined to bear antigenic epitopes of indicated S. aureus proteins 
by the analysis of the Jameson-Wolf antigenic index, a representative sample of which is shown in Figure 3. 

The epitope-bearing peptides and polypeptides of the invention may be produced by any conventional means. 
See, e.g., Houghten, R. A. (1 985) General method for the rapid solid-phase synthesis of large numbers of peptides: 
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specificity of antigen-antibody interaction at the level of individual amino acids. Proc. Natl. Acad. Sci. USA 82: 
5131-5135; this "Simultaneous Multiple Peptide Synthesis (SMPS)" process is further described in U.S. Patent No. 
4,631,211 to Houghten et al. (1986). Epitope-bearing peptides and polypeptides of the invention are used to induce 
antibodies according to methods well known in the art. See, for instance, Sutcliffe et al., supra; Wilson et al., supra; 

5 Chow, M. et al., Proc. Natl. Acad. Sci. USA B2:910-914; and Bittle, F. J. et al., J. Gen. Virol. 66:2347-2354 (1985). 

Immunogenic epitope-bearing peptides of the invention, i.e. , those parts of a protein that elicit an antibody response 
when the whole protein is the immunogen, are identified according to methods known in the art. See, for instance, 
Geysen etal., supra. Further still, U.S. Patent No. 5,1 94,392 to Geysen (1990) describes a general method of detecting 
or determining the sequence of monomers (amino acids or other compounds) which is a topological equivalent of the 

10 epitope (i.e., a "mimotope") which is complementary to a particular paratope (antigen binding site) of an antibody of 
interest. More generally, U.S. Patent No. 4,433,092 to Geysen (1989) describes a method of detecting or determining 
a sequence of monomers which is a topographical equivalent of a ligand which is complementary to the ligand binding 
site of a particular receptor of interest. Similarly U.S. Patent No. 5,480,971 to Houghten, R. A. et al. (1996) on Per- 
alkyfated Oligopeptide Mixtures discloses linear C1 -C7-alkyl peralkylated oligopeptides and sets and libraries of such 

is peptides, as well as methods for using such oligopeptide sets and libraries for determining the sequence of a per- 
alkylated oligopeptide that preferentially binds to an acceptor molecule of interest. Thus, non-peptide analogs of the 
epitope-bearing peptides of the invention also can be made routinely by these methods. 

Table 4 lists immunologically useful polypeptides identified by an algorithm which locates novel Staphylococcus 
aureus outermembrane proteins, as is described above. Also listed are epitopes or "antigenic regions" of each of the 

20 identified polypeptides. The antigenic regions, or epitopes, are delineated by two numbers x-y, where x is the number 
of the first amino acid in the open reading frame included within the epitope and y is the number of the last amino acid 
in the open reading frame included within the epitope. For example, the first epitope in ORF 168-6 is comprised of 
amino acids 36 to 45 of SEQ ID NO:5,192, as is described in Table 4. The inventors have identified several epitopes 
for each of the antigenic polypeptides identified in Table 4. Accordingly, forming part of the present invention are 

25 polypeptides comprising an amino acid sequence of one or more antigenic regions identified in Table 4. The invention 
further provides polynucleotides encoding such polypeptides. 

The present invention further includes isolated polypeptides, proteins and nucleic acid molecules which are sub- 
stantially equivalent to those herein described. As used herein, substantially equivalent can refer both to nucleic acid 
and amino acid sequences, for example a mutant sequence, that varies from a reference sequence by one or more 

30 substitutions, deletions, or additions, the net effect of which does not result in an adverse functional dissimilarity be- 
tween reference and subject sequences. For purposes of the present invention, sequences having equivalent biological 
activity, and equivalent expression characteristics are considered substantially equivalent. For purposes of determining 
equivalence, truncation of the mature sequence should be disregarded. 

The invention further provides methods of obtaining homologs from other strains of Staphylococcus aureus, of the 

35 . fragments of the Staphylococcus aureus genome of the present invention and homologs of the proteins encoded by 
the ORFs of the present invention. As used herein, a sequence or protein of Staphylococcus aureus is defined as a 
homolog of a fragment of the Staphylococcus aureus fragments or contigs or a protein encoded by one of the ORFs 
of the present invention, if it shares significant homology to one of the fragments of the Staphylococcus aureus genome 
of the present invention or a protein encoded by one of the ORFs of the present invention. Specifically, by using the 

40 sequence disclosed herein as a probe or as primers, and techniques such as PCR cloning and colony/plaque hybrid- 
ization, one skilled in the art can obtain homologs. 

As used herein, two nucleic acid molecules or proteins are said to "share significant homology" if the- two contain 
regions which prossess greater than 85% sequence (amino acid or nucleic acid) homology. Preferred homologs in this 
regard are those with more than 90% homology. Especially preferred are those with 93% or more homology. Among 

45 especially preferred homologs those with 95% or more homology are particularly preferred. Very particularly preferred 
among these are those with 97% and even more particularly preferred among those are homologs with 99% or more 
homology. The most preferred homologs among these are those with 99.9% homology or more. It will be understood 
that, among measures of homology, identity is particularly preferred in this regard. 

Region specific primers or probes derived from the nucleotide sequence provided in SEQ ID NOS.1 -5,191 or from 

50 a nucleotide sequence at least 95%, particularly at least 99%, especially at least 99.5% identical to a sequence of SEQ 
ID NOS:1 -5, 1 91 can be used to prime DNA synthesis and PCR amplification, as well as to identify colonies containing 
cloned DNA encoding a homolog. Methods suitable to this aspect of the present invention are well known and have 
been described in great detail in many publications such as, for example, Innis et al., PCR PROTOCOLS, Academic 
Press, San Diego, CA (1990)). 

55 When using primers derived from SEQ ID NOS: 1-5,1 91 or from a nucleotide sequence having an aforementioned 

identity to a sequence of SEQ ID NOS:1-5,191, one skilled in the art will recognize that by employing high stringency 
conditions (e.g., annealing at 50-60°C in 6X SSPC and 50% formamide, and washing at 50- 65°C in 0.5X SSPC) only 
sequences which are greater than 75% homologous to the primer will be amplified. By employing lower stringency 
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conditions (e.g., hybridizing at 35-37°C in 5X SSPC and 40-45% formamide, and washing at 42°C in 0.5X SSPC), 
sequences which are greater than 40-50% homologous to the primer will also be amplified. 

When using DNA probes derived from SEQ ID NOS:1 -5, 1 91 , or from a nucleotide sequence having an aforemen- 
tioned identity to a sequence of SEQ ID NOS:1-5,191, for colony/plaque hybridization, one skilled in the art will recog- 

5 nize that by employing high stringency conditions (e.g., hybridizing at 50- 65°C in 5X SSPC and 50% formamide, and 
washing at 50- 65° C in 0.5X SSPC), sequences having regions which are greater than 90% homologous to the probe 
can be obtained, and that by employing lower stringency conditions [e.g., hybridizing at 35-37°C in 5X SSPC and 
40-45% formamide, and washing at 42° C in 0.5X SSPC), sequences having regions which are greater than 35-45% 
homologous to the probe will be obtained. 

10 Any organism can be used as the source for homologs of the present invention so long as the organism naturally 

expresses such a protein or contains genes encoding the same. The most preferred organism for isolating homologs 
are bacterias which are closely related to Staphylococcus aureus. 

ILLUSTRATIVE USES OF COMPOSITIONS OF THE INVENTION 

75 

Each ORF provided in Tables 1 and 2 is identified with a function by homology to a known gene or polypeptide. 
As a result, one skilled in the art can use the polypeptides of the present invention for commercial, therapeutic and 
industrial purposes consistent with the type of putative identification of the polypeptide. Such identifications permit one 
skilled in the art to use the Staphylococcus aureus ORFs in a manner similar to the known type of sequences for which 

20 the identification is made; for example, to ferment a particular sugar source or to produce a particular metabolite. A 
variety of reviews illustrative of this aspect of the invention are available, including the following reviews on the industrial 
use of enzymes, for example, BIOCHEMICAL ENGINEERING AND BIOTECHNOLOGY HANDBOOK, 2nd Ed., Mac- 
millan Publications, Ltd. NY (1991) and BIOCATALYSTS IN ORGANIC SYNTHESES, Tramper etal, Eds., Elsevier 
Science Publishers, Amsterdam, The Netherlands (1 985). A variety of exemplary uses that illustrate this and similar 

2S aspects of the present invention are discussed below. 

1. Biosynthetic Enzymes 

Open reading frames encoding proteins involved in mediating the catalytic reactions involved in intermediary and 

30 macromolecular metabolism, the biosynthesis of small molecules, cellular processes and other functions includes en- 
zymes involved in the degradation of the intermediary products of metabolism, enzymes involved in central intermediary 
metabolism, enzymes involved in respiration, both aerobic and anaerobic, enzymes involved in fermentation, enzymes 
involved in ATP proton motor force conversion, enzymes involved in broad regulatory function, enzymes involved in 
amino acid synthesis, enzymes involved in nucleotide synthesis, enzymes involved in cofactor and vitamin synthesis, 

35 can be used for industrial biosynthesis. 

The various metabolic pathways present in Staphylococcus aureus can be identified based on absolute nutritional 
requirements as well as by examining the various enzymes identified in Table 1-3 and SEQ ID NOS:1-5,191. 

Of particular interest are polypeptides involved in the degradation of intermediary metabolites as well as non- 
macromolecular metabolism. Such enzymes include amylases, glucose oxidases, and catalase. 

<o Proteolytic enzymes are another class of commercially important enzymes. Proteolytic enzymes find use in a 

number of industrial processes including the processing of flax and other vegetable fibers, in the extraction, clarification 
and depectinization of fruit juices, in the extraction of vegetables' oil and in the maceration of fruits and vegetables to 
give unicellular fruits. A detailed review of the proteolytic enzymes used in the food industry is provided in Rombouts 
et ai, Symbiosis^: 79 (1 986) and Voragen etal. in BIOCATALYSTS IN AGRICULTURAL BIOTECHNOLOGY, Whitak- 

4S er et ai, Eds., American Chemical Society Symposium Series 389: 93 (1 989) . 

The metabolism of sugars is an important aspect of the primary metabolism of Staphylococcus aureus. Enzymes 
involved in the degradation of sugars, such as, particularly, glucose, galactose, fructose and xylose, can be used in 
industrial fermentation. Some of the important sugar transforming enzymes, from a commercial viewpoint, include 
sugar isomerases such as glucose isomerase. Other metabolic enzymes have found commercial use such as glucose 

50 oxidases which produces ketogulonic acid (KGA). KGA is an intermediate in the commercial production of ascorbic 
acid using the Reichstein's procedure, as described in Krueger etal., Biotechnology 6(A), Rhine era/., Eds., Verlag 
Press, Weinheim, Germany (1984). 

Glucose oxidase (GOD) is commercially available and has been used in purified form as well as in an immobilized 
form for the deoxygenation of beer. See, for instance, Hartmeir ef ai, Biotechnology Letters V. 21 (1979). The most 

ss important application of GOD is the industrial scale fermentation of gluconic acid. Market for gluconic acids which are 
used in the detergent, textile, leather, photographic, pharmaceutical, food, feed and concrete industry, as described, 
for example, in Bigelis et ai, beginning on page 357 in GENE MANIPULATIONS AND FUNGI; Benett et ai, Eds., 
Academic Press, New York (1985). In addition to industrial applications, GOD has found applications in medicine for 
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quantitative determination of glucose in body fluids recently in biotechnology for analyzing syrups from starch and 
cellulose hydrosylates. This application is described in Owusu et ai, Biochem. et Biophysics. Acta. 872: 83 (1 986), for 
instance. 

The main sweetener used in the world today is sugar which comes from sugar beets and sugar cane, tn the field 

5 of industrial enzymes, the glucose isomerase process shows the largest expansion in the market today. Initially, soluble 
enzymes were used and later immobilized enzymes were developed (Krueger et ai, Biotechnology, The Textbook of 
Industrial Microbiology, Sinauer Associated Incorporated, Sunderland, Massachusetts (1990)). Today, the use of glu- 
cose- produced high fructose syrups is by far the largest industrial business using immobilized enzymes. A review of 
the industrial use of these enzymes is provided by Jorgensen, Starch 40:307 (19B8). 

10 Proteinases, such as alkaline serine proteinases, are used as detergent additives and thus represent one of the 

largest volumes of microbial enzymes used in the industrial sector. Because of their industrial importance, there is a 
large body of published and unpublished information regarding the use of these enzymes in industrial processes. (See 
Faultman et ai, Acid Proteases Structure Function and Biology, Tang, J., ed., Plenum Press, New York (1977) and 
Godfrey etai, Industrial Enzymes, MacMillan Publishers, Surrey, UK (1983) and Hepner et ai, Report Industrial En- 

16 zymes by 1990, Hel Hepner & Associates, London (1986)). 

Another class of commercially usable proteins of the present invention are the microbial lipases, described by, for 
instance, Macrae etai, Philosophical Transactions of the Chiral Society of London 310:227 (1985) and Poserke, Jour- 
nal of the American Oil Chemist Society 61:1758 (1984). A major use of lipases is in the fat and oil industry for the 
production of neutral glycerides using lipase catalyzed inter-esterification of readily available triglycerides. Application 

20 of lipases include the use as a detergent additive to facilitate the removal of fats from fabrics in the course of the 
washing procedures. 

The use of enzymes, and in particular microbial enzymes, as catalyst for key steps in the synthesis of complex 
organic molecules is gaining popularity at a great rate. One area of great interest is the preparation of chiral interme- 
diates. Preparation of chiral intermediates is of interest to a wide range of synthetic chemists particularly those scientists 

25 involved with the preparation of new pharmaceuticals, agrochemicais, fragrances and flavors. (See Davies et ai, Re- 
cent Advances in the Generation of Chiral Intermediates Using Enzymes, CRC Press, Boca Raton, Florida (1990)). 
The following reactions catalyzed by enzymes are of interest to organic chemists:hydrolysis of carboxylic acid esters, 
phosphate esters, amides and nitriles, esterification reactions, trans-esterification reactions, synthesis of amides, re- 
duction of alkanones and oxoalkanates, oxidation of alcohols to carbonyl compounds, oxidation of sulfides to sulfoxides, 

30 and carbon bond forming reactions such as the aldol reaction. 

When considering the use of an enzyme encoded by one of the ORFs of the present invention for biotransformation 
and organic synthesis it is sometimes necessary to consider the respective advantages and disadvantages of using a 
microorganism as opposed to an isolated enzyme. Pros and cons of using a whole cell system on the one hand or an 
isolated partially purified enzyme on the other hand, has been described in detail by Bud et ai, Chemistry in Britain 

35 (1987), p. 127. 

Amino transferases, enzymes involved in the biosynthesis and metabolism of amino acids, are useful in the catalytic 
production of amino acids. The advantages of using microbial based enzyme systems is that the amino transferase 
enzymes catalyze the stereo- selective synthesis of only L-amino acids and generally possess uniformly high catalytic 
rates. A description of the use of amino transferases for amino acid production is provided by Rose lie-David, Methods 
40 of EnzymolQQV 136:479 (1987). 

Another category of useful proteins encoded by the ORFs of the present invention include enzymes involved in 
" nucleic acid synthesis, repair, and recombination. A variety of commercially important enzymes have previously been 
isolated from members of Staphylococcus aureus. These include Sau3A and Sau96l. 

4$ 2. Generation of Antibodies 

As described here, the proteins of the present invention, as well as homologs thereof, can be used in a variety 
procedures and methods known in the art which are currently applied to other proteins. The proteins of the present 
invention can further be used to generate an antibody which selectively binds the protein. Such antibodies can be 
50 either monoclonal or polyclonal antibodies, as well fragments of these antibodies, and humanized forms. 

The invention further provides antibodies which selectively bind to one of the proteins of the present invention and 
hybridomas which produce these antibodies. A hybridoma is an immortalized cell line which is capable of secreting a 
specific monoclonal antibody. 

In general, techniques for preparing polyclonal and monoclonal antibodies as well as hybridomas capable of pro- 
55 ducing the desired antibody are well known in the art (Campbell, A. M., MONOCLONAL ANTIBODY TECHNOLOGY: 
LABORATORY TECHNIQUES IN BIOCHEMISTRY AND MOLECULAR BIOLOGY, Elsevier Science Publishers, Am- 
sterdam, The Netherlands (1984); St. Groth etai, J. Immunol. Methods 35: 1-21 (1980), Kohler and Milstein, Nature 
256 : 495-497 (1 975)), the trioma technique, the human B- cell hybridoma technique (Kozbor et ai, Immunology Today 
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4: 72 (19B3), pgs. 77-96 of Cole et al, in MONOCLONAL ANTIBODIES AND CANCER THERAPY, Alan R. Liss, Inc. 
(1985)). 

Any animal (mouse, rabbit, etc.) which is known to produce antibodies can be immunized with the pseudogene 
polypeptide. Methods for immunization are well known in the art. Such methods include subcutaneous or interperitoneal 
5 injection of the polypeptide. One skilled in the art will recognize that the amount of the protein encoded by the ORF of 
the present invention used for immunization will vary based on the animal which is immunized, the antigenicity of the 
peptide and the site of injection. 

The protein which is used as an immunogen may be modified or administered in an adjuvant in order to increase 
the protein's antigenicity. Methods of increasing the antigenicity of a protein are well known in the art and include, but 
10 are not limited to coupling the antigen with a heterologous protein (such as globulin or galactosidase) or through the 
inclusion of an adjuvant during immunization. 

For monoclonal antibodies, spleen cells from the immunized animals are removed, fused with myeloma cells, such 
as SP2/0-Ag14 myeloma cells, and allowed to become monoclonal antibody producing hybridoma cells. 

Any one of a number of methods well known in the art can be used to identify the hybridoma cell which produces 
15 an antibody with the desired characteristics. These include screening the hybridomas with an ELISA assay, western 
blot analysis, or radioimmunoassay (Lutz etai, Exp. Cell Res. 175: 109-124 (1988)). 

Hybridomas secreting the desired antibodies are cloned and the class and subclass is determined using procedures 
known in the art (Campbell, A. M., Monoclonal Antibody Technology: Laboratory Techniques in Biochemistry and Mo- 
lecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1984)). 
20 Techniques described for the production of single chain antibodies (U. S. Patent 4,946,778) can be adapted to 

produce single chain antibodies to proteins of the present invention. 

For polyclonal antibodies, antibody containing antisera is isolated from the immunized animal and is screened for 
the presence of antibodies with the desired specificity using one of the above-described procedures. 

The present invention further provides the above- described antibodies in detectably labelled form. Antibodies can 
25 be detectably labelled through the use of radioisotopes, affinity labels (such as biotin, avidin, etc.), enzymatic labels 
(such as horseradish peroxidase, alkaline phosphatase, etc.) fluorescent labels (such as FITC or rhodamine, etc.), 
paramagnetic atoms, etc. Procedures for accomplishing such labelling are welt-known in the art, for example see 
Sternberger etai, J. Histochem. Cytochem. 18:315 (1970); Bayer, E. A. etai, Meth. Enzym. 62:308 (1979); Engval, 
E. etai., Immunol. 109:129 (1972); Goding, J. W. J. Immunol. Meth. 13:215 (1976)). 
30 The labeled antibodies of the present invention can be used for in vitro, in vivo, and in situ assays to identify cells 

or tissues in which a fragment of the Staphylococcus aureus genome is expressed. 

The present invention further provides the above-described antibodies immobilized on a solid support. Examples 
of such solid supports include plastics such as polycarbonate, complex carbohydrates such as agarose and sepharose, 
acrylic resins and such as polyacrylamide and latex beads. Techniques for coupling antibodies to such solid supports 
35 are well known in the art (Weir, D. M. et al., "Handbook of Experimental Immunology" 4th Ed., Blackwell Scientific 
Publications, Oxford, England, Chapter 10 (1986); Jacoby, W. D. etai, Meth. Enzym. 34 Academic Press, N. Y. (1974)). 
The immobilized antibodies of the present invention can be used for in vitro, in vivo, and in situ assays as well as for 
immunoaffinity purification of the proteins of the present invention. 

40 3. Diagnostic Assays and Kits 

The present invention further provides methods to identify the expression of one of the ORFs of the present in- 
vention, or homolog thereof, in a test sample, using one of the DFs,antigens or antibodies of the present invention. 
In detail, such methods comprise incubating a test sample with one or more of the antibodies, or one or more of 

45 the DFs, or one or more antigens of the present invention and assaying for binding of the DFs, antigens or antibodies 
to components within the test sample. 

Conditions for incubating a DF, antigen or antibody with a test sample vary. Incubation conditions depend on the 
format employed in the assay, the detection methods employed, and the type and nature of the DF or antibody used 
in the assay. One skilled in the art will recognize that any one of the commonly available hybridization, amplification 

50 or immunological assay formats can readily be adapted to employ the Dfs, antigens or antibodies of the present in- 
vention. Examples of such assays can be found in Chard, T, An Introduction to Radioimmunoassay and Related 
Techniques, Elsevier Science Publishers, Amsterdam, The Netherlands (1986); Bullock, G. R. etai, Techniques in 
Immunocytochemistry, Academic Press, Orlando, FL Vol. 1 (1982), Vol. 2 (1983), Vol. 3 (1985); Tijssen, R, Practice 
and Theory of Enzyme Immunoassays: Laboratory Techniques in Biochemistry; PCT publication W095/32291 , and 

55 Molecular Biology, Elsevier Science Publishers, Amsterdam, The Netherlands (1985), all of which are hereby incorpo- 
rated herein by reference. 

The test samples of the present invention include cells, protein or membrane extracts of cells, or biological fluids 
such as sputum, blood, serum, plasma, or urine. The test sample used in the above-described method will vary based 
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on the assay format, nature of the detection method and the tissues, cells or extracts used as the sample to be assayed. 
Methods for preparing protein extracts or membrane extracts of cells are well known in the art and can be readily be 
adapted in order to obtain a sample which is compatible with the system utilized. 

in another embodiment of the present invention, kits are provided which contain the necessary reagents to carry 
5 out the assays of the present invention. 

Specifically, the invention provides a compartmentalized kit to receive, in close confinement, one or more containers 
which comprises:(a) a first container comprising one of the Dfs, antigens or antibodies of the present invention; and 
(b) one or more other containers comprising one or more of the following: wash reagents, reagents capable of detecting 
presence of a bound DF, antigen or antibody. 
10 in detail, a compartmentalized kit includes any kit in which reagents are contained in separate containers. Such 

containers include small glass containers, plastic containers or strips of plastic or paper. Such containers allows one 
to efficiently transfer reagents from one compartment to another compartment such that the samples and reagents are 
not cross-contaminated, and the agents or solutions of each container can be added in a quantitative fashion from one 
compartment to another. Such containers will include a container which will accept the test sample, a container which 
contains the antibodies used in the assay, containers which contain wash reagents (such as phosphate buffered saline, 
Tris-buffers, etc.), and containers which contain the reagents used to detect the bound antibody, antigen or DF. 

Types of detection reagents include labelled nucleic acid probes, labelled secondary antibodies, or in the alterna- 
tive, if the primary antibody is labelled, the enzymatic, or antibody binding reagents which are capable of reacting with 
the labelled antibody. One skilled in the art will readily recognize that the disclosed Dfs, antigens and antibodies of the 
20 present invention can be readily incorporated into one of the established kit formats which are well known in the art. 

4. Screening Assay for Binding Agents , 

Using the isolated proteins of the present invention, the present invention further provides methods of obtaining 
25 and identifying agents which bind to. a protein encoded by one of the ORFs of the present invention or to one of the 
fragments and the Staphylococcus aureus fragment and contigs herein described. 
In general, such methods comprise steps of: 

(a) contacting an agent with an isolated protein encoded by one of the ORFs of the present invention, or an isolated 
30 fragment of the Staphylococcus aureus genome; and 

(b) determining whether the agent binds to said protein or said fragment. 

The agents screened in the above assay can be, but are not limited to, peptides, carbohydrates, vitamin derivatives, 
or other pharmaceutical agents. The agents can be selected and screened at random or rationally selected or designed 
35 using protein modeling techniques. 

For random screening, agents such as peptides, carbohydrates, pharmaceutical agents and the like are selected 
at random and are assayed for their ability to bind to the protein encoded by the ORF of the present invention. 

Alternatively, agents may be rationally selected or designed. As used herein, an agent is said to be "rationally 
selected or designed" when the agent is chosen based on the configuration of the particular protein. For example, one 
to skilled in the art can readily adapt currently available procedures to generate peptides, pharmaceutical agents and the 
like capable of binding to a specific peptide sequence in order to generate rationally designed antipeptide peptides, 
for example see Hurby et al, Application of Synthetic Peptides: Antisense Peptides," In Synthetic Peptides, A User's 
Guide, W. H. Freeman, NY (1992), pp. 289-307, and Kaspczakefa/., Biochemistry 28:9230-8 (1 989), or pharmaceutical 
agents, or the like. 

45 in addition to the foregoing, one class of agents of the present invention, as broadly described, can be used to 

control gene expression through binding to one of the ORFs or EMFs of the present invention. As described above, 
such agents can be randomly screened or rationally designed/selected. Targeting the ORF or EMF allows a skilled 
artisan to design sequence specific or element specific agents, modulating the expression of either a single ORF or 
multiple ORFs which rely on the same EMF for expression control. 

50 One class of DNA binding agents are agents which contain base residues which hybridize or form a triple helix by 

binding to DNA or RNA. Such agents can be based on the classic phosphodiester, ribonucleic acid backbone, or can 
be a variety of sulfhydryl or polymeric derivatives which have base attachment capacity. 

Agents suitable for use in these methods usually contain 20 to 40 bases and are designed to be complementary 
to a region of the gene involved in transcription (triple helix - see Lee et ai, Nucl. Acids Res. 6:3073 (1 979); Cooney 

55 et aL, Science 241 :456 (1 988); and Dervan et al, Science 251 : 1 360 ( 1 991 )) or to the mRNA itself (antisense - Okano, 
J. Neurochem. 56:560 (1991); Oligodeoxynucleotides as Antisense Inhibitors of Gene Expression, CRC Press, Boca 
Raton, FL (1988)). Triple helix-formation optimally results in a shut-off of RNA transcription from DNA, while antisense 
RNA hybridization blocks translation of an mRNA molecule into polypeptide. Both techniques have been demonstrated 
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to be effective in model systems. Information contained in the sequences of the present invention can be used to design 
antisense and triple helix-forming oligonucleotides, and other DNA binding agents. 

5. Pharmaceutical Compositions and Vaccines 

5 

The present invention further provides pharmaceutical agents which can be used to modulate the growth or path- 
ogenicity of Staphylococcus aureus, or another related organism, in vivoox in vitro. As used herein, a "pharmaceutical 
agent" is defined as a composition of matter which can be formulated using known techniques to provide a pharma- 
ceutical compositions. As used herein, the "pharmaceutical agents of the present invention" refers the pharmaceutical 

fo agents which are derived from the proteins encoded by the ORFs of the present invention or are agents which are 
identified using the herein described assays. 

As used herein, a pharmaceutical agent is said to "modulate the growth or pathogenicity of Staphylococcus aureus 
or a related organism, in vivo or in vitro," when the agent reduces the rate of growth, rate of division, or viability of the 
organism in question. The pharmaceutical agents of the present invention can modulate the growth or pathogenicity 

15 of an organism in many fashions, although an understanding of the underlying mechanism of action is not needed to 
practice the use of the pharmaceutical agents of the present invention. Some agents will modulate the growth or path- 
ogenicity by binding to an important protein thus blocking the biological activity of the protein, while other agents may 
bind to a component of the outer surface of the organism blocking attachment or rendering the organism more prone 
to act the bodies nature immune system. Alternatively, the agent may comprise a protein encoded by one of the ORFs 

20 of the present invention and serve as a vaccine. The development and use of vaccines derived from membrane asso- 
ciated polypeptides are well known in the art. The inventors have identified particularly preferred immunogenic Sta- 
phylococcus aureus polypeptides for use as vaccines. Such immunogenic polypeptides are described above and sum- 
marized in Table 4, below. 

As used herein, a "related organism" is a broad term which refers to any organism whose growth or pathogenicity 
2S can be modulated by one of the pharmaceutical agents of the present invention. In general, such an organism will 
contain a homolog of the protein which is the target of the pharmaceutical agent or the protein used as a vaccine. As 
such, related organisms do not need to be bacterial but may be fungal or viral pathogens. 

The pharmaceutical agents and compositions of the present invention may be administered in a convenient man- 
ner, such as by the oral, topical, intravenous, intraperitoneal, intramuscular, subcutaneous, intranasal or intradermal 
30 routes. The pharmaceutical compositions are administered in an amount which is effective for treating and/or proph- 
ylaxis of the specific indication. In general, they are administered in an amount of at least about 1 mg/kg body weight 
and in most cases they will be administered in an amount not in excess of about 1 g/kg body weight per day. In most 
cases, the dosage is from about 0.1 mg/kg to about 10 g/kg body weight daily, taking into account the routes of ad- 
ministration, symptoms, etc. 

35 The agents of the present invention can be used in native form or can be modified to form a chemical derivative. 

As used herein, a molecule is said to be a "chemical derivative" of another molecule when it contains additional chemical 
moieties not normally a part of the molecule. Such moieties may improve the molecule's solubility, absorption, biological 
half life, etc. The moieties may alternatively decrease the toxicity of the molecule, eliminate or attenuate any undesirable 
side effect of the molecule, etc. Moieties capable of mediating such effects are disclosed in, among other sources, 

40 REMINGTON'S PHARMACEUTICAL SCIENCES (1980) cited elsewhere herein. 

For example, such moieties may change an immunological character of the functional derivative, such as affinity 
for a given antibody. Such changes in immunomodulation activity are measured by the appropriate assay, such as a 
competitive type immunoassay. Modifications of such protein properties as redox or thermal stability, biological half- 
life, hydrophobicity, susceptibility to proteolytic degradation or the tendency to aggregate with carriers or into multimers 

45 also may be effected in this way and can be assayed by methods well known to the skilled artisan. 

The therapeutic effects of the agents of the present invention may be obtained by providing the agent to a patient 
by any suitable means (e.g., inhalation, intravenously, intramuscularly, subcutaneously, enterally, or parenterally). It is 
preferred to administer the agent of the present invention so as to achieve an effective concentration within the blood 
or tissue in which the growth of the organism is to be controlled. To achieve an effective blood concentration, the 

50 preferred method is to administer the agent by injection. The administration may be by continuous infusion, or by single 
or multiple injections. 

In providing a patient with one of the agents of the present invention, the dosage of the administered agent will 
vary depending upon such factors as the patient's age, weight, height, sex, general medical condition, previous medical 
history, etc. In general, it is desirable to provide the recipient with a dosage of agent which is in the range of from about 
55 1 pg/kg to 10 mg/kg (body weight of patient), although a lower or higher dosage may be administered. The therapeu- 
tically effective dose can be lowered by using combinations of the agents of the present invention or another agent. 

As used herein, two or more compounds or agents are said to be administered "in combination" with each other 
when either (1) the physiological effects of each compound, or (2) the serum concentrations of each compound can 
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be measured at the same time. The composition of the present invention can be administered concurrently with, prior 
to, or following the administration of the other agent. 

The agents of the present invention are intended to be provided to recipient subjects in an amount sufficient to 
decrease the rate of growth (as defined above) of the target organism. 

s The administration of the agent(s) of the invention may be for either a "prophylactic" or "therapeutic" purpose. 

When provided prophylactically, the agent(s) are provided in advance of any symptoms indicative of the organisms 
growth. The prophylactic administration of the agent(s) serves to prevent, attenuate, or decrease the rate of onset of 
any subsequent infection. When provided therapeutically, the agent(s) are provided at (or shortly after) the onset of an 
indication of infection. The therapeutic administration of the compound(s) serves to attenuate the pathological symp- 

10 toms of the infection and to increase the rate of recovery. 

The agents of the present invention are administered to a subject, such as a mammal, or a patient, in a pharma- 
ceutically acceptable form and in a therapeutically effective concentration. A composition is said to be "pharmacolog- 
ically acceptable" if its administration can be tolerated by a recipient patient. Such an agent is said to be administered 
in a "therapeutically effective amount" if the amount administered is physiologically significant. An agent is physiolog- 
ic ically significant if its presence results in a detectable change in the physiology of a recipient patient. 

The agents of the present invention can be formulated according to known methods to prepare pharmaceutically 
useful compositions, whereby these materials, or their functional derivatives, are combined in admixture with a phar- 
maceutically acceptable carrier vehicle. Suitable vehicles and their formulation, inclusive of other human proteins, a 
g. t human serum albumin, are described, for example, in REMINGTON'S PHARMACEUTICAL SCIENCES, 16 th Ed., 

20 Osol, A., Ed., Mack Publishing, Easton PA (1980). In order to form a pharmaceutically acceptable composition suitable 
for effective administration, such compositions will contain an effective amount of one or more of the agents of the 
present invention, together with a suitable amount of carrier vehicle. 

Additional pharmaceutical methods may be employed to control the duration of action. Control release preparations 
may be achieved through the use of polymers to complex or absorb one or more of the agents of the present invention. 

25 The controlled delivery may be effectuated by a variety of well known techniques, including formulation with macro- 
molecules such as, for example, polyesters, polyamino acids, polyvinyl, pyrrolidone, ethylenevinylacetate, methylcel- 
lulose, carboxymethylcellulose, or protamine, sulfate, adjusting the concentration of the macromolecules and the agent 
in the formulation, and by appropriate use of methods of incorporation, which can be manipulated to effectuate a desired 
time course of release. Another possible method to control the duration of action by controlled release preparations is 

30 to incorporate agents of the present invention into particles of a polymeric material such as polyesters, polyamino 
acids, hydrogels, poly (lactic acid) or ethylene vinylacetate copolymers. Alternatively, instead of incorporating these 
agents into polymeric particles, it is possible to entrap these materials in microcapsules prepared, for example, by 
coacervation techniques or by interfacial polymerization with, for example, hydroxymethylcellulose or gelatine-micro- 
capsules and poly(methylmethacylate) microcapsules, respectively, or in colloidal drug delivery systems, for example, 

35 liposomes, albumin microspheres, microemulsions, nanoparticles, and nanocapsules or in macroemulsions. Such tech- 
niques are disclosed in REMINGTON'S PHARMACEUTICAL SCIENCES (1980). 

The invention further provides a pharmaceutical pack or kit comprising one or more containers filled with one or 
more of the ingredients of the pharmaceutical compositions of the invention. Associated with such containers) can be* 
a notice in the form prescribed by a governmental agency regulating the manufacture, use or sale of pharmaceuticals 

40 or biological products, which notice reflects approval by the agency of manufacture, use or sale for human adminis- 
tration. 

In addition, the agents of the present invention may be employed in conjunction with other therapeutic compounds. 
6. Shot-Gun Approach to Megabase DNA Sequencing 

45 

The present invention further demonstrates that a large sequence can be sequenced using a random shotgun 
approach. This procedure, described in detail in the examples that follow, has eliminated the up front cost of isolating 
and ordering overlapping or contiguous subclones prior to the start of the sequencing protocols. 

Certain aspects of the present invention are described in greater detail in the examples that follow. The examples 
so are provided by way of illustration. Other aspects and embodiments of the present invention are contemplated by the 
inventors, as will be clear to those of skill in the art from reading the present disclosure. 
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ILLUSTRATIVE EXAMPLES 
LIBRARIES AND SEQUENCING 

i 

s 1. Shotgun Sequencing Probability Analysis 

The overall strategy for a shotgun approach to whole genome sequencing follows from the Lander and Waterman 
(Landerman and Waterman, Genomics 2: 231 (1 988)) application of the equation forthe Poisson distribution. According 
to this treatment, the probability P 0> that any given base in a sequence of size L, in nucleotides, is not sequenced after 

10 a certain amount, n, in nucleotides, of random sequence has been determined can be calculated by the equation P 0 
= e- m , where m is Un, the fold coverage." For instance, for a genome of 2.8 Mb, m=1 when 2.8 Mb of sequence has 
been randomly generated (1X coverage). At that point, P 0 = e _1 = 0.37. The probability that any given base has not 
been sequenced is the same as the probability that any region of the whole sequence L has not been determined and, 
therefore, is equivilent to the fraction of the whole sequence that has yet to be determined. Thus, at one-fold coverage, 

is approximately 37% of a polynucleotide of size L, in nucleotides has not been sequenced. When 14 Mb of sequence 
has been generated, coverage is 5X for a .2.6 Mb and the unsequenced fraction drops to .0067 or 0.67%. 5X coverage 
of a 2.8 Mb sequence can be attained by sequencing approximately 17,000 random clones from both insert ends with 
an average sequence read length of 41 0 bp. 

Similarly, the total gap length, G, is determined by the equation G = Le -m , and the average gap size, g, follows the 

20 equation, g = Un. Thus, 5X coverage leaves about 240 gaps averaging about 82 bp in size in a sequence of a poly- 
nucleotide 2.8 Mb long. 

The treatment above is essentially that of Lander and Waterman, Genomics 2: 231 (1988). 

2. Random Library Construction 

25 

In order to approximate the random model described above during actual sequencing, a nearly ideal library of 
cloned genomic fragments is required. The following library construction procedure was developed to achieve this end. 

Staphylococcus aureus DNA was prepared by phenol extraction. A mixture containing 600 ug DNA in 3.3 ml of 
300 mM sodium acetate, 10 mM Tris-HCl, 1 mM Na-EDTA, 30% glycerol was sonicated for 1 min. at 0°C in a Branson 

30 Model 450 Sonicator at the lowest energy setting using a 3 mm probe. The sonicated DNA was ethanol precipitated 
and redissolved in 500 ul TE buffer. 

To create blunt-ends, a 1 00 ul aliquot of the resuspended DNA was digested with 5 units of BAL31 nuclease (New 
England BioLabs) for 10 min at 30° C in 200 ul BAL31 buffer . The digested DNA was phenol-extracted, ethanol-pre- 
cipitated, redissolved in 100 ul TE buffer, and then size-fractionated by electrophoresis through a 1.0% low melting 

35 temperature agarose gel. The section containing DNA fragments 1 .6-2.0 kb in size was excised from the gel, and the 
LGT agarose was melted and the resulting solution was extracted with phenol to separate the agarose from the DNA. 
DNA was ethanol precipitated and redissolved in 20 ul of TE buffer for ligation to vector. 

A two-step ligation procedure was used to produce a plasmid library with 97% inserts, of which >99% were single 
inserts. The first ligation mixture (50 ul) contained 2 ug of DNA fragments, 2 ug pUC1 8 DNA (Pharmacia) cut with Smal 

40 and dephosphorylated with bacterial alkaline phosphatase, and 10 units of T4 ligase (GIBCO/BRL) and was incubated 
at 14°C for 4 hr. The ligation mixture then was phenol extracted and ethanol precipitated, and the precipitated DNA 
was dissolved in 20 ul TE buffer and electrophoresed on a 1 .0% low melting agarose gel. Discrete bands in a ladder 
were visualized by ethidium bromide-staining and UV illumination and identified by size as insert (i), vector (v), v+i, 
v+2i, v+3i, etc. The portion of the gel containing v+i DNA was excised and the v+i DNA was recovered and resuspended 

45 into 20 ul TE. The v+i DNA then was blunt-ended by T4 polymerase treatment for 5 min. at 37° C in a reaction mixture 
(50 ul) containing the v+i linears, 500 uM each of the 4 dNTPs, and 9 units of T4 polymerase (New England BioLabs), 
under recommended buffer conditions. After phenol extraction and ethanol precipitation the repaired v+i linears were 
dissolved in 20 ul TE. The final ligation to produce circles was carried out in a 50 ul reaction containing 5 ul of v+i 
linears and 5 units of T4 ligase at 14°C overnight. After 10 min. at 70° C the following day, the reaction mixture was 

so stored at -20°C. 

This two-stage procedure resulted in a molecularly random collection of single-insert plasmid recombinants with 
minimal contamination from double-insert chimeras (<1%) or free vector (<3%). 

Since deviation from randomness can arise from propagation the DNA in the host, £.co//host cells deficient in all 
recombination and restriction functions (A. Greener, Strategies 3 (1):5 (1990)) were used to prevent rearrangements, 
55 deletions, and loss of clones by restriction. Furthermore, transformed cells were plated directly on antibiotic diffusion 
plates to avoid the usual broth recovery phase which allows multiplication and selection of the most rapidly growing cells. 

Plating was carried out as follows. A 100 ul aliquot of Epicurian Coli SURE II Supercompetent Cells (Stratagene 
200152) was thawed on ice and transferred to a chilled Falcon 2059 tube on ice. A 1.7 ul aliquot of 1.42 M beta- 
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mercaptoethanol was added to the aliquot of cells to a final concentration of 25 mM. Cells were incubated on ice for 
10 min. A 1 ul aliquot of the final ligation was added to the cells and incubated on ice for 30 min. The cells were heat 
pulsed for 30 sec. at 42° C and placed back on ice for 2 min. The outgrowth period in liquid culture was eliminated 
from this protocol in order to minimize the preferential growth of any given transformed cell. Instead the transformation 

s mixture was plated directly on a nutrient rich SOB plate containing a 5 ml bottom layer of SOB agar (5% SOB agar: 
20 g tryptone, 5 g yeast extract, 0.5 g NaCl, 1.5% Difco Agar per liter of media). The 5 ml bottom layer is supplemented 
with 0.4 ml of 50 mg/ml ampicillin per 100 ml SOB agar. The 15 ml top layer of SOB agar is supplemented with 1 ml 
X-Gal(2%), 1 mlMgCI 2 (1 M), and 1 ml MgS0 4 /1 00 ml SOB agar. The 15 ml top layer was poured just prior to plating. 
Our titer was approximately 100 colonies/10 ul aliquot of transformation. 

10 All colonies were picked for template preparation regardless of size. Thus, only clones lost due to "poison" DNA 

or deleterious gene products would be deleted from the library, resulting in a slight increase in gap number over that 
expected. 

3. Random DNA Sequencing 

is 

High quality double stranded DNA plasmid templates were prepared using an alkaline lysis method developed in 
collaboration with SPrime -> 3Prime Inc. (Boulder, CO). Plasmid preparation was performed in a 96- well format for all 
stages of DNA preparation from bacterial growth through final DNA purification. Average template concentration was 
determined by running 25% of the samples on an agarose gel. DNA concentrations were not adjusted. 

20 Templates were also prepared from a Staphylococcus aureus lambda genomic library. An unamplified library was 

constructed in Lambda DASH II vector (Stratagene). Staphylococcus aureus DNA (> 100 kb) was partially digested in 
a reaction mixture (200 ul) containing 50 ug DNA, 1X Sau3AI buffer, 20 units Sau3AI for 6 min. at 23 C. The digested 
DNA was phenol-extracted and centrifuges over a 10- 40% sucroce gradient. Fractions containing genomic DNA of 
15-25 kb were recovered by precipitation . One ul of fragments was used with 1 ul of DASHII vector (Stratagene) in 

25 the recommended ligation reaction. One ul of the ligation mixture was used per packaging reaction following the rec- 
ommended protocol with the Gigapack II XL Packaging Extract Phage were plated directly without amplification from 
the packaging mixture (after dilution with 500 ul of recommended SM buffer and chloroform treatment). Yield was about 
2.5x10 9 pfu/ul. 

An amplified library was prepared from the primary packaging mixture according to the manufacturer's protocol. 

30 The amplified library is stored frozen in 7% dimethylsulf oxide. The phage titer is approximately 1x10 s pfu/ml. 

Mini-liquid lysates (0.1 ul) are prepared from randomly selected plaques and template is prepared by long range 
PCR. Samples are PCR amplified using modified T3 and T7 primers, and Elongase Supermix (LTI). 

Sequencing reactions are carried out on plasmid templates using a combination of two workstations (BIOMEK 
1000 and Hamilton Microlab 2200) and the Perkin-Elmer 9600 thermocycler with Applied Biosystems PRISM Ready 

35 Reaction Dye Primer Cycle Sequencing Kits for the Ml 3 forward (M13-21) and the M13 reverse (M13RP1) primers. 
Dye terminator sequencing reactions are carried out on the lambda templates on a Perkin-Elmer 9600 Thermocycler 
using the Applied Biosystems Ready Reaction Dye Terminator Cycle Sequencing kits. Modified T7 and T3 primers are 
used to sequence the ends of the inserts from the Lambda DASH II library. Sequencing reactions are on a combination 
of AB 373 DNA Sequencers and ABI 377 DNA sequencers. All of the dye terminator sequencing reactions are analyzed 

40 using the 2X 9 hour module on the AB 377. Dye primer reactions are analyzed on a combination of ABI 373 and AB! 
377 DNA sequencers. The overall sequencing success rate very approximately is about 85% for M1 3-21 and M1 3RP1 
sequences and 65% for dye-terminator reactions. The average usable read length is 485 bp for M13-21 sequences, 
445bp for M13RP1 sequences, and 375 bp for dye-terminator reactions. 

45 4. Protocol for Automated Cycle Sequencing 

The sequencing was carried out using Hamilton Microstation 2200, Perkin Elmer 9600 thermocyclers, ABI 373 
and ABI 377 Automated DNA Sequencers. The Hamilton combines pre-aliquoted templates and reaction mixes con- 
sisting of deoxy- and dideoxynucleotides, the thermostable Taq DNA polymerase, fluorescently-labelled sequencing 
50 primers, and reaction buffer. Reaction mixes and templates were combined in the wells of a 96-well thermocycling 
plate and transferred to the Perkin Elmer 9600 thermocycler. Thirty consecutive cycles of linear amplification (i.e.., one 
primer synthesis) steps were performed including denaturation, annealing of primer and template, and extension; i.e., 
DNA synthesis. A heated lid with rubber gaskets on the thermocycling plate prevents evaporation without the need for 
an oil overlay. 

ss Two sequencing protocols were used: one for dye-labelled primers and a second for dye-labelled dideoxy chain 

terminators. The shotgun sequencing involves use of four dye-labelled sequencing primers, one for each of the four 
terminator nucleotide. Each dye-primer was labelled with a different fluorescent dye, permitting the four individual 
reactions to be combined into one lane of the 373 or 377 DNA Sequencer for electrophoresis, detection, and base- 
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calling. ABI currently supplies premixed reaction mixes in bulk packages containing all the necessary non-template 
reagents for sequencing. Sequencing can be done with both plasmid and PCR-generated templates with both dye- 
primers and dye- terminators with approximately equal fidelity, although plasmid templates generally give longer usable 
sequences. 

5 Thirty-two reactions were loaded per ABI 373 Sequencer each day and 96 samples can be loaded on an AB| 377 

per day. Electrophoresis was run overnight (ABI 373) or for 2 1/2 hours (ABI 377) following the manufacturer's protocols. 
Following electrophoresis and fluorescence detection, the ABI 373 or ABI 377 performs automatic lane tracking and 
base-calling. The lane-tracking was confirmed visually. Each sequence electropherogram (or fluorescence lane trace) 
was inspected visually and assessed for quality. Trailing sequences of low quality were removed and the sequence 

10 itself was loaded via software to a Sybase database (archived daily to 8mm tape). Leading vector polylinker sequence 
was removed automatically by a software program. Average edited lengths of sequences from the standard ABI 373 
or ABI 377 were around 400 bp and depend mostly on the quality of the template used for the sequencing reaction. 

INFORMATICS 

75 

1. Data Management 

A number of information management systems for a large-scale sequencing lab have been developed. (For review 
see, for instance, Kerlavage etai, Proceedings of the Twenty-Sixth Annual Hawaii International Conference on System 

20 Sciences, IEEE Computer Society Press, Washington D. C, 585 (1993)) The system used to collect and. assemble 
the sequence data was developed using the Sybase relational database management system and was designed to 
automate data flow whereever possible and to reduce user error. The database stores and correlates all information 
collected during the entire operation from template preparation to final analysis of the genome. Because the raw output 
of the ABI 373 Sequencers was based on a Macintosh platform and the data management system chosen was based 

25 on a Unix platform, it was necessary to design and implement a variety of multi- user, client-server applications which 
allow the raw data as well as analysis results to flow seamlessly into the database with a minimum of user effort. 

2. Assembly 

30 An assembly engine (TIGR Assembler) developed for the rapid and accurate assembly of thousands of sequence 

fragments was enployed to generate contigs. The TIGR assembler simultaneously clusters and assembles fragments 
of the genome. In order to obtain the speed necessary to assemble more than 10 4 fragments, the algorithm builds a 
hash table of 12 bp oligonucleotide subsequences to generate a list of potential sequence fragment overlaps. The 
number of potential overlaps for each fragment determines which fragments are likely to fall into repetitive elements. 

35 Beginning with a single seed sequence fragment, TIGR Assembler extends the current contig by attempting to add 
the best matching fragment based on oligonucleotide content. The contig and candidate fragment are aligned using a 
modified version of the Smith -Waterman algorithm which, provides for optimal gapped alignments (Waterman, M. S., 
Methods in Enzymology .164: 765 (1988)). The contig is extended by the fragment only if strict criteria for the quality 
of the match are met. The match criteria include the minimum length of overlap, the maximum length of an unmatched 

40 end, and the minimum percentage match. These criteria are automatically lowered by the algorithm in regions of minimal 
coverage and raised in regions with a possible repetitive element. The number of potential overlaps for each fragment 
determines which fragments are likely to fall into repetitive elements. Fragments representing the boundaries of repet- 
itive elements and potentially chimeric fragments are often rejected based on partial mismatches at the ends of align- 
ments and excluded from the current contig. TIGR Assembler is designed to take advantage of clone size information 

45 coupled with sequencing from both ends of each template. It enforces the constraint that sequence fragments from 
two ends of the same template point toward one another in the contig and are located within a certain ranged of base 
pairs (definable for each clone based on the known clone size range for a given library). 

3. Identifying Genes 

50 

The predicted coding regions of the Staphylococcus aureus genome were initially defined with the program zorf, 
which finds ORFs'of a minimum length. The predicted coding region sequences were used in searches against a 
database of all Staphylococcus aureus nucleotide sequences from GenBank (release 92.0), using the BLASTN search 
method to identify overlaps of 50 or more nucleotides with at least a 95% identity. Those ORFs with nucleotide sequence 
55 matches are shown in Table 1 . The ORFs without such matches were translated to protein sequences and and com- 
pared to a non-redundant database of known proteins generated by combining the Swiss-prot,. PIR and GenPept 
databases. ORFs of at least 80 amino acids that matched a database protein with BLASTP probability less than or 
equal to 0.01 are shown in Table 2. The table also lists assigned functions based on the closest match in the databases. 
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ORFs of at least 120 amino acids that did not match protein or nucleotide sequences in the databases at these levels 
are shown in Table 3. 

ILLUSTRATIVE APPLICATIONS 

5 

1. Production of an Antibody to a Staphylococcus aureus Protein 

Substantially pure protein or polypeptide is isolated from the transfected or transformed cells using any one of the 
methods known in the art. The protein can also be produced in a recombinant prokaryotic expression system, such as 
io E. coli, or can by chemically synthesized. Concentration of protein in the final preparation is adjusted, for example, by 
concentration on an Amicon filter device, to the level of a few micrograms/ml. Monoclonal or polyclonal antibody to the 
protein can then be prepared as follows. 

2. Monoclonal Antibody Production by Hybridoma Fusion 

75 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as described can be prepared from 
murine hybridomas according to the classical method of Kohler, G. and Milstein, C, Nature 256:495 (1975) or modifi- 
cations of the methods thereof. Briefly, a mouse is repetitively inoculated with a few micrograms of the selected protein 
over a period of a few weeks. The mouse is then sacrificed, and the antibody producing cells, of the spleen isolated. 

20 The spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, and the excess unfused cells 
destroyed by growth of the system on selective media comprising aminopterin (HAT media). The successfully fused 
cells are diluted and aiiquots of the dilution placed in wells of a microtiter plate where growth of the culture is continued. 
Antibody-producing clones are identified by detection of antibody in the supernatant fluid of the wells by immunoassay 
procedures, such as ELISA, as originally described by Engvall, E., Meth. Enzymol. 70:41 9 (1 980), and modified meth- 

2S ods thereof. Selected positive clones can be expanded and their monoclonal antibody product harvested for use. 
Detailed procedures for monoclonal antibody production are described in Davis, L. et al. Basic Methods in Molecular 
Biology Elsevier, New York. Section 21-2 (1 989). 

3. Polyclonal Antibody Production by immunization 

30 

Polyclonal antiserum containing antibodies to heterogenous epitopes of a single protein can be prepared by im- 
munizing suitable animals with the expressed protein described above, which can be unmodified or modified to enhance 
immunogenicity. Effective polyclonal antibody production is affected by many factors related both to the antigen and 
the host species. For example, small molecules tend to be less immunogenic than other and may require the use of 

35 carriers and adjuvant. Also, host animals vary in response to site of inoculations and dose, with both inadequate or 
excessive doses of antigen resulting in low titer antisera. Small doses (ng level) of antigenadministered at multiple 
intradermal sites appears to be most reliable. An effective immunization protocol for rabbits can be found in Valtukaitis, 
J. era/., J. Clin. Endocrinol. Metab. 33:988-991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested when antibody titer thereof , as de- 

40 termined semi-quantitatively, for example, by double immunodiffusion in agar against known concentrations of the 
antigen, begins to fall. See, for example, Ouchterlony, O. et al, Chap. 19 in:Handbook of Experimental Immunology, 
Wier, D., ed, Blackwell (1973). Plateau concentration of antibody is usually in the range of 0. 1 to 0. 2 mg/ml of serum 
(about 1 2M). Affinity of the antisera for the antigen is determined by preparing competitive binding curves, as described, 
for example, by Fisher, D., Chap. 42 in:Manual of Clinical Immunology, second edition, Rose and Friedman, eds., Amer. 

45 Soc. For Microbiology, Washington, D. C. (198.0) 

Antibody preparations prepared according to either protocol are useful in quantitative immunoassays which de- 
termine concentrations of antigen-bearing substances in biological samples; they are also used semi- quantitatively 
or qualitatively to identify the presence of antigen in a biological sample. In addition, they are useful in various animal 
models of Staphylococcal disease known to those of skill in the art as a means of evaluating the protein used to make 

50 the antibody as^a potential vaccine target or as a means of evaluating the antibody as a potential immunothereapeutic 
reagent. 

3. Preparation of PCR Primers and Amplification of DNA 

55 Various fragments of the Staphylococcus aureus genome, such as those of Tables 1 -3 and SEQ ID NOS: 1 -5, 1 91 

can be used, in accordance with the present invention, to prepare PCR primers for a variety of uses. The PCR primers 
are preferably at least 15 bases, and more preferably at least 18 bases in length. When selecting a primer sequence, 
it is preferred that the primer pairs have approximately the same G/C ratio, so that melting temperatures are approxi- 
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mately the same. The PCR primers and amplified DNA of this Example find use in the Examples that follow. 
4. Gene expression from DNA Sequences Corresponding to ORFs 

s A fragment of the Staphylococcus aureus genome provided in Tables 1 -3 is introduced into an expression vector 

using conventional technology. Techniques to transfer cloned sequences into expression vectors that direct protein 
translation in mammalian, yeast, insect or bacterial expression systems are well known in the art. Commercially avail- 
able vectors and expression systems are available from a variety of suppliers including Stratagene (La Jolla, California), 
Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If desired, to enhance expression and facilitate 

10 proper protein folding, the codon context and codon pairing of the sequence may be optimized for the particular ex- 
pression organism, as explained by Hatfield etal, U. S. Patent No. 5,082,767, incorporated herein by this reference. 

The following is provided as one exemplary method to generate polypeptide(s) from cloned ORFs of the Staphy- 
lococcus aureus genome fragment. Bacterial ORFs generally lack a poly A addition signal. The addition signal sequence 
can be added to the construct by, for example, splicing out the poly A addition sequence from pSG5 (Stratagene) using 

*5 Bgll and Sail restriction endonuclease enzymes and incorporating it into the mammalian expression vector pXT1 (Strat- 
agene) for use in eukaryotic expression systems. pXT1 contains the LTRs and a portion of the gag gene of Moloney 
Murine Leukemia Virus. The positions of theLTRs in the construct allow efficient stable transf ection. The vector includes 
the Herpes Simplex thymidine kinase promoter and the selectable neomycin gene. The Staphylococcus aureus DNA 
is obtained by PCR from the bacterial vector using oligonucleotide primers complementary to the Staphylococcus 

20 aureus DNA and containing restriction endonuclease sequences for Pstl incorporated into the 5' primer and Bgll I at 
the 5' end of the corresponding Staphylococcus aureus DNA 3' primer, taking care to ensure that the Staphylococcus 
aureus DNA is positioned such that its followed with the poly A addition sequence. The purified fragment obtained from 
the resulting PCR reaction is digested with Pstl, blunt ended with an exonuclease, digested with Bglli, purified and 
iigated to pXT1 , now containing a poly A addition sequence and digested Bgll I. 

25 The Iigated product is transfected into mouse NIH 3T3 cells using Lipofectin (Life Technologies, Inc., Grand Island, 

New York) under conditions outlined in the product specification. Positive transfectants are selected after growing the 
transfected cells in 600 ug/ml G41 8 (Sigma, St. Louis, Missouri). The protein is preferably released into the supernatant. 
However if the protein has membrane binding domains, the protein may additionally be retained within the cell or 
expression may be restricted to the cell surface. Since it may be necessary to purify and locate the transfected product, 

30 synthetic 15-mer peptides synthesized from the predicted Staphylococcus aureus DNA sequence are injected into 
. mice to generate antibody to the polypeptide encoded by the Staphylococcus aureus DNA. 

Alternately and if antibody production is not possible, the Staphylococcus aureus DNA sequence is additionally 
incorporated into eukaryotic expression vectors and expressed as, for example, a globin fusion. Antibody to the globin 
moiety then is used to purify the chimeric protein. Corresponding protease cleavage sites are engineered between the 

35 globin moiety and the polypeptide encoded by the Staphylococcus aureus DNA so that the latter may be freed from 
the formed by simple protease digestion. One useful expression vector for generating globin chimerics is pSG5 (Strat- 
agene). This vector encodes a rabbit globin. Intron II of the rabbit globin gene facilitates splicing of the expressed 
transcript, and the polyadenylation signal incorporated into the construct increases the level of expression. These 
techniques are well known to those skilled in the art of molecular biology. Standard methods are published in methods 

40 texts such as Davis et ai, cited elsewhere herein, and many of the methods are available from the technical assistance 
representatives from Stratagene, Life Technologies, Inc., or Promega. Polypeptides of the invention also may be pro- 
duced using in vitro translation systems such as in vitro ExpressTM Translation Kit (Stratagene). 

While the present invention has been described in some detail for purposes of clarity and understanding, one 
skilled in the art will appreciate that various changes in form and detail can be made without departing from the true 

45 scope of the invention. 

All patents, patent applications and publications referred to above are hereby incorporated by reference. 
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45 
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ORF 


SEQ ID NO 


BLAST 


Antigenic 


Regions 










HOMOLOG 


Reqion 1 > 


Region 2 ! 


Region 3 


Region 4 


168.6 ! 


5192 


lipoprotein 


36-45 


84-103 1 


152-161 1 


176-185 


238.1 ! 


5193 


chrA 


21-39 


48-58 


84-95 


232-249 


51.2 i 


5194 


OppB qene product (B. sub" 


20-36 


70-79 


1 00-1 1 2 : 


121-131 ' 


278.3 i 


5195 


lipoprotein 1 


20-29 


59-73 


85-97 


162-171 


276.2 i 


5196 


lipoprotein 


21-33 


65-74 ! 


177-186 , 


211-220 


45 4 


5197 


ProX i 


28-37 


59-69 i 


85-100 | 


120-129 


315.8 ', 


5198 


hypothetical protein 


45-54 


88-97 


182-192 ' 


243-253 


154 15! 


5199 


■ unknown 


31-40 


48-58 


79-88 


95-104 


228 3 


5200 


! unknown ' 


25-38 1 


40-52 


64-74 


80-89 * 


22R 6 


5201 


' unknown i 


29-41 


89-101 


128-143 


173-184 


50.1 


5202 


unknown 


21-33 


52-61 


168-182 


197-206 


1 1 2.7 


5203 


iron-binding periplastic \ 


21-31 


58-67 


92-101 


111-120 


442.1 


5204 


! unknown 


30-39 


91-100 


122-137 


182-192 


66.2 


5205 


: unknown 


50-59 


104-116 


127-136 


167-182 


304.2 


5206 


: Q^binding periplasmic I 


19-28 


48-57 


75-84 


103-116 


44.1 


5207 


'hypothetical protein 


27-36 


86-95 


129-138 


192-201 


1 61.4 


5208 


;SphX 


27-44 


149-161 


166-175 


201-210 


46.5 


5209 


cmpC (permease) 


21-33 


61-70 


83-92 


100-109 


942.1 


5210 


;traH [Plasmid pSK41] ' ! 


83-92 


109-118 


127-142 




S 4 


521 1 


!ORF (S. aureus) 


12-22 


87-96 


m-120 


151-1 60 


20.4 


5212 


peptidoqlycan hydrolase (S' 


24-34 


129-138 


141-150 


161-171 ! 


328.2 


5213 


: lipoprotein (H. flu) i 


81-90 


123-133 


290-299 




520.2 


5214 


i fibronectin binding protein : 


44-54 


63-79 


81-90 


95-110 


771.1 


5215 


'emml qene product (S. py< 


30-39 


65-82 


! 96-106 


1 112-121 


999.1 


1 5216 


predicted trithorax prot. (D 


7-16 


120-129 


i 157-166 




853.1 


1 5217 


ORF2136 (Marchantia polyr 


43-52 


88-97 


I 102-111 


j 


287.1 


1 5218 


psaA homolog 


13-22 


28-44 


i 72-82 


I 114-1 24 


288.2 


! 5219 


cell wall enzyme 


14-23 


89-98 


i 


i 


596.2 


! 5220 


penicillin binding protein 2b 


40-49 


59-68 


! 76-87 


: 106-115 


217 5 


I 5221 


fibronectin/fibrinogen bindii 


28-37 


40-49 


! 62-71 


I 93-111 


217_6 i 5222 


fibronectin/fibrinogen bp 


10-19 


31-40 


1 54-62 


! 73-92 


528.3 


1 5223 


myosin cross reactive prot€ 


4-13 


29-47 


1 60-73 


90-99 


171.11 


5224 


EF 


20-31 


i 91-110 


i } 


63.4 


5225 


penicillin binding protein 2b 


12-21 


i 59-68 


95-104 ! 


353.2 


5226 




46-55 


1 62-71 


i i 


743C1 


5227 


29 kDa protein in fimA regi« 


23-32 


68-79 


94-103 


! 175-184 


342.4 


! 5228 


Twitching motility 


10-19 


48-60 


83-92 


' 111-121 


69.3 


i 5229 


arabinogalactan protein 


97-106 


132-141 


i 158-167 


! 180-189 


70.6 


5230 


nodulin 


36-45 


48-57 


137-160 


1 179-188 


129.2 


i 5231 


' alycerol diester phosphodie 


8-17 


41-50 


; 55-74 


97-106 


58.5 


! 5232 


PBP (5. aureus) 


26-35 


70-79 


; 117-126 


. 152-161 


188.3 


5233 


MHC class li analog (S. aure 


72-81 


94-103 


115-124 


: 136-145 


236.6 


: 5234 


histidine kinase domain (Die 


24-33 


52-67 


81-94 


' 106-121 


310.8 


5235 


ciumpinq factor (S. aureus) 


59-71 


77-86 


! 93-102 


! 1 1 8-1 27 


601.1 


5236 


novel antigen/ORF2 (S. au* 


45-54 


91-104 


108-117 


: 186-195 \ 


544.3 


5237 


ORF YJR151c(S. cerevisael 


76-90 


101-111 


131-140 


1 54-1 64 


662.1 


5238 


MHC class II analog (S. aure 


22-32 


71-80 


89-98 


114-122 


87.7 


5239 


5' nucleotidase precursor (' 


29-45 


62-71 


: 105-114 


125-137 


120.1 


1 5240 


B65G qene product (B. sub 


102-111 









55 



207 



EP 0 786 519 A2 

Table 4 



ORF Antigenic i Regions (cont) 


Region 5 Region 6 Region 7 Region 8 ! Region 9 . Region 1 0 


168.6 ! 244-272 303-315 ! ! ■ i 


238.1 ! 260-269 291-301 j 308-317 ! 1 


51.2 i 140-152 . 188-208 ! 211-220 j 256-266 i 273-283 s 


278.3 ! 198-209 . j 1 




276.2 i 255-268 ! 


i 


45.4 i 177-199 : 221-230 I 234-243 I 268-279 


284-293 ! 304-313 


316.8 , 




154.15 


148-157 i 177-187 


202-211 






228.3 


101-119 . 139-154 


166-181 






. 228.6 








i 
i 


50.1 








j 


112.7 


136-149 197-211 


218-229 


253-273 






442.1 


199-210 1 247-257 


264-277 


287-309 






66.2 












304.2 


178-187 250-259 










44.1 












. 161.4 












46.5 


131-141 162-176 


206-215 


243-252 


264-273 


■ 285-294 


942.1 








• 




5.4 


189-205 230-239 j 246-264 


301-318 


340-354 


378-387 


20.4 


202-212 217-234 


260-275 


314-336 1 366-373 


380-391 


328.2 


■ 






i 


520.2 






! i 


771.1 i 145-154 ! 


! 


999.1 i 






853.1 ! 




! i 


287.1 | 154-164 




i i 


288.2 I 




i • 


596.2 i 121-130 






217.5 244-253 259-268 


288-297 


302-311 | 


217.6 ! 144-158 174-183 


188-197 


207-216 ! 226-242 ! 


528.3 ! 




171.11 I 




j 


•63.4 i i 








353.2 ! 






j 


74U 1 197.-207 






j 


34-2.4 


1 ! i 


69.3 195-211 


1 i 

i 


70.6 206-215 263-272 l 291-301 


331-340 358-371 390-414 


129.2 117-127 141-157 ■ 168-183 


202-211 222-231 : 261-270 


58.5 ' 184-203 260-269 275-299 


330-344 372-381 [ 424-433 


188.3 i 




236.6 138-147 163-172 ! 187-198 


244-261 268-278 308-317 


310_8 131-140 144-153 1 177-186 


190-199 . 204-213 . 216-227 


601.1 


208-218 i 




544.3 


170-179 184-193 ! 224-235 


274-287 327-336 352-361 


662.1 






87.7 


: i 


120.1 : i : 
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Table 4 



ORF 


Antigenic 


Regions 


(cont) 




! Region 1 1 Region 1 2 


Region 13 


Region 1 4 : Region 1 5 


Region 1 6 


168.6 I 




i 


238.1 ! 




5i_z ; 




278_3 i 


: 1 


276.2 i 


i " i 


45_4 ! 




. 

: 


■ 


316.8 




: 




154.15 I i 




i 

! 




228.3 ! 








228.6 


i 




i 

I 




50.1 


1 




1 

i 




112.7 


1 








442.1 


i 




! 




1 66.2 


i 








304.2 










44.1 


1 






I 161.4 


i 






46.S 


306-315 i 








942.1 






i 

! 




5.4 


393-407 ! 416-426 


456-465 


1 




20.4 


396-405 : 410-419 


461-481 






328.2 


i 




1 




520.2 | i 








1 771.1 I ! 








999.1 1 








853.1 ! i 








287.1 ! ! 






288.2 1 I 






596.2 i if"! 


217.5 1 




I 


217_6 i 








528.3 j 








i7i_ii | 








63.4 I 








353.2 ! 








743-.1 ! 






342.4 i 








69.3 ! 








70.6 


453-471 505-515 








129.2 


296-31 5 








58.5 ! ! 




188.3 ' ! 


i 


236_6 


358-377 410-423 


428-439 


442-457 467-476 


480-493 


310.8 


238-251 256-275 


281-290 


296-310 314-333 


3^8-347 


601.1 i ; 


1 


544.3 I 




662_1 ! 1 




87.7 


i 




120.1 : 
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Table 4 



ORF Antigenic; Regions (cont) 1 


1 Region 17 Region 18 i Region 19 Region 20 i Region 21 Region 22 


168.6 i- i i 


238.1 [ ! 


51.2 ! . : ! 


278.3 i !• i - 


276.2 i 1 ! j 


45.4 1 | i i ! 


316.8 . ! 


154.15 ! 


i i 




228.3 


i 






228.6 | 








50.1 ; ! 






112.7 




! 






442.1 












66.2 


i 










304.2 




t 






44.1 




i . 






161.4 






1 


46.5 1 




1 

r 


942.1 ; 


i 




5.4 ! 




I 


20.4 j. 




1 


328.2 j 


i 


i 


520.2 1 


i 


i 


771.1 ! 








999.1 i 






853.1 1 






287.1 |- 




i 


288.2 ' 


- - ,' 




596.2 i 






217.5 






217.6 






528.3 1 


! 


171.11 1 


I 


63.4 . i 


: 1 ! 


353.2 i 


■ 1 ! 


74311 ; 


: i ' 


342.4 ' 1 


1 1 


693 


■ -1 ! ■ 


70.6 ! 


! : 


129.2 • i : 


58.5 i 1 1 


188.3 ; 


236_6 j i" - 


310.8 357-366 370-379 


429-438 443-452 478-487 551-560 


601.1 




544.3 : 




662.1 




87.7 




120 1 
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Table 4 



5 


ORF i Antigenic 


Regions 


(cont) 








i Region 23 Region 24 


Region 25 


Region 26 


Region 27 


Region 28 




168.6 ! ! I ! ! 




238.1 








— 




51.2 











10 


278.3 j ! 








276.2 . ! 




i i 




45.4 ! i 


; i 




316.8 : 













154.15 i 








IS 


228.3 




i 




228.6 1 | 


i 






. 50.1 ! 




i 

i 






1 1 2.7 1 




i 






442.1 i 




- i 




20 


66.2 ! 




1 






304_2 ! 












44.1 i 










161.4 




1 






46.5 ! 








25 


942.1 i 










5.^ ! 




; 






20.4 ! i 




j 






328.2 ! 




i 






520.2 i 




i 




30 


771.1 i 












999.1 j 












853.1 ! 












287.1 ! 


! 








288.2 ! 


! . 






35 


596.2 


i 






217_5 ! 












217.6 ! 






528.3 i ' i 






171_11 ; ! 






40 


63_4 I 








353.2 ! 


j 






743.1 : I" 1 ! 






342.4 1 






69.3 ! 




45 


70.6 


• 




129.2 ; 




58.5 




188.3 




236.6 ! i 


SO 


310.8 622-632 670-685 


708-718 


823-836 


858-867 


877-886 




601.1 '! i 1 




544.3 - • ! 




662.1 




87.7 S ! 


55 


120.1 
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Table 4 



10 



is 



20 



25 



30 



35 



40 



45 



50 



55 



ORF Antigenic; Regions ;(cont) 


Region 29 ; Region 30 1 


168.6 j i 


238^1 j i 


51.2 ! i 


278.3 i 


• 


276.2 : i 




45.4 i ! 




316.8 




154.15 • 




228.3 - 1 


i " • 


228.6 






50.1 






1.12^7 






442.1 ! 






66.2 1 






304_2 






44.1 






! 161.4 






46.5 






942.1 






5.4 






20.4 




328.2 




520.2 • 




771.1. 






999.1 


■ 


■ 


853.1 






287_1 






i 288.2 1 




596.2 i 




217_5 ! 


• 


217.6 *■ ! 




528.3 i 




171.11 ! j 




63.4 1 I ! 


353.2 - ! ! 




74£J ! 




342.4 I 1 




6913 




70.6 


129.2 ! 


58.5 


18B.3 


236.6 ; 


3 1 0.8 


601.1 


544.3 


662.1 


87_7 


120.1 
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Table 4 



ORF 




BLAST 


Antigenic 

ru [ li^wi lit* 


Reaions 










HOMOLOG 


Ppfttnn 1 


Renion 2 


Region 3 


Region 4 


46.1 


i5241 


aldehyde dehydrogenase 


8-17 


36-52 


. IIjOlZj 


13.1.2J2T""" 


63.4 


5242 


glycerol ester hydrolase (P. 


9-26 


57-73 77 ' 


93-1 07 


" T23-133" 


174.6 




5243 ketooantoate hvdroxvmeth 


71-80 


203-21 2 


242-254 1 


265-274 


206.16:5244 


ornithine acetyltransferase 


1-10 


34^43 


54^-63 


194-210 


267.1 


S5245 


NaH-antiporter protein (E. t 


120-129 


332-347"' 


"398-408 I 




322.1 


I5246 


acriflavin resistance protein 


58-75 


153-164 


203-231 | 


264-284 


415.2 


5247 


transport ATP-binding prot< 


108-126 


218-227 


298-308 ; 


315-334 


214_3 


!5248 


2-nitropropane dioxygenase 


123-136 


216-233 


283-292 I 


297-306 


587_3 


i5249 


; clumping factor 


5-14 


43-54 


59-68 j 


76-95 


685_1 


|5250 


signal peptidase 


59-68 


72-81 


86-95 j 


99-108 


54_3 


~!5251 


fibronectin binding protein 1 


23-32 


37-46 


50-59 j 


89-98" * 


54.4 


"'5252" 


fibronectin binding protein 1 


"43-52 


66-75 


~_95;J04 ; 


71 147-1 5V 


54.5 


!5253 


fibronectin binding protein 1 


49-60 


81-90 


i 




54.6 


S5254 


fibronectin binding protein 1 


55-71 


82-97 


139-158 i 


"l 750*86" 


328_1 


; 5255 


lipoprotein (H. flu) 


11-20 


61-70 


96-105 i 



Table 4 

25 



ORF 




Antigenic; Regions 


(cont) i ! 




Region 5 , 


Region 6 > 


Region 7 


Region 8 S 


Region 9 


Region 10 


46.1 • 


: 215-242 


333-352 ! 


376-385 


416-432 


471-487 ! 




63.4 


1 145-154 


191-202 


212-223 


245-265 : 


274-283 ! 


291-300 


174.6 










I 




206.16 


239-259 I 275-284 








267 1 ! i I 




322.1 


. 298-319 


350-359 










415.2 


344-353 


371-380 


395-404 


456-465 


486-495 


.5A8-527l_*l 


. 214.3 


; 318-337 


_ 365-"3"75 










537.3 


106-115 


142-151 


156-166 


173-182 


'186-198 


204-213 


685.1 


: 113-122 


130-145 










54.3 


i 128-138 


185-194 


217-226 


251-260 


268-277 


295-305 


54_4 


! 175-188 


191-200 


203-212 


220-229 






54.5 I 












54_6 


i 220-230 


287-304 


317-326 


344-353 


364,373 


378-387 


328_1 ! 













45 



50 
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Table 4 



ORF 


i 


Antigenic! Regions 


(cont) 




t 




Region 1 1 


Region 12 


Region 1 3 


Region 14 


i Region 1 5 


Region 17 


46.1 i i 




63.4 


306-315 ; 


319-328 


366-376 


395-420 


453-462 


467-476 


174_6 1 




i : 


206.16 ■; 




i ■ : ■ • 

i « 


267_1 I ! i 


i ! 


322.1 i i 


! i 


415.2 


539-555 




■ 


i ; 


214.3 : 1 




i i 


587.3 


' 217-226 ' 


278-287 


318-327 


332-342 


1351-360 . 


J 377-386 


685.1 


1 


• 








i 


54^3 


"T" 31 6-325 i 


329-345 


355-372 


387-396 


416-425 


438^8 


54_4 










i 




54.5 i i 






i 


"\ 


54_6 


1 396-407 , 


427-436 


514-531 


[541-550 


1569-578 


";6lI-622 


328.1 • I 


. 1 I 



Table 4 



ORF 




Antigenic; Regions 


(cont) 








Region 18 


Region 1 9 


Region 20 


Region 21 


i Region 22 


_ Regiqn_23 


46_1 ! . , 








63.4 


485-500 


513-525 










174.6 ; 








206.16 














267_1 i i i 








322.1 i : 








415.2 








214_3 i i 




T 




587.3 


396-405 


1 426-442 


459-470 


485-494 


i 505-51 4 


; 53 1-562 


685.1 






1 




54.3 


455-462 


i472-491 


517-536 






i 


54_4 i 




1 1 


54_5 1 . ! 




1 i 


54.6 


639-648 


! 673-681 ' 


703-715 


723-732 


1749-760 


i772-788 


328.1 ! ; 




i 
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Table 4 



ORr 




Antigenic Regions 


(com) 




i 




Region 24 


Region 25 ! Region 26 


Region 27 


Region 28 


J Region_29_ 


46 1 ! i ! i I 


63.4 .i 


1 i ..... ., 


I 174.6 i 


t 1 


206.16 






267.1 i i 




i 


322.1 : i 




i 


415.2 




214.3 






! i 


587.3 


567-578 


584-601 =607-840 


844-854 


858-870 


|877-886 


685^1 


i 




! 


54.3 


l " ". 




! 


54.4 


i 




! 


54.5 


i 






54.6 


793^802 


811-826 834-848 


866-876 


893-903 


1907-918 


328.1 






j 



Table 4 



ORF I Antigenic Regions 


(cont) 


! Region 30 Region 31 ; 


46.1 ! 




63.4 




174_6 




206.16 ! 




267.1 | ; 




322.1 ! ! 




415.2 




214.3 I i 




587.3 1889-911 j 927-936 




685.1 1 




54.3 ! 




54.4 i j 




54.5 i i 




54.6 925-944 ^951-997 




328.1 ! I 
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10 



15 



20 



30 



35 



40 



45 



SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: 

(A) NAME: Human Genome Sciences, Inc. 

'(B) STREET: 9410 Key West Avenue 

(C) CITY: Rockville 

(D) STATE: Maryland 

(E) COUNTRY: US 

(F) POSTAL CODE: 20850 

(ii) TITLE OF INVENTION: Staphylococcus aureus Poly- 
nucleotides and Sequences 



(iii) NUMBER OF SEQUENCES: 5255 

(v) COMPUTER READABLE FORM: 
25 ( A ) MEDIUM TYPE: Diskette, 3.50 inch, 1.4 Mb storage 

(B) COMPUTER: HP Vectra 486/33 

(C) OPERATING SYSTEM: MSDOS version 6.2 

(D) SOFTWARE: ASCII Text 



(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 60/009,861 

(B) FILING DATE: 05-JAN-1996 



(2) INFORMATION FOR SEQ ID NO:l: 



50 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5895 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1: 

70 

TCCATTATGA AGTCACAAGT ACTATAAGCT GCGATGTTAC CAATGTTTTT TAAAATCCCA ' 60 

GTAATAAAAT CAAAAAATAA GTTAAATAAT GTATTCATTT TAAGTCCTCC TTAATAAAGa 120 

J5 aaataGGTAA TAATGTAATA GCTTCTATTA TGATGCCTAA TTGAATGAAT TGGGCAAATG 180 

GCTCTTTGAT GATAAGTGTG ATAATGAAAA GGGTTAAACT AACAATAATC GCATAATATT 240 

TTTTTCGTTT AATAAGTCGC ACAGGAATGG GCTTCTTTTT AGTTGCTGCA GGAGCATATA 300 

20 CTGAGATTAC ACCTAAAGAA ATAACTGTTA AAATAATCAT AATTAAAAAG TTAATATGAA 360 

AATTTACTAT TACTAAAGGT AAAAGTATAA ATAGTATAAT ACTTTCTACA TAACACCAAA 420 

AAGAAGAAGG TGCATGTGCa CCATGTGCAT GtCTTCTTAT TAAATAAAAT GTTAAATTCG 480 

25 

TAATTAACGT AAACAGAAAA ATGTTTAAAA TATAGGCAAT AGTATACATA ACAATTAATT 540 

TACCTATATT TTTAGCTAAG ACCTGCATCC CTAATCGTAC TTGCAAAAAT TGAATATGAT 600 

CTAAGTTATT TCTCTTTTGA AGATACGTGG CAAACTGGTC AATTTTATTA TCAAAATAAT 660 

30 

TCAATTTTAC ACCACTCTCC TCACTGTCAT TATACGATTT AGTACAATCT TTTATCATTA 720 

TATTGCCTAA CTGTAGGAAA TAAATACTTA ACTGTTAAAT GTAATTTGTA TTTAATATTT 780 

35 TAACATAAAA AAATTTACAG TTAAGAATAA AAAACGACTA GTTAAGAAAA ATTGGAAAAT 840 

AAATGCTTTT AGCATGTTTT AATATAACTA GATCACAGAG ATGTGATGGA AAATAGTTGA 900 

TGAGTTGTTT AATTTTAAGA ATTTTTATCT TAATTAAGGA AGGAGTGATT TCAATGGCAC 960 

40 

AAGATATCAT TTCAACAATC GGTGACTTAG TAAAATGGAT TATCGACACA GTGAACAAAT 1020 

TCACTAAAAA ATAAGATGAA TAATTAATTA CTTTCATTGT AAATTTGTTA TCTTCGTATA 1080 

GTACTAAAAG TATGAGTTAT TAAGCCATCC CAACTTAATA ACCATGTAAA ATTAGCAAGT 1140 

45 

GAGTAACATT TGCTAGTAGA GTTAGTTTCC TTGGACTCAG TGCTATGTAT TTTTCTTAAT 1200 

TATCATTACA GATAATTATT TCTAGCATGT AAGCTATCGT AAACAACATC GATTTATCAT 1260 

50 TATTTGATAA ATAAAATTTT TTTCATAATT AATAACATCC CCAAAAATAG ATTGAAAAAA 1320 

TAACTGTAAA ACATTCCCTT AATAATAAGT ATGGTCGTGA GCCCCTCCCA AGCTCGCGGC 13 80 

CTT TTTT GTA ATGAAGAAGG GATGAGTTAA TCATCATTAT GAGACCCGCC GTTAAAATAT 144 0 

55 
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TCATTTGCAA 


AGGGCGAAAT 


GGGTTCTTAC 


TGAGTTATCT 


ATTATAAAAA 


AATAAACATA 


1560 




GACTTATGAA 


AAATCTCTCA 


TAAATCTATG 


TTTAGTCATG 


aCATGTGTTA 


AATATTATTT 


1620 


5 


CGGGCGCTTC 


TTATTTATAC 


AAATCTAATT 


TAATACTTTT 


AAATACAGGT 


ATATTTTCgC 


1680 




GTTGCTGTTC 


TACTTCATTT AAGTTTAAAT 


CTACAGTCAA AATATCTGCG 


GATTCATTTA 


1740 


10 


ATTCTCCAAC 


TAAATCTCCA 


TTTGGGTTTA 


TAACTATCGA 


ATGACCAGCA 


TATTCTGTGT 


1B00 


TACGATCGAA 


TCCAGTGCTA 


TTAGTTCCAA 


TGACAAACAT 


ATTATTTTCA 


ATTGCACGTG 


1B60 




CCTTTAGTAA TGAATGCCAA 


TGTTGAAGAC 


GTGACATAGG 


CCATTGCGCC 


ACATAAAATG 


1920 


15 


CAATTTTAGC ACCACTACGA 


GCAGGATATC 


TTAATAATTC 


TGGAAAACGT 


AAATCATAAC 


1980 




AGATAAGTTG 


GGTCACATAA 


GTACCGTCAG 


ACAATTGAAA 


GGGTTCAGCT 


ACGTATTCGC 


2040 




CAGCGGTTAA 


AAATTCATGC 


TCTCTTAACA 


TAGGAACTAA 


ATGAACTTTG 


TCGTATTCaT 


2100 


20 


TAATCAGCTG 


GCCACTTTTA 


TTCACACTAA 


AAGCTGTATT 


AAATATTTGA 


TTGTTTCTAA 


2160 




TGTTAGAAAC 


TGACCCAGCT 


ACGATATCGA 


CTTTATATTT 


TTCAGCTAAA 


TGTTTAATAA 


2220 


25 


ATGAAAAACT 


TTGTCCTAGA 


TTATTATCTG 


CTTTTTCATT 


TAAATGCTCT 


AAATCATAGC 


2280 


CATTATTCCA 


CATTTCAGGT 


AAAACGACTA 


CATCTACTTC 


AGCATTCATA 


TTTTTTTCGA 


2340 




ACCATTGCGT 


TATTTGAGTT 


7CATTTTTAG 


AACTATCTCC 


AAAAACAATC 


GGTAATTGAT 


2400 


30 


AAATTTGGAC 


TTTCATAACA 


TCACATCCTT 


GATAGATCTT 


ATATATAACT 


TACTAAAAGT 


2460 




TATGTTGAAA 


CGCAAAAAAC 


GAGCACAAGA 


CATAAAATCA AAGTCCTAGG 


CTCTACAAAG 


2520 




TTATATTGAC 


AGTAGTTGAT 


GGGGCCCCAA 


CATAGAGAAA 


TTGGAACACC 


AATTTCTACA 


2580 


35 


GACAATGCAA 


GTTGGGGTGG 


GCTCTAACAT 


AAAGAAATAC 


TTTTTCTTTA 


GAAATTAGTA 


2640 




TTTCTTATAC 


ATGAGTTTTA 


CTCATGTATT 


CCTATTCTTA 


AGTGCACATT 


AGCAGCGGCT 


2700 


40 


AATGTGTAAG 


AACTACTACA 


TAATGAATAA 


CTAATGATTC 


TTTATCATTT 


CTGTCCCATT 


2760 


CCTAACAATA 


TATTGATTAT 


TTTTTTATTA 


CGAAACGATC 


TTCCACTGGA 


TTAAATGTTT 


2820 
t 




TTTCGCCAGC 


AGCTTCACGA ATATCACCAA ATGGCATTTG 


AGCAATAAGT 


TTCCAACTTT 


2880 


45 


TAGGAATATT 


AAATTCATTT 


GAAGTCATCT 


CATCAACAAG 


TGGATTATAG 


TGTTGTAATG 


2940 




AAGCACCTAT 


GCCTTTAGTA 


GCTAATGCAG 


TCCAAATTGC 


AAATTGATGC 


ATGGCATTTG 


3000 




TTTGAGTTGA 


CCATATTGCA 


AAATTATCAT 


AGTAGTTTGG 


CATTTGTTCT 


TGTAAACCAC 


3060 


50 


TTACAACATC 


TTGATCTTCA 


TAAAACAAAA 


TTGTACCGTA 


TGAATGTTTG 


AAGTTATCAA 


3120 




TTTTTTG7TC 


AGTTGGCTCG 


AAATCACGAT 


TCTCTCCCAT 


GACTTCTTTT 


AAAATTGCTT 


3180 


55 


TTGTGTTATC 


CCAAAATTTA 


TTATTGTTGT 


CATTTAACAA 


GAGAACAATT 


CTAGTTGATT 


3240 
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CATCGCTAAT TGATATCGAA TCTTTCAAAT 
TGTCAAAAGT CATTGCTTTT TTATCTTTTT 
5 TAGTAAAGAA TACTTAATAG ACTAAGTATA 

TTACGAAAAT TTCAAGAATA TTAATATTCA 
AACGCATATT TATTATACTT AGATTAATAC 

10 

TGTCATATCA TTGGTTTAAG AAAATGTTAC 
GTAGTTTAGG GCTTGCAACG CACACAGTTG 

,5 CAACTACTAA TTTGAATCAT AATATAACTT 
ATGAGACTGG GACACCTCAC GAATCAAATC 
GTCGTGATGC TAATCCTGAT TCGAATAATG 

20 GTACAGATTC AAAACCAGAC CCAAATAACC 
CAGATAACCC GAAACCAAAA CCGGATCCAA 
CGGATCCAAA ACCAGATCCA GATAACCCGA 

25 

ATAAACCAAA GCCAAATCCG GATCCAAAAC 
ATCCAAAACC AGACCCTAAT AAGCCAAATC 

30 GGGATTCCAA TCATTCTGGT GGCTCGAAAA 
ATGGATCTAA TCAAGGTCAA TGGCAACCAA 
CTGGTAATGA TTTTGTATCC CAACGATTTT 

35 ATCCGTATAT TTTAAATCAA ATTAATAAGT 
AAGACATTTA TAATATTATT CGAAAACAAa 
TACAACAGCA ATCGAATTAC TTTAGATTCC 

40 

ACTATCGTAA TTTAGATGAA CAAGTACTCG 
CAGATTTGAA AAAGCCCGAA GATAAGCCGG 
AAAAAGACGA TTTTACAGTA GTTAAAAAAC 

45 

CATATAGTAA AAGTTGGCTA GCAATTGTAT 
TATTCTTATT- TGTAAAGCGA AATAAAAAGA 
50 . CCGTGTGTGA TTCGTTTTTT TTATTATGGA 
TCCGTGGCTT TTTTCAAAGC CTCAGGATTA 
TGTAACATAT GGATAATAAT TGGAACAGCA 

55 



TATATATTGA ACGTCTTTCT TCCATTGCAT 336 0 

TAAATAAGCC CATAATTATT GCTCCTTCTT 3420 

AAATTTATAC TCGTACTTGT AAAGCAATAT 34 80 

TTTTCAAATT CCAAATATAA ATGCATTTTC 354 0 

TTACATGAAA AAGGGAGGTG TCTCGTGAAA 360 0 

TTTCAACAAG TATTTTAATT TTAAGTAGTA 3660 

AAGCAAAGGA TAACTTAAAT GGAGAAAAAC 3720 

CACCATCAGT AAATAGTGAA ATGAATAATA 3780 

AAACGGGTAA TGAAGGAACA GGTTCGAATA 384 0 

TGAAGCCAGA CTCAAACAAC CAAAACCCAA 3900 

AAAACTCAAG TCCGAATCCT AAACCAGATC 3 960 

AACCAGACCC AGATAAACCA AAGCCAAATC 4020 

AACCAAATCC AGATCCAAAA CCAGACCCAG 4080 

CAGATCCAGA TAAACCAAAG CCAAATCCGA 4140 

CTAACCCGTC ACCAGATCCC GATCAACCTG 4200 

ATGGGGGGAC ATGGAACCCA AATGCTTCAG 4260 

ATGGGAATCA AGGAAACTCA CAAAATCCTA 4320 

TAGCCTTGGC AAATGGGGCT TACAAGTATA 43 80 

TGGGCAAAGA TTATGGAGAA GTTACTGATG 4440 

ATTTCAGCGG AAATGCATAT TTAAATGGAT 4500 

aATATTTCAA TCCATTGAAA TCAGAAAGGT 4560 

CATTAATTAC TGGTGAAATT GGATCAATGC 4620 

ATTCAAAACA ACGCTCATTT GAACCGCATG 4680 

AAGAAGATAA TAAGAAAAGT GCGTCAACTG 4740 

GTTCTATGAT GGTGGTATTT TCAATCATGC 4800 

AAAATAAAAA CGAATCACAG CGACGATAAT 4 860 

ATAAAAATGT GATATATAAA. ATTCGCTTGT 49 20 

AGTAATTGGA ATATAACGAC AAATCCGTTT 4980 

AGCCGTTTTG TCCAAACATA TGCTAATGAA 5040 
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AATATTAATG AACTTACTGT TGTAGCAATA ATAAATGCCA CGATACGATT ACCTTTAATC 5160 

GCATTAAATA ATTCTCCAAA GATTACTTTT CTGAATACAT ATTCTTCTAA TAAAGGACCA 5220 

ATAATAGATA CAAAGAAGAT AAATATAGGT ATTTTTCGAG CAATAATAAT TAGCTTTTCT 5280 

GTATTAGGAC TTACTTGTTG TCCACCATAA ATTTGCGTTA ATACAATGCT CACTACCATT 534 0' 

TGATAAATCA TTACCAATGC AAATCCAAGC AATGCCCATG GAATGATATA TTTTTTAGGT 5400 

TCTTTAACTT CTAATTCTAA TTTTGTTGGA TTTTTAATTT TTAAATTAAT TAAAATAATC 5460 

GTCGTGGCGG CGATTAAAAA TAGAACAAGT TGTATGTAAA TGACTGCTTT AGTCAGTTCT 5520 

75 ATGCCACTAT ATTGTACAAA TGGTAATTTT TTTACAATGA GAAGCGGTAA AAATTGAGAC 5580 

AATATATAAA TAATAACAGT TAGCAATGAT GCCCATAATC tTGTCATAAT TTTCCTCCAA 5640 

ATATTTGTTT ATAATTTATT TTATCGTAAA TAACTTGAAG TTACAAAACT TAATTAAAAG 5700 

GTTATGACTT GAAATTTTGA CCAAATTTGA TTATTATAAA TGTATGTTAG CACTCTTTAA 5760 

TGTTAAGTGC TAAACTTTAG GTTTTTTAAG GAGGAACAAT CATGCTAAAA CCAATTGGAA 5820 

ATCGTGTGAT TATTGAGAAA AAAGAACAAG AACAAACAAC TAAAAGTGGn ATTGTTTAAC 5880 

TGATAGTGCT AAAGA 58 95 
(2) INFORMATION FOR SEQ ID NO: 2: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 2: 

TTTGAAAAAA CAAGGTACGA TTGGTTTAAT AACATATATG AGAACCGATT CTACACGTAT 60 

TTCaGATACT GCCAAAGTTG AAGCAAAACA GTATATAACT GATAAATACG GTGAATCTTA 120 

CACTTCTAAA CGTAAAGCAT CAGGGAAACA AGGTGACCaA GATGCCCATG AGGCTATTAG 180 

ACCTTCAAGT ACTATGCGTA CGCCAGATGA TATGAAGTCA TTTTTGACGA AAGACCAATA 240 

CCGATTATAC AAATTAATTT GGGAACGATT TGTTGCTAGT CAAATGGCTC CAGCAATACT 300 

TGATACAGTC TCATTAGACA TAACACAAGG TGACATTAAA TTTAGAGCGA ATGGTCAAAC 360 

50 AATCAAGTTT AAAGGATTTA TGACACTTTA TGTAGAAACT AAAGATGATA GTGATAGCGA 420 

AAAGGAAAAT AAACTGCCTA AATTAGAGCA AGGTGATAAA GTCACAGCAA CTCAAATTGA 4 80 

ACCAGCTCAA CACTATACAC AACCACCTCC AAGATATACT GAGGCGAGAT TAGTAAAAAC 540 
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AAAGCGTAAC TATGTCAAAT TAGAAAGTAA GCGTTTTGTT CCTACTGAGT TGGGAGAAAT 660 

AGTTCATGAA CAAGTGAAAG AATACTTCCC AGAGATTATT GATGTGGAAT TCACAGTGAA. 720 

TATGGAAACG TTACTTGATA AGATTGCAGA AGGCGACATT ACATGGAGGA AAGTAATCGA 7 BO 

CGGTTTCTTT AGTAGCTTTA AACAAGATGT TGAACGTGCT GAAGAAGAGA TGGAAAAGAT 840 

TGAAATCAAA GATGAGCCAG CCGGTGAAGA CTGTGAAATT TGTGGTTCTC CTATGGTTAT 900 

AAAAATGGGA CGCTATGGTA AGTTCATGGC TTGCTCAAAC TTCCCGGATT GTCGTAATAC 960 

AAAAGCGATA GTTAAGTCTA TTGGTGTTAA ATGTCCAAAA TGTAATGaTG GTGACGTCGT 1020 

15 AGAAAGAAAA TCTAAAAAGA ATCGTGTCTT TTATGGATGT TCGAAATATC CTGAATGCGA 1080 

CTTTATCTCT TGGGATAAGC CGATTGGAAG AGATTGTCCA AAATGTAACC AATATCTTGT 1140 

TGAAAATAAA AAAGGCAAGA CAACACAAGT AATATGTTCA AATTGCGATT ATAAAGAGGC 1200 

AGCGCAGAAA TAATATTTTT ATTTCCTAGA TACATTTTAA GATTGTTAAA TAGAATCATT 1260 

AGTGAATCTT ATTTTAAAGA TAGTAAAGGA TTAATCTAAA TAAGTGCGGA TAATATAAAC 1320 

ATAACAACAT AATTAAmAGA CATAAATGAC aATAAAAGGA * GTATAGAAAT GACTCAAACT 1380 

GTAAATGTAA TAGGTGCTGG TCTTGCCGGT TCAGAAGCGG CATATCAATT AGCTGAAAGA 1440 

GGAATTAAAG TTAATCTAAT AGAGATGAGA CCTGTTAAAC AAACACCAGC GCACCATACT 1500 

30 GATAAATTTG CGGAACTTGT ATGTTCCAAT TCATTACGCG GAAATGCTTT AACTAATGGT 1560 

GTGGGTGTTT TAAAAGAAGA AATGAGAAGA TTGAATTCTA TAATTATTGA AGCGGCTGAT 1620 

AAGGCACGAG TTCCAGCTGG TGGTGCATTA GCAGTTGATA GACACGATTT TTCAGGTTAT 1680 

ATTACTGAAA CACTTAAAAA TCATGAAAAT ATCACAGTTA TTAATGAAGA AATTAATGCC 1740 

ATTCCAGATG GATACACAAT TATCGCAACA GGACCACTTA CTACAGAAAC CCTTGCGCAA 1800 

GAAATAGTGG ACATTACTGG TAAAGATCAA CTTTATTTCT ATGATGCGGC TGCTCCAATT 1860 

ATTGAAAAAG AATCTATTGA TATGGATAAA GTTTACTTAA AGTCCCGTTA TGATAAAGGT 1920 

GAAGCTGCAT ATTTAAACTG TCCTATGACT GAGGATGAAT TTAATCGCTT TTATGATGCA 1980 

45 GTATTAGAAG CTGAAGTTGC GCCTGTAAAT TCATTTGAAA AAGAAAAATA TTTCGAGGGT 2040 

TGTATGCCTT TTGAAGTAAT GGCAGAACGC GGACGCAAGA CATTACTATT TGGACCAATG 2100 

AAACCAGTAG GATTAGAAGA TCCAAAGACT GGGAAACGTC CTTATGCGGT GGTTCAATTA 2160 

50 AGACAAGATG ACGCTGCTGG TACACTCTAC AATATTGTTG GCTTCCAAAC GCATTTAAAA 2220 

TGGGGAGCTC AAAAAGAAGT CATTAAATTA ATTCCAGGCT TAGAAAATGT TGATATTGTT 2280 

AGATATGGTG TGATGCATAG AAATACCTTC ATTAATTCAC CGGACGTATT AAACGAGAAA 234 0 
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TATGTAGAAA GCGCAgcTAG CGG CTTAGTT GCAGGTATCA ATCTTGCGCA TAAAATATTA 2460 

GGCAAGGGTG AGGTAGTATT TCCGAGAGAA ACAATGATTG GAAGTATGGC TTACTATATT 2520 

TCTCATGCTA AAAACAATAA GAATTTCCAA CCTATGAATG CTAACTTCGG GTTATTACCA 2580 

TCTTTAGAAA CTAGAATTAA AGATAAAAAA GAACGCTATG AAGCACAAGC TAATAGAGCT 264 0 

TTGGATTACT TAGAAAATTT CAAAAAAACT TTATAAAATA GTTAGAAAGA CTAGATATGC 2700 

TATTCATTCT TAAGTCATCA ACGAGTAAGT AATGACTTTC TAAATGGAAA ATACTTATCC 276 0 

TAGTCTTTTT AATTTTGGAA TTGTTACGTA TTTCTGACAA TTTAGAATTC GCATTCAAAA 2620 

75 . AATATCTAAA TAAATAACAC GCAATAAGTT GATTGATGTA ACATGTAAGA GAATGTTTTA 2880 

AATAAACTTT ATTTAAAAGG CAATGAAATA ATAAATGGCA AGGCTATTAA TAAAGACTTT 294 0 

TAGTAATTAA TTTAAAAAAG AGGTATTCTA ATTAACAGGT TTTCCGATTA GTTACAATTA 3000 

TTTAATTCTC AAAAGATTTA GAATTGATTA TCAAATTACT GTAAGCCCTT TGCTGTATAT 3 060 

GCTAGAATTC TTATTGATGG AGGGTAAATG TATTGAATCA TATTCAAGAT GCGTTTTTAA 3120 

ATACATTGAA AGTTGAACGG AATTTTTCGG AACACACATT GAAATCATAT CAAGATGACT 3180 

TAATTCAGTT TAATCAATTT TTAGAACAAG AACATTTAGA GTTGAATACT TTTGAATACA 3240 

GAGATGCTAG AAATTATTTG AGCTATTTAT ATTCAAATCA TTTGAAAAGA ACATCTGTTT 3300 

30 CTCGTAAAAT CTCAACGTTA AGAACTTTCT ATGAATATTG GATGACGCTT GATGAGAACA 3360 

TTATTAATCC ATTTGTTCAA TTAGTACATC CGAAAAAAGA . AAAATATCTT CCGCAATTCT 3420 

TTTACGAAGA AGAAATGGAA GCGTTATTCA AAACTGTAGA AGAGGACACT TCAAAAAATT 34 80 

TACGGGATCG AGTTATTCTT GAATTGTTGT ATGCTACAGG CATCCGTGTT TCGGAATTAG 3540 

TAAATATTAA AAAACAAGAT ATAGATTTTT ACGCGAATGG TGTTACCGTA TTAGGAAAAG 3600 

GGAGtAAAGA GCGCTTTGTA CCGTTTGGTG CTTATTGTAG ACAAAGCATC GAAAATTATT 3660 

TAGAACATTT CAAACCAATT CAGTCATGCA ATCATGATTT TCTTATTGTA AATATGAAGG 3720 

GTGAAGCAAT CACTGAACGC GGTGTACGAT ATGTTTTAAA TGATATTGTT AAACGAACAG 3780 

45 CAGGCGTAAG TGaGATTCAT CCCCACAAGC TCAGACATAC ATTTGCAACG CATTTATTGA 3840 

ATCAAGGTGC AGACCTAAGA ACAGTACAAT CGTTATTAGG TCATGTTAAT TTGTCAACAA 3900 

CTGGTAAATA TACACACGTA TCTAACCAAC AATTAAGAAA AGTGTATCTA AATGCACATC 3 960 

50 

CTCGAGCGAA AAAGGAGAAT GAAACATGAG TAATACAACA TTACATGCAA CAACAATTTA 4 020 

TGCTGTAAGA CATAATGGGA AAGCAGCTAT GGCTGGAGAT GGGCAAGTAA CGCTTGGTCA 4080 

ACAAGTCATC ATGAAACAAA CGGCAAGAAA AGTGCGACGT TTATATGAAG GTAAAGTGTT 4140 
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ATTACAACAG 


TTTAGTGGTA ACTTAGAAAG 


AGCTGCTGTT 


GAATTGGCAC 


AAGAATGGCG 


4260 


5 


AGGCGATAAA 


CAATTACGTC AATTAGAAGC 


TATGCTAATT 


GTAATGGATA 


AAGATGCTAT 


4320 




TTTAGTTGTC 


AGTGGAACTG GCG AAGTTAT 


TGCTCCAGAT 


GATGACCTTA 


TCGCTATTGG 


4380 




ATCAGGAGGC AACTACGCAT TAAGCGCAGG ACGTGCATTG AAACGCCATG 


CATCGCATTT 


4440 




GTCTGCTGAA 


GAAATGGCAT ATGAGAGCTT 


GAAAGTAGCG 


GCTGATATTT 


GTGTCTTTAC 


4500 




CAACGATAAT 


ATTGTTGTCG AAACACTATA 


ATAATCAGAG 


CACGATAAAT 


AATTACGAGC 


4560 




AATTAATTTT AGTTAAAAGA CGGAGGAATG 


AAATTAATGG 


ATACAGCTGG 


AATAAGATTA 


4620 


IS 


ACTCCAAAAG 


AAATCGTATC TAAATTAAAT 


GAATACATCG 


TTGGACAAAA 


TGATGCTAAA 


4680 




CGTAAAGTGG 


CAATTGCCCT ACGTAATCGA 


TACAGAAGAA 


GTTTATTAGA 


TGAGGAATCA 


4740 


20 


AAGCAAGAAA 


TTTCACCTAA AAATATTTTG 


ATGATTGGAC 


CAACCGGCGT 


TGGTAAAACT 


4800 


GAAATTGCAA 


GAAGAATGGC CAAAGTTGTC 


GGCGCGCCAT 


TTATAAAAGT 


AGAAGCTACT 


4860 




AAATTTACTG 


AGGTAGGTTA TGTAGGACGA 


GATGTTGAAA 


GTATGGTTAG 


AGATCTTGTT 


4920 


25 


GATGTTTCAG 


TAAGATTAGT CAAGGCGCAG 


AAAAAATCAT 


TGGTACAAGA 


TGAAGCAACA 


4980 




GCTAAGGCCA 


ATGAAAAACT TGTTAAGTTA 


TTAGTTCCAA 


GTATGAAAAA 


GAAAGCGTCT 


5040 




CAAACGAATA 


ATCCTTTAGA GTCACTTTTC 


GGAGGTGCAA 


TTCCAAATTT 


CGGACAAAAT 


5100 


30 


AACGAAGATG 


AAGAAGAACC ACCTACTGAG 


GAAATTAAAA 


CAAAACGTTC 


TGAAATTAAG 


5160 




AGACAGCTAG 


AAGAAGGCAA ACTTGAAAAA 


GAAAAGGTAA 


GAATTAAAGT 


CGAACAAGAT 


5220 




CCTGGTGCTT 


TAGGTATGCT AGGTACAAAT 


CAAAATCAGC 


AAATGCAAGA 


GATGATGAAT 


5280 


35 


CAATTAATGC 


CTAAAAAGAA AGTTGAGCGA 


GAAGTTGCTG 


TTGAGACGGC 


AAGGAAAATC 


5340 




TTAGCTGATA 


GTTATGCGGA TGAACTAATT 


GATCAAGAAA 


GCGCTAACCA 


AGAAGCGCTT 


54 00 


An 


GAATTAGCAG AACAAATGGG TATGATCTTT ATAGATGAAA TCGACAAAGT TGCGACGAAT 


5460 




AATCATAATA 


GTGGTCAAGA TGTCTCAAGA 


CAAGGTGTTC 


AAAGAGATAT 


TTTACCTATA 


5520 




CTTGAAGGTA GCGTTATTCA AACCAAATAT GGTACTGTGA ATACTGAACA TATGCTGTTT 


5580 


45 


ATAGGTGCTG 


GAGCTTTCCA TGTATCTAAG CCGAGTGACT TGATACCAGA ATTGCAAGGT 


5640 




CGTTTTCCGA 


TTAGAGTTGA ACTTGATAGT 


TTATCGGTAG 


AAGATTTTGT 


AAGAATTTTG 


5700 




ACAGAACCAA 


AATTGTCATT AATTAAACAA 


TATGAAGCAT 


TGCTTCAAAC 


AGAAGAAGTT 


5760 


50 


ACTGTAAACT 


TTACCGATGA AGCAATTACT 


CGCTTAGCTG 


AGATTGCTTA 


TCAAGTAAAT 


5820 




CAAGATACAG 


ACAACATTGG TGCACGTCGA 


CTTCATACAA 


TTTTAGAAAA 


GATGCTAGAA 


5880 




GATTTATCAT 


TCGAAGCACC AAGTATGCCG 


AATGCAGTTG 


TAGATATTAC 


CCCACAATAT 


5940 
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AAATATACAA AAGGAGAAAA ATTCATGAGC TTATTATCTA AAACGAGAGA GTTAAACACG 6060 

TTACTTCAAA AACACAAAGG TATTGCGGTT GATTTTAAAG ATGTAGCACA AACGATTAGT 6120 

AGCGTAACTG TAACAAATGT ATTTATTGTA TCGCGTCGAG GTAAAATTTT AGGATCGAGT 6180 

CTAAATGAAT TATTAAAAAG TCAAAGAATT ATTCAAATGT TGGAAGAAAG ACATATTCCA 624 0 

AGTGAATATA CAGAACGATT AATGGAAGTT AAACAAACAG AATCAAATAT TGATATCGAC 6300 

AATGTATTAA CAGTATTCCC ACCTGAAAAC AGAGAATTAT T CAT AG AT AG TCGTACAACT 636 0 

ATCTTCCCAA TTTTAGGTGG AGGGGAAAGA TTAGGTACAT TAGTACTTGG TCnAGTACAT 6420 

15 GATGATTTTA ATGaAAATGA TTTGGTACTA GGTGAATATG CTGCTACAGT TATTGGTATG 64 80 

GAAaTCTTAC GTGAGAAGCA TAGTGAAGTA GAAAnAGAAG CGCGCGATAA AGCTGCTATT 654 0 

ACAATGGCAA TTAATTCATT ATCTTATTCT GAAAAAGAAG CGATTGAACA TATCTTTGAA 6600 

GAACTTGGCG GTACGGAAGG CCTATTAATC GCATCAAAAG TTGCAGATAG AGTTGGTATT 6660 

ACTAGATCTG TAATTGTAAA TGCACTACGT AAATTAGAAA GTGCTGGTGT AATTGAATCA 6720 

CGTTCTTTAG GAATGAAAGG TACTTTCATT AAAGTTAAAA AAGAAAAATT CTTAGATGAA 6780 

TTAGAAAAAA GTAAAT 6796 

(2) INFORMATION FOR SEQ ID NO: 3: 

30 {i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 2073 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3: 

ATCCTAAAAT TnAAAATTAT CACGCCTTTT GaACAGCTTT GTAACCaTCt GGACGATCAT 60 

kAAATTCCaA TGTAAATCCT GGTTTAAaGT TGATCTTTAA CCTTATTTAA AyCACCAATT 120 

GTACGTATAT TATGTTGTTT AGCAAAATCA CGTTTTACAG CTAAAGCATA CGTATTGTTA 180 

TACTTCATTG GTTTTAACAT AGTCATTTGA TATTTCTTTT CAAGACTTTG CTTAGCTTGT 240 

TCATAAACTT TTTTCTCTTC TTTTGACTTC AATGGTTCTT TTGTTAATTC ACCTAAAACT 300 

GTTCCAGTAA ATTCTAAATA CCCATCTATA TCGTCAGATT TTAAAGCATT AAATAAAAAT 360 

50 GCTGTTTTGC CCATACCATC TTTCACTTCT ACAGTATTTT TGGTCTCTTC TTCTATTAAA 420 

ATTTTATACA TATTTGTAAT AATCGATGGC TCGGAGCCAA GCTTTCCAGC TAACGTAATT 480 

TTATCACCTT TTTGTGCAAA CATAGGAATA GCGATAGCCA GTATAATAAT CATCACTATA 54 0 
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TCAAATATAA TTGCCAATAA GGCTGCTGGA ATTGCACCTA ATAATATCAA CGATGCATTG 660 

TTACGGTCTA TACCTAATAA AATTAAATCT CCTAGTCCGC CTGCACCAAT TAATGCTGCT 720 

AGTGTTGCTG TACCTATAAT TAATACCATA GCCGTTCTTA CACCAGCCAT TATAACAGGC 780 

ATTGCTATCG GAAGTTCGAC TTTAGTTAAA CGTCTAAATG GTTTCATACC TATACCTTTA 84 0 

GCCGCTTCAA TGAGTGATGG ATCAACTTCT TTAATTCCAG TATACGTATT CCTTAAAATT 900 

GGTAACAACG CATACACTAC AAGTGCAATA ATTGCTGGCA CACGACCGAT ACCAAATAAA 960 
GGAATCATTA AACCTAATAA TGCCAACGAT. GGTATGGTTT GAAGAATTGC CGCAATATTC . 1020 

ATTACGATTT CAGATATCGT TTTAGTCTTC GTTAATAAAA TACCTAATGG TACCGCAATA 10 80 
GCAGTTGCAA TCAATAATGC GATAAATGAT ATTTGAATAT GTTCTATCAT TGTCGAAAAG . 114 0 

AGTTGCCCCT TACGTTCACT CAATATGTCg AAAAAGTTAG TCATGTTGAG CTACCTCCTT 1200 

20 TTTCTGGGAC AAATATTTGA AGATATCTTT CCTATCAATA ACATATTGAC CTACGCTATC 1260 

TTCTTGCATG ACAATGACAC GCTCGCTCTC TGATAAAAGT TGATACAATA CTTCAATTGG 1320 

TTGATTGTCA TAAACAATTG GATAAGCGCT CATAGATGTA ACCTCATCGA TTGGTTTCAT 1380 

AATATCCAAG TCACGGATAA TTGCGTTCTC TTCAACACAT GGCGCATCAT CTTCTAAATG 144 0 

ACTACCCATA AATTGTTTAA CAAATTCACT TTGAGGATTA TTTTTAAATC CTTCTGGTGT 1500 

GTCAATTTGT TCAATATGCC CTTCATTCAA AAGACAAATC TTATCACCAA GTTTCATCGC 1560 

CTCTTGAATA TCATGTGTAA CAAATATGAT TGTCTTCTTA ATTTTAGTTT GTAATTCAAT 1620 

TAAATCATCT TGAAGTTTTT CTCGGCTGAT TGGGTCTAAT GCACTAAACG GTT CATC CAT 1680 

35 ' TAAAATAACT GGTGGATCAG CTGCTAACGC ACGTATAACT CCTACACGTT GTCGTTGCCC 1740 

CCCTGACAAT TCATCAGGTT TTCTGTTTTT ATATTTTTCA GGTTCTAATC CAACCATTTC 1800 

AAGTRATTCA TCTACTCTTT TATCTATATC TTTTTCTTTC CACTTTTTCA TTTGTGGCAC 1860 

TTGTGCAAtA TTTTCTTTGa wTGTCaTATG TGGGAATAAT GCAATCTGCT GcAATACGTA 1920 

TCCAATATCC CAACkCATTT CGTATACTGG ATAATCACTT ATTGGTTTAT CTTTAAAATA 1980 

AATATAACCT TCACTTAAGT GAATGAGTCG ATTAATCATT TTTAATGTCG TAGTTTTTCC 204 0 

ACAACCTGAA GGTCCAATTA GCACAAAAAA TTC 2073 
(2) INFORMATION FOR SEQ ID. NO: 4: 

SO (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4: 

ACTATTCTAG CTTCATCAGT TATCATATAT TCTTTGAAAC ACTTGTAAGA AAATATAATG 60 

AGTATTTACT ACATAATGAT ATTTCAAATT AGAAAAAAGG AAGTTATGAT TTAATGGCCT 120 

TGAGCCTATC ATAACTTCCT TTTATCATTT TATTGTTGTG TTGATGTTTC GATAACGTGG 180 

TACATCTTAT CAAACATCAA TTCGAAACCA TGCACCATGG CATCATGATA TTCTTTTTTC '240 

TTTTGCTTGT ATTCTAAATT AGTAAATCGT CT TTCT T TTT CAACTAATGA ACGATAATAA 300 

AATAGCATTT GGGTGCCACC TGTTTCACGT TCAAAAAATT CTACCTCAAT GACATCTTGC 360 

1S GTTTCACTTA GTCCAGGCAT ACCGATAGTC ATCTTAACGT ATTCATCCAT AACTAAAGAT 420 

TCATAAATGC CTTCAATCAC ATTTACTTTG CCATTACGTT GTTGATCTAC AATACGATAT 4 80 

TTACCGCCTT CTTTAACGTC CGCTTCAATC TCTTTATTCG TTCTGGCTGA TGTCATAAAC 540 

20 CATTGTTTCA ACAAATCTTT CTTTGTCCAA GCTTCGTATA CTAACTCTGG AGAAAATTTA 600 

• TAAAGCTTTT CAATTTCAAC TTCGACATGT TCATTCTCTA CATTAAATTT TGCCACTGTT 660 

GTCCACCCAC TTTCGCTCTT ACTTTTATTT TAACGTATTT TTGCTCAGTT CCAAACATAG . 720 

ATGATCATCA TTTTTAAAAG ATTAGCGTTA TACGGTGAGT ACAACATGAT CTGTTAATAT 780 

AACAAGCCAC CTTACTTGGC TACATCGATA TATTGTTAAG CATTAATGTT TC ATTTCTT G 84 0 

ACTAGTGTTC TTTTTTAGCT TTGGAAAATT AAATAAAATC GCAATAAGTC CGCATACACC 900 

TAATAATATA GGATAAATGC TGTATGGGAA TAACATTAAC GGTGAAATAC CAGCTACACC 960 

AGCCGCTGaA ATGACTTGCG GGCTATATGG TAATAAACCT TGGAAGCAGC CTCCAAATAT 1020 

35 ATCAAGAATA CTTGCTGATT TCCTTGAATC TACATCATAT TCATCTGCAA TATTTTTAGC 1080 

TAAAGGACCT GACATAATAA TAGAGATGGT GTTGTTTGCC GTGGCAATAT CTGCGACACT 1140 

TACC&AACTA GCAATTCCTA ATTCTGCGCC ACGCTTTGAT TTCACTTTAG AGCGAACAAA 1200 

TTGCAACAAC CATTCAATAC CACCATTGTG TTGAATAATA CCGACTAAAC CACCAATTAG 1260 

CAACGCAATC ATAGCAATAT CTTCCATGCT TATAATACCT TTGGACACTG CATCTAGTAG 1320 

CCCCATCCAA CCGAATGAAC CATCTATGAG ACCAATGATT CCGGCTAATA ATGTTCCGCC 1380 

AATCAATACG ATAATGACAT TTACACCTAA TAATGCTAAT ACCAATACTA AGATATACGG 1440 

TACAACTTTA ATTAGATTAT AATCATAGTt TTTAGCATGA TTTAAAGAAA TGCCATTCGT 1500 

50 TAAGAAATAC AGAATAATAA TCGTTAAAAT AGCACCTGGC AATACAATTT TAAAGTTTAC 1560 

TCTGAATTTA TCTTTCATTT TCGTATGTTG TGTTCTAACC GCAGCAATTG TTGTATCTGA 1620 

AATCATTGAT AGATTATCGC CGAACATTGC ACCTCCAACA ACTGTAGCCa tTGctAGCGC 1680 
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TCCTACAGAC GTCCCCATAG ATATAGAAAC AAACATACAA ATCACAAACA ATCCTACAAT 1B0O 

AATTAAATTT TCTGGGATTA ATGATAGTCC TAAATTAACT GTCGACTTTA CGCCACCCAT I860 

5 

TTTTTCAGCT GTATTTGAAA ATGCACCTGC TAAAATAAAA ATCAACATCA TTAAAACAAT 1920 

GTTTGAATGG CCTGCACCTT TCGTGAAGAC CTCAACTTTT TTAGCAAATG ATTCTTTTCG 1980 

ATTCATTAAT AACGCCACAA TTACCGTTAT CGTAATTGCA ACATTTAATG GCATTGAAGT 204 0 

10 

. AAAATCACCT GTGATAXTAC CTACGCCTAA AAACAACGCC ACAAATAATA ACAAGGGGAA 2100 

TAATGCCCAA GCATTGCTCT TTTTATGTAC TTCCATCCTT TTTACCTGCT TTCCAATTAA 2160 

15 AAATACCTCT TTCTCACAAA CGATGAAGAA AGAGGTTTTC ATGTGCTTTA CCTGCTTATC 2220 

TTCAAACCAT TACGGTTACT GGAATTGGCA CATTCGAGAT GTTGCCGAGG CTTCATAGGG 2280 

CCAGTCCCTC CACCTCTCTA GATAAGTGAT GCTTATTTAG GTTTACGTTA CAAGATAATC 2340 

20 

CTTAGTACGT CAATCATAAA TTAATCAGGA GTCGTATAAT ATTTTTCATA AACAATCATT 24 00 

GCTACTGTAA TAATAATCAA AACAATAATG CTAATAACAA GTAAAAGCCA CCATTTAAGC 2460 

ATTAATGCAA TAAAAATGAA CACGATAGAC ACACTTACTA ATATTAATGA TATGACTTTA 2520 

25 

AATTGCTGAA CACGTTGCTT GGAGATGACT TTCAACTGTT TGTTTGATAG ACGCGTATTT 2580 

TTTATACTGA TTCCCAGTAT ATTTTCTAAT ATTTGAACCA ATACGATACT TATTGCAAAT 2640 

30 ATAATAATTG GTAAAACATC ATAGCTCCCT ATAGTTAATG TATAAATTAC AAATCCAATG 2700 

TAAAGTAACC CTGAGACAAA GGATAAAAAG TATGCGACGT ATTTGTTAAA CTTAATGATA 2760 

TGCTTTTTAA CGTTTTGATG TGTAAACCAT ACATTCGAAA CGATCGCAAC TGCTACAAAT .2820 

55 AATGTGAATA CTATATATAA TGGTAATTTT TGTTCAGGAA AAACAGTCGC TATTCCAAAA 2880 

GCTAATGCTA AAATCAAAAA TAATATAGCT CTAGATACTA TTAATGCCAT AATAACAACC 2940 

CCTTTGTTTA ATATCGAGTT TGCAAATTTA CGTTTATCAG CGTTTCTATG ATCAGTACTT 3000 

40 

CTACGGGTAG CGTTTCTATG TAATTTACAT CATCTTAACA TATAAATACT TCGCTATTTA 3060 

ATTGAAAACA TATCCTATTA TTCTTTGTCC GTTCTGACGT TTAATATCTA GCCTTAGGCA 3120 

4$ TTTCACTTGT TAATGAATTT AACTTTCTTC CACTAACCGT CCCTAAACCC AATCCCGCAA 3180 

CAGTTTTTAA CTTTTTCGTT GTTGTCCTGA CATCCTCATT AAGAAAGTTT ATTCTGCTTA. 3240 

AAACTTATAA TCCACACCCT GAGCAAACGC TCCTTATGAC AGAGTATTAA AATAAGCCGA 33 00 

50 TAAAGATACA CACCTTTACC GACTATTTAA AATACACTTC ACCAATTCAT TTTAATTTAA 3360 

TGGATTGAAG TAACTAAATT AATATTATGT TGTTCAATTA AAAGCTTCAT ACAAACCTAA 342 0 

TCTATTTGCA CTCCACCGCT AACACCGAAC ACTTGTCCGG TTGTATAACT TGATTCTTCT 34 80 
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GTTTTTTGAC CAAATGTTGG GATTTTACTT TGAGGTTGTC CACCAGAAAT TTGTAATGGT 3 500 
GACCAGAATG GACCAGGCGC TACACAGTTC ACTCTAATTC CTTTTGGTCC TAATTCTTCT " 3660 

GAAAAAGTTT TAGTTAATGA AATAATTGCT GCTTTTGAAG CGGCATAATC ATGAAGAATA 3720 

GGACTAGGAT TATAACCTTG TACAGATGAT GTCGTTGTAA TTGACGCACC CGGTTTTAAA 3 780 

TATTCCAATG CTTTTTGAAC TGTCCAAAAT AGCGGATAGA CATTCGTTTC AAATGTTTCT 3B40 

GTAAATGCCT CAGTTGTAAA TCCATGAATA TCATCATGAT ACTGTTGATG TCCAGCAACT 3900 

AAAGTAACAT TATCTAAGCC ACCTAATTGT TGATATGCTT GTTCAACAAG GTCATAGTTG 3960 

AACTGTTCAT CTCTTATATC ACCAGGAATT AACACTGCCT TTTGACCACT TTCTTCAATC 4020 

ACTTGGCGTA CTTCTTGTGC ATCTTGTTCT TCACTCGGAA GATAGTTAAT CGCTACATCT 4080 

GCACCTTCTT TAGCATACGC AATTGCTGCT GCACGCCCTA TTGCTGAGTC ACCACCTGTG 4140 

20 ACTAATATTT TATAGCCTTG TAAGCGTTGA TGACCTTGGT AAGACGTTTC GCCACAATCG 4200 

GGTGCTGGCG TCATTTCAGA TTGTAAACCC GGTACCTCTT GTTCTTGTTT TTCATAATCC 4260 

GTTGTTTTAA ATTTTGTTCT AGGATCTTGA GCTGCCATTT TTTTACATCT CCTTATTCGC 4320 

TTAATGGTTA TTATTTACCC AATCTTCCTA GGAACTTAAT CATGATTACA CTAAAAATTA 4380 

CTTTCTTCTT TATAAAAACA AGCTCGAATT ATTCATGCAA TAGTCTCTTT ACAAATTCAA 4440 

CAAAATACTC AGGTACTTTT TCCAGAATCC TTTCATCCGG TTTATATTGA GGATGATGTA 4500 

AATCATATTC ACTATGAGAA CCAATTAACG CAAATACACT TGGAAAATGT TGACTATAAC 4560 

CTGAAAAATC TTCTCCAATC GTAAGCGGCT GTTCCATCAT TCCCACCTTA TATCCAACAT .4620 

35 GTTGGGCTAC TGCAATTGCT TTATGCGTCA ATGCCTCATC ATTCATCACA GCGCCAGGTA 4680 

AATGCGTATA ATTTAAATTA ATTTTCATAT TATATGCTTG AGCCAATCCG TCCGCAATAT 474 0 

CTTGTAATCG TGTTTCTACA AGCTTTCGTA CCACAGGATC AAAACTACGC ACTGTGCCTT 4 800 

GTACATACGC ATGATCAGCA ATGACATTCC AAGTATTACC ACATGATATT TGTCCAATTG 4860 

TTACTACCGC TTCATCAAAC GCAGATAGAT TTCTACTAAC TATGGATTGA ATACTATTAA 4 920 

TCAATTGCGC CAACACAATA ACTGGATCGT TGCATTGTTC TGGcTTTGCA GCATGACCAC 4980 

CCACGCCTTT AATATGAAAC TCAAAACGAT CTACTGCTGA TGTAATTGCC CCTGTTTTGA 5040 

> TTGCAAATGT ACCTACCGAA CGCGATGGGT CATTATGAAA ACCCAATACT GCTTGTACAT 5100 

50 CTTTTAATGC ATGTGTTTCA ATAATTTTAA AAGCGCCATG TCCTAGTTCT TCTGCTGATT 5160 

GAAAAATGAA TTTAACACGC CCAGTAAGAG TGCCCTCAAT TTCTTTTAAT TTTACAGCTG 5220 

TAGCCAAAAT ACTAGCCATG TGAATATCAT GACCACACGC ATGCATAACA CCTTCATTTT 5280 
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CAGCTATACA ACTCAGACCT TGTCCCACTT CAGCAACAAG CCCAGTCGCA AGTGGTAAGT 5400 

CTAATATTCT AATATGATGT TCTGTTAAAA TATCTTTAAT TTTTTGTGTA GTCTTAAATT 5460 

CTTTATCGGA TAGTTCTGGA AATTGATGAA AATACCTTCT CCAGGTAACA GCTTGATCTT 5520 

TTAATCCCAT CGGTCATTCC CCTTCCTTAA GTCAATGATA TGTTGTCTAC CCTACGATGA 5580 

TCATCTTTGA CTATTAAACG ATGATTTCAC AACAATGTAC TCTTGTTAAT TGCTTTCGTT 564 0 

70 

AATGATAGAC AGTTGTTTAA TAATATCGTA ACACTGTTGT CAAACTATTC TAACTTTTAT 5700 

AATTGAGACT CTATACAAAA ACGTGTTCTC GAATATACTT GTTTTTACAA ACCACAAAAA 5760 

IS GCTCTAAACA TTAGTTTAAA CCAATGCTTA GAGCTTTCTA ATTATTTTAT GCTTTAAAAG 5820 

ATACTGTGTT ATCTACGATG ACCTTACCGT CTTTAATAAC TTTTTCTGCG TGATTGATAC 5880 

CAAAATGATA TGGAATATAT TCATGATTTG GTGCATCCCA AATTACTAAA TTAGCCTTAT 5940 

20 

CACCTGTGTT AATTGTACCC GCGTTAATGT CTATTGCTTT AGCAGCATTG ACCGTAACAG 6000 

CATTCCAAAC 7TCATTAGGT GATAGCTTTA ATTTCAAGGC TGCAATCGCC ATAACAAGTT 6060 

GTAAGTTGTT TGTGACACTA CTACCAGGGT TATAATCAGT TGCTAATGCA ATCGCACCGT 6120 

25 

TATTGTCAAG CATGCCTCTT GCATCTGCAT AATCTTCTTT ACCTAAATAG AACGTCGTTG 6180 

CAGGTAAGAG GACAGCTACA GTATCACTAT TTCGCAACTT TTCTTTTCCT TTATCACTAG 6240 

30 AAGCTACTAA GTGGTCTGCT GATATTGCTT GTTCATCAAT TGCTAATTCC AGTCCGCCTA 6300 

ACGGATCAAT TTCATCCGCA TGTATTTTCA CTTTAAAACC TGCTTCTTTG GCTTTTTGCA 6360 

TATAATGTTG CGATTGTTCT ATTGTAAATA CACCTGTTTC ACAGAAAATA TCCGCAAAGT 6420 

JS CTGCATATTG TTTTACTTCC GGAAGTAACG CAATCATTTC TTCTAAAAAT GCCTCATTTG 6480 

AACTTGCCTC TTTAGGTACA GCATGAGGCC CTAGGAAAGT ATGTTTCATG TCTAAATCAT 6540 

ATTTCTCAGC TAAACGATTA GACACTTTCA ATTGCTTCAG TTCATTTTCT CTATCTAATC 6600 

40 

CATAACCACT CTTACTTTCA ACTGCAAGCA CGCCGTGTTT AATCATAGTA AGCAAATCAT .6660 

GCTCTGCTTT TTTAAACAAG TCATCTTCGG ATGTTTCTCT AGTAGCATTA ACGGTAGATA 6720 

45 ATATGCCACC ACCCATTTCT AATATTTCAA GGTAAGACTT ACCTTGACGT TTTAATGACA 6780 

TCTCATGTTC TCGAGATCCA CCAAATGTTA AATGGGTATG TGCATCTACT AATGCTGGGG 684 0 

ACACTACCTT CCCACTAGCA TCAATCGTCT CAGTCGCATC GTAGTCATCT GTATGTGTTC 6900 

50 CAGCATATAC AATTTTGCCA TCTTTAATGA CAACTGTACC ATTTTTCACA ACATTTAATT 6960 

CATCTAATTC CTTACCCTTC AAAGGTTTAT CTGTTGATCT CGGTAAAATT AATTCTGCTA 7020 

TATGATTAAT TATTAAATCA TTCATTACTT. ATCACCTGCT TTATCAATCA TTGGAATATG 7080 
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AACACCCATA CCTGGGTCAG TCGTCAATAC ACGTTCCAAT CTTCTTTCAG CACGCTCTGA 7200 

TCCATCTGCT ACAACAACCA TACCCGCATG AAGTGAATAT CCCATGCCAA CACCGCCACC 7260 

5 

GTGATGGAAT GAAATCCATG AACCACCTGC AGCTGTGTTA ATGAGTGCAT TCAATACAGC 7320 

CCAATCACCA ACCGCGTCAC TACCATCTTT CATACTTTCT GTTTCACGGT TAGGACTAGC 73 80 

AACTGAACCA GCATCTAAAT GGTCTCGTCC AATAACAATT GGTGCTGAAA TTTCACCGTC 7440 

10 

ACGTACAAGA CGATTTAAAG CTAAGCCCAT TTTCGCTCTT TCTCCATAGC CTAACCAAGC 7500 

AATACGTGAT GGTAGTCCTT GATATGAAAT TTTTTCTTCA GCTAAATCAA GCCATCTTAA 7560 

15 TAACTTTTCA TTTTCTGGGA AAAGTTTGCG CATTTCTTCA TCCGCACGCT CGATATCTTT 7620 

TGGATCACCA CTCAACGCAG CAAAGCGGAA TGGCCCTTTA CCTTCACAGA ATAATGGTCT 7680 

AATGTAAGCT GGTACAAAGC CTGGGAAGTC AAAAGCATTT TTCACTCCGT TATTGAAGGC 7740 

20 TACTTGACGA ATATTGTTAC CATAATCAAA TGCTACAGCG CCACGTTTTT GGAATTCAAG 7800 

CATTAATTCA ACATGCTTTG CCATTGAAGC TTGTGACAGT TCAACATATT TTTTCGGATC 7860 

TTTTTCACGC AATACTTTCG CTTCTTCTAC AGAGTATCCT TGTGGCACAT ATCCATTTAG 7920 

CGGATCATGT GCACTTGTTT GGTCAGTAAT AATGTCAATT TTAAATCCTT TTTCTAGAAT 7980 

CGCTTGATGG ATGTCTACAG CATTTCCAAC TAACCCGATT GATAATCCTT CTCCACGTTC 8040 

30 TTTCGCCTCT T.CTGCTAATT TTAATGCTTC ATCTAAATCA GCTGTTTTAA CATCACAGTA 8100 

TTTCGTATCA ATTCGCTTAT CAACACGTGT TTCATCAACA TCCACGCAAA TTGCTACCCC 8160 

ATGATTCATA GTAATTGCTA ACGGTTGCGC ACCACCCATA CCACCTAAAC CTGCTGTCAG 8220 

35 TGTAACAGTG CCTGCTAAAT CTCCATTAAA GTGTTGATTA CCTAGCTCGG CAAATGTCTC 8280 

ATAAGTACCT TGCACAATAC CTTGAGAACC AATATATATG CAACTACCGG CTGTCATCTG 834 0 

TCCATACATG ATTAAACCTT TTTTATCTAA TTCATTAAAA TGATCCCAGT TTGCCCATTC 8400 

40 \ 

AGGCACTAAT ACTGAATTTG AAATTAATAC ACGTGGCGCT TCTTCATGTG TTTTAAATAC 8460 

AGCAACTGGC TTTCCTGATT GTACTAACAT TGTCTCATCT GATTCTAATT CTCGTAACGT 852 0 

TTTCTCTATT GCTTCAAAAG CTTCCCAATT ACGTGCTGCT TTTCCAATAC CACCATAAAC 8580 

45 

AACTAAATCT TCTGGTCTTT CAGCAACTTC TGGGTCTAAA TTGTTGTATA ACATTCTAAG 8640 
TACTGCTTCT TGTTCCCAAC CTTTACACTC AATACTCAAA CCTTTTTTTG CTTGAATTTT ' 8700 

50 TCTCATAAAA TTCGCTCCTG TTCTTTTAAG AAGTTAATTC CACTAAATTT AAAACGCTTA 8760 

CATTATTATC TTCAATATTC ATTATAGTAT GTTAAAATAT AGCCAACAAA TATAAATAAA 8820 

CTAATTATCC ATAGCTTGAA TCTATAAATA AAAGGAGCAA AACACATGAA AATTATTCAG 8880 
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CAT ATT AG CC AGCCATCTTT AACTGCTACG ATTAAAAAAA TGGAAGCAGA TTTAGGTTAT 9000 

GACTTATTTA CACGTTCAAC AAAAGACATC AAGATTACCG AAAAAGGAAT ACAGTTTTAT 9060 

CGTTATGCGA GCGAATTAGT TCAACAATAT CGATCCACGA TGGAAAAAAT GTATGATTTA 912 0 

AGCGTTACAT CAGAACCAAG GATAAAAATT GGGACTCTTG AATCTACGAA TCAATGGATT 9180 

GCGAATTTAA TTCGAAAGCA CCATTCCGAC TACCCTGAAC AGCAATATCG TTTATATGAA 9240 

ATACATGATA AACATCAATC TATAGAGCAA TTACTGAATT TTAATATTCA TTT AG CT AT A 930 0 

ACAAATGAAA AAATAACCCA CGAAGATATA AGATCCATTC CTTTATATGA GGAATCTTAC 9360 

ATTTTATTAG CACCCAAGGA AACATTTAAA AATCAAAATT GGGTAGATGT TGAAAATTTG 9420 

CCACTCATAT TACCAAACAA AAATTCTCAA GTGCGCAAAC ACTTAGATGA CTATTTTAAT .9480 

AGAAGAAATA TTCGTCCAAA TGTCGTTGTA GAAACAGATC GATTCGAATC AGCAGTTGGA 9540 

TTTGTTCATC TCGGCTTAGG TTACGCTATC ATTCCGAGAT TTTATTACCA ATCATTTCAC 9600 

ACGTCTAATT TAGAATATAA AAAAATTCGT CCAAACTTAG GCCGAAAAAT TTATATCAAT 9660 

TACCATAAAA AACGCAAACA CTCCGAACAA GTACATACAT TCGTACAACA ATGCCAAGAT 9720 

TATTTATATG GACTTTTAGA GGCTCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9780 

CTCAGTCAAC TGTATACCTT TTGCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9840 

CTCAGTCAAC TGTATACCTT TTGCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9900 

CTCAGTCAAC TGTATACCTT TTTCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 9960 

CTCAGTCAAC TGTATACCTT TTGCCTTTAA CTTAAGTTAT TAGTGCCTCT TATGTAGTTG 10020 

CGTAGTCAaC TGTaTACCTT TTGCCTTTAA CTTAAGTTAT TAGAGCCTCT TATGCAGTTG 10080 

CGCAGATCAT CGTATAAAAA TTAATGACGT CATTTCAAAA ATCGATACAA AAATAATTTA 10140 

TTATAAAAAT TCTAAGAAAG AAGTGAAGCA GATGTTAAAA TCTATTAATC ATATATGCTT 10200 

TTCAGTCAGA AATTTAAACG ATTCAATACA TTTTTATAGA GATATTTTAC TTGGGAAATT 10260 

GCTATTGACT GGTAAAAAAA CTGCTTATTT TGAGCTTGCA GGCCTATGGA TTGCTTTAAA 10320 

TGAAGAAAAA GATATACCAC GTAATGAAAT- TCACTTTTCA TATACACATA TAGCTTTCAC 103 80 

TATAGATGAC AGCGAATTTA AATATTGGCA TCAGAGGTTA AAAGATAATA ACGTGAATAT 10440 

TTTAGAAGGA AGAGTTAGAG ATATTAGAGA TAGACAATCA ATTTACTTTA CCGACCCTGA 10500 

TGGTCATAAG CTAGAATTAC ATACTGGCAC ACTTGAGAAC AGATTAAATT ATTATAAAGA 10560 

GGCTAAACCA CATATGACAT TTTACAAATA AGGTGTCATT ATAAAAAGGC CTCTTGAACT 10620 

CCGTTAAAAT TTTAATTAAT TATTATATAA TAAGAGAACT TTTCAAACAA TACAGTTGTT 10680 
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TTACTGCAAT TATTTTTCAA ATATATCAAC GTTAATATAA CTTCTATTAA GAAATACTCA 10800 

CATTCTGCCC TGCAATGCAA ATCTCGTCAC ATATAAATAT TTTTAATTAT TTTAAAAAAT 10860 

5 

GATGCACTAA ATTAGCAACG AGCTTAGCAG TTCTATTGTC AGCGTCATAT GTTGGATTCA 10920 
TCTCAGCAAT ACTAACTGAA GACACCTTAT CACTTGGAAT AATACGTTTT GCTAATTCAA . 10980 

GAACAGTATG TGGATACAAA CCTAACACTG CCGGCGCACT TACCCCAGGC GCAAACGCAC 11040 

10 

TATCAATGAC ATCCATACAA ATCGTAAACA TAATGACATC ATGTTCATGT ACAAAACGTT 11100 

CAATCATATC TTTAATTGTT GGTGATACGT GACTCAATAA TTCATCTGCA AAGACATAAT 11160 

IS CAATCTTTTT CTCTTTAGCA TAATCAAATA AACTTTGCGT ATTACCACCT TGAGCAATAC 11220 

CAAGCACTAA ATAATCTGTG TTTTCATCTT CTTCTAAAAT TTGTCTAAAG CTCGTTCCAG 11280 

ATGTAGATTG TTGTTCAGCA CGTGTATCAA AATGCGCATC AATATTTATC ACACCAATAG 11340 

20 

ATTGTGTTGG ATAGACTTTA CGTGTTGCTA AATATTGAGC ATACGCAATA TCATGTCCAC 114 00 

CACCTAATAA AAATGTTTGT CTATGATTAG CAATTGACTT CGCTGCAAGC ATAGCAAATT 11460 

CTTTTTGAGT ATCAATTAAT TGCTCATGAT CATGATAAAC ATTTCCGTAA TCGACTAAAG 11520 

25 

TTcACATTGA TTCAAATCCG GCAAACCTGC AAATGCTTGT TTAATCGCAT CTGGTCCTTC 11580 

TTTTGCACCA ATGCGCCCCT TGTTTAAAGC AACACCTTTG TCAACAGCAT AGCCTAATAT 11640 

30 ACCGACCCCT GATGGCATAC TACTCTTTTC CAGCTTAGAC AAATCTTCAA ATGTTACTGT 11700 

TTGAAAATGT CTAAATTTTT TCGGGTCTGT TTCACTATCT AACCTTCCAG TCCATAAATT 11760 

TGGTTCACCT TGCTTGTACA CAGCATTTCC CCCTCTTATT TATGTGGCTT ATTAACAATT 11820 

35 AAAGTATAAC GTATAGGAAA TTTTGAATTC AATTCATAGT TAAATCCGTA TCTTAAAAAT 11880. 

ACTTATCTAC ATTACTTTTA CCCCTATTTT CTATGTAATA ACGAATACTT AGCTGATTTA 11940 

TGTTAATAAA ATACGTCAAG ACTATTACAT TTTCATTAAT ATTGACATAG ACAATTTATC 12000 

40 

TCTCGGCTTG TAATATGTAT AATTGTTAGT AAAAGATATT TTGCTTGTTA CCTAATGGAG 12060 

GTTACATATA ATGAAGAACA ATAAAATTTC TGGTTTTCAA TGGGCAATGA CGATTTTCGT 12120 

45 CTTCTTTGTC ATTACAATGG CGTTATCCAT TATGCTCAGA GATTTCCAGT CTATAATTGG 12180 

TGTCAAACAC TTTATATTTG AAGTTACAGA TCTAGCACCA TTAATTGCTG CAATCATTTG 12240 

TATACTCGTT TTCAAATATA AAAAGGTCCA ACTTGCAGGT TTAAAATTCT CAATCAGCCT 12300 

50 GAAAGTAATT GAACGTCTAT TGCTAGCTTT AATTTTACCT TTAATTATTC TAATTATTGG 12360 

TATGTACAGC TTTAATACAT TTGCAGATAG CTTTATTTTA TTACAATCAA CAGGCTTATC 12420 

AGTACCTATT ACACACATTC TGATTGGACA TATTCTGATG GCGTTCGTAG TAGAATTCGG 12480 
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10 



TGTTGTTGGT 


TTGATGTATT 


CAGTTTTCTC AGCAAATACA ACTTATGGTA 


CAGAATTTGC 


12600 


TGCTTATAAC 


TTCCTTTATA 


CATTCTCATT 


CTCTATGATT CTTGGTGAAT 


TAATTAGAGC 


12660 


GACTAAAGGA 


CGTACAATTT 


ATATTGCAAC 


GACATTCCAT GCTTCAATGA 


CATTCGGACT 


12720 


TATTTTCTTG 


TTTAGCGAAG 


AAATCGGCGA 


TCTATTTTCA ATCAAAGTCA 


TCGCCATTTC 


12780 


AACAGCAATC 


GTTGCAGTAG 


GATACATTGG 


TTTAAGCTTA ATTATCCGAG GTATTGCATA 


12840 


TTTAACAACA 


AGACGAAACC 


TTGAAGAACT 


TGAGCCTAAT AATTATTTAG 


ACCATGTCAA 


12900 


TGACGATGAA 


GAAACTAATC 


ATACTGAGGC 


TGAAAAATCT TCTTCAAATA TTAAAGATGC 


12960 


TGAAAAAACA 


GGTGTAGCTA 


CTGCATCAAC 


GGTTGGTGTT GCTAAAAATG 


ATACTGAAAA 


13020 


TACAGTGGCT 


GACGAACCAA 


GCATTCATGA AGGTACTGAA AAAACAGAAC 


CTCAACATCA 


13080 


CATAGGTAAT 


CAAACTGAAT 


CTAATCATGA TGAAGATCAt GACATCACTT 


CGGAGTCAGT 


13140 


AGAATCAGCm 


GaATCAGTTA 


AACAAGCACC ACmAAGTGAC gATTTaACAA ACGATTCAAA 


13200 


TGAAGATGAA 


ATAGAGCAAT 


CATTAnAAGA 


ACCTGCGACT TATAAAGAAG 


ACAGACGTnC 


13260 


ATCAGTTGTA 


ATTGATGCAG 


AAAAACATAT 


CGAAAAAGCT GAAGAnCAAT 


CTTCAGATAA 


13320 



25 

A 13321 
(2) INFORMATION FOR SEQ ID NO: 5: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 854 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear . 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

ATGTGTTGTA AACTTTTATG TTGAAAAAGC TACTTATCTC AATGAAAACA AGTAGCATTT 60 

40 

AATAAATTAA TTAGTATACA GCTAGTTTTT CTAATTGTTC TTTAACTTGA ATTAAGTTTG 120 

ACCGTATTAG AGAGGCAGAT TGATCCATCG TTTGAATTGC TTGTCCTTCA TTTTCGTTCA 180 

45 AGCCATTACA AACAACTTCA AACTGTTGTG CCATTTGATC AAGACGCGCA TGAGCTTGTG 24 0 

TGTTTAAAAT AAACATATCG TCATAATGTG ATGGCGAATA GATAATTCGT CGTTGTATAC 30 0 

AAACGTATAA AAACCTTGTC ATATCAACGG TTTTGGCATT TTTAAACCTC TGTGTTTTCC 360 

50 ACGCATGTTT GCCCTTATTT AAATAATTTG CCCTTTTTTC GCCCCGAAAA AAAAACACAA 420 

AAAAATAACC ACACTCCTAA ATTAATAGGT GGTGTGGTTT TGTTGATTGT AGGGGTATAA 480 

AAATAACCGC ATTATTAAAG ATACGGTTAC TCTGTTATCT GTAAATATAA TAGTAGTTTA 54 0 
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AAACAGGACT 


CCACATAAAA 


ATCAACTCCT 


TTATATACCA 


TAATGATACT 


ATATTTTCTA 


660 




GTTTATTTCA 


ATTTTTCAGT 


TTTTAAAAAT 


GAGTTTCTGT 


TTTTATTTAT 


ACGCTTTTCT 


720 


5 


GTTTTCTTTT 


TAAATTTTAT 


CTTTTTGTTA 


TTCCATTCAT 


TGTAAAATTC 


TATTAAATTA 


780 




ACATAAAATT 


TTTCATGCCC 


TATTTTATTT 


GTTGATGAGA 


TATCAATGTA AAGACTCAAT 


84 0 


1U 


ATTGTTTTTA AATAGATTTG 


ATGCAACGAC 


TGATAAACCG 


TATTACTATC 


TGCTATGTTA 


900 




TTGGTAAAAT 


GCATAGAAAA 


ATATTCTAAT 


TTATTCATGC 


AATATATATG 


GGTTTCATTA 


960 




TACTTCTTAA 


TGAGTGTATT 


TATACCTTGC 


AATACGTCAT 


TACTTTTAAT 


AACAATTTCT 


1020 


15 


TTTTCAOCTG 


TCGAAAAAGT 


CCACTGTTTA 


TCTCCTATAT 


TTTCTTTAAT 


TGTTTTCTTG 


1080 




TTGTCAAATT 


CTAAAATTAT 


AGCCCGTAAA 


CACTCTTCTT 


TATAATTCTC 


GTTCTTGAAA 


1140 




GTACGAAGCA AAATTTTTAT AAATTCGGTA 


TTGGTGACTT 


TTTTATAAGT 


GTGATATTTT . 


1200 


20 


GCAATCTCTT 


TATCAGTAAA 


GACTGTTCTT 


AGTTCGTGAT 


TATCAAAACT 


TAAATTCATC 


1260 




TTATTCTCTA ATTCATTAAT TTTATCTTGC AAACCAACAT TTTCTAAAAT TTTCTTGTTT 


1320 


25 


ATCTCCCCTA 


TATCAAAACT 


CCTTTTCGAA ATTAATTTTG 


AAAACTCGTC 


TGCCATTTCA 


1380 




ACAGCCTTTT 


CTTTCCTTTT 


ATACCTTTTG 


TTAAATTTAT 


GAACCACCGT 


TGCAGCATAA 


1440 




TACGATATCC 


CACCAGATAA 


AATAGATGaT 


ATTATCGGTA 


TGTATATATC 


ACCTTTCATA 


1500 


30 


TTTCCACCTC 


TTTTAACACA 


ATTAAGTATT 


ATGATACACA ACTTGCGCAA AAAGATGTAG 


1560 




ACAGAACATA 


ATGGCGAACA 


AAAACAACCA 


CCCAGTAACT 


AGTATGGGTG 


GCGTAgACTA 


1620 




TAACAACTCT 


ATGTTATCAA 


GATATATGTA 


TCGAGTGATG 


GCAAGGAAGA 


AGTCTCCTGC 


1680 


35 


GGGACCAACA 


GTCAGATATA 


TGGCCTCTGC 


CGGGCTATAT 


AGTTCACTCC 


TACTATATAA 


1740 




AAGTAAGTAT 


AACATAAAAA 


GCACCCCGTA 


AACTGTTATA 


CGGGAATGCT 


AAAGTCATAT 


1800 


40 


ATACTACGGG GAGTAGTATG AAAACTATGC TCTCTATCGT AAGAAAAAAC ACCCAGTGAC 


1860 


ATGCTTGGGT 


GAACAAGGAT 


AGATGTAAAT 


AGTTGATGCA 


TGTGTAcACA 


TCATAACAAA 


1920 




AAACTAGCCC 


GAAGcTAGCT 


ATAACATAAA 


AAAATAGGCA 


AGTACCGAAG 


TACCTGCCAG 


1980 


45 


TTACGCACAT 


TTAAATCTTG 


AGAGTAATGT 


TAAAAAGTGT 


ATAGGAATAT 


TAA CATC CAT 


2040 




CCAAATAGTT 


ATTTAATAAC 


TGTAAGATTC 


CCTATAATTA 


ATGTAGCaAA 


ATTTTTATTC 


2100 




TAAGTAAATA 


CTAAATCGTG 


CTAAACTTAC 


CAAAACTACT 


TATTCTATTA 


CCTGCCTTGT 


2160 


50 


CTACCTCTCC 


TGTCGCTATA 


TAACGACGTT 


GTCCACTATT 


AGCAATATAA 


GTAATCCATC 


2220 




TATAGCCATT 


GATGCAATAT 


GCGCCGTCAT 


ATTTAATTGT 


TGCGTTATTA 


GGTAATACAC 


2280 




CTGTAATTCT 


TGAATTAGTT 


GAATAGCCGT 


CCCTTACGTT 


ATTACCTTTA ACATTGGCAA 


2340 



55 



234 



EP0 786 519 A2 





CTGGCACTGG 


TGGATTTTTT 


TGGTTTTTAG 


CTGATGTTTT 


AACATTACCA 


GCTACCAAAC 


2460 




CACCTATAGG 


CTTACCATGA 


ATCGCACCGG 


CTATTAATTT 


AGAATACAAG 


TCATAGTTTT 


2520 


5 


TCTTAATCCA 


ATCCATATCA 


TTTTTATTAG 


TAATAAAACC 


TAATTCAGAT AAACGATAGT 


2580 




TTATATTTAT 


TTCTGCTGAT 


ACATTAACGT 


TTAGTAAATC 


ATTACGAGGT 


GTTACACCTC 


2640 


10 


TTATTTGTCC 


TAAGTTATTT 


TTAATAACAT 


CTTGTATACT 


TTTATCAATA 


GTATCTGCAT 


2700 




TGAATTGACT 


TGAAATAATA ACATGCCCAC 


CACTTGCACT 


TTCTCCTGCT 


GCGTCTAAAT 


2760 




GAATCTCTAG 


AACAATGTCA 


TACCCATGTG 


ATTTAACCCA ATATAAGCCA 


TAATCTTTAT 


2820 


15 


TATTTCCTAC ATTAACACCG TAAGCAGTAT CTTGATACAT ATCTTGTGAT TGACTTGAGC 


2880 




CACCATATAA 


TGCAACTTCG 


TGACCTGCAT 


GTCTTAAATA 


CTTAGCGATA 


TTTGGTGTTA 


2940 




TATATTTACG 


GATAAAATCA 


CGTTCATTTG 


TTCCGTTTCC 


GACTGCTCCA 


GGATCGTTAT 


3000 


20 


AACCATGACC 


GGCTACAAGC ATAATTTTTT 


TAGGTTTAAT 


TACTGCTTGC 


TTTTTGGCAG 


3060 




, TTGCTTGCTT 


AATAACGCTT 


TTAGCTTTAT 


CTCCAACACT 


TACTTTATCT 


GGGAAATTTA 


3120 


25 


ATCTAATAAA ATACATTGGG 


TCATCGTAAT 


AATGAACATG 


TCTTGTAACG 


GTTTCGGGAC 


3180 


CCCAACCAGG 


TTGCGCAACG 


CCATTTGTCC 


AACCTTTACC 


ATTCCAATTT 


TGGCCAAACG 


3240 




ATGTGAAAGT 


GTTTAGATTA 


GCGCTCTCAA 


CAATTTCAAC 


ATGTCCaGct 


CCGCCACCAT 


3300 


30 


ACTTTGACGG 


GAAAACGACA ATGTCCAACT 


TTTGCGGTAA AAAGCTATCA 


TAGTTTTTAA 


3360 




• TTATTTGCCC 


GTATTTTTCA 


ATCCTTGCTT 


TATTATCAAA TGGAATATTA 


TAAGCGTATA 


3420 




AACCTTGTAA 


CcTTTCGCCT 


GTTGCTATCA 


TAAAAAACAT 


ATTTGCGTAA 


TCGTAACACT 


3480 


35 


GAAATCCATA AAACAAATCA GGATTGAACT 


GCTTCCCTAA 


TGAATTATCA AACCATTTTT 


3540 




CTGCTTGGTT 


TTTTGTTATC 


AACATTGGTC 


AACACCTACC 


CTAAATCATT 


TGTGTCGTTC 


3600 


40 


ATAOTCGTAG 


GTGTCATTAC 


TTCTTTAATT 


GGCGCTTGCC 


CTGTTGCTTT 


TCTATACTTG 


3660 


TTTTCAGCTT 


TATATTTCTT 


TAGCTTTTGA 


TTTGCCCATT 


TACCTTCTTG 


AGATGTTGGA 


3720 




TTATCTTTAT 


ATGTAGTATA 


TAAAGCAACA 


ACTGTTAAGA 


TAATCGATGA 


AACACTTTCT 


3780 


45 


TCATCTACTG 


GTATCGGACT 


TATACCTTTA 


TTCGCTAAAA 


ACTGATTGAC 


TAATGCTAAG 


3840 




ATCAATACGA 


TGTATCTTGT 


TATTACTTTT 


GCATCCATTT 


GTTTGCTCCT 


TTTATCCAAA 


3900 




ATAAAAAGCC 


AGTGCCGAAG 


CACTGACTCT 


TAACTATTAC 


TTACACTTAC 


TAAACCAGAA 


3960 


SO 


ACACGACCAA 


AAGCTATATC 


CTAAAATTCC 


CTTAAGCATG 


GTAATCACCT 


CCTTTAAATG 


4020 




CCAAAAATAG 


TTTTTAACAA 


GGCTATAACA 


AATGTACTTA 


GAATCGTCCC 


TATTAATCCT 


4080 




AGAATCCACA 


TCTTGATGTC 


TCTAATATTT 


TTAGCATTTT 


TCTCTTTATT 


TTTTTCATCT 


4140 
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TGCGTTCTCA 


GACTGTCTTC 


TATTCTGTCG AATTTTTCAA ACATAGTCTT 

i 


ATCATTTTCT 


4260 




TCTAATCGCG 


TTAAACGCCA ATCTTGTTCG 


TGTCGTTTGG 


TAAATCCAAA 


CATTACACCA 


4320 


5 


CCCACTTTAT TCAAATTAAA AAGCCATAAG ATTATAACCT ATGACTCTAG ATTTTCTGGA 


4380 




TACTTTTCTC 


CTGTAATAAT 


TGCATATTCC 


TCTTTATCTA 


TAACTTCCAT 


ATCTACATAC 


4440 


10 


CACGCTATAT 


CTTCTTTACT ATATTCTTTC 


AATTGATACC 


ATGTTTTAAT 


ATCTTCGAAT 


4500 




GTTGGTGAAA TTAATTTAAG 


CATTTTCAGT 


CTCTCCTTTA ACCTCTTCTA ATTTTTTATT 


4560 




AAGTGTCACA AGTTGTTTTG 


CCATTAGTGC ATTTTGCTTA TTAACTTGCA 


TCGATAACTT 


4620 


15 


TGTACTTTGA ACAACTTGTT 


TCTGCATACT AGCAACCATT 


TTTCGTAAGA TGTCATCAGA 


4680 




AGCGACTGTG 


TTTTGTTCTT 


CACTGTCAAT 


CTGTTGATGC 


AAGTCATCTT 


TTTCTTCTGA . 


4740 




ATAATCTTCG 


TTAAAAACTA 


TTTCCCCATT 


TGAATATTTA AAGGCTTTAG 


GTCTAAAAAC 


4800 


20 


TTGAGAGAAA 


TT7TCTGGTA AATTTTCAAT 


ATCAATACCT 


TCTTCAAAGC 


CACCAATGAT 


4860 




AGCGTATGAA ATTATCTCAT 


TACGCTTGTT AACTAATATT 


TGCATTATTT 


TCTCACTCCT 


4920 


25 


ATAATTTTGT 


TAATTGTCCC 


TCTATTTGCG 


TTCGCACCAG 


AGCCTCTTTG 


ACTTCCTAAG 


4980 




TCGAAATAGA 


CATCGTTTGA 


TATAGTTAAA 


GATGTACGAC 


TAGATTTAGT . 


, TAATCCAAAC 


5040 




TCATAAACAC 


CTCCACCATT TCCATCACCA 


TCTGGAAGAT 


TTGAGGGATT 


CAATGAAATC 


5100 


30 


TTTCCTCCTC 


CAAAAGGACT 


GCCAAACTCT 


GTAAAGTCAC 


CACCTGGAAA AGTCCCATAA 


5160 




AAAATTAATA 


AAATAAATTG 


GTCTAAACTC 


TCATTTAAGT 


ACAATGTAGA 


GCCCACACCA 


5220 




TTTGCTGTTC 


CATCAAAAAT 


AACCGAATAC 


CTTTTATTAA 


ACTTGTCATC 


TGCGTATAAT 


52B0 


35 


TTAGCGTTAC 


TTTCGGCCAT 


ATTAGCTTTT 


GATTGGGCAC 


TTTGAACAGT 


TTCAAAAGGT 


5340 




GTATTGTAAT 


CATTAATAGC 


TAATTCTGAC 


CACTCAGACC 


ATGAACCCGC 


TTCTTTTCTT 


5400 


40 


TTAftCAAATA 


CTTTATTTGT 


ACCGTTCGGT 


CGATAAGTCA 


TACGCTTGTA 


ATCTGAAGTT 


5460 


ACTACTAAAT 


ATTCGACAGT 


ACCGTTAGTA 


CTAACACCTC 


TTGGATAATT 


TATAGCTTGC 


5520 




GAAACATAAA TAAATTGGGT 


TGAATCACCT 


ATTCTTTGTT 


CTGGATTATT 


AAAATCAAAT 


5580 


45 


CCAGTAATCT 


GCATTATCTT 


ACCATCATCT 


TTAGTAATCT 


TAGCTTTTTG 


CCAATTTGAA 


5640 




GTAGAACCAC 


TTGTGACTAA 


ACCACCACTA 


TTCACTGACT 


GCTTGAAGGC 


TTCATGTTTC 


5700 




TCATCCATAT 


ATCGCTTTTG 


CTCATCGAAT 


GTTCTTGAAT 


ATGCTTGCGC TTTATTTTCC 


5760 


SO 


AAATCAGATA 


TATGGCTATT 


AGCAAGTTGC 


TTTAATTCAT 


CTATACTTGA 


AGATTTTGCT 


5820 




ATTTGAATAT 


CTGATAGACC 


TTTTTCTTTA 


GCTTTTTCAA 


TCAGACTCGC 


ATAATCTTCA 


5880 




CCATTTTTTA 


TAGCCTCGTC 


CATTGCTTTC 


GCACGATCCA 


TAATAGTTTT 


TTCTAATTCC 


5940 
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TCAACGTTAA ATGTGATAGT TCTCTCGACA ACTACCACGT CTGAATTACC TAATTCTGCA 6060 

ACCGAAACTT GAGCTTGATA ACTTCCATCT CGTTTAATTA CATCATTAGG TAATTGAAAT 6120 

5 

TTTAAAATAC CTTTAAATGG ATCTAATATT TCTAGTGGAG CAACTACCAT GACTCCTTTA 6180 

CCTGGAATCG CTATTCGTGC kTTGATATTT tCTTCACTCA ATAATAACGG TTGATTATTT 6240 

TTAGTGATAT TAAAAAGAAG AACAGAAGAA TCACTCTCTC CTGTTCTAAA AGTTATATCT 6300 

10 

AGATTTGAAA TATTTCCATA ATGCGCTGTG TTTTCTAAAT TTATAGCTAC AGATTTCTCT 6360 

AAATTACTCA TTAACTTATA ATTCTCCCTT CGTGTAAAGT CCATGGCCCT GAACTTGTTT 6420 

75 TACTATCATA ATTTTTCAAT AGTATCT.CAG CAGATGCTGT AACACTATTA CGAACTAGCC 6480 

TATGAACAAA GCCACCTGTG TTTGAAGCTT CTACATATAA GTTCCAACCA GCTACCCCTT 6540 

• TACGTTCAGT TGGAAAATCT GTAAAACGTT TTGTATCATC CGTAGTTAAA TAAAACGACA 6600 

20 

TGCCTACTAT GTTAATATCT GACATTTTTG TGATGAATGA AGGTACTCTC TCCCATTTAC 6660 

CACTATTTTT AGGCACATAA TTCCAGTCCG AAATGTCTGC AGTTCTTCCA GAAAGCACCC 6720 

TTTCAAAAGT CATCATATTC CTTGCATAAC TATTACGCGT CAATATCTGA ATTACATCAC 6780 

25 

CGCCAGTTTG TGGTGGCTTA ACTTCCAAGA ACCAACCTGC ATCACGCCAT TCTCTTGGTA 6840 

ATGGGAAATC ATCGATTTGA ACTGTATGAT CAGTGTATAA ATAGTAAAGA CCTGGCTCTG 6900 

30 TTAACATCCC AAGATTCTTA AGTTTATCAG GCCTCATTGG TAAAGGTTTA ACTCTACCAC 6960 

CTGTGTCACT CaTGATAAAA GGAACGCCTC TTGAGTGAAG TATTTCTAAA ATACCTCTTT 7020 

GCCCAATCAT GAAAATACGA TGTGTTCTAT TTCCaTCACC ACCGACAGTA ACACCTAGCA 7080 

35 . TCAAAGCTTT TTTACCACTA TCTTTGTCAT AGTATATTTG CAAACCTTtC TgCTTCCGCA : 7140 

AATTCGC CAG GAAATGAATC tAgTGTTCCA CCATAGTCAG CATTAACCTG ATACGCTTCT 72 00 

TCTCCTGTTT CTAAATCGAA AGCCGTTAAA TAGTTTCTAT TATTTGGATT ACTGTCTCCT 7260 

40 

GTATACCAAT ACAAGTATTT TTCATCAAAA GTCACACCCT GCATTGGTTG GGTTTCGTTT 7320 

GTTAGTCTCA TAGGGATACT GATTTTATGC AAAACTTTAT CAATATTTTT ATCAACATCG 73 80 

TCTAAACTTC TTATCTCTAT ATAAnTCATT GAGTTTTCAA GTTCCCACTG ACTTCTAGGT 7440 

45 

CTCTCaATTC TGTATAGAAT TTTATTTTCT TTTTCATTTA TGACAGGGGT GATGTAGGGT 7500 

TTTTCTGGGT GTCCTGTAAA TACATCTTGC ATACCATACT TGCCATAGCT AATTTCCACA 7560 

50 TTAGGCGTAT ACTTGAAACG AACTAATGTA TTCTCATTAT TACCATTTAA GATAAAACTA 7620 

TAAATCCATA ACTCATcATC AATATATCTA TAACCGTTAT GTGTACCATG ACCCCCACCT 7680 

ACAATCAATG AGCTGTCTAT AAATTGACCA TTAGGTCTTA GACGACTTAG CATATAGCCA 7740 
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ATTACTGCAT 


TTGTAAgAGG 


TGCAAGTTCT GTCACAAATA AAAATTCTTG CTTATCAGGT 


7860 


TCAAAACGAT 


ACTCGATATC 


AAGAATTTCT TGTTTGGTCT TATTTAATTC TCTTATAGTT 


7920 


TCCTCTTTAT 


TAATTTGAGT 


TTTGGTTTCC CAATCGTCTA AATGTTCTTT TAATGTGTCA 


7980 


AAGGTTTCGC 


CGTTTACATT 


AACTCGAGCT TGAACAATCT CATTAGCACT GTTATTACGT 


8040 


GGTGCCACAA 


CAAGTGCGTT 


AATTTGACTT TGTAAAGATT TGTTTACTGC TGCTTGCGAT 


8100 


CTACCATTAT 


AATAAATTTG 


CTCAGCGAAG TGTTGAATTG TTTTAGCTyT CTGATGCAAC 


8X60 


TTAAACTCTG 


TTGTCAAGCC 


AAGCGCAAAT TGCTCTATTC TTTGTAAGTT TTGTATTTCC 


8220 


TTAGCTCTAT 


AATCTCGACC 


TGCTAAAGCT CCCAAATCCT TTATTAAATA CAAATTTTCC 


8280 










GTTATTTTCC 


ACAATTTACC 


GTTTGGGGAT TTCTGTACAA TGCTATTTTG AATAATTgcC 


8400 


TGctTCGCCT 


ATTTTTAAAT 


TATCTAATTT ATTTkTATCA TTTACCGAAA TGATACCGTC 


6460 


TTGAGGCAAT 


CCATCAATAn 


CACTACTGCC TGCATAAGGT ATCCCATTTA TAGCTTTCCA 


8520 


ATGTGTAGCT 


GGAAAGTACT 


GTTTATCGT 


8549 



25 

12) INFORMATION FOR SEQ ID NO: 6: 

<i) SEQUENCE CHARACTERISTICS: 

• (A) LENGTH: 3601 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 6: 

AGGCGTGTAG TGACTTACGG nTAGGAAACT ATGTATCCGA ATGATTTATT GAGACCAAAA 60 

AGGCATTAAA GTCCATTGAA ATATCnGGTA GCGmGTTGGT ACgTGGACGT GGGGGCCCTA 120 

40 

GATGTATGAG TCAACCATTA TTCAGAGAGG ACATTTAACG TAATAAATTA TAGAmACGAG 180 

GGTGAAAATA ATGACAGAAA TTCAAAAACC GTATGATTTA AAAGGCAGAT CATTATTAAA 240 

45 AGAAAGTGAT TTTACCAAAG CAGAATTCGA AGGACTTATT GATTTTGCAA TTACATTAAA 300 

AGAGTATAAG AAAAACGGTA TTAAGCATCA CTACTTATCT GGAAAAAATA TTGCACTACT 360 

ATTCGAAAAG AATTCGACGA GAACGCGTGC TGCGTTTACA GTTGCGTCTA TTGATTTAGG 420 

50 TGCGCATCCA GAATTTTTAG GAAAAAATGA TATTCAATTA GGCAAAAAAG AATCTGTAGA 480 

GGATACTGCG AAAGTATTAG GTAGAATGTT CGATGGTATT GAATTCCGTG GTTTTTCACA 54 0 
ACAAGCTGTT GAAGATTTAG CGAAGTTCTC TGGTGTACCG GTGTGGAATG GATTAACAGA ■ 600 
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TCTAGAAGGA ATAAACTTAA CTTACGTTGG AGATGGACGT AATAATATTG CGCATTCATT 720 

AATGGTAGCA GGTGCTATGT TAGGTGTTAA TGTAAGAATT TGTACACCTA AATCATTAAA 780 
TCCAAAAGAG GCATATGTTG ATATTGcAAA rGAAAAaGCG AGTCAaTATG GTGGTyCAGT . 84 0 

CATGATTACG GATAATATTG CAGArcCAGT TGAAAaTwCm GATGCTATAT ATmCAGATGT 900 

TTGGGTATCG ATGGGTGAAG AAAGTGAATT TGAACAcGTA TTAATTTATT AAAAGACTAT 960 

CAAGTGAATC AACAGATGTT TGATTTAACA GGTAAAGATT CAACGATATT CTTACATTGT 1020 

TTACCAGCAT TCCATGATAC AAATACACTT TATGGACAAG AAATTTATGA AAAATATGGA 1080 

1S TTAGCTGAAA TGGAAGTTAC AGACCAAATC TTTAGAAGTG AACATTCAAA AGTGTTTGAT 1140 

CAAGCTGAAA ATAGAATGCA TACAATTAAG GCAGTAATGG CAGCAACATT GGGGAGTTAA 1200 

TCACTAAATG GAACGATATG AATATGATGT GTCTGATGAT ATAAGTGTCA TGTACAGACA 12 60 

CCTCATATTG GTATTAAAGG AGAAATGAAT ATGAACGAAT CAGGAGATAA CAAACTCAGT 1320 

AAATCTTCTT TAATTGGACT AGTTATAGGA TCCATGATTG GTGGCGGTGC GTTCAATATA 13 BO 

ATGTCTGATA TGGGCGGTAA AGCCGGTGGA TTAGCCATTA TTATTGGTTG GATTATTACA 1440 

GCTATAGGAA TGATTTCATT AGCGTTCGTA TTTCAAAATT TAACCAATGA ACGGCCGGAG 1500 

CTAGACGGTG GTATTTATAG TTATGmTCAA GCAGGATTTG GCGATTTTGT AGGATTTATC 1560 

30 AGTGmTTGGG GATATTGGTT CTCAGCGTTT TTAGGCAATG TTGCCTATGC AACACTATTG 1620 

ATGTCAGCAG TAGGTAACTT TTTCCCGATT TTTAAAGGAG GCAACACATT ACCAAGTGTT 1680 

ATTGTCGCCT CGTTACTACT CTGGGGTGTC CATTTCTTGA TTTTAAAAGG CGTTGAAACA 1740 

35 GCAGCATTTA TCAATAGTAT TGTTACTGTT GCAAAGTTAA TACCGATTTT ACTTGTAATC 1800 

ATATGCATGA TAATTGCATT CAATTTTGAC ACTTTTAAAA CAGGCTTTTT CAGTATGACG 1860 

TCAGAGGGTG TATTGCCATT TAGTTGGGCG AGCACAATGA GCCaaGTtAA AAGTACGrTG 1920 

CTAGTGACAG TTTGGGTGTT TATCGGTATC GAAGGTGCAG TAATTTTTTC TAGTAGAGCT 19 8 0 

nAAAATGAGA AAGATGTAGG TAGTGCCACG GTTATAGGAC TTATATCAGT TTTAATTATC 2040 

TATyTCTTAT TAACTGTATT AGCTCAAGGC GTGATTTTGC AAAATCATAT TTCGCAATTA 2100 

GATTCGCCAA GTATGGCACA GGTGCTTGCA ACTATTGTAG GTGGTTGGGG ATCTACACTT 2160 

GTAAATATTG GTTTAATTAT TTCGGTACTA GGTGCATGGT TAGGATGGAC ACTGCTTGCT 2220 

50 GGTGAATTAC CTTTCATTGT TGCAAAAGAT GGATTATTTC CAAAATGGTT TGCTAAAGAA 2280 

AATAAAAATG GAGCACCTGT AAATGCACTG CTTATTACCA ATATATTAGT ACAATTATTT 2340 

. TTAATAAGTA TGCTATTTAC ACAGAGTGCG TATCAATTTG CATTTTCACT. AGCATCAAGT 24 00 
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CGACAGCAAG CAACTACTAA ACAATGGACG ATTGGTATCA TAGCCTCAAT TTATGCTATA 2520 

TGGCTTATAT ATGCAGCAGG TATCAATTAC TTATTATTGA CGATGTTACT TTATATTCCA 2580 

GCTCTTCTTG TTTATACaAT CGkTCmAAAG rATwATCAGa CACGTTTGAT TAAATCAGrC 264 0 

TATATTCtTT TTATGATTAT tATCGTACTT GCAGTTATCG GGTTAATTAA GTTATTGATG 2700 

GGAACGATAA ATGTTTTTTA AAAGGAGCGA CAAAAATATG AAAGAGAAAA TTGTCATTGC 2760 

ATTAGGCGGT AATGCGATAC AGACAACAGA AGCAACAGCT GAAGCACAAC AAACAGCTAT 2820 

TAGATGTGCG ATGCAAAACC TTAAACCTTT ATTTGATTCA CCAGCGCGTA TTGTCATTTC 288 0 

16 ACATGGTAAT GGTCCACAAA TTGGAAGTTT ATTAATCCAA CAAGCTAAAT CGAACAGTGA 2940 

CACAACGCCG GCAATGCCAT TGGATACTTG TGGTGCAATG TCACAGGGTA TGATAGGCTA 3 000 

TTGGTTGGAA ACTGAAATCA ATCGCATTTT AACTGAAATG AATAGTGATA GAACTGTAGG 3060 

CACAATCGTT ACACGTGTGG AAGTAGATAA AGATGATCCA CGATTTGATa ACCCAACTAA 3120 

AcCAaTTGGT CCTTTTTATA CGAAAGAAGA AGTTGAAGAA TTACAAAAAG AACAG CCAGA 3180 

CTCAGTCTTT aAAGAAGATG CAGGACGTGG TTATAGAAAA GTAGTTGcGT CACCACTACC 3240 

TCaATCTATA CTAGAACACC AGTTAATTCG AACTTTAGCA GACGGTAAAA ATATTGTCAT 3300 

TGCATGCGGT GGTGGCGGTA TTCCAGTTAT AAAAAAAGAA AATACCTATG AAGGTGTTGA 3360 

30 AGCGGTTATA GATAAAGATT TTGCTAGTGA GAAATTAGCA ACGCTGATTG AAGCAGATAC 34 20 

CTTAATGATT CTTACGAATG TAGAAAATGT ATTTATTAAC TTTAATGAAC CTAATCAACA 3480 

ACAAATCGAT GATATTGATG TAGCAACACT GAAAAAAtAC GCGGCACAAG GTAAGTTTGT 354 0 

35 

GGAAGGATCG tGTTGCCAAA AATAGAAGCT GCGtACgtTT GTTGAaAGtG GGGaAACCAA 3600 

A 3601 

(2) .Information for seq id no: 7: 

40 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 573 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
45 (D) TOPOLOGY: linear 



20 



25 



so 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7: 

CGACACTATT AAATGAATTA GAGCACAATC TAACAAATCA AATTCATTTT TCAAAAGATG 60 

AACGACTCAC ACATATCGCT TTAAAGTTAT TCGAAACAAC CGATCCTGTT TCAACAAAGC 120 

AACTTGCGCA AGATGTTAAT GTTTCGCGTC GGACAATTGC AGATGATATT AAAATGATTC 180 
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TTATTGGTGA GGAAGATCAT TATCGTAAAG CGTATGCACA CTTTATACAT CAATATATGA 300 

AACAAGCTGC ACCTTTTATA GAGG CGGATA TCTTTAATTC AGAATCAATC GCATTGGTTC 360 

GCCGTGCCAT TATTAAGACA TTAAATAGTG AAAATTATCA TTTAGTTCAG TCGGCTATCG 420 

ATGGCTTAAT CTATCATATA CTCATTGCCA TTCAGCGTTT AAATGAAAAT TTTTCGTTOG 480 

ATATACCTAT- CAATGAAATT GATAAATGGC GACATACTAA TCAGTATGCn ATTGCTTCAA 540 

AAATGATAGA AAACTTAGAA CGCAGTGTAA TGT 573 
(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 8: 

TTGATATTTA TAACGTTATA TTTTAATAGT TCACCTGGAT TATTAAATAA ATAGTCCGCC 60 

AAATTTTCTT TTTCTTTATC AATCTGaTkG TAATTAACaC TTTCGaCTTC TGTAGGAATT 120 

CTAATGTCAA CAGAAGCATT GATATAAGCT TGATGTTGCA TGCAATCACA CTCCTAATCC 180 

TTCATmTmAA ACGGAGAAGT AAACCCGTCA CTATTCAAAT TCAATCCTTT TGCCCAATCA 240 

ACAGGCTTAT TCATGATAGT TTCGATTTCC TTAAGTCCAT TTGAACCTCT AGGTATTTCT 300 

ACAATTACTT CATCATGGAC ATGGCCAACT ATTTTAAAAC CTAATGCTTC AAGCCTTGCT 360 

ATAGAAATCG CAAGTAAATC CCTTGCAGTT GCTTGAACAA TATTCTCGAC TAACTTCCCA 420 

CCATACGTTT TTAACTTTGA CCATTTACGG TTAAGATCTA ACCCCATAAA TTCAACAACT 480 

TGACSACCCC AACTATTTTC ACCAACTAAA GCTTTTGGAT AAGCTAAAGC TCTTCCACTA 540 

GGCAGTTCAA TCATTAGAAA ACCTTTTTTC ATATAAAATC TAAGTCCATG TGTATGATGC 600 

GTCTTTCGGG ATTTTACAGT ATTAATTGCA GCCTCTTGGC AAGCCTTCCA AAAATTAACT 660 

ATGTTAGGAT TTGCGTTACG CCAACTATCA ACTAAACCTT GTAACTCGTT TTCTTCAATG 720 

CCCATTTCCA ATGCACCCAT TGCTTTTAAA GCTCCAGCGC CACCTTGATA GCCTAAAGCT 780 

AATTCGGACA CTTTTCCTTT TTGTCTGAGA GGGTCGCCTT TAGTTATGCT TTCTACCGGT 840 

SO ACATTAAACA TTTGAGAAGC CGATGCTTCA TATATCTTTC CGTGTGTGTT GAATACATCT 900 

AAACGCCATT GTTCTTTTGC ATACCATGCT ATGACTCTTG CCTCTATTGC AGAAAAATCA 960 

CTTACTGCTA GTTCATTACC TTCTTCAGCA GTAAATGTCG TCCTAACTAA TTGACTTAAT 1020 
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AGATCTCTTG CTATTTCTAA TTCAGTATCT GAAATATAAT GCTTTGTTAA ATTCTGAAGT 1140 
TGTACACCTC TACCTGCCCA TGTTCCAGTA CCGGCACCGT . AAAATTGAAA CAGACCTCTT 1200 

5 

ACCCGTTCAT CACTGCACAT C 1221 

(2) INFORMATION FOR SEQ ID NO: 9: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1090 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

is 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 9: 
TTTTGTTTGG TATGAGGTAG CAATGACGAC GTGTCATTGG TGGAGATTGT AAAAATACAT 60 

20 

AATAAAAAGA AGCGGCAATG TATACCGCTC CTTTTTTATA CTACATACCG ATTTTCAACC 120 
ATCTCTTTCT ACTTAGTAAT AAGACAATAG TATTAACTAT AAATAGAAGA ACGAAGAATG 180 
ATACTATATT TATAATTTCA GTAGGACACA TAAATGTTGA CTCGTTATTC AATATTTTTT 240 
CTACGGCACG ATACATCGTA TTGCTCGCCT CAAATGGAGC AACGATACCA AATATATTTT 3 00 

TATTAATGGC AACTAAGATG ACTGAACCAA TCCAATATAC AATGCTGATA CCTAAGCTGA 360 

30 TTAAAATGTT AGGTGAAACC ATACTAATCG TTCCAACAAC TAAGATATAT TGTAAGATAA 4 20 

CGAGTGAAAA TAAGATTATT AATAGTAAGT AATGTGAGAA ATCCGAATAT ATAATTGAAA 4 80 

TAATAGTGAT ACTTAGAATT ATGAACACTA AACATTCAAA AAATAACACT GCTACCTTTT 540 

35 TATAGAAGAA GGTAAAGATA . TTATCGCCAA TCAATTTATA AAACAGGATA TTTTTATTCG 600 

AATACTCTTT ATTAATAAAA TATGCAATAA CAAATGAAAA TAGTAAGAAC CCTAATTGCG 660 

TTGCAACAGT ATATGAACTG AAGAAAAACT GGCTATAGCT TAAACTTTTA ACTTTGTCTA 720 
40 " 

TACCTATTGG TAAAAAATAC CCAAGTAAGA AAAGGAATGT GAATAGCACA ACAAGCGTGT 730 

AAATAATTTT ATTGGAAATA CTTTTTTTAA ATTCTAATTT CAAAGTGGAC ACCTCAATTA 840 

TAAATTAATG TAATCATTTA TGACTTCTTC TTTTGATTGG TACTCTTCTA TTTGAAGGTC 900 

45 

TTTAAAAATA AAGTATTTAC CCGGCAAAGC ACTTAAATCG GATAAATTaT GTGTAATATT 960 

GATAATAGTT TTAGTTTGAT GGCTTTGAAT AAAATCATTT AAAAATTCAT AAATTTCATT 1020 

50 AACTGTTTTC TTGTCTAAAG CGTTTGTAAC TTCATCTAAT ATGATTAAAT CATGATCTTC 1080 

CAATAAGAAA 1090 
(2) INFORMATION FOR SEQ ID NO: 10: 
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is 



20 



(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 

10 TTAGGACTAT TTTATCATAT TCATTTAAAT TACGGCTAAA AATTTTAAAA ACGGGGATTA 60 

ATATATGGAA TTAAGCTATG AAAGTTAATT GATACTTGCA TTTTACGCTG ATTTATATAA 120 

GAATAACTAT TGTATAGTTT TAAAAACGAA CGTACGTTTG CAGGAGGCGA AATCATTGGC 180 

AATGAATAAA CAAAATAATT ATTCAGATGA TTCAATACAG GTTTTAGAGG GGTTAGAAGC 24 0 

AGTTCGTAAA AGACCTGGTA TGTATATTGG ATCAACTGAT AAACGGGGAT TACATCATCT 300 

AGTATATGAA ATTGTCGATA ACTCCGTCGA TGAAGTATTG AATGGTTACG GTAACGAAAT 360 

AGATGTAACA ATTAATAAAG ATGGTAGTAT TTCTATAGAA GATAATGGAC GTGGTATGCC 4 20 

AACAGGTATA CATAAATCAG GTAAACCGAC AGTCGAAGTT ATCTTTACTG TTTTACATGC 480 

25 AGGAGGTAAA TTTGGACAAG GCGGCTATAA AACTTCAGGT GGTCTTCACG GTGTTGGTGC 54 0 

TTCAGTTGTA AATGCATTGA GTGAATGGCT TGAAGTTGAA ATCCATCGAG ATGGTAATAT 600 

ATATCATCAA AGTTTTAAAA ACGGTGGTTC GCCATCTTCT GGTTTAGTGA AAAAAGGTAA .660 

AACTAAGAAA ACAGGTACCA AAGTAACATT TAAACCTGAT GACACAATTT TTAAAGCATC 720 

TACATCATTT AATTTTGATG TTTTAAGTGA ACGACTACAA GAGTCTGCGT TCTTATTGAA 780 

AAATTTAAAA ATAACGCTTA ATGATTTACG CnwGGgTAAA GAGCGTCAAG AGCATTACCA 840 

TTATGAAGAA GGGAtCaAAG rGTTgTTAGT atGTCCAaTG ArGGAAAAGA AGTTTTGCCT 900 

GACG 904 
(2) INFORMATION FOR SEQ ID NO: 11: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11271 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



30 



35 



50 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11: 

GATTTCTAAA TCAAGATCTG TTTTACGATA ACCATTCAAA CCTTGACGTT CATCTTCTTC 60 

AGGTTGATTT TGTTGCTGTG TGTCTTTGTT GTCAGAAGTC GCTACTGTTT TTTTATTATC 120 

TGTTTCTTTA GTCATAACAA ACGCCTCCGT TATAAAACGC TATATTTAAT GATATGTGAT 180 
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w 



TTAATAAGAC GATTCAGCAA GTTTTAAAGT ATTATTTGAC TATGTTGGAT TAGGCATCTA 300 

GTCCTATAAT ATCACTGACA TTGTCAAAAT GATGATCTTT TAAGTAACGT GCGATGCCTT 360 

TGTTCATTTT CTTAGTTAAA CCTGGGCCTT CAATAACAAG TGATGAATAA ATTTGAATAA 420 

GTGACGCACC GTGACGCATC ATTTTGATTG CATCTTCAGT ACTGAATACG CCGCCTGTAC 480 

CTATAATTAA AAATTCACCA TTTGTTTGCT GATAAgCATa CTTAATCAAT TTTAAATTAC 540 

GTTCAAATAA TGGACGACCA CTCAAACCGC CTTCTTCGAC TTTATTAGCA GAAGTTAAAC 600 

CATCTCGTTG TCGCGTTGTG TTTGCTAAGA TGATACCGTC AAATGTCTCA ' GTAATCGCTG 660 

1S GTAATAGTGC TTTTAAGCCA TCGAAATCCA TATCAGACGT TAGTTTTAAA TAAATTGGCA 720 

CTGTTACATC ATGTTGTTTT TTAAATGCTG TTAAAGCTTG GCATAACATT GAAAATTCAT 780 

CTTTATCATG GAAGTTTTGA AGATTTTCAG TATTTGGAGA ACTGATGTTG ACTGTGAAAA 840 

20 1 ATGAAACGTC GTGTTTAAAC GTATCAATAA CCTTTATATA ATCTTGATAA CGCGCTTCAT 900 

AAGGTGTCAT TTTATTCACA CCAACATTGA TACCAACAGG TACTTGATAA GCATTTTTAC 960 

GCAAATGACT TAGTGCTTTG TTCATACCAA TATTATTGAA GCCCATTCGA TTTATCAAGG 1020 

CGTCATCTTC TAATAATCTA AACATGCGTG GTTGAGGGTT ACCCGGTTGA GGTTTAGGTG 1080 

TGATACCACC TAATTCTAAA GCACCGAATC CAAGGTGTTC CAATGCTTTT GGTACTTCGC 114 0 

AAGATTTGTC GAAACCAGCT GCTAAgCCAA TTGGATTGTC GTACGTATTA CCTTGTATCG 1200 

TTTGTGATAA CGTTGGATTC TTATAAGTAA ATAGTTTATC GACGACTGGG AATAAAACCG 1260 

GaAACTTTTG TaACGTTTTT AATGCATCGA TAGTTAGTCC GTGTGCTTTT TCGGGTTCGA 1320 

35 TTTTGAATAA GAAAGGTTTA ATTAATTTGT ACATGAGTAT GCTCCTATTT CATTATATTT 1380 

GAGGCTTACT ATCCTCAACT TAATATATGT GAAATATATT CTTTTAATAG ACTAGCATTT 1440 

CCATACATAA TTTCCTAGTT AAAACTAAAA AGTTTTGAAA ATTGACGCAA gTTTGAATAA 1500 

CGTTTTTAAG ATTAAATCAT CCTAATTAGG CAATATTATA GTATAAAGTA AGTAGATTGG 1560 

AAGGTGTTTG TATGAATGAA CAATGGTTAG AGCATTTACC TTTAAAAGAT ATTAAAGAGA 1620 

TTTCACCAGT GAGTGGTGGT GATGTAAACG AAGCATATCG AGTCGAAACA GATACGGATA 1680 

CATTTTTCTT ACTTGTCCAA CGTGGACGTA AAGAATCATT TTATGCTGCA GAAATTGCAG 1740 

GTTTAAATGA ATTTGAACGT GCAGGTATCA CGGCACCTAG AGTAATTGCA AGTGGCGAGG 1800 

TTAACGGTGA TGCGTATTTA GTGATGACGT ATTTAGAAGA AGGGGCTTCA GGGAGTCAAC 1360 

GCCAATTAGG GCAACTCGTA GCTCAATTAC ACAGTCAGCA ACAAGAAGAA GGCAAATTTG P 1920 

GCTTCTCATT ACCTTATGAA GGTGGCGATA TTTCTTTTGA TAATCATTGG CAAGACGATT 1980 
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GGCTATGGGA TGCCAACGAT ATCAAAGTAT ATGACAAAGT GCGACGTCAA ATTGTGGCGG 2100 

AATTAGAAAA GCATCAAAGT AAACCGTCTT TATTACATGG TGACCTATGG GGTGGTAATT 2160 

5 

ATATGTTCTT ACAAGATGGT CGTCCGGCGT. TATTTGATCC AGCGCCATTA TATGGTGACA 2220 

GAGAATTCGA TATCGGTATT ACAACGGTAT TTGGTGGTTT TACGAGCGAA TTTTATGATG 2280 

CGTATAATAA ACATTATCCA CTCGCAAAAG GTGCATCCTA TAGACTTGAA TTTTATCGTT 2340 

10 

TATATTTATT GATGGTCCAT TTATTGAAAT TTGGTGAGAT GTACCGTGAT AGTGTTGCGC 2400 

ATTCTATGGA TAAGATTTTA CAAGATACAA CAAGTTAGTT AAGACGTTAG ATTGAGATAA 2460 

15 ATAGATAATA TGCACAGATA TTTTTACAAT GAGAAGCGAT ACAGCTGCCT CAATAAAAAT 2520 

ATTTGTGCGT TTTTATTGTT GGAAAATAAA ATTTTAATCG CTATTGTTAA TTTCTGTAAT 2 5 BO 

GTAAAACAAG GTTGAGTTAC AATAAAAGTG ATTTTATAAC TTTTTGTTCA ATAAAATTCT 264 0 

20 AGGAATGATA CATATTTATT GATACAATAA TTTTGAATAT AATCATAAAA CAATATTTAA 2700 

GTATAATTGA ATGTTTGAAT ATCATATATT GATACAGTTT CTAATAATTT TAAAATAATT 2760 

TAAATGGAGA GAGGTGTAAA TGATGAGTAC AGTTCAAAGT GATATTTTTA AGACCAATAG 282 0 

25 

TGCATCATCA TCTATTAAAA GCGCTGTTGA AACATGTAAT AATGTGTCGA AACCGGATAA 28 BO 

AGATGAAAGT ACAACAGTAA GTGGAAATAA TAATGCTCAT AGTGTGATAG ATGATTTGAT 2940 

3Q GAGTAAGAAT CAATCTGTTG CTGAAGCAAT ACGAACTGCG AGCGATAATA TACAAAAAGT 3000 

TGGTGAGGCT TTTGACCAAA CTGACGTAAT GATTGGTAAT GAAATTGGTA AAAATTAAAA 3060 

CGTGGTGAAA TGATGTCGAA TAAACTGGAT GAAATCAATA AAATAATCAC AGCGAAACAT 3120 

3S GAGCAAATGG ATGACTTATA TGATGAAAAG CGAGAGGTTA AAGCATTGAT AGATGAAAGT 31 BO 

GATGCGCTTA ATCATTCGAT AGATCAATTA TATCAACATT TAGGTGAGCG TTATTATAGT 3240 

AGCAATATGG CTAGTCGTAT GGAACAGTTC CGCGATGAAT TTCATTTTGC GAAACGACGT 33 00 

40 TCAACGGAAG CGTTATACGA GCAGCAACAG CAAATTCAAC ATGGCATTCG TAAAGTGGAA 3360 

GAAGAGATGA TTGACTTGGA AATGCGAAGG AATGTTGAAA TTGAGACGGT GACAAAGGAG 34 20 

GAAAATAAAT GGAAACAATA GGAAGCATTA TTTATTTAAA AGAAGGTTCG CAAAAGTTAA 34 80 

45 

TGATTATTAA TAGAGGmCCA aTTGTAGAAA TTGAAAATCA AAAGTATATG TTTGACTATT 3540 

CTGCATGTAA ATATCCGATT GGTGTTGTAG AAGATGAAAT TTATTATTTT AACGAGGAAA 3600 

SO ATATAGATTC AGTTATTTTT AAAGGTTATT CTGATCAAGA TGAGGTTAGA TTTCAAGAGT 3660 

TGTTTGAAAA TATGAAACAA AATTTGGATA GTGAAATACA ACGTGGAGAA GTTACACAAC 3720 

AATAAAGAAA TACTTTTTCT TTATTGGGGT GGGACGACGA AATAAATTTT GTAAAAATAT 3 780 
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ATGTCATTCA TAATCATTTG AACTAAACGT AGCAGCCTTA AATTTTAAAA AAAGACACAT 3900 

ACCAACTTCC GAAATGTAGA TGAATTCTCT ACAATAACGG AAGTTTTTCT TTTAATATTG 3960 

5 

AAATTTCTCA AGGATAGGTC TATACTTTAT AAATCGTAAT TATTACGATT TATAATCAAA 4020 

AACAATAACT TGAAATAGAT CATTGAGGGA GTGTTAATAT GCAACATCAT AAAGTGGCTA 4 080 

TTATcGGTGC CGGTGCTGCA GGTATAGGTA TGGCCATTAC CTTAAAAGAT TTCGGTATAA 4140 

10 

CAGATGTCAT TATTTTAGAA AAAGGAACAG TAGGACATTC ATTTAAACAT TGGCCGAAAT 4200 

CGACCCGTAC GATCACGCCA TCATTTACGT CTAATGGATT TGGCATGCCT GATATGAATG 4260 

75 CAATTTCCAT GGATACTTCA CCAGCATTTA CATTTAATGA AGAACATATT TCCGGAGAAA 4320 

CATATGCTGA ATATTTACAA GTGGTTGCCA ACCATTACGA GCTGAATATC TTTGAAAATA 4380 

CAGTTGTCAC AAATATATCT GTAGATGATG CATATTATAC GATTGCAACG ACAACAGAGA 4440 

20 TATATCACGC GGATTATATC TTTGTCGCAA CAGGTGATTA TAATTTCCCT AAAAAgCCAT 4500 

TTAAATATGG TATTCATTAT AGTGAAATTG AAGACTTTGA TAACTTTAAT AAGGGGCaAT 4560 

• ATGTGGTTAT CGGAGGTAAT GAAAGTGGCT TTGATGCTGC ATATCAACTT GCAAAAAATG 4620 

25 

GCTCTGACAT CGCACTTTAT ACTAGCACAA CCGGTTTAAA TGATCCGGAT GCTGATCCTA 4680 

GTGTTAGATT GTCACCTTAT ACACGTCAGC GACTAGGTAA TGTCATTAAG CAAGGTGCTC 4740 

GCATCGAAAT GAATGTACAT TATACAGTTA AAGATATTGA TTTTAACAAT GGACAGTATC 4 BOO 

30 

ATATCAGTTT TGATAGCGGA CAAAGTGTGC TTACACCTCA TGAACCAATA CTAGCAACTG 4860 

GCTTTGATGC AACAAAAAAT CCAATCGTTC AACAATTATT TGTGACAACA AATCAAGATA 4 920 

35 TTAAATTAAC AACACATGAT GAATCGACAC GTTATCCGAA TATTTTTATG ATTGGTGCAA 4 9 BO 

CAGTTGAAAA TGATAATGCC AAATTATGCT ATATCTATAA ATTTAGAGCG CGATTTGCAG 504 0 

TACCTGCACA TCTTTTAACA CAGCGGGAAG GcTTACCAGC TAAACAAGAT GTCATTGAAA 5100 

40 ATTATCAAAA AAATCAAATG TATTTAGATG ATTATTCATG TTGTGAAGTG TCATGCACAT 5160 

GTTAGAAGTG aaatatgata tgagaactgg GCATTATACG CCCATACCTA ATGAACCTCA 5220 

TTATTTGGTT ATTAGTCATG CGGATAAACT TACCGCAACA GAAAAAGCGA AATTAAGATT 52 BO 

45 

ATTAATCATA AAACAGAAAT TAGATATTTC ATTGGCAGAA AGTGTAGTTT CTTcGCCTAT 5340 

AGCGAGTGAA CATGTGATAG AACAATTGAC ACTATTTCAA CATGAGCGAC GACATTTAAG 5400 

SQ ACCTAAAATA AGTGCGACAT TTTTAGCCTG GTTGTTGATA TTTTTAATGT TTGCATTGCC 54 60 

AATCGGTATC GCTTATCAAT TTTCAGATTG GTTTCAAAAT CAGTATGTGT CAGCATGGAT 5520 

AGAATATTTA ACTCAAACAA CATTGCTCAA TCACGATATA TTACAGCATA TATTATTTGG 55 BO 
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ATTGATTAGT TTATCAACTG CTATAATTGA TCAAACAGGA CTCAAATCAT GGATGATATG 5700 

GGCAATTGAA CCGTCAATGT TATGGATAGG ATTACAAGGT AATGATATCG TGCCACTATT 5760 

5 

. AGAAGGGTTT GGATGTAATG CAGCAGCTAT TTCACAAGCA GCACACCAAT GCCATACCTG .5820 

CACGAAGACA CAGTGTATGA GTTTAATAAG CTTTGGTAGT TCTTGTAGTT ATCAAATAGG 5880 

TGCGACATTA TCTATTTTTA GTGTAGCTGG AAAGTCATGG CTATTTATGC CGTACTTAAT 594 0 

10 

ATTAGTACTT TTAGGTGGCA TCTTACATAA AGGATATGGT TGAAAAAGAA TGATCAACAA 6000 

CTTAGCGTTC CGCTACCTTA TGATAGGCAA TTACATATGC CAAATATACG TCAAATGTTG 6060 

is CTACAAATGT GGCAAAATAT ACAAATGTTT ATCGTTCAAG CGCTACCTAT TTTTATCACA 6120 

ATCTGTCTTA TTGTTAGTAT TTTATCACTA ACGCCAATTT TGAATGTTTT ATCACAAATA 6180 

TTTACACCTA TATTATCGTT ATTAGGCATC TCGTCAGAAT TGTCACCAGG GATTTTATTT 624 0 

20 TCAATGATTC GAAAAGACGG CATGCTCTTG TTTAATTTGC ATCAGGGCGC CTTATTACAA 6300 

GGAATGACAG CAACACAGTT ACTACTACTT GTGTTTTTTA GTTCAACATT TACAGCGTGC 6360 

TCGGTCACAA TGACGATGCT TTTGAAACAT TTAGGTGGTC AGTCAGCACT AAAATTAATT 6420 

25 

GGAAAGCAAA TGGTGACATC ATTGTCTTTA GTTATTGGTG TAGGCATCAT TGTTAAAATA 64 8 0 

GTAATGCTGA TTATTTAAAA AAAATGAACT ATAACTGAAT ATAGAGTCAT GTCAGTCAAT 6540 

AGGAGATCTA TCTTGGAATA TGCTATTCAT ATGAAGTATA AGAGGAGAGT CGCAGATGAA 6600 

30 

AATAGTTATT ATAGGTGGGT TTTTAGGTGG CGGTAAAACG ACTGTCTTAA ATCATTTGCT 6660 

CGCTGAATCA TTAAAGGAAT CGCTGAAACC AGCAGTCATC ATGAATGAAT TTGGGAAAAT 6720 

35 GAGTGTTGAT GGTGCCTTAG TATCTGAAGA CATACCTTTA AGTGAACTGA CAGAGGGGTG 6780 

TATCTGTTGT GCAATGAAAG CAGATGTATC AGAACAGTTA CATCAATTAT ATTTAAAAGA 684 0 

GCAACCAGAC ATTGTATTTA TTGAATGTAG TGGGATTGCA GAACCGGTCT CTGTCTTAGA 6900 

TGCTTGTTTA ACGCCTATTT TAGCTCCGTT TACAACAATT ACACATATGA TTGGTGTAAT 6960 

AGACGCAAGC ATGTATAAAC ACATTAAATC ATTCCCTAAA GACATCCAAG GCTTATTTTA 7020 

TGAGCAATTA GCATATTGTT CTGTCTTATT TGTTAATAAA ATAGATTCAG CAGATGTTGA 7080 

45 

AACAACGAGC AAACTATTGA AAGATTTAGA AGTTATTAAC CCAGAGGCCG ATATACAAGT 7140 

CGGTATGCAT GGCAGCGTCA CTTTGCCAAT ATCAGTTAGA CAAATGACAG CAACTTCTGA 7200 

50 CAATAAACAT AAGTCTTTAC ATCAAATGAT TAATCATCAA TTTGTGCAAT CACCAGTCAA 7260 

ATGTACTAAA GCAGAGTTTA TAAAACGTTT AGCATGCCTT CCGTCTCATA TTTATAGGTT 7320 

GAAAGGGTTT ATGACATTTG AAGACACCGC ACATACGTAT CTCATTCAAT TTACACAAGG 7380 
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CGGAAAGGGT ATTTCAAAAG AAGACTATCA 
GAGAATGGTT AACATGCCTT CATGTATAAT 
TAAAAATAAG CTTGGTCAGC CATCAAATAT 
GAGAACAATC AATTAACCCC ACATATTTAA 

AATATAACCT AAGTGACCGC CTGGAATATC 

10 ' 

ATAAAAGTTA ACATCTTGTG GGAAGGAGCC 
TTTATCGCTG TATTTTGTGA AATCATCCAA 

1S TTCAAATTCT GACCAGAACA TCGTACGTTT 
AGCAGGTTGA GACATCATTT TTGCATCAAT 
TTTCATGCCT TTTTCTAAGC CTTCTGTTAA 

20 TTTCCAATAA GTACTGTCTG GTAAAAATGT 
TTTAACGACT TCAGGGTAAT CTTTTAACAC 
ACCTAATATA TAGACAGGTT CATCACTTAA 

25 

GTCGCGTTTG ACACGATAAT CACTGTCAGG 
TAACTCGCTT TCTCCATAAT CACGACGATC 
CTGTTCTGCA AGAGGCAGAA AAATGTCTCC 

30 

CACGGGTCCT TGTCCGACTT GGTGGTATCG 
CATATTCAAT GACCTCCATT TGTTAATTGT 

35 TTGTATAACT TATTTTCTCT TTTTCTTCAT 
CTAATTTTTC AGGCTCAATA TATGGATAAT 
CTTCTTTCTT GACTAAATCA AACTGTGGCT 

40 AATTATTTTT AAAGTAATAG CTTACAGGGT 
CCATACGTTC TAAGAAGAAT GGGATAAACT 
TTGGATAACG ATCAAAAATA CCAGATAATA 

45 

TGTGCCAACC ATAACCAAAA CAAGCAAATG 
TATAGTATGA TTGATAAATG TCACTGTTAA 
5Q CTAAATTTTC AGCTGTTTTG AAAATAATGT 
GTGCACGTCC CATAATGAGC GCACCTTTGA 
CTCGCGCTGC GGCTTCAGGC TCATTGATAG 

55 



ATGTTTGGAA CAGTAGTGTT TTCAGTGGAA 7500 

AACGAGTTGA TTTGAACGTT TAAGCGTAAA 7560 

AATTTGAAAA CTGTCCAAGC TGTTTTATTA 7620 

TAATACATCA GCAAAGCCTT CAGGTTTTTG 7680 

TACAATAGGT ATGCCAGTTT CTTTATTTAT 7740 
TCTAGAATCT GTCCCATTTA GTAGGGTGAT . 7800 

AGTAATATCT GAATGCGTAT ATTGTCTAAT 7860 

GTACTGTTCT ATACGTCCTT CTTCAGTATC 7920 

TGGTGCGATA TTTAATGTTT CGCCAAATGT 7980 

AATTTGATGC ACAATGTCAT CATTTTTATC 8040 

ATTAATTGGT GGTTCGTGAA ATGCAATCTT 8100 

ATGCATCGCA ACGATTGAAC CTGAACTTGA 8160 

TGACTTTGCA AGTTCGGCAA TGTCCTGTGC 8220 

GTTTGAAGCG GAATCAGGGA GTGGTTCAGT 8280 

AACGGCTACA ACAGTAAAAT GGTCTTTTAA 834 0 

GGTACCGTTT GCACCAGGAA TAAAGATGAG 84 00 

TAATTTAGCG CCTTGTAATT CTAAAGTTTC 8460 

TAGGTGATAA ACCTAATAAT TTAGCACCAT 8520 

CTGTTAAACC CAGTTCATCT AAAAATACAC 8580 

CAGCAGCATA AAGAATTCTA TCAATACCTA 8640 

TCGTTAACAT GCCACTCGGT GTGATATAAA 8700 

GGTTCAAATG TTCAGCGAAT AAAGCTTCAT 8760 

CACCCCAATG TCCAATAATC ATATTTAACT 8820 

CTAGATGTAT TGTATGAATG CCGACATCAA 8880 

TTGCCGCAGT TACTTCAGGA TAATTTCCTT 894 0 

CTGGCGCGGG ATGTAGATAA ATCGGTACGT 9000* 

CATATTTGTC TTGATCAAGA AAACCATCTT 9060 

ATCCTAAATC ATTGATGCAA CGTTCGAATT 9120 

GTAAAGTTGC AAAGCCTACA AAGCGATTGG 91B0 
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TCTGACCAAC CAAATTTGAA GGAGAACCAT TTCCATAAGA TAAGACTTGA ATTTGAACGT 9300 

CTTGATTATT CATAAATTGG ATACGTTCAT CATGATGTGA TAATTCGTCG GCATTTGTAA 9360 

5 ' 

AACCTGTCTT TTTTTcAAGG CCTTCTAACA TTACTTTCAT CGGTACACCT TTAGGATCTG 9420 

CTGATATCGC ATTCATCGTT TCTTTTTGAA TATCTTCAAT GACATAATGT TCTTCAAACG 94 BO 

TAATACTTTT CATTTACTTC GCCTCCATAT TGTATTGCAT GTTTATTGCA TCTATTGCAG 9540 

10 

AAGC A TT T TT TATATACCTC TAATTTCAAT GTTTGTAACA TAAAATTGAT CTACCAAGGC 9600 

ATCTCTCCAT CGCCATTAAT AAATGTACCT GTTGGGCCAT CTGCACCAAT CGTTGCTAAT 9660 

15 TGAATGATTG GCTTGATTCC TTCAGAAACG TGTTTGGAAT TATTACTAAA ATCACCAACT 9720 

AAATCAGTAT TTGTAGCGCC TGGATCAGCA GCATTGATTT GCATGTTAGG TAATCCTTTA 9780 

GCGTATTGTA GCGTTAGCAT TGTTACTGCC GATTTAGACG AACAATAAGC TAATGAATTC 984 0 

20 ACTTTAGATT CAGCTGTTTC GGGGTTTGTA ACCATTCCAA ATGAACCTAA ACCACTTGAT 9900 

ACGTTGACGA CAACAGGTTG TTCAGATTTT TCTAAGAGAG GGACGAATGT ATTCATCATT 9960 

CGTACGATAC CGAATACATT CGTTTGATAT ACTTCTTCAA CGTCACGAGG TGTCAATTTG 10020 

25 

GAAGGTGCTG AAAATTGACC AGATATACCT GCATTGTTAA TGAGGATATC AAGACGGCCT 10080 

TCTTTTTCAG CAATCATGTT ATAAGCATTT TTGACTGAGT AGTCACTTGT AACATCTAAT 1014 0 

TGTACATAAT GAACACCTAA TTTTTGTGAT GCTTGTTGTC CTCTTACATC ATTCCGAGAA 10200 

30 

CCTATATAAA CTTTGTAACC CAATGCTTTA AGTGCCTCTG CACTTGCATA GCCTAACCCT 10260 

TTATTGCCTC CTGTGATTAA CACAATTTTA GTCATTACGT CCCACCTCAT CTAAATAAAT 10320 

35 GTTTAATAAA TAATTTCTGT ACGCTTCAAT TGAAATATGG CGATGCTCTA TTTGGAAGGC 10380 

AAATACACTA GTTGATAATG ATTGCAACAG CATATCTGTT TTGAAtTCGT GTAAGTGTCG 1044 0 

TCATCGCTTT TAAATAAGTC ATAATAAAAA TCAAATAATT CTTGATAAAA TGCGCTTTGG 10500 

40 

TAAAAACGTA ATTTATTGTT GCCTGCTTCA ATACATTGCA GTAGTGCCTT ATTATCGATT 10560 

TTAAATTGTA AAAGATAATC TAACGACACT TGCATAACCT CATAATTAGA ATGATAGTCA 10620 

TCTTTAATTT GCTTAAAATG AGTGATAAAA ATATCAAGGT CTCTTTGTAT GACGTAGTAG 10680 

45 

CATAAATCGC TTTTATCTTT GAAATGTCGA TACAATGTCC CCATACCGAT ACCTAGTTCT 10740 

TTAGCAATAC GATTCATACT AATGTTTTCA ACGCCTTCTT CATCAAAAAG TTTGTGCGCT 10800 

SO ATTTCTTCAA TTCGTTGCCT ATTCTCTTTT GCATCTTTTC GCATGATTAC ACCTACTTAA 10860 

AATTCTCTAA AATTGACAAA CGGATAACTC TCCGTTTATT ATAAAACGTG TTAAGAAAGT 10920 

TAGCAATGAA TTTGCAATAA CTATTAAATA TCATAAAAGA AAAGAGTGTT GATAATGTCT 1098 0 
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ACCTTATCGG TTCAAATGAT TGCTGAAAAA CTGAATGTCA CTACAGAAGA TGTGGAAAAA 11100 

GTATTAGCTA TGACAGCGCC ACTAGGCATT TTTAGTCATC AATTACAACG ATTTATTCAT 11160 

TTAGTATGGG ATGTCAGAGA TGTAATAAAC GACAATATTA AAGGAAATGG ACAAACACCA 11220 

GAACCATATA GGTATTTAAA AGGTGAAAAA GAGGACTATT GGTTTTTAAG A 11271 



(2) INFORMATION FOR SEQ ID NO: 12: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
1S (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12: 
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CAACCCGTTC 


AGAACAAAAT 


AAAAACCGTA 


CAATTTTATC 


ATCTTAATGA 


TTATTGTACG 


60 


GAAAAACTTT 


TTTACATCAT 


ATCTGCATGT 


GCATAATCGA 


TATCGGTAAA 


TTTATTATAT 


120 


TGTTTCATAA 


AATGTAACTT 


AACTGTGCCT 


GTTGGACCGT 


TACGTTGCTT 


AGCAATGATA 


180 


ATTTCAATTT 


CACCGTTTTC 


ATCATTCGTT 


TGTGGCTCGA 


AACCACCATC 


ATCGTCATCA 


240 


TCTTCAT CGC 


CGCCACGGTT 


ATAGTAATCA 


TCACGGTATA 


AGAATGCAAC 


GATATCGGCA 


300 


TCTTGCTCAA 


TCGAACCAGA 


TTCACGAATA 


TCACTCATCA 


TTGGACGTTT 


ATCTTGTCGT 


360 


TGTTCAACAC 


CACGAGATAA 


CTGACTTAAT. 


GCGATAACTG 


GACATTTTAA 


TTCACGGGCT. 


420 


AATGCTTTTA 


ATGTACGAGA 


GATTTCAGAA 


ACTTCCTGTT 


GTCTGTTATC 


GGACGCACGT 


480 


GAACCACTAC 


CTTGAATCAA 


CTGTAAGTAG 


TCAATCACAA 


TCATGTCTAA 


GCCATGTTCT 


540 


TGCTTTAATC 


GACGACATTT 


AGAACGTAAA 


TCATTAATTC 


GAATACCCGG 


TGTATCATCA 


600 


ATAAAAATCT 


TCGTACGTGA 


TAATTTACCT 


ACCGCTATAG 


TAAAACGACT 


CCAATCTTCC 


660 


TCAGTCATAG 


TACCCGTTCT 


TAAGCGGTTT 


GAGTCAACAT 


TTCCAGAACT 


ACAAATCATA 


720 


CGTGTGGCTA 


ACTGATCAGC 


ACCCATCTCT 


AGCGAGAAAA 


TACCAACTGT 


ATACATATCT 


780 


TCATGCGTTG 


CAACTTTTTG 


TGCAATATTA 


AGTGCGAACG 


CAGTCTTACC 


TACAGATGGA 


640 


CGCGCTGCAA 


GGATAATTAA 


ATCATTTCGG 


TTGAACCCTG 


CTGTCATTTG 


GTCTAAATCT 


900 


CGATATCCTG 


TAGGTATACC 




CCACTATTTT 


GATCAAGCTC 


TTCAGCTGTT 


960 


TCATACACTT 


GTCCTAAGAC 


GTCTCGAATG 


TCTTTAAAGC 


CATCGCTTTC 


ACGAGAAGAT 


1020 


GATAGCTCTA 


AAATTCGACG 


TTCTGCATCA 


CTTAAAATCG 


CATCTAGTTC 


AAGTTCATCA 


1080 


TTATATCCAT 


CATTGGCAAT 


ACTATCTGCA 


GTTTGAATCA 


ATCTACGTTT 


TAATGCATGC 


1140 
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TCTGCAAGAT ATTGCGGGCC ACCCGCTTcA TTCAACGTAC CTTCCGTCGA TAATTGATCC 1260 

ATCAATGTTA CAACATCAAT TTCTTTATTA TCTTCATTTA AGTGCATCAT TGCACGGAAA 1320 

5 

ATATGTTGAT GGGCACCCCT ATAAAACGAC TCAGGAAGCA AAACTTCCTG AGTAGTATTA 1380 

' ATCAATTCTG GATCTATAAT AATTGAACCT AAGACAGACT GTTCAGCTTC ATTGTTATGC 1440 

GGCATTTGAT TTTGCTCATA CATTCTATCC ATGAATGGTT ACACCTCTTA TTTCAATCCA 1500 

10 

ACTTTATTGT TCAACTGTGT GTACGCGAAT TGTACCTTCA ACTTCTTTAT CTAATTTAAC 1560 

AGGTACATTC GTATATCCTA GGGAATGAAT TCCATTTGGT AAATCCATTT TACGTTTATC 1620 

75 AATTTTAATA TCATGTTGTG CTTTTAGTGC TTCGGCAATT TGTTTTGTAC TTACTGACCC 1680 

AAACAATTTA CCACCTTCAC CAGTTTTTGC TGaTACTTCA ACTTCAATGT TTGATAACGT 174 0 

TTCTTTTAAT GCTTTAgCAT CTTCAATTTC TTGTTGGCGT TCTTGTTTTG CACGTTTTTT 1800 

20 CTGTAACTCT AATTGTTTAA GGTTACCTGG TGTTGCTTCT ACAGCATAAT TCTTTTTCAA 1860 

TAAGAAGTTA TTTGCATAAC CTACTGGTAC TTCTTTAACT TCACCTTTTT TACCTTTACC 1920 

TTTACCTTTA ACATCTTGTG TAAAAATTAC TTTCATGCAT CTTCACTCCT ACTTAATTGT 1980 

25 

TCTGTAATTG CTTGTTGTAA TTGTGCTATC GCCTCTTCGA CTGTCACACC TTTAAGTTGT 204 0 

GTTGCCGCAT TGGTTAAATG TCCACCGCCA CCAAGTGCTT CCATTGTTAA CTGGACATTT 2100 

ACTGAACCGA GTGAACGCGC AGATATACCA ATCAGATTAT CTTCACGTCT CGCAACAACA 2160 

30 

TATGATGCTT CAATACCTTC TAAACTTAAC AGTTCATCTG CTGCTTGTGC AACTGTTACT 2220 

GGATGATAAA TTTTATCGTC TGAACCATGC GcAATGGCTA TGCCATTATC TTCAACTTTT 2280 

35 ACAGTTCGAA TTAATTCAGA TCGATTAATG TAAGTATCCA CATCATCTTT TAAGAAATGT 2340 

TGCGTTAAAA TCGTATCTGC ACCATGTGCA CGTAAATAAC TCGCTGCATC GAATGTTCTT 24 00 

GATCCTGTTC GTAATGTAAA GTTTCTTGTA TCTACAATAA TACCTGCATA CATCACTGTT 24 60 

40 

GATTCAAGAC GTGTTAAACG TTGTTCTGTT GGTTGATATT CCAGTAACTC TGTTACCAAT 2520 

TCAGCTGTCG AACTTGCGTA TGGTTCCATA TATATCAACA ATGGATTAGA GATGAAGCTT 2580 

TCACCACGTC TATGATGATC GATAACAACT TTACGGTTTG CTTTATTTAA GACATTTTCA 264 0 

45 

TCTAAAACCA GTTCCGGTTT ATGCGTATCA ACAATCACTA CGGTTGTCTT AGATGTCATC 2700 

ATATCCCAAG CATCATCTGA TGTAATAAAT CGCTCTCTTA ACTCTGGCTT TTTATCTATT 2760 

so TCGTTCATCA CGCGTCGTAA TGTTGGATCA ATGTCAGTCT CATTTAATAC GATGTATGCT 2820 

TCTAAATTAT TCATCATTGC AAATCTAGAC ACACCGATTG CTGCACCAAT TGCATCTAAG 2880 

TCAGGACGTT TATGTCCCAT GATAATGACT. TTGTCACCCT CTGCAAGGAT ATCTTTTAAC 294 0 
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CCATAGAAAC 


GCACATTACC 


ATTAATACTT 


TTAATTGCAA 


CTTGGTCGCC 


ACCGCGTCCT 


3060 


AATGCTAAGT 


CTAGGCCTGA 


TTGTGATAAT 


TCACCTAAGT 


CGATTAAATT 


TTCAGTACCT 


3120 


TCACCAACAC 


CGATACTTAA 


TGTTAATTGG 


GCACGATAAC 


CAACACTTTT 


TTCACGTAAT 


3180 


TGACTCAAGA TATCAAATTT AGATTCTTCT 


AAGTCAGCTA ATATTTTTTG 


ATTTAAATAG 


3240 


GCTACGAATT 


GATCGGAACT 


GTATCTTTTG 


AAAAATATAT 


TATACTCAGT 


TGCCCATCGA 


3300 


CTAATGACAC 


GCGTTACCAT 


TGAGTTGATT TCCGAACGCT GCGTATCATT 


CATATTTTGC 


3360 


GTAATCTCAT 


CGTAGTTATC 


TAAAAATAAT 


GTCGCAATGA 


TTGGTTTAGA ATTTTCATAT 


3420 


AGTTCATTTG 


TTTGTACTTG 


TTCAGTTATA 


TCAAAGAAAT AGAGGCAGTG 


ATCATTCTCA 


3480 


GAATAACGTA 


CTTGGAAATG 


ATACTGATTA 


TATTCTATTT 


cAACGGATTT 


CACTCTATCT 


3540 


AATTGCTTTA 


AAATGTTTGG 


AAATACTTCA 


TTTACAGATT 


CAGAAATGAC 


ATTCGCTTCC 


3600 


ATATGATCTG 


TCATAAATTG 


GTTAACCCAT 


TCGATGTGAT 


CATTTTCATC 


TAAAACAATG 


3660 


ATACCAATTG 


GTAAATGTTT 


GATTGCTTTA 


TTATTTGTTG 


TTGAAATTTG 


AGCACTCAAA 


3720 


CCATCTACAT 


AACTATCCAT 


TTTCATTAAA 


GCTTGTCTGA ATAAAATGAT 


GCTAACAATA 


3780 


ATCATCACGA 


CAAGAACGAT 


AGATGCAATT 


AGTGCTATAA 


GACTATTAAA 


GATAAACCAT 


3840 


ACACCCATTA AAACAATTGC 


TGTGATGATC 


ATGATGACAA ATGGTATTAG 


TAAAGCTTTC 


3900 


TTAGTGGACT 


GCCGATTCAT 


TATTCCACCT CTATTCACTT TTTAGAATTA TTTTTCATGA. 


3960 


TTCGCTTCAA ATTCAAACTT 


AAATCGATAA 


CACCAAGTAG 


TCCTACAATA 


TGTGTCGTAG 


4020 


GTGTCAGTAT 


TGTACCGATA 


ACCAATAGTA 


AAATCGTTAC 


TGCATTCGGC 


AAACCTTTCG 


4080 


CTTTACCAAA GAAATGAATA ACACTTAAAC 


CTTGAATATA 


CATTACTAAT 


GATAACACAA 


4140 


GTTGGAAGTT 


TAAAAGAATG 


CTCTGGAACA 


CACTCGGTTG 


ACCTGTAAAT 


AATAAACATA 


4200 


TGATAACAAT AATGTATATC 


CATAATAAAA TACCGCTCAT 


TTGCCACGCG 


AAAAGTGGCT 


4260 


TAAATACAGG 


TGTAGCGATT 


TTAAATTTTC 


GTAAAATCGG 


AAATGTAACG 


ATTAAGTTAA 


4320 


TTAAGACGAT 


TAAAAATGTA 


ATGATAATGA 


TGAAACCTGG 


TAATTGAACG 


GTCGCTTGTC 


4380 


TAAACCCTTC 


TTCTAATATT 


TGGGTCATAT 


TCGCATCGGC 


ACCGCTCATC 


GTAATCGCTT 


4440 


CATGTAATGT 


TTGCTTGAAA 


GGTTTTACTA 


TGCTCGCTGA 


TGGTGGAATC 


CTTCCGAATG 


4500 


TTTGTAGTAA 


CATAAAAGCG 


ATTAATGAAA 


TTnArCTCAT 


CGCTACTGTT 


GTTACGTATA 


4 560 


ATATTCTTTC TTTAGACGTT 


CTTTCTTTGA 


GCAATTGACC 


AATAATTAAA 


CTTGCAATTA 


4620 


AGACTAATAT 


GATGGCACTT 


AAAACGAAAG 


TATTACCTAA 


AACAGTTGTT 


ATAATTACTG 


4680 


TAATAAGTGC 


ACTAATCCCG 


AAAGATTGTA 


TTGATTTATT- 


CCATAAAACG 


ATACCTGGTA 


4740 
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CAAATACCAA CGCAATCGTT GCAATTATTG TTGCTTTAGG TTGTATTTTT GAAAACACAT 4 860 

AAGCCACTCC CATATTTTTA ACTATAGCTA TTATTTTAAC CTCTTTAATG AAAATTAACA 4920 

ATTTATAGAT TGTATGCTTC TATTTCATTT AATTGAATAA TAACTTTCAT GTTTTATAAG 4930 

TAATTAACAT ACTCATTTGA ATCGCTTTTG TGTGCTTTCA TTTTCAACAT GATTATTTAA 5040 

TCCCACTACA TAGCAATCAA GCTTGATTTA GATTTACAAT ACATTTCCAC TCTCATGTAC 5100 

TCTAGATGTT TTTGAATATG ATAACTGTGA TTTAGTGGCT TCATTCTTTG AAAATATATA 5160 

TTATTACTTA CGCTTAAAAT GCTTTAAATT TAAGAAATGA TATAAGTTAG GTGCCCAGGT 5220 

ACTAAAGTTT AGTAGGaATC CATCATGCCC AACATTATCA GGCACGAAGA AATGACGATG 5280 

ATATTTAAAA CGTTCACCTA ATGCACGAAC TTGATCATCC GGATATAGCA AATCATCTAT 534 0 

GAACCCCATC GTTAACACTT TTGTTTCTAA ATTTTTAAAA ACATGCGTTA CGTCTGTGCG 54 00 

20 ACCTCGGTCA ATGTTGTGAC TATCCAATAC ATCTAGCAGT GTCAGATAAC AATTCAAATC 54 60 

AAAATGTTCT TTAAATTTAT TACCTTGATG TTGTTGGTAT GCGACTACTT CATCCGGCGT 5520 

AAAACGTTCA TCATAACTTT TTGATGATCG ATATGTCAAA AAACCTAATT GGCGTGCAAT 5580 

ACTTAGACCT TCCTTACCAC CAAGATGAAT GGCTTGCCTT GCAATTTCAT TGAAAGCTCT 564 0 

ACTATAAGAT GATGTTCGAC TTGTTGCAGC AAGGATAATG GCTTTATCTA CTTCAAACTG 5700 

TTGATTGTAG AGTAGTTCCA TTGCTTGCAT ACCTCCAAGA CTTCCCCCTA TTAAAATATT 5760 

AATCTTATCA TAACCAAGGG CTTGTATACC TCGTTCATTC GCTCTGACTA TATCTCTTAA 5820 

TGTTAATTTT TTAGGAAAAT GAGGGTCGTT TAAAGGTGAA CTTGAACCGA AAGGACTACC 5880 

AATAACATCA AATGTTAAAA ATTGATAATC GTGAATGGGT ATATATCCCC CATCAATAAT 594 0 

TTCTCGCCAC CAACCCGGAT AATCATCTGT TCCATATGTT AAATGATTGC CAGTTAATGC 6000 

ATGACAAACT ACAACTAATG GTTGTCCATG ATAACCGACA TGCTCATATC TCAAACGCAA 6060 

40 GTnATCTATG ACTTCCCCAG ATTCTGTAAT AAATTCCCCT AAATTTAAAG TATCTACTGT 6120 

GTAATTTGTC ATTGTTCTTT CCTCCTTAAA CAAAAAAACT TCTCACCCTA- TTGAAAAGTA 6180 

AGAAGTCTTT ATACTTATCA TTCGAGTAAC TCGTTGGTTT TAGCACCGTG CTATAAAGTC 624 0 

GGTTGCTGAA GTATCACAGG G 6261 

(2) INFORMATION FOR SEQ ID NO: 13: 

(i) SEQUENCE CHARACTERISTICS: 
so (A) LENGTH; 1222 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 13: 

ATGCGATTAA CTCTGGAAAT ATCTTTTCCA TATTTACGTn TTAAATTATT CAGCAAATTC 60 

ATACGAGaTT CATACTCGTT yAACACTTGT TCGTCGAATT CTGTATTAGC CATTTCATCA 120 

TATAACTCAT GTTTTGCATC TTCTAAAATG TAGTAAAATT GATCAATATC TTCTTTTAAT 1B0 

TTGTCATATT TGTTTGGAAC TATATCGTTT ATTGTTAACA AATGGTTGCT TAG TT CAT AT 240 

AAACGATCAG TGATAGCATT TTCATCCGTT AATGTCATAT ATGCGTTATT AAGCGCTAAG 300 

CTTAATTTTT CAGAGTTTTG AATGCGTTTA ATATCTATTT CAAGTTGCTC TATTTCGCCT 360 

TCTTTTAGAT GTGCTTCAGA CAATTCTTCT AATTGGAATT TCATTAAATC TAAACGCTGT 420 

AGCAATGCTT GGTCTGCTGA TTCTAAATCT TCTAACTCTT GCTTTTTGGC TTTATAATTT 4 80 

TGAAAAGTTT GGTGATATTT ATCCAACAAA TCTTGATAAC GTGATTCTGC GTAATTATCC 540 

20 AATAATGTTA AATGGTATTT TTGTTTCAAC AAAGACTGCG TTTCATGTTG GCCATGAATA 600 

TCTAATAATT CTTGCATAAC TTTTCGTAAA TCTTGTAAAG TAACTGTTTG ATTATTAATT 660 

TTACAAAGAC TTTTACCAGA GCTGAAAATT TCCCGTTTAA CTAATAAAAA ATCTTCATCT 720 

ACATCAATAT CCATATTTTT CAATATATGT ATAGCATCTT TACTCTCGTC AATATCAAAT 780 

ATACCTTCGA TGACAGCCTT TTTTTCACCA TGTCTTACAA AATCAGATGA AGCTCTCATT 840 

CCAATTAATT GTCCAATTGC. ATCTATAATA ATTGACTTAC CTGAACCCGT TTCACCACTT 900 

AAAACAGTTA AACCATCAGA AAATTGAATT TCTAATTCTT CAATAATAGC AAATTGCTTG 960 

ATTGATAAGG TTTGTAACAT AAACTCATCG CATCCTTATA ACAAATTGAA AATTCTTGAC 1020 

TTGATTTCAT CACTTGCCTC TTTGCTTCGA CAAATAATTA AACAAGTATC ATCACCACAA 1080 

ATTGTGCCTA GTACTTCTTC CCAATTGATT TGGTCTAATA TAGCTCCAAT AGATTGTGCA 1140 

TTACeAGGTA TGTTTTTAGA ACAAGTAAAT TATCAGTACC ATCTATATTA ACAAAGGAAT 1200 

40 CCATTAAATA ACGTCCCAAT TT 1222 
(2) INFORMATION FOR SEQ ID NO: 14: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 14: 
TTTGTTATTA TTACnTnAAA TAATTGCATT ACTTTTTACT GATGGTACAA CTTTCCATCC 60 
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TTCTTTTGGC ACGACATAAT TGTCTTTATC TTGAACTAAA TATCCGCCAG ATACTGAAAC 180 

AAACTCTTCT TCGTTACTGT CTATAGTCAT ATCAATTTCT AATAATCTTA CATTCTTCTT 240 

TTGTTTTAAA ATATCTAATG CTTCATCTGT AAATTTTGGT GCAATAATGA CTTCCAAAAA 300 

GATACTATGC AATTGCTCTG CTAACTCAGG TGTTACAGCT CGGTTTAATG CAACAATTCC 36 0 

ACCAAATATT GATTGACTAT CCGCTTCATA CGCATGTTGA AATGCTTGTT CTATCGTGTC 420 

ACCGATACCA ACACCACATG GATTCATGTG TTTAACCGCA ACTGTAGCAG GTGTATCAAA 480 

CTTTTTAACT AAAGCTAGTG TAGCATCTGC ATCTTTAATA TTGTTATAGC TTAATTGTTT 54 0' 

CCCATGTAAT TGTTTAGCGC CTGCAATCGT GTGCTTAGCA TTCGAAGTTC TCACAAAATA 600 

CGCTGATTGT TGTGGATTTT CTCCATATCT TAAAGTTTCT TTATCCCCTT TAAAGAAACG 660 

TACAATCGCT TCATCATATT CTGCAGTATG CTCAAAAACT TTAATCATTA ATGATTGTCT 720 

20 ATATGACTCA TCTAACGAAT CGTTTCTTAA TCGCGTCAAT ACTTCTTGAT AATCTGCCGG 780 

ATGTACAATT GTTGTTACAT GTTTATAGTT TTTAGCTGCA GCACGTAACA TTGTTGGACC 840 

ACCAATATCA ATATTTTCAA TTGCTTCGTC CATCGTCACA TCAGGGTTTG CAACAGT7TG 900 

TTGGAATGGA TATAAATTAA CTACTACCAT ATCAATTAAA TCTATATGTT GTTCTGATAA 960 

TTCATTTAAA TGCTGCGGTT TATTTCGATC AGCTAAAATG CCACCATGAA CAGCCGGATG 1020 

T 1021 
(2) INFORMATION- FOR SEQ ID NO: 15: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNSSS : double 

(D) TOPOLOGY: linear 

40 ' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 15: 

TCATTCACTC CTAAATTGTT ATTACACTAT TACACaTAGC TAATCATCAA TGTGAAATCA 60 

CCTTCAAAGA CACTATCCAA ATCTTCAGAA GTCAAAATAA AGTTTGTACC AGTAGTCAGT 120 

TTGAAAATTT CACCATCGAC AATCATTTGC CCTTCGCCTT CCAACACTGT AACTAAACAG 180 

AACTCTCTAG GCTTCATATA ATTTAACGTG CCAGAAATTT CCCATTTAAC CAATGTAAAG 240 

AAATCATTCG ATACAATGTG TGTACACTTA TGGTTTTCAA TAATTTCGCT TTCAGGCAAA 300 

ATATTAGGTA ATGGTGCATT GTACTGAATA ACGTCTAAAG CTTTTTCAAT ATTTAACGGT 360 

CTATCATTAT ATTGATTATC TTGACGATTG AAATCATAAA GTCTATATGT AATGTCTGAC 420 
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ATAAAAtAGa ATTCyCCAGG kTTTACtTTA 
TGTTGAACAT GATTCGCAAC TTCTTCTCTA 
5 . GCATCTTCTT CTGCATCTAT AATATACCAA 
TGCTCATAAG CATAAGAATT ATCAGGGTGC 
ACTATTTTAG TTAGAAGCGG AAAATCTTTG 

10 

TCTGACCAAA TACGGTCTAA TGTTTGACCT 
CCATTTGGAT GTGCTGACAC ACACCAACAT 

15 TCCAAACTCA CTTAGACGTT GACCGCCCCA 
TAATGGCATT GTTGCACCTC CATTGTGATT 
TCCATTATAT TTTGATTTTG TTCTCATTTA 

20 TAACTATTAG TGATTGTACC ATATTTACTA 
CACTTAAATT TACAGTACTT TAACATTTTC 
CTTACATTTG TACATATTTC CCTTTAAATT 

2 * CTTTAATAGT TGTGCCATAC ATTGTTCAAA 
TTATTCATAC TTA7AATTCA TCATTTTCAA 
TTCAAATCAT ATTTACTATC CTTATTAATC 

30 

TTTAATGTCC TGATCACCAC TAATAATTTG 
GACAATTTCT TTTAATACTG TCGCAACATC 
ATATTGTGCA GCTTCTATCT TTCCAGATCC 

35 

AATTGTATAA TTGAAACCTG nAACGTCTTA 
TATAXGGCTT TAAATCACCG CTATCATCAA 
40 CCATGACATA GTGTTTAATA TTGGCCTCTT 
CTAAATCGAC AATAATTGTT TTATCTGCAC 

TAACTTTATC GAATGGTTTA AACGTCTCAG 

45 

CAACAAGAAT TGCTTTCATA CCTTGTGATT 

CACCAGCAGT AAATGGTACA TTTTCTTTTG 

CGCCATTAGC ACCTATAACC AAAATATTCA 

SO 

ATGCCATACC ACTTTATGAG ATATGTAAAA 
ACTACTGGGA ACGTATTAAA TTAATATATG 

55 



AtatATCyAA gTAtCGaCtC tATCGTTCCG 540 

GACTCTGCTA ATGTCCCtAT AACTATTTCT 600 

CATTCAGATT TGCCATATTG CCCgTTTTCA 660 

ACATGAATAG AAAGTGATTC TCTTGCATCC 720 

CTTGGGAAAT CACCAAACAA TTCACGATGT 780 

TGATATGGTC CATTAATAAT CTCGCTCGTA 840 

TCCCCCAGTT GTATCATTGT CTAATTGATA 900 

TAATTTTGTT TTTAAAATTG GTTGTAAAAA 960 

AAGTAAGCAA TAGAACTCTG ATGTTGTTGT 1020 

CATCGTATTA TTAACTTCCA CATTTCAAAT 108.0 

ACATTGCAGT ACTGCCAATT AAAAGnGCTT 1140 

AAAAATTTAT AGCATAGAGA TTATATCTCT 1200 

TACTCGCCCA TTATACCAAT TAATAaACAA 1260 

TTCTTTGTAA AAGGCATAGA CAATACGTAC 1320 

AAAATAACGA GTTACGAAAA AGTAACCCGC 1380 

CGTTTCATTT TCAAATTGAG TTAAAGCATC 1440 

AAACTCTTGG TGATTAAAAT GATTGGATGT 1500 

TTCTCTAGGA ATTTCACCTT TACCATCAAA 1560 

TGCTGCATTT GTAAGTGCCC CTGGATGTAA 1620 

AATAGTCATC AGCGTAATGT TTAGCTATTG 1680 

AAGCCTGACG TCTCGAATCA TATGTTGAAA 1740 

TACTCGCAAT CATTGATTTA ACAGCACCAT 1800 

CCGTGTTCCC TCCAGAACCT ACTGAAAAGA I86 0 

TTAAAGTCTC TATTGAATCA TTTTCAACAT 1920 

TTAACGCATT AAGTTGATCT GATTGCCTAA 1980 

CTAATTGTTG CACTAGTAAC GAACCTACAC 204 0 

TTTACAACAC TCTCCTATkT ATTATTCTCT 2100 

CTTGTTACAA CTATAAAAAT CAATTGACAT 2160 

AACAAATATT CATATGAAAG GATTGTCATA 2220 
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tCaAGGCATT 


AGcGATTACA 


ATCGAATACG 


TATCaTGGAA TTGTTATCaG TCAGCGAAgC 


2340 




AAGTGTTGGT 


CACATTtCAC 


ATCAATTGAA 


TTTATCTCAA 


TCAAATGTCT 


CGCACCAATT 


2400 


5 


AAAATTACTT 


AAAAGTGTGC 


ATCTTGTGAA 


AGCAAAACGA 


CAAGGCCAAT 


CAATGATTTA 


2460 




TTCATTAGAT 


GACATCCACG 


TAGCAACTAT 


GTTAAAGCAA GCCATACATC ACGCGAATCA 


2520 


10 


TCCTAAAGAA 


AGTGGGTTAT 


AATATGTCTC 


ATTCACATCA 


TCATCATGAC 


CATATGCATA 


2580 


GTCATGTAAC 


TACAAATAAT 


AAGAAAGTAT 


TGTTTATATC 


GTTTTTAATA ATCGGTCTAT 


2640 




ATATGTTTAT 


CGAAATCATC 


GGCGGTCTCC 


TTGCTAACAG 


CTTGGCATTA 


CTATCTGACG 


2700 


15 


GTATCCATAT 


GTTTAGCGAC 


ACATTCTCAT 


TAGGTGTTGC ACTTGTCGCA 


TTTATTTATG 


2760 




CTGAAAAGAA 


TGCCACAACT 


ACAAAAACAT 


TTGGTTATAA ACGTTTCGAA 


GTACTCGCAG 


2820 




CGTTATTTAA 


CGGTGTAACG 


CTTTTTGTAA 


TAAGTATTTT 


GATTGTTTTT 


GAAGCGATTA 


2880 


20 


AACGTTTCTT 


TGTTCCTTCT 


GAAGTTCAAT 


CAAAAGAAAT 


GTTAATCATT AGTATTATCG 


2940 




GTTTAATTGT 


CAATATCGTT 


GTTGCATTCT 


TTATGTTTAA AGGCGGCGAC 


ACTTCACACA 


3000 




ATTTAAATAT 


GCGTGGTGCT 


TTTCTACATG 


TTATCGGAGA 


CTTATTAGGT 


TCAGTTGGCG 


3060 


25 


CCATTACTGC 


AGCTAkTTTA 


ATTTGGGCAT 


TTGGATGGAC 


AATCGCCGAT 


CCTATCGCAA 


3120 




GTATTTTAGT 


TTCCGTTATT 


ATTTTAAAAA 


GTGCTTGGGG 


TATCACAAAA 


TCTTCAATTA 


3180 


30 


ACATTTTAAT 


GGaAGGCACA 


CCAAGTGATG 


TTGATATAGA 


TGAAGTTATA 


ACTACTATTA 


3240 


AAAAGGATTC 


ACGAATACAA 


AGTGTGCATG 


ATTGCCATGT 


TTGGACAATT 


TCAAATGATA 


3300 




TGAATGCATT 


GAGTTGTCAT 


GTTGTTGTAG 


ACCATACATT 


GACAATGAAA 


GAATGTGAAT 


3360 


Ji> 


TATTATTAGA 


AAaCATTGAG 


CATGATTTAT 


TACATTTAAA 


TATTCACCAT ATGACTATTC 


3420 




AATTAGAAAC 


GCCTAATCAC 


AAACATGATG 


AATCGATTAT 


ATGTTCAGGA 


ACACATAGTC 


3460 




ATTCACATAA 


CCATCATGCT 


CATCATCACG 

uni v><i^> w 


CGCA7GTACA TTAATAATTT 


TAACCTACTG. 


3540 


40 


CCATTGCATC 


GATTAAACTT 


TTCAATGGCA 


GTAGGTTTTT 


TATGTCTTTA 


TGGCGACTTG 


3600 




TTTGGTCTTT 


GATGATGCAA 


TGTTTATTAA 


CAAATTTTCA ACTATTATTT 


CTTACATTAG 


3660 




TCATATTTTT 


GACAATTTAC 


TATTATAATT 


CTCTAACTTT AGTCACTTTA ATTAATTTTT 


3720 


45 


ATTAGATATT 


AATATGAAAA 


TAACGTGTTT 


TTTGTTATT 






3759. 



(2) INFORMATION FOR SEQ ID NO: 16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13086 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 16: 

TAATTATCGC GCATAACAAA ACATTAGCAG GACAATTATA TAGTGAGTTT AAAGAATTTT 60 

5 TTCCTGAAAA CAGGGTGGAA TACTTTGTAA GTtACTATGA TTATTATCAn CCAGAGGCAT 120 

ACGTACCGTC TACTGACACT TTTATTGAAA nAGATGCCTC AATCAnTGAT GAAATTGATC 180 

AACTACGACA TTCTGCTACA AGTGCATTAT TTGAACGCGA TGATGTAATT ATTATTGCTA 240 

10 

GTGTAAGTTG TATATATGGT TTAGGTAATC CTGAAGAATA TAAAGATTTA GTAGTAAGTG 300 

TTCGAGTTGG TATGGAAATG GATAGAAGTG AATTACTTAG AAAACTTGTc AGATGTGCAA 360 

TATACACGAA ATGACATCgA TTTcCAACGA GGAACGTTTC GAGTGCGTGG TGATGTAGTG 420 

15 

GAAATATTCC CAGCCTCTAA AGAAGAACTT TGTATAAGGG TTGAGTTTTT CGGCGATGAG 480 

ATTGACCGTA TCCGAGAAGT TAACTACCTA ACAGGTGAAG TGTTGAAAGA AAGAGAACAT 540 

2Q TTTGCGATAT TCCCAGCTTC TCACTTCGTA ACACGTGAAG- AAAAGTTGAA AGTTGCGATT 600 

GAACGTATTG AAAAAGAATT GGAAGAACGA TTGAAAGAAT TACGAGATGA GAATAAATTA 660 

CTAGAAGCGC AAAGGTTAGA ACAGCGTACC AACTATGATT TAGAAATGAT GCGAGAGATG 720 

25 GGATTCTGTT CAGGAATTGA AAACTATTCC GTACATTTAA CTTTGCGACC ACTGGGTTCG 780 

ACACCATATA CTTTATTGGA TTACTTTGGC GATGATTGGT TAGTAATGAT TGATGAATCA 840 

CATGTGACAT TACCGCAAGT TCGAGGCATG TATAACGGAG ACAGAGCGCG TAAACAAGTT 900 

30 TTGGTGGATC ATGGGTTTAG ATTACCGAGT GCATTAGATA ACCGTCCACT TAAATTTGAA" 960 

GAATTTGAAG mAAAGACAAA ACAACTTGTG TATGTATCTG CAACGCCTGG ACCATACGAA 1020 

ATTGAACATA CGGATAAGAT GGTTGAACAA ATTATTCGTC CTACTGGTTT ACTGGATCCT 10 BO 

35 

AAGATTGAGG TTAGACCTAC TGAAAATCAA ATTGACGATT TATTAAGTGA AATTCAAACA 1140 

AGAGTgAGCG TAATGAACGC GTACTTGTTA CAACGCTCAC TAAAAAGATG AGTGAAGATT 1200 

aACCACATAC ATGAAAGAaG CGGGTATTAA aGTtAATTAT CTGCATTCAG AAATCAAGAC 1260 

40 

ATTAGAACGA ATTGAAATAA TTAGAGACTT ACGAATGGGT ACATATGATG TTATCGTAGG 1320 

TATTAATTTA TTAAGAGAGG GTATTGATAT ACCAGAAGTT TCTCTAGTTG TCATATTAGA 1380 

4$ TGCAGATAAA GAAGGGTTTT TACGTTCTAA CCGCTCATTA ATTCAAaCAA TAGGTAGAgC 1440 

TGCGCGTAAC GATAAaGGTG AAGTCATTAT GTATGCCGAT AAAATGACTG ATTCGATGAA 1500 

GTATGCAATT GATGAGACAC AACGTCGTCG AGAAATACAG ATGAAACATA ATGAAAAACA 156 0 

50 TGGTATTACA. CCTAAAACAA TTAATAAAAA AATACATGAT TTAATTAGTG CTACTGTTGA 1620 

AAATGACGAA AATAATGACA AAGCACAAAC TGTGATACCT AAGAAGATGA CGAAAAAAGA 1680 
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TTTCGAGAAA 


GCTACAGAAT 


TAAGAGATAT 


GTTATTTGAA 


TTAAAAGCAG 


AAGGGTGACA 


1800 




AGTAAATGAA 


AGAACCATCC 


ATAGTAGTAA 


AAGGTGCTCG 


TGCGCATAAC 


TTGAAAGATA 


1860 


5 


TTGATATCGA 


ACTACCTAAA 


AaTAAATTAA 


TTGTTATGAC 


AGGTTTATCT 


GGGTCAGGTA 


1920 




AATCGTCATT 


AGCATTCGAT 


ACTATATATG 


CTGAAGGACA ACGACGTTAT 


GTTGAATCAT 


1980 




TAAGTGCCTA 


TGCGCGTCAA 


TTTTTAGGCC 


AAATGGACAA 


ACCAGATGTT 


GATACAATTG 


2040 


10 


AAGGATTATC 


GCCAGCAATT 


TCAATAGATC 


AAAAAACAAC 


AAGTAAAAAT 


CCAAGATCAA 


2100 




CTGTAGCAAC 


AGTAACAGAA 


ATATATGATT 


ATATACGTTT 


GTTATATGCA 


CGTGTTGGTA 


2160 


15 


AACCTTACTG 


TCCAAATCAC 


AATATAGAAA 


TTGAATCGCA AACAGTACAA 


CAAATGGTTG 


2220 


ACCGCATTAT 


GGAATTAGAG 


GCACGTACAA AGATTCAATT ATTAGCACCT 


GTCATCGCTC 


2280 




ATCGTAAAGG 


TAGTCATGAA AAGCTAATCG 


AAGATATTGG 


TAAAAAAGGT 


TATGTACGTT 


2340 


20 


TAAGAATCGA 


TGGCGAAATT 


GTTGATGTAA 


ATGATGTACC 


TACTTTAGAT 


AAGAACAAGA 


■ 2400 




ATCATACAAT 


AGAAGTTGTT 


GTAGAC CGAT 


TAGTTGTTAA 


AGATGGAATT 


GAAACACGAC 


2460 




TAGCTGACTC 


TATAGAAACT 


GCCTTAGAGC 


TTTCAGAAGG 


ACAATTAACA 


GTCGATGTCA 


2520 


25 


TTGACGGGGA 


AGACCTTAAG 


TTTTCAGAAA 


GCCATGCTTG 


TCCTATATGT 


GGATTTTCAA 


2580 




TCGGAGAGTT 


AGAACCAAGA 


ATGTTTAGCT 


TTAACAGTCC 


TTTTGGTGCT 


TGTCCGACAT 


2640 




GTGATGGCTT 


AGGCCAAAAG 


TTAACAGTCG 


ATGTAGACTT 


GGTTGTTCCC 


GACAAAGATA 


2700 


30 


AGACGCTAAA 


CGAAGGTGCA 


ATAGAACCTT 


GGATACCGAC 


GAGTTCTGAT 


TTTTATCCAA 


2760 




CATTGTTAAA 


ACGTGTTTGT 


GAAGTTTATA 


AAATCAATAT 


GGATAAACCT 


TTTAAAAAGT 


2820 


35 


TAACAGAACG 


TCAACGTGAT 


ATTTTATTGT 


ATGGTTCTGG 


TGACAAAGAA 


ATTGAATTTA • 


2880 


CATTTACACA 


ACGTCAAGGT 


GGTACTAGAA 


AACGAACAAT 


GGTTTTCGAG 


GGTGTAGTTC 


2940 




CTAATATAAG TAGACGATTC CATGAATCTC 


CTTCAGAATA 


TACACGTGAA 


ATGATGAGTA 


3000 


40 


AATATATGAC 


TGAACTACCT 


TGCGAAACTT 


GTCATGGAAA 


GCGATTGAGT 


CGTGAAGCkT 


3060 


TATCTGTTTA 


TGTAGGTGGT 


TTAAATATTG 


GTGAAGTAGT 


CGAATATTCA 


ATCAGTCAAG 


3120 




CGCTGAACTA 


TTATAAAAAC ATTGATTTGT 


CAGAACAAGA 


TCAAGCGATT 


GCAAATCAAA 


3180 


45 


TATTGAAAGA 


AATTATTTCC 


CGACTCACTT 


TTTTAAATAA 


TGTGGGACTT 


GAATATTTAA 


3240 




CGTTAAACAG. AGCTTCAGGT 


ACACTTTCAG 


GTGGTGAAGC 


ACAACGTATT 


CGATTAGCAA 


3300 




CGCAAATTGG 


GTCGCGTTTG 


ACTGGTGTCT 


TATATGTATT 


AGATGAGCCA 


TCAATTGGAC 


3360 


SO 


TGCATCAAAG 


AGATAATGAT 


CGATTAATTA 


ATACACTTAA 


AGAAATGAGA 


GATTTAGGAA 


3420 




ATACTTTAAT 


TGTAGTTGAA 


CACGATGATG 


ATACAATGCG 


TGCGGCTGAT 


TACTTAGTGG 


3480 
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AGGTAATGAA AGATAAAAAA 


TCATTAACAG 


GACAATACTT 


GAGTGGTAAG 


AAACGTATTG 


3600 




AAGTACCTGA ATATCGCAGA 


CCGGCTTCAG 


ATCGTAAAAT 


TTCTATACGT 


GGAGCTAGAA 


3660 


5 


GCAACAATCT 


TAAAGGGGTT 


GATGTGGACA 


TACCACTATC 


AATCATGACG 


GTTGTTACAG 


3720 




GTGTATCAGG 


TTCTGGTAAA 


AGCTCATTAG 


TAAATGAAGT 


ATTATACAAA 


TCATTAGCTC 


3780 


10 


AAAAAATTAA 


TAAATCTAAA 


GTAAAGCCAG 


GATTGTACGA 


TAAGATTGAA GGTATTGATC 


3840 


AACTTGATAA 


AATTATTGAT 


ATTGATCAAT 


CACCAATAGG 


TAGAACGCCA 


CGCTCTAATC 


3900 




CAGCAACATA TACTGGTGTG 


TTTGATGATA TACGTGATGT GTTTGCGCAA ACAAATGAAG 


3960 


15 


CTAAAATTCG AGGATATCAA AAAGGGCGTT 


TTAGTTTTAA TGTAAAAGGT 


GGACGCTGTG 


4020 


AAgcTTGTAA AGGTGACGGT ATTATTAAAA TTGAAATGCA TTTTTTACCT GATGTTTATG 


4080 






AGTGTGTGAT 


GGTAAACGAT 


ATAATCGTGA 


GACACTAGAG 


GTTACTTACA 


4140 


20 


AAGGTAAAAA 


TATTGCTGAC 


ATTTTAGAAA 


TGACTGTTGA AGAAGCAACA 


CAATTTTTTG 


4200 




AAAATATTCC 


TAAGATTAAG 


CGCAAGTTAC 


AAACACTAGT 


TGATGTTGGT 


CTTGGATACG 


4260 




TCACATTAGG 


TCAACAAGCT 


ACAACGTTAT 


CAGGTGGTGA 


GG CTCAACGT 


GTGAaACTTG 


4320 


25 


CATCTGAACT 


TCATAAACGT 


TCAACTGGTA 


AATCTATTTA 


TATCCTAGAT 


GAACCGACAA . 


. 4380 




CAGGGTTACA 


TGTTGACGAT 


ATTAGTAGAT 


TATTAAAAGT 


ATTAAACCGA 


TTAGTTGAAA 


4440 




ATGGTGATAC 


TGTTGTAATT 


ATTGAACATA 


AC CTAGATGT 


TATCAAAACA 


GCAGACTATA 


4500 


30 


TTATAGACTT 


AGGTCCTGAA 


GGTGGTAGTG 


GCGGTGGTAC 


TATTGTTGCG 


ACTGGCACAC 


4560 




CCGAAGATAT 


TGCTCAGACA 


AAGTCATCAT 


ATACAGGAAA 


GTATTTAAAA 


GAAGTACTTG 


4620 


35 


AACGAGATAA 


ACAAAATACT 


GAAGATAAAT 


AAGATTAAAA 


GAAGTGAAGG 


ATGTTATAAA 


4680 


TTTATCCTTC 


GCTTCTTTTT 


ATTAATTTAG 


TAATGAATAG 


TAGAAAGAAA 


AGATGCGTAA 


4740 




AAAGAATTAT GTTAAGATAG GGTCAATCTA GAGTAGTTAA ACATAAATCG 


AACTGGGAGT 


4800 


40 


GGGACAGAAA 


TGATAAAGAA 


TCACTAATGA 


TTTATTATGT 


AGTGGTTCTT 


TGTCATTAGC 


4860 


CACAGCTATT GTGTACTTAA AAATAGGaat . 


GCaTgAGTGC AACTCATGCA 


TAAGaAATAC 


4920 




TAATTTCTAA 


AGAAAAAGTA 


TTTCTTTATG 


TTGGGGCCCC 


GCCAACTTGC 


ATTGTTTGTA 


4980 


45 


GAATTTCTTT 


TCGAAATTCT 


TTATGTTGGG 


GCCCCGCCAA 


CTTGCATTGT 


TTGTAGAATT 


5040 




TCTTTTCGAA 


ATTCTTTATG 


TTGGGGCCCC 


GCCAACTAAT 


TCCAATATAT 


CATTGTAGAG 


5100 




CTTAGGTCAT 


TGATTTTTGG 


CTCGGACTTT 


TATGG CGATA 


TGAACCATGT 


AAATTAAGCA 


5160 


SO 


AGCAATAAAT 


TAATGATTGA 


TATTGACTTG 


TAAAATAATA 


ACAATAATGA 


ACAATTAATA 


5220 




TTTATTTTAG 


CTTTTCAATG 


TAGATTGGTG 


TTATATTTTT 


GATATGATAA 


GAAGAGATGT 


5280 
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.ACATTAAAGT TAGATTTAAT CGCTGGTGAA GAAGGACTAT CGAAGCCAAT TAAAAATGCT 54 00 

GATATATCAA GACCGGG CTT AGAGATGGCA GGTTATTTTT CACATTATGC GTCAGATAGA 5460 

5 ATACAACTAT TAGGAACAAC GGAACTATCG TTTTACAATT TATTACCAGA TAAGGATCGC 5520 

GCAGGTCGTA TGCGTAAACT ATGCAGACCA GAAACGCCTG CAATTATTGT GACACGTGGA 5580 

TTGCAGCCAC CAGAAGAATT AGTTGAAGCT GCAAAAGAAT TAAATACCCC ACTTATAGTT 564 0 

10 

GCTAAAGATG CGACTACAAG TTTAATGAGT CGCTTAACAA CGTTTTTAGA GCATGCACTT 5700 

GCAAAGACGA CATCTTTACA TGGTGTTTTA GTAGATGTTT ACGGTGTTGG TGTACTAATT 5760 

ACCGGTGATT CAGGAATAGG TAAAAGTGAG ACTGCGTTGG AATTAGTTAA ACGTGGGCAT 5820 

15 

AGATTAGTAG CAGATGATAA TGTAGAAATA CGTCAAATTA ATAAAGATGA ACTAATAGGG 5880 

AAACCACCAA AGTTAATAGA ACATCTATTA GAAATACGTG GACTAGGTAT TATCAATGTT 594 0 

2Q ATGACTTTAT TTGGCGCGGG TTCAATATTA ACTGAAAAAC GAATTAGATT AAATATTAAT 6000 

TTGGAAAACT GGAACAAGCA AAAGTTATAT GACCGCGTAG GTCTTAATGA AGAGACGCTA 6060 

AGTATTTTAG ATACTGAAAT CACTAAAAAA ACAATACCTG TAAGACCTGG TAGAAATGTT 6120 

25 GCGGTAATTA TTGAGGTCGC TGCAATGAAC TATCGATTAA ATATCATGGG CATTAACACG 6180 

GCCGAAGAAT TTAGTGAAAG ATTAAATGAA GAAATTATCA AGAACAGTCA TAAGAGTGAG 6240 

GAGTAGGTTG AATGGGTATT GTATTTAACT ATATAGATCC TGTGGCATTT AACTTAGGAC 63 00 

30 CACTGAGTGT ACGATGGTAT GGAATTATCA TTGCTGTCGG AATATTACTT GGTTACTTTG 6360 

TTgCACAACG TGCACTAGTT AAAGCAGGAT TACATAAAGA TACTTTAGTA GATATTATTT 642 0 

TTTATAGTGC ACTATTTGGA TTTATCGCGG CACGAATCTA TTTTGTGATT TTCCAATGGC 6480 

35 

CATATTACGC GGAAAATCCA AGTGAAATTA TTAAAATATG GCATGGTGGA ATAGCAATAC 654 0 

ATGGTGGTTT AATAGGTGGC TTTATTGCTG GTGTTATTGT ATGTAAAGTG AAAAATTTAA 6600 

ACCCATTTCA AATTGGTGAT ATCGTTGCGC CAAGTATAAT TTTAGCGCAA GGAATTGGAC 6660 

40 

GCTGGGGTAA CTTTATGAAT CACGAGGCAC ATGGTGGATC GGTGTCACGC GCTTTTTTAG 6720 

AACAATTACA TTTGCCTAAT TTTATAATAG AAAATATGTA TATTAACGGC CAATATTATC 6780 

4S ATCCAACATT CTTATATGAA TCCATTTGGG ATGTCGCTGG ATTTATTATC TTAGTTAATA 6 84 0 

TTCGTAAACA TTTAAAATTA GGAGAAACAT TCTTTTTATA TTTAACTTGG TATTCAATTG 6900 

GTCGATTCTT TATAGAAGGA TTACGTACAG ATAGCTTAAT GCTCACAAGT AATATTAGAG 6960 

SO TTGCACAATT AGTATCAATT CTTTTAATTT TAATAAGTAT AAGTTTAATT GTATATAGAA 7020 

GGATTAAGTA TAATCCACCG TTGTATAGCA AAGTTGGGGC GCTTCCATGG CCAACAAAAA 7080 
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TTATGGCGTG TATACCGTCT TGTTAAATTT TCGAAAGTTT TTAAGAATGT AATTATCATT 7200 

GAATTTTCGA AATTTATTCC AAGTATGGTA CTGAAAAGAC ATATATATAA ACAACTTTTA 7260 

5 AATATTAATA TCGGTAATCA ATCGTCGATA GCTTATAAAG TAATGTTAGA TATTTTTTAC 7320 

CCAGAACTGA TTACGATTGG TAGTAACAGT GTTATTGGTT ACAATGTAAC AATTTTGACG 73 80 

CATGAAGCAT TAGTTGATGA ATTTCGTTAT GGACCAGTGA CGATAGGATC TAACACTTTG 744 0 

10 

ATTGGTGCAA ATGCTACCAT TTTACCCGGT ATAACGATTG GTGACAATGT AAAAGTTGCA 7500 

GCTGGTACGG TTGTTTCAAA AGATATACCG GATAATGGAT TTGCATATGG CAACCCTATG 7560 

TATATAAAAA TGATTAGGAG GTGACAATTT TATGGCGCAA AAGAATAATA ATGTAATTCC 7620 

15 

AATGACTTTT GATGATGCAT TTTATCGTAA AATGGCTAAA CAGAAGTTTA AACAAAGAGA 7680 

ATATAAACGA GCTGCTGAAT ACTTTGAAAA AGTGTTAGAA TTGTCACCTG ATGATCTGGA 7740 

20 AATTCAAATT GATTATGCAC AATGTCTAGT GCAACTTGGT ATTGCTAAAA AAGCAGAACA 7800 

TTTATTTTAT GACAATATTA TTTATAATAG GCATCTAGAA GATAGCTTTT ATGAATTGAG 7860 

TCAGCTCAAC ATTGAAGTTA ACGAACCAAA CAAGGCATTC TTGTTTGGTA TTAATTATGT 7920 

25 TATTGTTAGC GACGACCAAG ATTATAGAGA TGAATTAGAT CAAATGTTTG ATGTGAAATA 7980 

TCAAAGTGAA GAACAAATTG AACTTGAAGC TCAATTGTTT GTAGTTCAAA TACTATTCCA 804 0 

ATATCTTTTT TCTCAAGGTC GATTAAAAGA TGCAAAGAAT TATGTCTTAC ATCAACCACA 8100 

30 AGAAGTTCAA GATCATCGTG TAGTACGTAA TTTATTGGCA ATGTGTTATT TATATCTCGG 8160 

TGAATATGAT ACgGCTAAAG CATTGTACGA aGCACtATTA CAAGAGGATA GTACaGATAT 8220 

ATATGCATTA TGCCATTATA CTTTGCTACT TTATAACACT AAGGAAAATG AACAATATCA 82 80 

35 

AAAATATTTA AAAATATTAA ACAAAGTTGT ACCTATGAAT GACGATGAAA GTTTTAAATT 8340 

AGGTATTGTA TTAAGTTATT TAAAGCAGTA TCGTGCATCA CAACAATTGT TGTACCCTTT 84 00 

ATATAAAAAA GGGAAATTTT TATCAATTCA AATGTACAAT GCTTTAGCAT ATAATTATTA 8460 

40 

TTATTTAGGT GAAGAAGACG AAAGTCATTA CTACTGGGAT AAATTGAAGC AAATTTCTAA 8520 

AGTGGAAATT GGACATGCGC CTTGGGTAAT TGAAAATAGC AAAGAAGTTT TTGACCAACA 858 0 

4S TATTTTGCCA TTACTTCAAA GTGATGACAG TCATTATCGT TTATATGGTA TTTTTTTATT 8640 

GGATCAATTA AATGGTAAAG AAATTGTGAT GACGGAAAGT ATTTGGCAGG TTTTGGAAAA 8700 

TCTAAATAAT TATGAGAAAT TGTATTTAAC GTATTTAGTT CAAGGTTTAA CGCTCAATAA 8760 

50 ATTAGACTTC ATTCATCGCG GCTTATTAAC GCTTTACCAT AATGAATTAT TTGTAAGTGA 8820 

AAATGATGTA ATGGTTGCAT GGATTAATCA AGGTGAACTC ATAATTGCTG AAAAAGTAGA 8880 
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TCGAAACGTT ACAAAGAAGC AAATTACAAC ATGGTTAGGC ATAACACAAT ATAAACTGAA 9000 

CAAAATGATT GAATTTCTCT TGAGCATATA GATTTATGAA AAGTTAGATT TATTATATAA 9060 

5 TGCGCATAAT GATTAATAAT GAGGAGGCGT TAATAAAATG ACTGAAATAG ATTTTGATAT 9120 

AGCAATTATC GGTGCAGGTC CAGCTGGTAT GACTGCTGCA GTATACGCAT CACGTGCTAA 9180 

TTTAAAAACA GTTATGATTG AAAGAGGTAT TCCAGGCGGT CAAATGGCTA ATACAGAAGA 9240 

W AGTAGAGAAC TTCCCTGGTT TCGAAATGAT TACAGGTCCA GATTTATCTA CAAAAATGTT 9300 

TGAACACGCT AAAAAGTTTG GTGCAGTTTA TCAATATGGA GATATTAAAT CTGTAGAAGA 9360 

TAAAGGCGAA TATAAAGTGA TTAACTTTGG TAATAAAGAA TTAACAGCGA AAGCGGTTAT 9420 

75 * 

TATTGCTACA GGTGCAGAAT ACAAGAAAAT TGGTGTTCCG GGTGAACAAG AACTTGGTGG 94 80. 

ACGCGGTGTA AGTTATTGTG CAGTATGTGA TGGTGCATTC TTTAAAAATA AACGCCTATT 954 0 

CGTTATCGGT GGTGGTGATT CAGCAGTAGA AGAGGGAACA TTCTTAACTA AATTTGCTGA 9600 

20 

CAAAGTAACA ATCGTTCACC GTCGTGATGA GTTACGTGCA CAGCGTATTT TACAAGATAG 9660 

AGCATTCAAA AATGATAAAA TCGACTTTAT TTGGAGTCAT ACTTTGAAAT CAATTAATGA 9720 

2S AAAAGACGGC AAAGTGGGTT CTGTGACATT AACGTCTACA AAAGATGGTT CAGAAGAAAC 9780 

ACACGAGGCT GATGGTGTAT TCATCTATAT TGGTATGAAA CCATTAACAG CGCCATTTAA 984 0 

AGACTTAGGT ATTACAAATG ATGTTGGTTA TATTGTAACA AAAGATGATA TGACAACATC . 990 0 

3° AGTACCAGGT ATTTTTGCAG CAGGAGATGT TCGCGACAAA GGTTTACGCC AAATTGTCAC 9960 

TGCTACTGGC GATGGTAGTA TTGCAGCGCA AAGTGCAGCG GAATATATTG AACATTTAAA 10020 

CGATCAAGCT TAATTCGAAG TCGAATTAAG ATGTTGAGCT GTAAATTATT TGGATATTTA 10080 

35 TTTTAATAGT GTCATCACAG CGTTAAAATA ATGTCTTACT TTTAAATTAA AGCAAATTAT 1014 0 

ATAGfiAAACT AGAACTTAGT ACGTATCATT TGTGCGTTTC AATGAGTTCT AGTTTTTTTA 10200 

TATGtTATAT TAAACTTATA ACTTTATGGG AGTGGGACAG AAATGATAAA GAGCCACTAA 10260 

40 

TGATTTATTA TGTAGTGGTT CTTAAACATT AGCCACAGCT AATGTGTACT TAAAAATAGG 10320 

AATACATGAG TAAAACTCAT GCATAAGAAA TACTAATTTC TATAGAAAAA GTATTACTTT 10380 

ATCGTTGTCC CACCCCAACT TGCACATTAT TGTAAGCTGA CTTTCCGCCA GCTTCTGTGT 10440 

45 

TGGGGCCCCG CCAACTTGCA CATTATTGTA AGCTGACTTT TCGTCAgCTT CTGTGTTGGG 10500 

GCCCCGCCAA CTTGCACATT ATTGTAAGCT GACTTTTCGT CAGCTTCTGT GTTGGGGCCC 1056 0 

SO CGCCAACTTG CATTGTCTGT AGAAATTGGG AATCCAATTT CTCTATGTTG GGGCCCACAC 10620 

CCCAACTCGC ATTGCCTGTA GAATTTCTTT TCGAAATTCT CTGTGTTGGG GCCCACACCC 106BO 
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ACTCGCATTG CCTGTAGAAT TTCTTTTCGA 


AATTCTCTGT 


GTTGGGGCCC 


CTGACTAGAG 


10800 




TTGAAAAAAG CTTGTTGCAA GCGCATTTTC 


ATTCAGTCAA 


CTACTAGCAA 


TATAATATTA 


10860 


5 


TAGACCCTAG GACATTGATT TATGTCCCAA 


GCTCCTTTTA 


AATGATGTAT 


ATTTTTAGAA 


10920 




ATTTAATCTA GACATAGTTG GAAATAAATA 


TAAAACATCG 


TTGCTTAATT 


TTGTCATAGA 


10980 


10 


ACATTTAAAT TAACATCATG AAATTCGTTT 


TGGCGGTGAA AAAATAATGG ATAATAATGA 


11040 


AAAAGAAAAA AGTAAAAGTG AACTATTAGT 


TGTAACAGGT 


TTATCTGGCG 


CAGGTAAATC 


11100 




TTTGGTTATT CAATGTTTAG AAGACATGGG 


ATATTTTTGT 


GTAGATAATC TACCACCAGT 


11160 


15 


GTTATTGCCT AAATTTGTAG AGTTGATGGA ACAAGGAAAT CCATCCTTAA GAAAAGTGGC 


11220 




AATTGCAATT GATTTAAGAG GTAAGGAACT 


ATTTAATTCA 


TTAGTTGCAG 


TAGTGGATAA 


11280 




AGTCAAAAGT GAAAGTGACG TCATCATTGA 


TGTTATGTTT 


TTAGAAGCAA 


GTACTGAAAA 


11340 


20 


ATTAATTTCA AGATATAAGG AAACGCGTCG 


TGCACATCCT 


TTGATGGAAC 


AAGGTAAAAG 


11400 




ATCGTTAATC AATGCAATTA ATGATGAGCG AGAGCATTTG TCTCAAATTA GAAGTATAGC 


11460 




TAATTTTGTT ATAGATACTA CAAAGTTATC ACCTAAAGAA 


TTAAAAGAAC 


GCATTCGTCG 


11520 


25 


ATACTATGAA GATGAAGAGT TTGAAACTTT 


TACAATTAAT 


GTCACAAGTT 


TCGGTTTTAA 


11580 




ACATGGGATT CAGATGGATG CAGATTTAGT 


ATTTGATGTA 


CGATTTTTAC 


CAAATCCATA 


11640 




TTATGTAGTA GATTTAAGAC CTTTAACAGG ATTAGATAAA GACGTTTATA ATTATGTTAT 


11700 


30 


GAAATGGAAA GAGACGGAGA TTTTCTTTGA 


AAAATTAACT 


GATTTGTTAG 


ATTTTATGAT 


11760 




ACCCGGGTAT AAAAAAGAAG GGAAATCTCA ATTAGTAATT 


GCCATOGGTT 


GTACGGGTGG 


11820 


35 


ACAACATCGA TCTGTAGCAT. TAGCAGAACG ACTAGGTAAT TATCTAAATG AAGTATTTGA 


11880 


ATATAATGTT TATGTGCATC ATAGGGACGC ACATATTGAA AGTGGCGAGA AAAAATGAGA 


11940 




CAAAT AAAAG TTGTACTTAT CGGTGGTGGC 


ACTGGCTTAT 


CAGTTATGGC 


TAGGGGATTA 


12000 


40 


AGAGAATTCC CAATTGATAT TACGGCGATT 


GTAACAGTTG 


CTGATAATGG 


TGGGAGTACA 


12060 




GGGAAAATCa GAGATGAAAT GGATATACCA 


GCACCAGGAG 


ACATCAGAAA 


TGTGATTGCA 


12120 




GCTTTAAGTG ATTCTGAGTC AGTTTTAAGC 


CAACTTTTTC 


AGTATCGCTT 


TGAAGAAAAT 


12180 


45 


CAAATTAGCG GTCACTCATT AGGTAATTTA 


TTAATCGCAG 


GTATGACTAA 


TATTACGAAT 






GATTTCGGAC ATGCCATTAA AGCATTAAGT 


AAAATTTTAA 


ATATTAAAGG 


TAGAGTCATT 


12300 




CCATCTACAA ATACAAGTGT GCAATTAAAT 


GCTGTTATGG 


AAGATGGAGA 


AATTGTTTTT 


12360 


50 


GGAGAAACAA ATATTCCTAA AAAACATAAA 


AAAATTGATC 


GTGTGTTTTT 


AGAACCTAAC 


12420 




GATGTGCAAC CAATGGAAGA AGCAATCGAT 


GCTTTAAGGG 


AAGCAGATTT 


AATCGTTCTT 


12480 



55 



264 



EP0 786 519 A2 



10 



15 



20 



25 



GCGTTAATTC ATTCTGATGC GCCTAAGCTA TATGTTTCTA ATGTGATGAC GCAACCTGGG . 126 00 

GAAACAGATG GTTATAGCGT GAAAGATyAT ATCGATGCGA TTCATAGACA AGCTGGACAA 12660 

CCGTTTATTG ATTATGTCAT TTGTAGTACA CAAACTTTCA ATGCTCAAGT TTTGAAAAAA , 12720 

TATGAAGAAA AACATTCTAA ACCAGTTGAA GTTAATAAGG CTGAACTTGA AAAAGAAAGC 127 BO 

ATAAATGTAA AAACATCTTC AAATTTAGTT GAAATTTCTG AAAATCATTT AGTAAGACAT 12840 

AATACTAAAG TGTTATCGAC AATGATTTAT GACATAGCTT TAGAATTAAT TAGTACTATT 12900 

CCTTTCGTAC CAAGTGATAA ACGTnAATAA TATAGAACGT AATCATATTA TGATATGATA 12960 

ATAGAGCTGT GAAAAAAATG AAnATAGACA GTGGTTCTAA GGTGAATCAT GTTTTAAATA 1302 0 

AGAAAGGAAT GACTGTACGA TGAGCTTTGC ATCAGAAATQ AAAAATGAAT TAACTAGAAT 13080 

AGACGT 13086 
(2) INFORMATION FOR SEQ ID NO: 17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1350 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 17: 

CATTAGTCAT GAAAATAGCC GACAACTTCA TCTGTGAAAT CACCGGCCTT TTATTTTAGC 60 

TAACTTTATT TCTGATTTTA CGATTTTAAT TGATCATACA GAGAAAGTGA TCTTTTTACA 120 

ATTTCTAAAA ACTCATGATC TATATTGGAC ATTTGATGAA AATAAGACAA AATGTTTTCT 180, 

GTTAGCTTCT CTTGTTTTGG GAATGAATCA TCTTCTTTAA TCCAAATCGC TAATTCGCCT 24 0 

AATGGTGTTT TATCATCTTT AAATGTTTGT ATATATTCGT AAAAGCTCAT AGTATTCCTT 300 

CTCTCAATTT ACTTATATAA ATCCTACCAC GAAAGCTTTC AAGAAAACAC AATTAAATGT 360 

CTATTTAGTG AACTTTTTAA GGTTGTGCAC TCTTTTAATG TCTGCCAATT AGGTCAATTA 42 0 

ATCATCACAA TGTACAATTA ACTCTATTTT CAGTTCATAT ACTCACACAC CGTTTTTGAA 480 

CAACACATTA ACTTCTCATT TAGATAAAAC GCAAAAAAGC CTGGCACCAA TACAATAGAT 54 0 

GCCAGACTAA GAGTCTACTA TATAAATTTA TTTAGCGTAT GGTTTTACTT CGATTGCACC 600 

TTCATTTTCA TCATGAACAC CATGCTTATA ATAATCAATA TATTGTGGCT CTAAAGGCTT 660 

TCTGCCACGT ATAATGTCTG CTGCTTTTTC AGCTAACATT AAAACAGGTG CGTGTATATT 720 

GCCATTTGTC GTACGTGGCA TAGCTGATGC ATCAACTACA CGTAAATTTT CCATACCGTG 780 
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ACTACAAGAT GGGTGTAATG CTGTTTCACC ATCTCTACGA ACCCAATCAA GAATTTCTTC 900 

GTCTGTTTGC ACTTCTGGTC CTGGTGAAAT TTCTCCACCA TTGAATGGAT CCATTGCTTT 960 

5 

TTGAGATAAG ATATTTCTTG CTACACGAAT TGCTTCTACC CATTCTTTTT TATCTTCTTC 1020 

TGTTGATAAA TAATTAAAGC GGATACTTGG TTTTTCGAAT GGATCTTTAG ATTTGATTTT 1080 

CAAGCTACCA CGAGAGTTTG AATACATTGG TCCTACGTGA ACTTGATAAC CATGTGCGAC 114 0 

10 

CGCTGCCTTT TGACCATCAT ATCTTACAGC TATTGGTAAG AAATGGAACA TTAAGTTAGG 1200 

ATAAtCAACT TCGTTATTTG AACGTACAAA TCCGCCACCT TCAAAATGGT TAGATGCTGC 1260 

15 TGCACCTGTA CGTGTGAAAA TCCATTGTAA ACCAATAAAT GGcATGCGCT TGAtATCTAA 1320 

GCTTGGCtGt AATGATACAG GTTCCTTACA 1350 
(2) INFORMATION FOR SEQ ID NO: 18: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 18: 





TAATGCTATT 


GGCAACACCA 


TATATGAAAn 


CTCCAAACGA 


TCCTAAACCG 


ACTATAGATT 


60 


30 


CACCAAATTT 


nACAATCCAT 


GAATAAAGTA 


GTGGCCATAA 


GAATAACAAT 


ATGACAACTA 


120 




AAAATGTACA 


GTAAAATGCA 


GTCATAATTG 


GAACTAGACG 


TTTACCACTA 


AAAAATGATA 


180 


35 


ATGCTAATGG 


TAATTCTGTT 


TCACTAAACT 


TATTGTATGC 


ATAAGCTGCT 


ATTAAACCTA 


240 


TTACAATACC 


AACAAAGACA 


TTGCCATTAT 


TCATCTTTTC 


AAAAGCTGAA 


TTTATTTCCG 


300 




ArGCTTTCAT 


TCCTAATAAA 


GGCGCTAATT 


TCATTGGTGA 


TAATACAACT 


GTAACTAAAA 


360 


40 


AATATCCTAA 


CGTrGCTGCA 


rGCGsGACTG 


CACCATCATT 


TTTCTTTGCC 


ATTCCTATAG 


420 




CTACACCAAT 


TGCAAATAAA 


ATACCTAATT' 


GCTCTAAAAT 


CGTAGTACCT 


ACCGTAGTAA 


480 




AGAACATTGC 


GATTTTCGGC 


GTCGCATGAA 


GTGCATTTAA 


CGTATTACCA 


ATTCCGGCAA 


540 


45 


TAATTGCTGC 


AGCCGGTAAA 


ATGGCAACTG 


GTAACATTAA 


CGAACGCCCT 


AAATTTTGGA 


600 




AAAATTTATA 


CATTGAATGT 


CATCCTTCTT 


AAAATAATGT 


AGAAATATAA 


AGATTACTAA 


660 




TGTAACTAGA 


ATAACTACTT 


CGATACTCCG 


TTATAGTCAC 


CTAGGCTTAC 


TAACCAGCTA 


720 


50 


TATTTCT AC C 


TCAAGTTATT 


TTATAAACTT 


TTTACAATTT 


CATGCAATTC 


TTGTTGTAAC 


780 




TTTGCTGTTC 


GTGTTTCAAT 


CTCTTTTGTA 


ATATAATCGA 


TACGCTCGTT 


TCGTTTTAAA 


840 
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AAAGACCGTG AATCTTAGTA GGACCAACAT AAGCAACAGG TAATATTGGT 


GACTTACTTA 


960 


ACATTGCAAT TGTTGAAGCA 


CCaCGTTTCA AAGGTGCACC 


TTCTTGCGAT 


GTGCGAGAAC 


1020 


CTGTTGGGAA GATACCAACT 


GTCTTATTAT 


CTTTCAACAA 


ATTGATTGGG 


CGTTTTAAAG 


1080 


TACTAGGTCC TGGATTTTCA 


CGATCTACAG 


GAAATGCATT 


TAAAGACGTT 


AAAAATTTAC 


1140 


CAATCCATTT ATTTTTGAAT 


AATTCTTTTT 


TAGCCATATA 


ATGAATTTGA 


TTAGGATATA 


1200 


ATGCCATACC TAGCATAATG 


ACTTCGTTAT 


AACTTTCATG 


CGTACAAGTT 


ACGACATATT 


1260 


TACTATCCTT AGGAATATTA 


TCTTTACCGA 


TTACGTATAA 


TGATTTTGAC 


ATTTTAACTA 


1320 


AAATGAAATT CAAAATCTTA CTAATCACTG 


AATACATTGT 


G CCACCTACT 


TAACTT 


1376 


(2) INFORMATION FOR SEQ ID NO: 19: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7363 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 19: 






TTGTCATACC AATATTTTGT 


AAAATATGGA ACACAAGTAA 


AGTGACGAAA 


CCAACGATAA 


60 


AGATTTTGTT AAATTGATCT 


TCAATTTTCG 


CAGCTAATCT 


TATTAGATGG 


AAGATTAAAA 


120 


ATAAAAATAT TAAGATCAAT 


ATGACAGAAC 


CGATAAAGCC 


AAGTTCCTCT 


CCAATCACTG 


ieo 


AAAAGATAAA GTCAGTATGA 


TTTTCAGGTA TATAAACTTC 


ACCGTGATTG 


TATCCTTTAC 


240 


CTAGTAACTG TCCAGAACCG 


ATAGCTTTAA 


GTGATTCAGT 


TAAATGaTAG 


CCATCACCAC 


300 


TACTATATGT ATAGGGGTCA' 


AG CCATGAAT 


TGATTCGTCC 


CATTTGATAC 


AGTTGGaCAC 


360 


CTAAJAAATT TTCAATTAAT 


GCGGGTGCAT 


ATAGaATACC 


TAAAATGACT 


GTCATTGCAC 


420 


CAACaATACC TGTAATAAAG 


ATAGGTGCTA 


AGATACGCCA 


TGTTATACCA 


CTTACTAACA 


480 


TCACACCTGC AATAATAGCA 


GCTAATACTA 


ATGTAGTTCC 


TAGGTCATTT 


TGCAGTAATA 


540 


TTAAAATACT TGGTACTAAC 


GAGACACCAA 


TAATTTTGAA 


AAATAATAAC 


AAATCACTTT 


600 


GGAATGATTT ATTGAATGTG 


AATTGATTAT 


GTCTAGAAAC 


GACACGCGCT 


AATGCTAAAA 


660 


TTAAAATAAT TTTCATGAAT 


TCAGATGGCT 


GAATACTGAT 


AGGGCCAAAC 


GTGTACCAAC 


720 


TTTTGGCACC ATTGATAATA 


GGTGTAATAG 


GTGACTCAGG 


AATAACGAGC 


AAGCCTATTA 


780 


ATAATAGACA GATTAAGAAA 


TACAATAAAT 


ATGTATAATG 


TTTAATCTTT 


TTAGGTGAAA 


840 


TAAACATGAT GATACCTGCA 


AAAATTGCAC 


CTAAAATGTA 


ATAAAAAATT 


TGTCTGATAC 


900 



55 



267 



EP0 786 519 A2 





TTGCTAAAAC AGCTATAGTG 


GCTACTAATA CCCAGTCTAC 


TTTGCGAAnC 


aATGCTTATC 


1020 




CGGCTGTTGA CGAGATGAAT 


AATTCATTGC AAACTCCTTT 


TATACTCACT 


AATGTTTATA 


1080 


5 


TCAATTTTAC ATGACTTTTT 


AAAAATTAGC TAGAATATCA 


CAGTGATATC 


AGCTATAGAT 


1140 




TTCAATTTGA ATTAGGAATA AAATAGAAGG GAATATTGTT 


CTGATTATAA 


ATGAATCAAC 


1200 


10 


ATAGATACAG ACACATAAGT CCTCGTTTTT AAAATGCAAA ATAGCATTAA 


AATGTGATAC 


1260 


TATTAAGATT CAAAGATGCG AATAAATCAA TTAACAATAG GACyAAATCA 


ATATTAATTT 


1320 




ATATTAAGGT AGCAAACCCT 


GATATATCAT -TGGAGGAAAA 


CGAAATGACA 


AAAGAAAATA 


1380 


15 


TTTGTATCGT TTTTGGAGGG 


AAAAGTGCAG AACACGAAGT ATCGATTCTG 


ACAGCACAAA 


1440 




ATGTATTAAA TGCAATAGAT 


AAAGACAAAT ATCATGTTGA 


TATCATTTAT 


ATTACCAATG • 


1500 




ATGGTGATTG GAGAAAGCAA 


AATAATATTA CAGCTGAAAT 


TAAATCTACT 


GATGAGCTTC 


1560 


20 


ATTTAGAAAA TGGAGAGGCG 


CTTGAGATTT CACAGCTATT 


GAAAGAAAGT 


AGTTCAGGAC 


1620 




AACCATACGA TGCAGTATTC 


CCATTATTAC ATGGTCCTAA 


TGGTGAAGAT 


GGCACGATTC 


1680 




AAGGGCTTTT TGAAGTTTTG 


GATGTACCAT ATGTAGGAAA 


TGGTGTATTG 


TCAGCTGCAA 


1740 


25 


GTTCTATGGA CAAACTTGTA 


ATGAAACAAT TATTTGAACA 


TCGAGGGTTA 


CCACAGTTAC 


1B00 




CTTATATTAG TTTCTTACGT 


TCTGAATATG AAAAATATGA 


ACATAACATT 


TTAAAATTAG 


1860 




TAAATGATAA ATTAAATTAC 


CCAGTCTTTG TTAAACCTGC 


TAACTTAGGG 


TCAAGTGTAG 


1920 


30 


GTATCAGTAA ATGTAATAAT 


GAAGCGGAAC TTAAAGAAGG 


TATTAAAGAA 


GCATTCCAAT 


1960 




TTGACCGTAA GCTTGTTATA 


GAACAAGGCG TTAACGCACG 


TGAAATTGAA 


GTAGCAGTTT 


2040 


35 


TAGGAAATGA CTATCCTGAA 


GCGACATGGC CAGGTGAAGT 


CGTAAAAGAT 


GTCGCGTTTT 


2100 


ACGATTACAA ATCAAAATAT 


AAAGATGGTA AGGTTGAATT 


ACAAATTCCA 


GCTGACTTAG 


2160 




ACGAAGATGT TCAATTAACG 


CTTAGAAATA TGGCATTAGA 


GGCATTCAAA 


GCGACAGATT 


2220 


40 


GTTCTGGTTT AGTCCGTGCT 


GATTTCTTTG TAACAGAAGA 


CAACCAAATA 


TATATTAATG 


2280 


AAACAAATGC AATGCCTGGA TTTACGGCTT TCAGTATGTA 


TCCAAAGTTA 


TGGGAAAATA 


2340 




TGGGCTTATC TTATCCAGAA 


TTGATTACAA AACTTATCGA 


GCTTGCTAAA 


GAACGTCACC 


2400 


45 


AGGATAAACA GAAAAATAAA 


TACAAAATTG ACTAACTGAG 


GTTGTTATTA 


TGATTAATGT 


2460. 




TACATTAAAG CAAATTCAAT 


CATGGATTCC TTGTGAAATT 


GAAGATCAAT 


TTTTAAATCA 


2520 




AGAGATAAAT GGAGTCACAA TTGATTCACG AGCAATTTCT AAAAATATGT 


TATTTATACC 


2580 


50 


ATTTAAAGGT GAAAATGTTG 


ACGGTCATCG CTTTGTCTCT 


AAAGCATTAC 


AAGATGGTGC 


2640 




TGGGGCTGCT TTTTATCAAA 


GAGGGACACC TATAGATGAA 


AATGTAAGCG 


GGCCTATTAT 


2700 
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AAACCCTAAA GTAATTGCCG TCACAGGGTC TAATGGTAAA ACAACGACTA AAGATATGAT 2820 

TGAAAGTGTA TTGCATACCG AATTTAAAGT TAAGAAAACG CAAGGTAATT ACAATAATGA 2880 

AATTGGTTTA CCTTTAACTA TTTTGGAATT AGATAATGAT ACTGAAATAT CAATATTGGA 2940 

GATGGGGATG TCAGGTTTCC ATGAAATTGA ATTTCTGTCA AACCTCGCTC AACCAGATAT 3000 

TGCAGTTATA ACTAATA7TG. GTGAGTCACA TATGCAAGAT TTAGGTTCGC GCGAGGGGAT 3060 

TGCTAAAGCT AAATCTGAAA TTACAATAGG TCTAAAAGAT AATGGTACGT TTATATATGA 3120 

TGGCGATGAA CCATTATTGA AACCACATGT TAAAGAAGTT GAAAATGCAA AATGTATTAG 31B0 

TATTGGTGTT GCTACTGATA ATGCATTAGT TTGTTCTGTT GATGATAGAG ATACTACAGG 3240 

TATTTCATTT ACGATTAATA ATAAAGAACA TTACGATCTG CCAATATTAG GAAAGCATAA 3300 

TATGAAAAAT GCGACGATTG CCATTGCGGT TGGTCATGAA TTAGGTTTGA CATATAACAC 3360 

20 AATCTATCAA AATTTAAAAA ATGTCAGCTT AACTGGTATG CGTATGGAAC AACATACATT 3420 

AGAAAATGAT ATTACTGTGA TAAATGATGC CTATAATGCA AGTCCTACAA GTATGAGAGC 3480 

AGCTATTGAT ACACTGAGTA CTTTGACAGG GCGTCGCATT CTAATTTTAG GAGATGTTTT 3540 

25 AGAATTAGGT GAAAATAGCA AAGAAATGCA TATCGGTGTA GGTAATTATT TAGAAGAAAA 3600 

GCATATAGAT GTGTTGTATA CGTTTGGTAA TGAAGCGAAG TATATTTATG ATTCGGGCCA 3660 

GCAACATGTC GAAAAAGCAC AACACTTCAA TTCTAAAGAC GATATGATAG AAGTTTTAAT 3720 

AAACGATTTA AAAGCGCATG ACCGTGTATT AGTTAAAGGA TCACGTGGTA TGAAATTAGA 37B0 

AGAAGTGGTA AATGCTTTAA TTTCATAGAG ATTAGTCGAG GGACCTTTTA CTTATAAAAA 3 84 0 

TGATTTGAAT TAATACTAAA AGATTACAAA GAAGAGGTGG TTTTGTGTGT AAATACAAAA 3 900 

TTGCCTTTTT CTTTTTATGT TAAATCTATA AATTTGAAAC TAAATCAAGG TTAATTCTAT 3960 

GTACACACTT TATATAGGAA GTAGTTTGAA TGTTTATATA ATGTTTTACA AAAAGATGTA 4 02 0 

GTATTATAAT GTCTAATTTC ACATGTGTTT CAGTAAAATT TGTTGTGGAA TGTTAACGAT 4080 

ATACGTATTT TATAAAAaAT TTTTTATAAT GATTATTCGA ATGATGCGTA ACGCTTACAT 414 0 

CTTATCTAAT GCTAGCTTTT TGACAAAAAT ATGACAATCA ATTAATGTGA TTCTAATAAA 4200 

45 TATTCGCAAA TTGCTTTATT GCGATTAAAT TTTTTTGGTG GTACTATATA GAAGTTGATG 4260 

AAATATTAAT GAACTTATAT GCAAAAGTAT ATTGAGAAAT AAACAGGTAA AAAGGAGAAT 4320 

TATTTTGCAA AATTTTAAAG AACTAGGGAT TTCGGATAAT ACGGTTCAGT CACTTGAATC 43 8 0 

50 AATGGGATTT AAAGAGCCGA CACCTATCCA AAAAGACAGT ATCCCTTATG CGTTACAAGG 444 0 

AATTGATATC CTTGGGCAAG CTCAAACCGG TACAGGTAAA ACAGGAGCAT TCGGTATTCC 4500 
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AGAATTGGCA ATGCAGGTAG CTGAACAATT AAGAGAATTT 


AGCCGTGGAC 


AAGGTGTCCA 


4620 




AGTTGTTACT GTATTCGGTG GTATGCCTAT CGAACGCCAA 


ATTAAAGCCT 


TGAAAAAAGG 


46B0 


5 


CCCACAAATC GTAGTCGGAA CACCTGGGCG TGTTATCGAC 


CATTTAAATC 


GTCGCACATT 


4740 




AAAAACGGAC GGAATTCATA CTTTGATTTT AGATGAAGCT 


GATGAAATGA 


TGAATATGGG 


4800 


10 


ATTCATCGAT GATATGAGAT TTATTATGGA TAAAATTCCA 


GCAGTACAAC 


GTCAAACAAT 


4860 


GTTGTTCTCA GCTACAATGC CTAAAGCAAT CCAAGCTTTA 


GTACAACAAT 


TTATGAAATC 


4920 




ACCAAAAATC ATTAAGACAA TGAATAATGA AATGTCTGAT 


CCACAAATCG 


AAGAATTCTA 


49B0 


15 


TACAATTGTT AAAGAATTAG AGAAATTTGA TACATTTACA AATTTCCTAG 


ATGTTCATCA 


5040 




ACCTGAATTA GCAATCGTAT TCGGACGTAC AAAACGTCGT 


GTTGATGAAT 


TAACAAGTGC 


5100 




TTTGATTTCT AAAGGATATA AAGCTGAAGG TTTACATGGT 


GATATTACAC 


AAGCGAAACg 


5160 


20 


TTtAGAAGTA TTanAGAAAT TTAAAAATGA CCAAATTAAT 


ATTTTAGTCG 


CTACTGATGT 


5220 




AGCAGCaAGA GGACTAGATA TTTCTGGTGT GAGTCATGTT 


TATAACTTTG 


ATATACCTCA 


52B0 




AGATACTGAA AGCTATACAC ACCGTATTGG TCGTACGGGT 


CGTGCTGGTA 


AAGAAGGTAT 


5340 


25 


CGCTGTAACG .TTTGTTAATC CAATCGAAAT GGATTATATC 


AGACAAATTG 


AAGATGCAAA 


5400 




CGGTAGAAAA ATGAGTGCAy TcGTCCACCA CATCGTAAAG AAGTACTTCA 


AGCACGTGAA 


5460 




GATGACATCA AAGAAAAAGT TGAAAACTGG ATGTCTAAAG 


AGTCAGAATC 


ACGCTTGAAA 


5520 


30 


CGCATTTCTA CAGAGTTGTT AAATGAATAT AACGATGTTG 


ATTTAGTTGC 


TGCACTTTTA 


5580 




CAAGAGTTAG TAGAAGCAAA CGATGAAGTT GAAGTTCAAT 


TAACTTTTGA 


AAAACCATTA 


5640 


35 


TCTCGCAAAG GCCGTAACGG TAAACCAAGT GGTTCTCGTA 


ACAGAAATAG 


TAAGCGTGGT 


5700 


AATCCTAAAT TTGACAGTAA GAGTAAACGT TCAAAAGGAT 


ACTCAAGTAA 


GAAGAAAAGT 


5760 




ACAAAAAAAT TCGACCGTAA AGAGAAGAGC AGCGGTGGAA GCAGACCTAT 


GAAAGGTCGC 


5820 


40 


ACATTTGCTG ACCATCAAAA ATAATTTATA GATTAAGAGC 


TTAAAGATGT 


AATGTCTTGA 


5880 


GCTCTTTTTT GTTTTCAATA ATTGATTCTC TGTAGATATC 


aAAGTaCTAA 


CGTTTTAAAG 


5940 




GTTAAATATT TAATTGGATT GAGATCTGTA TGCGGTTATA 


TCaTTCTGTG 


TAAATATGGT 


6000 


45 


TCTCCACCAA ATGTGGTGAG TATATAATTT AAAGAACTAT 


TTTTAAATTA 


AGAATAATCG 


6060 




AACATAAATA AACTTTATGA AATTTCAGTA TCATGTTCTT 


ATAAAAAACA 


ATAGGGCTTT 


6120 




TTGctGACGC TAGTGCGCGA TAAATAATAA GTTGAATATA 


AAAAAGATCA 


CTGCCAATCA 


. 6180 


50 


TTCGTTTAAT GGCAGCGATC TTTTTTATTT AATTATTTCT 


CTTTCCACTG 


CAACATTTGA 


6240 




TAACCAATGC GTGGATGTGT TTTAATAATA TCTTTTGCGT 


CCTCATGACA 


TTGTGAAAGT 


6300 
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CCATATATTC GTTTTAATAT CATCTCATAA GTGAGTACTT TTCCTTTATG ATTTGACAAT 6420 

AGTTCTAACA AGCTAAATTC ATTTGGCGTC AAATGTACCT CCTGATTATT AATAACAACA 64 80 

5 

GATTTGGAGC CAAAGTCGAT GCTTAGCAAA CCGTTAGTAA ATACAATGTT AGTTTCTTGA 6540 

TGTGACTTAG CGATTCTCTC GATGACTCGT ATTCGTGCCC GAAGCTCATC AACATTAAAA 6600 

GGTTTAGTCA TATAGTCATT CGCACCGTTA TCTAAAGCTT GAATAATTGT TTGTTCTTCT 6660 

10 

TGTCTTGCAC TTATTACAAT GATAGGAATG TCAGTATGTT GCCTGATTTC TGAAATCAAA 6720 

CATAATCCAT CTTTATCTGG TAAACCTAAA TCTAATAAAA TGACATCTGG TTTATCAATT 6780 

1S TGAATTTTAA AGTGTGCTTG TGTGGCATTG TCGGCTGTAG TTACATTGTA ATAATCTAAA 6840 

GTTAATGCAA CATCAAGTAA ATGTGTGATT GCGTGATCAT CTTCAATTAT CAATATTTTA 6900 

GATTGCATTA TACGTCTCCT TCGTTAAAGT CTGTATATAT ATTGAAATAG AATATACTGC 6960 

20 CGTGTGGTTG GTTCGGTTTA TATTGTAAGT TTGATTGATG TTTGTGTAGG ATAGTCTGTA 7020 

CTAAATATAA GCCTAGTCCC ATGCTTTCTT TTTGGTTATC TTTAAAATAT TTATTTGATC 7080 

CTGTGTAAAA AGGCTCGAAT ATCTTTTGTt GTTCTTCTAA ACTAATTCCA GGTCCTTCGT 714 0 

25 CTATAACGGC AAATTCGATT TGTTCATAGC TAGCATAACG AATAGATAAA TTGATTTTGG 7200 

TGTCAGTAGA AGTGTGTTTA ACTGCATTTT CAATCAAATT GAAtAAAgCT TGTAAAATCA 7260 

ACTTACTGTC AATGTGTATA AACtGTAAAT TTACTGAGGA TGATACAGTT ATACGCTTTT 7320 

TTAAATGGCG ACGTTCTAAA ATACATATCG ATTTCTTATA CTA 7363 

(2) INFORMATION FOR SEQ ID NO: 20: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 10470 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
- (D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 20: 



TTAACAATCG ATAACCACAA TACTTCTATT GTAATTGTTT AACGATTTCn CGATTAAAAT 60 

CATCTAAATC GTCTGGTACT CGACTTGTTA CAATATTGTT GTCTACAcTa CTGACTCATC 120 

AACTACATGT GCGCCTGCAT TTGATAAATC TTTGCGTACA TTTAATACTG CTGTTAACGT 18 0 

ACGACCTTTT AAATCGTCTG TATCTATTAG TATTTGTGGC CCATGACAAA TGGCAAATGT 24 0 

TGGTACATCA TTTTTAGTAA AGTATTTAGC AAATGTGCCA TATCGACCTT CTGTATCTCC 3 00 

ACGTAAATGA TCTGGTGAAA ATCCTCCAGG AATTAATAAT GCATCATAAT CTTCTGGTTT 360 
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ATTTGCAGTA TCTCCAATCA 


CTACAGTATT 


AAAGCCTGCA 


TTCTCTAATG 


CCTCTTTAGG 


480 




GCTTGAATAT TCTATATCTT 


CAAATTCGTT 


TGCTAGAATA 


ATTGCTACTT 


TTTTAGTCAT 


54 0 


5 


TGAAAATCAC CTTTCTATAT 


ATCA7TGATA 


TAATTACTAT 


AGACAAGTAA 


ATCAGTGATT 


600 




AAACATACAA GATATAAAAA 


ATATTAAGCG 


ACTGTCGCGA 


TATCTAACCC 


TAACACATCT 


660 


10 


'TATGTGGCAT TTACTTAGAT 


ACTAATTTAA 


CCTTTTCTTC 


AAGCTGATCT 


AACAATCCAA 


720 




TCCATTCATC TATATCTTCA ACACGTACTT 


CATCAGGATT 


TACATGATCG 


ATATCCTCAA 


7B0 




TAAACTTATT TAAACGCGCT TTTATCTGTT 


CGATTGTTTG 


CTGTTCATTC 


ATAAAAAGTT 


840 


15 


AACTCCTTTT ATTTTGTTTT 


CTTTTTCATT 


ATTATCCTAA 


CAGAAATTGC 


GTTAAAGCGA 


900 




TATAATCTTA GCTATATTTA TGACATTCAA ATTATTTTGA 


CTTTTAAAAA 


TCCCCTTTTC 


960 




AATTAACTAA AATTAAGAGA 


TAATTTGTTA 


CGAGTGATAA 


TACGAaGkGG 


TaTCATACCG 


1020 


20 


ATATGAACCA AATAGAAAGA AGGAAGTTTA AGACGATGAA TAGCGTCAAA 


TTGAAGCAAC 


10B0 




CTGTTAGCAT TTACAATGAT 


CCATGGGAAG 


TGAAATTTAT 


ATACATTTAA 


ATTTCATGAG 


1140 




ACAATAAACG TTGATTTAAT 


GCGTTTTTTT 


GCCTTTTTTA 


TTTTCCTTAT 


TTTTTCTGTT 


1200 


25 


TTACAACAAA ATGGTATCAA 


AAATGGTATC 


ATTTGTAGTT 


ATTTTAGCTT 


CACATATTAA 


1260 




AACAACCACA CTCCTAAATT 


AATAGGTGGT 


GTGGTTTTGT 


TGGTTGTGTG 


GGGATAAAAA 


1320 




TAACCGCATC AGTTAAGATG 


CGGTTATCTA 


GCAAGGGCCA 


CGTATTTATA 


AATACGTTTA 


1380 


30 


GAATCTCTTC GGCAACTTTG 


CTATAGACAG 


TCTATGCTGT 


TACTAAATTA 


TACCACCACA 


1440 




CAAACCTACT CCCATTCAGG 


AACACAGAGC 


TTTGTCGCTC 


GTCAGCAACG 


TCATATGAAT 


1500 


35 


TCTCAGTTCA TGTTGTGGTG 


ACACTTTAAA 


CGGTCTGTGC 


CAGTAGCGAC 


CGAGTCATTT 


1560 


CAAGAATGAC CATTTCACAT TTATATTATA ACACTTGTCG TGCGTAACTG 


TATAGTTTTT 


1620 




CAGft'GTATT TAAAGTTAAG TTATCTACTT CGCGCTTTCC TTGCCTTAAT 


TGTGAAATTA 


1680 


40 


CATATTGCGC TACGCCAGTT 


TGTTTGTGAA 


TTTGGTAACC 


TGTTATATCA 


CTTTTGATCA 


1740 


ATTCAATTAT TTTTAATTTA 


TAATCACTCA 


TATTATCTAC 


GTCCATTCTT 


TTTATCTAAA 


1B00 




CAATAAAAAT GTGTCTTTCT 


CCCGATAAAT AATAACAATG 


GTAGGCTTAA 


TAAAAACAAT 


1860 


45 


ATTAAATACA TTTGTTCTGT 


CATAATTGAA AACCTCCAAA TAATATTATA 


TTATATAAGT 


1920 




GTAAGGAGGA GCCATCAGGC 


TCCAAGCATA ATGTTAATCT 




GCTTTCGGTC 


1980 




TAGGTAGCCG AGATGCCaTT 


CTCTAAGTTG 


TTTTAACACT 


TCTGGAATTA 


TCAGTACTGC 


2040 


50 


CAATACTTGA TGTTCTAGAA 


GTGTTTTTAT 


TATGTCTAGC 


ATGAGGCTTT 


TCACCTCCTT 


2100 




ACACATAATT TGTAAGTCAT 


CAACTAACCT 


ACAAATATAA 


TTATACTAAA 


CAAATGTTTA 


2160 



55 



272 



EP 0 786 519 A2 



GTTATCTACA TTTAAATCTT GAGAGAAATG TTAAAAAGTT CTAGTAAAAT AATAGCACAT 2280 

TTTATCTTTA AATGTAAATA GAAAGCAGGT ATGTAACGCA CCTGCTTAAA TAGaCATGAC 2340 

5 

TATGTCATTC TAACTGATTT CTCCCCATAA GTCACCTAAT ATCTGATTAG GTGGGGCAGA 2400 

ACCATTCCAT GTTCTAATAG GCAAGTAATA ACGTTGCCCC TCCCATGTAT ATCCTACCCA 2460 

AACATGACCA TCTTGTAACA TCACTTCTGT ATAATCACAA TACCCACCAG GTTGGAACTG 2520 

10 

ATAACCCACT GGACAAGATA AGAATGGCCC CACTTTTCTT ACTGTGATTG GTTGATTGCC 2580 

GTTTGTGAAT CTAGCACTTT CTTCCATGTA GTAAGTACCA TATTTATTAC GTTTCCATGC 2640 

ACTTGCAACT GGTTTAACTG TATTACTTGA AGCGCTTGAC TCATTAGAGA CAGTGGCAAC 2700 

15 

CGGTATTTTA CCATCCATGT ACGCCCTAAT CTGCTTGATA AAGTAGTCTT TAAGTTGCAA 2760 

CCGCTTGTCT TCTGGCAATA GACCGCGAGT TACTGGGTCA AAACCAGTGT GTAAAACCGA 2B20 

20 ACTTCTATGA GGG CATGATG TTGAAGTAAA TTCATTGTGC AATCTGATTG TATTTCTGTT 2B80 

TGCTGGTAAT CCCCATTTTT TCAACAATCT AGCGCATTCT TGGAAAGTTG CCTGTTCATT 2940 

TTTTAAGAAT GTCGCGTTAT CTGCGCCCAT TGATTGACAT ACTTCAATAC CGTAATAATA 3000 

25 TTTATTACCT ATTTGATTAG CGGTATGCCA ACCTACTTGT GATTCATCTA AGGCTTGCCA 3060 

AACTGTGTTG CCTGATACGT AACTATGCGC AATGCCCGCT TCTAATCTTG ATAAAGGTGC 3120 

ATTTACTAAT CCGTTACGAT ATGCTTCAGC AGTCGCCCCT TTGCTCCCTG CGTCGTTGTG 3180 

3D 

TATAACTATA CCTTTAGGGT TACTACCACG CTTAGGTAGG TCATAACCTT TAACCACATC .3240 

TTTGATGATT TTAAGTTCTA CTGCTTTAGG TTGTGGCTTA GCTGTTTCTT TTTTAGGTGC 3300 

TTGTGTAGGA GATTGAACTG ATCGTGGCGC TGTCTCACTT TTAAAATTCG GACGGATAAA 336 0 

35 

CCACATAGGG AAATCATAAG CATGTTGTCG TCTTGTAACT TTTTCCCAAC CCCAGCCGGG 3420 

TTGTTCGATT CCGTCAGTCC AGCCACCGCC TAGCCAATTC TGCTCATATA CAATGATGTA 34 80 

ATCTAAAGTT GCTTCAATTA CCCATGCAAC GTGACCATAT CCAGCACCGT AGTTGCTACC 354 0 

40 

GAATACCACC ATGTCGCCAG GTTGTGCTAA GAAGTCCGGT GTATTTTGGT ATACAGTAGC 36 00 

TAATCCGTCG AAGTTGTTAG CGAACGGAAT ATCTTTTGCA CCTAAACCTT TTAGAAGTAA 3660 

45 TCCAAACAAA ACTTTCCAAC CAGCATTGGC ATAATCAAAG CATTGAAATC CATACCATAA 3720 

GTCCACATTG AATTGTTTTC CCTCAGAAGT TTTCAACCAC TCTATAAACT CATTTTTAGT 37 80 

TAATTTTGCT TGCATTGTCG CCACCTCCAT GATGATACTC ATTCACATCA AAGCCAACAT 3 84 0 

50 CGTTAGAGGC GTCTGTGAAA GGTTGTGATG TATCATATTC TTTTGGTGcT TTCGCGCTTA 3 900 

ATTCCGGCGT TAAACTACTG TCTTGTGATG ATTTCCACGT AACTTGTTGT TCTTCTTTTT 3960 
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TTGGGTCAGT AATAACGCCA ATACCTGTAA 
TAGCTTGATT TAATTGAGTA GATAAATCTA 
5 * ATAGCAACAA TGCTCCAACT AAACCAGTTA 
AGTTAATATC CATTTGTTTG CTCCTTTTAT 
GTTTAAAATT ATTCAATGGT CAATGTCGGA 

10 

TACAACATCC CTGAAGGATT ACTAAAGTTG 
CCTGATATTC CTAAATCACT TGACCCTAAA 
CGTACATTTT CTATTGTCAC CTGATAACTT 

15 

TTTGCTGTTG ATTCTTCTAT GCTATCCGGA 
AAAGATGAAG GCTTTTTCCA TACTTGGATA 

20 CCTTCGTAAA TAAACTTCTT TACATTTTTA 

TTAAATATAA CGTATTCGGG TCTTTTTGAT 
TCCAAATTTT AACCGTCGGT TGAGATGCGC 

25 GTTTAGTAAA AGCTTGAGAT GACAAAAGAT 
CATATAAATC ATTTAGTGTT TGTTTGAATT 
CAATCTGTTG CAATACACTT TCTGAAATAG 

30 TTAATGTGTT CATAGATTCA GGCGCGCTAT 
ACGTTTTAGA GTCGTTGAGA GTTGTATCTT 
CTAACCCTTC AACATTTGCG ATATTGATTT 

35 

CTGTATCTTT ACCATCAATT TGCCACATTT 
CGTT3TTACC GGGTGCGCCT TGTTCTCCTT 
GTCCCGGTTC ACCTTTATCA CCTTTCGCAC 

40 

AAGAAATGAC ATCTTTATCT ATTTTCTCTT 
TATCTTTTAA AATTCTCGTA ATAGCATCAT 
CAGCAGATTC AATACCACTA TCAACGATAT 

45 

CTTCTTCTTT CTCTAAAAAC AGCTTACAGC 
TAGGTATCTT GTAGGTAAGG AAACCTTTTA 
50 TGAATATAGA GCCATCTTCC ATAAACAAAT 
GATCGATACG ACCTTGTTTG TCATTGATAC 



GTAACGTG AG GATAGCGCCT ATAATTGCGC 4080 

ATCCGAATAA ATCCGTGACT TGCTTGATAA 4140 

GTACTGCTTT GTTTTTGAAT CTCAATTTCC 4200 

CCAAAATAAA AAAACGACTA AAAATTAGTC 4260 

GATCCTGAAT AAACATCACT TATAGTGACG 4320 

ATATTTTTAC TTGCAACTCC GCTATTGACT 4360 

TTAGTTTGCG AAATCCTCAT TATACCGCTA 4440 

TTATTGGGTT CAACTCCATT TATTGTCCAT 4500 

TATTTATTTT TAGGTAAGGG TTTTATTACA 4560 

TTTCCAGCAT ATACTTTTGT ATATTCTTCA 4620 

AAATTACCTT CCATAAAAAT CACCCTTTAA 4680 

ATATATAGTT ATATTCATTT TCTGTTCCTG 4 740 

TTTTTAGTTG ATATAAATTA TCCGCTTGTT 4 800 

ACCGCTCGTC ATGATTATGA TTTTTTGGAG 486 0 

CCTCAAAATC TTCTGTATTA ACTTTTGAGC 4 92 0 

AGTTGTTTTG TATTG CTTCT GCTAATTCTC 4980 

CAACTAGTTC AGCAATTTTT GTATCCGTAT 504 0 

TGATTTTTTC pj^^q^vqq AATTTATTTT 5100 

TGTCCAATAA CTCAGGTTCT GCTTTGATAT 516 0 

TAGTGTCAGG ATTGATTGAT ACTACAGTAC 5220 

TTTTACCTGC TTCACCTTTT GCTCCAGGTT 5280 

CTTTAAATCT ACTTTCATTC TTTTCGATGT 5340 

TAAAGTCTTT GCTCAATAAA TCTGTCGCGT 5400 

CTACCAATTT AACATCGATT TCTTTTGCTA 5460 

TGAAAGAAAA GTTTGCGACA TGTATTTTTT 5520 

GAACATAACC AGCGTGTTTG ATAACCTTTT 5580 

CAACATCGTC GATAATAAGG GGCTCATTTT 5640 

GTAATCTAGG TGTTAAGCCA TGTGCTTTTA 5700 

CTATTCTTAT AGATGCTGTA TTTTCATCTT 5760 
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CAACATCTTT TATTTTGTAC ATTTACACAC CTCTTTATTT ATATTTATCC CTTGTGAAGT . 5880 

AGATACCTTT TAAGCCGATT TGTTTATATA ACTTAGCGAT TGTACTTGCT TGATGTTGGC 5940 

5 

ACCACTCTA7 AGCAGTAGCG TATTGGTGGG TAGCTGGATT CTTAGGATTC CATCTAATTC 6000 

GGTACAATGT GTTTTGACCT TTATTGATGT AATCCTTTCT TACGAAGCTA GCACCGCCCA 6060 

TGATTGCTTT TGCTGGAGAT GTCCAACCTT TATTCCTTGC AAACGTCATT GCGTAGTTAG 6120 

GATTGTTGTC GTAAGCGCCA ATGCCGAAGT AGTTGTATAC TCCATCTTTT CCGTTAGCGA 6180 

AGTTACTTGT TCCATATCCA CTTTCTAAGA AAGCATGCGC GATTAAATAA ATTTCATTAA 6240 

HC TGTTGTGCTT TTTACAAGCT TCTGCGAACG CTTTACCTTG ATTATTCAAT GTTCCCTTAC 6300 

75 

CTTTAAGTAT CTTATTAAGT GCGCTAACTG AAACACCTTG ATACTTGCCT AAATTAAGCA 6360 

TTTGGTAGCA TTGTGTGTTA CTTTCCCATA TACGCTTTAC ATTCATTGCT GAACTCGTTT 6420 

20 GTGCTCGTGT AGCGTTAscC AACCCCAAGC ATTAGATTTT TTCGGGTTAC CTCTTGCCAT 6480 

TTGTTTATCC AGTGCTTGTT TGAATGTATA AGGACTCGTT TCTGTTATGA TCTGCGGTTG 6540 

TTTAGATGCC GAACCATTGT TGGCTGTTGG TCACGAGTCT CTTACATTAG CTATATCAGC 6600 

25 G TTTTTA TTA T CTACCATAA CTTTTATTCT AGATTTTGTT ACTGTTGGCT TAGTTATAGA 6660 

ATTTAATAAT TTTTCTCTGT TTTTAAATAT ATTAAGTAAT GCCTTTTCTA ATGCTTCGTA 6720 

TTTATCTTTA GGAGGAACAC CGTTGTCAAT CATATTCCAA TTAACATGTT CCAACATTGA 6780 

30 ACGCCAAATG CTGTCGTCTA CTTTTAAATT TTCAATACTT AGAGGTATCT CATATTTGGC 684 0 

CATCATATCT ACAGCTACAA CCATTGCGTG AATCTCATTA AAAATAAATT CATTTTTACT 6900 

CGCACTATAA TCTTCACATA CGTCTATAAC TATATAATCA GGTTCATTAG GAACTTCAAA 6960 

35 

TACAGCTCTT CTAGGTGCCC AAATATTATG TCTATCAACA TAAAAGTGGG GAT ATT CT AC 7020 

ATCCTGTTTG TATTTCTTCC TACTGTTATA TAAACTTTCT ACCGAGCTCA TCGTTTGTGC 7080 

GTTTCTAATC ATTATTCCTT TAGGTTTTTC GAGTCGTCGA TTACCTTCTA CTATAAAGTG 714 0 

40 

ATAAATATAT TCTGGATAAT TAACCTCTTG GCTAGAAATA GTGTACTTTA TAGTTGTTAC 7200 

ATCTTTCCAA ATTGGAACTT TTTTATTATT TTTTTCGTTA TCATCACTAT CATCTTCTGG '7260 

TTTAGGTGCC GGTGTAGTTT TGTCTGGATG ATATGGTGGT CTAACAAAAT ATTTAACCCC 7320 

45 

TCCACCTGGT CCATCATGAT AAGAGTGTTT AATTTTATAA GGTGGACTTC CTGTTGCGTT 7380 

ATTTGTATAC CAGTTTTGAT CTACGCCATA CCAATAGTCT TTTGTGCATG GTCCCACTAC 744 0 

50 AATGTTTACA TGTCCTGCCC AACCACCAGT CCAAACACCC CAGTCGCCTG GTTGTGGTAC 7500 

AAAATCTTTT GTATTTCTAA TTATCTTGAA ATCTCTACCT CTATAATTGG ATTTTTGAGC 7560 
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TAAATCCCAG CATTGTGCTC CCATTCCAGA ACCAGGTACA TCAATAGCTA TTTTGTTTTT 76 BO 

AGCGATATAT AACGCCCATT CAACCACTTC ACTAGCTGTG GGCTTTCTAT TTTTCGGATT 7740 

5 . AGGTAATCCC ATGTATGCAC CTCATTTCAA TCAAAATAAA AAGCCAGTGC CGAAGCACTG 7800 

ACTCTTAACT GTTATTTACA TTTACCAAAC CAGAAGCACG CCCAGAAGCT ATATCCTAAA 7860 

ATCCCTTTAA GCATGGTAAT CACCTCCTTT AAATACCAAA AACAGTTCTT AGTAAAGCTA 7920 

10 

TGACAATCGT ACTGAAGATA GTCCCTATCA AACCTAGAAT CCACATTTTT ATGTCTCTAA 7980 

TATTCTTGGC ATTCTTTTCT TTATTCTTTT CATCTTCTAC CTTGTCGCGC TTTAATTCTT 804 0 

CAAAATTTGT ATCTAATTTG TCATAAATCT TTTCTTGCGC TCTAAGACTA TCTTCTATTC 8100 

IS 

TGTCGAATTT TTCAAACATA GTCTTATCAT TTTCTTCTAA TCGCGTTAAA CGCCAATCTT 8160 

GTTCATGTCG TTTGGTAAAT CCAAACATTA TGCCACCCAC TTTATTCAAA TTAAAAAGCC 8220 

ACAAGCATTA CACCTGTGAC TTTTCATCTT TTGTTTCTGG ATATTTTTCT CCAGTGATTA 8280 

20 

AAGCGTATTC TTCTTTATCG ATTAAACCCT TGTCTACGTA CCACTTAATT TGCTCGTTTT 8340 

TATAGTAACC CCAAACATAA AAAGTTTTAA TGTCTTTAAA AGTTGGATAA ATCATCTTCA 8400 

25 TTATTTAAAC GTCCCCCTCA GTACTTGTTT TGTTAGTTTT CAGTTCAGTC AACTGTTGTG 846 0 

TTAACATAGC GTTTTGTTGA GCTAATTCCA TTGTTAATAC GTTTACTTGT GCCACCTGCA 8520 

TTTGCATACT CGCAACCATT CCGCGAAGTT CCTCATCACT TAAATCTGAC GCACTTTGTT 8580 

30 GGTTTGATGC ATTCGGTACG TCTTCTTTTT CGAAATTGCT ATTGTATTTA ATTTCGCCGT 864 0 

TAGTGAAAAC AAACTTTCTA GGTTCGAACT CTTCTTTAAA TTTAATAGGC ACATTGTTAT 8700 
. CATCTACATC TAAACTATTG CGTAAACCGC CAGTATTAAC GAATCCGATA ACTTCGTTTT . 8760 

35 

TATCGTTTAC TGTGATTTTG ATTATTTCCA CCCCATAATT TTAGTTATAG TAACTTTGTT 882 0 

GGCAJTCGCT CCAGAACCTG ATGTTTTACC TAAATCAAAG TACACATCGT TATCTATTCT 8880 

TAAAGTAGTG CTACTTGTTT TGGATAGTAA GCACTCATAA ATACCGCCAC CGTTGCCGTC 894 0 

40 

TGAGTCAACT ACATTCGCTT TACTCAATTG AATCGCGTTA GGTAATGCGG TTAGTCCGAA 9000 

TCCCTCAATA ACGCCACCTG GATAAGTTCC ACTTACCAAC AAAATAGAAT AGTTTGTGTA 9060 

CGGTTCAGTT AGATTGATTG TTGTACCTAC ACCATTTGCG CCACCGTCGA ACAATACCGT 9120 

45 

TGATTTATGT TCATTAGGAA CTGTCCACTG TTGCTCAAGT CTGCCGTTTG TGATTGATCG 9180 

TGTGTAAATC TTTTTAGAGT TATAAGGTGT GAAGTTAAAT AGCTTGTTTG TATCATCTTT 9240 

50 AACGAATACC GATAAATAAC CCTCATAACT TTCAACGCTA CCTGGTAAAT CCGGCACTCT 9300 

TGTTGCATAG TAATTACCAG CAGTTAAATA TCCCAAATCG CCTTGCGCAT TATTTAAGTT 9360 
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10 



15 



20 



25 



30 



35 



40 



GAATTTATCA TCTACATACT GCTTAGCTTG ATTTAAAGCG TTGTTAGACG TTTCTTCAAC 9480 

AAATTGCTTA GTTAAGTTTC CATCATTCTT TTTATAAAAC GGGTACCATG TGCCGTAGAT 954 0 

TTTGTATTTT GTGTACTCAT CGTTTGAATC GTCTGGGTAC CATGTTGCAC GAGCAGTATT . 9600 

ATTATCAACA ACATAAACAA CTAACACACC AGATTTGCTT GATGTATAAG TTGATTCATC 9660 

GAACGAAGAA CCGTCATCAA CACCATCTTG TCCAGGCTTC TCTAACGTGC CTATATCCGT 9720 

CTTTTCTGGC GCATCTGTTG CATTAGTAAT ATGAATAATC CTAGATGTGT TAACTGCGCT 9780 

TAAAACGCTA TCTATGGACT GCTCATACGA TTCAATTGCT TTACCGTAAT CATCTGTAAG 9840 

TTTAGACTTT TGCCAATTCG TTGTTGAATT ACCTTTAACA AGGTCAGCGC CATTGATTTG 9900 

TTGTTCAACT TCGTTAACAC GTTCAAAAAT CGCTTGCTCT TTTTCAACTA TTTTATCGAA 9960 

TTCAGCTGTA ACAGCTTGTG TTGCACTAGT TTGCGTCGCA GTAATAGCTT GTATAGCTTC 10020 

GTTTTGCTTG ATTTCGATTT GTTGAATGCC TTTTGTCGCA CTATCATTCA CTTTTGCTAT 10080 

TAACGTTTGT GTATCAGCCA TATTTTGCTT TAATTGGTTA AAATCTTTAC CGACAGCTTC 10140 

GATAGTATCT TGAATAGATT TGATATAAAC AAGCTTTGTT ATACCATCAA ACCCACTAAC 10200 

TAAATCATTT TCAATATTGA AGCTAAATTG ACGTTCAACA ACAACATTAT TACTCCCGTT 10260 

TTGTGTAAAG AATGCCTGAG CATGCACCTT GCCTGAATGT TTTAAAAATT CATTCGGTAT 10320 

CACATACTGC AAACGCCCAT TAATTGCGTC TACTATCGTT AATTCGTCTG AAATATAAGC 103 BO 

GCCTCTATCT ACGTTATAAT CATCGGTTTT TAAnACGATA GATGTTTTAA CATGTTCAGA 10440 

ACTTATAGAT AAGGGTCTGT TATnCTTAGT 10470 

(2) INFORMATION FOR SEQ ID NO: 21: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3647 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 



so 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 21: 

ATCAGATCTT GAGAATCGAG TTATTAAGTC TATCGAAGAC TTAACTAAAA TCCAACCATT 6 0 

CATGCCTACA CAAGATTTTG ATTTTAAAAC TAAAGAAATT CAATCAAACA TTTCTGAAGA 120 

AAGATTTATC GAAATGATTC AGTATTTCAA AGAGAAAATA ACAGAAGGGG ATATGTTCCA 180 

AGTTGTGCCA TCAAGAATTT ACAAATATGC GcATCATGCT AGTCAGCATT TAAATCAACT 24 0' 

TTCGTTTCAA CTGTATCAAA ATTTAAAACG ACAAAACCCA AGTCCATATA TGTATTATCT 300 
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TCAAATTGTA 


ACAACTAATC 


CTATTGCAGG 


TACGATTCAA 


CGTGGTGAGA 


CGACACAAAT 


420 




AGATAATGAG 


AATATGAAAC 


AACTACTTAA 


TGATCCAAAA 


GAATGCAGCG 


AACATCGTAT 


480 


5 


GCTAGTTGAT 


TTAGGACGTA 


ATGATATTCA 


TAGAGTAAGT 


AAAATCGGTA 


CCTCAAAAAT 


540 




TACTAAATTA 


ATGGTTATTG 


AAAAATATGA 


ACATGTTATG 


CATATCGTAA 


GTGAAGTCAC 


600 


10 


AGGTAAAATA AATCAAAATT 


TATCGCCAAT 


GACAGTTATT 


GCGAATTTAT 


TACCAACAGG 


660 


TACCGTTTCA GGTGCACCAA AATTACGTGC AATTGAAAGA ATATATGAAC AATATCCACA 


720 


• 


TAAACGGGGC GTTTATAGTG GTGGTGTTGG ATACATAAAT TGTAATCATA ACTTAGATTT 


780 


15 


TGCATTAGCA ATTCGAACGA TGATGATAGA TGAGCAGTAT ATCAACGTAG AAGCTGGTTG 


840 


TGGCGTTGTA 


TATGATTCTA 


TTCCTGAAAA AGAACTGAAT 


GAAACGAAAT 


TGAAAGCTAA 


900 




AAGCTTATTG 


GAGGTGAGCC 


CATGATCTTA 


GTTGTAGATA 


ATTATGATTC 


CTTTACATAT 


960 


20 


AACCTAGTGG 


ATATTGTTGC 


TCAACATACT 


GACGTCATTG 


TTCAATACCC TGATGATGAT 


1020 




AATGTGCTGA 


ATCAATCGGT 


GGACGCTGTT 


ATTATATCTC 


CTGGTCCAGG 


GCATCCATTA 


1080 




GACGATCAAC AGTTAATGAA AATCATATCA ACCTATCAAC ACAAACCCAT 


TTTAGGTATT 


. 1140 


25 


TGTTTAGGGG 


CTCAGGCACT 


GACTTGTTAC 


TACGGTGGAG 


AAGTCATTAA AGGCGACAAG 


1200 




GTTATGCACG 


GCAAAGTTGA 


TACACTAAAG 


GTTATATCGC 


ATCATCAACA 


TCTGTTATAT 


1260 




CAAGATATAC 


CAGAACAGTT 


TTCAATTATG 


AGATATCATT 


CATTAATAAG 


TAACCCTGAC 


1320 


30 


AATTTTCCAG 


AAGAATTGAA 


AATTACTGGA 


CGTACCAAAG 


ATTGTATACA 


GTCATTCGAG 


1380 




CATAAAGAAA 


GACCGCATTA 


TGGTATTCAG 


TACCATCCTG 'AATCATTTGC 


TACAGACTAT 


1440 




GGTGTCAAAA 


TAATTACAAA 


TTTCATTAAT 


CTAGTGAAGG 


AAGGATGAAA 


ACCATGACAT 


1500 


35 


TACTAACAAG 


AATAAAAACT 


GAAACTATAT 


TACTTGAAAG 


CGACATTAAA 


GAGCTAATCG 


1560 




ATATfiCTTAT 


TTCTCCTAGT 


ATTGGAACTG 


ATATTAAATA 


TGAATTACTT 


AGTTCCTATT 


1620 


40 


CGGAGCGAGA AATCCAACAA 


CAAGAATTAA 


CATATATTGT 


ACGTAGCTTA ATTAATACAA 


1680 


TGTATCCACA 


TCAACCATGT 


TATGAAGGGG 


CTATGTGTGT 


GTGCGGCACA 


GGTGGTGACA 


1740 




AGTCAAATAG 


TTTCAACATT 


TCAACGACTG 


TTGCTTTTGT 


TGTAGCAAGT 


GCTGGcGTAA 


1800 


45 


AAGTTATAAA 


ACATGGtAAT AAAAGTATTA 


CCTCaAATTC 


aGGTAGTACG 


GATTTGCTAA 


1860 




ATCAAATGAA 


CATACAAaCA ACAACTGTTG 


ATGATACACC 


TAACCAATTA 


AATGAnAAAG 


1920 




ACCTTGTATT 


CATTGGTGCA 


aCTGAATCAT 


ATCCAATCAT 


GAAGTATATG 


CAACCAGTTA 


1980 


50 


GAAAAATGAT 


TGGAAAGCCT ACAATATTAA ACCTTGTGGG 


TCCATTAATT 


AATCCATATC 


2040 




• ACTTAACGTA 


TCAAATGGTA 


GGCGTCTTTG 


ATCCTACAAA 


GTTAAAGTTA 


GTTGCTAAAA 


2100 
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5 



70 



75 



20 



25 



35 



40 



AAGCAACACT 


ATCTGGTGAT 


AATTTGATAT 


ATGAATTGAC 


TGAAGATGGA GAAATCAAAA 


2220 


ATTACACATT 


AAATGCGACT 


GATTATGGTT 


TGAAACATGC 


GCCGAATAGT GATTTTAAAG 


22B0 


GCGGTTCACC 


TGAAGAAAAT 


TTAGCAATCT 


CCCTTAATAT 


CTTGAATGGT AAAGATCAGT 


2340 


CAAGTCGACG 


TGATGTTGTC 


TTACTAAATG 


CGGGTTTAAG 


CCTTTATGTT GCAGAGAAAr 


2400 


TGGATACCAT 


CGCAGAAGGC 


ATAGAACTTG 


CAACTACATT 


GATTGATAAT GGTGAAGCAT 


2460 


TGGAAAAATA 


CCATCAAATG 


AGAGGTGAAT 


AATATGACGA 


TTTTATCAGA AATTGTTAAA 


2520 


TATAAACAGT 


CACTTTTACA 


AAATGGCTAT 


TATCAAGACA 


AACTTAATAC CTTGAAAAGT 


2580 


GTGAAGATTC 


AGAATAAAAA 


ATCTTTTATA 


AACGCAATTG 


AGAAAGAACC AAAGCTAGCA 
i 


2640 


ATTATTGCAG 


AAATTAAATC 


GAAGAGTCCT ACAGTTAATG 


ACTTACCTGA ACGAGATTTA 


2700 


TCGCAACAAA 


TCTCAGATTA 


TGACCAATAT 


GGTGCAAATG 


CCGTGTCCAT TTTAACTGAT 


2760 


GAAAAGTACT 


TTGGTGGTAG 


TTTTGAAAGA 


TTACAAGCAT 


TGACGACAAA AACAACATTA 


2B20 


CCCGTATTAT 


GCAAAGACTT 


TATTATAGAC 


CCGCTTCAAA 


TTGATGTTGC TAAACAAGCT 


2B80 


GGTGCATCTA 


TGATTTTATT 


GATCGTTAAC 


ATCTTATCTG 


ATAAACAATT GAAAGATTTA 


2940 


TATAACTACG 


CTATATCGCA 


AAATCTAGAA 


GTGTTAGTTG 


AAGTACATGA TCGCCATGAA 


3000 


TTAGAACGTG 


CCTATAAGGT 


TAATGCTAAA 


TTGATTGGTG 


TAAATAACAG GGACTTAAAA 


3060 


CGATTTGTTA 


CAAATGTGGA 


ACATACAAAT 


ACTATTTTAG 


AAAATAAAAA AACAAATCAT 


3120 


TATTATATTT 


CTGAAAGTGG 


TATTCACGAT 


GCATCTGATG 


TAAGAAAAAT CTTGCATAGT 


3180 


GGTATCGATG 


GCTTACTAAT 


AGGTGAGGCG 


CTTATGCGTT 


GTGACAATCT ATCTGAATTT 


3240 


TTACCACAAC 


TGAAAATGCA 


AAAGGTGAAG 


TCATGATGAA 


ATTGAAATTT TGTGGCTTTA 


3300 


CATCAATAAA 


GGATGTTACA 


GCGGCCAGTC 


AATTACCTAT 


TGATGCGATA GGTTTCATCC 


3360 


ATTATGAAAA 


AAGTAAAAGG 


CATCAAACAA 


TTACCCAAAT 


AAAAAAGTTA GCGTCTGCTG 


3420 


TTCCAAATCA 


TATCGATAAA 


GTATGTGTCA 


TGGTAAATCC 


TGATTTAACA ACAATTGAAC 


3460 


ACGTATTAAG 


CAATACGTCA 


ATTAACACAA 


TACAGTTACA 


CgGCACAGAA TCTATTGATT 


3540 


TTATACAGGA 


AATTAAAAAG 


AAATATTCAA 


GCATTAAAAT 


CACTAAAGCT TTAGCTGCaG 


3600 


ATGgAAAACtn 


TwATCCCAAA 


caTtAAtnAA tnTTAgGGGG 


TCCGTGG 


3647 



(2) INFORMATION FOR SEQ ID NO: 22: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 966 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 22: 

CcAcCTTGAC CACCTTTACG TGGAATCTTT TCmCCTkGAG CAACaTCGaT AATaTATATT 60 

GAAAgTCAAC AAGTTCTGGA CTAAATGTTG CTGCTAAGTT ATCGCCACCA GATTCTATGA 120 

AAATTAGTTC TATATCGTCA TGACGTTCTA ATAATTCGTC TATTGCTGCA AAGTTCATAG 180 

ATGCATCTTC ACGAATCGCA GTATGAGGAC ATCCACCAGT TTCAACACCA ATGATACGAC 240 

TTTCAGGTAG AACTCCTGAA TTTACTAATA TCTTTTCGTC TTCTTTTGTA TATATATCAT 300 

TTGTAATAAC GCCGATACTC ATTTCTTTTG AAAGACGTTT TACAACTTTT TCAATTAATT 360 

GTGTTTTACC TGCACCTACA GGACCACCAA TACCAATTTT AATCGGATTT GCCACAATTA 420 

TAACCTCCTA TGATATGAAA tTCTAACATT GaCGTTCTCA TGCGCCATTT GATTTAGTTC 4 80 

TAAACCAGGC GCTGTCATGC CAAAATCTGC TTCTTTTAAT TCGAAAATCT GCTTTCTTGT 540 

20 TCCTTCTATA TAAGGAATCA TGTGAGTAAC TATCTTTTGA CCAGCAGTTT GTCCAAGTGG 600 

AATAGCACGA ACAGCATTTT GAGTTAAACT TGAAACATTT TGATATAAAT AGTAATCAAT 660 

AATCGTTTCA ATATCTACAC CTAAATGATG GCCTAGCATA GTAAAACAAA TAGCTGGATT 720 

TnACTTTGCT TTCTTATCTT GCATTTGTTG ATGATACCAA GCAATCCATG GGCTATtATA 780 

AAGTTCTAAA GCCAATTTAA CCATGCGAGT CCCCATTTGT kTTGCACCAA CACGTGTTTC 84 0 

TTTAGGTAAG TTTTGrACAr ACATCAGTTT ATCTATGTGT AATACTTTTT GTGTATCATC 900 

ATTTTCCAAT G CAT CAT AAA CTAaACGCAT GGCTAAACCA TCAGAATAGG TAAGTTGCTC 96 0 

TTGTAAAAAC ATTTTTAACC AAGCAATAAA AGTATGATCG TCATGAATTA TATTTCGTTG 1020 

35 AATATATGTT TCAAGACCAA ATGAATGACT GAAAGCACCT GTTGGAAACT GTGAATCACA 1080 

GAACTGAAAT AATCTTAAGT GTGTATGATC AATCATGAGA ATGCCCTATA TGTCTGAAAG 1140 

CCTTATTAAC TTTACGGTCT TCTCGAACAT ATGGGATGCC TAAACTTTTT AATAAATCTT 1200 

CAACTAAATA ATCATATTGT ACTAGCATTT CAGTCTCTGT AAATTGTGCT GGCAAATGAC 1260 

GATTTCCTAA TTGATGGGCT ATATCTCCCA TTTCTTGCAA TGTTCTTGGT TGAATCACTA 1320 

AAAGATCTTC TGAATTAACA TCCACAATAA TCATATTATG GTCATCTGCG TATAAAATAT 1380 

CTCCATATTG TAAGTCAATA GGTTGTTTTA AACGAATGCC TATTTCAGTG CCATGGTCTG 1440 

TAACGACTCT TTGAATACGT TTAACAAGAT CTGAATTTTC AAGGTATACT TTTTCGACGT 1500 

GCTTTTGTTT TTCTGAATTT GACAAATTGG CAATATTGCC TTGGATTTCT TCAACAATCA 1560 

TTCTATGTTC CTCCTAGAAT AAGAAGTATC TTTGAGTTAA TGGTAACTCA GTTGCTGCAT 1620 

TACTTGTAAT TTTTTCTCCA TCTACATATA CTTCATATGT TTGTGGATCA ACGTCTAATT 1680 
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GACGCACCAT GCGTTTTAAA TTTAATGCAC 
ACACGAATGT CATTGACGTA CTTGTAAGGT 

5 

ACTTCATCGG TTCAGATGTA GGTATAGAAC 
TTAATCCGCC CTTTACAACT AATTCAGGTT 
CAATGTCAGC TAGTTTGCCC GGCTCGATAG 

10 

TTGCTGGGTT AATTGTATAT TTAGCGATAT 
CAAAATCACC ATCTAAAGGA CCACGTTGTT 

15 GTGTAATTAC TTCACCTACA CGGCCCATTG 
CACCCATATC TTGCAGAACA TCTT.CTGCTG 
CGAATGCGAT ATCTTCAGGA ATAGCCGCAT 

20 AATGTTCATC TACAGTATTA TGTGTATAAG 
TATTTGAAAA TGCAGCGGAT TTAATTAAAT 
TATGGTACAT ATGAAGTACA CGGTCTTTAA 

25 

CTTCATTTAA AGTATCTGCA TGTAATGCAA 
ATGCATGACT CAAAGCAGAT GGTGTTGCAC 

30 CTCCGGCATT GATTTGTTCA ATGAGTGCAG 
AACCGACATT AATCGGTAAA CcTTCGGCAG 
CTGGAGTTAC AGTTGTTGCT TTAGAACCTT 

35 TCGTAATACC ACTTTCTAAT GCGACCTCTG 
TATCAATACC ACCAGCAGTG ACGATTTTAC 
CTATAATAAT GTCGACATTA TCCATTATAT 

40 

TATAACCATT TTTAATGCCT ATATCAGCTT 
. CATTAGAAAT GACAAGGTCT GCAACGTTCA 
CCATACCGTC TCTAATAGAT TTACCACCAC 

45 

AGTCTTTTTC TATTTGAGCA AATAGATTCG 
TTGGACCGTA TAAGCTCGTA TATTGATTTT 
50 TCCTTTTTTA TTCACGTTTT CAGCACCGTT 
TTCATCAGTT GGGCGATAGA CACGTGACTC 
ACCAAAAATT TTACGTTTGC CAGCATATTC 

55 



GATTGATACC ATTTTCATAA GCAGTTTTAG 1800 

TTCCGCCGTA TTGACCATAC ATTTTACGGT 1860 

CATTTGCATC GCCATTTACG GCAGAGTTAA 1920 

TAACCCCAAA GAAAATTGGG TCCCATAAGA 19 BO 

ATCCTACATA TTCAGAAATA CCATGTGTAA 204 0 

AACGTTTGAT GCGATTATTA TCATTATGTT 2100 

CTTTCATGCG ATGTGCTACT TGCCATGTTC 2160 

CTTGTGAATC GGAACTAATC ATACTGAATA 2220 

CAATCGTTTC TTTACGAATA CGTGAATCTG 2280 

TTAAATGGTG AGTAATCATT ACCATATCTA 2340 

GCAAAGTTGG ATTTGTAGAT GAAGGTAAAA 2400 

CAGGCGCATG ACCGCCACCA GCACCTTCAG 2460 

CAGCAGCCAT TGTGTCTTCC ATAAATCCTG 2520 

TTTGAACATC AAATTCATCA GCAACATCTA 2580 

CCCAGTCTTC ATGTACTTTT AATCCAATTG 2640 

TTGGATTTGT TGCTTGTCCT TTACCTGTAA 2700 

CTTCTAACAT TCTATGAATA TGCCATGGAC 2760 

CTGAAGCACC AGTACCACCA CCAATATGAG 2820 

CTTGTTCAGG ATTAATAAAA TGAACATGAG 2880 

CTTCAGCGGC AATGATATCT GTTGTTGAAC 2940 

CTGGGTTGCC GGCATTACCT ATGGCGAAAA 3000 

TAACCACTTT ATCGTAATCG ATAATAACGG 3060 

CGTCATCACG TGTTACACGA GGATTTTGCG 3120 

CAAAAGTAGC TTCTTCACCA TAAACCGCAT 3180 

TATCACCTAA ACGAATGGAA TCTCCAACAG 324 0 

GCGTCATTTT AAAGCTCATG ATCTTTTTCC 3300 

ATCTCCGAAT ACACCTGCAT ATTCATCATT 3 360 

ATCGATAGGA CCATTGACCA TACCACGAAA 3420 

AACTAATTGA ACTTCTTTTT TATCCCCAGG 34 80 
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TTCGAAATCT AATGCTGCAT 


TTGCTTC&TA 


AAAATGAAAA TGTGAGCCCA CTTGAATTGG 


3600 




TCGATCTCCT GTATTTTCAA 


CTTCGATAAC 


TGTTTCAGGA TGATGGTTAT TAATTTCAAC 


3660 


5 


CTCTGTACTT TTTGTAATAA 


TTTCTCCTGG 


TATCATTTGA CTGCCTCCTT TAAACAATAG 


3720 




GGTGATGTAC TGTGATTAAC 


TTAGTACCAT 


CGGGGAACGT AGCCTCGATT TCGATATCTG 


3780 


ifi 
1 u 


TAATCATGTG TTCGACACCA 


TCCATGACAT 


CTTCTTTGTT TAGAATTTGT CTACCATAAC 


3840 




TCATTAACTC TGCAACGGTC 


TTACCATCGC 


GTGCACCTTC TAATAATTCA TCGCTGATTA 


3900 




AAGCTAATGC CTCAGGATGA TTTAGTTTCA AACCACGTGC TTTACGACGA CGTGCAACTT 


3960 


15 


CCGCCGCCAC TACAATCATT AATTTGTCTT 


GCTCTCGTTG TGTAAAATGC AAATTAAAAC 


4020 




CCCCAATTTC ATATTAGATA CaATTTACAA AATTTATATT AATCCTAATT GTTGTGATAA 


4080 




ACAAGTAATA TACAAAGTTC 


AATGTGTAAT 


TAGAAAATTA TATTTTTAGC ATATCCGATA 


4140 


20 


TTGAAGCAAA CAATCTAATC GAAAACAAAT AGTGGAATAT ATTTATGTAA AAACCAAAAT 


4200 




AGTTTTTAAT ATAACTTTTC ATAGAATAGT AGTATATTAA TGAGTAATGA TTCAAAGGAA 


4260 


25 


AGGTGAAAGA TTTGAAGATA ATAGATGTGC TTTTGAAAAA TATATCTCAG GTTGTGTTAA 


4320 


TTAGTAATAA ATGGACAGGA 


TTATTTATCT 


TAATAGGATT ATTTGTAGCC GATTGGACAA 


4380 




TTGGATTAGC GGCTATTGTA" GGTAGCATCA TCGCCTATAC TTTTGCGCGT TTTATAAATT 


4440 


30 


ATAGTGAGGC AGAGATTAAT 


GATGGGTTAG 


CTGGATTTAA TCCAGTGCTA ACTGCCATTG 


4500 




CGTTAACAAT CTTTTTAGAT 


AAGTCAGGAT 


TAGATATTGT TATAACAATG ATAGCAACTT 


4560 




TATTAACGTT ACCAGTTGCT 


GCTGCAGTGA 


GAGAAGTTTT AAGACCATAT AAAGTTCCGA 


4620 


35 


TGCTGACGAT GCCTTTTGTC. 


ATTGTGACTT 


GGTTTACAAT TTTACTTTCA GGACAGGTTA 


46B0 




AATTTGTAGA TACATCGTTA 


AAGTTAATGC 


CTCAAAACAT TGAAACGGTT AATTTTAGCA 


4740 


40 


ACAATGATAG AATaCATTTC 


ATTCAGTCAT 


TATTTGAAGG ATTCAGTCAA GTATTTATCG 


4800 


AAGCGAGTGT AATTGGTGGC 


GTATGTATTT 


TAATCGGCAT ATTGATAGCA TCAAGAAAAG 


4860 




CAACACTCTT AGCTGTTATA 


GCTAGTTTGT 


TAAGCTTTAT CATTGTAGCT CTATTAGGTG 


4920 


45 


GTAATTATGA TGATATTAAT 


CAGGGATTAT 


TCGGTTATAA GTTTGTATTA ATGGCAATCG 


• 4980. 




CACTAGGATA TACATTTAAA 


ACAGCGATTA 


ACCCTTATAT TTCGACTTTT TTAGGTGTGT 


5040 




TATTAACAGT AGTGGTGCAA 


CTAGGTACAA 


CAACATTGCT TGAACCGTTT GG CTTACCTG 


5100 


50 


CATTAACATT GCCATTTATT 


ATCGTGACAT 


GGATTTTATT ATTTGCTGGT ATTAAACATG. 


5160 




ACAAAGTAGA TGCTTGATAG 


TTAAATCAAA 


CCTAATATTG TTTGAATATC ACCTTAAACT 


' 5220 . 




ATACAGCGAA TTGTATAGTT 


TAAGGTGTAT 


TTTTATGGAT AAAATTAAGT GCATACTTAA 


5280 
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GTGTTAAACT AGGAATAAAT AATTTATATT GTGTGTTGTG TGGGGTGACT AATATGAATG 54 00 

ATATGGATAA TTCCTTTTTA ATAACAACGG AAATTCAAAG AAAATGGATT GAAAAATTCA 5460 

5 AAGTAATTAG AGATACATTT AAGGCTAAAG CTGAATATAA TGATCAACAT AGCCAATTTC 5520 

CATATAAAAA TATTGAATGG TTAATTAAAG AAGGTTATGG AAAATTAACG TTACCAAAAG 5580 

CATATGGTGG TGAAGGTGCG ACCATAGAAG ACATGGTTAT TTTGCAATCA TTTTTAGGCG 5640 

10 

AACTTGATGG TGCCACAGCA TTATCTATTG GTTGGCATGT GAGTGTCGTA GGACAAATTT 5700. 

ATGAACAGAA ATTATGGTCT CAAGATATGT TGGAGCAATT TGCTGTTGAA ATTAATAATG . 5760 

15 GTGCATTAGT TAATAGAGCA GTTAGTGAAG CTGAAATGGG TAGTCCAACA AGAGGGGGAA 5820 

■ GACCAAGTAC ACATGCTGTT AAAGCTGATG ATGGGTATAT TTTAAATGGT GTGAAGACAT 58 80 

ATACATCAAT GAGTAAAGCA CTAACACATA TTATTGTTGC TGCTTATATA GAAGAATTAG 5940 

20 AAAGTGTTGG TTTTTTCTTA GTAGAC 5966 

(2) INFORMATION FOR SEQ ID NO: 23: 

(i ) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 17310 base pairs 

(B> TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 23: 

CTGTGTCATC GCGAAATAGT TAGGGTCATT CATTAATCCT 
3S ATATAACAAT TTCATTAGTA AAGGGGACTT GTTCAAACCA 

TATAGTCACA CTGCTTATAA TATAAGAGGT AACGATCACT 

AAAGffTGATC ATCCCTAAAT AGAAATAAAT GACTACAAAT 
40 AAGAACTTCC TTAACCGTAA TAAATATCAA ATCATCAAAA 

AATCATACGA TATGTATACA AAATAATGAm , AAACTGTmAA 

ATGGTTAGCG AAAAACAGTA AATAAACTAA TATTAGTAAT 

45 

AACATTCACA CCGGCAATAA CCGAAGATTG CTGAATAAAA 
CAATGTTAGT AATTTACTAT TGTGTTGATT TTCCATTATA 
50 TCATTTTCTC CTCAGTAAAA CATTCTAAAT AACGTTTTCT 
TGTAATCACT GTCTATTAAA TATTTTTCCA GGACTTTAGC 
TCATCATACA TATACCATTA TCAGCTACTA ATTCTGAAAT 

55 



TTTGAACGTA TTTCATCAAA 60 

GCTATAATAC AAAATAGACC 120 

TTTTTGCTAT TACCTAACTT 180 

GCATATTTAA CTGTAGATGC . 24 0 

AATaGCaAAC AArGCGTAAT 300 

AAATGATTTG CCTTTAATAA 360 

GTGATAAAGT CAGCTATAGA 420 

ACCGCTAAAC CGATAAGTAA 480 

AACGTCTTCC ACTTCTTTAA 540 

AGATTGATTA CTCATTTTGA 600 

AATAGTTTCG GGTTGGTTGT 660 

ACCGCCAACA TGACTGGCTA 720 
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TTATTAAAAT AAACGTATCG TATTGT&ATA ATAAATGACT CGCATTAATG ACATTGCCCA 840 

AAAATGTGAC ATCATTTTCT AACCCAGCTT GTACAACTTG TTGCTGACAA TCATTTAATG 900 

5 

TAGGTCCATC GCCTATAAAT GTAAAATGCG CATGATTACT GTTATGTAAT TTCAATATCT 960 

CTATTGCCGC GATTAGATTT TGTGGCAATT TTGGATAAGC AAATCTTGCA ATCATAACAA 1020 

ATTGATGCTT TGTCGGGGCA TTAATCTGTA AATCTTGTTT ATTAGGCAAC ATTCCAACTA 1080 

10 

CTTCGCCAAT ATTGTTATGT GATTGGCTTT TTAGCGTTTG CTTAACAGCG GGAACATCTG 1140 

CAATACCATT ATGTATTGTG GTTAATTTCA ATCGATTAAA TCGATATTTT AACGCTAACT 1200 

15 GTTTATCGAA ATCTGAAACA CAAATAATGC TATCTGTAAT AAGTGACATT AATTTTTCGA 1260 

TAACTAAAT A TAGAAATTTT TTAGCTGGTT TAACACCCTC TGTAAAAGCC CATCCATGTG 1320 

CAGTAAAAAC TATACGTGTG TCTTTCGATT TCGAAATGAa CTtCGCAATT CGTCcGACCG 1380 

20 TLCCAGCTTT GGAAGAATGT AAATGGATAA CATCAGGTTT AATTTTCGAG AATAACTGTG 144 0 

CTAACACTTT GACAGCTAAA ATATCTTGTT TAAAGTCAAT TGGACCTACT AAATGTTCGA 1500 

TAATAATTAC ATTAACTCTT GCATCTAGTT GTTCAATCAT TGGTCCATGA TTGCCTACAA 1560 

25 

TGACATAAAC ATCATTGTGT ACGCAAAAAT GGTTGGCGAG TTGAATGAGA TGTGTTTGTG 1620 

CACCACCATT GTCTGCTTTA GTAATACAAT ATATAATTTT CAACTGTTAC AAACCCCTTT 1680 

3Q AATGCTATAC TTTCAATTTC TTAACATGGC TATCTCATCA GATGAATAGT ATTTATAGCC 1740 

ATGCAAATCA ATGATGGCAC ATATTTCTTA ATGCCATTTG ATACTGTCTC AAGGGATTCC 18 00 

TCGTTATACT GTAACAATTG GTCACAATCT TTAAAATATA ACTTTTATTT GAACTTATTA 1860 

35 AGTAAATTAA GACTACCTTG AGCCTTCCCC TGTAATAACA ACCATCAATG TTCTAATTGA 1920 

TATATATAGT TCCATCATTA AACTACCTTT ATGTATATAT TTCATGTCAT ATTTCAGTTT 1980 

TTGTTGCGGT GTTAAGTCAT ATCCACCTTG AATTTGCGCA AGTCCTGTTA ACCCTGGTGT 2 04 0 

40 

AACAAGACAT CTTTGCTCGA AACCTATCAC TTCTGAACTA AATAATTCTA CAAATTCCGG 2100 

ACGTTCCGGG CGTGGTCCAA TAAAACTCAT TTCCCCTTTA ACAACATTAA TTAGTTGTGG 2160 

TAATTCATCA ATGCGTGTTT TACGAATAAA CTTCCCGACA TTTGTTATAC GATCATCATC 2220 

45 

TTTATCAGCC CATTGCGCAC CGTTTTTCTC TGCGTTTTTG CACATCGAAC GTAATTTGTA 2280 

TATTTTAATT aatTTACCCA TCTTCCCAAC TCTAACCTGA CTATAAATAG GGTTTCCTGG 234 0 

SO CGAATCTATG ACGATAGCAA TGGCGAATAT AACCATAATC GGTAAAGTTA AAAATAATAA 24 00 

AACAATGCTT AAAATTAAGT CAATCGCACG TTTAATTGGG TAATAGCTTT TTCTCACTTC 2460 

. TTCTAGTTTG TCTAATTTTC TTTGATAGGC ATAACCCTTA TTATTATGGA CAGCTTCAAT 2520 
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AATTAAAGTA 


ATCCTTTAAA CCTGTTTCTA CTGTATATTT 


AGGAACAAAT 


CCTAATGCCT 


2640 




TTAAGTTAGA 


AATATCTGCA TAAGAATGCT TAATATCTCC 


TTTTCGTGCT 


TCTTTAAATT 


2700 


5 


CATGCTCGAC 


TGATTTTCCA TATAATTCAC CAATAATACG 


ATAAACCTCT 


AATAAATTAG 


2760 




TAAAAGTGCC 


TGTACCAATG TTATAACCGT GTCCAATTGC 


ATCTTTGTGT 


TCCATAATTA 


2820 


10 


AGCGTACAGA 


TTGAACAACA TCATATACAT ATACAAAATC 


TCTAGTTTGC 


AGTCCGTCAC 


2880 




CAAAAAATGT AAATGGCTTG TTATGCTCAA ATGAATCGAA 


CATCTTTGAA 


ATCACACCTG 


2940 




AATATTGTGA 


CTTAGGATCC TGTCTTGGCC CAAATACATT 


AAAAAATTTA 


ACAACCGCTG 


3000 


15 


TTGGTATGTT 


ATATAACGAA CAATAATTTA ATGTCGTCCG 


TTCGCCGTAA 


TATTTATCTA 


3060 




TTGCATATGG 


TGATAATGGT AAGATTAATG ATTGATCACT 


TTTAGGCAAA 


TCAGGAAGAT 


3120 




CACCATAAAC AGCTGCTGAC GAAGCAAAGA TAAAACGTTT 


TATATGATTA 


TTATATTTTT 


3180 


20 


TAATGATTTC 


TAACAATCTT AATGTTGCTA CGACGTTTAT 


TTCTTGAGAT 


AAGATAGGTT 


324 0 




TCTCAACCGA 


CTCAGCAACA CTAACTAATG CTGCTAAATG 


AATAACATAA 


TCAAATTGAT 


3300 


25 


ATGTCTTCAT 


GATTTGTTCA ACTGCATCAT ATTCACGAAT 


ATCTAATTCA 


AACACATGAT 


3360 


CGTCAGCCAA ACTTTTAATA TTTTCTCGTT TACCTGTTCT 


ATAGTTATCT 


AGAACATAAA 


3420 




CATCATAATC 


TTGTTGTAAA TCATCTACTA AATGCGACCC 


AATAAAACCA 


GCCCCACCAG 


3480 


30 


TTATCAAAAG 


TCTTTCCAAA TCTTCCACCT CATTTATACA 


TTAAAAATAT 


ATCATAAAAA 


3540 




CATAAAGTAT 


TGTAAGCTTT TTATCGATAT TTTTTATTTA 


TAAAAATAAA 


ATGAGATAAC 


3600 




TTTGTGAATT TTTATTGAGA TAAATTAGAT AGTGGTGTTT 


TTGTGATGTT 


TTATAATATC 


3660 


35 


TTGGGTGTGT 


TAATACTAAT AATGCTTTCA ACTGATGCAT 


TAGACTGTGA 


CATCATAACT 


3720 




CACTTAAGAA 


CTTCGCTTAT TAATTTTCTA CCAATACACT 


CCCTTCTAAG 


TGCACTAAAA 


3780 




AATCCTTACT 


GCTAAGTGAT TAAACTTAAC AATAAGGATT 


TATTTATCAT 


TAGTGGATGA 


3840 


40 


TTATTAACGG 


AATCTCATAC CACCATCTAC AATAATTGTT 


TGTCCAGTAA 


TGTAATCAGA 


3900 




GTCTTTACCA 


GCTAAGAAGC TCACTACATT TGAAACATCT 


TCTGGTTGAG 


AAACTCTGCC 


3960 


45 


CAAAGCAATC 


TGACTTGTAA ATTGTTCCCA ACCCCATGCT 


TCAGGTTTAC 


CTGCTTCTTC 


4020 




GGCTGTTGCC 


ACTGCGATAC TTTCCATCAT TGGTGTTTGA 


ACGATACCAG 


GTGCGAATGC 


4080 




ATTCACAGTA ATACCTTCAG ACGCTAAATC TTGTGCGGCT 


ACTTGTGTTA 


AACCTCGCAC 


4140 


50 


TGCGAATTTT GTACTGCAAT ATAAAGACAA GCCTGGGTTA 


CCCTCAACGC 


CTGCTTGAGA 


4200 




TGTTGCATTG 


ATAATTTTAC CGCCATGATT GAATTTTTTA 


AATTGTTCAT 


GTGCGGCTTG 


4260 




■ AATACCCCAT 


AGCACACCTG CAACGTTCAC GCCATATACT 


GTTTTAAACT 


GTTCTTCAGT 


4320 
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GCCAAATTGC GCGGCAGTTT GTCTTAcTGC GTTAAATACA TCATCACGGT TTGATACATC 444 0 

TGCTTTGATA GCAATAGCTT TTGTACCATC- ACTTGATAAT TTAAGTGCAG CTGCTTTTGC 4500 

5 

CCCTTCTTCA TTGAAATCAA CAACTGCTAC TTTGAAACCA TCTTCCACTA AACGTTCTGC 4560 

AATTTTAAAA CCAATCCCTT GTGcTCCGCC AGTTACTAAT GCTACTTTGT TGTTTGTCAT 4620 

AAAGATCACT CCTCAAATTT CTTTCCTTTA ATTACATTTT ACTCCTCTTC ATTTGAATAG 4680 

10 

TACAACAAAG GTAGCTCCAT TTAACAAAAT ATTCAGATAT TTAAGGTATA GTTAAACGCA 4 74 0 

CTACCATTAG TGATTGGCAA TGCGTTTAAA TGTCGTTTTA AAAGTTCTTA TGTTGAATAT 4 800 

15 TATTTTTTTA AGTCTCTCGA TTAGTTTGTC ATCAATCTTT TTTCGAGACA TGGTCTTTTG 4B60 

ATTCAATAGG CGGTTCCGTG TTATCACTGA CAACTTTAGT TGTAGCTTCA TCTTTATGTA 4 920 

TTTCTTCGTT AAATCCTTCA AGGTTTTTAG TCGTGGGATT TTTAACCTCA GGATGTTCCA 4 980 

20 TCATGTCTTG ACTATCAAGT TCCTTTTTAC ACGTGTCTTT ATGTGATGCT TGATTTGCGT 5040 

TCCCTTTACT TTTTTGAATA GTGGTAGTAT CTGCTGCAGC TACTAATTTT TTTCTACCTA 5100 

AAATAGATAT GGCTGAAACA AACCAGAGTA TTGCAGATAC AAAGTTGCAT AATACTAAAG 5160 

25 

CGATAATAGC CAATACAATT AATATGACAC CTTTTGAAAT CCTTTCTTTA AATAAGTCAG 5220 

ATGCCAATAC GATGACAGGT ACGATTGAAA GTATAATTAC AAATATAGAA ATTATTGCCG 5280 

ATATAACTAT TGTTACTATT AAATAATCAG CTCTGCTACC TGATAATAAA TAGAAAAGGC 534 0 

30 

CGAAAATTAG TCCATAGCAA ATTACAAACC CACATAAAGT TATAGCCATG AGTACTATAT 5400 

AAGCTATTTG AAAATATAAA CCTATCTTTA TGAATGATTT TTCTACATTT TTTTCCATGT 54 60 

35 CTATTCCCCA TTTATTTAAA ATTTATACTT TACCTTAAAT ATTCTCTTTA TTCTTTAGTG 5520 

ATTTTATCTT TAGATTCAAA TTGATTCTCT GTACTTTCAA TATCAACTTT TTCATTTTCG 5580 

TCTGTCGATT CATCTTTTGA GTATTTATTC * CAAATCAGCA AAATACCACC AATCAGCCAT 5640 

AAAATTGACG AAAGGAAATT ATATAAACAC AGTGCAATAA TAGCATAAAC AATAAAAAGT 5700 

GCACCTCCGA TTACAGAGTA ACTTTCCATA TAAATCGCAG TAAAGATGGT TGGTAAAACA 5760 

GTGAAAAGAG CCAATATTAA TCCTAATAAA AAAATTGTTT CGTAATCAGA TCCTCCAGCA 5820 

45 

ATATTAATAG ATATCATCCT AACAAAAACG ACACTAAAAT ATATTTGAGC TACGATGCCT 5880 

ATCCAAATTG CTATTTTTCC TATAATTGAG CTCATACTCA TTCCCCATTT ATTTAAAATT 5940 

50 TATACTTTAC CTTAATATAC CTTATTTTAT TTAATTTTTA TATGCAAAAT ACAAAAATGG 6000 

AGAACTTCAA TATTTATAAA ATATCAAAAG TTCTCCACAC TATATTGTTT TATTATATTT 6060 

TCGCTATCAA TACGCTAAAT CATCATATTT CCCTCAACAT CACAGTAAAA CTATTGCTCC 6120 
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TTCCAATTGC GCAGTTGTTC AACATCATCA TCTTGTTTAA GTAATGCCAG TGGTACTTGA 6240 

AGATTAAGAC ATCGTCCTGA AATATTAAAG CGTGTCACAC CTGCTGGCAC AGTTTCCCCT 6i0O 

TTATGAACAA CCGCTTCAAT TTCCTTATAA CTCAATGGCT GATACTTCAT GAGTACATCT 6360 

TGTTGAGAAA GACAAGGATA TGTACCTTGT GCAATTCTCT CTACAGAACA ACAACCACTA 6420 

TAACTTGCGA CAACCTTTTC CCATACTTGA AAATGTGCTT CGCCTAAATC TTTTGTATAC 6480 

AAATATTGTT CTGTATCACC ATGACACATT GTAATAAATG GCGCTTCTTG TCTTGTCTCA 6540 

GTAGTCCATG GCAAGCGATG TTCTTGTTGT AACGTTTCCC ACCACACACC AAATGGAACT 6600 

75 TTATGTTGCC ATGTACTAAT TGAATATTGT GTTTCATGGA TTTCTTGCAC TGGAACTTTC 6660 

TTACATCCTA ACGCTTTCAA ACTTGTATAC CGATGCACAC CATCTATAAC CATATATCTA 6720 

CCATGTTGCA TCGCTGTCAC TAAAATAGGA TGACGTATAA AATCATCTGC TTCAATACTA 6780 

20 CTTTTCGTTT TTTCCAATCT TAAAGGTTCG AATGTTTCGT GAAGATCAAT CTTATCTACT 6840 

GGTACCAATT TTAAATGTTC ATGAATATGA TTCAATAGTT ATTCATCCTC CTTTGTTTGT 6900 

GTTAAATAAA TAAATTCAGG ATGTGGATGG CTTAAGAAAT CGTGATGTGA AATAGACCAT 6960 

CCGTATGCAC CTGCATATTT GAAAACAATA ACGTCGCCTG TACTGATTGC GTCTATCTGT 7020 

ACTTCTCTAG CAAAGACATC TTTCGGTGTA CATAATTGAC CGACTAACGT TGTGTCCTGT 7080 

CTCGAAATTG AAACTTTTTC AAATGAATAT GGATTGTCCT TATAGCGATA AATGTCAAAA 7140 

GGATGGTTAT GTTGCCAAGA TACCGGCAGT CTAAATTGTT GCGTACCTCC TCTTAATATG 7200 

GCATACCAAG CACCATGTAC TTTCTTAATG TCTAGCACTT CTGTCACATA GTAACCAATA 7260 

35 TGTGCCACAA TAAAGCGCCC ACATTCAAAG TTCAATGTCA CATCTTCCAT TTCTTGCTCA 7320 

ACGATAAGTG TTTTAAAACG TTCTACAAAA TTATCCCATT CAAATTGGTT AGTTAAATCT 7380 

GCAXAGTTAA CGCCTATGCC ACCACCAAGA TTGATATGTT TGAGTGGAAA TCGATGTTTT 744 0 

TCAGACCATG CCTTTGCTTT TTTAAAATAA AGTTTCACTA CATCGACATG TAAATTCGAG 7500 

TCTAAATTGT TAGAAATAGA ATGAAAATGA AATCCATCTA GATGAATCTT TGGCATTGCG 7560 

AGCGCAgcTT CAATGACATC ATCAACTTCG TCTTCAGAAA TACCAAATTG TGTTGGGCGT 7620 

CCTGCCATAT GCAACGTTGC ATTGGGAAAT GGTCCTGCTA AATTAACACG CAATAAAATG 7680 

TGTTGTGTCT TATCTTCATC TTCTAAGATG GCATTTAGCC GTTGTAATTC ATGCATACTT 7740 

so TCAACATGAA TACGCTGAAC ACCTTCACTT ACTGCATATC TTAGTTCCTC GTCTGTCTTA 7800 

CCAGGGCCAC CAAAAATAAT ATGATTTGCT GGTTTAAAAG CAAGACCTTT TGCTATTTCA 7860 

CCTTGAGATG CAACTTCGAA TCCTTCAACA TACTGACTAA TTGTATCTAG GATTTTTCGT 7920 
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TGTTGCAAAT GATGTTCCAG TCCGACTAAA 
TGTGCTTTTA ATTGTTCAAT AACAGGTTGA 
5 GTTTAGACGT CGCTAGAGAT GCACTTAAAT 
AAATAAATGT TTGTACACCT TGTGCCTGCC 
ATGCACAAAA ATGTTTACCA TGTGCATTCA 

10 

•TTACTTGATC ATCACGCGTT TGCCATGGTA 
CTTCGACTAT CATGTCTAAA CCTTCGACTT 

15 CAACATCTTC TATCATGGCA ATCACCATAA 
GTAATGGTGT ACGTCCAAAT CTTGCCATGC 
GGTAATAACG ACTTAATTTC ACAATATGCT 

20 CAATAATACC TCTCGCACCC ATATCCAACA 
TGACACGTAC AATTGGTATA ATATGCGCTG 
TCTCATCATT AATCGCCACG TGTTCTGTAT 

25 

CGATAACCTC GATCATCAAT GGGTCCGGTA 
CATTGTTTAA TCTATGTTTC AGAGATAGTT 
GGATTTGTAA CATGATGAAT TCTTAACTCG 

30 

TTTTCAACTT GAATCGTAGG TTCAAACAAA 
AATGCTTCTT GATACGCCTC GATGATGCCT 

35 ATACCATATT GCTTTTCAAT AAATAAGATG 
TCATGTAAAA AGTCGCGTAC TAAACGTTCG 
ACTT7TTTAT GTGCTTCTGG CATTGGCTTT 

4U TGCtCACGCT TAAAACGAAC ACCATCATGG 
CCATTTTCAT GAATGAGCAT CATATTTTGT 
TAAAGCATAT GAATCATTGG ACGAATCGCT 

45 

GAACCATATT GTTTAATCCA ATTTTCAATG 
AGTGCATTAA ATGGTATCGC ATCCTCTTCA 
50 CATATAACAC CTAACGCACC ATAAACTTGA 
AAATAAGACT GTCCTAAGAC TTCCCCTAGA 
ATATCTTGTT GCTGTATCTG CTTTAACCAA 

55 



TCATAGATAT AATGACAAAC TGGATGAGAT 8040 

ACTATACGCA TTAGCCTTCA TCCCCTTTCT 8100 

GGCGATATAT TTTTCCGCGA TCATCACCTA 8160 

ATTTTGCAAT ATCTTCATCT TCACGTGGTA 8220 

CAACTTCAAA AATATGTTGA ACATGTGATG . 8280 

TGCCAAGTGA CTGCGATAAA TCTGCGGCAC 834 0 

GTGCTATATC GTCAATGGCC ATAACCCCTT 8400 

TATGCTCATT AGCCATCTCC ATTGCATCAA 8460 

GACCACCATT CAAACTTCTT AATCCTTGCG 8520 

CAACTGTCTC ACGATCTTTA ACGTGTGGCA 8580 

CTTTAATGAT ATCTCTATCT ATCACTGCAG 864 0 

CTTCAGCTGC ACGAATTAAA TGCGCTAGTG 8700 

CAATCACAAC AAAGTCATAC CCGCTTGCTG 8760 

TAGAATTAAA AATGCCATAA ACTGAATCAC 8820 

GTTGCATCAT TGATACCTCC TACACCTAAT 8B8 0 

GAGTCACTTA ATAATCGACG TGTCGTTAAC 894 0 

TCGAAATGTT GATAGTTATT CAACTCTGGA 9000 

TTAACCCATT GCCATTGCAG CTCCTCATCG 9060 

ATTTCGGCGA TATTAATAAA GAAAAATGCA 9120 

TCATCTGTTT CAATAAATGA ATTACTATTC 9180 

AATGTCAGGT GTGAAGCAGC TTCACTTAAA 924 0 

AAATCTTTTA AGGCAATACG TGTAGGCCAA 9300 

GCATGCGATT CAAAGGCAAT ACCGTGATAA 9360 

ACAGCTAAAA ATTGCTTTGT CCAAGCTTCA 9420 

AATGGTACAC CATCCTTATC ACTTGCATAA 94 80 

TCGATTAACA TATGATATAT ATTTTCACGC 954 0 

GTTTGTTTAT AAGGCGAAAG TTGTGTATTT 9600 

AAAACTGTCT TTAATTCATC TTTTAAATAC 9660 

TCCGTAATTT GCGCTGCATT TTCAATTGTA 9720 
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TATTTTGTCG 
TCACTTTCCC 
ATGACATGTT 
CCAGATGCTT 
AACATTTCGT 
GCTAACCACT 
AACGTAAATC 
AATTCATAGT 
CTTTGCGTAT 
TCATTTTTAG 
TCTGCCTCAT 
TGTGTCTTTT 
ACACCGTCTT 
AGTTGGTGCA 
CTCCTTGTTA 
ATTCAATTTA 
TACACAGTCA 
CGCTACAAGT 
ATGTCCTAAG 
ATATACAACA 
CGTTSTAGAT 
TTTTAATTCA 
CAGCTTTAAT 
TAAACCACTT 
AAATAATGAC 
AACAATCATG 
ATTCGGTAAC 
.CTCATCTTTG 
GTCATTCGTA 



TGTCTATTGG 
CTAACCATAG 
CAAACTGCCA 
CAATTTGCTG 
TAACTACAAC 
GCAGTTTAAC 
CTAAACGTGA 
CGTTAAATGT 
CTTTTAATTC 
GAAATGTAAA 
CTCCTACGAC 
CAGCAGTAAA 
GATATGACGC 
TCACTCTAGT 
TGACAAATTG 
CTCATCAAAT 
ACAAATACTG 
TGC CATAACA 
TGATTTACTA 
GGGCTTGATG 
AGACAAATGC 
ATTAATGTAT 
ATCGGCAATA 
.TGCTCAATCA 
GCCAATACAT 
GCACTATTTG 
AATGCACGAT 
ACTGATGCGA 
CGTATAAAAT 



CGACATCGTA 
TACTGTGCCA 
TGGGTGTACA 
TACAAAATGT 
ATTTCTTGAT 
GTTTGGTACA 
TTTGTAACTT 
CTCAGGTGTT 
TGTCTGTAAT 
TACAACCTCT 
ACGCTCAATT 
AGGATACTCT 
TTTATACACA 
CTTTACACGA 
GATTTGGTAT 
TCGCTTTAGC 
CGTTATTCGC 
CAACTTCATT 
CAACGTAATA 
CTGCCACAAC 
CTTCAAGATC 
TTTGTACATG 
ATGTACGATT 
CTTGTGATAA 
GAATATCTTT 
TTAATAAATC 
ATCCTTCTTC 
TAACTTGCGC 
TAGTGATTTT 



CGAATCGATT 
TTAAGCCTTT 
GGTATCATCT 
TCATAAGTCT 
ACCGTCGTTT 
AAATCAGGAC 
GGATGATACT 
GCTGGTGGGT 
AACTCGACAA 
CTCAATAATT 
GGTGATGTGA 
GAATCATGTC 
ACAATATTCT 
TTAAGAATTG 
ATGTGTATAA 
CGcAATGGTC 
GTATTCTTTT 
TCTAGTCGCT 
TTTAAGACGA 
ATTTGGCACA 
TCTGACAAAG 
TGCTTCTAGA 
CAAATAACAT 
CTTAGACATC 
ATCAGCATGG 
CATTTCAGGT 
AAACATCAAT 
GGCATCAATT 
AACGTGTATC 



GTTGAGGGTG 
CTTCAGCCAA 
CAACATCATT 
TATCGCCAAC 
CTACTTTATC 
CAAATTTCAA 
GATGCCCTTC 
TTGATTCTCG 
TAAATTGTTC 
GTGTATAGTC 
TACGTATACG 
CTTCTATTGT 
CATAAATAAG 
TTTGATTCAC 
ATAGGGTTTG 
GGCGTTTGAT 
TTCCAAGTCA 
TTACCAATAG 
TGCCATGCTT 
AGCTGTTTTT 
CATACGTCGG 
CTAATGCCTG 
TCAAGCCATG 
GGTGAATCAG 
TAATTCGGTA 
TCAACTGTTT 
TTAAAATGGG 
GTCCGTTCAA 
GGTAATTTTA 



ATATAGCTCA 
ATCAACTTGG 
TACATGTTTG 
TTGTTGACGT 
TTTGTCGATA 
ATTATCACTC 
CATCGCATAA 
ATACTGCATA 
TAGCTTTTCA 
TGTTGTTGTA 
ATCAAAGCTA 
AAAATGACCG 
TGATGATACC 
AATACGATAC 
CACCACAATC 
ATAAATCTTC 
TAAGACGATG 
TTGATACTAA 
CATCATGTGC 
CAGTAGCAAT 
GTATGCCATC 
TGTTACTAAA 

GCATCGTTTC 
TCCCTTCACG 
GCCCTAATGG 
GTGTTTCAAC 
TCTGTTCAAG 
AATAAATGTT 



9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
103B0. 
10440 
10500 
10560 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
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GCCAAGGTCT TTTATTAAAC CTTGTTCACT ATATTGCATA TACTGTGGAT GCTGTCGCAA 11640 

CACATTGATT TGATAAGGAT GTGTTGGTAA TAAAATAAAA TCTTTGGGTA TCTCTGATAT 11700 

5 

ATCTATGTCT GCTAATTGAT ACAACACTTT CTCAACCTGA TCTTCTTTAC CTTCTACATA 11760 

GCGCGTGAGC AGAACATCTT GATGCACAGC TAAATAATGC AATTGGAATG ATGTATGACA 11820 

TTCGGGTGCA TATTTCTCTA AATCTGCTTC TGAAAACCCA CTTGCACTCT TAGGAGTCGG 11880 

10 

ATGAAATGGA TGACCTAAGT ATAAAGATTG TTCTGAAACG ATATAACGAT CCTCTACGTA 11940 

GTCTATTGTG TTACTTTGCA AATAACGTGC CGTGCGATGA ATGCTATTAT CGATGTCAGA 12000 

75 CATAATTTGC GCCATATGTT GTTGCACTGC CGTTTGATTA TCTGCACTTT GAGCCATATG 12060 

TTGCAAAATA CGCGCAATTG CTTCTTTATA AGTTGTTATT TTTTTACTTT TTCCATCGAT 12120 

AAGCCATACC TCTGGATGAT ACATATGATG CCCCATCGCA GACCAATAGC GAAATTCACC 12180 

20 CGTTAAAGTT TCGAGCTCTG ATAATTGTAT AGACCATTGA TGATTTTGAG GTGGTACTTG 12240 

ATATAAATTT TCTTCTCTAA AATATTCATT TAAAATGCGT TCGATAGCCG CATACGCTGC 12300 

ATGTTGTATT AATTCTTTAT TTTGCACTTT TTTGTTTCAA CTCCCATAAT TTCATTAATG 12360 

25 

TGTGATCGTT GATTTGATTA GTGATGGTTG AACAAATTAA AAATAAACTA CTTACTGCAA 12420 

ATACTACGCC CATAACGATA AACGTAGTAG CTGGTGTAGT ATAACTTGTA ATGGCAGCGC 12480 

CACTaAGACT GCCAATAATT TGACCAACAA CTAACATACT GTTCGTCGTT CCAACAAATG 1254 0 

30 

TGCCTTTAAG TTGTTGATGA CACGCATTCA CGACAACAAA CATGACACTT TGAATCAATG 12600 

CACTATATGT TAATCCTTGA AGTATTCTTG CAGCCATTAA AAACTCTATA TTCGTCGCTA 12660 

35 AACCTTGCAG TATCGCACTA CAACCACATG CAATCGTGGC AAATATATAT ACTGATTTAA 12720 ' 

CATATGATTT ATCATTAAAG CGTCCCCATA AAGGGGCGCT TAATATCGAA GCCGTCCAAA 12780 

ATGCGGACTG TAAAAATCCA ATCACACTAC GGTCATCTAT CGCTGTATGA TTCACTGATG 12840 

40 

AAGCAAGTGG TGATAATGCA GTTAGCATGC CATACATAGC AAAGTTTGCT AAAACGCCAA 12900 

CGATAATAAA TCGACATGTT TGTTGTGTGC ATAATAGACA TTGAAATGAA CGGCGAATAC 12960 

CTTTATTAAT ATTTGGTGTT TGTGATTTTG GCATATGTGT CGTTTCAATC AATTTTAATG 13020 

45 

CACCGAAAAT ACAGACAATA AAAGTAATAA CGGCAATACT CATCAGTAAC GCACTAAAAC 13 080 

CTAATATCGA AGCTGTAACA CCGCCAATTA ATGGCCCCAC AAGAGACCCT GCGCTGACTG 13140 

50 AACTTTGCAG TCTTCCTAAT ACCTTTCCAC GATCTTCAGC TGGCGCCTCT GCACTCGCAA 13200 

ACGCACTTGA TGCATCAACA ACACCACCAA ATAGTCCCTG CAATAACCTC ACAAGTACAA 13260 

ACTGTAATGG TGTCGTACAC AATGCCATTA AAAATAAGCA TACCGCCAAA CCAAGTAACG 13320 
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CtATCATCGT CGTTACAGCT GGAGCAGCAA TCGCTATACC ACTCCACAAC TGTATTTCTA 13440 

CGACTGATAG ATTTTGTAGT GATGCCATAT AAATTGGCAA TAATGGCACA AGTACTGTCA 13500 

5 

GTCCAGCAAT CGCTATAAAC TGACTGAGCC ATAAAATGCG AAAGTTACTG CGCCATATAG 13560 

ACTGATTAAT CATATGTCAC CATTGGATTT GGTACGGTAG TTAAACCTGA AGGCATACTA 13620 

10 CCTCCACCAC TATCACGTTG ATATAGCAAT GGTAATAAAA TTTGTTTGAA TGGCCACGTC 13680 

TGTTTATCAA ATAAAATGTG TCTGACAGCT AGCTGATCAG TTGTAACCCA GGAAATAGTT 1374 0 

GCCACTTCAT TTTTTAAAAT TTGTTTTAAC AACGACATAA GTTCATGCTC ACTTACACCA 13800 

15 AATAAATCTT GAATTGCATC AATAATGGCA TATAGATTTA CCGATACAGC TAATGTTTGA 13860 

AAATAAGCAA AGAATGTTTC CAAATCCTCA TTAATTAGCG TATTAGGTGT ATCTTCTCTG 13920 

ACGACATACT TCGGCAATGA AAGCTGATGT GCTGTTAGCC ATGGTTTATA AATTCTGACA 13980 

20 

GTATCATGAT CACGTAACAC GCATTTTTGT ACACGTCCAT CTTCAAATGA CAACAATATA 14040 

• TTTTGACCAT GCAACTCTGG TAATGCGCCG TATTGCATAA ATGATAGTGT TACCTTTAAA 14100 

AAGACTTGCG CGATATCTTC AAATAACGTC ATGACATCAT TTTTAGAAAT ATTATCTTTT 14160 

25 

CCACAAATCA TTTGATATAA AGTGCGATCA TTTGCCGCGA GTGCTGCCAT TGACACTAGC 14220 

TGTTGCGTAT CATTTTTGGC TAGCACTTCG GGATACTTTC TTAGCTGAAC AGTTAGATGA 14280 

30 CCTAATTGAT CTTTGAAAAT ATCATTATCT TGACCCATAT ATGACCACCA AGCTGTTTCA 14340 

TCACAAACCA TGACATACTT AGCTAGTGCT TCATCTTTTT CTATAAGCTG ACGTAATAAT 14400 

TGTTCTGCTT GTTCTCCGTT TTTCATGTAA CGCGTAGGCG TTAGCCTTAA TGCGCCTAAT 14460 

3b GACTGCATTG CAAATGGTAC TTTGACATGG TTATACGGTG CGCCAATATC AATTAATGAA 14520 

CGCATACTTG AAGACGACAG ATAATCTCCA AATTTTAACG GTAATAGTAC AACCAACTTT 14580 

TCACTAATCT CTTTCGCAAA GACGTTCGGC AGAATATGCT GATATTGCCA AGGATGTACC 14640 

40 ■ 

GGAAATAGTA CATAGTCATC TATTGATAAC CCTTGATCAT TTAACATGTC TGTCGCTTGT 14700 

T CTTTTA TAG GTACTGTCAA ATTTTCTAAT TCATCGATAT TTGCAGTATC GCCATGAATC 14 760 

45 ATATGTGTCT TTTTAACTGC TGCAACCATT AAAGGAAATG ATTGATTTAA TTCAGCTTGA 14820 

TACACTTGAT AATCCGCTTC TCTTAATCCT CTTTTTTCTT TAGCTAATGG ATGAAATGGA 14880 

CGATCTTTTA AACTTGCAAA CTGCTCTGAC ATCACAAAAG GATGTGACGC TAAATCTAAT 14 940 

SO TCTGATAATT GTTTAGCAAG CTGTGTGGCA GCAGTAGTCA GTCCTTCTTC AACGCGAGCC 15000 

ACTTCCCATT CATGACTTAG ATCACAATTC ATATTAGCAA TTGTTTGCCA AAATTCAGCT 15060 

GCCGTTAAAG GTTGCTTAGA CACCCTTCCC TCTATCGTAA TTGGTTGTGA ACTTTCGTAA 15120 
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TATATCAAAA GCGTTTGTCC GTTTTCTTTA GTAATCTCAC TATTCGATAC * AATTCCGGCT 1524 0 

ATATCTTCAA ATAATAATGC ATCAACTAAA TCTCTTAATA TTATCGCTTG TGCTGTATTG 153 00 

5 

ACTGCTGTAT GATTCTGCAA TGTTCAGACA CCTCGCATTC TTAATATAGG TTCAATGTTG 15360 

TCCCAATATT TTGTTGTTGT GCCTGTTGAT AAATAAAATA AGCACTTGAA ATATCTTCGA 15420 

TAGCCATACC CATCGGATTA AGTAATATGA TCTCATCATC GTCTTCACGT CCTGGTATGT 15480 

10 

CACCTGTCAC AAGTTGTCCT AGTTCAGCAT GAAGAGCTTC TTTGCTGAAT TTACCTTCTA 15540 

ACACCAATTG GTTAATAGTT TTCTTTTCTC GATTACATTG TGACCAGTCA TCTACTACGA 15600 

IS CTTTGTCAGC TTTAATAAAG ACTTCTTTAT GCACATCCAT GATAGAAATG TTGCTAATAA 15660 

ATGCACCCTT TTGTAACCAA TCATATTCAA TGTATGGTTG ATCCGTTACG GTACATGTAA 15720 

TGACTACTTC ACCATTTGAT ACTGCTTCTT TAGCATTTTC TGTCGCAATA AAATTAATTT 15780 

20 

CCGGACGCTG TTGTTGCCAT CTATCAACAA AGCGTGCACA TGCTTCAGAG AATTGATCGT 15840 

AAACAAACAC GCGTTCAATA TGATCGAATT GCTCTAACAT ACTTTGTAAT TGCTTGTCTC 15900 

CGATTAGCCC GCATCCAATG ATTGTTAAGT CTTTAAATCC TTTTTTAGCC AAATGCTTTG 15960 

CTGCAATCAC TGAAACTGCT GCAGTACGCA TACTACTAAT TAAACTTGCT TCCATAACTG 16020 

CAATTGGATA ATTCGTTTCT GGATCATTCA AAATAATGAC GC CACTTGCA CGCTCCATAT 16080 

3D TACGTTTCGA TGGATTGTCG . TGCTTACTAC CTATCCACTT AATACCTGAA ATTGCGTGTT 1614 0 

CACCACCGAT ATGACTTGGC ATTGCAATAA TTCGATCTGC GATGTGTCCA TTTTCAGGAT 16200 

CCtGTCTTAA ATACGGCTTA AGCGGTTGTA CAAAATCATT GTGCGCATGG GCTGTTAATG 16260 

35 CTTCTGTTAA TGCGTCCACA TAAACTTGTG AATGATTACC TCCCGCTTGT TCAATATCTG 16320 

ATCTATTTAA ATACAACATC TCTCTatTCa TTCTGaTTTA ACTCCTTGTC TTGATTTCAT 16380 

TTTTTCTAAC CATGTATCTG AATAAACTAA ATCTAAGTAA CGATCGCCTC ' GATCTGGTAA 16440 

40 

AATCGTGACA ATTGTTGCAC CTTCTTCAAT TGACGTTATC AACTGCTCAA TCGCTGCAAT 16500 

AATCGAACCT GTTGAAcCTC CGGCAAATAT GCCTTCATAA TCAATCAGTT TTCGACAGCC 16560 

CAAAGCAGAT TGATAATCAT CTACATGGAT CACTTGATTA ATTTCTGATC TATTCAATAT 16620 

45 

TTCGGGTACA CGACTAGCAC CGATACCAGG TAATTCTCTA TTAATAGGTT TGTCACCAAA 166 80 

AATGACTGAC CCTTTCGCAT CAACAGCAAC AATTTGTGCG TTTGGATGCA CTTCTTTTAT 16740 

50 TTTTCTACTC ATACCCATAA TGCTACCTGT CGTGCTGACT GGCGCGACAA AATAATCTAT 16800 

AGGTTGCTTA ATTGTTTCAA CAATCTCTGT GCCTGCACCA TGATAATGGG ATTGCCAATT ; 16860 

TAACTCATTC GCATATTGAT- TAATCCAATA TGCATCX5TCA ATAGTGGCTA ACAGTTCTTG 16920 
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TACATTGGCA CCATAACTTT TAATAATTTT CAAATTTGTT GGTGATATTT TAGGATCAAC 1704 0 

AACACACGTG AGTTTTAATC CCTTGATTTT AGCTATCATT GCCAACGCAA TGCCTAAATT 17100 

5 

ACCAGAAGTA CTTTCAATTA AATGTGTATT CTCAGTGATT AAACCATGTT TAATACCATG 17160 

TTCAATGATG TACTTGGCAG GTCGATCTTT CATGCTGCCT CCAGGATTCA TATACTCTAA .17220 

1Q CTTTGCAAAC ACTTCATGTT TCGGAAATAG TTGATGAAGT TGAACCATAG GTGTTTGCCC 17280 

TACAGAATCT AAGAATGAAT CGTGCACATG 17310 
(2) INFORMATION FOR SEQ ID NO: 24: 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5423 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 24: 



25 



30 



35 



40 



45 



ATACTAGTAA 


GCGCATCGGT 

W W WSa-TW X WWW X 


TATTGACATC 


GAATTCAACT TTAACAGTTT TCATGTTCGG 


60 


TGATGTTTCa 


ATAGAATGTG 


TGTGTTGTAC 


TTGCGCATTT ATATTTCCAC 


CTAAATTACT 


120 


TAAGTTTCCT 


GTAATACTAG 


AAATGTCAGG 


TGCGTTTAAT 


GTAGGTTGAA ATGCATCAAC 


180 


TACTTTATCT 


GCAACATTAG 


AAACATTACG 


GATAACTTTA 


. CTTGAATGAT 


TATCTATACC 


240 


TTTAACGAAA 


CCTAACATTG 


AATACATACC 


AACATCCATG 


AATTCACGTG 


AAGGTGAGTG 


300 


AATACCTAGC 


GCTCTTTTGG 


CTGCATTTAA 


AGCACCTTTT 


GCTACACTAG 


CTGCTTTTTC 


360 


AGCTAAGTCT 


CTAGCCATAT 


TACCAATACC 


TCTCATCAAA 


CCACGGATCA 


TATCAGCACC 


420 


TGCTGATACA 


AAGTCATCCA 


CAAAGCTTTT 


AACTTTATTT 


ACTGCATTTG 


TCATACCTTG 


480 


ACTAACTTTG 


TTTACAACAT 


TAACGAATCC 


TTGAACAACT 


CTATTAACAA 


rGTTAATTAG 


540 


CGTACtTGTt 


ATAGTAGATA 


CCCaTnGCAT 


ACCTTTAGTG 


ACmATGAAGT 


TCCAAGCTTG 


600 


AGACATTTTG 


TCTGATATAG 


TTGAAACAAC 


TTGTGTGAAT 


ATGCTTACAA 


CTTTATTCCA 


660 


AATTGTCGTT 


AATATACCAG 


ATAAGAAACT 


CCAAATCGTA 


TTCCATATAT 


TAGAAATAAA 


720 


ACTCCATGCC 


GCTTGTAACG 


CAGTAGATAT 


AGCTGTAGTG 


ATAGCGTTCC 


AAACCTTAGT 


780 


TGCCACAGTA 


ACTATAGTGT 


TCCACAACGT 


TTGTAAGAAC 


GTCCAAATAG 


CGTTCCAAAT 


840 


TGTTATTGCG 


ATAGTCATAA 


TTGTGGTAAA 


- CACTGTAGTT 


ATTACAGTGA 


CTAACAAATT 


900 


CCAAATCGTA 


GTAGCGATTG 


TAATTATCGT 


ATTCCAGATT 


GTACTTAAGA ACGTCCAAAT 


960 


AGCTGTCCAT 


ATCGTCATAA 


CTATTGTCAT 


TATCGTCGTG 


AAAACAGTTG TAATGATTGT 


1020 



55 



293 



EP 0 786 519 A2 



ATAAGCGACT ATTTGATTCC AAACAATCAT TATAAAATTG TAAACATTCG ATACTGCTGT 1140 

AGTGATAGCT GTTAAAATAG CATTCCATAC AACCGAAGCT ACAGCTTTTA ATACATTCCA 1200 

5 

AACATTAACC ATAAACGTTT TTATCGCATT CCAAGCATTT ATAATAAAGT TTCTGAATCC 1260 

TTCATTTTTA TTCCACAATA AAACGAATAT AGCTATTAAT GCAGCAATTA CACCAATTAC 1320 

TATTGTTATT GGACCGCCTA AAATACCAAA CACAGTTACT AGTCCTGTGA TAGCATTTCT 1380 

AATTAATCCA ATCTTACCGA ATAACAATTG GAATATAACT GATATAATTT TTAATGGTCC 1440 

TTTTAATAAC ATGAACGCAC CTTTTAAAAT TGTTAATCCC GCTCTTAATA AACCGAACTT 1500 

15 ACTTACTAAT GCAATGrTTC TACCTATTAA TCCGCCACCC ATAAAGTTAG ATACAGCAAG 1560 

AATAATCGGT ATTAAAAATC TAAATGCACC AACTAAAGTT ATAATGACAC CAACTAATTG 1620 

TGCTGTAGCT GGATGCGCCT CAAACAAGTT AGCTATCCAA CCAGTTATTG CAACTGCAAC 1680 

20 

GCGTAATACT GCACTAGCTA TAGGAGCCAT TGCTGTTGCG AATGCArmTA ATCCTCTTGC 174 0 

GATGTTTCCA ATCAATTGCA TTATTAGTGG TCCATTTGTT TGTATATAAC TGACAAAGTC 1800 

TTTAAACCCT TGAGATTGTC CTACTTGTTC AGACCATTCC CTAAACTTAG CTGTCATTTG 1860 

25 

TTCAAGAGAT TGGAATATGC CAGTTGATGA TCCGCTGAAT GCATTCATCA AATTGTTAAT 1920 

TCCAACGAAA ACATTTTTGA AAATATTACC AATGATAGGT AAGTTTGTTT TTGTGTATTC 1980 

30 AATAAAACGA GTTATCGAAT TTTCTCCAGC TGCACTATTA . GCCCAGTTAG AGAAAGATTG 204 0 

ACCTAATCTA TCCAACCAAT CAGCCGACCA TTGAAACAGT GGTGCTAATT GCGTGAATAC 2100 
ATTGACTAAT CCGTCACCAA AACCACCTGC AGCACTTAAT AGCTTGTTAA ATACCGAAAC ' 2160 

35 ACCCGTTGTA TTCATCATAT TAAAGAATCT TGAAGCTACA CTGCTATTTT CAGCCCATTT 2220 

AAGCACGCTT TGAGAGGCTT CTTCCATTCC TCTTGAAATA CCACTAAAAA ACGGTTGTAA 2280 

GCTCTGCATT GCAGTTTTAA CAGTATTTAA ACCATTTGCA AGAGTTGTGA AGATAGCGGA 2340 

.40 

TTGATTTTGC TTTATAATAT CAGTCCATGC TGACTTTACG CCATCTAACG CTTTTTTGTA 24 00 

TTCGTTTGTT GCTGAGCTAG CTTGTAAAGT GCCATCATTA AGCATCTTTA TAGCGCTGAT 2460 

AGCCATTGCG CCAAACGCTA CAAATCCTGC TCCCGCTATT GCTACGGCAC CACCTAAAGC 2520 

45 

AAGTACACCA CCAGTTAACA CTTTGATAGC GTTTAATAGC GCAAATACTA CAGGTACTAC 2580 

GCTCGCTATT ACAGGTATTA AGATACTAAA AGATGATGTA AGTAATCCAC CAACCATATT 264 0 

SO AGAACCTACA GTACCGAACA CACGGAACAT ATTAGCTAAA TTCCCCATCT GTCTTTGAAA 2700 

ATTGTCATTT GCTTTTATTA TGTAGGCATA AGCTTTCTTT AAACCATTAG TATCGACATC 2760 

TACCTTTGTT GTTTTTTTGT TCGGCAATGC GTCTAATGAT TTTTTAAACG CATAAATAGT 2820 
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AAGTTCTTCT TTAGTACGTT TGATTTTAGA GTTAGCAACA CCATTGTGCA CGTCTATAAT 2 94 0 

AGCTTTGGCT TTAGACCTAT TTAATGCTTC GAGACTAGCT TTAGATACTT TTAACACTCG 3000 
ATTGAATTTA CTGTTATCTG CATTGACGTC AATATTGACA CGTTTCTTTT CTAATTCTGA . 3060 

TAATTTAGCT TCTGTTTCAG CGATATCTTT AATCAACTTT TGTTTTTGCA ACTTAACTTC 3120 

TGGTGTAACT TCTTTAGAGT TTAGTTTGTC TAGTTCAAAA TTCGATTCTA GTACCTTTTG 3180 

TTGTAAATCT TGTATACTAG CATCTAATTT AGCTTTTACA TTTTTGTTAC TAAAGGCATC 3240 

TAAAGACTTT TTAGCAACTT TGATAGTTTT TTGTAAATTT TTATCGTTAG CGTTTAATTC 3300 

,5 AACATCTTTA GTTTGATCTG CTACTCGTTT AAATCTTTGC ACAGACTTAA CCGCACTATC 3360 

AATTTGCCTT TTGAATTTGG CTACACTAGC TTCAATAGTC GCTTTAATTT TATATTCCGT 3420 

CACATTAACA CCTCTCTTTC TATTGCTTAT TAAATTCTGC TATAACTTTA AAGAATTCAT 3480 

20 TATTTTGTGG TTCGTATTCA TCACGTTCGC TACTAAATCT TATATCTTTA CCTTCGTTAA 354 0 

GCCGTTGGAT ATTTTCTTCA TAAGGCAATA CGTCGTTTGC ATTGTTAAAA ACATATTCCT 3600 

CTTTAGGTTT ATTTTCTGTC CCAACATTTT TAGTAGCTGC AGCATCACGA ATAGCAAACG 3660 

CAAGTTTGTA . ACGTTCGAAT TCTTGGGTTA GCATTTCATA CTCTTTCGCA TACATTCGAT 3720 

AGTTATATTC TGTTAATGTC ATTTGCTCAA TAACGTTCAA ATCTGTAATA CCAAGTGTTG 3780 

ACATACAAGT TATAACGATT CTGTCGTAAG TTATTAGGcT TCCGCTGGTT TTTCTTCCGT 3B4 0 

TTCCACTACT TCGACTAGGT TTCGGGTCAT AGGTCGCTTT CCCAAcTCCG TTAAAATATC 3 900 

CGAACCGAAT TCTTCTAGTC CGATATTTTC TGCGATTTCA TCTAATGCTT CATCAATGTT 3960 

ATTAATAGTA ATTGCTTGTT TTTTTAAGTG AGATGTAGCT GCGATTAAAA CTTCGCCAAT 4020 

CACAACCGGA TTTCCACTTT CTAAACCTAC AGGCAACATT GATACACCTT GACCGATAGA 4080 

AGCTTGTTCA ACTTTTAAAC CTAATCGGTT ATCGATTTCT CTTAAAAATT TAAAACCAAA 414 0 

ACTTAATTCT AATGACTTTC CGTTAATTTC TACATTCATA ACTTAAAATC TCCATTCATA 4200 

. ATTAATTTAA ACAAAATAAA mArGCTTAAC GCCCTATTTT TATACCTCTC TTGGTGCAAC 4260 

CGGTGGTGAA TCTACTTTAG GTTGTGGAAT TGCTGTTAAA TCTTCGCCAG TTAATGCATC 4320 

TGCTTTTGTA GTGTCGTGGA ATCTGTATcC AGTCGCCTTA AGTTTCTTTG TTACAGCCTC 43 80 

AGGTAGTGTT GCAAATCCAC GTTGGAAACG ACCATTCACT C CAT ATTCAT ATTCATATTC 444 0 

SO ATCAATACCG TTAGCTTCTG CTTTTAATTC AAATTTATTG TGGAAACCTT GGAAATATTT 4500 

CGCTTTAAAT TTAGCGGAAT CCCCATTTTT GCCTGGTATT CTACTTTCAA CTTCCCAAGC 4560 

TTCATACAAT ACGCGATCTA CAACTGCATC TTCAATTTCA TCTGCAAAAT CGTCACCATA 4620 
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GTCCATTGTA TCCTCTGTAT CTGTATCAGC TTCATGTGAT AAGCCGTATT CAGTTAAAAA 4740 
AAGCATTTTA GTAGCATCTA CTTTTTCGCC AGCTTTTCTA AATAAAATAA TACGATCATT 4800 

5 

ACTATTTTTC ATATTTGCCA TTCAATATTC CTCCGTTTTT TAAAATGTTT TGTAAGATAT 4860 

CGTTACTGAT GTGTGTAGCA ATTCTTGATT GGTAGTATCA TCAACTAACT GTGTGATGTT 4920 

AGTATCTTCT TCTTCAAAGT CATAATCGTT TGTTTTAACG CTAGGTGTTA AATCATCAAT 4980 

10 

ACATCTTTTA ACAAGTCCGT CATGATGTCC TAAATCATCG CTTACACTCC AAATATCAAT 504 0 

AACTAAATTC GTATCGCCAG AATAACTATC AAACGTGTAC TTACTTCTAT TTGACTCCGG 5100 

is CATTTTTATT ACAAAAAAAG GATACGGAAT CTCTTGTTGC ATCTCTTTAC GAGAAATAAC 5160 

AGGGAATCCA TATCCTTGTA GCGTTTCATA CGCTTTATTA TAAAGTTGTA AGTTCGGTGT 5220 

CATGCTTTTA TCTCCTATTC AAACAACGCT TTCAATTCTT CTACAGTTGA TTTCCTAATC 5280 

20 

ACTTCGTATA CCGGCCACAT AAAAGGTTCA GCCTCCATGT ATCGAGTACC AAATTCTAAG 5340 

AAACCACTAT AAGCTGCGTG CGATGTGATA GTGTATTGCA AATCGCCAGT TTTTTTATAT 5400 

CTGATATTGC GTGATaAATT ACC 5423 

25 

(2) INFORMATION FOR SEQ ID NO: 25: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 25: 

AAACGCAGAT GTTCAATTAG AACCAGTCTA TCGTATTAAG GAAGGTATTA AACAAAAGCA 60 

AATACGAGAC CAAATTAGAC AAGCGTTAAA TGATGTGACA ATTCATGAAT GGTTAACTGA 120 

40 

TGAACTAAGA GAAAAATATA AATTAGAGAC CTTGGACTTT ACTTTGAACA CATTACATCA 180 

TCCTAAAAGT AAAGAGGATT TATTACGTGC TCGTAGAACC TATGCATTTA CTGAACTGTT 240 

TTTATTCGAA TTACGTATGC AATGGCTAAA TAGATTAGAA AAGTCATCTG ACGAAGCAAT 300 

45 

TGAAATTGAT TATGACATAG ACCAAGTTAA ATCATTTATT GATCGTTTAC CTTTTGAACT 360 

AACTGAAGCA CAGAAATCCA GTGTTAATGA AATTTTTAGA GATTTAAAAG CACCAATACG 420 

so TATGCATCGA TTACTTCAAG GTGATGTAGG TTCAGGAAAA ACAGTAGTTG CTGCAATTTG 4 80 

TATGTATGCG TTAAAAACTG CTGGTTATCA ATCAGCATTG ATGGTACCAA CTGAAATTTT 540 

AGCAGAGCAA CATGCTGAAA GTTTAATGGC TTTATTTGGA GATTCTATGA ACGTTGCATT 600 
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TACGATTGAT TGTTTAATTG GAACCCATGC TTTGATTCAA GATGATGTGA TTTTCCATAA 720 
TGTTGGTTTA GTAATTACAG ATGAACAACA TCGATTTGGT GTGAATCAAC GCCAGCTTTT 780 

5 

AAGAGAAAAA GGTGCAATGA CGAATGTGTT ATTTATGACA GCAACGCCGA TACCAAGAAC 84 0 

. ACTAGCAATA TCAGT T TTTG GTGAGATGGA TGTGTCTTCA ATTAAACAAT TACCAAAAGG 900 
TCGTAAACCT ATCATTACTA CTTGGGCAAA GCATGAGCAA TACGATAAAG TTTTGATGCA 960 

10 

AATGACCTCA GAGTTGAAAA AAGGTCGTCA AGCATATGTC ATTTGCCCGC TAATAGAAAG 1020 

TTCTGAGCAT CTCGAAGATG TTCAAAATGT TGTCGCATTG TACGAGTCTT TACAACAGTA 1080 

15 TTATGGTGTT TCCCGTGTAG GGTTATTGCA TGGTAAGTTA TCTGCCGATG AAAAAGATGA 1140 

GGTCATGCAA AAGTTTAGTA ATCATGAGAT AAATGTTTTA GTTTCTACTA CTGTTGTTGA 1200 

AGTAGGTGTT AATGTACCGA ATGCAACTTT TATGATGATT TATGATGCGG ATCGCTTTGG 1260 

20 ATTATCAACT TTACATCAGT TACGCGGTCG TGTAGGTAGA AGTGACCAGC AAAGTTACTG 1320 

TGTTTTAATT GCATCCCCTA AAACAGAAAC AGGAATTGAA AGAATGACAA TTATGACACA 13 80 

AACAACGGAT GGATTTGAAT TGAGTGAACG AGACTTAGAA ATGCGTGGTC CTGGAGATTT 144 0 

25 

CTTTGGTGTT AAACAAAGTG GaTTGCCAGA TTTCTTAGTT GCCAATTTAG TTGAAGATTA 1500 

TCGTATGTTA GAAGTTGCTC GTGATGAAGC AGCTGAACTT ATTCAATCTG GCGTATTCTT 1560 

3Q TGAAAATACG TATCAACATT TACGTCATTT TGTTGAAGAA AATTTATTAC ATCGTAGTTT 162 0 

TGACTAATTG CCATGCTGAT TTGTCAATTT GAGTGCAACa CTTCGTTAAT TGAGTGATAT 16 80 

GACACTTGAA CTATTTAAAT GTAAAGTGGT ATTTTAACAA TTTATAAATT TTCGACTAAA 174 0 

35 TAATAGCTAA ATATTACAGT TATTTGTTGA GTCGGTTAAA TAGAAAGTGT TATGATATGT 1800 

GAGGAATGTT TAAGACTAGG TACTAAAAAA TGAGGGGTGA GACGTTGAAA CTAAAGAAAG 1860 

ATAAACGTAG AGAAGCAATC AGACAACAAA TTG AT AG CAA TCCCTTCATC ACAGACCATG 1920 

40 

AACTAAGCGA CTTATTTCAA GTGAGTATAC AAACAATTCG TTtAGaTCGC ACTTATTTAA 198 0 

ACATACCAGA ATTAAGGAAG CGTATTAAAT TAGTTGCTGA AAAGAATTAT GACCAAATAA 2040 

GTTCTATTGA AGAACAAGAA TTTATTGGTG ATTTGATTCA AGTCAATCCa AATGTTAAAG 2100 

45 

CGCAATCAAT TTTAGATATT ACATCGGATT CTGTTTTTCA TAAAACTGGA ATTGCGCGTG 2160 

GTCATGTGCT GTTTGCTCAG GCAAATTCGT TATGTGTTGC GCTAATTAAG CAACCAACAG 2220 

50 TTTTAACTCA TGAGAGTAGC ATTCAATTTA TTGAAAAAGT AAAATTAAAT GATACGGTAA 2280 

GAGCAGAAGC ACGAGTTGTA AATCAAACTG CAAAACATTA TTACGTCGAA GTAAAGTCAT 2340 

ATGTTAAACA TACATTAGTT TTCAAAGGAA ATTTTAAAAT GTTTTATGAT AAGCGAGGAT 24 00 

55 



297 



EP 0 786 519 A2 



TTAGAAGCCG TACAAAAGGC TGTTGAAGAC TTTAAAGATC TAGAAATTAT ACTTTTCGGT 2520 

GACGAAAAAA AGTATAATCT GAACCATGAA CGAATCGAAT TTAGACATTG TTCTGAAAAG 2580 

5 

ATTGAAATGG AAGATGAGCC TGTTAGAGCG ATTAAACGTA AAAAAGATAG CTCAATGGTA 2640 

AAAATGGCTG AAGCTGTGAA ATCTGGTGAA GCAGATGGAT GTGTGTCAGC AGGTAATACT 2700 

GGTGCTTTAA TGTCAGCTGG TTTATTCATT GTTGGACGTA TTAAAGGTGT AGCTAGACCG 2760 

GCTTTAGTAG TAACATTGCC AACGATTGAT GGAAAAGGTT TTGTCTT TTT AGACGTTGGT 2820 

GCAAATGCTG ATGCTAAACC TGAACACTTA TTACAGTATG CGCAACTAGG GGATATTTAT 2880 

15 GCTCAAAAAA TTAGAGGTAT TGATAATCCG AAAATCTCAT TATTAAATAT AGGAACCGAG 2940 

CCAGCTAAAG GTAATAGTTT AACGAAAAAA TCATATGAGT TATTAAATCA TGATCATTCA 3000 

TTGAATTTTG TTGGGAATAT TGAAGCGAAG ACATTAATGG ATGGCGATAC AGATGTTGTA 3060 

20 

GTTACCGATG GCTATACTGG GAACATGGTC CTTAAAAATT TAGAAGGTAC TGCAAAATCA 3120 

ATCGGTAAAA TGTTAAAAGA TACGATTATG AGTAGTACTA AAAATAAATT AGCAGGTGCA 3180 

ATATTGAAGA AAGATTTAGC TGAATTCGCT AAAAAGATGG ATTACTCAGA ATACGGTGGT 3240 

25 

TCCGTATTAT TAGGATTGGA AGGTACTGTA GTTAAAGCAC ACGGTAGTTC AAATGCTAAA 3300 

GCTTTTTATT CTGCAATTAG ACAAGCGAAA ATCGCAGGAG AACAAAATAT TGTACAAACA 3360 

ATGAAAGAGA CTGTAGGTGA AtCAAATGaG TaAAACAGCA ATTATTTTTC CGGGACAAGG 3420 

30 

TGCCCAAAAA GTTGGTATGG CGCAAGATTT GTTTAACAAC AATGATCAAG CAACTGAAAT 34 80 

TTTAACTTCA GCAGCGAACA CATTAGACTT TGATATTTTA GAGACAATGT TTACTGATGA 354 0 

35 AGAAGGTAAA TTGGGTGAAA CTGAAAACAC AGAACCAGCT TTaTTGaCGC aTAGTTCGGC 3600 

ATTATTAGCA GCGCTAAAAA ATTTGAATCC TGATTTTACT ATGGGGCATA GTTTAGGTGA 3660 

ATATTCAAGT TTAGTTGCAG CTGACGTATT ATCATTTGAA GATGCAGTTA AAATTGTTAG 3720 

40 

AAAACGTGGT CAATTAATGG CGCAAGCATT TCCTACTGGT GTAGGAAGCA TGGCTGCAGT 3780 

ATTGGGATTA GATTTTGATA AAGTCGATGA AATTTGTAAG TCATTATCAT CTGATGACAA 3840 

AATAATTGAA CCAGCAAACA TTAATTGCCC AGGTCAAATT GTTGTTTCAG GTCACAAAGC 3900 

45 

TTTAATTGAT GAGCTAGTAG AAAAAGGTAA ATCATTAGGT GCAAAACGTG TCATGCCTTT 3960 

AGCAGTATCT GGACCATTCC ATTCATCGCT AATGAAAGTG ATTGAAGAAG ATTTTTCAAG 4 020 

SO TTACATTAAT CAATTTGAAT GGCGTGATGC TAAGTTTCCT GTAGTTCAAA ATGTAAATGC 4080 

GCAAGGTGAA ACTGACAAAG AAGTAATTAA ATCTAATATG GTCAAGCAAT TATATTCACC 4140 

AGTACAATTC ATTAACTCAA CAGAATGG CT AATAGACCAA GGTGTTGATC ATTTTATTGA 4200 
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AACATCAATT CAAACTTTAG AAGATGTGAA AGGATGGAAT GAAAATGACT AAGAGTGCTT 4320 

TAGTAACAGG TGCATCAAGA GGAATTGGAC GTAGTATTGC GTTACAATTA GCAGAAGAAG 4380 

GATATAATGT AGCAGTAAAC TATGCAGGCA GCAAAGAGAA AGCTGAAGcA GTAGTCGAAG 4440 

AAATCAAAGC TAAAGGTGTT GACAGTTTTG CGATTCAAGC AAATGTTGCC GATGCTGATG 4 500 

AAGTTAAAGC AATGATTAAA GAAGTAGTTA GCCAATTTGG TTCTTTAGAT GTTTTAGTAA 4560 

ATAATGCAGG TATTACTCGC " GATAATTTAT TAATGCGTAT GAAAGAACAA GAGTGGGATG 4620 

ATGTTATTGA CACAAACTTA AAAGGTGTAT TTAACTGTAT CCAAAAAGCA ACACCACAAA 4680 

TGTTAAGACA ACGTAGTGGT GCTATCATCA ATTTATCAAG TGTTGTTGGA GCAGTAGGTA 4740 

ATCCGGGACA AGCAAACTAT GTTGCAACAA AAGCAGGTGT TATTGGTTTA ACTAAATCTG 4800 

CGGCGCGTGA ATTAGCATCT CGTGGTATCA CTGTAAATGC AGTTGCACCT GGTTTTATTG 4 860 

20 -TTTCTGATAT GACAGATGCT TTAAGTGATG AGCTTAAAGA ACAAATGTTG ACTCAAATTC 4 920 

CGTTAGCACG TTTTGGTCAA GACACAGATA TTGCTAATAC AGTAGCGTTC TTAGCATCAG 4980 

ACAAAGCAAA ATATATTACA GGTCAAACAA TCCATGTAAA TGGTGGAATG TACATGTAAT 5040 

ATATTTGAGC TAAAGCTCAT TGACGCAGTG GTTGACTGGT CATCCAATGG AGAATTGTCT 5100 

GACCTAGTCA ACTTTGCGGG GGAAATTCTA AGCAACCTAG ATAAGGTTCC AGAATTTCTC 5160 

CCTAAGAAAC ACTAATCAAT aAATTGwTAA GTGTTTCTAA AATTTCTACT TGTTTTTTAG 5220 

AATTTAAAAT GGGAAAATAT AGTAGTCTAT GTATAGGCAT TTTTAAAGGA GGTGAATCGA 5280 

CGTGGAAAAT TTCGATAAAG TAAAAGATAT CATCGTTGAC CgTTTAGGTG TAGACGCTGA 534 0 

35 . TAAAGTAACT GAAGATGCAT CTTTCAAAGA TGATTTAGGC GCTGACTCAC TTGATATCGC 54 00 

TGAATTAGTA ATGGAATTAG AAGACGAGTT TGGTACTGAA ATTCCTGATG AAGAnGCTGA 5460 

AAAAffTCAAC ACTGTTGGTG ATGCTGTTAA ATTTATTAAC AGTCTTGAAA AATAATAAAT 5520 

CTTACATCTG GGTCGTCAGT ATTGTCGACT CAGTTTTTTT CTTTAATTAT CAATAGTTTT 55 BO 

AACGTAAAAT TAAAGATGAT TCAAGAGCAA CACATAAAGG AGATAAAATA ATGTCTAAAC 564 0 

AAAAGAAAAG TGAGATAGTT AATCGTTTTA GAAAGCGCTT TGATACTAAA ATGACAGAGT 5700 

TAGGCTTTAC TTATCAAAAT ATTGATTTAT ACCAACAAGC ATTTTCGCAT TCGAGTTTTA 5760 

TTAATGATTT TAATATGAAT CGTTTAGACC ATAATGAGCG TTTAGAGTTT TTGGGTGATG 5820 

so CGGTATTAGA ATTGACGGTT TCACGATATT TATTTGATAa ACATCCCAAC TTGCCAGAAG 58 BO 

GGAATTTAAC AAAAATGCGT GCCaCTATTG TATGTGAGCC CtCACTkGTA ATATTTGCGA 594 0 

ATAAAATTGG ATTGAACGAA ATGATTTTAC TTGGTAAAGG TGAAGAGAAA ACAGGGGGAC 6000 
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ATCAAGGACT AGATATAGTT TGGAAATTTG CTGAGAAAGT CATTTTCCCA CATGTAGAAC 6120 

AAAATGAGTT ATTAGGCGTG GTAGATTTTA AAACACAATT CCAAGAATAT GTGCACCAGC 61 BO 

AAAATAAAGG TGATGTAACC TATAATTTAA TAAAAGAAGA GGGACCGGCA CATCATCGTC 624 0 

TATTCACTTC A 6251 
(2) INFORMATION FOR SEQ ID NO: 26: 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4920 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
is (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 26: 

ACCTACTGAA GTTGCTAATT TTTTGGAGCA ACTAAGCACT GAAATTGAAC GTCTTAAAGA 60 

AGATAAAAAA CAACTTGAAA AAGTAATCGA AGAGAGaGAT ACTAATATTA AGTCTTATCA 120 

AGACGTGgCA TCAATCTGTA AGTGaTGCTT TGATACAAGC TCAAAAAGCT GGTGAAGAAA 180 

CTAAGCAAGC TGCAGAGAAA CAAGCTGAAG CGATTATAGC TAAGGCAGAA GCGCAAgcTA 24 0 

ATcAAATGGT TGGTGACGCG GTAGAAAAAG CACGCCGTTT AGCATTCCAG ACTGAAGATA 300 

TGAAACGTCA ATCAAAAGTA TTTAGATCGC GTTTCCGTAT GTTAGTTGAA GCGCAATTAG 3 60 

ACTTATTAAA AAACGAAGAT TGGGATTACT TGTTGAATTA TGATTTAGAC GCTGAACAAG 420 

TGACGCTTGA AAATATTCAT CATTTGCATG AAAATGATTT AAAGCCAGAT GAAGTTGCAG 4 80 

35 CAAATGCACA AAATAATGCA TCAAATACAC CAGACAATAA TCAACAATCC AATGATTCAG 540 

AAACAACTAA GAAGTAAGAA TTAAATAAAG ACAGACGCGT AATATACATT TAACTTTTCA 600 

CAGCGAATTA GGTAATGGTG AGAGCCTAGT AAAAGCATGT ATGTTATATC ACTGGCTTTT 660 

TAATATTTAA ATAATGTAAT GAGAGAACTC TAAGTTGAGT TAATAAGGGT GGTACCGCGA 720 

GCAATCGTCC CTTTTAATTT AACTTAGAGT TTTTTAAATT TTTAAGGAGT GAAAAAAATG 780 

GATTACAAAG AAACGTTATT AATGCCTAAA ACAGATTTCC CAATGCGAGG TGGTTTACCA 84 0 

AACAAGGAAC CGCAAATTCA AGAAAAATGG GATGCAGAAG ATCAATACCA TAAAGCGTTA 900 

GAAAAAAATA AAGGTAACGA AACATTCATT TTACATGATG GCCCACCATA CGCGAATGGT 960 

50 AACTTACATA TGGGACATGC CTTGAACAAA ATTTTAAAAG ACTTTATTGT ACGTTATAAA 1020 

ACTATGCAAG GGTTCTATGC ACCATACGTA CCAGGTTGGG ATACACATGG TTTACCAATT 1080 

GAACAAGCAT TAACGAAAAA AGGTGTTGAC CGAAAGAAAA TGTCAACAGC TGAATTCCGT 1140 
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TTAGGTGTTC GTGGTGACTT TAATGATCCA TATATTACAT TAAAACCTGA ATACGAAGCT 1260 

GCACAAATTC GTATTTTTGG AGAAATGGCA GATAAAGGTT TAATTTATAA AGGTAAAAAG 1320 

5 

CCAGTTTATT GGTCTCCTTC AAGTGAGTCT TCATTAGCAG AAGCAGAAAT TGAATATCAC 1380 

GATAAACGTT CAGCATCAAT TTACGTTGCA TTTGACGTTA AAGATGACAA AGGTGTCGTT 144 0 

GATGCAGATG CTAAATTTAT TATCTGGACA ACAACGCCAT GGACAATTCC ATCAAATGTT 1500 

10 

GCGATTACCG TTCATCCTGA ATTAAAATAT GGTCAATACA ATGTAAATGG cGAAAAATAT 1560 

ATTATTGCAG AAGCCTTGTC TGACGCTGTA GCAGAAGCAC TGGaTTGGGA TAAAGCATCA 1620 

75 ATCAAATTAG AAAAAGAATA CACAGGTAAA GAATTAGAGT ATGTTGTAGC ACAACATCCA 1680 

TTCTTAGACA GAGAATCGTT AGTGATTAAT GGTGATCATG TTACTACAGA TGCTGGTACA 1740 

GgTTGTGTAC ATACAGCACC AGGTCACGGG GAAGATGACT ATATTGTTGG TCAAAAATAT 1800 

20 

GAATTGCCAG TAATTAGTCC AATCGATGAT AAAGGTGTAT TTACTGAAGA AGGCGGCCAA 1860 

TTTGAAGGGA TGTTCTATGA TAAAGCTAAT AAAGCCGTTA CTGATTTATT AACAGAAAAA 1920 

GGTGCACTAT TAAAATTAGA CTTTATTACA CATAGCTATC CACACGACTG GAGAACAAAA 1980 

AAACCTGTAA TCTTCCGTGC TACACCACAA TGGTTTGCCT CAATCAGTAA AGTAAGACAA 2040- 

GATATTTTAG ATGCAATCGA AAATACAAAC TTCAAAGTAA ATTGGGGTAA AACACGTATT 2100 

TACAATATGG TTCGTGACCG TGGCGAATGG GTTATTTCTC GTCAACGTGT GTGGGGTGTA 2160 

30 

CCGTTACCAG TATTTTATGC TGAAAATGGC GAAATTATCA TGACGAAAGA AACAGTGAAT 2220 

CATGTTGCTG ATTTATTTGC AGAACACGGT TCAAATATTT GGTTTGAAAG AGAAGCGAAA 2280 

35 GACTTACTAC CAGAAGGATT TACACATCCA GGCAGCCCTA ACGGTACATT TACTAAAGAA 234 0 

ACAGACATTA TGGACGTTTG GTTTGATTCT GGTTCATCAC ACCGTGGCGT GTTGGAAACA 2400 

AGACCGGAAT TAAGTTTCCC AGCGGATATG TATTTAGAAG GTAGTGACCA ATATCGTGGT 2460 

40 

TGGTTCAACT CTTCTATCAC . AACTTCAGTT GCTACAAGAG GAGTATCACC TTATAAATTC 2520 

TTACTTTCTC ATGGTTTTGT TATGGACGGT GAAGGTAAGA AAATGAGTAA ATCTTTAGGT 258 0 

AATGTGATTG TACCTGACCA AGTGGTTAAA CAAAAAGGTG CTGATATTGC GAGACTTTGG 2640 

45 

GTAAGTAGTA CGGACTATTT AGCTGATGTT AGAATTTCTG ATGAAATTTT AAAACAAACA 2700 

. TCTGATGTTT ATCGTAAAAT CAGAAATACA TTAAGATTTA TGTTAGGTAA CATTAACGAT 2760 

SO TTCAATCCTG ACACAGATAG CATTCCTGAA TCAGAGTTAT TAGAAGTGGA TCGTTACTTG 2820 

CTAAATCGTT TACGTGAATT TACTGCAAGT ACGATTAACA ACTATGAAAA CTTTGACTAC 2880 

TTAAATATTT ATCAAGAAGT TCAAAACTTT ATCAATGTTG AGTTAAGTAA TTTCTATTTG 294 0 
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CAAACAGTGT TATATCAAAT TTTAGTTGAT ATGACGAAGT TGTTAGCACC AATCTTAGTG 3060 

CATACAGCTG AAGAAGTTTG GTCTCATACA CCACATGTTA AAGAAGAAAG TGTTCACTTA 3120 

GCAGACATGC CTAAAGTTGT AGAAGTAGAT CAAGCTTTAT TGGATAAATG GCGTACATTT 3180 

ATGAATTTAC GTGATGATGT GAACCGTGCA TTAGAAACTG CTCGTAATGA AAAAGTTATT 3240 

w GGTAAATCAT TAGAAGCTAA AGTTACGATT GCTAGTAACG ATAAATTTAA TGCATCTGAA 3300 

TTCTTAACTT CATTTGATGC ATTACATCAA TTATTTATCG TGTCACAAGT TAAAGTTGTA 3360 

GATAAGTTAG ACGATCAGGC AACAGCTTAT GAACATGGTG ATATTGTCAT CGAACATGCA 3420 

75 GATGGTGAAA AATGTGAAAG ATGTTGGAAC TATTCAGAGG ATCTTGGTGC TGTTGATGAA 3480 

TTGACGCATC TATGTCCACG ATGCCAACAA GTTGTAAAAT CACTTGTATA ATTGAAATTG 3540 

TATAAAGTAC TCATACAGAT GATATAAATT AAAGCTCTCT TCATAATCAT GTTGTAGTTT 3600 

20 

TTGTTGACAT GATGAAGAGA G TTTTTTTG T GAATAAAAAA ATGACCAAGT TACCGGTCAT 3660 

ATATGTAAAA AATGTGCGAT TTACTAAAAT AAAAATTATT CAGGAATGGT ACAAATTCTC 3720 

TGAGGCATAT AAATGCGTTA TAGTTGCTAT TCTCAATTAT GTTCGCGATA ATTTTAAGTA 3780 

25 

AAAGTAAGCA CAGATATTGA ATTTGATAGG AGTTAATTGA ATGTATCATA ACAGTAACGC 3840 

AAACTTTGTC AATGGTATCA CTTTAAATGT GAGAGATAAG AATGAATTAA AGCCATTTTA 3900 

30 TGAGGACATA TTAGGATTAA ATATTATAAA TGAGACATTA ACATCGATAC AATATGAAGT 3 960 

AGGTCAAAAT AATCATGTCA TTACACTTGT TGAATTACAA AATGGACGTG AACCTTTAAT 4 020 

GTCCGAAGCG GGACTGTTTC ATATCGCAAT TAAACTACCT CAAATTAGTG ATTTAGCTAA 4 080 

35 TTTACTAATT CATTTAAGCG AATATGATAT TCCAGTTAAC GGAGGTATAC AGCCTGCTTC 414 0 

GTTATCATTA TTTTTTGAAG ACCCGGAAGG AAACGGTTTT AAATTTTATG TTGATAAAGA 4200 

CGAASCGCAA TGGACGAGGC AAAATAATTT AGTAAAAATT GATATTAGAC CATTAAATGT 4260 

40 \ 

ACCGAGATTA GTGAGTCATG .CAACAAAATT GTTATGGTTA GGTATTCCAG ATGACGCTAT 4320 

TATAGGTGCA TTGCATATTA AGACAATTCA TTTATCAGAG GTAAAAGAGT ACTACCTCGA 4380 

4$ TTATTTTGGA TTAGAGCAAT CGGCATATAT GGATGATTAT TCAATATTTT TAGCATCGAA 4440 

TGGCTATTAT CAACATTTGG CCATGAATGA TTGGGTATCA GCAACGAAAC GTGTAGAAAA 4500 

TTTTGATACG TATGGATTAG CAATTGTTGA CTTTCATTAT CCTGAAACAA CACATTTAAA 4560 

SO TTTACAAGGT CCGGATGGTA TCTATTATCG CTTTAATCAT ATCGAAGTTG AAGATTAGTA 4 620 

TATACTTTGA ATGGACGAAC CATATAATGA ATCGTTTTTA ATGATCTTTT TATACAAGTT 4680 

ATGAAGGAGG CTGGGACATT AAGTTCTTAG GCAATGTAAA AAGCTGATTT CTATTAATTA 4740 
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TTTTCCTTAT ATTAATTGCC ATT AATACAA AACCTAGCTC TCGTTTAACT TTATTTATTC 4860 
CTCGAACTGA' CATTCGnGTG AACTCAAAAT nGCCTACTTn CTTAAATTAC CAATATCTAT 4920 
(2) INFORMATION FOR SEQ ID NO: 27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 27: 

TGGATTGCCA TTACATGGAC AAGATTTAAC TGAATCAATT ACACCATATG AAGGTGGTAT 60 

CGCTTTTGCA AGTAAACCAT TAATTGATGC TGATTTTATT GGTAAATCTG TATTAAAAGA 120 

20 TCAAAAAGAA AATGGTGCAC CAAGAAGAAC AGTGGGATTA GAATTACTTG AAAAAGGAAT 180 

TGCAAGAACT GGTTATGAAG TTATGGATTT AGATGGAAAT ATTATTGGAG AAGTAACTTC 240 

AGGAACACAG TCTCCATCAT CAGGAAAATC AATTGCACTT GCAATGATAA AAAGAGATGA 300 

25 

GTTTGAAATG GGTAGAGAGT TGCTTGTTCA AGTTCGTAAG CGTCAATTAA AAGCGAAAAT 360 

TGTTAAGAAA AATCAAATTG ATAAATAATT AAAAAGGGGT GTGCATTGTG AGTCATCGTT 420 

ATATACCTTT AACTGAAAAA GACAAGCAAG AAATGTTACA AACAATTGGT GCAAAATCTA 480. 

30 

TAGGAGAATT ATTCGGTGAT GTACCAAGTG ACATTTTATT AAATAGAGAT TTAAATATTG 540 

CTGAAGGCGA ACGGAGAACA ACGTTACTTA GAAGATTnAA TCGCATTGCA AGCAAGAGTA 600 

35 TCACTAGAGG AACGCGTACA TCGTTT 626 

(2) INFORMATION FOR SEQ ID NO: 28: 

T(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1126 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 28: 
nGGAAGTGGT GTATATATTT GTAATGAGTG TATTGAATTA TGCTCAGAAA TCGTCGAAGA 60 
SO AGAATTAGCT CAAAACACTT CTGAAGCGAT GACAGAATTA CCTACTCCTA AAGAAATTAT 120 
GGATCATTTA AACGAATATG TTATTGGTCA AGAAAAAGCT AAAAAATCTT TAGCTGTAGC 180 
TGTTTATAAC CACTATAAGC GTATTCAACA ATTAGGACCA AAAGAAGATG ATGTTGAATT 240 
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10 



AACCTTAGCC 


AAGACGTTGA 


ATGTACCATT 


TGCAATTGCA 


GATGCGACAA GTTTAACTGA 


360 


AGCTGGTTAT 


GTAGGCGATG 


ATGTTGAAAA 


TATCTTGTTG 


AGATTAATTC AAGCAGCTGA 


420 


CTTTGACATT 


GATAAAGCCG 


AAAAAGGTAT 


TATTTATGTA 


GATGAAATTG ATAAAATTGC 


480 


ACGTAAATCT 


GAAAACACAT 


CTATAACACG 


TGACGTTTCA 


GGTGAAGGTG TTCAACAAGC 


540 


ATTGCTTAAA 


ATCTTAGAAG 


GTACGACTGC 


AAGTGTTCCG 


CCACAAGGTG GACGCAAACA 


600 


TCCAAACCAA 


GAAATGATTC 


AAATTGATAC 


AACAAATATC 


TTATTTATTC . TTGGTGGTGC 


660 


CTTTGATGGT 


ATTGAAGAAG 


TGATTAAGCG 


CCGTCTTGGT 


GAAAAAGTTA TTGGTTTCTC 


720 


AAGCAATGAA 


GCTGATAAAT 


ATGACGAACA 


AGCATTATTA 


GCACAAATTC GCCCAGAAGA 


780 


TTTGCAAGCC 


TATGGTTTGA 


TTCCTGAATT 


TATCGGACGT 


GTGCCAATTG TAGCTAATTT 


840 


AGAAACATTA 


GATGTAACTG 


CGTTGAAAAA 


CATCTTAACG 


CAACCTAAAA ATGCACTTGT 


900 


GAAACAATAT 


ACTAAAATGC 


TGGAATTAGA 


TGATGTGGAT 


TTAGAGTTCA CTGAAGAAGC 


960 


TTTATCAGCA 


ATTAGTGAAA 


AAGCAATTGA 


AAGAAAAACA 


GGTGCGCGTG GTTTACGTTC 


1020 


AATCATAGAA 


GAATCGTTAA 


TCGATATTAT 


GTTTGATGTG 


CCTTCTAACG AAAATGTAAC 


1080 


GAaGGTAGTT 


ATTACAGCAC 


AAACmATTAA 


TGrAGaACTG 


AACCAG 


1126 



(2) INFORMATION FOR SEQ ID NO: 29: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4392 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 29: 



45 



ATTGACTTCT 


TAGCAATnAA 


TaTGAGTGAA 


GAACGTACTG 


TTGAAGTACC 


AGTTCAATTA 


60 


GTTGGTGAAG 


CAGTAGGCGC 


TAAAGAAGGC 


GGCGTAGTTG 


AACAACCATT 


ATTCAACTTA 


120 


GAAGTAACTG 


CTACTCCAGA 


CAATATTCCA 


GAAGCAATCG 


AAGTAGACAT 


TACTGAATTA 


180 


AACATTAACG 


ACAGCTTAAC 


TGTTGCTGAT 


GTTAAAGTAA 


CTGGCGACTT 


CAAAATCGAA 


240 


AACGATTCAG 


CTGAATCAGT 


AGTAACAGTA 


GTTGCTCCAA CTGAAGAACC AACTGAAGAA 


300 


GAAATCGAAG 


CTATGGAAGG 


CGAACAACAA 


ACTGAAGAAC 


CAGAAGTTGT 


TGGCGAAAGC 


360 


AAAGAAGACG 


AAGAAAAAAC 


TGAAGAGTAA 


TTTTAATCTG 


TTACATTAAA 


GTTTTTATAC 


420 


TTTGTTTAAC 


AAGCACTGTG 


CTTATTTTAA 


TATAAGCATG 


GTGCTTTTTG 


TGTTATTATA 


480 


AAGCTTAATT 


AAACTTTATT 


ACTTTGTACT 


AAAGTTTAAT 


TAATTTTAGT 


GAGTAAAAGA 


54 0 
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CTTACTAAGC TAAAGAATAA TGATAATTGA TGGCAATGGC GGAAAATGGA TGTTGTCATT 660 

ATAATAATAA ATGAAACAAT TATGTTGGAG GTAAACACGC ATGAAATGTA TTGTAGGTCT 720 

AGGTAATATA GGTAAACGTT TTGAACTTAC AAGACATAAT ATCGGCTTTG AAGTCGTTGA 780 

TTATATTTTA GAGAAAAATA ATTTTTCATT AGATAAACAA AAGTTTAAAG GTGCATATAC B4 0 

AATTGAACGA ATGAACGGCG ATAAAGTGTT ATTTATCGAA CCAATGACAA TGATGAATTT 900 

GTCAGGTGAA GCaGTTGCAC CGATTATGGA TTATTACAAT GTTAATCCAG AAGATTTAAT 960 

TGTCTTATAT GATGATTTAG ATTTAGAACA AGGACAAGTT CGCTTAAGAC AAAAAGGAAG 1020 

75 TGCGGGCGGT CACAATGGTA TGAAATCAAT TATTAAAATG CTTGGTACAG ACCAATTTAA 1080 

ACGTATTCGT ATTGGTGTGG GAAGACCAAC GAATGGTATG ACGGTACCTG ATTATGTTTT 114 0 

ACAACGCTTT TCAAATGATG AAATGGTAAC GATGGAAAAA GTTATCGAAC ACGCAGCACG 1200 

CGCAATTGAA AAGTTTGTTG AAACATCACG ATTTGACCAT GTTATGAATG AATTTAATGG 126 0 
TGAAGTGAAA TAATGACAAT ATTGACAACG CTTATAAAAG AAGATAATCA TTTTCAAGAC ' 13 20 

CTTAATCAGG TATTTGGACA AGCAAACACA CTAGTAACTG GTCTTTCCCC GTCAGCTAAA 1380 

GTGACGATGA. TTGCTGAAAA ATATGCACAA AGTAATCAAC AGTTATTATT AATTACCAAT 144 0 

AATTTATACC AAGCAGATAA ATTAGAAACA GATTTACTTC AATTTATAGA TGCTGAAGAA 1500 

TTGTATAAGT ATCCTGTGCA AGATATTATG ACCGAAGAGT TTTCAACACA AAGCCCTCAA 1560 

CTGATGAGTG AACGTATTAG . AACTTTAACT GCGTTAGCTC AAGGTAAGAA AGGGTTATTT 1620 

ATCGTTCCTT TAAATGGTTT GAAAAAGTGG TTAACTCCTG TTGAAATGTG GCAAAATCAC 1680 

35 CAAATGACAT TGCGTGTTGG TGAGGATATC GATGTGGACC AATTTCTTAA CAAATTAGTT 1740 

AATATGGGGT ACAAACGGGA ATCCGTGGTA TCGCATATTG GTGAATTCTC ATTGCGAGGA 1800 

GGTATTATCG ATATCTTTCC GCTAATTGGG GAACcAATCA GAATTGAGCT ATTTGATACC 1860 

GAAATTGATT CTATTCGGGA TTTTGATGTT GAAACGCAGC GTTCCAAAGA TAATGTTGAA 1920 

GAAGTCGATA TCACAACTGC AAGTGATTAT ATCATTACTG AAGAAGTGAT CAGCCATCTT 198 0 

AAAGAAGAGT TAAAAACTGC ATATGAAAAT ACAAGACCCA AAATAGATAA ATCAGTGCGC 2040 

AATGATTTGA AAGAAACGTA TGAAAGCTT.T AAATTATTCG AAAGTACATA CTTTGATCAT 2100 

CAAATACTAC GTCGCTTAGT AGCGTTTATG TATGAAACAC CTTCGACAAT TATTGAGTAT 2160 

SO TTCCAAAAAG ATGCAATCAT TGCAGTTGAT GAATTTAATC GTATTAAAGA AACTGAAGAA 2220 

AGTTTAACAG TAGAGTCTGA TTCGTTTATT AGCAATATTA TTGAAAGTGG TAATGGATTT 2280 

ATAGGACAAA GTTTTATAAA ATATGATGAT TTTGAAACAT TGATTGAAGG CTATCCTGTC 2340 
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TCATGTAAAC CTGTCCAACA ATTTTATGGG CAATATGACA TTATGCGTTC TGAATTTCAA 246 0 

CGATATGTTA ATCAAAACTA TCATATCGTG GTTTTGGTCG AAACCGAAAC TAAAGTTGAA 2520 

5 

CGTATGCAAG CGATGTTAAG TGAAAtGCAT ATTCCATCAA TAACAAAATT GCATCGCTCA 2580 

ATGTCATCGG GGCAAGCAGT GATTATTGAA GGCAGTTTAT CTGAAGGATT TGAACTACCT 2640 

GATATGGGAT TAGTTGTCAT TACTGAGGGT GAgcTTTTTA AATCAAAACA GAAAAAGCAA 2700 

CGAAAACGTA CGAAAGCTAT CTCAAATGCT GAAAAAATTA AGTCTTACCA AGATTTAAAT 2760 

GTGGGAGATT ATATTGTTCA TGTGCATCAT GGTGTTGGTA GATATTTAGG TGTTGAGACG 2820 

is CTCGAAGTGG GGCAAACGCA TCGTGATTAT ATTAAATTGC AATATAAAGG TACGGATCAA 2880 

CTATTTGTTC CAGTAGATCA AATGGATCAA GTTCAAAAAT ATGTAGCTTC GGAAGATAAG 2940 

ACGCCAAAAT TAAATAAACT CGGTGGCAGT GAATGGAAAA AAACAAAAGC TAAAGTTCAA 3000 

20 

CAAAGTGTTG AAGATATTGC TGAAGAGTTG ATTGATTTAT ATAAAGAAAG AGAAATGGCA 3060 

GAAGGTTATC AATATGGGGA AGACACAGCT GAGCAAACAA CATTTGAATT AGATTTTCCA 3120 

TATGAACTTA CGCCTGACCA AGCTAAATCT ATCGATGAAA TTAAAGATGA CATGCAAAAA 3180 

25 

TCGCGTCCAA TGGATCGCTT GCTATGTGGT GATGTTGGTT ATGGTAAAAC TGAAGTTGCA 324 0 

GTGAGAGCAG CATTCAAAGC TGTAATGGAA GGAAAGCAGG TTGCATTTTT AGTTCCTACA 3300 

30 ACTATTTTAG CTCAGCAACA TTATGAGACG TTAATTGAGC GTATGCAAGA TTTTCCTGTT 3360 

GAAATTCAAT TAATGAGTCG TTTTAGAACG CCTAAAGAGA TAAAACAAAC TAAGGAAGGA 3420 

CTTAAAACTG GATTTGTTGA CATAGTTGTT GGTACACACA AATTACTTAG TAAAGATATA 34 80 

35 CAGTATAAAG ATTTAGGGCT GTTGATTGTA GATGAAGAAC AACGATTTGG TGTACGCCAT 3540 

AAAGAGCGTA TTAAAACATT AAAACATAAT GTAGATGTAC TAACATTGAC TGCAACCCCA 3600 

ATAGCTAGAA CATTGCATAT GAGTATGCTA GGTGTGCGGG ATTTGTCAGT GATTGAAACG 3660 

40 

CCGCCAGAAA ATCGTTTCCC AGTTCAAACA TATGTATTAG AACAGAACAT GAGTTTTATC 3720 

AAAGAAGCTT TAGAAAGAGA ACTATCCCGT GATGGCCAAG TGTTTTATCT TTATAATAAA 3780 

GTGCAATCCA TTTATGaAAA ACGAGAACAA CTCCAGATGT TAATGCCAGA TGCTAACATT 3840 

45 

GCAGTTGCTC ATGGACAAAT GACAGAGCGC GATTTAGAAG AAACGATGTT AAGTTTTATC 3 900 

AATAATgAAT ATGATATTTT AGTAACGACG ACGATTATTG AAACAGGTGT CGATGTCCCA 3 960 

SO AATGCAAATA CTTTGATCAT TGAAGATGCA GATCGCTTTG GATTGAGTCA GTTGTATCAA 4 020 

TTAAGAGGTC GTGTTGGTCG TTCAAGTCGT ATTGGTTATG CATACTTCTT ACATCCAGCA 4080 

AATAAGGTAC TAACTGAGAC TGCAGAAGAT CGATTACAAG CGATTAAAGA ATTTACGGAG 414 0 
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TTAGGTAAAC AACAGCACGG CTTTATTGAT ACAGTTGGAT TTGATTTGTA CAGTCAAATG 
TTAGAAGAAG CTGTAAATGA AAAACGTGGT ATTAAGGAAC CAGAATCTGA GGTGCCAGAA 
GTCGAAGTTG ATTTAAACTT GGATGCATAT TTGCCAACAG AATATATTGC AAATGAACAA 
GCTAAAATTG AA 

(2) INFORMATION FOR SEQ ID NO: 30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 30: 



TTTCTTTTGA 


ATCTATATCG 


AGGTGGTTGG 


TAGGTTCATC 


TAAAATAAGT 


ACATTGTCAC 


GTTGCAACAT 


AAGTAGTGCT 


AGTTGTAAAC 


GTGCTTTTTC 


ACCACCAGAT 


AAATCATTAA 


TTATCTTTTT 


AACATCGTCT 


TGTACAAATA 


AGAAACGTCC 


AAGAACTGCT 


CGAATATCTT 


TTTCATTCAT 


TAACGGATAT 


TGATCCCACA 


CATAATCTAA AATCGTTTTA 


CTAGATTTAA 


ATTCTGCTTG 


CTTTTGATCA 


TAATAACCAA 


TTTGTAAATT 


TGCGCCGAAA 


GTAATATCGC 


CATTAAGCGC 


TTTTTGTTGA 


TTAGCAATAG 


TTTTAATTAA 


GGTCGATTTT 


CCAATACCAT 


TTGGCCCAAT 


GATTGCTATA 


TGATCGCCTT 


TAGAGACCTC 


TATACTCATA 


GGTTTGGTAA 


TTGCAGTTTG 


ATAACCGATT 


TCTAAATTTT 


TTACATGCAT 


GACGTCATTA 


CCTGTATTCC 


GGTCAAAGCC 


AAATTGAATA 


TTTGCACTTT 


TGGCATCTAA 


CATTGGTTTA 


TCAATGCGTT 


CCATTTTTTC 


TAAAATCTTA 


CGTCTACTTT 


TTGCCATTCC 


ACTTGTTGAA 


GCACGGGTAA 


TATTTTTCTC 


AACAAAAGTT 


TCTAATCGTT 


TTATTTCTGC 


TTGTTGACTT 


TCATATTCTT 


GCATTCGTTT 


TTGATAATAT 


AAATCCCGTT 


GCTGTATAAA 


TTCCTCGTAA 


TTACCAACAT 



AGCGTTTGA 

(2) INFORMATION FOR SEQ ID NO: 31: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 856 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 31: 
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TGATGTTTCG ATACATTTGT TGCACCTTGT GGATATACTT TAAAGGTTGT GTCGTATGTT 120 

TCCTTACTAT CTTTAGCTTC AGATTCCTGT GATTCAACCG TTTTATATTT TTCAAGTGCA 180 

5 

TGTCCTTCAA TATCAACTCG TGGAATAATG CGATTCAACC ATGCTGGTAA ATACCACGAA 24 0 

CCTTTtCCAA ACAATTTCGt TAATGCAGGA ATTAACATCA TtCTGACTAC GAAGGCATCA 300 

AAGAGTACAC CAAACGCTAA TGCCATACCC ATTGATTTAA TCATGACATC TTCTTGGAAT 360 

10 * 

ACAAACGCAA AGAAGACACT AAACATAATT AATGCAGCTG CTACAATAAC AGGACCGCTT 420 

TCTTTCAATC CTACTTTGAT AGAATAATCA TTATCCCCTG TTTTACTATm yyCTTCATGr 480 

75 ATTCGCGACA TAAGGAAGAC TTCATAATCC ATCGCTAATC CAAATAAGAT ACCTATAGTA 540 

ATAACCGGTA AAAATGCTAG CATTGGTCCT GTCGTTTCAA TACCAAACAG ACCTTTCATA 600 

AAACCATCTT GCATTACTAA TGTTGTAAAT CCTAATGTTG CCATTAATGA CAAGACGAAT 660 

20 CCTAAAACTG CTTTTAATGG TATTAGAATT GAACGGAAGA CAATCATTAA TAAGAAAAAT 720 

GCTAATACAA CAATGACTGA GGCAAATAAA GGTATCGCCT CATTTAACTT TTTAGACATA 780 

TCAATATTAA TGACACTTTG TCCCGAAATC TCCGTTTTGA ACCCATATTT ATCTTGTGCA 84 0 

25 

TCTTTATGAT AATCTCGTAA ATCATGCACT AAATCATTTG TACTCTCTGC ATTAGGCCCT 900 

TGCTTAGGTA TCACGACCAT CAAAGCGTAA TCATTATCTT TACTCATTTG TGGTGGCGTA , 960 

ACGATATCTA CATTTTTCTT ATCTTTAATA TCTTTATATA CAGACTGTAA ATCTTGTTGT 1020 

30 

AATCCTTGTG GATCATCCTT TTTATCTTTC ACATTTATCA ACATCGGTAT TTGGCCATTA 1080 

AATCCTTCAC CAAATTTATC CGAGATAATA TCGTAAGCTT TTTTCTGTGT AGAATCTGCT 1140 

35 GGTTTAACAC CGTCATCTGG AATACCAAGT CGCATATGAC TAACTGGTAT TGCAGCTGCT 1200 

ACTAATATGA TTAAACCTAG TAATACTGCC GCAAGTGCAT TTCCTGTAAT AAATTTAGAC 1260 

CATGGCGTAT CAATATCTTT TTTGAATTTA GACTGTAATT TATTCACTTT AATGCGTTtA 1320 

40 

TGGAAAATGC TTATTAATGC AGGTAATAAA GTTAAAGCGC TAAGTACTGC AAAAACAACA 1380 

CTAATTGCCG AAGCAAATCC CATTACCGCT AAGAAGTCAA TGCCTACTAA TGATAAACCA 144 0 

CATACTGCAA TTACAACTGT TACACCAGCA AAAACAACTG CACTACCTGC TGTTCCTATT 150 0 

45 

GCAAGACCAA TGCCTTTAAT GTAATCTGTT TCAGTTTTCA ( TAACTTGTCG ATATCTGAAT 1560 

AAAATAAATA ATGCATAATC GATACCAACT GCTAGTCCAA ' TCATTACGGC TAATGTCAGT 1620 

50 GTGACATTTG GTATATCGAA TGCATAAGTT AACAAACTGA TAATACCTAC ACCAGAGGCT 1680 

AGACCAATCA ATGCACTTAT AATTGGTAAT CCTGCAGCAA TGACTGAACC GAATGTGATT 174 0 

AACAGTACAA CAAATGCAAC AATAATACCA ACTAGTTCAG AATTACCGCC TACTTCTGTA 1800 
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AAATGACTTT TAACATTATC TCTAGAGCCA TCTTTTAAAG ATGTTTGACT AACGTCATAT . 1920 

GTGATATCTG CAAATGCAGT TGTTTTATCT TTACTAATTT GCTTATTTTC ATAAGGATCT 1980 

GATATTTTAT CAATGTGCTT GTCATCTTTT TTAATATCAT CTAACGTTTT CTTAATATCT 2040 

TTAGTAATGT TCGGTTGCAC AATACCATCA TCTTTAGTCG TCTTAAAGAC AACACGTATT 2100 

TGTGCC TTTT CACTATCTTG ATTAAAATGT TTTTCAATCT TTTTATTCGT ATCTAACGAC .2160 

TCTAATCCTG TCATTTTAAT ATCATTGTCA AATTTCGGTG CATTTGTAGC AAGTGGTATC 2220 

AATATTGCAG CTACAATCAC TATCCATGCA ATGACCGCGG ACCATTTATG TTTTGCGATG 2280 

AATGTCCCCA TCTTATATAA AAATTTTGCC AAAGTATATT GCCTCCTTTT AAAATCAACG 2340 

TTATAGTTTA AATATACAGT GTAGATTATT GTTCGATTAT AGTATCTATC CCCGACCTCT 2400 

TAAAGAATCA ATTGGAAAAT TTTGTATATT AAACTACACA CAAAGGAGAA ATGTAGATGA 2460 

20 AAGAGACTGA TTTACGAGTT ATAAAGACAA AAAAAGCATT GTCGAGTAGC TTGCTACAAT 2520 

TGTTAGAACA GCAATTATTC CAAACGATTA CTGTCAATCA AATTTGCGAC AACGCACTCG 2580 

TACACCGTAC AACATTTTAT AAACATTTTT ATGATAAATA TGATCTTCTA GAGTACTTGT 264 0 

TCAATCAATT GACTAAAGAC TACTTTGCTA GAGATATCAG TGACCGTCTT AATCATCCAT 2700 

TCCAAACGAT GAGTGATACG ATTAATAATA AAGAGGATTT GAGAGAAATC GCAGAATTCC 2760 

AAGAAGAAGA CGCTGAATTT AATAAAGTAT TAAAAAATGT CTGCATTAAA ATTATGCATA 2820 

ACGATATCAA AAATAATAGA GACCGTATCG ATATTGACAG CGACATCCCA GATAATCTCA 2BB0 

TATTTTATAT TTATGACTCG TTGATTGAAG GTTTTATACA TTGGATAAAA GATGAAAAAA 2940 

35 TTGATTGGCC TGGCGAAGAT ATTGATAACA TTTTCCATAG ATTAATCAAT ATTAAGATTA 3000 

AATAGTAGAT GAGAAACTCA TGAGCGTTAC CAACATTCAT AATAAAAACG ATAGTGkACA 3060 

CGTTAATGAA TTCGTGTACT ACTATCGTTT TTTATTTTTA TCGTGCTTAT CGCTATTAAA 3120 

ACAACTGATA CACAACACAT AAACTATGAA GAAAAAAATA AATCCGCTAT CTAAATGACT 3180 

TTGACTCAGT TGTTTAAATG ACCAAATTGC TAATACAATT CCCATTATTA TTGAAATAAC 324 0 

GTATCTCACA TTCTTATACC TATAATCCTT TTCTAAAAAT ATGGTTGCTA TTACTTAATT 3300 

TTTAAAGTTA TAAATAAAAA GAGCCAACCG CAATGGATGG CCCTTGTTCA TTATGAAGCA 3360 
TTAGAACATT TCTGAAACAA CCTTTTGTTC TAAGAAGTGT AATAAGTAGT CTGGACTACC . 3420 

SO TGTTTTAGCG TCCGTACCTG ACATTTTGAA ACCACCAAAT GGATGGTATC CAACAACTGC 34 80 

TGAAGTACAG CCTCTGTTAA GGTATAAATT GCCTACATCA AATTCGTTTA CCGCTTTAAT 3540 

CCAATGCTCG CGATTATTTG TAATCACTGC ACCAGTTAAA CCGTAATCTG TATCATTTGC 3600 
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TTCTTCTTGC ATGATTCTAT CTTTAGATTT 
GTAACCTTTT GAATCATCAG TGCCGCCACC 
CTCAATATAA TTTTTAATCT TATCAAATTG 
ATTGTCTACA GTATTGCCCA ACGTTAATTC 
TTCGTCATAA ACGTCTTTAT GCACAATTGC 

10 

AAAACCAAAT GCTGACGTTA CAATAGCTTC 
AACTACAATG GCATCTTTAC CACCCATTTC 

75 TTCTTGAACA ACGGCACTAC GTTCATAAAT 
TGTAACGAAA TGCGTATCTT TATGATCAAC 
AGGAACAAAG TTAACTACGC CTTTTGGTAA 

20 ATAAGCGATA TAAGGTGTAT CCTCAGCAGG 
TGGTGCTAAA GTTGTACCAG CCATAATCGC 
ACCTGTACCA ATTGATTTAT AGAAATATTT 

25 

CTTACCTTGA GCCAAGTCCA TCATTGAACG 
AGCTGCATCA CCAACTGCTT CATCCCATGG 
AATTTCCGCT TTTCGACGAC GAATAATTGC 

30 

ATTTGCTGAC CATGTTTTCC AAGATTTATA 
AACATCTTGT TTTGTTGCCT TTGATGCATT 

35 GATTGATTTA ATTTTGTCAT CTTTGAAAAT 
TTGACCTAAT TCTTTTTCCA CGTCTTTCAA 
GACTGAAAAA TCGTAACCAG GTTCATTTTT 

40 ATAAATTTTG AAAGTGGTTT AACCCTTTGA 

TTACTATGAT TAAGGTTAGT TTTGCAATCG 
CAAGTATTTT GAAATTGATT GGTTACTTTT 

45 ^ 

TATCGTTTCG TCATTTAATG TTTCGGATGG 
ACAAGGGTTT CCAACCGCTA AGCTGTGTGG 
50 ACCAATCACA CTGCCTTCTC CAATCGTCAC 

CCAAGTATTA CTGCCAATAT GAATGGGTCC 
GAAATTAAGT GGATGTGTCG CTGTGTAGAA 

55 



AAGTCCTGAA ATGATTGTTG GTTCTACAAA 3720 

TTGTTCTAAT TTACCTTCTT CTTTACCAAT 3780 

TTTTTTATTA ATAACTGGGC CCATATACGT 3840 

TTTTGTTAAT TTGATTGATT ' TCTCTAATAC 3900 

ACGTGAACAT GCTGAACATT TTTGACCAGA 3960 

TGCTGCCATA TCTGTATCAA TATTTTCATC 4020 

AGCGATAACA CGTTTCAAGA AGTTTTGACC 4080 

TCTAGTACCT GTCGCACGTG ATCCTGTAAA 4140 

TAAGTAATCA CCAATTTCTT TCGGATCACC 4200 

TCCTGCTTCT TCTAAAATTT CCATTAATTT 4260 

TTTCAATAAC ACTGTATTAC CTGCCACAAC 4320 

AAACGGGAAG TTCCACGGCG GAATTGTAAC 4380 

ATTGTGTTCA CCTTCACGAT CAAGTACTGG 4440 

TGCATAGTAT TCAATAAAAT CAATACCTTC 4500 

CTTACCTGCT TCATAAACCA TAATTGCTGC 4560 

CGAAACACGT AACATAAGCT CTGCACGATC 4620 

AGCTTCGTTT GCTGCTTTAA ACGCATCTTC 46 80 

TGCAATCACT TGTGATGTGT CTGCAGGATT 4 740 

CTTCTCTCCA TTAATCACTA ATGGTATGTC 4800 

TGCTTTCTTA AACATATCCA CATTTTCTTG 4860 

AAATTCTACT ACCATGTACA CTTACCCCCT 4920 

TTTAATGATA TAACATCATT TAAACTCATT 4980 

CTTTCATTTT TATGTTTTAT CACTTATTCT 504 0 

TAAAATTTAT ATGGGTCGCA ACTGCTACTT 5100 

TAGGTCATTA TCAATTTTAC GAACGACTTT 5160 

CGGAATATCT TTAGTGACAA CACTACCAGC 5220 

CCCTGGTAAC ACGGCTACAT GACCGCCAAA 528 0 

GGCTTTTTCA AAACCTTCAT TTCTATGATG 5340 

TCCACAATTA GGTCGTATAA AAACATTATC 54 00 



310 



EP 0 786 519 A2 



TCCTAGTTTA ACGTTCCAAC CATAATCTGT 
TGTTGTTGTT TGAAATAATT GATCAATTAA 

5 

TGTATGATTT AATTCAAAGC AAATATCTTT 
ATCAAAGTTT GCATCGTACC ATTTTTCTGC 
TTTCAACTCC TAATAACTTA TTTACTTGTT 

10 

GCAACTAATA CGCTATAACA TTATGAAATC 
TCGCATCAAA TACTGCTAAA GTAGTAGATA 
15 AATCTATTAA CGTACAATGT GAGTAAGAGA 

ATTAATTACA GGCACAGTTG CTATCTTAAT 
TTACGAAATG ACAAACGACA CGAGTCGTcA 

20 

CATCCATCTT AATAATTTTA AAAATTTACA 
TTAAGTCACC TAAGAATTGC AAATCCAGAA 
TCCTTTAATA TTAATTACGA CTTCTTTTAT 

25 

ATGTTTGCAC GG CAATCTCT CTTTTTCTTT 
TGACTTATAT ACCTATAGGG GGTACATTAG 

3 q AATGTGGCTA ATACGAAAAA AACAACATTA 

TCAAATCGTA TCGAAAAGAA ACTGAATAAA 
ACTACAGAGA AAGCAACTGT TGAGTATAAC 

35 AATACGATTC AACATTTAGG TTACGGTGTC 

GGTATGACTT GTGCTGCATG CTCAAGCCGT 
GTTCAAAATG CAACGGTCAA TTTAACAACA 

40 

GAAACAGATG CTGATAAACT TGTCACTCGC 
AAAGATAACA ATAAAGATCA AACGTCACGC 
AAGCTTATCA TATCAGCAGT ATTATCTTTA 

45 

TTCAATATGC ATATACCAGC ACTATTTACG 
CCTGTACAAT TTATTATTGG ATGGCAATTT 
so GGTGGCGCCA ATATGGATGT ACTTGTTGCT 

ATTTATGAAA TGGTTCGTTG GCTAAATGGC 
ACAAGCGCCG TACTAATTAC CTTAATCTTA 

55 



ATCAAAAGGA ATCGAAATAC TTACATTGTC 5520 

TTCCTTTCTT TTATTTGTAG CACTCGGTCT 55 BO 

CGCTCGTGCA CGTTCATTGA TTAAGTATTG 564 0 

TAACATTTTT TCTTTTTCAG TCATTACACC 5700 

TAAAAGTTAA TCAAATAAAC CTTCGCCTAT 5760 

ATGACCTTAT CACCCTTATC TATACAATTC 5820 

AATTCAATAC TACAGACGCA TTCATTTTTT 58 BO 

AATATAAAGG AGTATGATAG CGATGAGAAT 5940 

CATTCTAGGT TTGGTCAAAA CGATACAAGA 6000 

GTTGTCAGAC AACAAAGATG ATGATAAAGT 6060 

TGCGAAAGAA TTTAACCCAT CTGATTTCTT 6120 

GTCATTTAAG TTTTACCTTT CATTCATACA 61 BO 

ATAGATGCTA AGTAGAGAGA TTGTTGTGCA 6240 

TTAAAATTGG TAAAAGTAAA ACGCAACGAT 6300 

ACGTGTAACA ATGAATCACA GGGAGGCAAT 6360 

GATATCACTG GTATGACTTG TGCCGCATGT 6420 

CTTGATGACG TTAATGCCCA AGTGAATTTA 64 80 

CCTGATCAAC ATGATGTCCA AGAATTTATT 654 0 

GCTGTAGAAA CTGTCGAATT AGACATTACA 6600. 

ATTGAAAAAG TGTTAAATAA AATGGACGGC 6660 

GAGCAAGCTA AAGTTGACTA TTATCCTGAA 6720 

ATTCAAAAAT TAGGTTATGA CGCGTCTATT 6780 

AAAGCTGAAG CGCTACAACA TAAATTGATT 684 0 

CCACTATTAA TGTTAATGTT TGTACATCTT 6900 

AATCCATGGT TCCAATTTAT TTTAGCTACA 6960 

TATGTAGGTG CTTATAAAAA CTTAAGAAAT 7020 

GTTGGTACAA GTGCAGCATA TTTTTACAGT 7080 

TCAACAACGC AACCGCATTT ATACTTTGAA 714 0 

TTCGGTAAGT ATTTAGAAGC TAGAGCGAAG 7200 
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TTAAAAGATG GTAATGAAGT GATGATTCCT CTAAATGAAG TACATGTTGG AGATACACTT 7320 

ATCGTTAAAC CAGGTGAAAA GATACCTGTT GATGGCAAAA TTATTAAAGG TATGACTGCC 7380 

ATCGACGAAT CTATGTTAAC AGGTGAATCT ATCCCTGTTG AGAAGAATGT TGATGATACT 7440 

GTAATTGGTT CAACGATGAA CAAAAACGGT ACTATTACTA TGACAGCAAC AAAAGTTGGC 7500 

GGGGACACTG CGTTGGCAAA TATTATTAAA GTTGTCGAAG AAGCTCAAAG TTCTAAAGCG 7560 

CCGATTCAAC GATTGGCAGA TATTATTTCT GGTTATTTCG TTCCTATCGT TGTTGGTATC 7620 

GCACTATTAA CATTTATCGT GTGGATTACT TTAGTTACAC CAGGTACATT TGAACCTGCA 7680 

15 CTTGTTGCGA GTATTTCCGT TCTCGTCATT GCTTGTCCAT GCGCATTGGG ACTTGCTACA 7740 

CCAACTTCTA TTATGGTAGG TACTGGTCGC GCTGCTGaAA ATGGTATTTT ATTTAAAGGT 7800 

GGCGAGTTTG TTGAACGCAC ACATCAAATT GATACCATCG TTTTAGATAA GACGGGTACC 7860 

20 ATTACAAATG GTCGTCCAGT CGTGACAGAT TATCATGGTG ACAATCAAAC GCTACAACTA 7920 

CTTGCTACTG CTGAAAAAGA TTCTGAACAC CCATTGGCAG AAGCCATTGT CAATTATGCA 7980 

AAAGAAAAGC AATTAATATT AACTGAGACA ACAACATTTA AAGCAGTACC TGGCCATGGT' B040 

ATTGAAGCAA CGATTGATCA TCACCATATA TTGGTTGGTA ACCGTAAATT AATGGCTGAC 81 00. 

AATGATATTA GCTTGCCTAA GCATATTTCT GATGATTTAA CACATTATGA ACGAGATGGT 8160 

AAAACTGCTA TGCTCATTGC TGTTAATTAT TCATTAACTG GTATCATCGC AGTGGCAGAT B220 

ACTGTCAAAG ATCATGCCAA AGATGCTATA AAACAATTGC ATGATATGGG CATTGAAGTT 828 0 

GCCATGTTAA CTGGCGATAA TAAAAACACT GCTCAAGCCA TTGCAAAACA AGTAGGCATA 8340 

35 GATACTGTTA TTGCAGATAT TTTACCAGAA GAAAAAGCTG CACAAATTGC GAAACTACAG 8400 

CAACAAGGTA AGAAGGTTGC GATGGTTGGT GACGGTGTAA ATGATGCACC TGCATTAGTT 8460 

AAAGCTGATA TCGGTATCGC CATTGGTACA GGTACAGAAG TTGCCATTGA AGCAGCTGAT 8520 

ATTACTATTC TTGGTGGCGA CTTGATGCTT ATTCCTAAAG CCATTTATGC AAGTAAAGCA 8580 

ACCATTCGTA ATATTCGTCA AAATCTATTT TGGGCATTCG GCTATAATAT TGCCGGTATC 8640 

CCTATAGCTG CATTOGGCTT ACTTGCGCCA TGGGTTGCTG GTGCTGCAAT GGCACTAAGT B700 

TCAGTAAGTG TTGTCACAAA CGCACTTAGA TTGAAAAAGA TGCGATTAGA ACCACGCCGT 8760 

AAAGATGCCT AGATTCCTTA ATAATGAAGG ATTCGTTGGT GATTCTGAGA TAGGCTAGTG 8820 

50 ATTGGCTCTA TAATGTCGCG GTTTAyaGTt GGATCTTCGC TCCAACTGCA TATATAGTnA 8880 

CACTTTTCGC TTGGCGAATT AGTGTATCTT ACCTAATAGc TCCGCCTATT AGGTTCCATC 894 0 

ATTATTATAA ATAATAAGTA CACTACGGtT TACAGTTGGA TCTTCGCTCC AACTGCATAA 9000 
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GAAATTTTAA ATGTTGAAGG TATGAGCTGT GGTCACTGCA AAAGTGCTGT TGAATCTGCA 9120 

TTAAATAATA TTGACGGTGT CACTTCAGCT GACGTTAACC TTGAAAATGG TCAAGTAAGT 9180 

GTTCAATATG ATGACAGTAA AGTTGCTGTA TCTCAAATGA AAGACGCAAT TGAAGATCAA 924 0 

GGTTACGATG TCGTTTAATT AGGCAATATT CAACGTCATC AACACCAAAT TAAAAAATCG 9300 

AACTGATGAG AATCCCAACA ATCCAAATTA TCTCATCAGT TCGATTTTTA ATTTACTCGT 9360 

AACCTAGTAT CTCCAGTCTG CAATACATCT AATGTTGCAT CTAATGCATC GACAATTAGA 9420 

TTTTTAACTG CAGCTTCAGT ATAAAACGCA ATATGTGGTG TTAATATGAC ATCTTCCCTG 94 80 
TCAATCAACG ATTCTAACAA TGGATCGTTC AGTGTTTTGC CCCTTTGATC ACTTGGGAAA , 9540 

AGTTTGCGTT CAAATTCATA CGTATCAAGT GCTGCACCTT TAATCACACC ATTGTCTAAT 9600' 

GCGTCTAATA ACGCCTTAGT ATCTACTAAA GAACCTCTCG CACAATTGAC AAATACTGCG 9660 

CCCTTTTTAA AATGTTTAAA TAATTCAGCA TTAAATAGAT AATGATTATA TTTCGTTGCA 9720 

GGTACATGTA ATGTCACGAT. ATCAGCACCT TCAACCGCTT CCTCAATCGT ATCTTTGTAA 9780 

TCGACATACG TTGCAATTTT AGCATTAGGA AACGGtCGTA TGCGACCACA TCACTTTGAT 984 0 

AACCATTGGC AAATATATCG GCTACTACAC GGCCAATTCG ACCTGTACCA ATAACAGCTA 9900 

CTTTTAAATC TTTAATGGAT TTCGATAAAA TAGTAGGTTC CCATCTAAAA TCATGcTCCC 9960 

GCACTTTCGT TTGAATTTGA TTAAAATGAC GAACCACATT AATAGCCTGG TTCACAGCAA 10020 

ACTCCGCAAT TGAATTCGGA GAGTATGACG GCACATTTGA CACAATAAAG TTATACTTGT 10080 

TTGCTAACTC CAAATCATAT GTATCAAATC CAGCACTACG TTGTGCGATT TGTTTAATAC 1014 0 

CTAGTTCATT TAATCGTTTA TAAACATGCT CTGATAATGG TATTTGTTGT GATAGCGATA 10200 

AGCCATCATA ACCAGCGACA CCTTCAACAT TGTCATCAGT TAATGCTTCT TTAGTAATAT 10260 

CTACCTCAAC ATGATGTTTC TCTGCCCACG CCTTGATATA AGGCATATCT TCATCACGTA 10320 

CACTCATGAT TTTAATTTTT GTCATTTTAA CATCACCCTT AACTTTATTA TTCATATAAA 10380 

TATGCTAGTT CTGTTAATCT TATTGCAGCT TCGTCTAATT TCTGGTCATC TAACGCCAAT 10440 

GAAATTCTCA CATAACGATT ACCATTCTCT CCAAATGGTT TCCCTGGAGC AACAAGTATT 10500 

GACTTCTCTT GCACTAAAAA TTGCTCAAAT TGCTCGCTGT CATAACCAGG CGGTGTTTCC 10560 

AACCATACAT ATATGCCACC TTTAGCATGA ACAAATGGCA AATCAGCTTT TGCAAGCATG 10620 

GCTTCGAATC GGTCACGACG TGTTTTAAAT ACATTGCTTT GTTCTTCTAA AAAATCATCA 10680 

TAATGATTCA AAGCATATAT TGCGGCATCT TGTAATGCAC CAAACATCCC AGCATTTGTG 10740 

TGCGTTTGGT ACTTTTTCAA AGCTTGAATC ATATCTTTAT TACCAACTGC AAAACCGACT 10800 
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CCATTTTCCG AAGCAAGTAT ACTAGGATTT TTAGCGTCGA AACCGAAAGC ACCATAAGCA 10920 

AAATCATGCA CGATTTTAGT GTCTGTACCT TTAAATTTAG CTATCGCTTC ATCAAAAACT 10980 

TCTTTCGTAG CTGTCGATCC AGTTGGATTA TTTGGATACG TTAAATAAAT GAGTTTTGTT 11040 

TTATCTATTA TTTGTGAATC AACTTTGGAC CAATCTGGCA AATAATGTGG CGGTTCTAAA 11100 

TTAAGCGGGA CTGGCTTGCC ATCAGCTAAA AGTACACCTG CTAAATAATC CGTGTAGCCT 11160 

10 

GGATCAGGTA GTAATACATA GTCTCCTGGA TTGATAACAC ATGTTGGTAC TGCCACTAAT 11220 

CCATTTTTTG TACCATATAA AATGCATACT TCATCTTCTT TATCTAACGT CACATTATAT -11280 

75 TGTCTTTGAT AAAAATCTAC AATAGCTTGC TTGAACGCTT CTTTACCATG AAAAGCACCA 1134 0 

TATTTTTGAT TTTCAGGAAT AGTTAGTGCT TTTTGAAAAT GATCAATAAT ACCTTGTGGC 11400 

GTGGGCCCAT CAGGGATTCC AACTGCCATA TTAATTAATG GCAATGGTCC ATGTTCGATT 11460 

20 TTACGTCCCA TCGTTTTCCC GAAATAACTA TCAGGGATAT TTGCTAATTT GTTAGAGATC 11520 

ATCAAATTCC TCCTCTATCA TTAAACATAG CCTGGGCGAC TATCATAATC CTAACAACTT 115 BO 

GTATCACTCT CATTTAGATG GTTACAATGA CATCGCCATT CACCGTTATG TTCAACAGAA 11640 

25 

CTTATGACAC ACGTTGTATT GAATGAATTT ATTTTCATTT TAGGTAGGTA TAATATTATT 11700 

GTCAATATTA GGAATTTTCA GATTAATATG CACTCAATCG TTATGATTTA ACTGTCATGC 11760 

ATATCCGCAT GCGCAACCAG TTAGATATGC TTATATAAAG TATAACGCCC ATCAAGGTAC 11820 

30 

GTATTCAAAC GTGAACCTTA ACAGGCGTCA TTCATTGTTA AATAAAACTT CTTAAGCACA 118 80 

TACTTATTTC ACTATGCCTT TTACGTTCCC CTTATACTTT TCTCACATCT TTCTCTTAGA 11940 

35 CTACTCCCTT ATACGCCCCG CTCAATATCT TTAATCATTT CATCTACAGT TATTTTCGCA 12000 

CTCGTTAAGA CAATAGGAAC GCCTGCACCT GGATGCGTAC TTGCACCTGC AAAATATAAA 12060 

TCTTTATAAT. CTCGCGATAC ATTTTGTGGA CGATAATAAT TACTTTGCGC TAAAGTTGGC 12120 

40 

ATTAAACCGA ATGCCGAACC AAATTTCGCA TGATACGTTT GCTCAAAATC ATTTGGCGTA 12180 

AAGATTGTTT CTGAAACAAT ATGCGATTTT ATATCTTCAA ATACTTCAAT CGTTGCTAAT 1224 0 

TTACGATAAA TAATTTCCTT TATTTGTTGC GTCAAAGCTT CATCTGACCA ATCGATTCCG 12300 

45 - 

CTACCTGTTT TAAGTTCCGG CGTCGGCATT AGCACATAAA TACCAGTTTT GCCTTCTGGC 12360 

GCAAGTGATT . TATCAGCGAC CGCTGGTACA TACACATAAA TAGAAGGATC ATATGATAAA 12420 

SO CGTCCCTCAA ATATTTCTTC AATATTGCCT CTAAAGTCAT CTGAAAAAAT AACATTATGA 124 80 

AGTCTCACTT GATCTGTCAC ATCAATATCT ATACCGATAT ACATTAAAAA TGCTGAACAA 1254 0 

GAGTAATCTA AGTCTGCAAT TTTATGTGGT GGATACTTTT TAATAGGTGC AAAATGTGGC 12600 
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ATGTCACCAT TCACTTTTAT CGCATCGGCC CGTTTGAATT TAGGATCAAT AATAATTTGC 12720 

TCAATTTCAG CATTTAGTTC AATATTAACG . CCTAAGTCTT TATTTAATTG CGCTAGcCCT 12780 

TGAGCCATGC CATACATACC GCCTTTAATA AAATGCACAC CAAACATCAT TTCAATCATA 12 840 

GGAATAATTG AATATAGTGA CGGGCCTCGT TTTGGATCAA TTCCTATGTA TAACGTTTGA 12 900 

AACGCTAAAA GCTTTTGTAT CTTTT CGTTA TCAATATAAT GTTCAATTAG CTGATCTGCA 12 960 

TGATTTAACG TTTTTAACTT AGCACCTTGC ACAAGTGACG TCATATTATA AAAGTCACTC 13020 

GGTTTGCGAT ACGTTCTTTC TAAGAAATAG CGACGTGCAA TTTCATATTT TTTATAAACA 13080 

15 TCCGTTAAAA AGGACATAAA ACCATGCGTT GAACCAGGTT CTATACTTTC TAGCATTTGC 13140 
TGTAATTCAG CTAAATCTGT AGGCACCGTT ATACGATCAT CGTGGTCAAA ATACACATCG . 13200 

TAAATATAAC GTAATTGTCT CAATTCAATA TAATCTTCAT AATTTTTACC ACACGCTGTA 13260 

20 AAAACATCTT TATAAACATC TGGCATCATG ACAATTGTGG GACCCATATC AAATGTAAAG 13320 

CCGTCTTTCT TTAATTGATT CATACGCCCG CCTACATTAT TATTTTTTTC AAATATCGTC 13380 

ACTTCATGAC CTTGAGAAGC AATACGGGCT GCCGCTGCTA ATCCTGTGAC ACCTGCACCA 1344 0 

ATTACTGCAA TCTTCATTAT TCAACCACCT ATATTCTATG ATATTTACTA TTTATTTCAT 13500 

GAAACAACTT -TGCCTTTTTC CTCTTATCCA CAAAAACACG TTCATGTAAT GTATAGTTAG 13560 

CCTGTCTCAC TTCGTCCAGT ATTTCAATAT ATATACGTGC TGCTAATTCT ATGATTGGTT 13620 

GTGCTTCAAT ACTAAATACT TTGATTTGAT CCATAACATC TTGAAAATCT TTTTCTGCGA 13680 

TAGCTGCATA ATATTCCCAT AAGTCAATAT AATGATTATT AACACCATTT TGGTACACTT 13740 

35 CAGCAATATC AACTTCATAT TGCTTTAATC GTTGCTTACT AAAATATATC CGTTCATTGT 13800 

CAAAATCTTC ACCGACATCT CTTAATATAT TAAnGGGATC CTCTAGAGTC GACCTG 13856 
(2) INFORMATION FOR SEQ ID NO: 32: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10088 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double . 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 32: 

ATATATAAAT ATAGATTAAG TATATAGATT AATCAACTTT TTTGGAAGAG CAAATCACGC 60 

AATCAACAAA TAATATAAGA AGTTTTTGCG ATAGTTTTAA AATAGCTGTA ATAGAATACT 120 

AAATGTGACA AACTTAGAAC TAATATCAAG TGTTGATGTT TTGAATATAA AAATGCTAAT 160 
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ATAATTGGTT AATATATGAG TAATTAGAAA 
AATATGAAAG ATTATGGGTT AACAGGCATA 
CGTGCGTTAA ATCGTGGAAG ATGTAAACCA 
GATATTTGCA AACCATTAAC GATATATGGC 
ATTTTACGCC GATGTCATTC TGGTCCTTTA 

10 

CGTGGTTATA ATGGACACAG TCATATTCAT 
GTATCGTATC CTTATAACAA TACAGCTATG 

75 ATAGGTGTGA CCATTAAGAA TGTAGTGAGT 

GGACTCTATA TTAAAAGCTG TTCATTTGAA 
ATTCTGAAGC AATACAATTA GACATTCAAG 

20 CAGATGGTAC GATAACGAAA AATGTCATTA 

TGCCCGAAAT GGGAAGTTGG AATCGTGCTA 
ACTATGAGAA TATTCATATT AGAAATAATA 

25 

CTCCCTTGaA GTATAAAGAT GCTTTCATTA 
GCATTAGATA TTTAGGAGTT AGAGATGGTA 
ACTTAGGTTC. CCAAGCAGGC ATAAATATGA 

30 

. TGTCTAAAGA TGGGATACAT GTACGTAATT 
TCGTTGGGAA TACATTCAAT AATTCGACTC 

35 TGTTTTTAAG TCCTGTTGAA GCGGGTATTC 

AAAAGTAAAA AGTTTCGCAT GACATTAGGA 
ATTGATAAAA CGGTATAAAT ATGCTATAAT 

40 TGACGGTAAT GATAATACAA GATAGACAAC 

AGCTTGTCAT AATCATCATG AGGGGGAAAT 
TGATATCGAA AAGGTATTTA ACATTCTTTT 

45 

CGGGACAACT TATTGGACTA ATATTAGGTC 
TTCATCCACA AGACTTACCT TGGAAAGGCG 
50 CGACTTGGTG GATTACTGAA GCAATTCCTA 

TATTACCATT AGGTCATATA CTTACACCAG 
TTATCTTTTT GTTTTTAGGT GGATTTATTT 

55 



ATAGACAAAG GATGACGATT TATGTATATC 300 

AACAAAACTA AAGATACTCG AGCAATACAA 360 

ACGACAGTTT ATATACCGAA AGGGACGTAT 420 

AATACAACAC TTTTGTTAGA TAATGAAACT 4 80 

TTAAAAAATG GTCGTCGCTT TGGTTTTTaT 540 

ATTAAAGGCG GCAAGTTTGA TATGAATGGT 600 

TGCATTGGGC ATGCTGAAGA TATTCAATTA 660 

GGTCATGCAA TTGATGCTTG TGGGATTAAC 720 

GGATTCATAG ACTATAGTGG CGAACcTTTT 780 

TACCTGGTGC TTTTCCAAAA TTCGGAACgA 840 

TCGAAGATTG TTATTTTGGA CCTTCAGAAT 900 

TTGGCTCACA TGCAAGTAGA CATAATCGAT 960 

TATTTGAAGA TATACAAGGT TATGCATTAA 1020 

TTAATAATAA GTTTATTAAC TGTGaGGGTG 1080 

AAAATGCAGC AGATGTGaTG ACAGGaAAAG 1140 

ATATAATTGG AAATGAATTT AAAGGATCAA 1200 

ATAATAATGT TAAACATAAA GATGTATTAA 1260 

AATCAATTCA TTTAGAAGAT ATTGATACAG 1320 

AAGTTACTAC AATCAATGTA GATGAAATAA 1380 

TTAAGAATAG TAGATAATTT TTGAAAGCGC 1440 

AAACCCAATT ATCTGATAAA AGGGGTATTT 1500 

TTTCTATACT CTAATATAGT GAGTTGAAGT 1560 

TTATGGCTTA TTTCAATCAA CATCAATCAA 1620 

CAAAATCAAA GAAAAAGAAA CCGTTTAGTG 1680 

CATTACTTTT CCTATTAACA TTATTATTCT 174 0 

TCTATGTTTT AGCGATTACT TTATGGATTG 1800 

TTGCAGCAAC GAGCTTATTA CCAATTGTGT 1860 

AACAAGTATC ATCCGAATAT GGCAATGATA 1920 

TGGCAATTGC AATGGAAAGA TGGAATTTAC 1980 
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TTGGATTCAT GGTGGCAACA GGATTCTTAT 
TGATTATGAT TCCGATTGGT TTAGCAATTA 
5 ATACGAATCA AACAAGTATT CAAAAGTTTG 

CAGGTACGAT TGGTGGCTTG GGTACATTAA 
GACAATACAT GCAACATTTT GGACATGAAA 

10 

TTCCAACGGT CATTGTTTTG TTAGGTATTA 
- GACATGATTT GAAATATTTa CCTGGTGGTC 
TTGGCAAAAT GAAGTATGAA GAAAAGGTAG 

15 

TATGGATTAC AAGAGAGTTT CTTCTGAAAA 
GTACGATTGC TATTTTTATA TCAATATTAT 
20 AACATCGCCG TATCATTGAC TGGGAAGTTG 

TATTTGGTGG CGGTTTAGCA TTAGCGAAAG 
TAGGCGAACA GTTGAAATCA TTAAATGGTG 
CAATCTTTGT CTTATTTTTA ACTGAAGTGA 
TACCGATTTT AGCAACGTTG TCTGTTGCTG 
CTGCAGCTAT GGCGG CTAAC TGTGCATACA 

30 

TTATCTTTGG TTCTGGTAAA ATATCTATCA 
ACTTAATCAG TGCAATAATT ATTATTTTAG 

35 GTATTGATAT AAATCAACCA CTGCCATTGA 

AAAGGTTACA TTAGCAATTG CTTGGACGAG 
TAASAACAAT ACAAACAAAA GAAAGTCAAC 

40 TGACTCTTTA TTTTATTTAA ATTGTAGAAC 

ATCATTTCTA CTTTGAAATC TAAGGTTTCT 
GTAATTGCAG AATCAGCCTG AACGAAAAAT 

45 

CGAGACTCAA TCCAGGATAA ATTAATATTA 
AACAACCCAG GTTTATCATG CATTGGTGTA 
TCAAATTGCT GCTGATTTTT TATAACTAAA 

50 

ATATGTGTAT CAATAGGTGT AAAACCATAA 
GCAACGCCAT TTTCAATTTT AGTCAAACTT 
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CTATGTTTGT ATCGAACACT GCAGCTGTAA 2100 

TTAAGGAAGC ACATGATTTA CAAGAAGCCA 2160 

AAAAATCTCT AGTTTTAGCA ATTGGCTATG 2220 

TCGGAACCCC GCCATTAATT - ATTTTAAAAG 2280 

TTAGTTTTGC TAAATGGATG ATTGTAGGGA 234 0 

CTTGGCTCTA TTTAAGATAT GTTGCGTTTA 2400 

AGACGTTAAT TAAACAAAAG TTAGACGAGC 2460 

TACAAACTAT CTTTGTACTT GCTAGCTTAT 2520 

AATGGGAAGT TACGTCATCT GTTGCAGATG 2580 

TATTTATTAT TCCAGCTAAA AATACTGAAA 264 0 

CAAAAGAGCT CCCTTGGGGT GTATTAATTT 2700 

GTATTTCTGA aagTGGTTTA GCAAAATGGT 2760 

TTAGTCCGAT TCTTATTGTA ATTGTCATAA 2820 

CATCTAATAC TGCAACTGCA ACGATGATTT 2880 

TTGGAGTGCA TCCATTACTA CTTATGGCAC 294 0 

TGTTACCAGT AGGGACACCA CCGAATGCAA 3000 

AACAAATGGC ATCAGTAGGA TTCTGGGTAA 3 060 

TCGTGTATTA TGTAATGCCT ATAGTTTTAG 3120 

AATAGTAATT GCAGATTAGA ACGAAAAATA 3180 

TGGTAACGAA ACGTATACCG CAGCATCGTG 3240 

CAAGGATGGA TTCCTATTTT AATCCTTGGT 3300 

CTAGAAAATA AAGTTTAATT AAAAGCACCA 3360 

AAAATAGCAA TGACTTTCTT TATATCGGTT 342 0 

CGATACATAC CTAATTGTGT TTTTAAAGGA 34 80 

AACAAAGCAA ATGTATTAAG CACACTTGCT 3540 

ATTAAAAACA TCAATGATGT CGCATTTTGA 3 600 

AAACGTGTCA CGTTATGTGG ATAGTCTTCA 3 660 

GctTCGCCAC TACCTAAAGG TGCAATTGCT 3720 

TGAATTGTAC TGTCGACATA ATCATAGTCA 3780 
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TTTTTAATAT CAGAAATGGA ATCTGTTCCA TT AC CAT AT A ATGCAAAGTT AATATCTAAA 3 900 

CGTATTTCAC CGTGTGCAAA GACATCTTGC TGTGCAAGTG CATCTGCCAC AATGTTGATT 3960 

5 

GTTCCTTCTA TAGAATTTTC AATAGGGACA ACACCAATCG ATGTGTCATC ATCTGCAACT 4020 

GCCTTGATGA CTTCAAATAA ATTTGACTTT GGTTGAAAAG TTGCTTCATT TTCAGAAAAA 4080 

TACTGACGAC AAGCCAAATA TGAAAATGTA CCTTTAGGGC CTAAATAATA TAATTGCATA 414 0 

TGCTACACCT CTACTAACTT AATGATGGAA AGGGCACTGG TTAGCATTTG ATTCTTTCTT 4200 

TTTATAGAAA AAGTTTGGAT CTTTTACTGT ATTGTCATAT CCGTGATGAT AATTTGACGT 4260 

15 CAATGTTGGA GATAATGGCG GTGCTAGCCA AGACCATTTT CCGGTAACTT GACGACCTTG 4320 

TTGTGCTTCG TTACGTTCGA ATAGTTCGAA TTGCTTTGCA GCGGTCAAAT GATCGACAAT 4380 

TGATACGCCT TCTTTTTTAA AGGAATGATA CACAGCATAG TTCAATTCAA CAAGTGCTCG 4440 

20 

ATCTTTATTA AATGAATTAT TTTTAAGTGT ATCAAATTCA AACGCATCTG CAACTTTTTC 4500 

TAGTAAATTG TAACGGTAAT CATCAATAAA GTTACGTACG CCAATTTCAG TTACCATATA 4560 

CCAACCGTTA AAGGGTGCAG TTGGATATAC AATGCCACCG ATTTTTAAGT CCATATTGGA 4620 

25 

AATGATAGGG ACTGCATACC ATTTTAAGTT CAATTTTCTT AATTTTGGAT AATGATTATG 4680 

TTCAATAGGT ACTTCTTTAA TTAATGAAGT AGGATATTCG TAAAATTTAA CTGACTCATT 4740 

3Q AGGTAATTGG TAAATCAGTG GTAACACGTC AAAATTAGTA CCTTTTCCTT TCCAAC CTAA 4800 

GTGATTTGCT AAGCGTGTAA CTTCTTTTTC AGCAGGATCA CCACAATTGT CATAGCCAGC 4860 

ATAGCGAATT AATTGATTGT TGAAAATTTT AGGTCCATCC TTTGGAGCAT ATATAGTAAT 4920 

3$ ATACGGCTTT AATTTACCTT CATTTGTAGC CTGTGTAATA TGATAAGTAA TTGATGATAA 4 980 

GAACGATGCT TCGTCAGTAA CATCTCTTGC ATCAATGACA TTTAACGAAT CCCAAAATAA 504 0 

ACGACCAATG CAACGATTTG AATTACGCCA AGCCATTTTA GCACCATAAA TAAGTTCTTC 5100 

40 

TTCTGTATGT GTATATGTCC CAGTTTCTTT TATTTCTAGT TCAATGTCAT GTAAACGTTT 5160 

ATTGATAATT TGCGTTTCAT AATGACACTC TTTATACATG TTTTCTATGA AAGCTTGAGC 5220 

CTCTTTAAAT AACATTAACA ACACCTCGCT TTATATTATA GTCTACATTA TTAAAATACT . 5280 

45 

CTTAAAAATT ATGTATATGT CATTAAATTG TTGGTTGATT TTAATTAAAA GTATGGAAAT 5340 

TAAGGGGCTC TTATGTATAT AAAAAAATGA ATTATGATAA AATGTAAGAA AATATTTAGG 5400 

SO TCGATTGGAG AGATACAAGT GTACCAATTA GAAGACGACA GTTTAATGTT ACATAATGAC 5460 

TTATATCAAA TAAATATGGC TGAAAGTTAT TGGAATGATA ATATTCATGA AAAAATGGCT 5520 

GTATTTGATT TGTATTTTAG AAAAATGCCA TTTAATAGTG GCTATGCTGT TTTTAATGGT 5580 

55 
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TTAAAGTCTA TTGGCTACAA GGATGATTTC TTATCATATT TAAAAGATTT AAAATTCACA 5700 

GGCAGCATCC GTTCGATGCA AGAAGGCGAA TTATGCTTTG GTAACGAACC .ATTGTTACGC 5760 

£ 

GTAGAAGCAC CATTGATTCA AGCGCAATTA ATAGAAACAA TTTTATTAAA CATTGTAAAT 5820 

TTCCATACAT TAATTACAAC AAAGGCTAGC AGAATTCGTC AAATTGCATC AAATGATAAA 5880 

w TTAATGGAGT TTGGTACACG TCGTGCGCAA GAAATTGATG CAGCATTGTG GGGCGCTAGA 594 0 

GCTGCTTACA TCGGGGGCTT TGATTCTACA AGTAATGTTA GGGCGGGGAA ATTATTTGGT 6000 

ATACCTGTGT CTGGTACACA TGCACATGCA TTTGTCCAAA CTTATGGAGA CGAATATGTT 6060 

75 GCCTTCAAAA AATATGCTGA AAGACATAAA AATTGTGTGT TCCTAGTAGA TACATTCCAT 6120 

ACTTTAAAAT GTGGCGTGCC AAATGCAATA AAAGTTGCAA AAGAATTAGG TGACAAAATT 6180 

AACTTTGTAG GTATTCGATT AGATTCTGGA GATATCGCTT ATTTATCTAA AGAGGCAAGA 6240 

20 

CGTATGCTTG ATGAAGCAGG ATTTACTGAA ACTAAAATTA TCGCGTCTAA TGATTTGGAT 6300 

GAAGAAACGA TTACGAGTTT GAAAGCACAA GGTGCAAAAG TAGATTCTTG GGGCGTTGGT 6360 

ACAAAGCTGA TTACAGGATA CGATCAACCA GCATTAGGTG CAGTATATAA ACTTGTAGCT 6420 

25 

ATTGAAAATG AAGATGGTTC ATATAGTGAT CGTATTAAAT TATCAAATAA CGCTGAAAAG 6480 

GTTACGACGC CAGGTAAGAA AAATGTATAT CGCATTATAA ACAAGAAAAC AGGTAAGGCA 654 0 

30 ' GAAGGCGATT ATATTACTTT GGAAAATGAA AATCCATACG ATGAACAACC TTTAAAATTA 6600 

TTCCATCCAG TGCATACTTA TAAAATGAAA TTTATAAAAT CTTTCGAAGC CATTGATTTG 6660 

CATCATAATA TTTATGAAAA TGGTAAATTA GTATATCAAA TGCCAACAGA AGATGAATCA 6720 

35 CGTGAATATT TAGCACTAGG ATTACAATCT ATTTGGGATG AAAATAAGCG TTTCCTGAAT 6780 

CCACAAGAAT ATCCAGTCGA TTTAAGCAAG GCATGTTGGG ATAATAAACA TAAACGTATT 6840 

TTTGAAGTTG CGGAACACGT TAAGGAGATG GAAGAAGATA ATGAGTAAAT TACAAGACGT 6900 

40 7 

TATTGTACAA GAAATGAAAG TGAAAAAGCG TATCGATAGT GCTGAAGAAA TTATGGAATT 6960 

AAAGCAATTT ATAAAAAATT ATGTACAATC ACATTCATTT ATAAAATCTT TAGTGTTAGG 7020 

TATTTCAGGA GGACAGGATT CTACATTAGT TGGAAAACTA GTACAAATGT CTGTTAACGA 7080 

45 

ATTACGTGAA GAAGGCATTG ATTGTACGTT TATTGCAGTT AAATTACCTT ATGGAGTTCA 7140 

AAAAGATGCT GATGAAGTTG AGCAAGCTTT GCGATTCATT GAACCAGATG AAATAGTAAC 7200 

50 AGTCAATATT AAGCCTGCAG TTGATCAAAG TGTGCAATCA TTAAAAGAAG CCGGTATTGT 7260 

TCTTACAGAT TTCCAAAAAG GAAATGAAAA AGCGCGTGAA CGTATGAAAG TACAATTTTC 7320 

AATTGCTTCA AACCGACAAG GTATTGTAGT AGGAACAGAT CATTCAGCTG AAAATATAAC 7380 
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TAAACGACAA GGTCGTCAAT TATTAGCGTA TCTTGGTGCG CCAAAGGAAT TATATGAAAA 7500 

AACGCCAACT GCTGATTTAG AAGATGATAA ACCACAGCTT CCAGATGAAG ATGCATTAGG 7560 

5 

TGTAACTTAT GAGGCGATTG ATAATTATTT AGAAGGTAAG CCAGTTACGC CAGAAGAACA 7620 

AAAAGTAATT GAAAATCATT ATATACGAAA TGCACACAAA CGTGAACTTG CATATACAAG 7680 

ATACACGTGG CCAAAATCCT AATTTAATTT TTTCTTCTAA CGTGTGACTT AAATTAAATA 7740 

10 

TGAGTTAGAA TTAATAACAT TAAACCACAT TCAGCTAGAC TACTTCAGTG TATAAATTGA 7800 

AAGTGTATGA ACTAAAGTAA GTATGTTCAT TTGAGAATAA ATTTTTATTT ATGACAAATT 7860 

75 CGCTATTTAT TTATGAGAGT TTTCGTACTA TATTATATTA ATATGCATTC ATTAAGGTTA 7920 

GGTTGAAGCA GTTTGGTATT TAAAGTGTAA TTGAAAGAGA GTGGGGCGCC TTATGTCATT 7980 

CGTAACAGAA AATCCATGGT TAATGGTACT AACTATATTT ATCATTAACG TTTGTTATGT 8040 

20 AACGTTTTTA ACGATGCGAA CAATTTTAAC GTTGAAAGGT TATCGTTATA TTGCTGCATC 8100 

AGTTAGTTTT TTAGAAGTAT TAGTTTATAT CGTTGGTTTA GGTTTGGTTA TGTCTAATTT 8160 

AGACCATATT CAAAATATTA TTGCCTACGC ATTTGGTTTT TCAATAGGTA TCATTGTTGG 8220 

25 

TATGAAAATA GAAGAAAAAC TGGCATTAGG TTATACAGTT GTAAATGTAA CTTCAGCAGA 8280 

ATATGAGTTA GATTTACCGA ATGAACTTCG AAATTTAGGA TATGGCGTTA CGCACTATGC 8340 

TGCGTTTGGT AGAGATGGTA GTCGTATGGT GATGCAAATT TTAACACCAA GAAAATATGA 8400 

30 

ACGTAAATTG ATGGATACGA TAAAAAATTT AGATCCGAAA GCATTTATCA TTGCGTATGA . 84 60 

ACCTCGAAAC ATACATGGTG GATTCTGGAC TAAAGGCATT. CGTCGTAGAA AGCTTAAAGA 8520 

35 TTATGAACCA GAAGAACTGG AAaGTGTAGT AGAaCATGAA aTTCmAAGTA AaTGAGAaTG 8580 

AAmCAATtGC TGATTGTTTG TCACGAATGA AAtGCAAGGG TATATGCCGG TAAAACGTAT 8640 

TGAfiAAACCC GTGTTTCAAG AGGAAAAAGA TGGCACGGTT GAAGTATCAC ATCAAGAAAT 8700 

40 CGTTTTTGTA GGTAAGAAAA TCCAATAACA TAATCCAATT TAAATAAAGA CTATTTGAAG 8760 

AGGAAAGGCT ATTCAAAGTT TGAGTAATTT TACTTTGAAT AGCCTATTTG TTTATACATG 8820 

CAAGATGCTC GATCCATATT GTATGAGAAA CCCCCAGCAA GCTATATAAA GCATATGCTG 8880 

45 

GGGGTTCTTA ATATTTTAAA AATTATTGTT AGATTATATA TATCGTCGCT TTTTCTAAAA 8940 

CAATCTCATC GCATGAAATT TTTTCTTCCT AGAGACCTTT AATAAGATTA ATAGTTTACT 9000 

SQ TAATCATATC TAGATAGTCT TATGACTTAT GCTTAATGAA AGTCATTCTA GGAGAAGTTC 9060 

CCAAAGCTTC TGTGTTCATA ATTGTTAGTA GTATTTTATT ATCATTTGGT ATAAATATTT 9120 

CAATAACAAT TGAGCTATTA TTTTTATTAT ATAATGTGAG TTGTTTGTGT TCTGTATTTA 9180 
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10 



is 



20 



25 



30 



35 



CATTTAAATC TTGAGGATGC CATTCTCCCT CAATAATATT 
AATTACATTT GAATTTATCA ATACTAAATA ATTCAATTTG 
TTCTAAAATA CAAATTTTAA TAACCATAAA TAGATGAATA 
TGGATACTGG AATAACATTG TTTTTAGCAT CTTGAGTCAT 
TCCATATAAT TATAACCTCT TGTCCATTAT CTAATTTAGC 
CATGCCCTGC GTGCATACCA TTTCTTGATT CTACTCTACT 
TATTATTTTT AGTAAGATTG TCAACTTCAT TATATGTAGT 
GACTTCTTCC CTGAGTTTGT CCAAAATAAA TCATCTCTCT 
GGAATCGTTG TCCTTGTAAG TTTGGGTGTA AAAATCTCAT 
TGTTAGTATT ATAAGTCGCA TTTAGTAATT CAGACATCGT 
TGTTACCTTG AGTTTCTCTT ATCTTGAAAT TCTCAAGTTT 
AAGTATAATT ATTACTTTTA AATTGACTAG TTGGCATAGT 
GTTGCGTTAG ATTATTGCCA GTAGGTATAC TCTCAGTCTT 
GACGTGGTGT TTTTAGTACT AGTTTAGCTT TATGATTTTG 
TGAGTTGT 

(2) INFORMATION FOR SEQ ID NO: 33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7563 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



AAGATAATAC TTAGCCTCTG 9300 

TTCCATAATA TTATTTACCT 9360 

CCATCGATAA TGGTCGCCAT 9420 

AAAACCATTA TCCCATGGAT 9480 

GTTCCCAACA ACTGCCATGG 9540 

ACCTAAAACA GCAATTCCTT 9600 

CATTCTATTA AGAAGTTGTG 9660 

TGGCGTTAAA CCAGTAAATT 9720 

CACAGCTTCT GCATGATATT 9780 

ATAGCCTGCA CACCAACCAT 9840 

ATTTATATAT TGsTCGTTGT 9900 

GACAGAAGCT TTTTGCTTTA 9960 

TnTnAACTnT nTATCTTCTA 10020 

AGTACCACAT AGTAACCTTT 10080 

10088 



40 



45 



50 



55 



r <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 33: 

CGGAAACGnA CCCnATGCGT ATGCTTGACG TGCCAAAATT AAATACGAAG TTCATAGCTT 60 

TGAGGTACCA GAAGAACATT TATCTGGTCA AGAAGTCGCA GnACTCATAC AAGCAAATGT 120 

TAAAACAGTA TTTAAAACGC TTGTTCTAGA AAATACAAAA CATGAACATT TTGTATTTGT 180 

TATCCCAGTA AGTGAAACTT TAGATATGAA AAAGGCAGCT GCTTTGGTTG GAGAGAAGAA 240 

ATTGCAGCTT ATGCCTTTAG ATAATTTGAA AAATGTAACG GGATACATTC GTGGTGGGTG 300 

TTCGCCTGTT GGTATGAAAA CATTGTTTCC AACAGTCGTT GACAAATCGT GTGAAAATTA 360 

TAGTCATATC AGTGTGAGTG GTGGGCTTCG AACAATGCAA ATCACAATAG CTGTTGAGGA 420 

TTTGATTACA ATAACTAAAG GCAAAATTGG AGCAGTTATC CATGAATGAT TAATAACAAC 480 
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TGCCACACTC CTTTTTGATT GAATTAGCAT 
GTATTTGAAC ATAAAAATGT AATTTTATCG 
GTAATTTATG ATTGAAAAGT GAAAGCGTAC 
GATGATAATT ACTGaAAAAA GACACGAGTT 
TTTGACTTTA CAAGAATTAA TAGATCGAAC 

10 

TTTATCTAAA CTACAACAAT TAGGGAAATT 
AGAAAATCGT ATGGTTGAGG CGAATTTAAC 

1S GAAAATGATT GCTAAAATAG CAGCTAATCA 
TGCTGGTTCA TCTACATTGG AGCTAATTAA 
AACCAATGGT TTAACACATG TAGAAGCTTT 

20 AGGTGGTCAA GTTAAAGAAA ATACACTTGC 
AAGACGATAT TGTTTCGATA AAGCTTTTAT 
ATTAACTACT CCCGATGAGC AAGAGGCATT 
TCAATCATTT GTACTTATAG ATCATTCTAA 
♦ TTTGCTAGAA AGTACGACAA TCATCACATC 
AGAATACCAA CAAAAGTATC ACTTTATAGG 

30 

AATCCTTCAA TTGACTATGT CATTTTTACG 
GCAACAGCAA CATATAAATT CGCTGGGGGG 

35 ACATTGGATG TTGAGTCAAC TGCCTTGGGA 
ATAGATACAT TAAATAACAG TGCAATTCAA 
CGTAJTAATG TGAAATTAAA AACAGGACAA 

40 ATAACGTCAA CACAATTTGA ACAACTGTTA 
ATAGTTATTG TTGCTGGAAG TGTACCAAGT 
GCACAAATTA CAGCACAGAC AGGTGCTAAA 

45 

GAAAgCGTTT TACCATATCA TCCACTATTT 
ATGTTTAATA CAACAGTGAA CTCAGACACA 
GATAAAGGTG CGCAATCTGT TATTGTCTCG 

50 

AAAGAAATCA GTATTAAAGC AGTTAATCCA 
GGTGATAGTA CAGTTGCAGG CATGGTGGCT 

55 



TTTACGATCA TAAACAGTCA TTATAATTGA 600 

TAACAATTTG AGTGTTTGTG ATTGTTTTTG 660 

TCATTATAAT ACAAAGTGAG ATGGGGTGAT 720 

AATATTAGAA GAACTTTCGC ACAAAGATTT 780 

TGGTTGCAGT GCTTCAACAA TACGArGAGA 840 

GCAACGTGTG CATGGTGGTG CAATGTTAAA 900 

TGAAAAATTA GCAACGAATC TTGATGAAAA 960 

AATCAACGAT AATGAATGCT TATTTATCGA 1020 

ATATATTCAA GCGAAAGATA TCATTGTGGT 1080 

ACTTAAAAAA GGTATTAAAA CAATTATGCT 1140 

TACGATTGGT TCTAGTGCTA TGGAGATATT 1200 
CGGGATGAAT GGATTAGATA TTGAACTTGG - 1260 

AGTTAAACAA ACAGCAATGT CATTAGCCAA 1320 

GTTTAATAAA GTATATTTTG CTCGTGTACC 1380 

TGAAAAAGCA TTAAATCAAG AATCGTTAAA 1440 

AGGGACTTTA TGATTTATAC AGTGACTTTC 1500 

AATGATTTTA AAATTGATGG TTTGAACAGA 1560 

AAAGGTATTA ATGTCTCGCG CGTCTTAAAG 1620 

TTTGCAGGTG GATTTCCTGG GAAATTCATT 1680 

TCGAATTTTA TTGAAGTTGA TGAAGATACA 1740 

GAAACAGAAA TCAATGCACC GGGTCCT CAT 1800 

CAACAAATTA AAAATACAAC AAGCGAAGAT 1860 

AGTATTCCAA GCGATGCGTA TGCGCAAATT 1920 

TTAGTAGTCG ACGCTGAAAA AGAATTGGCT 1980 

ATTAAACCTA ATAAAGATGA ATTAGAAGTG 2 04 0 

GATGTTATTA AATATGGTCG TTTGTTAGTT 2100 

CTTGGCGGTG ATGGTGCTAT TTATATTGAT 2160 

CAAGGGAAAG TGGTTAATAC AGTTGGCTCT 2220 

GGAATTGCTT CAGGTTTAAC GATTGAAAAA 22 80 
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CGGGACGCTA TAGAAAAAAT 


AAAATCACAA GTTACGATTA GCGTACTTGA TGGGGAGTGA 


2400 




AAATAATGAG 


AGTAACAGAG 


TTATTAACAA 


AAGATACAAT AGCAATGGAT TTAATGGCAA 


2460 


5 


ATGACAAAAA 


TGGTGTTATT GATGAGTTAG 


TAAATCAATT AGACAAAGCA GGTAAATTAA 


2520 




GTGATGTCGC 


GTCATTTAAG 


GAAGCGATTC 


ACAATCGAGA ATCACAAAGT ACAACTGGTA 


2580 


70 


TCGGCGAAGG 


TATTGCCATT 


CCACATGCCA 


AAGTGGCCGC AGTTAAGTCA CCAGCTATTG 


2640 


CGTTTGGTAA 


ATCTAAAGCA 


GGCGTAGATT 


ATCAAAGTTT GGATATGCAA CCAGCACACT 


2700 




TATTCTTTAT 


GATTGcAGcG 


CCAGAAGGTG GCGCCCAAAC ACATCTAGAT GCTTTAGCTA 


2760 


75 


AGTTGTCTGG 


TATTTTAATG 


GATGAAAATG 


TACGTGAGAA ATTATTACAT GCTTCATCAC 


2820 




CTGAAGAAGT ACTAGCGATC 


ATAGATGAGG 


CTGATGATGA AGTGACAAAA GAAGAAGAGG 


2880 




CAGAAGCTGA AGCACAACAA GTTGCAACTG 


CAGAACAATC ATCTAAACAA TCTAATGAGC 


2940 


20 


CATATGTGTT AGCAGTAACT GCTTGTCCAA CAGGTATTGC ACACACATAT ATGGCACGTG 


3000 




ATGCATTGAA 


AAAGCAAGCG 


GATAAAATGG 


GTATTAAAAT TAAAGTAGAA ACGAATGGTT 


3060 




CAAGCGGCAT 


TAAAAACCAT 


TTAACTGAAC 


AAGATATTGA AAATGCAACA GGTATCATTG 


3120 


25 


TTGCTGCTGA 


TGTTCATGTT 


GAGACGGATC 


GCTTCGATGG TAAAAATGTC GTAGAAGTAC 


3180 




CAGTAGCAGA 


TGGTATTAAA 


CGCCCAGAAG 


AATTAATTAA TAAAGCATTA GATACAAGTC 


3240 


30 


GTAAACCTTT 


TGTTGCCCGT 


GATGGTCAAA 


GAAAAGGTAA CTCAAATGAC AGTCAAGAAA 


3300 


AATTAAGCCC 


AGGTAAAGCA 


TTCTATAAAC 


ACTTAATGAA CGGTGTTTCT AACATGTTGC 


3360 




CACTTGTAAT 


ATCTGGTGGT 


ATTTTAATGG 


CAATTGTATT TTTATTTGGA GCAAATTCAT 


3420 


35 


TTAATCCAAA 


AAGCTCAGAG 


TACAATGCGT 


TTGCAGAGCA GCTTTGGAAC ATTGGTAGTA 


3480 




AAAGTGCATT 


CGCGTTAATC 


ATTCCAATTT 


TATCTGGATT CATTGCACGT AGTATTGCGG 


3540 




ATAAACCTGG TTTCGCTTCA GGTCTTGTAG GTGGTATGTT AGCAATTTCA GGTGGTTCAG 


3600 


40 


GATTTATTGG 


TGGTATTATT 


GCAGGTTTCT 


TAGCAGGTTA CTTAACACAA GGTGTTAAAG 


3660 




CCATGACACG 


TAAGTTACCA 


CAAGCATTAG AGGGATTAAA GCCAACATTA ATTTATCCAC 


3720 




TATTAACAGT 


GACGGCTACA 


GGCTTATTGA 


TGATTTATGC CTTTAATCCA CCAGCATCTT 


3780 


45 


GGTTAAATCA 


TTTGTTATTA 


GATGGATTAA 


ACAATTTATC AGGTTCTAAT ATTGTATTAT 


3840 




TAGGTTTAGT 


TATTGGCGCT 


ATGATGGCGA 


TTGATATGGG CGGTCCATTC AACAAAGCGG 


3900 


50 


CATATGTTTT 


TGCAACAGGT 


GCGTTGATTG 


AAGGTAATGC AGCACCAATT ACAGCTGCAA 


3960 


TGATTGGTGG 


TATGATTCCA 


CCGTTAGCAA 


TTGCGACAGC GATGTTAATT TTTAGACGTA 


4020 




AATTTACAAA AGAACAACGT 


GGTTCAATTA 


TCCCTAACTA TGTGATGGGT ATGTCATTTA 


4080 
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• TGATTGGTTC AGGTATAGGT GGCGCAATTG 
CACATGGTGG TATTATTGTA ATTGTTGGTA 

s 

TTGCACTTCT AGTTGGCACA TTAGTTTCAG 
TAACTGAAAC AGAAATCGAA GCTTCAAAAT 
TGATTGTTAG CAAAGAGCTT CATATTAAGT 

10 

TATATCGTGT TAACGGTAGC TTATACAAAG 
TTATGAATTG ATATGAAAGT GTTTTTATTT 

15 CAAATGTATA GACTTTTTTA ATATTTTGCA 

AAAATATGAG TGTCTTAAAG TGAAAATTTA 
TTAATTATAT ATAACGGCAA AGTTTATACT 

20 CATGTGAAAG ATGGACAGAT TGTTGCAATT 

AATGATACGA CAAATAAAAT TCAAGTGATT 
TTTATTGATA TACATATTCA TGGTGGTTAT 

25 

GGCTTAAAAT ATCTATCCGA AAATTTGTTG 
ACAATGACGC AATCGACTGA TAAAATAGAT 
GCGGAgCAAG ATGTTCACAA TGCAGCGGAA 

30 

ATATCTGAAA ATAAAGTTGG TGCTCAACAT 
AAAATTAAAC ATTTTCAAGA GACTGCTAAC 

3S GAAATTGAAG GTGCAAAAGA AGCGCTTGAA 

GGTCATACAG TAGCAACATA CGAAGAAGCA 
GTCACGCATT TATATAATGC AGCGACGCCA 

40 GCAGCATGGT TGAATGATGC TCTACATACC 

CCGGCATCGG TTGCAATTGC TTACCGTATG 
GATGCAATGC GTGCAAAAGG TATGCCTGAA 
ACTGTTCAAT CGCAACAAGC ACGTCTTGCA 
ATGAATCATG GGTTACGTAA CTTAATATCA 
CGAGTAACAA GTTTAAATCA AGCCATTGCA 

50 

AAAGTAAATA AGGATGCAGA TCTTGTTATT 
ATAAAACAAG GCAAGGTTCA CACATTTAGC 

55 



CTTTAGGCTT AGGTTCAGGA ATTACTGCGC 4200 

CTGATGGTGC ACACTTACTT CAAACTCTTA 4260 

CATTAATTTA CGGTTTAATC AAACCAAAGT 4320 

CAATGGACGA GTAGTTTTAA TGATGTAAAA 4380 

TGTATGTTCA ATGAATATAT GTTAGTTTTA 4440 

CTGTAAAAAC ACTTTCTATT AATTCAGTTT 4500 

TTAGATAAAT GAATGAAGAA ATAGACACCA 4560 

AAAAGTTATG CCAAACGAAG CAGATATAGT 4620 

TAAATAAAGA AGGGTTTATA CGTGTCAGAA 4680 

GAAGATGGCA AAATCGATAA TGGTTACATT 4740 

GGAGAAGTGG ATGATAAAGC AGCAATTGAT 4800 

GATGCTAAAG GTCATCATGT ATTACCAGGT 4860 

GGTCAAGATG CAATGGATGG GTCATACGAT 4920 

TCTGAAGGGA CGACATCATA CTTGGCCACT 4980 

AATGCACTTA CAAATATTGC TAAATATGAA 5040 

ATTGTAGGTA TACATTTAGA AGGACCATTT 5100 

CCGCAATACG TTGTACGCCC ATTTATCGAT 5160 

GGATTAATAA AGATTATGAC GTTTGCACCT 5220 

ACGTATAAAG ATGACATTAT TTTTTCAATT 5280 

GTTGAAGCTG TTGAGCGAGG AGCTAAACAT 5340 

TTCCAACATA GAGAACCAGG TGTTTTTGGA 5400 

GAAATGATTG TTGATGGCAC TCATTCTCAT 5460 

AAAGGTAATG AACGTTTTTA TTTAATTACC 5520 

GGAGAATATG ATTTGGGTGG ACAAAAAGTA 5580* 

AATGGTGCGC TTGCTGGTAG TATTTTAAAA 5640 

TTTACAGGTG ATACATTAGA TCATTTATGG 5700 

TTAGGTATCG ATGATAGAAA AGGTAGTATT 5760 

CTAGATGATG ATATGAATGT AAAATCTACA 5820 

TAATAAATAA TCATAATTAA ATGTATGCAA 5880 
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TTTTCTGGGG GTGTCTAAAT GGGAAGGCGA TAACATGTAG TTGTAATTTA AGTCATAGTG 6000 

ATAAATTTGA ATGCGTGTTA CCCATGAGTG ACACATATAA CATGGAGGTG AATCCCTAGA 6060 

5 AATAGGGAAT TAATTGGAAA CTTCGACCAT AATTAGTTTG ATTATATTTA TTCTATTAAT 6120 

TGCATTAACC ACTGTATTTG TTGGTTCAGA ATTTGCATTA GTAAAAATTA GAGCAACAAG 6180 

AATTGAACAG CTAGCAGATG AAGGAAATAA ACCTGCTAAA ATAGTAAAAA AGATGATTGC 6240 

10 

TAATCTAGAT TATTATCTTT CTGCTTGTCA GTTAGGTATA ACAGTAACAT CTTTAGGGTT 6300 

AGGTTGGCTT GGTGAACCAA CGTTTGAAAA GCTATTACAC CCAATATTTG AAGCAATCAA 6360 

TTTACCAACT GCATTAACGA CGACGATTTC GTTTGCAGTG TCATTTATAA TCGTTACGTA 6420 

15 

TTTGCATGTA GTACTTGGTG AATTAGCGCC TAAATCTATA GCTATTCAAC ATACTGAAAA 64 80 

GCTTGCTTTA GTATATGCAA GACCATTGTT CTATTTCGGT AACATTATGA AACCATTGAT 6540 

20 TTGGCTGATG AATGGTTCTG CACGTGTTAT TATTAGAATG TTTGGTGTAA ATCCTGATGC 6600 

CCAAACTGAT GCAATGTCAG AAGAAGAAAT CAAAATTATT ATTAACAATA GTTATAATGG 6660 

TGGAGAAATC AACCAAACTG AATTGGCATA TATGCAAAAT ATCTTTTCAT TCGATGAAAG 6720 

25 ACATGCAAAA GATATAATGG TACCTAGAAC TCAAATGATT ACACTAAATG AACCTTTTAA 6780 

TGTAGACGAA TTACTAGAAA CAATAAAAGA ACATCAATTT ACGCGTTATC CAATTACTGA 6840. 

TGATGGTGAT AAAGACCACA TTAAAGGATT TATTAACGTC AAAGAATTTT TAACTGAATA 6900 

30 

CGCTTCTGGA AAAACGATTA AAATAGCAAA CTATATaCAT GAGTTGCCAA TGATTTCAGA 696 0 

GACAACACGT ATCAGTGATG CATTAATTAG AATGCAACGT GAACATGTAC ATATGAGTCT 7020 

TATTATAGAT GAATATGGTG ■ GAACGGCAGG TATTTTAACG ATGGAAGATA TTTTAGAAGA 7080 

35 

AATCGTTGGA GAAATTCGTG ATGAATTTGA TGATGATGAA GTGAATGATA TCGTTAAAAT 714 0 

TGAT5ATAAG ACATTCCAAG TAAATGGCAG AGTACTATTG GATGATTTAA CTGAAGAGTT 7200 

40 CGGTATAGAA TTTGATGACT CTGAGGATAT TGATACGATA GGTGGATGGT TACAATCTCG 7260 

TAATACCAAT TTACAAAAAG ATGATTACGT GGATACAACT TATGATCGCT GGGTTGTTTC 7320 

AGAAATCGAT AACCACCAAA TTATTTGGGT GATATTAAAC TATGAATTTA ATGAAGCGAG 7380 

45 

ACCTACTATC GGACAGTCTG ATGAAGATGA AAAATCAGAA TAGATATTAA TATATAAACC 744 0 

AACTAAGAAT GATTTAATTC ATTTTTGGTT GGTTATTTTT TTGACTAAAA TTAAnGAAAA 7500 

GTGAAAATAG TATTGGAACT CAATATCTTT AATGATTTAA TGAATAAnTT TTATTGAAAG 7560 

50 

CGA 7563 
(2) INFORMATION FOR SEQ ID NO: 34: 

55 
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25 



(A) LENGTH: 3492 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

TTATATCAAC TTCATGGCGG AACCATTGAT GACCCATTAG ACGAAACAAT AAGCGCATTT 60 

SATGAATTGA AACAAGAAGG AATTATACGT GCTTACGGTA TTTCTTCTAT TCGCCCAAAT 120 

GTAATTGATT ATTATTTAAA ACATAGTCAA ATCGAAACGA TAATGTCTCA ATTCAATTTG 180 

15 ATTGATAATC GTCCAGAATC ATTATTAGAT GCAATTCACA ACAATGATGT TAAAGTATTG 240 

GCAAGAGGAC CTGTGTCTAA AGGATTATTA ACTTCAAACA GTGTTAATGT GCTCGACAAT 300 

AAATTTAAAG ATGGTATTTT TGATTATTCT CATGATGAAT TGGGTGAAAC AATAGCCTCT . 360 

ATTAAAGAAA TTGAAAGTAA TTTATCTGCA TTGACATTTA GTTATTTAAC ATCACATGAC 420 

GTGCTTGGTT CCATCATTGT AGGTGCAAGT AGCGTCGACC AATTAAAAGA AAATATTGAA 480 

AACTATCATA CTAAAGTTAG TTTAGATCAG ATTAAAACAG CAAGAGCTCG TGTAAAGGAT 540 

TTGGAATATA CCAATCATTT AGTGTAGAAG TCATTTTCAG TAATAAAAAC AGCAGCATGA 600 

GGCGTTTCAT TATAAAAATG CCTTACTGCT GTTGTTTATG TACAATTCGC TATAATTTAT 660 

30 GATTATGATT ACTCACTTAT GATAGAAATT AAAGCGTTGT CCTCACGCAT CAGTATTTAG 720 

TAATTTCGCC TTGCGGCATT GCCTTAAGCA AACTTCTGCC ACTTCATCTC TTAATAATTT 780 

TATTAAAACA TCTTTCTATA TTTCACTTCG CATGTTGATT CATCATTATT AGTTATTATT 840 

35 TGTACACCCA GCACATTTCC TTGCAACACA AGTAGTTTGA ATTTTTCACA AGTATAATAT 900 

AATGTACCGT CTGAAATTTG GTCTACAGAA ATATCGCCTA AAATATCCAG CACTGTAAAT 960 

TCTTCAAATA CTGATAGTTG TTCCGCATAT CGTACACAAA GTCTTACCAC ACTCTCCGAT 1020 

TGACAGTTCA TTGCCATCCC ACCTATTTAT GCTTTATTTT TAAATAATTT AGGGAAACAT 1080 

CGTTCAAAAA ATCTAGGCGC AATTTGATAC ATTTTCAACG CATGaTGCAT CCATTTAGGC 1140 

CGATTAATTT CCAATTGTTT TGTTTTAATG CCATAAATGA TATCTTCTGC AAGCTGATTA 120 0 . 

GCATCAAGCA TAATTTCCCC CATCTTTTTA gCATACTTCA TTGATGGGTC GGC T TTTTGA 1260 

TGAAAAGGTG TATCAATCGG GCCAACATTA ACTGTCATGA TATGTAAGTT TGGTGACTCT 1320 

50 AGTCTTAAAG CATTCATTAA TGCATAAAAC CCTGCTTTCG ATGCCCCATA ATGTG CAGCA 1380 

TTTGCTTGTG TGGAAAATGC AGCTTGACTT GAAATACCTA CAATATGTGC GTTAGATGTT 144 0 

AAATATGGTC TCAACACAGT ATATAAAACA TTAAAACTAA TTAAATTAAG CTGATACGTT 1500 
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TAAATGAATC CATCGAATGA TGTATTGTCT 
TCATTTAAGT CACAAGGAAT AACATTTATA 

5 

GTTGCTTTAT CAACATCACG CACCAACAAC 
CGGACAATCG ATAAACCTAA ACCACTCGTA 
ATCAATTAAC CTTCCTTTTC AATTATATAG 

10 

ACAAGTTGAT TATCTTTCCT AATATATATA 
AAAACTAAAG GGATGTGaCG TTAATGrAAC 

r5 AAGCTATATT TACAGGrATT ATTAACTCmA 

CAAATAAATC TAATGAACAA GCTTTAAAAG 
GTTATGAtGA TGCGACATTA TTAAAAGATG 

20 ATGACTTTGA TGCTCTAGCA ACACGCATCA 
TTTCAATTAT GGCAGGTATT CCGATTGATT 

CaGTTGCTAG AATTATGCCA AACACAAATG 

25 

GTTTTTCAAA CAACTTTGAC CCTAAATCTA 

TTGGTTCTGT AATTGAAGTA TCAGAAGATC 

GCGGCCCAGC ATTTTTATAT CATGTATTCG 

30 

GTCTAGAAAA AGAACAAGTT GAAGAATCTA 
TGATTGAACG TTCAGAtTTG AGCATGGCTC 

3S GTACGACACA AGCTGGCCTT GATACATTGT 
ATTGTCTAAA CGCTGCCGTC GACCGTAGTA 
AACAgACCCG CCAACACATG TATGCATCAT 

40 ATAATTTATT GTTATTTGGC AAGCATTGTT 
CAAAATCTTT TACAAAATTA AAATTAGGTG 
TTTCATTTAT ATCTTCTGTA TTATAGCGAT 

45 CATTGGCTTC TTTTATCAAT GCAAATACGT 
CTAAATGCTT TTCACCATCT ATATAGGTCG 
AAATCACACG TTCATTAGAA CATGGTTTTG 

SO 

GTTTGGACTC ACCTCTAAAA TCTTTTGATT 
CATGAGATTT TACTTCTTGA TATTTAGGAC 

55 



TCAAATTGCA GTGCCTGTAT CGACTTCAAA 1620 

GTTTTCCCCA ATTCCTGTTC AAAGATTCTA 1680 

GTTACATGCA CTTTATTTTC TAGTAACTTT 1740 

CCACCAGTCA CTATAAAATG TTGTCCTTTC 1800 

AATGCAATTT ATCAACTTTA CATAATTGAG 1860 

CAATAATAAG AAAATATAAC ATACAAATCA 1920 
TCGTATTTTA TGGAGCTGGT AATATGGCAC . 1980 

GCAACTTAGA TGCCAATGAT ATATATTTAA 2040 

CATTCGCTGA AAAACTAGGT GTTAACTATA 2100 

CAGAyTATGT A TTTTT AGGT ACCAAACCAC 2160 

AACCACATAT TACAAAAGwC AATTGCTTCA 2220 

ATATTAAACA ACAATTAGAA TGCCAAAATC 2280 

CGCAAGTTGG ACACTCTGTT ACTGGCATTA 234 0 

AAGATGAAAT TAACGATTTA GTTAAAGCAT 24 00 

ATTTACATCA AGTAACAGCT ATCACCGGAA 2460 

AGCAATATGT TAAAGCTGGT aCsAAACTTG 2520 

TACGCAACCT TATTATAGGT ACAAGTAAGA 2580 

AATTAAGAAA AAATATTACC TCTAAAGGTG 264 0 

CACAATATGA TTTAGTATCT ATTTTCGAAG 2700 

TTGAACTTTC TAATATAGAA GACCAATAAA 2760 

CGCAAGCACT GTGTTTGACG GGTTATTTTT 2820 

TATTACTTTG TCATTAGATT TTAAAACTAT 2880 

TATCTTCATT TTGTATCAAT GTTTGATAAA 2940 

TGCTCAAATG TGTAATCAAC GTACGTTTAA 3000 

CTTCAATATG GCTATGATGA TAATTGTTGG 3060 

CTTCATGTAC CATCACATCA GCATCTCTAG 3120 

TATCACCAAA AATTGCTACA ACTGGACCCT 3180 

GATAAACTTG ACCATTATGT TCAAATGTAT 3240 

CTGGTTCAAG ACCAATGTTT TTTAACGCTT 3300 
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CATGATTAAG TAAATGCGCC TCTACAGTAA AACCATCCAT GATGATATGT CAGATGATCA 3420 

TCGATTTCAA TATATGtAAT TGGATAGTTT AAATGTGACT CTGATAAATT CATAGACATT 34 80 

5 

TCCACATATG CT 34 92 

(2) INFORMATION FOR SEQ ID NO: 35: 

(i) SEQUENCE CHARACTERISTICS: 
W (A) LENGTH: 1973 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

is 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 35: 

ATCTAGCGGT ACAAGCGTCT TGGAGGCTAG TATGTTGAAC ATTGTAAACC CTGAAGATCA 60 

20 CTTCGTTGTC ATTGTTTCAG GTGCCTTTGG TAACCGATTT AAACAAATTG CACAAACTTA 120 

TTACAAAAAT GTGCATATTT ATGACGTAAC ATGGGGAGAA GCTGTAGATG TCAAAGATTT 180 

CATCAATTTC CTTTCAACTT TAAATGTTGA AGTTAAAGCA GTATTTAGTC AATATTGCGA 240 

AACATCTACG ACAGTGCTAC ACCCTATTCA CGAGTTAGGA AATGCCATTA ATCAATTTAA .300 

TAGTAATATT TATTTTGTAG TTGACGGCGT AAGTtGCATT GGTGCTGTTG ATGTTGACAT 360 

TAACAAAGAT AAAATTGATG TACTTGTTTC TGGTAGTCAA AAAGCAATTA TGTTACCTCC 420 

AGGATTAGCT TTTGTAGCTT ATAGCCACCG TGCAAAAGAA CATTTCAAAG AAGTAACTAC 480 

GCCAAAATTT TATCTAGACT TAAATAAATA CATTTCGTCA CAAGCTGACA ATTCTACACC 540 

GTTCACACCA AATGTGTCTT TATTTAGAGG TGTAAATGCA TACGTTGAAA CCGTAAAAGC 600 

AGAAGGTTTC AATCACGTAA TAGCACGACA CTATGCAATT AGAAATGCAT TAAGAAGCGC 660 

CTTAAAAGCA TTAGATTTAA CTTTATTAGT CAATGATAAA GATGCATCTC CAACGGTTAC 720 

40 AGCATTCAAA CCTAATACAA ATGATGAAGT GAAAATAATC mAAGATGAAC TTAAAAATnG 780 

CTTTAAAATA ACAATTGCnG GTGGTCAAGG CCATCTTAAA GGTCAAATTT TnAGAATTGG 840 

TCATATGGGG AAAATTAGTC CTTTCGATAT TTTATCGGTA GTATCTGCTT TAGAAATTAT 900 

4$ TTTAACTGAA CACCGTAAAG TTAACTATAT CGGTAAAGGT ATATCAAAAT ATATGGAGGT 960 

TATTCATGAA GCAATTTAAT GTACTCGTTG CAGATCCCAT ATCAAAAGAT GGTATCAAAG 1020 

CATTATTAGA TCACGAACAA TTCAATGTAG ATATTCAAAC TGGCTTGTCC GAAGAAGCAT 10 B0 

SO 

TAATCAAAAT TATACCTTCA TACCATGCTT TAATCGTTCG TAGTCAAACT ACGGTTACTG 1140 

AAAATATCAT AAATGCTGCT GATTCTTTAA AAGTAATCGC ACGCGCCGGT GTTGGTGTAG 1200 
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GTAATACGAT TTCAGCTACT GAACATACAC TGGCAATGTT ATTATCAATG GCACGAAATA 1320 

TTCCGCAAGC ACACCAATCA CTTACAAATA AAGAATGGAA TCGAAATGCA TTTAAAGGTA 13 BO 

5 

CTGAGCTTTA TCATAAAACA TTAGGTGTCA TTGGTGCTGG TAGAATTGGT TTAGGTGTTG 1440 

CTAAACGTGC GCAAAGTTTC GGAATGAAAA TACTAGCTTT TGACCCTTAC TTAACGGATG 1500 

AAAAAGCAAA ATCTTTAAGC ATTACGAAGG CAACAGTTGA TGAGATTGCC CAACATTCTG 156 0 

10 

ATTTCGTTAC ATTACATACA CCACTAACAC CTAAAACAAA AGGCTTAATT AATGCTGTCT 1620 

TTTTTGCCAA AGCAAAACCT AGTTTGCAAA TAATCAATGT GGCACGTGGT GGTATTATTG 1600 

15 ATGAAAAGGC GCTAATAAAA GCATTAGACG AAGGACAAAT TAGTCGGGCA GCTATCGATG 1740 

TGTTTGAACA TGAACCTGCA ACTGACTCGC CTCTTGTTGC ACATGATAAA ATTATTCTTA 1800 

CACCTCATTT GGGTGCTTCA ACAGTCGAAG CTCAAGAAAA AGTGGCAATT TCTGTTTCAA 1860 

20 ATGAAATCAT CGAAATTTTA ATTGATGGTA CTGTAACGCA TGCAgTGAAT GCACCTAAAA 1920 

TGGACTTAAG CAATATAGAT GATACTGTAA AATCATTCAT CAATTTAAGC CAA 1973 
(2) INFORMATION FOR SEQ ID NO: 36: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7620 base pairs 

(B) TYPE: nucleic acid 
.(C) STRAND EDNESS : double' 
.(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 36: 

35 GGTGTTTCAG ATGTCACTGG TTGATTTTTA ATTGTAGACG GGTATTTTGG GCTTTCGCCA 60 

TATTTATTTG CCGGCTTACT GTCAAAGCAT AGGAATACTA TCATAACAAT TGTTAGGCCT 120 

AAATfiAACAA AATAAAGAAG TACTAACAAA ATATTAAGAC CCATCGGCAT TAATGTAAAA 18 0 

40 TCACTGTCAT AATAACTATC GATAATCTGT AATACTATAT AAAATATAAT ACTGAATACT 240 

GTCATAATCA TTGGAAATAA CATTGTTCTT GATATATCGT GAAATCTTCG AACGCACAAC 300 

GCTAAATTTG GAATAAACGT TGCCAAACTA TAGACAAAAG TATACACAGA TGTAAGGATA 360 

45 ATCATCAATA TACTCATAAC TATTAATGTT TCGTTATCCG CCGCTATAGA AATAAAGAAT 420 

AGAAATAGGT TTATTATTAG CACACACACA GCTGGAACCA TAAGTATCAA ATGCCATAGT 480 

GCCATATACC AATATTCACT ACGTCTTGAT CTCCCCTTAA AATTTACATA ATTTTTCCAA 540 

50 

AATAAAACGA ATGATTTCAT AAAACCTACT TGAGGTAATT GTTCCATTGT AATCTCCCTT 600 

TCGTTAATCA TATTTATATT TTTAATTATT GTTACCGTTA TAATTTACAA GATTCATTAT 660 
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GTAAAATGAA AACCCGCTAC AAGTACACAT CTATATGGAG ACTCATTTGA AAGTCAACGC 780 

TTCGTTAACT ATACTAAAAA TATGTCATAC TGCAATGTTC ACGTTTAAAA GAGTCTCAAT 840 

5 

CTATGCAAAT AAAATATTCC ATAACAAAGT ATATACTTTA CATTTTTATA ATTCTTAACA 900 

ATACTATTTT ATCAAACATT TACCACAATA AAAATATCTT TTTCATTTTT ATTTAAATTA 960 

ATCATATAAT TGCGAGGAGA ATATTATGGA TTTCGTTAAT AATGATACAA GACAAATTGC 1020 

10 

TAAAAACTTA TTAGGTGTCA AAGTGATTTA TCAGGATACC ACTCAAACGT ATACAGGCTA 1080 

CATCGTGGAA ACGGAAGCTT ACTTAGGTTT GAATGATCGT GCGGCTCATG GCTATGGCGG 1140 

15 TAAAATAACA CCTAAAGTCA CGTCATTATA TAAACGTGGT GGTACAATTT ATGCACATGT 1200 

CATGCATACG CATTTACTCA TTAATTTTGT AACAAAATCT GAAGGTATAC CTGAAGGCGT 1260 

ACTTATCCGC GCAATTGAAC CAGAAGAAGG TTTATCCGCT ATGTTCCGTA ACAGAGGTAA 1320 

20 GAAAGGCTAC GAGGTAACGA ATGGCCCAGG AAAATGGACT AAGGCATTTA ACATTCCACG 1380 

. GGCTATCGAT GGCGCTACGT TAAATGACTG TAGATTGTCT ATTGATACTA AGAATCGTAA 1440 

ATATCCTAAA GATATTATTG CTAGTCCACG AATCGGTATT CCAAATAAAG GTGATTGGAC 1500 

25 ACATAAATCT TTACGTTACA CAGTGAAAGG TAATCCATTT GTGTCTCGCA TGCGTAAATC 1560 

AGATTGTATG TTTCCCGAAG ATACTTGGAA ATAAATGCCA TCTTTCATTG ATTACTATCA 1620 

TGAAAATGAA ATCTATCTCC TTATAAGTCA ATCAATCGTG CCGTCAACAT GCGGATGGGT 1680 

30 

TGATTGTTTT TCTTTGTATC CATCATATTT TTTGATTCAT CTCCTCTTAT TGAACTTGTT 1740 

CTTAATTATA AAATATAACA ATAGAATTAT TTATAATTAT 7AAATTTAGA TGCATTAATA 1800 

TTATTGATAT TATTTTCAAA AACTAGAAAT ATTGATTTGT TGCATGTATA ATGTTAAAAG 1860 

35 

CGCCCTTTTA TAACGCTTAC ATATAAAAGC TTATTTAGGG AGAGGGATAT TCAACAAGGG 1920 

GGATTTGAAA ATGATAGAAC TTAATGCAAT TACAACATTA TGTTTAGCTT GTATCCTTTA 1980 

40 TTTACTTGGT AAGGCTATCG TTAATCACGT TAATTTTTTA AAACGTATTT GTATACCAGC 204 0 

ACCAGTGATT GGCGGCTTAA TCTTTGCTAT TTTAGTTGCG GCTTTGGATT CATTTGGCAT 2100 

GGTTAAGATT AAATTAGATG CTTCATTCAT TCAAGATTTC TTCATGTTAG CATTCTTTAC 2160 

45 GACAATCGGT CTTGGTGCAT CATTGAAATT ATTTAAATTA GGTGGCAAAG TCTTGCTATT 2220 

ATACTTTATG TTTTGTGCTA TCATTTCAGT CATTCAAAAC ATAGTTGGTG TATCACTAGC 2280 

AAAAGTATTA AATATTAAAC CTTTGTTAGG ATTAACAGCA GGTTCCATGT CTATGGAAGG 234 0 

50 

CGGTCATGGT AATGCTGCTG CTTATGGTAA GACAATTCAA GATTTAGGTA TTGATTCGGC 2400 

ACTGACAGCG GCTCTTGCAG CTGCAACTTT AGGTCTTGTA TTTGGAGGGC TTATCGGTGG 246 0 
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ATTTAAAGAT TATAGCCAAG TAGCATATAA CGAACATTTA CATAGTAAAT TTAATGCCAC 2580 

TGAAGTATTC TTCATTCAAT TTACAATCGT TGTATTCTGT ATGGCAGTTG GAAGTTATTT 264 0 

5 

CAGTCATTTG TTTACAGCTC AAACAGGGAT TAATGTTCCA ATTTACGTTG GCTCATTATT 2700 

TGTAGCTGTT ATTGTCCGAA ATATCTCTGA AAGTTTTAAT TTTAATATTG TAGATTTAAA 2760 

AATTACTAAT CAAATTGGCG ATGTCGCATT AGGTATTTTC TTATCTCTTG CGCTAATGAG 282 0 

10 

CATTCAATTA ATCGAAATTT ATAAACTTGC TATACCTCTT ATTATTATCG TTTTAGTTCA 2880 

AGTTGTCGTT ATGATTTTAT TTGCTGTTTT AATTTTATTT AGAGGTTTAG GAAAAGATTA 2940 

15 TGATGCTGCA GTAATGGTAG GTGGTTTTAT CGGTCATGGG CTTGGTGCAc GCCAAATGCC 3000 

ATGGCAAATT TAGATGTTAT TACTAAAAAA TATGGAAACT CACCTAAAGC ATATTTAGTT 3060 

GTACCTATTG TTGGTGCATT CTTAATCGAT TTAATTGGTG TTATAGTCAT TATGGGATTC 3120 

20 ATACAATGGT TTAGTTAAAC ACCAAACTCA TAAATAAAAG AGGAGGCCTT CGCCTCcTcT 3180 

TTTATTTATC CTCGATGTAT ATTCAAGTTA CGTTGTTCTA TCCATGACAA TATTTCCGGA 3240 

CTAAATACGA TTTGTTTTTG TGTTAAGTCG TCAATATTTT TAGCATCTAA CATCGTCATT 3300 

25 

ATTGATTTCA TGTGTTCAAT AAATGATTCT ACATAAGCTA CTGTATGTGC AATGCCATTA 3360 

TTTTCAACTT GATTTAAAAA CGGACGTGAC ATACCAGTTG CCTTTGCACC AAGTGCTAAA 3420 

CTTTTAATTG CATCGAGTGG TGTACGTAAA CCACCACTCG CGAAAACTGA AATTTCGCTT 34 80 

30 

TGATAAGCCG TTGTTTCAAG. TAATGACTCA ACTGTAGACT GTCCCCATGA TGATAAGTAA 354 0 

TCCATATCTT TATTTGCACG ACGTTCATTT TCAATATCTA CAAAGTTAGT ACCACCTTTG 3600 

35 CCACTAACAT CGACATACTT GACGCCTATT TGTTGTAAGT CATGCATTAA TTCTTTGCTC 3660 

ATACCAAATC CAACTTCTTT TATAATGACT GGAACAGACA CTCGTGATAC AATCGACGCT 3720 

ATAl3t*ATCTA ACCAAGTCAC AAATTCACGA TTCCCTTCAG GCATAACTAA TTCTTGAGGA 3780 

40 GAATTAACAT GGATTTGTAA CGCTTGTGCC TCAAGTAATT CAACTGCTTC CAAAGCCTTT 3840 

TCTACTGGTA CGTCCGCACC AACATTGCTA AAAATCATGC CTTCAGGATT CATTTTTCGC 3900 

GCAATCGTAA ACGTCTCAGC CATGCGTGGA TTTCTCAATG CCGCATGTGT TGATCCAACT 3 960 

45 

GCCATCGCTA AGCCAGTTTC TCTTGCAACT ACAGCTAGCT TTTCATTGAT GTTTTTCGTC 4020 

CACTCGCTAC CACCCGTCAT TGCATTAATA TAAACCGGAT ATGCCATCGT TAAGTCAGGC 4080 

GTCTGTGATG TCAAATCGAT ATCATTTACA TTAATTGATG GGATAGAATG ATGCACAAAA 414 0 

SO 

CGCATCTTAT CAAAATCTGA ATGCATTGCG TCAGATTGGG CCATTGCTAT TTCAACATGT 4200 

TCATTTTTTC TCTGTTCTCT TTGAAAATCA CTCATGATTA AACCTACCTT TTCGTCATTT 426 0 
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ATTACAGCTA AGCAAATATA ATATCCATAA TGTAAATGTA ATGCCGGCAT ATTTACAAAG 4380 

TTCATACCAT AAATCCCAGC TATGAATGTT AACGGTGAAA ATATAACTGA TACTAATGTC 4440 

5 

AGTACTTGCA TAATACTATT CATTCTAAAT GACGTGTATG ACTCAAAATT TTCTCGTATT 4500 

TCGTTTGTCA TTTCTTGAGC AGTACGAATG ATATTACGTT GCTTAATCAA GTGGTCATCG 4560 

1Q ATATGTTGAA TGTATAGCGA ATGTTTATTA TCTATAATCA AATCACCATT TTGTTTCATT 4620 

GTATCAATTA GCTCTTGCAT AGGAAACAGT ACACGTTTTA CTTTAATCAA ATCCGAACGT 4680 

AACTTAAAGA CACTATCCAT GACCATTTTA TTAAAGCGAT CATCTACATG GCGGTCTTCA 474 0 

15 AAATGATAAA CACTATCTTC AAGTGCATAT ACAAAGTTGA AATATTTATC AACCATCATA 4800 

TCTAAAATTA ATATGACGAC ATCTGCACAA TCTAATTCTG CATCTAATGT ATTCATATAC 4 860 

TTATAGACTA CTTTATTTAA TGATTCCAAC GTTTGATGAT GATATGTTAC TAATACATTG 4920 

20 TCTTGTATAA AAATATTTAG TGCTATTGGT GAATAGTTTG ACCCCATAAT ACTATGGAAT 4980 

ACTAAGTATT GATAATCTTT ATAAGATTTA TATTTAGCTC GTGGCATACC GTTAATTGCA 5040 

TCATCCACTT CTAAATCATT AAAATTAAAA TGTGCTTTAA ACCATTCATT TTCTTGTTCA 5100 

25 

TTCGGTTCAT CAAAATCATA CCAAACAATA GTCGCATCTT TTGGTATCTC TTTGATATCA 5160 

TCAACTACTT TAAACGGTTC ATATGTAGTT TGATACCGTA TCTTTAAAGC CATCGATACT 5220 

CCCCCTAAAT AACGAATTCT CTATTATTTT ATCATGAATT AAATAACGTG TATGTCTTAA 5280 

30 

TTTATTTTAG TATGATAGTC ACTAAGGAGA TGGTTATTAT CAAACAACTT TTTACACATA 534 0 

CTCAAACCGT AACATCTGAA TTCATTGACC ATAACAATCA TATGCATGAT GCAAATTATA .5400 

35 ATATCATTTT TAGTGACGTC GTGAATCGTT TTAATTACAG CCACGGTCTT TCTTTAAAAG 5460 

AACGCGAAAA TTTAGCATAT ACGCTATTTA CACTAGAAGA ACATACGACA TACCTCTCAG 5520 

AATTSTCTCT TGGCGATGTA TTTACTGTTA CTTTATATAT TTATGATTAC GATTATAAGC 5580 

40 GGTTGCATTT ATTTTTAACA TTAACTAAAG AAGATGGTAC ACTAGCATCA ACAAATGAAG 5640 

TAATGATGAT GGGAATTAAT CAGCACACAC GTCGTTCTGA TGCTTTTCCT GAATCATTTT 5 700 

CAACACAAAT AGCACACTAT TATAAAAATC AATCAACTAT CACTTGGCCT GAACAATTAG 5760 

45 

GACATAAAAT AGCAATTCCA CACAAAGGAG CATTAAAATG ACAGATGCAT TACAACAAAA 5820 

GATTCATATC GAATTACTAG ATTTATTAGA TGATGTTAAG TTTGAATTAA CAGAATTAAA 588 0 

TGCACAAAAA GGGTTATACA TTAACGGACC AGCAAATCAG CTACTTAAGC GTGGCGTGCA 594 0 

50 

TATGGCTTAT GTTCAAGGAC AAAAGCAAGC CATCGATAAT ATTATGACTA TTGTGGAACA 6000 

ACAGCTTGAA AGATCAACAT TTCCTAGAAC ATTATGATAA ATTTCAAAAT GAGGTTGCTC 6060 
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ATAATTTTTT AGATCAATTT TATCAAATTA AAGGGCAATA CTTTATCATC ACACATATCA 6180 

ATACACTTAT TGGTGATTTT CACTCAGAAG CTCATTAACA ATTAGTCTAT ATAACCCTTG 624 0 

5 

CTATATTTTC AAAAACAAAA CCCAATTACG TTTTCATGTC AAATATCATC TTGCATGAAA 6300 

TCGTAACTGG GTCATTTATA TGTTATTAGT TATTTTGTGT TACATCCTCA TCTATCGATT 6360 

TGGCAATTTG TTTAATAGCT TTATGTGATT GTCTAATTGG ATAAATTGGA AAATCATGTA 6420 

10 

CCATCTTAGG ATAATCATAA AACTCAATGT ATTGATGATG TTGCAACATC ATTTGTTCAA 64 8 0 

ATAGCTTCAT ATCAGGATGT GTCATTTCAC GTCCACCACC AAACATATAA ACTGGTGGCA 6540 

1S ATCCTTCTAT TGTGCCATTA ATTGGCGATA TGCGCTTATC TGTTAATGGT AGGCCATTCG 6600 

CCCATTTTTT CATAATCTCA TTGACACCAA ACTGACTTAG aACCGCATCT TGTTCGATTA 6660 

AGGCGTCCGA AATATCTTTA TTAGATAGTG TTGCATCTAA AATTGGTGAG ATTAAATACA 6720 

20 ATTTATTCGG TAATGGCTGT TGATTAkCTA AAAGAGATTG TACAAAGGAT AATGCCAGTG 67 BO 

CACCACCTGA ACCATCACCC ATGACTACGA CATTTTGATG TCCTACTTCA GATACTAATT 6840 

GaTCATAAAC ACGTTGTATC GCTTGGnAAA GTATCGTCaA TATGnAAACT CTGGTGTCTT 6900 

25 

TGGATAGATA GGCAGTACAA CCTCATATAA TGtACTTAAA GTGATTTTAT CCCAACAATC 6 960 

TCCAATGGAA CGGTGATGGT TGTAGTGCAT TGAATCCACC GTGAATATAT AAAATTTTCT 7020 

TATCAATTTG ATGTCTGAAA TTAAAGCGAA AGACTTGCAT ATCATCTAAT GACAATTTTT 7080 

30 

CTAAATTTGC TTTAACATTT AATGTTGAAG GCTGCTTATG TTTTTTTCTA TTTTCAATTT 714 0 

CTCTTTTATA AAAAAATCTT TCAACATCTT GATCATTTTT AAACATAATC GAGCGATTGT 7200 

„ GAAGCAAATA TTTATTGACA ACGCTATTCA TAACACGGTT TCTAATCAAT GTCTTAACCT 7260 

3d 

ACCTTTATAT ATTTTATGTA TCCAATGATk GTCTATCCCC TACATTCTTT GCCAAAAAAA 7320 

GTATATAATG TAGAAGATAT TTTCTTTTTC ACTTTCAAAT TTAAGACTAC AATTGAACAG 7380 

40 TGATTTTTCA TCATTATAAC AGACAACTAG ACATATTGAT AAGTAAAGAA AAGAACTTTA 744 0 

TACGGAGGTA CCTTGCATGA CAAATCCAAA TCAACGATTA GAACCATTTG ATGAGACATT 7500 

TCAACAACCG AATATTCATC GTGGTAAGCG ATATGGTAAG AAAAAACGTT CATTGGTAAG 7560 

45 CATGATTATT CAAATCATTG TTGTwATATT AACCACCATC GCTGGAATAC AGCATGGTGG 7620 

(2) INFORMATION FOR SEQ ID NO: 37: 

(ij SEQUENCE CHARACTERISTICS: 
so (A) LENGTH: 9834 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55. 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 37: 





GTCATtACCG 


amTTTCtTAG 


AaTCATTTAA AGATGATAAA 


TATACAAACG 


TTGGTAATTT 


60 


s 


AAAAGAAGTG 


AATTTTGATA 


AAATTGCTGC GACGAAACCC 


GAAGTAATCT 


TTATCTCTGG 


120 




ACGTACAGCT 


AATCAAAAGA 


ATTTAGATGA ATTCAAAAAA 


GCTGCACCTA 


AAGCGAAAAT 


180 


10 


TGTTTATGTT GGTGCAGATG AAAAGAACTT AATTGGTTCA ATGAAACAAA ACACTGAAAA 


240 


TATCGGAAAA ATTTACGATA AAGAAGATAA AGCTAAAGAA TTAAATAAAG ATTTAGATAA 


300 




CAAAATTGCT 


TCAATGAAAG ATAAAACGAA AAACTTCAAT AAAACTGTTA 


TGTATTTACT 


360 


15 


AGTTAACGAA GGTGAATTAT CAACATTTGG ACCTAAAGGT CGTTTTGGTG GATTAGTTTA 


420 




CGATACATTA 


GGATTCAATG 


CAGTTGATAA AAAAGTAAGT AATAGCAATC ATGGACAAAA 


4B0 




TGTTTCTAAC GAATATGTTA ATAAAGAAAA TCCAGATGTT ATTTTAGCGA TGGATAGAGG 


540 


20 


TCAAGCGATA AGTGGTAAAT 


CAACTGCGAA ACAAGCATTA AATAATCCTG 


TATTAAAAAA 


600 




TGTTAAAGCA ATTAAAGAAG 


ACAAAGTATA TAATTTAGAT 


CCTAAATTAT 


GGTACTTTGC 


660 




AGCTGGATCA ACTACAACTA 


CAATTAAACA AATTGAGGAA 


CTTGATAAAG 


TTGTAAAATA 


720 


25 


ATTTTAAAAG AGGGGAACAA 


TGGTTAAAGG TCTTAATCAT 


TGCTCCCCTC 


TTTTCTTTAA 


780 




AAAAGGAAAT 


CTGGGACGTC 


AATCAATGTC CTAGACTCTA 


AAATGTTCTG 


TTGTCAGTCG 


840 




TTGGTTGAAT 


GAACATGTAC 


TTGTAACAAG TTCATTTCAA 


TACTAGTGGG 


CTCCAAACAT 


900 


30 


AGAGAAATTT 


GATTTTCAAT 


TTCTACTGAC AATGCAAGTT 


GGCGGGGCCC 


AAACATAGAG 


960 




AATTTCAAAA AGGAATTCTA 


CAGAAGTGGT GCTTTATCAT 


GTCTGACCCA 


CTCCCTATAA 


1020 


35 


IXiTTTTGACT 


ATGTTGTTTA AATTTCAAAA TAAATATGAT 


AGTGATATTT 


ACAGCGATTG 


1080 


TTAAACCGAG 


ATTGGCAATT 


TGGACAACGC TCTACCATCA 


TATATTCATT 


GATTGTTAAT 


1140 




TCGTQTTTGC 


ATACACCGCA 


TAAGATTGCT TTTTCGTTAA ATGAAGGCTC AGACCAACGC 


1200 


40 


TTAATGGCGT 


GCTTTTCAAA 


CTCATTATGG CACTTATAGC ATGGATAGTA 


TTTATTACAA 


1260 




CATTTAAATT 


TAATAGCAAT 


AATATCTTCT TCGGTAAAAT AATGGCGACA scgTGTTTCA 


1320 




GTATCGATTA 


ATGAACCATA 


AACTTTAGGC ATAGACAAAG 


CTCCTTAACT 


TACGATTCCT 


1380 


45 


TTGGATGTTC 


ACCAATAATG 


CGAACTTCAC GATTTAATTC 


AATGCCAAAT 


TTTTCTTTGA 


1440 




CGGTCTTTTG 


TACATAATGA 


ATAAGGTTTT CATAATCTGT 


AGCAGTTCCA 


TTGTCTACAT 


1500 




TTACCATAAA 


ACCAGCGTGT 


TTGGTTGAAA CTTCAACGCC 


GCCAATACGG 


TGACCTTGCA 


1560 


50 


AATTAGAATC 


TTGTATCAAT 


TTACCTGCAA AATGACCAGG 


CGGTCTTTGG 


AATACACTAC 


1620 




CACATGAAGG 


ATACTCTAAA 


GGTTGTTTAG ATTCTCTACG 


TTCTGTTAAA 


TCATCCATTT 


1680 
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AGTGTTCTTT TTGAATAATG CTATTACGAT AATCTAACTC TAATTCTTTT GTTGTAAGTT 1800 

TAATTAACGA GCCTTGTTCG TTTACGCAAA G CG CAT AG TC TATACAATCT TTAACTTCGC 1860 

CACCATAAGC GCCAGCATTC ATATACACTG CACCACCAAT TGAACCTGGA ATACCACATG 1920 

CAAATTCAAG GCCAGTAAGT GCGTAATCAC GAGCAACACG TGAGACATCA ATAATTGCAG 1980 

CGCCGCTACC GGCTATTATC GCATCATCAG ATACTTCGAT ATGATCTAGT GATAATAAAC .204 0 

TAATTACAAT ACCGCGAATA CCACCTTCAC GGATAATAAT ATTTGAGCCA TTTCCTAAAT 2100 

ATGTAACAGG AATCTCATTT TGaTAGGCAT ATTTAACAAC TGCTTGTACT TCTTCATTTT 2160 

TAGTAGGGGT AATGTAAAAG TCGGCATTAC CACCTGTTTT AGTATAAGTG TATCGTTTTA 2220 

AAGGTTCATC AACTTTAATT TTTTCATTTG GGATAAGTTG TTGTAAAGCT TGATAGATGT 2280 

CTTTATTTAT CACTTCTCAG TACATCCTTT CTCATGTCTT TAATATCATA TAGTATTATA 2340 

CCAATTTTAA AATTCATTTG CGAAAATTGA AAAGAAAGTA TTAGAATTAG TATAATTATA 2400 

AAATACGGCA TTATTGTCGT TATAAGTATT TTTTACATAG TTTTTCAAAG TATTGTTGCT 2460 

TTTGCATCTC ATATTGTCTA ATTGTTAAGC TATGTTGCAA TATTTGGTGT TTTTTTGTAT 2520 

25 TGAATTGCAA AGCAATATCA TCATTAGTTG ATAAGAGGTA ATCAAGTGCA AGATAAGATT 2580 

CAAATGTTTG GGTATTCATT TGAATGATAT GTAGACGCAC CTGTTGTTTT AGTTCATGAA 264 0 

AATTGTTAAA CTTCGCCATC ATAACTTTCT TAGTATATTT ATGATGCAAA CGATAAAACC 2700 

CTACATAATT TAAGCGTTTT TCATCTAAGG ATGTAATATC ATGCAAATTT TCTACACCTA 2760 

CTAAAATATC TAAAATTGGC TCTGTTGAAT ATTTAAAATG aTGctACCGC CAATATGTTT 2820 

TGTATATTTT ACTGGGCTGT CTAAGAGGTT GAATAATAAT GATTCAATTT CAGTGTATTG 2880 

TGATTGAAAA CAATTAGTTA AATCACTATT AATGAATGGT TGAACATTTG AATACATGAT 2940 

AAAOTcCTTT GATATTGAAA ATTAATTTAA TCACGATAAA GTCTGGAATA CTATAACATA 3000 

ATTCATTTTC ATAATAAACA TGTTTTTGTA TAATGAATCT GTTAAGGAGT GCAATCATGA 3060 

AAAAAATTGT TATTATCGCT GTTTTAGCGA TTTTATTTGT AGTAATAAGT GCTTGTGGTA 3120 

ATAAAGAAAA AGAGGCACAA CATCAATTTA CTAAGCAATT TAAAGATGTT GAGCAAAAAC 3180 

45 AAAAAGAATT ACAACATGTC ATGGATAATA TACATTTGAA AGAAATTGAT CATCTAAGTA 3240 

AAACTGATAC AACTGATAAA AATAGTAAAG AATTTAAGGC ACTACAAGAA GATGTTAAAA 3300 

ACCATCTCAT ACCTAAATTT GAAGCATATT ATAAGTCAGC AAAAAATTTG CCTGATGATA 3360 

50 CAATGAAAGT TAAGAAATTA AAAAAAGAAT ATATGACGCT TGCAAATGAG AAGAAGGATG 3420 

CGATATATCA ATTAAAAAAA TTCATAGGTT TATGTAATCA ATCTATCAAG TATAACGAAG 34 80 
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10 



15 



AATTAGCTGA TAATAAAAGT GAAGCAACTA ATCTTACGAC AAAATTAGAA CATAATAATA 3600 

AAGCGTTAAG AGATACTGCG AAGAAGAACC TAGATGATAG TAAAGAAAAT GAAGTAAAAG 3660 

GCGCGATTAA AAATCACATT ATGCCAATGA TTGAAAAGCA AATTACCGAT ATTAACCAAA 3720 

CTAATATTAG TGATAAGCAT GTTAATAATG CAAGGAAAAA CGCAATAGAA ATGTATTACA 3780 

GTCTGCAGAA CTATTATAAT ACACGTATTG AAACAATAAA GGTTAGTGAG AAGTTATCAm 3840 

AAGTCGATGT AGATAAGTTG CCGAAAAAGG GTATAGATAT AACTCACGGC GATAAAGCCT 3900 

TTGAAAAAAA GCTTGAAAAA TTAGAAGAAA AATAACTATA ATCATTTTTC AAAGTTAAAA 3960 

ATTTTGAATT TATGGTTAAC ATGTGAACTT ACTATGTGTA TAATGGTAAA CATTGATATT 4020 

AACTATATGT ATAAAAATGT CACGCAGATG CTATTTAAAT GTGATAAATA TTTTTAGAGG 4080 

TGAATAGAGT GGCTATAAAG CTAAGTTCAA TTGACCAATT TGAACAGGTT ATTGAGGAAA 414 0 

20 ATAAATATGT TTTTGTATTA AAACATAGTG AAACTTGTCC AATATCGGCA AATGCGTACG 4200 

ATCAATTTAA TAAATTTTTA TATGAACGCG ATATGGACGG TTATTATTTG ATTGTCCAAC 4260 

AAGAACGCGA TTTGTCAGAT TATATTGCTA AAAAAACGAA CGTTAAACAT GAATCACCTC 4320 

25 AAGCATTTTA TTTTGTAAAT GGTGAAATGG TTTGGAATCG AGACCACGGT GATATCAATG 43 80 

TGTCGTCATT AGCACAAGCA GAAGAATAAT GAAACTATAG GGTTGGAACA TTTTGCCTTA 4440 

CACTACTAGA CGTGAATAGC ACAACTTAAA TTCGTGTGAA TCAGAGTAGT TTGGCTATAA 4500 

TGATGTTCTG ACCTTTTATT TTATGTCACC TTTAGAAGCA GTTAAGTTAG TACTTTTTTA 4560 

CAAACATATG TATAATATAT TCGAGTATTT TTATTGAAAa tATTTTGGAA AACGACGAAT 4620 

CCAATAAGAA AATTTAAACA TGATTTGTAA GTTAGTTTAA TAGGAAATAT ATGCTAAACC 4680 

AAAAGAAGCA TATTGTTATT TACTGGAATA ATTAATAATC ATGTCATGTT AAATGTTAGC 474 0 

ATATAATCAC GAGATAAAAT CTAAAATTTA AGATTAATCT TTTATGAATA AAAAACGTAT 4800 

CACAACAAAT AATAAAGTAA GGTGGTCAAG GTTATGAAAG TATTAGTAGC CATGGATGAG 486 0 

TTTCATGGAA TTATTTCAAG TTATCAAGCT AATAGATATG TTGAAGAGGC AGTTGCAAGC 4920 

CAAATTGAAA CTGCAGATGT AGTTCAAGTA CCATTGTTTA ATGGAAGACA TGAATTATTA .4980 

45 GATTCTGTAT TTTTATGGcm ATCTGGGcaA AAGTATCGTA TACCAGTACA TGATGCAGAT 5040 

ATGAATGAAG TTGAAGGTGT TTACGGACAA ACTGATACAG GGATGACCGT TATCGAGGGG 5100 

AATTTATTTT TAAAAGGTAA AAAACCAATT GTTGAACGAA CAAGTTATGG TTTAGGAGAA 5160 

50 

ATGATTAAAC ATGCATTAGA TAACGACGCA AAACATGTTG TAATTTCACT AGGTGGGATT 5220 

GATAGTTTTG ATGCTGGTGC AGGTATGTTA CAAGCATTAG GTGCTCAATT CTATGATGAC 5280 
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GATATGTCGA ACTTACACCC TAAAATGGAA ACAGCAAGAA TTCAAGTAAT GTCGGATTTT 54 00 

TCAAGTCGAT TATATGGTAA GCAAAGTGAA ATCATGCAAA CTTATGATGC GCATCAGTTG 5460 

5 . AATCATAATC AAGCAGCAGA AATCGATAAT TTAATTTGGT ATTTTAGTGA GTTATTTAAA 5520 

AGTGAATTGA AAATTGCAAT TGGTCCAGTT GAACGTGGTG GTGCTGGTGG TGGAATTGCA 5580 

GCAGTCTTGA ATGGACTGTA TCAAGCTGAA ATATTAACCA GTCATGCATT AGTAGACCAA 564 0 

10 

CTAACACATT TAGAAAATTT AGTTGAACAA GCGGATTTAA TT AT TTTTGG AGAAGGATTA 5700 

AATGAAAATG ATCAGTTGCT AGAAACGACA ACATTGCGTA TTGCAGAACT TTGTCATAAA 5760 

CATCAAAAGG TTGCCATTGC AATTTGTGCA ACTGCTGAAA AGTTTGATTT ATTTGAATCA 5820 

15 

CAAGGGGTTA CAGCAATGTT TAATACATTT ATCGATATGC CAGAAACTTA TACTGACTTT 5880 

AAAATGGGtT ACAAATTAGG CATTATACGG TTCAGTCTTT AAAACTGTTG AAAACACATT 5940 

20 TTAATGTTGA GGTTTAGTAA AGAAGGACTA AATTGGTGAT GCTGTCATGA TGGTTAATAA 6000 

CATTTATGAT GGTTAGCAAA ACGAATTAGA AGATCGAAAG TATACGTAAA AAATATGAAA 6060 

AATCACGCTA TCATTGCACT GAATGTTAGC GTGATTTTTA TATATTAATT AAGCCTGAGT 6120 

25 TGAACTAGTA TATAATCGTT GGTTTTTAGT GATTTTCAGC GATATCTTCT ACAATTCCAA 6180 

TGATTACTTG TACTGCTTTT TCCaTAACAT CAATGGATGC aTATTCATAT GGGCCGTGGA 6240 

AGTTACCGCA ACCTGTAAAG ATGTTTGGAG TTGGTAACCC CATAAATGAC AATTGTGAAC 63 00 

30 CATCTGTACC ACCGCGAATA GGTTCAGTGT TTGCTGGAAT ATCTAATTTG GCAAAGACAC 6360 

GTTTAGGTAT ATCAATAATA TGAGGCAATG GTAATATTTT TTCTGCCATA TTGAAATATT 6420 

GATCCGATAT ATCAACTTTA ACTGGATAAT TTTCAAAATG GGCATTGATA TCGTCACGTA 64 80 

35 

TTTCTAAAAT ACGTTTCTTA CGCAATTCGA ATTGTTTTTT ATCATGATCA CGAATAATGT 654 0 

ATTGCAAAGT TGCTTTTTCA ACAGTTCCTT CAAAGTTCAT TAAGTGATAA AAGCCTTCGT 6600 

ATCCTTCTGT TCGCTCCGGA ACTTCACTAT CAGGTAGCAA ACTATCGAAT TGTTCACCTA 666 0 

40 

AACGTATTGC GTTTACCATT GCATTTTTAG CTGAACCAGG ATGAACATTT ACACCGTGGC 6720 

ATGTAATAAC CGCTTCAGCA GCGTTAAAGC TTTCATATTG TAATTCTCCA TATTGACTAC 6780 

45 ' CATCCATAGT ATAAGCAAAA TCAGCATTGA AGCGGTCAAC ATCAAATTTA TGTGGACCAC 684 0 

GACCGATTTC TTCGTCTGGT GTAAATCCAA TGCGAATGGT ACCATGTTTA ATTTCTGGAT 6900 

GTTCTTGTAA ATAACAAATA GCTTCCATAA TTTCCACAAT ACCCGCTTTA TCGTCTGCAC 6 960 

50 CTAGTAACGA TGTACCATCA GTTACCATTA ATGTATGACC AACTAAACTG TTAAGTTCTG . 7020 

GAAATACTTT AGGATCTAAG ACACGTTTAG TATTGCCTAG TTTGTATGGC TTACCATCAT 7080 
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GCGCCAAAAA TCCAACTGTT GGGACGTCGA CATCGATGTT ACTTTCTAAT GTAGCAAATA 7200 

AGTAGCCATT TTCATCTAAA TCAGTTGGCA ATCCTAATTG TTGTAATTCT TTTTCTAATA 7260 

5 AATGTAACAA ATCCCATTGC TTTTCAGTTG AAGGTGTTGT TGTAGATTTT GGATCAGATT 7320 

GCGTATCAAT TGTCGTATAT CTTGTTAATC TATCTATCAA TTGGTTCTTC ATTATATTCG 7380 

ACCCCTTAAA CTCTATTATT CATGTTGTAA GATTTTTTAT ATGTCTTACC TTTGATTTTA 7440 

10 

CCATACAGTT GTTTGATACG TGTGTATAGG TAATATAGAA TTTCAGAAAC TAATATACCG 7500 

AAAGCAATCG CACCTGAAAT CAGTGTAcTT CTAAAAATGT ATTTACAGCA CTTGTATAAT 7560 

CATTTGATAC TAAAAAACGA GTCGCTTGAT AAGCTGCACC ACCAGGTACT AATGGTATAA 7620 

15 

TGCCTGGCAC TATGAATATA ATTACCGGTC GTTTATATCT GCGACTCATA GTATGACTCA 7680 

TTAAGCCTAA AATTAAGCTT CCCAAAAATG AAGCGCCAAC TTTTCCAAAC TCTAAATCTA 7740 

20 CCGTTAATTG GTAAATCGTC CATGCAATGG CACCCACAAA TCCACATGCT ACTAAGAGGC 7800 

GTTTGGGTGC ATTGAAAATG ATAGAGAAAA GTACTGTTGA TATAAAGCTG ATTGTAAAAT 7860 

GAAATAAATA AAATAGCATG CTTTAACAGT CCTTCCTTAA ATGATTAATA AAACGATTGC 7920 

25 GACACCAGCA CCGATTGCGA ATGCTGTTAA TGCAGCTTCA ACACCGCGAG ACATACCTGC 7980 

AAGTAATTCA CCCGCTAATA AATCTCGAAT GGCATTGGTA ATTAATATAC CAGGGACAAG 8040 

TGGCATGACA CTGGCTATAG TAATGATATC TTGATTGGTT GCAATGCCTA ATTTAGTAAA 8100 

30 TGTGGCTGCA ATGGATATGA CCACAGCGGC TGCAACAAAC TCTGAGAAAA ATTTAATTTG 8160 

TATATAGCGT tGCACAAAGC TGAATGTTAA AAATGCGGAT CCGCCAGCAA TGACTGCAAT 8220 

CCAACAATCT GATGCGACAC CACCAAACAT AAATAGGAAG AAGCCACATG CAATGGCAGC 8280 

35 

TGCAAAGAAA TTCGTTAAAA AAGAATATTG TAATGATGCA TGCTGTAAAT GAATAAATTC 834 0 

AGATTTAGCT TCATCAATTG TGAGTTCTTT ATTTGATATT TTACGTGAAA GACTATTCGT 8400 

TAAAGCGATT TTCTCTAAAT GTGTTGTACG CTCTTGTACA CGAATTAATC TTGTACTTGT 84 6 0 

40 

TCGATCGTTT AATGAAAAAA TAATTGCAGT TGAACTGACA AAACTATATG TATTATGAAG 8520 

ACCATAACTA TGTGCGATAC GGTTCATTGT ATCtTCAACT CGATATGTTT CAGCACCTGA 8580 

45 TTCaAGTAAA ATTCTACCTG CAATTAATAC AACATCAATC ACTTTGTTTT CATCTATAAT 864 0 

TGTGATTGAA TCTGGCATAT CAATTCACCT CCAATGATAT GTGTTATTTA TTTGAACAAT 8700 

TGaAGTTTAC AACTTGTTGT TACAACTTTC AATAGTGAGA CTTTGTGTTA GTATGATGAA 8760 

50 CTTGTATGGT TCAAATTTAA ATAAGAAAAA CTGTTAATCT TTGCTATTAT ACTATGATTT 8820 

. AATAATAGCA AAGGATTAAC AGTTTTGTCG TTGTTATAAA TTGATAATAG GGTTAAACAT 8880 
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TTTACGCTGT 


GATTTTGGAT 


CGTCATCTGT 


TAAATAACCA 


ACACCGATAG 


ACACTGACAA 


9000 


TTTAATAACT 


TCTTTGTTTG 


GTAAATGGAA 


TGATGATTTT 


TCAACACCCG AACGAATATT 


9060 


TTCAGCTAAT 


TTAACACTTT 


GATCAAGTGA ATAATTGTGA 


ATGACAACTG 


AGAACTCTTC 


9120 


GCCACCATTT 


CTAAAAATTT 


TAAATTGATT 


CGGCACATAG 


TTTTTAAGTA 


ATTGAGACAT 


9180 


TTGTTTTAAT 


.ACAGCATCAC 


CTGATTTGTG 


TGAGTAGGTA 


TCATTGaCAT 


CTTTAAATCC 


9240 


ATCGATATCG 


ATTAATAATA 


ATGCGATACT 


TTGATGTTCT 


TTTTCAGCTT 


TTCGTGAAAT 


9300 


TTCATTTAAA 


TGTCTATCAA 


ATTCTTTTAC 


ATTACCTAAG 


CCTGTTAAGT 


AATCATATTT 


9360 


ATCTTCGTTT 


TCATAACGAT 


TTACGAGTGA 


GAAGAAATGC 


CAAATATCGA 


CAAATGTTAT 


9420 


CGCTGAAGCT 


AAAGTGATAA 


TTAATGAAAT 


TGGTATTAAA 


ATGATAACTT 


CCGATAGTGT 


9480 


GTAAATAGGA 


CTCACTAACG 


CGACACCAAA 


TAAAATGATT 


ATTGTAACAA 


CATTAAGTAT 


9540 


TAATAATGAT 


AGCACATCAT 


TTTGTTTTAA AAATGGTCCA 


ATAGCACTTG 


TTACTGCAGC 


9600 


AATAACAATC 


AACGTAACAC 


CGTACATAAT 


CGAGTTGTTA 


AATACTACAA 


TTTCAACAAT 


1 9660 


TGCTACAATT 


ACTGTGGCAG 


ATAATGTATA 


GACCATATTT 


GTAAATCTAC 


CTAAAAACAA 


9720 


TAAAGGAACG 


AATGTTAAGT 


GAATTAAATA ATCTTCACGA 


TAAGGGATAG 


GGTAGACAGA 


9780 


TAATAATAAT 


GATACGATTG 


TCATTAAAAC 


AGTGACATAA 


GCCTTAGAAA AAAC 


9834 



(2) INFORMATION FOR SEQ ID NO: 38: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2343 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

35 

r(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 38: 
TCTCAATCAG ATGAAAAATT GCATATCGTA GGTTTTACAG AAAGTGCAAA ATATAATGCG 60 

40 

TCATCAGTCA TTTTCACGAA TGACGCTACC ATTGCCAAGA TCAATCCTAG ATTGACTGGA 120 
GATAAAATTA ATGCAGTTGT TGTACGTGAT ACAAATTGGA AAGACAAAAA ATTAAACCAA 180 

45 GAGCTTGAAG CGGTAAGTAT TAATGACTTT ATTGAAAATT TACCAGGTTA TAAACCACAG 240 
AACTTAACAT TAAACTTTAT GATTTCATTC TTATTTGTCA TTTCAGCTAC AGTTATAGGC 300 
ATTTTCCTAT ATGTCATGAC ATTACAAAAG ACGAGTTTAT TTGGCATATT AAAAGCTCAA 360 

SO GGATTTACGA ATGGCTATTT GGCGAATGTG GTAATTTCGC AGACGGTCAT ATTAGCACTA 420 
TTTGGTACGG CATTTGGCTT ACTGTTAACA GGCGTTACAG GTGCATTTTT ACCTGATGCA 480 
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TCTGTATTAG GAAGTTTATT CTCCATTTTA ACAATTAGAA AAATAGATCC GTTAAAGGCG 600 

ATTGGGTAGG AGGTGTAGCA AATGTTGAAA TTTGAAAATG TAACAAAGTC ATTTAAAGAT 660 

5 GGGAATCGTA ACATTGAAGC GGTTAAAGAT ACAAATTTTG AGATAAATAA AGGTGATATT 720 

ATAGCATTGG TTGGACCTTC TGGCTCTGGT AAAAGTACAT TTCTAACTAT GGCAGGTGCT 780 

TTACAAACAC CGACATCTGG GCACATTTTA ATCAATAACC AAGATATTAC GACAATGAAG 840 

10 

CAAAAAGCAT TGGCAAAAGT TAGAATGTCT GAAATAGGTT TTATTTTACA AGCTACAAAC 900 

CTTGTACCAT TTTTAAOGGT AAAGCAACAA TTTACATTAT TGAAAAAGAA AAATAAGAAT 960 

GTTATGTCTA ATGAAGACTA TCAGCAACTT ATGTCACAAT TAGGTCTAAC TTCATTGCTT 1020 

75 

AATAAGTTAC CTTCAGAAAT TTCAGGTGGT CAGAAACAAC GTGTGGCGAT AgCaAAGCGT 1080 

TATATACGAA TCCGTCGATT ATTTTAGCGG ATGAACCTAC CGCGGCGTTA GATACTGAAA 1140 

20 ATGCGATTGA AGTCATTAAA ATTCTACGTG ATCAAGCCAA ACAAAGAAAG AAAGCATGTA 1200 

TTATTGTTAC ACATGATGAA CGACTTAAAG CATATTGTGA TCGTTCATAT CATATGAAAG 1260 

ATGGCGTCCT TAATCTTGAA AATGAAACAG TAGAATAGTT TTATTAAGCC GGTACATCAT 1320 

25 GTGCCGGTAT TTTTATGTTT ATGTATTATT TGAATAAACT TTCACATTCA ATTAATAATA 1380 

ATTATTATCG AAAATCAGAA ATATTCCGTG AAATATAATA TTTTTTGTAG TAAAATGGCC 1440 

TCTAAGTATT CAATATTTAA ATATGGGGAT TGAATATAAA ATTATCGTAA TGGGGGTCAA 1500 

30 

TGGTTATGGA TTTATTGATA GGTACTTTAT TTTTATTTTT GGTCTTAGTG ATTTTTACAT 1560 

TATTTACATA TAAAGCGCCT AATGGTATGC GTGCCATGGG AGCATTAGCT AATGCAGCAA 1620 

TCGCAACATT TTTAGTGGAA GCATTTAATA AATATGTTGG TGGCGAAGTA TTCGGTATTA 1680 

35 

AATTTTTAGA AGAGCTAGGA GACGCTGCGG GAGGTCTAGG TGGTGTCGCT GCCGCTGGAT 1740 

TAAGAGCATT AGCTATCGGT GTGTCACCAG TATATGCATT AGTTATAGCA GCCGCGTGCG 1800 

GTGGTATGGA TTTATTACCA GGTTTCTTTG CGGGTTATAT GATTGGATAT GTGATGAAAT ie60 

40 

ATACAGAGAA ATATGTGCCG GATGGTGTCG ACTTAATTGG ATCGATTGTC ATCTTAGCGC 1920 

CATTAGCTCG TCTTATTGCA GTATTATTAA CGCCAGTAGT GAATAGTACA TTGATTCGAA 1980 

45 TTGGTGATAT TATCCAAAGT AGTACGAATA CGAATCCAAT TATCATGGGT ATCATTTTAG 204 0 

GTGGTATTAT TACGGTTGTC GGCACAGCGC CATTGAGTTC AATGGCATTG ACAGCATTAT 2100 

TAGGTTTAAC GGGTGTACCT ATGGCTATTG GTGCCATGGC AGCATTTAGT TCGGCATTTA 2160 

50 TGAATGGGAC GCTATTCCAT CGCTTAAAAT TAGGTGATCG TAAGTCTACG ATTGCAGTAA 2220 

GTATTGAACC TTTATCACAA GCAGATATTG TATCAGCCAA TCCAATTCCA ATCTATATTA 2280 
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ATGCGACAGG 


TACAGCTACA 


CCGATTGCAG 


GATTTTTAGT 


TATGTTTGGA 


TTTAATCATC 


2400 




CGACGACAAT 


TGTGATTTAT 


GGTGTAGTAA 


TGGCGATTGT 


AGGTGCGCTT 


GCAGGTTATC 


2460 


5 


TTGGTTCAAT 


TGTATTTAAA AAATATCCAA 


TTGTTACTAA 


GCAAGACATG 


ATTAATCGAG 


2520 




GTGCAGTAGA 


CGCATAGCAT 


CATCATATTG 


AATAGTAAAA 


ACAAATAAAA 


CATAGTAACG 


2580 




TGATTCAGTC 


GATGTAACAG 


TCGATAATGA 


GTCACGTTTT 


TTTATAGAAA 


AATACAAGAC 


2640 


10 


ATAAAAATGT 


CATAATTTAT 


TGTCGACAAA 


TATCATACTG 


TATAAACATT 


TATCATTTTC 


2700 




TCAAGTACCT 


TTTACACGAT 


GGAATGAACT 


TACTTTTTAC 


GAAATTATGC 


GTATTTTATA 


2760 


15 


AACAAATATC 


ATTGATATAA 


CGGTAAATGT 


AAGCGTTTAC 


AACAGAAATA 


ACAGCATGCT 


2820 


ACGATATTTT 


TGTAAATTCA 


CTGATTCAAG 


TATTTTAAGT 


CAATATGAGG 


AGGGATGTTA 


2880 




TGAGCGATTC 


TGAGAAAGAA 


ATTTTAAAAA 


GAATTAAAGA 


TAATCCGTTT 


ATTTCACAAC 


2940 


20 


GTGAACTTGC 


TGAGGCAATT 


GGATTATCTA 


GACCCAGCGT 


AGCAAACATT 


ATTTCAGGAT 


3000 




TAATACAAAA 


GGAATATGTT 


ATGGGAAAGG 


CATATGTTTT 


AAATGAAGAT 


TATCCTATTG 


3060 




TTTGTATTGG 


CGCAGCGAAT 


GTAGATCGTA AGTTTTATGT 


GCATAAAAAT 


TTAGTTGCAG 


3120 


25 


AAACATCAAA TCCTGTAACG TCAACACGCT CTATTGGTGG CGTAgCAAGA AATATTGCTG 


3180 




AGAACTTAGG 


TAGGCTTGGC 


GAAACGGTCG 


CTTTTTTATC 


TGCTAGTGGA 


CAAGATAGTG 


3240 




AATGGGAAAT 


GATTAAACGA 


TTGTCCACAC 


CATTTATGAA 


TTTGGATCAT 


GTTCAACAAT 


3300 


30 


TTGAAAATGC 


GAGTACAGGT 


TCATATACAG 


CTTTAATTAG 


TAAAGAAGGC 


GACATGACAT 


3360 




ATGGCTTaGC 


AGATATGGAA 


GTGTTTGACT 


ACATTACGCC 


TGAATTTTTA 


ATTAAGCGTT 


3420 




CACACTTATT 


GAAAAAGGCT 


AAGTGCATTA 


TTGTAGATTT 


GAATTTAGGC 


AAAGAGGCAT 


3460 


35 


TAAACTTCTT ATGTGCCTAT ACCACGAAAC ATCAAATCAA ATTAGTTATC ACCACGGTTT 


3540 




CTTCCCCAAA AATGAAAAAT ATGCCTGATT CATTACATGC TATTGATTGG ATTATCACGA 


3600 




ATAAAGATGA 


AACAGAAACA 


TACTTAAATT 


TAAAAATAGA 


ATCTACTGAT 


GATTTAAAAA 


3660 


40 


TAGCTGCTAA ACGCTGGAAT 


GATTTAGGTG 


TTAAAAATGT 


TATTGTGACA 


AATGGCGTGA 


3720 




AAGAACTCAT 


TTATCGAAGT 


GGTGAGGAAG 


AAATCATTAA 


GTCAGTTATG 


CCATCAAATA 


3780 


45 


GTGTGAAAGA 


TGTTACAGGT 


GCAGGCGATT 


CATTCTGTGC 


TGCAGTAGTG 


TATAGCTGGT 


3840 


TAAATGGGAT 


GTCTACTGAA 


GATATATTAA 


TTGCTGGTAT 


GGTTAACGCA 


AAGAAAACGA 


3900 




TAGAAAGGAA ATATACAGTT AGGCAAAACC 


TAGATCAACA 


GCAACTTTAT 


CACGATATGG 


3960 


50 


AGGATTATAA 


AAATGGCAAA 


TTTACAAAAG 


TATATTGAGT 


ATTCTCGAGA AGTTCAGCAA 


4020 




GCACGGGAGA 


ACAATCAACC 


GATTGTAGCA 


TTAGAATCAA 


CAATTATTTC GCATGGTATG 


40B0 
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10 



15 



20 



GCCATTCCAG CAACCATAGC CATTATAGAT GGCAAAATTA AAATTGGTTT AGAAAGCGAA 4200 

GATTTAGAAA TACTGGCAAC TAGTAAAGAC GTTGCTAAAG TATCTAGAAG GGATTTAGCA 4260 

GAAGTTATTG CGATGAAGTG TGTTGGTGCT ACTACTGTAG CGACGACGAT GATATGTGCT 4320 

GCAATGGCTG GTATTCAATT TTTTGTTACA GGAGGTATTG GGGGCGTCCA TAAAGGTGCA 4380 

, GAACATACGA TGGACATTTC AGCAGACTTA GAAGAACTGT CTAAAACAAA TGTCACTGTT 4440 

ATCTGTGCAG GTGCCAAATC AATTTTAGAC TTACCTAAGA CGATGGAGTA TTTAGAAACA 4500 

AAAGGCGTTC CAGTTATTGG ATATCAAACG AATGAATTGC CAGCATTCTT CACTCGCGAA 4560 

AGCGGTGTTA AGTTAACAAG TTCGGTTGAA ACGCCAGAAC GACTTGCTGA CATTCATTTA 4620 

ACAAAACAGC AGTTAAATCT TGAAGGTGGC ATTGTTGTTG CTAATCCAAT TCCATATGAG 4680 

CATGCCTTAT CAAAAGCATA TATTGAGGCA ATCATAAATG AAGCTGTTGT TGAAGCGGAA 4740 

AATCAAGGTA TTAAAGGTAA GGACGCCACA CCGTTCTTGT TAGGGAAAAT TGTAGAAAAA 4800 

ACGAATGGTA AAAGTTTAGC AGCAAATATA AAACTTGTTG AAAACAATGC GGCGTTGGGT 4 860 

GCTAAAATTG CTGTCGCTGT TAATAAATTA TTGTAGGTGA TGATACATGA ATATTTTATT 4 920 

25 CGCTATCACA GGGATAGCAT TTGCACTATT TGTTGCGTTT TTATTCAGTT TTGATCGTAA 4 980 

AAAAATAGAC TTCAAAAAGA CGTTAATAAT GATATTTATT CAAGTGTTGA TCGTGTTATT 504 0 

TATGATGAAC ACAACGATTG GTTTGACAAT TTTAACTGCA CTAGGTTCAT TTTTTGAAGG 5100 

GCTAATAAAT ATTAGTAAAG CAGGCATAAA TTTTGTTTTT GGAGATATAC AAAATAAAAA 5160 

TGGCTTTACG TTCTTTTTAA ACGTATTACT GCCATTAGTT TTTATTTCTG TATTAATAGG 5220 

CATCTTTAAT TATATTAAGG TATTACCATT TATTATCAAA TATGTAGGTA TCGCTATTAA 52 80 

TAAAATAACT AGAATGGGGC GCTTAGAAAG TTATTTTGCT ATTTCAACAG CAATGTTTGG 534 0 

GCAACCAGAA GTATATTTAA CAATAAAAGA TATTATTCCA AGATTATCTA GAGCGAAATT 5400 

ATATACAATT GCGACGTCTG GTATGAGTGC TGTTAGTATG GCAATGCTAG GTTCATATAT 5460 
GCAGATGATT GAACCCAAGT TCGTAGTTAC AGCAGTAATG TTAAATATTT TTAGTGCGCT ' 5520 

TATCATCGCC AGTGTAATCA ATCCCTATAA ATCTGATGAT ACTGATGTTG AAATTGATAA 5580 

45 CTTAACGAAA TCCACAGAAA CTAAAACATT GAATGGAAAA ACAGGAAAAC CTAAGAAAGT 5640 

TGCCTTTTTC CAAATGATTG GTGATAGTGC GATGGATGGG TTTAAAATCG CTGTTGTAGT 5700 

AGCCGTAATG TTGTTAGCAT TTATTTCATT AATGGAAGCA ATTAATATCA TGTTTGGTAG 5760 

50 TGTTGGTTTG AACTTTAAAC AGCTTATTGG CTATGTGTTT GCACCAATCG CATTCTTAAT 5820 

GGGGATTCCA TGGAGCGAAC TGTTCCAGCT GGCTCTTTAA TGGCGACTAA ATTAATTACA 5880 
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CAAGGTATCA .TTTCAGTTTA CTTAGTAAGc TTCGCTAATT TTGGTACGGT TGGTATCATC 6000 

GTAGGTTCAA TTAAAGGCAT TAGTGATAAA CAAGGAGAAA AAGTTGCATC CTTTGCAATG 6060 

5 

AGGTTGCTAC TTGGTTCAAC TCTAGCTTCA ATCATTTCAG GATCAATCAT TGGCTTAGTA 6120 

TTGTAAATGA ATCGAAGTAC CTAAATTAAA TTCATGGCAA AGCTAAACCC CGTCACCAAG 6180 

TTGGCGCAAC AGCGcATgcA TAACTTAGTG ACGGGGTTTT ATCATAACAA TCTACTTTTT 6240 

10 

CGTAGCCGTT TTTGAAATGT ATGTTGATGG TTTATCTTTT TCAAAAATTG TTAATCCCGT * 6300 

TATATCTTTT TTATGTTTTG AAGGGACAAT GAAGCTAAGT ATATAAGCAA AGACAAAAGC 6360 

1S AACTGTAAAT GAAATGGTAG ATACATAGAA AGGTGAGTTA CCTTTGCCAA CACCATTATA 6420 

GACATAAGCA AAGATGATAC CCAATATTAA TCCACAAATA ACACCGAATG TATTCGTACG 6480 

TTTAGTGAAA ATACCAACTG CAAATACACC AGCCAATGGA ACGCCGAATA ATCCAGTCAC 6540 

20 AAACAAGAAT AAATCCCATA AGTCATTTGA ATTAGAAGCA ATTAAGTATA GTGACATTCC 6600 

AAAACCGAAA ATACCTGCAA TGATAATAAT GAAACGTGCA AAGTTAACTT CGTGTCGCTC 6660 

GCTACCTTTT CCGAAGAAGC GTTGCTTAAT GTCGATTGAA ATACAAGCAG ATATAGAATT 6720 

25 TAAACTAGAT GAAATGGTAG ACTGTGCAGC GGCGAAAATG GCTGCAATAA GTAATCCTGC 6780 

TACAAATGGT GG CATCTCAG TCAAAATGAA ATATGGCACT ACAGATGATG TATTGAAGCC 6840 

TTTTGGTAAA ACAGCTTCAT GTGTATAAAA TGAATACAGC ATTGTACCCA TACCATAAAA 6900 

30 

TAAGGGTGCT GAAATTAAAG CTAGGATACC ATTTGTCCAT AACGATTTAT TTGTTTCTTT 6 960 

TAAACTATCA GAAGCTTGAT AACGCTGCAC GACGTCTTGA CTCGCTGTGT ATTGATACAA 7020 

GTTGTTGAAA ATATTTCCTA GGAAAATAAT TGGAATGGCA GCTGCCGCAG TATTTAGTTT 7080 

35 

CCAATTGTCT GCACTAATTA ATTTTTTGTG CTCAATCGCA TCTGCAAAGA CAGTGCCGAA 714 0 

ACCGtCTTTA ATGTTCACAA CACCTAGAAT AATAATAACT AAAGCGCCGC CTAATAAAAT 7200 

4Q GACGCCTTGA ATGAAATCAC TCCAAACCAC ACCTTCGAAA CCACCTAAAA ATGTATATAA 7260 

AATACATAGT AAACCAACGA GTGATGCAAC GATATAAGGG TTCATGTCTG ATACAGATGT 7320 

GATTGCTAAT GTTGGTAAGT AGATAACAAT TGCAACACGC CCTAAATGGT AAACGACAAA 7380 

45 TAATAATGAG CCAATGACAC GTATGCTAGG GCCAAATCTA GCTTCTAAAT ATTCATATGC 7440 

AGATGTTACC TTTAACTTTT TAAAGAAAGG GACATAGAAA TAAATAAGTA ATGGAATAAT 7500 

TGCGACGATA GCAATGTTAC CAGCGATATA TGACCAATCT GTTAAAAATG CTTTCTCTGG 7560 

50 TGTCGACATA AATGTAATCG CACTTAACGT AGTAGCATAA ATTGAAAAGC CAACTACCCA 7620 

AGATGGCAAG CGACCACTTG CGGTAAAGAA ACTATTGGTA CTTTGGCTCG CGCGCTTGGT 7680 
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TGTGCCAAAT CCAACTTCTT TCATGGGCAA CATCCCCTTT ACAATGTATT GATTCTTTGA 7800 

TGTCTATAAA TCGTATTTTG CAATGAGTTG ATCTAATGTT TGTCGATGTG CTTCGTTAAA 7860 

6 AGGTTTGAAA GGTCTTTTCG GTAATCCTGC ATCAATGCCA CGATGACGTA ATATTTCTTT 792 0 

CAATGTTGGA TAAATCCCCA TTGATAACAC TGTTTCGATA ATGTCGTTTG AATCATGTTG 7980 

CAGTTGGTAA GCTTCTTGAA TTTGACCTTG TCGTGCTAAG TCGAAGATTT TTCTTGCACG B04 0 

10 

GCGACCATTA ACGTTATATG TAGAACCAAT TGCACCATCT ACGCCAGAAA TCGTAGCTTG 8100 

AACTAACATT TCATCAAAGC CAGATAAGAT TAATTTGTCT GGGAATGCTT TTCTAATACG 8160 

TTCGAGTAGG AAGAAGTTTG GCGCTGTATA TTTAACACCA ACAATTTTTT CATGATTAAA 8220 

75 

TAGCTCGCTG AATTGTTCAA TAGAAATATT CACACCTGTT AAATCTGGTA TTGCATAAAT 8280 

AATCATATTG TTCTGAGTTG CTTCGATAAT ATCGAAATAG TAATCTCTAA TTTCTTCAAA 8340 

2Q AGTAAATGGA TAGTAGAATG GTGTTACGGC AGAAAGTGCA TCATAACCGA GTTCTGTGGC 8400 
ATATTTTCCA AGTTCAATGG CTTCATTTAA ATCTAACGAA CCTACTTGAG CAATCAATTT . 84 60 

CACTTTATCC CCAACTGCCT CTTTGGCAAC CTTGAAAACT TGCTTCTTCT GCTCTGTATT 8520 

25 TAATAAAAAG TTTTCGCCTG AGCTACCATT TACATAAAGA CCGTCTAATT CTTCAGTTTC 8580 

AATGGCATTT TGAGCAATTT GTTTAAGTCC TTGTTCATTT ACTTGACCAT TTTCATCAAA 864 0 

AGGAACGAGT AACGCTGCAT ATAAACCTTT TAAATCTTTG TTCATTATGA AGTCCCTCCA 8700 

30 AAAATCATTT GATAATATAG TTTACAGCTA TAATTGTAAA CGCTATCATA AAATGTAACA 8760 

ATATCTTTTT GAAAATTGTA GTCATATTTA TGTATAATTA ATGAAAATGT TTTTCAAAAT 8820 

CAATAGAAAT GGAGTGAGTA AGGTGTATTA CATCGCAATC GATATTGGAG GCACTCAAAT 8880 

35 

TAAATCGGCA GTTATTGATA AGCAATTGAA TATGTTTGAC TATCAACAAA TATCAACGCC 894 0 

GGACAACAAA AGTGAGCTTA TTACTGACAA AGTATATGAG ATTGTAACAG GATATATGAA 9000 

GCAATATCAG TTGATCCAAC CTGTCATAGG TATTTCATCA GCAGGCGTTG TTGATGAACA 9060. 

40 

AAAAGGCGAA ATTGTATACG CAGGGCCAAC CATTCCGAAT TATAAAGGTA CTAATTTTAA 9120 

GCGATTATTA AAATCACTGT CTCCTTATGT CAAAGTAAAA AATGATGTAA ACGCTGCATT 9180 

ACTAGGCGAA TTGAAATTAC ATCAATATCA AGCAGAACGG ATCTTTTGTA TGACGCTTGG 924 0 

45 

TACAGGCATT GGGGGTGCGT ACAAGAATAA TCAAGGTCAT ATTGATAATG GTGAGCTTCA 9300 

TAAGGCAAAT GAAGTTGGGT ATTTATTGTA TCGTCCAACT GAAAATACAA CGTTTGAGCA 9360 

SO ACGTGCTGCA ACGAGTGCAT TGAAAAAGCG CATGATTGCC GGAGGATTTA CGAGAAGCAC 9420 

ACATGTGCCA GTATTGTTTG AAGCAGCTGA AGAAGGTGAT GATATTGCAA AACAAATATT 9480 
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AGGGCTTATA TTAATTGGGG GCGGTATATC TGAACAAGGA GATAATCTCA TTAAATATAT 9600 

CGAGCCGAAA GTTGCACACT ATTTACCAAA AGACTATGTT TATGCACCAA TACAAACGAC 9660 

5 

TAAGAGTAAA AATGATGCAG CATTATATGG CTGTTTGCAA TGATAGTTGA AAGAAGGAGT 9720 

CATTCTAAAA TAGAATTTGA AACCGTTACG AGAGATGAGA GCTGTTGTTA GTTCCACACA 9780 

TCACACTCTA TCTAGGACCA ATCTAAACTA TATCAACCAA CAGTGTGCCA CGGGCAAATT 9840 

10 

AAATTGAAGA AGCTGAGATA TTAAAATTTT AGAAAATGTA AAAAAATATT TGGTATTGAA 9900 

ATTAAAAAAG CACCTAGCAA CTCGTTGGGA CAATCACGAT GATTGTCTAC AGTTGCAGGT 9960 

1S GGATTTGAAT ATACTACTAG TTATTTGTTG TCTAGGATAA TAGATTTAGT ATGTTGATAA 10020 

GTTTGACTCA GATTCGTATT TTCTAATAAA TGATAACTCA CGATATCGAT TAAAAAGAGT 10080 

GTCGCAATTT GTGTGTTGAT AAATTGATGG TCGGTATTAC GCGATTGATC CGTTGTTAAA 10140 

20 AGTACTAAAT CTGCACAATC TGTAAGTTTA CTACCTTCAA AATTTGTGAT GGCAACGACA 10200 

TATGCACCAT GAGATTTGGC GACTTCCGCT GCAGAAATTA ATTCCGAAGT ATTACCACTA 10260 

TTTGACATAG CAATAAACAT ATCCGAATGA GATAGTAGGG ATGCCGATAT TTTCATTAAA 10320 

25 TGTGAATCGG TAGTAACATT ACCTTTTAGC CCCATACGAA TCATACGATA ATAAAATTCA 10380 

GTCGCTGATA AACCAGAGCT ACCTAGTCCA GCAAAGAGTA TATGTCGACT TGATTGAAGT 10440 

TTGTCGATAA AGGTTTGGAT AATGTCGTTA TCAATAAATT CACCAGTTTG TTGAATGATT 10500 

30 TGTTGATGAT ATTTATGAAT TCTTTGAATA ATTGGGCTAT TTTCAATAAC TGTCTCTGTC 10560 

ATTTCTTGTT GAATATTAAA TTTTAAATCT TGGAAATTCT CATAATCCAG CTTATGACTA 10620 

AAGCGTGTCA TCGTTGCTGG TGATGTACCA ATCGCATGGG CTAAGGAGTT AATCGTTGAA 10680 

35 

• AAGGCATCGC TATAACCATT TTGTCTTATA TAATTGACGA TGCGTTTATC AGTTTTTGTA 10740 

AATAAATGTT GATAACGTTG AACACGATTC TCAAATTTCA TTGTGTCACC CCTTCATCTT 10800 

AATGATTACT ATTATATATG AAAAATATTT TCAAGATAGT AAAAAGCATT GATAAAAATT 10860 

40 

ATCTTAATGA TATATTGTAA ATGACTTTAC GTGAAAAAAC GACTTATGGA GTGAGGAATA 10920 

ATGTTACCAC ATGGATTAAT AGTATCTTGT CAGGCACTAC CAGATGAACC ATTGCATTCA 10980 

45 TCTTTTATTA TGTCGAAAAT GGCATTAGCT GCGTATGAAG GTGGTGCTGT TGGTATTCGC 11040 

GCAAATACTA AGGAAGACAT TTTAGCAATT AAAGAAACGG TAGATTTACC AGTTATTGGC 11100 

ATTGTGAAAC GTGACTATGA TCACTCAGAT GTTTTCATTA CTGCAACGTC AAAAGAAGTT 11160 

50 GATGAACTGA TAGAAAGCCA ATGTGAAGTC ATTGCATTGG ATGCAACGTT ACAGCAACGT 11220 

CCGAAAGAAA CGTTAGACGA ATTAGTATCA TATATTAGAA CACATGCACC GAACGTTGAA 11280 
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10 



15 



20 



25 



30 



35 



40 



45 



SO 



TATATTGGCA CGACGTTACA TGGCTATACT AGTTATACGC AAGGACAATT ACTTTATCAA 114 00 

AATGACTTCC AATTTTTAAA AGATGTACTA CAAAGTGTTG ATGCAAAAGT TATTGCGGAA 114 60 

GGTAATGTCA TTACACCGGA TATGTATAAA CGTGTGATGG ACTTAGGCGT TCATTGTTCA 11520 

GTCGTTGGTG GTG CGATAAC ACGACCAAAA GAAATTACGA AACGTTTTGT TCAAATTATG 11580 

GAAGATTAAA TGATAACGAT AAAAAAACGA GATGACCATC ATTAATTAAA GGCACCTAAT 1164 0 

TATCTTAGGT GGCTGAATGA ATGTAATGGG TTCATCTCGT TTTGTTTGTT TATGATAGTG 11700 

ATTTTATTTT CAACTTTATC CAAAAATAAG TAAAGCGACG GGGATGGTGA TTAATAGCGA 11760 

CAACGCCACG CGTAAAAACC AAATGATGAT GAGTTTCCAG ACAGGTATTT TAATTTCAGT 11820 

TGCTAGTATA CATGGCACTA ATGCTGAGAA AAAGATAATG GCTGATACGC TTACTACACC 11680 

GACGACAAAT TTAGTACTCA TTGCAGCTTT AGTTACTAAC AAAGATGGTA GAAACATCTC 11940 

TACAATAGAA AckCTGACGC TTTTGCTAGT AAAGCCTGAT CAGCAATTGG GAAAATATAA 12000 

ATAAATGGAT AGAAGATATA GCCAAGCCAA TCAATGAATG GTGTATAGTT CGCTACAATC 12060 

AGTCCTAAAA AACCAATCGA TAATATAGAA GGTAAAATAC CAACAGTCAT TTCTAAACCG • 12120 

TCTTTCAAAT TGTCCCAAAC GTTCTTCACG AGAGATGGTG TTAATGCATT TTGTTTCATC 12180 

GCCTCTGCAT ATGCAGTTTT CAGTCTGCTT CCTTCAATAG CAACTTCTTG TTCTCCTTCT 12240 

TGTCCGTTAT AATATTCTGT TGATTCATTG CTGATTGGCG GTAGCCATGC AGTAATTGCA 12300 

GTCACGACAA ATGTGATGAC TAAAGTTATC CAAAAGTATA AATTCCAATG CGGCATTAAT 12360- 

CCTAAAGTTT TAGCAACGAT AATCATAAAA GTTGCTGAAA CTGTTGAAAA GCCAGTCGCA 12420 

ATAATCGTGG CTTCTCGTTT GTTGTACATC CCTTGCTTAT AGACACGATT AGTAATCAAT 12480 

AATCCTAAGG AATAACTGCC GACAAACGAA GCCACTGCAT CGACAGCGGA TTTTCCTGGT 12540 

GTTTTAAAAA TAGGTCTCAT AATAGGCTCC ATATAAACAC CGACAAATTC TAATAAGCCA 12600 

TAGCCCACTA ATAAAGAAAG CGcAATTGCA CCTACTGGAA TTAAGATACT TAATGGCATC 12660 

ATTAATTTTT CAAACAAAAA CGGACCATAG TTAGCTTTAA ATAGTATTGA TGGACCGATT 12720 

TTAAATACAT ACATTATACC GATCATTGCA CCTGCAACTT TAAATAATGT AATGACCAAG 12780 

TTTGTGATTG AAGTCATAAA AGTACGTCTC ACTATTGGTA ACGCTGTACC AATTAAAATC 1284 0 

ATAATCAGTG CAACATAGGG CATAAGTGGA CCTATGATTG AGCGAATGGC TAGATGAACA 12900 

TGATCGACGA AAATAGTGTT GTTACCATTA ATCGTAAAAG GAATAAAGAA ACATAGTATG 1296 0 

CCCACTAAAC TATAGACAAA AAAACGCCAT GCACTTGGTT GTTGTGCATT AGAATGATAT 13020 

TGATTCATTA AAGCAACCCC TTTGTTTAAA TGAATACACA AAACTGTATG ATGCATCTTC 13080 
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ATAGTTTGAA TTATTTTCAT AC CAATACAA ATTAACTAAT TATATATAGA TTGAAACTAT 


13200 




ATTACTTAAT AAAATATTTA TCTTAAATGT 


TGTTGTGTTG ATTCAACACC ACAACTAAAA 


13260 


5 


GTGTTTATAA ATTATTTGGA AATACACATA 


TTTGTAAATG ATTAGTATCG 


ATTTAATATC 


13320 




GTATTATTAA ATTTTTATTA ATTTTGTAGT 


CTTAATCmAA AAATAATATA 


TGTCATGTTA 


13380 


10 


TATTGAAGGT GCAGTTGTTT TTCATTCTCA 


AGAGGGGGTC 


AAAAAAATAC 


TTTTGAGGTG 


13440 


ATTATATGTT AAGAGGACAA GAAGAAAGAA 


AGTATAGTAT 


TAGAAAGTAT 


TCAATAGGCG 


13500 




TGOTGTCAGT GTTAGCGGCT ACAATGTTTG 


TTGTGTCATC 


ACATGAAGCA 


CAAGCCTCGG 


13560 


15 


AAAAAACATC AACTAATGCA GCGGCACAAA AAGAAACACT 


AAATCAACCG 


GGAGAACAAG 


13620 


GGAATGCGAT AACGTCACAT CAAATGCAGT 


CAGGAAAGCA ATTAGACGAT ATGCATAAAG 


13680 




AGAATGGTAA AAGTGGAACA GTGACAGAAG 


GTAAAGATAC 


GCTTCAATCA TCGAAGCATC 


13740 


20 


AATCAACACA AAATAGTAAA ACAATCAGAA 


CGCAAAATGA TAATCAAGTA AAGCAAGATT 


13800 




CTGAACGACA AGGTTCTAAA CAGT*CACACC AAAATAATGC GACTAATAAT ACTGAACGTC 


13860 




AAAATGATCA GGTTCAAAAT ACCCATCATG 


CTGAACGTAA 


TGGATCACAA 


TCGACAACGT 


13920 


25 


CACAATCGAA TGATGTTGAT AAATCACAAC 


CATCCATTGC 


GGCACAAAAG 


GTAATACCCA 


13980 




ATCATGATAA AGCAGCACCA ACTTCAACTA 


CACCCCCGTC 


TAATGATAAA ACTGCACCTA 


14040 




AATCAACAAA AGCACAAGAT GCAACCACGG 


ACAAACATCC 


AAATCAACAA 


GATACACATC 


14100 


30 


AACCTGCGCA TCAAATCATA GATGCAAAGC 


AAGATGATAC 


TGTTCGCCAA 


AGTGAACAGA 


14160 




AACCACAAGT TGGCGATTTA AGTAAACATA 


TCGATGGTCA 


AAATTCCCCA 


GAGAAACCGA 


14220 




CAGATAAAAA TACTGATaAT AAACAACTAA 


TCAAAGATGC 


GCTTCAAGCG 


CCTAAAACAC 


14280 


35 


GTTCGACTAC AAATGCAGCA GCAGATGCTA 


AAAAGGTTCG 


ACCACTTAAA 


GCGAATCAAG 


14340 




TACAACCACT TAACAAATAT CCAGTTGTTT 


TTGTACATGG 


ATTTTTAGGA 


TTAGTAGGCG 


14400 


40 


ATAATGCACC TGCTTTATAT CCAAATTATT 


GGGGTGGAAA 


TAAATTTAAA 


GTTATCGAAG 


14460 


AATTGAGAAA GCAAGGCTAT AATGTACATC 


AAGCAAGTGT 


AAGTGCATTT 


GGTAGTAACT 


14520 




ATGATCGCGC TGTAGAACTT TATTATTACA 


TTAAAGGTGG 


TCGCGTAGAT 


TATGGCGCAG 


14580 


45 


CACATGCAGC TAAATACGGA CATGAGCGCT 


ATGGTAAGAC 


TTATAAAGGA ATCATGCCTA 


14640 




ATTGGGAACC TGGTAAAAAG GTACATCTTG 


TAGGGCATAG 


TATGGGTGGT 


CAAACAATTC 


14700 




GTTTAATGGA AGAGTTTTTA AGAAATGGTA ACAAAGAAGA AATTGCCTAT CATAAAGCGC 


14760 


SO 


ATGGTGGAGA AATATCACCA TTATTCACTG 


GTGGTCATAA 


CAATATGGTT 


GCATCAATCA 


14820 




CAACATTAGC AACACCACAT AATGGTTCAC 


AAGCAGCTGA 


TAAGTTTGGA 


AATACAGAAG 


14880 
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ATTTAGGATT AACGCAATGG GGCTTTAAAC AATTACCAAA TGAGAGTTAC ATTGACTATA 15000 

TAAAACGCGT TAGTAAAAGC AAAATTTGGA CATCAGACGA CAATGCTGCC TATGATTTAA 15060 

5 

CGTTAGATGG CTCTGCAAAA TTGAACAACA TGACAAGTAT GAATCCTAAT ATTACGTATA 15120 

CGACTTATAC AGGTGTATCA TCTCATACTG GTCCATTAGG TTATGAAAAT CCTGATTTAG 15180 

GTACATTTTT CTTAATGGCT ACAACGAGTA GAATTATTGG TCATGATGCA AGAGAAGAAT 15240 

10 

GGCGTAAAAA TGATGGTGTC GTACCAGTGA TTTCGTCATT ACATCCGTCC AATCAACCAT 15300 

TTGTTAATGT TACGAATGAT GAACCTGCCA CACGCAGAGG TATCTGGCAA GTTAAACCAA 15360 

15 TCATACAAGG ATGGGATCAT GTCGATTTTA TCGGTGTGGA CTTCCTGGAT TTCAAACGTA 15420 

AAGGTGCAGA ACTTGCCAAC TTCTATACAG GTATTATAAA TGACTTGTTG CGTGTTGAAG 15480 

CGACTGAAAG TAAAGGAACA CAATTGAAAG CAAGTTAAAT TCATCTTCTG AATTTAATAT 15540 

20 GCTATGTAAA TCGTGCTGTT ATCATGGCAC ATCAGATATA AGTAGCATCA CAGTGTTGAA 15600 

TTTAAAAATA GTAAAGTGAA ATAAAGCGCC TGTCTCATTA GCGAAAACTA AAGGGACAGG 15660 

CGTATCTGTT TATGAGCTTA ATAAATTGTA TGAATAATAT GGTTGATCGA ATAACTGTTT 15720 

25 ATCATGATGA TAAATTGAGT TTTTTAAAAT AATGATATAT TACATCATTG TTATAGCGTT 15780 

TAAGAAATCA ACAACTTTAC GATAAATAGT GATTGCTTCG TCATTAGGTC TACGATCAAA 15840 

ATCATGCTCG TTTTTATTCA CGCGTTCAAA TGTTGAATGT GGAACATGAT TCATGATATG 15900 

30 TTCGCTTTCC TCAACGGGAA CATCATAATC GCCATTACAA TGCGCAATGA AAACAGGTGG 15960 

AAGTGTTTTA AGTTCATCTG GTGCAATATT ATATTTTGAA TTAGTATAAT CAGCAATGTT 16 020 

AATCATATTT ATCCATTTAC CTGTGCCACG TGCATAAACG TAGATTAAAA AACGTTGTGC 16080 

35 

GATTTGATCT TGAACAACCG GTGTTGGTGA AGTGAGTTGT GCAATCATTG TTTCGTTTAC 16140 

GCTTTGAGCT ATTTTTGCGT AATAACTATT AGTTGTTTTA AAAGGTTCAG TGTTGATGCG 16200 

ACTATAACCA TAAAAATCAA TAACACCATC AATATCTCTG TCTCGTGCAA TTAATAGACT 16260 

40 

TAAATATGCA CCTGATGATC TGCCAAAGGT AAAAATAGGG CAATTAGAAT ATTGTGATTG 16320 

AATCGCATCG AATGAtGCgn AGnACATCCT CAATAATGCA ATCGAGACTT ACTTCTGGTA 1638 0 

45 ATAAACGATA ACTTAGTTGA ATTAAATCGT AATGTTCCGT AAgATATCGA TATACTGTGG 1644 0 

GGATAAATCG TTAGCTTTAC CGAACATTAA TCCACCACCG TGGATGTAGA CAATAGCGCC 16500 

TTTTGTTGGT TGATTTTTTG CTTTAATAAT TGTGTAAGGT AATGCAAATG CATCTTTAGT 16560 

SO ■ AATTACTTTA TCTTTAATTT CAGTCACGAT TTAATAGGCT CCTTATTTTT GATATTGATG 16620 

TCATTATAAC ACTGTCTTAA ATTTCCATGA AAAATAGTCT TAAGACGATG AGTCATGATA 16680 
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CATCATTTTA ACAATATCTT TAAAAGCAGC ATGTGGAATG GCTAAATCTT CTAAATCTGC 16800 

CATAGAAAAT TCAAGATTGA TATCATGTGG TCGCTGTTCA GCAAGTTTAT GCACAAAGTC 16860 

5 

AGGTTCTGTG ACAAAAGGCG AAGACATGCC GACCATATCT GCATGTTGTA AAGCATCTAA 16920 

AGCAGACTCT GGAGAATTAA TCCCGGCACT TGCAATTAAA GGGATACGAC CTGCTAAATG 16980 

TTCATAGACA ATTTGGTTAA CTGGTCGACC GAAATGATCA CCTGGTGTAC GAGACGTATT 17040 

10 

TTGATAAATA TGTCGACCCC AGCTAGCGAT TGCTAAGTAT TGGATGTTTG AAACGTCCAT 17100 

GACCCAATTG ATTAATTGGT TGAAGTCGTC AATGGTATAT CCTAAATCAC TGCCTCTGGT 17160 

TTCTTCTGGC GTTGCTCGAA ATCCTAAAAT AAAATTGTCA GGTGCTTCTT TATCAATCAC 17220 

7© 

TTCTTGTACC GCACGCATAA CTTCTAAACA TAATCTTGCA CGATTTTTTA ATGAGTCGGC 17280 

ACCGTAATGG TCTGTACGTT TATTCGAAAA AGTTGAGAAA AATGTTTGAA TCAGCAAACG 17340 

20 TTGTGCAATC GAAATTTCCA CACCATCAAA ACCTGCTTTA ATCGCGCGTA ATGTAGCATC 17400 

GCGATACTGC TGAATGATGC TATTGATTTT CTCATGAGAC ATGGCGATAA CATCGTGTTC 17460 

AATCGGTGAA TGCAATGTCA TAGGGCTTGG TCCATACACC TTTCCAAAAT TTAAAATGGC 17520 

25 TTGATTTGAA AAACGACCAG CATGCGCTAg CTGGATAATA GCGAGGCTAC CATGTTGTTT 17580 

CATCGTAGAT GCCATGTTAG TTAATCCAGG GATACAAGCA TCATGATCAA TATTAAAGCC 1764 0 

ATATTCAAAC AATTGACCAT AAGGTTCAAT GTAAGCAGCG CCGGTGACTT GCATTCCAGC 17700 

30 ' 

TGAATTAGAG CGACGTGCAG CATAAGCCAA GTCTTCTTTT GTAATATAGC CTTCTTTTGT 17760 

TGATGTGTTT ACGGTCATTG GTGATAATAC AAAGCGATTC GAAATTTTGA TGCCATTAGG 17820 

TAAGTGGATT GATTGTAAAA GTGGTTTGTA TCGGTACATA CTATGATTCC TTTTCTATTC 17880 

35 

AATATTGTTT TCAAAGTACC ATGGAAAGAA TGAATAATCA ATGATGAACA GTCTTGATAG 17940 

AATAGAATTG GTACATGGAA AGTATTTTTA AAATTAAACT AATGAATGGC ATTTGTAGGT 18000 

CTGAAAATAT GAATATGAAA AAGAAAAATA AAGGCGAAAA GATATAAAAG TTAATTGAAA * 18060 

40 

AACGTTATCA TATACGTGGG TATATGAAGA GGGAATGGTA TTAAGAACGC TAAAATGTTA 18120 

TGTCGGTTTG ACATGACAGG ATAAGTTTGG AGATGACGGA TTGGTTAAAT TAAGCGTATT 18180 

45 AGACTATGCC TTAATAGATG AAGGTAAGGA TGCACAAAAG GCATTGCAAG ATTCAGTGAC 18240 

ACTTGCAAAA TTAGCAGATC GACTTGGCTT TAAGCGAATT TGGTTTACGG AACATCATAA 18300 

TGTACCAGCG TTTGCGTGTA GTAGTCCAGA ACTTTTGATG ATGCATACAT TGGCGCAGAC 18360 

SO AAATCACATA CGAGTTGGCT CTGGTGGTGT GATGCTGCCG CACTATCGAC CTTATAAAAT 18420 

TGCTGAGCAT TTTAGAATGA TGGCAGCGTT ATATCCAAAT CGTATTGATT TAGGTATTGG 18480 
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TAGTTACGAT GAATCGATTT CGTTATTACG TGATTATCTT ACAATAAAGG ATAAACCAAG 18600 

TGCGCATACG TTAGGTGTCC AACCACACAT TGATCATTTT CCAGAAATGT GGTTATTAAG 18660 

5 TAGTAGCGCA ACATCTGCCA AAATAGCTGC CGAACTAGGT ATAGGGCTTT CTGTTGGAAC 18720 

ATTTTTGCTA CCAGATATAA ATGCGATACA TACAGCGAAG GATAACATTG ATATTTACAA 18780 

AAAACATTTC CAAGCATCAA CGATTAAAAT GGACGCAAAG GTGATGGCAT CTGTATTTGT 1884 0 

10 

CATTGTAGCT GATAACGAAG CGGAAGTAGC AGCATTACAA CATGCCTTAG ATGTTTGGTT 18900 

ATTAGGTAAA TTACAATTTG CAGAATTTGA AGATTTTCCT TCAGTAGACA CAGCACAAAA 18960 

GTATAAGCTT AATGATCGAG ACAAAGAGAT GATTCAAGCA CATCAAGCAC GCATCATTGC 19020 

AGGTACACAA GAAAAGGTTA AAGCACAATT AGATGATTTC ATTGCTACGT TTGAAGTTGA 19080 

TGAGGTGTTA GTAGCACCGC TTATTCCAGG TATTGAACAG CGTTGTAAAA CATTAAAATT 19140 

2Q ACTCGCGGAA ATTTATTTGT AGCATTTTAA ATAGAAGAGA AAGGATGAAG ATAAGATGAA 19200 

AAAGTTAGCC AATTATTTAT GGGTAGAAAA AGTAGGAGAT TTGTATGTGT TTAGTATGAC 19260 

ACCTGAATTG CAAGATGATA TTGGGACAGT AGGTTATGTT GAATTCGTAA GTCCAGATGA 19320 

25 AGTTAAAGTG GATGATGAAA TTGTGAGTAT CGAAGCATCG AAAACGGTCA TTGATGTGCA 19380 

AACGCCATTG TCAGGAACGA TTATTGAGCG AAATACAAAA GCGGAAGAAG AACCGACAAT 19440 

TTTAAACTCT GAAAAACCAG AAGAAAATTG GTTGTTCAAA TTGGATGATG TCGATAAAGA 19500 

30 AGCATTCCTA GCATTACCGG AGGCTTAAAT GGAAACGTTA AAATCAAATA AAGCGAGACT 19560 

TGAATATTTA ATCAATGATA TGCATCGAGA GAGAAATGAC AATGACGTAT TGGTAATGCC 19620 

ATCTTCATTT GAAGATTTGT GGGAATTATA TCGAGGCTTA GCAAATGTCA GACCGGCATT 19680 

35 

ACCTGTAAGT GATGAATATT TAGCTGTACA AGATGCTATG TTAAGTGATT TGAATCGTCA 1974 0 

ACATGTTACG GATTTGAAGG ATTTGAAGCC GATAAAAGGT GACAATATCT TTGTTTGGCA 19800 

AGGTGATATC ACGACGTTAA AAATCGATGC TATTGTTAAT GCTGCAAATA GTCGTTTTCT 19860 

40 

AGGATGTATG CAAGCTAATC ATGACTGCA7 TGATAATATT ATTCATACAA AAGCGGGTGT 19920 

TCAAGTTCGA CTTGATTGTG CAGAGATCAT TCGACAACAA GGGCGCAATG AAGGTGTAGG 19980 

TAAAGCCAAA ATAACACGTG GATATAATTT GCCAGCAAAG TATATAATTC ATACGGTTGG 20040 

45 

TCCGCAAATA CGTCGATTGC CTGTTTCAAA GATGAATCAG GACTTGTTAG CTAAATGTTA 20100 

TCTTAGCTGT CTTAAATTGG CTGATCAACA TAGTTTAAAT CATGTCGCTT TTTGCTGTAT 2 0160 

50 ATCTACAGGT GTATTTGCTT TTCCTCAAGA TGAAGCAGCA GAAATTGCTG TTCGAACAGT 20220 

AGAAAGCTAT CTCAAAGAAA CAAATTCAAC ATTGAAAGTC GTGTTCAATG TATTTACAGA 20280 
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CAATGTCTCT 


GTTAATGGAT GACAAGACAA AGCAGGCTGA AGTATTGCGT ACTGCGATTG 


20400 




ATGAAGCAGA 


TGCGATAGTG 


ATTGGAATTG 


GTGCAGGCAT 


GTCTGCATCT 


GACGGATTTA 


20460 


5 


CATATGTAGG 


AGAGCGTTTT 


ACGGAAAATT 


TCCCAGATTT 


TATTGAAAAA 


TATCGCTTCT 


20520 




TTGATATGTT 


GCAAGCGAGT 


TTACATCCTT 


ATGGCAGTTG 


GCAAGAGTAT 


TGGGCATTTG 


20580 


10 


AGAGTCGTTT 


TATTACATTA 


AACTATTTAG 


ATCAACCTGT 


AGGTCAGTCT 


TACCTCGCTT 


20640 


TAAAATCCTT 


GGTGGAAGGT AAACAGTACC ACATTATAAC 


TACGAATGCA 


GATAATGCTT 


20700 




TCGATGTAGC 


TGATTATGAT ATGACTCATG 


TATTTCATAT ACAAGGGGAG 


TATATACTGC 


20760 


IS 


AACAGTGTAG 


CTCAGCATTG TCATGCTCAA ACGTATCGCA ATGATGATTT AATTCGTAAA 


20820 


ATGGTTGTTG 


CGCAACAAGA 


TATGCTTATA 


CCTTGGGAGA 


TGATTCCAAG 


ATGTCCAAAA 


20880 




TGTGATGCCC 


CAATGGAAGT 


GAATAAACGT 


AAAGCGGAAG 


TTGGGATGGT 


TGAAGATGCT 


20940 


20 


GAATTTCATG 


CGCAACTACA 


TCGTTATAAT 


GCTTTTCTAG AGCAACATCA AGATGATAAA 


21000 




GTGTTGTATT 


TGGAAATTGG 


AATTGGTTAT 


ACTACACCAC 


AATTTGTGAA 


GCATCCTTTT 


21060 




CAGCGTATGA 


CACGTAAAAA 


TGAAAATGCC 


CTTTATATGA 


CGATGAATAA 


AAAGGCATAT 


21120 


25 


CGCATTCCGA 


ATTCAATTCA 


AGAACGTACC 


ATACATTTAA 


CTGAGGATAT 


CTCAACATTG 


21180 




ATTACAGCAG 


CACTCCGGAA 


CGACAGCACA 


ACGAAAAATA 


ACAACATTGG 


AGAGACAGAA 


21240 




GATGTACTTA 


ATAGAACCGA 


TTAGAAATGG 


AGAATATATT 


ACTGATGGTG 


CGATTGCACT 


21300 


30 


CGCTATGCAA 


GTTTATGTTA 


ACCAGCATAT 


CITITTAGAT 


GAAGATATTT 


TATTCCCTTA 


21360 




TTATTGTGAT 


CCAAAAGTGG 


AAATTGGACG 


TTTTCAAAAT 


ACTGCTATAG 


AAGTGAATCA 


21420 




AGATTATATA 


GATAAACACA 


GTATTCAAGT 


AGTTCGCCGA 


GATACTGGTG 


GTGGCGCTGT 


21480 


•35 


GTATGTTGAT 


AAAGGTGCCG 


TTAATATGTG 


TTGTATTTTA 


GAACAAGACA 


CTTCAATTTA 


21540 




TGGT5ATTTT 


CAACGATTTT 


ATCAACCAGC 


TATAAAGGCG 


TTGCATACAT 


TAGGTGCAAC 


21600 


40 


AGATGTGGTA 


CAAAGCGGTA GAAATGATTT AACATTGAAT GGTAAAAAAG TGTCAGGCGC 


21660 


CGCAATGACA 


TTAATGAATA ATCGTATTTA TGGCGGTTAT 


TCGCTATTAC 


TTGATGTTAA 


21720 




TTATGAAGCA 


ATGGATAAAG TGTTAAAGCC TAATCGCAAA AAGATTGCAT CGAAAGGGAT 


21780 


45 


TAAATCTGTG 


CGCGCACGTG 


TTGGTCATCT 


TAGAGAAGCA 


CTGGATGAAA 


AGTATCGTGA 


21840 




'TATAACCATT 


GAAGAATTTA AAAATTTAAT 


GGTGACGCAG 


ATTTTGGGAA 


TCGATGACAT 


21900 




TAAAGAGGCG 


AAACGATATG AATTAACGGA 


TGCAGATTGG 


GAAGCGATTG 


ATGAATTAGC 


21960 


SO 


TGATAAAAAG 


TATAAAAATT 


GGGATTGGAA 


TTATGGCAAG 


TCACCCAAAT 


ATGAATACAA 


22020 




TCGAAGTGAA 


AGATTATCTT 


CAGGTACGGT AGACATAACA ATTTCTGTTG AACAAAATCG 


22080 
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AGAAGCATTA CAAGGAACAA AAATGACAAG AGAAGATTTA ACGCATCAGT TAAAGCAATT 22200 

AGACATCGTT TATTATTTTG GCAATGTTAC GGTAGAAGCA TTAGTGGATA TGATTTTAAG 22260 

5 _ 

TTAATATTGT TATTTTATGT ATGCTGAATC ATTGGAAGTG TTTGCTTGCT CTTGAAAAGG 22320 

TGACAATAGT GTTTGGTGAA GGTTGAACAT ATGAGTGGAA ATTATTGCCT TTAACTATTC 223 80 

AAAGTATGAT ATATATATGG TTTTTGTTTC TAAATGATTG GGTATTTGAA AATAGATGAG 22440 

10 

■ TTTAATATTT TAAGGAATAT AATGATGTTT ACTTTTATAA TTCATATAGA ATATTAAGCA 22500 

ATATAAGTCT GTTGATATAT ACAAAATATA ATGACTGCTA TAATGAGTAA TCAATAGACA 22560 

is CAAAGAGGAG ATTATGTGAT GAATAATAAA GTATTAGTAA CCGGTGGTAC AGGGTTT GTT 22620 

GGCATGCGAA TTATTTCACG ATTATTAGAA CAAGGTTATG ACGTACAAAC GACGATACGT 22680 

GATTTAAGTA AAGCTGATAA AGTAATTAAA ACAATGCAAG ACAATGGCAT TTCCACAGAG 22740 

20 CGATTAATGT TTGTCGAAGC GGATTTATCA CAAGATGAAC ATTGGGATGA AGCAATGAAA 22800 

GATTGCAAGT ATGTCTTGAG TGTAGCATCT CCGGTGTTTT TCGGTAAAAC AGACGATGCA 22860 

GAAGTGATGG CGAaCTGcAA TTGAAGGTAT ACAACGTATT TTAAGAGCTG CAGAACATGC 22920 

25 GGGTGTTAAA CGTGTGGTAA TGACTGCAAA CTTTGGTGCA GTTGGTTTTA GTAATAAAGA 22980 

TAAAAATTCA ATCACAAATG AAAGTCATTG GACAAATGAA GATGAACCAG GCTTATCAGT 23040 

ATATGAAAAA TCAAAATTGT TAGCTGAAAA GGCAGCGTGG GATTTTGTTG AGAATGAAAA 23100 

3D 

TACAACAGTA GAATTTGCCA CAATCAATCC AGTTGCAATT TTTGGGCCAT CATTAGATGC 23160 

ACACGTTTCA GGAAGCTTTC ATTTATTAGA AAATTTATTG AATGGTTCAA TGAAACGTGT 23220 

ACCGCAAATT CCGTTAAATG TTGTTGATGT GAGAGACGTA GCTGAACTGC ACATTTTGGC 232 BO 

35 

AATGACAAAT GAACAAGCTA ATGGCAAGCG ATTTATTGCG ACGGCTGATG GACmAATTwA 23340 

tTTGTTGGGA ATTGcCAAAt TAATTAAAGA AAAGGGCCTG GAAATAGCTC CAAAAGTTCC 23400 

TACTAAAAAA TTACCCAGCT TTATTTTGAG CnAnGnGCC 23439 

40 

(2) INFORMATION FOR.SEQ ID NO: 39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4522 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

50 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 39: 

CCCTTTGAGA GTATATCATC TAGTCAAATT ATGCCTGTCA TTAGAGCGAC TAGCTTTGAT 60 
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10 



15 



20 



TATTATGCAG TCGATTTAGG GAAATCATAT CGTCTAATTG ACGAAAGCAT GTTAGAGGAT 180 

TTGAAGTTAA CTGAACAACA AATAAGAGAA ATGTCTCTGT TTAATGTTAG AAAATTGTCA 24 0 

AATTCATATA CGACTGATGA AGTAAAAGGT AATATTTTTT ATTTTATTAA CTCAAATGAC 3 00 

GGGTATGATG CAAGTAGGAT ACTAAATACT GCATTTTTAA ATGAAATTGA GGCACAATGT 36 0 

CAAGGCGAAA TGCTCGTAGC AGTGCCACAC CAAGATGTGT TAATTATTGC AGATATACGC 420 

AATAAAACAG GATATGATGT GATGGCACAT TTAACAATGG AATTTTTCAC TAAAGGTCTA 4 80 

GTTCCAATTA CATCATTATC CTTTGGATAT AAACAGGGTC ATCTTGAACC GATATTTATT 540 

TTAGGTAAAA ATAATAAACA AAAAAGAGAT CCAAACGTGA TTCAGCGTTT AGAAGCAAAT 600 

CGTCGTAAAT TTAATAAAGA TAAATAGAAA TAATTGGATA AGGAGTTTTG TCATAATGAA 660 

TTTATTTTAC AATCCTAAAT ATGTAGGAGA TGTCGCATTT TTACAAATTG AACCAGTTGA 720 

AGGTGAATTA AACTACAATA AAAAAGGTAA TGTTGTTGAA ATTACtAATG AAGGTAATGT 780 

TGTAGGTTAT AATATTTTTG AAATTTCAAA AGATATAACA ATTGAAGAAA AAGGTCATAT 840 

TAAATTAACT GATGAACTTG TAAATGTATT CCAAAAGCGT ATTTCAGAAG CTGGTTTTGA 900 

25 TTATAAATTA AATGCTGATC TATCACCGAA ATTTGTAGTT GGCTACGTTG AAACTAAAGA 960 

CAAACATCCT GATGCAGATA AATTAAGTGT ACTAAATGTA AACGTTGGAA ATGACACATT 1020 

ACAAATTGTA TGTGGCGCGC CTAACGTTGA AGCTGGACAG AAAGTTGTTG TTGCTAAAGT 1080 

30 AGGTGCAGTG ATGCCTAGCG GTATGGTAAT TAAAGATGCT GAATTACGTG GTGTTGCCTC 114 0 

AAGCGGTATG ATTTGTTCAA TGAAAGAATT GAATTTACCT AATGCACCTG AAGAAAAAGG 1200 

TATTATGGTA TTAAATGACA GCTATGAAAT TGGACAAGCA TTtTTTGAAT AATTAAGGAA 1260 

GGTAGTGAAA ATATGAGCTG GTTTGATAAA TTATTCGGCG AAGATAATGA TTCAAATGAT 132 0 

GACTTGATTC ATAGAAAGAA AAAAAGACGT CAAGAATCAC AAAATATAGA TrACGATCAT 13 BO 

GACT'CATTAC TGCCTCAAAA TAATGATATT TATAGTCGTC CGAGGGGAAA ATTCCGTTTT 1440 

CCTATGAGCG TAGCTTATGA AAATGAAAAT GTTGAACAAT CTGCAGATAC TATTTCAGAT 1500 

GAAAAAGAAC AATACCATCG AGACTATCGC AAACAAAGCC ACGATTCTCG " TTCACAAAAA 1560 

CGACATCGCC GTAGAAGAAA TCAAACAACT GAAGAACAAA ATTATAGTGA ACAACGTGGG 1620 

AATTCTAAAA TATCACAGCA AAGTATAAAA TATAAAGATC ATTCACATTA CCATACGAAT 1680 

AAGCCAGGTA CATATGTTTC TGCAATTAAT GGTATTGAGA AGGAAACGCA CAAGCCAAAA 174 0 

50 ACACATAATA TGTATTCTAA TAATACAAAT CATCGTGCTA AAGATTCAAC TCCAGATTAT 1800 

CACAAAGAAA GTTTCAAGAC TTCAGAGGTA CCGTCAGCTA TTTTTGGCAC AATGAAACCT 186 0 
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AAACAAAAAT ATGATAAATA TGTAGCTAAG ACGCAAACGT CTCAAAATAA ACAATTAGAA 198 0 

CAAGAAAAAC AAAATGATAG TGTTGTCAAA CAAGGAACTG CATCTAAATC ATCTGATGAA 2040 

5 AATGTATCAT CAACAACAAA ATCAATGCCT AATTATTCAA AAGTTGATAA TACTATCAAA 2100 

ATTGAAAATA TTTATGCTTC ACAAATTGTT GAAGAAATTA GACGTGAACG AGAACGTAAA 2160 

GTGCTTCAAA AGCGTCGATT TAAAAAAGCG TTGCAACAAA AGCGTGAAGA ACATAAAAAC 2220 

10 GAAGAGCAAG ATGCAAT ACA ACGTGCAATT GATGAAATGT ATGCTAAACA AGcGGAACgC 2280 

TATGTTGGTG ATAGTTCATT AAATGATGAT AGTGACTTAA CAGATAATAG TACAGATGCT 2340 

AGTCAGCTTC ATACAAATGG CATAGAGAAT GAAACTGTAT CAAATGATGA AAATAAACAA 2400 

75 

GCGTCAATAC AAAATGAAGA CACTAATGAC ACTCATGTAG ATGAAAGTCC ATACAATTAT 2460 

GAGGAAGTTA GTTTGAaTCA AGTATCGACA ACAAAACAAT TGTCAGATGA TGAAGTTACG 2520 

GTTTCGAATG TAACGTCTCA ACATCAATCA GCACTACAAC ATAACGTTGA AGTAAATGAT 25 BO 

20 

AAAGATGAAC TAAAAAATCA ATCCAGATTA ATTGCTGATT CAGAAGAAGA TGGAGCAACG 264 0 

aATAAAGAAG AATATTCAGk AAGTCAAATC GATGATGCAG AATTTTATGA ATTAAATGAT 2700 

2s ACAGAAGTAG ATGAGGATAC TACTTCAAAT ATCGAAGATA ATACCAATAG AAACGCGTCT 2760 

GAAATGCATG TAGACGCTCC TAAAACGCAA GAGTACGCAG TAACTGAATC TCAAGTAAAT 282-0 

AATATCGATA AAACGGTTGA TAATGAAATT GAATTAGCAC CGCGTCATAA AAAAGATGAC 2880 

30 CAAACAAACT TAAGTGTCAA CTCATTGAAA ACGAATGATG TGAATGATAA TCATGTTGTG 2940 

GAAGATTCAA GCATGAATGA AATAGAAAAG AATAACGCAG AAATTACAGA AAATGTGCAA 3 000 

AACGAAGCAG CTGAAAGTGA ACAAAATGTC GAAGAGAAAA CTATTGAAAA CGTAAATCCA 3060 

35 AAGAAACAGA CTGAAAAGGT TTCAACTTTA AGTAAAAGAC CATTTAATGT TGTCATGACG 3120 

CCATCTGATA AAAAGCGTAT GATGGATCGT AAAAAGCATT CAAAAGTCAA TGTGCCTGAA 3180 

TTAAAGCCTG TACAAAGTAA GCAAGCTGTG AGTGAAAGAA TGCCTGCGAG TCAAGCCACA 3240 

40 

CCATCATCAA GATCTGATTC ACAAGAGTCA AATACAAATG CATATAAAAC AAATAATATG 3300 

ACATCAAACA ATGTTGaGAA CAATCAACTT ATTGGTCATG CAGAAACAGA AAATGATTAT 3360 

CAAAATGCAC AACAATATTC AGAGCAGAAA CCTTCTGTTG aTTCAACTCA AACGGAAATA 3420 

45 

TTTGAAGAAA GTCAAGATGA TAATCAATTG GAAAATGAGC AAGTTGATCA ATCAACTTCG 3480 

TCTTCAGTTT CAGAAGTAAG CGACATAACT GAAGAAAGCG AAGAAACAAC ACATCCAAAC 354 0 

AATACTAGTG GACAACAAGA TAATGATGAT CAACAAAAAG ATTTACAGTC ATCATTTTCA 3600 

SO 

AATAAAAATG AAGATACAGC TAATGAAAAT AGACCTCGGA CGAACCAACA AGATGTTGCA 3660 
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AGAACCACAA GTTATTGAGT CGGACGAGGA CTGGATTACA 3780 

TGACGCATTA TTTTACTTTA ATGTACCTGC AGAAGTACAA 384 0 

TGTTACAAGA TTTGAATTAT CAGTTGAAAA AGGTGTTAAA 3900 

ACAAGATGAC ATTAAAATGG CATTGGCAGC GAAAGATATT 3960 

AGGAACTAGT CGTGTTGGTA TTGAAGTTCC GAACCAAAAT 4020 

TTCTATTATT GAATCTCCaA GTTTTAAAAA TGCTGAATCT 4080 

GTATAGAATT AATAATGAAC CATTACTTAT GGATATTGCT 414 0 

TGCAGGTGCA ACTGGATCAG GGAAATCAGT TTGTATCAAT 4200 

ATATAAAAAT CATCCTGAGG AATTAAGATT ATTACTTATC 4260 

AGCTCCTTAT AATGGTTTGC CACATTTAGT TGCACCGGTA 4320 

TACACAGAGT TTAAAATGGG CCGTAGAAGA AATGGAACGA 4380 

TTACCCATGT ACGTAnTATA ACAGCATTTA ACnAAAAAGC 4440 

CAAAAATTGT CATTGTAaTT GATGAGTTGG CTGATTTAAT 4500 

TG 4522 

(2) INFORMATION FOR SEQ ID NO: 40: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRiANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 40: 



40 



45 



50 



TCAAGTTTAC 


GGATACGTAT 


ATATTTTGCA 


TGACATTTAG 


TGCAATAATA 


TTCATAATTT 


60 


GCCCGTTGTT 


GATAGCTTTC 


AATGCTGTTA 


CAAAATCTAG 


GCGCTCCAAC 


CTGTTGGCTC 


120 


AATCGTTTAA 


AATCTTGATC 


TTTATGTTGA 


TAACCTTTAC 


CAGCAATATG 


CAAGTGATAA 


180 


TGACACAATT 


CGTGCAGTAT 


AATTTTTACA 


ACAGCATCTT 


CTCCATAATG 


CTCATATTGT 


240 


TTTGGATTAA 


TTTCAATATC 


ATGGGACTTT 


AAAAGATAAC 


GTCCGCCTGT 


TGTACGTAAC 


300 


CTTTTATTAA AATATGCACA 


ATGTCGAAAC 


GTACGTCCAA 


ATTTTTCTTC 


CGAAAGATTC 


360 


TCAACCATTC 


GCTGAAGTTT .GTCATTATTC 


ATGTGGATCA 


ATCATCGTTA 


ATGATACTTT 


420 


GTCTTTATTT 


TTGTCAATAC 


TGTAAATCCA AACGTCAACG 


ATATCACCAA 


CACTGACAAT 


480 


ATCCATTGGA 


TTTTTTACGA 


ACTTCTTAGA 


AAGTTTCGAA 


ACATGGACAA 


GTCCATCTTG 


540 



55 



CCAAGTGTTT CATTACTAGA 
GATAAAAAGA AAGAACTGAA 
5 GATGTAACTG AAGGTCCAAG 

GTTTCAAGAA TTACGGCATT 
CGTATAGAAG CGCCTATTCC 

10 

CCAACGACAG TCAACTTACG 
AAATTAACAG TTGCGATGGG 
AAAACGCCAC ACGCACTAAT 

75 

AGTATTTTGA TGTCTTTACT 
GATCCAAAAA TGGTTGAATT 
ATTACAGATG TCAAAGCAGC 

20 

CGTTATAAGT TATTTGCACA 
CCCATATGAT GAAAGAATGn 
oc GATGATGGTC CGCAAGAAGT 



355 



EP0 786 519 A2 



TTTCATTCCT TCTTGTAAAT CTTCAATTGA TAGCACATCG GATTTAAGGA TTGGTGTTTC 660 
AAACTCGTCC CTTGGATCTC GATTAGGTGC GTTCAAGGAT TTAATAATAT CCTGTAATGT 720 

5 

AGGTACACCG ACTTGTAATT CAATCGCCAG T .751 

' (2) INFORMATION FOR SEQ ID NO: 41: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1076 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 41: 

TCTCCAGCTT TAACTTGATC TGGCACTTTA ACAATTGTCT GATCCATACA TACGCGACCA 60 

20 ATAACTTCGC ATTGATGACC ATTTACATTT ACAAAGCTAC CTTGCATTAT GCGTAAATGG 120 

CCATCTGCAT ATCCAATAgG TAACAATGCT ATTGTAGTTG GGTCAGTAGC TGTATAAGTT 180 

GCACCATAAC TTACAGACTC ACCCGCTTGT AGCGTCTTTG TTTGAACTAC ATTAGCAATT 24 0 

25 AATTGCACAC TTGGTTTAAG GTGTACTTTA ACTTTTTGCT GTACATACTC TGATGGATAA 300 

TATCCATAAA GGGAAATTCC TGGTCTTATT GCATTACAGA ATTGGCAATC CATTAATAGA 360 

GAGCCTGCTG AGTTCTGACA ATGTATATAT TCAGGTTTAA TTGCTTCATT GACCATATCT 420 

30 TTAAAACGTT GATATTGTTC AGTTGTCATA TCTCCTGGTT CGTCAGCACA GGCAAAGTGT 480 

GTAAACACGC CTTCAAATAC AAGTTGCTCA TATTGTTGAA TGATTTCAAT CACTTCTTGA 54 0 

TACGTTTTAG TATCTTTAAT ACCTAAACGT CCCATTCCTG TATCTAATTT AATGTGCAAC 600 

35 

CATAACTTTT TCTCTTGCTC ACCAGAAATG TTTTTAATTG CTTCTTTCAA CCACTGTTTA 660 

GACGSAACCG TTAAGGCAAC TCGGTGTTGT ATCGCTTTAT CAATATCTTT AGCTGGTAAC 720 

ACACCTAAGA CTAAAATTTT AGCAGTAATC CCATGCATTC TAAGTTCTAT CGCTTCATCT 780 

40 

AACGTTGCTA CAGCAAAAAA TGTGGCGCCA TTTTCCATTA AATGACGTGC TACTTTAACA 84 0 

CTACCTAGTC CATAGGCATT GGCTTTAACG ACAGCCATCA CTGTTTTATT TGGATGCAAT 900 

GTACTGAATA CTTTGAAATT TGATGCAACA GCGTTTAAAT CTACATTCAT ATACGCAGAT 960 

45 

CTATAATATT TATCCGACAT ATTACTTCCT CCTGTAATTC CCACACGTTT TAAAACTAGA 1020 

TCTTAATTAT CATTGTATAA CAAATTTAAA ATGCTGACTT TTCTAAAACA ACTTGG 1076 

50 (2) INFORMATION FOR SEQ ID NO: 42: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2930 base pairs 

55 



356 



EP 0 786 519 A2 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 42: 
TGACCACAAT GCCCAATACA ACCATCCCAT GGTAAAGCCA AGAGATGAGT CAATAAAGCG 
TGTTGAATAA GAGCTGAATG AACCTGATAC TGGATAAAAT GTTGCCAACT CTCCAATTGA 
TGACATTAAG AAATATAGCA TGACACCAAT AACAAGATAA GCGAGTATAG CGCCTCCAGG 
ACCAGCTTGA GAAATGATAT TACCAGTAGC TACAAATAGA CCAGTCCCAA TTGCACCACC 
TATAGGAATC ATGGAAATGT GTCTTGAGTT AAGACTACGG TTCATTTTAT TATCTTCCAT 
ATTTAGTCTC CCATCTATTT AAATATACCC ATTATTGTAA GCTTTTTAAG TGTACTATTC 
AATAACTATT TAGTACTGTA AAGCGAAAAA ATTAAAATTT TCTGATTTTT TAATCATCTT 
GAGCATGTTT AATTGTAATT TTGATGGGGT TAAATTATAA TATGTATTAA ATTATAATTA 
TnATAAATTG TGGAGGGaTG ACTATGTCAC AACAAGACAA AAAGTTAACT GGTGTTTTTG 
GGCATCCAGT ATCAGACCGA GAAAATAGTA TGACAGCAGG GCCTAGGGGA CCTCTTTTAA 
TGCAAGATAT TTACTTTTTA GAGCAAATGT CTCAATTTGA TAGAGAAGTA ATACCAGAAC 
GTCGAATGCA TGCCAAAGGT TCTGGTGCAT TTGGGACATT TACTGTAACT AAAGATATAA 
CAAAATATAC GAATGCTAAA AtATTCTCTG AAATAGGTAA GCAAACCGAA ATGTTTGCCC 
6TTTCTCTAC TGTAGCAGGA GAACGTGGTG CTGCTGATGC GGAcGTGACA TTCGAGGATT 
TGCGTTAAAG TTCTACACTG AAGAAGGGAA CTGGGaTTTA GTAGGGAATA ACACACCaGT 
ATTCTTCTTT AGAGATCCAA AGTTATTTGT TAGTTTAAAT CGTGCGGTGA AACGAGATCC 
TAGAACAAAT ATGAGAGATG CACAAAATAA CTGGGATTTC TGGaCGGGTt TCCAGAAGCA 
TTGCACCAAG TAACGATCTT AATGTCAGAT AGAGGGATTC CTAAAGATTT ACGTCATATG 
CATCjGGTTCG GTTCTCACAC ATACTCTATG TATAATGATT CTGGTGAACG TGTTTGGGTT 
AAATTCCATT TTAGAACGCA ACAAGGTATT GAAAACTTAA CTGATGAAGA AGCTGCTGAA 
ATTATAGCTA CAGATCGTGA TTCATCTCAA CGCGATTTAT TCGAAGCCAT TGAAAAAGGT 
GATTATCCAA AATGGACAAT GTATATTCAA GTAATGACTG AGGAACAAGC TAAAAACCAT 
AAAGATAATC CATTTGATTT AACAAAAGTA TGGTATCACG ATGAGTATCC TCTAATTGAA 
GTTGGAGAGT TTGAATTAAA TAGAAATCCA GATAATTACT TTATGGATGT TGAACAAGCT 
GCGTTTGCAC CAACTAATAT TATTCCAGGA TTAGATTTTT CTCCAGACAA AATGCTGCAA 
GGGCGTTTAT TCTCATATGG CGATGCGCAA AGATATCGAT TAGGAGTTAA TCATTGGCAG 
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GGTCAAATGC GCGTAGTTGA CAATAACCAA GGTGGAGGAA CACATTATTA TCCAAATAAC 1680 

CATGGTAAAT TTGATTCTCA ACCTGAATAT AAAAAGCCAC CATTCCCAAC TGATGGATAC 174 0 

5 GGCTATGAAT ATAATCAACG TCAAGATGAT GATAATTATT TTGAACAACC AGGTAAATTG 1800 

TTTAGATTAC AATCAGAGGA CGCTAAAGAA AGAATTTTTA CAAATACAGC AAATGCAATG 1860 

GAAGGCGTAA CGGATGATGT TAAACGACGT CATATTCGTC ATTGTTACAA AGCTGACCCA 1920 

10 GAATATGGTA AAGGTGTTGC AAAAGCATTA GGTATTGATA TAAATTCTAT TGATCTTGAA 1980 

ACTGAAAATG ATGAAACATA CGAAAACTTT GAAAAATAAA TTTGATATGT AGTTTCTATA 204 0 

TTGCGTAGTT GAGCAGTTTA TGATATCATA ATAAATCGTA AAGATTCCTA ACAAGAGAGG 2100 

75 

GTGTTTAACG TGCGCGTAAA CGTAACATTA GCATGCACAG AATGTGGCGA TCGTAACTAT 2160 

ATCACTACTA AAAATAAACG TAATAATCCT GAGCGTATTG AAATGAAAAA ATATTGCCCA 2220 

AGATTAAACA AATATACGTT ACATCGTGAA ACTAAGTAAT TCTTATCATT CAAATACGAC 228 0 

20 

GATTTGAAAA TAAAGCGGGC TTACCTATTA TATTGGGGAG CTCGCTTTTT TATGAAATTT 234 0 

TTGTGAAGAG TGATTAATGG ATTGAGTTTC ATCGGTAGAA CAATATATGA TTATATTAGT 24 00 

TGTTACTTTA TTAAAaTTTG AGAATATTTA TAGAAGGAAA TAGATTACTG ATTTTATAAA 2460 

25 

GTCACTTTGT TAGCGAATGC TTGAAAGAGT ATTTAATATA GTAGAATTTA AAATTTCAAA 2520 

GCGGAATTTA ATAAGTACGA AGTAGTTCTG GGTATGTTTT ATAAATGTTC GATAATACAC 25 BO 

TTTAATCTTA AATATGATGG TTTAGAAAAT GATTTAACAA AGAAATGAaA CTTTACTGTT 264 0 

30 

GAATTATGTG AGGATTGTGT TATTATATAA ATCGTAATAA TTACGATTTG ATAAAAAGTG 2700 

AGGTAACTAT ATATGGCTAA GAAATCTAAA ATAGCAAAAG AGAGAAAAAG AGAAGAGTTA 2760 

35 GTAAATAAAT ATTACGAATT ACGTAAAGAG TTAAAAGCAA AAGGTGATTA CGAAGCGTTA 2820 

AGAAAATTAC CAAGAGATTC ATCACCTACA CGTTTAACTA GAAGATGTAA AGTAACTGGA 28 80 

AGACCTAGAG GTGTATTACG TAAATTTGAA ATGTCTCGTA TTGCGTTTAG 293 0 

40 (2) INFORMATION FOR SEQ ID NO; 43: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3606 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 43: 

50 

CTTCTTGCCA TGGCTCTCTT TATTTAAAAA TGCTTCCAAC TTGTCCATTT GATTGTTTCT 6 0 
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TTATAAAAAA CTAATTTTAC AAATGCTTTT GCGTTCTTAC AAAAAATGCA TTTGACTATT 1B0 

ATTATAATAA GCGTATAATT GTCGCATATT ATTTTTTGTA TTTTTGGCAA TAACGAAGGA 240 

5 GTATTTATGA ATAAAGACAA GCAATTGCAC AACGACAAAA TCAATCTATC CCAATTAGTC 300 
TTATTAGGGT TAGGCTCTTT AATAGGATCT GGTTGGCTAT TTGGTGCGTG GGAAGCATCA . 360 

TCAATAGCTG GACCAGCAGC AATCATATCA TGGGTTCTTG GATTCCTAGT CATTGGAACC 420 

10 ATTGCCTATA ACTACATTGA AATCGGCACA ATGTTTCCTC AATCAGGTGG CATGAGTAAC 4 80 

TATGCCCAGT ATACACATGG CTCATTATTA GGCTTTATTG CTGCTTGGGC GAATTGGGTG 540 

TCTTTGGTGA CAATAATACC TATCGAAGCT GTGTCAGCTG TTCAATATAT GAGTTCTTGG 600 

75 

CCGTGGCATT GGGCGAAACC AATGAGATAT TTAATGGAAA ATGGCTCTAT TAGCACATAC 660 

GGATTGCTAG CTGTATATCT CATCATTGTT ATTTTTTCAT TATTAAACTA TTGGTCCGTA 720 

AAACTTTTAA CATCATTTAC GAGTTTAATT TCTGTATTTA AATTAGGCGT ACCCATGTTA 780 

20 

ACCATCATCA TGTTGATGCT ATCAGGATTC GACACTTCAA ATTACGGCCA TTCGGCAAGC 840 

ACATTTATGC CTTACGGAAG TGCACCGATT TTTGCTGCAA CAACAGCATC AGGGATTATT 900 

TTTTCATTCA ATTCATTCCA GACAATTATT AATATGGGTT CAGAAATTAA AAATCCTGAA 960 

25 

AAAAATATCG CAAGAGGCAT CGCTATCTCA CTGTCAATCA GTGCAGTGTT GTACATCATT 1020 

TTACAAAGTA CGTTTATCAC TTCTATGCCT CAATCAATGT TACAACATAG TGGATGGAAT 1080 

GGCATCAACT TCAATTCACC ATTTGCTGAT TTAGCTATCT TATTAGGAAT TAATTGGCTC 114 0 

30 

GCAATTTTAC TATACATTGA AGCTTTTGTA TCACCATTCG GTACTGGCGT GTCATTTGTC 1200 

GCCGTTACAG GTCGAGTTTT ACGAGCAATG GAGAAAAATG GACATATCCC TAAATTTCTT 1260 

35 GGGAAGATGA ATGAAAAATA TCATATCCCA CGTGTAGCAA TCATCTTTAA TGCCATCATT 1320 

AGT/CTGATTA TGGTTACATT ATTTAGAGAT TGGGGTACGC TAGCAGCAGT TATTTCTACT 1380 

GCAACTTTAG TAGCCTATTT AACTGGCCCA ACGACAGTGA TTGCATTAAG AAAAATGGGA 1440 

40 CCAACAATGA CTCGTCCATT TAGAGCAAAA ATTTTAAAAG TAATGGCACC ATTATCATTT 1500 

GTATTAGCTT CATTAGCTAT ATATTGGGCA ATGTGGCCAA CAACGGCTGA AGTTATTTTA 1560 

ATCATTATAC TTGGATTACC AATCTACTTC TTCTATGAAT ATCGTATGAA TTGGCGTAAT 1620 

45 ACAAAGAAAC AAATTGGTGG TAGCTTATGG ATTATTGTAT ATTTAATCGT GCTATCAATA 1680 

CTGTCATTTA TAGGAAGCAA AGAATTTAAA GGCTTAAATA TGATTCACTA TCCATTTGAC 1740 

TTTATCGTTA TTATTATTGT GGCACTTATC TTCTATTACA TCGGTACAAC GAGTTCATTT 1800 

50 GAAAGCGTCT ATTTCCGTCG CGCAACACGA ATCAATACGA AGATGCGTGA GTCACTAAAT 1860 
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CACACACATT AACCAACCAT TGATTTCAAC ATCTTGGTTG GTTTTTTATT TTGAAAATCG • 


1980 




GTTATAAATA ACTAACATAA CAAGATGATG ATCAGGCTGG GACATAAATC AATGTTCTAT 


2040 


5 


GCTCTACGAA gTTATATTGG CAGTAGTTGA CTGAACGAAA ATGCGCTTGT AACAAGCTTT 


2100 




TTTCGATTCT 


AGTCAGGGGC CCCAACACAG AGAATTTCGA AAAGAAATTC TACAGGCAAT 


2160 




GCAAGTTGGG 


GTGGGACGAC GATAAAGAAA TACTTTTTCT ATAGAAATTA GTATytCTTA 


2220 


10 


TGCATGAGTT 


TTACTCATGT ATTCATATTT TTAAGTACAC ATTAGCTGTG GCTAATGTAT 


2280 




AAGAACCACT 


ACATAATAAA TCATTTGTGG CTCTTTATCA TTTCTGTCCC ACTCCCGTAG 


2340 




AAGTACATCA 


TATAATGCTG AAAATGGTTT GAGTTAAAAC AGATATCAAG CTCGTCTGAT 


2400 


15 


TCAGTCACAA 


AATTGTCTTG TTATACTTGT CACCTATCAT CTATAGACCG TGGTATGATT 


2460 




. AAATTGGGGA 


TGATAAAGGA GGTTAATAAA TATGAAGATT AATACTACAG GTGGTCAAAT 


2520 


20 


TCATGGTATT 


ACACAAGATG GTTTAGATAT CTTCTTAGGC ATTCCTTATG CAGAACCACC 


2580 


AGTTCATGAC 


AATCGCTTTA AACATTCTAC GTTAAAAACA CAATGGTCAG AGCCAATTGA 


2640 




TGCAACTGAA ATACAACCCA TCCCACCGCA ACCAGACAAC AAATTAGAAG ATTTTTTCTC 


2700 


25 


CTCAGAATCT 


ACAACTTTTA CTGAACATGA AGACTGTTTA TATCTAAATA TTTGGAAACA 


2760 


ACATAATGAT 


CAGACGAAGA AACCTGTCAT CATTTATTTT TATGGTGGTA GTTTTGAAAA 


2620 




TGGTCATGGT 


ACAGCCGAAC TCTATCAACC GGCACATTTA GTACAAAATA AOGACATTAT 


2880 


30 


CGTTATTACA 


TGCAATTATC GTTTAGGCGC ATTAGGATAT TTAGACTGGT CATATTTTAA 


2940 


TAAAGATTTT 


CATTCCAATA ATGGCCTTTC AGATCAAATC AATGTCATAA AATGGGTGCA 


3000 




TCAATTTATT 


GAATCCTTCG GTGGCGACGC TAATAACATT ACTTTAATGG GTCAGTCTGC 


3060 




AGGCAGTATG 


AGCATTTTGA CTTTACTTAA AATACCTGAC ATTGAGCCAT ACTTCCATAA 


3120 




AGTQGTTCTA CTAAGTGGCG CACTACGATT AGACACCCTT GAGAGTGCAC GCAATAAAGC 


3180 




ACAACATTTC 


CAAAAAATGA TGCTCGATTA TTTAGATACA GATGATGTTA CATCATTATC 


3240 


40 


GACAAATGAT ATTCTTATGC TGATGGCGAA gcTAAAACAA TCTCGAGGAC CTTCTAAAGG 


3300 




GCTTGATTTA 


ATATATGCGC CTATTAAAAC AGATTATATA CAAAATAATT ATCCAACAAC 


3360 




GAAACCAATT 


TTTGCATGTT ATACAAAAGA TGAAGGCGAT ATTTATATTA CTAGTGAACA 


3420 


45 


GAAAAAATTA 


TCGCCGCAAC GCTTTATCGA CATTATGGAA TTAAATGATA TTCCTTTAAA 


3480 




A.TACGAAGAT 


GTTCAGACGG CGAAGcAACA ATCTTTAGCG ATTACACATT GTTATTTCaA 


3540 




ACAGCCGATG 


aAGCAATTTT TACmACmACT CAATATACmA GATTCCAACC GCACCAACTA 


3600 


50 


TGGCTT 




3606 



55 



360 



EP 0 786 519 A2 



(I) SEQUENCE CHARACTERISTICS: 

{A} LENGTH: 15109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 44: 

10 GAAATTAAAA AAGCAATTGG nACAAGATGC AACAGTGTCA TTGTTTGATG AATTTGATAA 60 

AAAATTATAC ACTTACGGCG ATAACTGGGG TCGTGGTGGA GAAGTATTAT ATCAAGCATT 120 

TGGTTTGAAA ATGCAACsAG AACAACAAAA GTTAACTGCA AAAGCAGGTT GGGCTGAAGT 180 

75 

GAAACAAGAA GAAATTGAAA AATATGCTGG TGATTACATT GTGAGTACAA GTGAAGGTAA 240 

ACCTACACCA GGATACGAAT CAACAAACAT GTGGaAGAAT TTGAAAGCTA CTAAAGAAGG 300 

ACATATTGTT AAAGTTGATG CTGGTACATA CTGGTACAAC GATCCTTATA CATTAGATTT 360 

20 

CATGCGTAAA GATTTAAAAG AmAAATTAAT TAAAGCTGCA AAATAATTCA GCTATATAAG 420 

TTAGTGAAAT GAGAGTCTGA AACATATCAA TCTTTTGATA TTGTATTAGG CTCTTATTTT 4 80 

TATAGCTAGA AAGTTAGATA TTTGTATTTT TTTAAATAAT AAGTGCCGTT GTTATCGTTC 540 

AATTTAATTA ATGATAGATT AGTATTATTA TAGCTAAAGT AGTATACCTG AGAAAATAGC 600 

TCAATGTATC TCTTTATTAA TAAGTTATAT CATAATTATT TTAGTGCATA CTTTATGGAA 660 

GGGATATCAG GGAATGGCTT TCAATTAAAG AAGAGGTTTA AAAGGATTAC AACAGAATGT 720 

30 

TATGATTTTG TAGAAAGATA TATAACAACG TTTTATAAAA ACATAATATT GTTAATGGAA . 780 

AATGAAATGT AAGGGGGATT TCGAGTGACT AAGAAAGTTT ATTTTAACCA CGATGGTGGT 840 

35 GTAGATGATT TAGTATCTCT ATTTTTATTA TTACAAATGG AAAACGTTCA ATTGATAGGG 900 

GTCAGTACAA TTGGTGCTGA TTGTTATTTA GAGCCATCTT TGAGCGCATC AGTAAAAATT 960 

ATTAATCGTT TTTCAAATGA AGATATTCAA GTTGCGCCAT CATATGAACG AGGAAAAAAT 1020 

40 CCATTTCCTA AAGAATGGCG TATGCATGCC TTTTTTATGG ACGCATTGCC AATTTTAAAT 1080 

GAGCCAGTCA AACATGTTGC TTCAAATGTG AGCGACAAAG AAGCCTTTGA AGACATTATT 1140 

CAAACTTTAA AGAGACAATC AGAAAAAGTA ACATTATTAT TTACAGGCCC GCTTACAGAT 1200 

45 TTAGCAAAAG CACTACAAAA AGATTCATCT ATCGTTCAGT ATATAGAAAA ATTAGTTTGG 1260 

ATGGGTGGCA CCTTTTTACC AAAAGGAAAT GTTGAAGAAC CTGAGCATGA TGGTTCTGCA 1320 
GAATGGAATG CATATTGGGA TCCAGAAGCG GTTAAAATTG TTTTTGATAG CGATATAGAG v 1380 

50 . ATTGATATGG TTGCTTTAGA AAGTACGAAT CAAGTACCGC TAACGTTAGA TGTTAGACAA 1440 
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GTACCACCAT TAACACACTT TATAACAAAT TCTACTTACT TTTTATGGGA TGTTTTAACG 1560 

ACTGCTTATA TTGGTAACAA GGACTTGGTT CATTCAATTG AGAAAAAAGT CGATGTAATA 1620 

5 AGTTATGGAC CAAGTCAAGG TAAGACATTT GAGTGTAAAG ATGGGCGCAA AATTAATGTC 16 80 

ATAAATCATG TAGATAACAA CGCATTTTTT GATTATATAA CTGCACTTGC TAAAAAAGTA 1740 

AATTAACAGC TGTGTAGAAT AATTAAGGTT TTAATTTATA TAGAACAACT TATTGTAAAC 1800 

10 TTTTCATTTC TTAAAGTTTA CAATGGTGCT ATAATAATGG TCATGAAATA CGAAAGGAAG I860 

TAAAAAATGA CAACAAAACA GTTAGTATAT ACAGCTTTAA TGACAGCGAT TATCGCTATT 1920 

TTAGGATTGG TACCGGTAAT TCCACTACCA TTTTCTTCAG TACCAATTGT ACTTCAAAAC 1980 

75 

ATTGGTATTT TCTTAGCAGG TGCGATTTTA GGACGTAAAT ATGGCACATT AAGTGTTATC 204 0 

GTCTTTTTAT TATTAGTAGT TGCTGGCTTG CCATTGTTAT CAGGTGGTCG CGGTGGCATC 2100 

GGTGTATTCG CAGGTCCTTC AGCAGGGTTT TTACTATTAT ATCCAGTTGT AGCATTCATG 2160 

20 

ATTGGGGCGA TTCGAGATAG ATTCATCAAT GAAATTAATT TCTGGATTTT ATTCGTTGGT 2220 

ATTTTAGTTT TTGGTGTTAT AGCATTAGAT GTTATTGGTA CATTOATTAT GGGCATGATT 2280 

ATTAACATAC CATTTACGAA AGCTATTTCA ATTTCATTAG CTTATTTGCC TGGTGATATA 2340 
25 ' 

TTAAAAGCAA TTGTAGCAAG TTTGATTGGT . ACAGCTTTAC TTAATCACTC GCAGTTTCGT 2400 

CAAATTATGG GAATAAAATA ATCATATTTA AGATAGTAAA GTAATTGAAT AAGTTGCTTT 2460 

GAAATTTATA AAAGTGAAAG GAGTAGGTGT CAATGGCTAG TATAAGTATG TCAGATATAT 2520 

30 

ATTGTAACGG CACTATATTT GAAAATGACG ACGAGCAGTT GATTTATTTA ACGCCTTCTT 2580 

TTCCACAACG ATACACAAGT AACACATGGA TATATAAAAA GACGCCTACC CAAGAGCGAT 2640 

GGCTGAAAGA CTTAGAACGT CAACATCAAT TACATACAAA TCAAGGTTCA AATCATTATG 2700 

35 

CGTTTAGTTT CCCGGAAAAT GAACAACTTG ATAATCATTG GATGGCTATG TTTAAAGATA 2760 

TGAATTTTGA ACTAGGTATT ATGGAATTGT ATGCCATAGA AAGTGATGCG CTTGCCAATT 2820 

4Q TGCCGCGTAA CTCTGACGTT GAAATTGCCA TCGTTGACGA GTCGCATATA GATGCCTATT 2BB0 

TAAAAGTTGC ATATCAGTTT AGTTTGGCAT TTGGAAAAGA CTATGCAGAT GCACATGAAG 294 0 

AAATGGTAAG GGAACATTAT CAAAAAGATG TGATTAAACG CTTAGTAGCT TATTTAAATA 3 000 

45 ATGAACCTAT TGGCGTTGTA GATGTCATTb AAAGTGAAAA TTACATTGAA TTAGATGGAT 3 060 

TTGGTGTATT AGAACAATTT CGGCACCAAG GAATTGGATC TACAATTCAA TCGTTGATAG 3120 

GTGAATACGC CATATCAAAA AATCACAAAC CAATCATATT AGTTGCAGAT GGTGAAGATA 3180 

50 CAGCAAAAGA TATGTATGCA AAGCAAGGTT ATGTCTATCA ATCGTTTTGT TATCAAATAT 3240 
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TAAGCTGGTT TCGAGTAGAA ATCAACTTAC TGCTTTTTAA ATTGTTTTGA GCTACTTATA 3360 

CTTATAAAAA TAGTGCGTTT AAATTGTTGA TTCATGTAGA ATATCGTTCA TTATGACACA 3420 

5 

CTATAATGAA TATGTTATTG TTCAGAATCA ATGATACGTT CTGGATGACT GTATATATTA 34 80 

AAGCCACCAT TTCGAATAAA TCCAACTGCC GTAATATTTA GGTCATTAGC TAAGGTTACA 354 0 

GCAAGCGTTG TCGGAGCTGA TTTAGATAAA ATGACGCCAA CACCAATTTT TGCGGCTTTA 3600 

10 

ATTAAAATTT CTGATGAAAT ACGTCCACTA AAAATTAATA CTTTATCTCG GACAGTAATA 3660 

TGTCGCTGAA TACAAAATCC ATATAATTTA TCTAGAGCGT TATGTCTACC AATGTCTTGT 3720 

15 CGATGTACAA AAAATGTCAA ACCATCGCTT ATAGCAGCAT TATGTAAGCC ACCTGTTTCT 3780 

TGGTAAATAT GACTTGCACT TTGTAATCGA GTCATCATGT TAATAATTTG CATTGGAGTT 3 840 

AAAGTGATTT TAGACATAGA TGTTTTAGCG ATAGCAGCAT CATTTTGAAA ATAAAACTCA 3900 

20 CGACTCTTTC CGCAACAAGA TGCAATCATT CGTTTTGTGG AATATTGAAA GCGATCGCCT 3960 

AAATCTTTAT TAAGTTCAAC ATGGGCAAAA CCTTTACTAT CATCAATCAG TACAGATTTT 4020 

AATTCATCTC GCTTTAAAAT GGCACCTTCC GAAGCCAGAA ATCCAATGAC TAACTCCTCA 40 BO 

25 AGGTTTGTTG GACTGCATAT AACAGTCGCA AATTCTTCAC CATTCACCAT AATTGTAAGT. 4140 

GGAAATTCTG TCACATATTG ATCTGTTGTA TTGAATAATT TTCCATCTTC ATATCTAACA 4200 

ATTGGTTGAC CTAAAGATAC ATCTTTGTTC ATTATCTAAC CCCTTTAATT AGCTTAAACT 4260 

30 

TTATTTTAAA GCAATTTGCT TAAAATTTTA ACATATTTGC TTAAGTTTGA AATTTGATTG 4320 

ATAAAAATTA ATAGCGAGCA ATCTGTTTGA TTTAAATTGA ATTCGAGAAT ATACATACTA 4380 

GGGCATCAAT TAATAAATAT CAATCTTATG CAAATTTGAC AATTGTTTGA ATCAATATAT 4440 

35 

AAACAGGCAA CGGTT C TTTT CAAATATAAT AGTAAGTGTA TAATGAAAAT GTAAATATTA 4500 

TTAAAAATGG GGGTTCACTC AATGAAATTG AAACGTTTAT TTGCTGTTGT GATTGCAATG 4560 

4Q CTTTTAGTAT TAGCTGGTTG CTCTAATTCT AACGATAATA ATGAAAGTAA AAAAGATGAC 4620 

GCAGACAATG GTAAGAAACA .AGAGATTCAA GTTGCAGCGG CAGCAAGTTT AACAGATGTA 4680 

ACCAAGAAAT TAGCTTCAGA ATTTAAAAAA GAGCATAAAA ATGCTGATAT TAAATTTAAC 474 0 

45 TATGGTGGAT CAGGGGCATT AAGAAAACAA ATTGAATCAG GCGCACCTGT TGACGTATTT 4 800 

ATGTCTGCAA ATACTAAAGA TGTAGATGCA TTAAAAGACA AGAATAAAGC GCATGATACA 4 860 

TATAAATATG CGAAAAATAG TCTAGTATTA ATTGGTGATA AAGATTCAAA TTACACTTCA 4 920 

50 

GTAAAAGACT TAAAAGACAA TGATAAATTA GCATTAGGTG AAGTGAAAAC TGTACCAGCA 4 9 BO 

GGAAAATATG CGAAACAGTA TTTAGATAAC AATAACTTAT TTAAAGAAGT CGAAAGTAAA 5040 
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CAAGGTTTTG TGTATAAAAC TGACTTATAT AAACAAAATA AAAAAATTGA TACTGTAAAA 5160 

GTAATTAAAG AAGTAGAACT TAAGAAGCCA ATCACATACG AAGCTGGTGC TACATCAGAT 5220 

5 AGTAAATTAG CAAAAGAGTG. GATGGAATTC TTAAAATCAG ATAAAGCTAA AGAAATACTA 5280 

AAAGAATACC ACTTTGCAGC ATAAGGAGTT GTAATCCATG CCTGACTTAA CACCTTTTTG 5340 

GATATCAATA CGAGTTGCTG TAATCAGTAC GATTATTGTA ACGGTTTTAG GTATTTTTAT 54 00 

10 ATCTAAATGG TTGTATCGTC GTAAGGGTTC GTGGGTTAAA GTATTGGAAA GTTTATTGAT 5460 

ATTACCTATT GTTTTGCCGC CAACGGTATT AGGTTTTATT CTATTAATCA , TCTTCTCGCC 5520 

AAGAGGACCA ATCGGTCAAT TCTTTGCGAA TGTACTACAT TTACCTGTAG TGTTCACTTT 5580 

75 

GACAGGTGCT GTGATAGCAT CTGTCATTGT TAGTTTTCCA CTAATGTATC AACATACTGT 564 0 

GCAAGGCTTC AGAGGTATAG ACACGAAAAT GATTAATACA GCTAGAACGA TGGGAGCAAG 5700 

TGAAACGAAA ATTTTCCTCA AATTAATTTT ACCATTAGCT AAACGCTCTA TTTTAGCAGG 5760 

20 

TATAATGATG AGTTTTGCTC GTGCATTAGG TGAGTTTGGT GCTACATTAA TGGTTGCAGG 5820 

ATATATTCCA AATAAAACGA ATACACTACC TTTAGAAATA TACTTCTTAG TGGAACAAGG 5880 

25 TAGAGAAAAT GAAGCGTGGT TATGGGTATT AGTGCTAGTC GCATTCTCTA TTGTGGTTAT 5940 

ATCTACAATT AATTTATTGA ATAAAGATAA ATATAAGGAG GTCGACTAGA TGCTTAAAAT 6000 

CAATGTGAAA TATCAATTAA AGAACACTTT AATTCGCATC AATATAGATG ATACTGAACC 6060 

30 AAAAATTTAT GCAGTTCGTG GTCCATCTGG CATTGGTAAA ACTACTGTTT TAAATATGAT 6120 

TGCCGGATTA CGTAAAGCAG . ATGAAGCTAT TATCGAAGTG AATGGGCAAT TACTTACTGA 6180 

TACGGCAAAA AACGTGAATG TTAAAATTCA ACAACGACGT ATTGGATATC TGTTTCAAGA 6240 

35 

CTACCAATTG TTTCCTAATA TGACGGTCTA TAAAAATATT ACTTTTATGG CTGAACCATC 6300 

TGAAeACATC GATCAATTAA TTCAAACTTT AAACATTGAT CATTTGATGA AACAATATCC 6360 

TATGACATTG TCAGGTGGAG AGGCACAACG TGTAGCACTT GCACGTGCAC TTAGCACrAA 6420 

40 

ACCAGATTTA ATTTTATTAG ATGAACCTTT TTCTAGTTTG GATGATACTA CAAAAGATGA 64 80 

GAGTATTACA TTAGTTAAAC GTATTTTCAA CGAATGGCAA ATACCAATCA TATTTGTGAC 6540 

45 ACATTCAAAC TATGAAGCAG AACAAATGGC TCATGAAATT ATTACAATTG GGTAATCATT 6600 

TATTTGCCAT TAAAGAGTTT AGAACGTATT TAAAATTGTA GAAGTGAATG CTTCTATCAG 6660 

CATTTTAATG ATGTTTTAAA CTCTTTTTTA GGGGCAGTTT TTTTGAGAGA CATTGACGCG 6720 

SO CGTCATATAA TGAAAGTAAT GATAAAAAGA AAGGATAACT TAATGTGAGT CAAGAACGTT 6780 

ATTCAAGGCA AATTTTATTT AAACAAATAG GTGAAATAGG TCAAAGCAAA ATAAATCAAA 684 0 
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5 



10 



IS 



20 



35 



40 



GAGCAGGCAT 


TGCCAAACTA ATCATTGTTG 


ATAGAGATTA 


TATTGAATTT 


AGTAATTTAC 


6960 


AAAGACAAAC 


ATTGTTTACT GAAGAAGATG 


CTTTGAAAAT 


GATGCCTAAG 


GTGGTTGCAG 


7020 


CTAAAAAGCA TTTGCTAGCG TTACGTAGTG 


ATGTTGATAT 


TGATGATTAT 


ATTGCCCATG 


7080 


TGGATTATTA TTTTTTGGAA ACACATGGAC 


AGGACGTTGA 


CGTTATTATT 


GATGCAACCG 


7140 


ATAACTTTGA 


AACACGACAA CTGATTAATG 


ATTTTGCATA 


TAAATATCGT 


ATACCTTGGA 


7200 


TTTATGGTGG 


TGTTGTACAG AGTACATATA 


CAGAAGCTGC 


ATTTATACCT 


GGTAAAACAC 


7260 


CTTGCTTTAA 


CTGTTTGGTA CCACAATTGC 


CAGCATTAAA 


TTTAACATGT 


GATACAGTAG 


7320 


GGGTCATTCA ACCTGCCGTG ACGATGGCAA 


CAAGTTTACA 


ATTAAGAGAT 


GCGATGAAAG 


7360 


TATTAACGGA 


ACAACCAATT GACACAAAAA 


TAACTTATGG 


CGATATTTGG 


GAAGGTAGTC 


7440 


ATTATTCATT 


TGGTTTCAGT AAAATGCAAC 


GTTCAGACTG 


TACAACTTGT 


GGAGATGTAC 


7500 


CAAGTTATCC 


GTATTTAAAC AAGAATGAAC 


AACGTTATGC 


AACATTGTGT 


GGTAGAGACA 


7560 


CTGTACAGTA 


TGAAAATGCA TCAATTACAC 


ACGACATTCT 


TGTTCAATTT 


TTAAAACAAC 


7620 


ATCAGTTAAA 


TTATCGCAGT AATTCGTATA 


TGGTTATGTT 


TGAATTTAAA 


GGACACCGCA 


7680 


• TTGTTGCTTT 


TAAAGGTGGA AGGTTTTTAA 


TACATGGCAT 


GACACGCACA 


TCAGATGCCA 


7740 


CACATCTAAT 


GAATTTATTG TTTGGATAAA 


AAAAGATAAG 


ACAAAAGGAG 


TGTAATATTA 


7800 


TGGGCGAACA 


TCAAAACGTT AAATTGAATC 


GTACAGTTAA 


AGCAGCCGTA 


CTAACGGTAT 


7860 


CAGATACTAG 


AGACTTTGAT ACAGATAAAG 


GTGGTCAATG 


CGTGCGCCAA 


CTATTACAAG 


7920 


CAGATGACGT 


TGAAGTGAGT GACGCACATT 


ATACAATTGT 


GAAAGATGAA 


AAAGTAGCCA 


7980 


TCACGACGCA 


GGTGAAGAAG TGGTTAGAAG 


AAGATATTGA 


TGTCATCATT 


ACGACTGGTG 


. 8040 


GAACAGGTAT 


TGCACAACGT GATGTGACGA 


TTGAAGCAGT 


AAAACCACTT 


TTAACTAAAG 


8100 


AGAtfcGAAGG CTTTGGGGAA TTGTTTAGAT 


ATTTGAGTTA 


TGTTGAAGAT 


GTTGGCACGC 


8160 


GTGCATTATT 


GTCTCGTGCT GTAGCAGGTA 


CAGTTAATAA 


TAAATTGATA 


TTTTCGATTC 


8220 


CAGGATCAAC AGGCGCAGTT AAATTAGCAT 


TAGAAAAGCT 


CATTAAACCA 


GAATTAAATC 


8280 


ATCTGATTCA 


TGAGCTTACA AAATAATTTA 


TTGATTTGAT 


TGGCGTTGAA 


AATCTCCAGA 


8340 


TTTACCGCCA 


GACTTGCTTT CAAGGTAGGT 


TTCGCCAATA 


ATCATACCTT 


TATCAACTGC 


8400 


TTTCGTCATG 


TCGTAAATGG TTAAAGCCGT 


TGCTGATGCA 


GCGGTTAAAG 


CTTCCATTTC • 


8460 


AACACCGGTT 


TTGCCAGTTG TAGAGACAGT 


TGTTTGAATG 


TTTAAAGTAT 


AAAGGGGTGC 


8520 


ATTTGTTTCA 


TCCCAGCTGA AGTGAACATC 


TATGCCAGTC 


AATGGTAATG 


GATGGCACAT 


8580 


CGGAATAAGT 


GTTGATGTAT TTTTGGCAGC 


CATAATACCA 


GCGATTTGAG 


CAGTGTTCAA 


8640 
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AATGCTTGAA TGAGCGACAG CAGTTCT TTT 
TTTGGCGTGG CCTTGTTGAT TAATATGAGT 

5 

TCTAGTATAT CATGAAAAAA TAAAAGTTTT 
GAAACCCAAT CCCAGTTAAA GAAGCAATTC 
CGGCAATTAC GGTAGCACTT GAAAAAAGTC 

10 

CTACTTATGA TATACCAAGG TTTGATAAAT 
TTGATTCACA AGGGGCAAGT GGTCAGAATC 

15 GTGCAGGTTC AGTTTCTGAT AAATTAGTTG 

GAGCACAAAT ACCTAATGGC GCAGATGCTG 
AAGATACATT TACAATTCGT AAACCATTTT 

20 AAGAAACAAA GACAGGCGAT GTTGTTCTAA 

TCGCGGTCCT TGCAACATAT GGCTATGCAG 
CTGTTATTGC AACAGGAAGC GAATTATTAG 

25 TTCGTAACTC TAATGGCCCA ATGATTCGTG 

GTATTTACAA AACACAAAAA GATGATTTAG 
TGGAAAAACA TGATATCGTT ATTACAACGG 

30 

TACCTGAGAT TTATAAGGCT GTAAAGGCGG 
CTGGTAGCGT AACAACGGTT GCATTTGTAG 
ATCCATCAGC TTGTTTTACA GGATTTGAAC 
GTGGCGCACT AGAAGTCTTC CCGCAAATAA 
AGGGAAACCC ATTCACACGA TTTATACGTG 
CTGTAGTACC TTCAGGATTC AATAAATCAG 

40 

GTATGGTCAT GTTACCAGGA GGGTCACGTG 
TATTGACTGA ATCTGACGCT GCTGAAGAGG 

45 TTACAAAAAG TCTGGTAAGA CAACATTGAT 

TGGTTATACA GTTGCTACTA TTAAACATCA 
GGATTCAGAC GTCGATCACA TGAAGCATTT 

50 AGGTTTTCAA TATCAGCAAA CTGTAACACG 

TGAAAAATCT GTTACAATTG ACACCAATAT 

55 



TGTAATTTGT TTGTCTGATA 


CATCGACCAT 


8760 


AAACTCAGTC ATTTTACCCC 


TCCTAGTGCA 


8820 


GGAGATGATT TTTAATGGTA 


GTAGAAAAAA 


8880 


AACGTATCGT TAATCAGCAG AGTTCAATGC 


8940 


TAAATCATAT CTTAGCAGAA 


GATATTGTAG 


9000 


CACCTTATGA TGGTTTTGCA 


ATTCGCAGTG 


9060 


GCATTGAGTT TAAAGTGATT 


GATCATATTG 


9120 


GGGATCACGA AGCGGTGCGT ATTATGACTG 


9180 


TTGTTATGTT TGAACAAACG 


ATTGAACTAG 


9240 


CAAAAAATGA AAATATATCT 


TTAAAAGGTG . 


9300 


AAAAAGGACA AGTAATTAAT 


CCAGGGGCTA 


9360 


AGGTTAAAGT TATTAAGCAA 


CCGAGTGTCG 


9420 


ATGTTAATGA TGTATTAGAA 


GATGGGAAAA 


9480 


CCTTAGCAGA AAAATTAGGT 


CTTGAAGTTG 


9540 


ATAGTGGCAT CCAAGTCGTT 


AAAGAAGCTA 


9600 


GCGGAGTTTC TGTTGGAGAT 


TTTGACTATT 


9660 


AAGTGTTATT TAATAAAGTA 


GCAATGCGTC 


9720 


ATGGaAAGTA TTTGTTTGGa 


TTATCTGGAA 


9780 


TATTTGTGAA nCCAGCTGTT AAACATATGT 


9840 


TTAAAGCAAC ATTAATGGAA GATTTTACCA 


9900 


CTAAAGCAAC GTTAACAAGT 


GCTGGAGCTA 


9960 


GTGCGGTTGT AGCGATTGCA 


CATGCTAACT 


10020 


GTTTTAAAGC GGGGCATACA 


GTAGATATTA 


10080 


AACTTCTTTT ATGATTTTAC AAATTGTAGG 


10140 


GAGGCATATT GTCTCTTTCT 


TAAAGTCACA 


10200 


TGGGCATGGT AAGGAAGATA 


TTCAATTACA 


10260 


TGAAGCGGGG GCAGATCAAA 


GTATTGTACA 


10320 


TGTAGATAAT CAAAATCTTA 


CTCAAATTAT 


10380 


CGTATTAGTT GAAC3GCTTTA 


AAAATGCTGA 


10440 
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70 



75 



20 



25 



30 



35 



40 



45 



50 



GAATGTTTGT TATAGCATTA ATGTAAGGGA GCATGAAGAT TTTACAGCAT TTGAGCAATG 10560 

GTTATTAAAT AAAATTAAAA ATGATTGTGA TACACAATTA ACATAGAGGA TTGAAATGAA 10620 

TGAAACAATT TGAAATCGTG ACAGAACCGA TACAAACAGA ACAATATCGT GAATTCACTA 10680 

TAAATGAATA TCAAGGTGCA GTAGTTGTTT TTACCGGTCA TGTTCGCGAA TGGACTAAAG 10740 

GCGTCAAAAC GGAATATTTA GAATATGAAG CGTATATTCC AATGGCTGAA AAGAAATTGG 10 BOO 

CACAAATTGG AGATGAAATA AATGAAAAAT GGCCTGGAAC GATAACGAGT ATTGTTCATA 10860 

GAATAGGGCC ATTACAAATT TCAGATATGG CTGTATTAAT TGCGGTTTCT. TCACCGCATC 10920 

GTAAAGATGC CTATCGAGCA AATGAATATG CAATTGAGCG TATAAAAGAA ATTGTTCCGA 10980 

TTTGGAAAAA AGAAATTTGG GAAGATGGTT CAAAATGGCA AGGGCATCAA AAAGGGAATT 11040 

ATGAAGAAGC AAAGAGGGAG GAATAAGAGA GATGAAGGTA CTTTACTTCG CAGAAATTAA 11100 

AGATATATTA CAAAAAGCAC AGGAAGATAT TGTGCTTGAA CAAGCATTGA CTGTACAACA 11160 

ATTTGAAGAT TTATTGTTTG AACGTTATCC GCAAATCAAT AATAAAAAGT TTCAAGTTGC 11220 

TGTAAATGAG GAATTTGTAC AAAAATCGGA TTTCATTCAA CCTAATGATA CTGTTGCATT 11280 

AATTCCACCG GTTAGTGGAG GTTAAGGGAG CATGAAAGCA ATAATTCTTG CAGGTGGTCA 11340 

TTCAGTGCGA TTTGGTAAGC CCAAAGCTTT TGCGGAAGTG AACGGTGAGA CCTTTTATAG 11400 

TAGAGTAATT AAGACATTAG AATCAACAAA TATGTTCAAT GAAATTATTA TTAGTACAAA 11460 

TGCGCAATTG GCAACGCAAT TTAAATATCC AAATGTTGTT ATAGATGATG AGAATCATAA 11520 

TGATAAAGGT CCATTAGCAG GAATTTATAC AATCATGAAG CAACATCCTG AAGAAGAATT 11580 

GTTTTTTGTC GTTTCTGTTG ATACACCAAT GATTACTGGT AAAGCTGTAA GCACGTTGTA 11640 

TCAGTTTTTA GTTTCTCATC TTATTGAAAA TCATTTAGAT GTCGCAGCTT TTAAAGAAGA 11700 

TGGACGTTTT ATTCCAACAA TTGCATTTTA TAGTCCGAAT GCATTAGGCG CTATAACTAA 11760 

AGCACTACAT TCTGATAATT ACAGTTTTAA AAATGTATAT CATGAATTAT CAACGGATTA 11820 

TTTGGATGTA AGGGATGTAG ATGCGCCCTC ATATTGGTAC AAAAATATAA ATTATCAGCA 11880 

TGATTTGGAC GCTTTAATTC AAAAATTGTA AGCTGTTAGG AGGTCCACAA ATGGTAGAAC 1194 0 

AAATAAAAGA TAAACTAGGA CGTCCCATCC GTGACTTACG GTTATCTGTG ACAGATCGGT 12 000 

GTAACTTTAG GTGTGATTAT TGCATGCCTA AAGAGGTATT TGGAGATGAT TTCGTATTTT 12060 

TACCTAAAAA TGAACTTTTA ACGTTTGATG AAATGGCTAG AATCGCTAAG GTATATGCAG 12120 

AATTAGGTGT AAAAAAAATA CGCATTACAG GTGGAGAACC ATTGATGCGA CGGGATTTAG 12180 

ATGTACTTAT AGCTAAATTA AATCAAATCG ATGGTATTGA AGATATTGGT TTGACTACAA 1224 0 
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ATGTCAGTTT GGATGCTATT GATGATACGC 
AAGCGACTAC GATTTTAGAA CAAATTGATT 
5 TAAATGTTGT TATACAAAAA GGTATTAACG 

TTAAAGATAA ACATATAGAG ATTCGATTTA 
GATGGGATTT CAGTAAAGTT GTAACTAAAG 

10 

. TTGAAATCGA TCCTGTAGAA CCAAAATATT 
AGGATAATGG TGTTCAATTT GGTTTGATTA 
GTACACGCGC AAGGCTGTCA TCAGATGGGA 

15 

ATGGATTTAA CGTTAAAGCG TTTATTCGTT 
AATTTAAAGC TTTATGGCAA ATAAGAGATG 
CAGTTGCCAA TCGTCAACGT AAAAAGATAA 

20 

GGACCACTAC ATATTAAATC ATTAGAGATG 
ACAATATTAT TTATTAAAGT AAAAACGGTC 

25 GTTTTTAAAG TTTTTACAAG TTGGCGGGGC 
TACAATAATG TGCAAGTTGG CGGGGCCCCA 
GACAATGCAA GTTGGGGAAC GGGGCCCCAA 

30 TAATGTGCAA GTTGGCGGGG CCCCAACATA 
GCAAGTTGGG GATCAACGAA ATAAATTTTA 
AATCACTACA TAATAAATCT TTAGTGGTTC 

55 V GAGTTGTAAT ATATCTTTTT TAGGTATAAA 
AGATATAAAT CTAAACAAGA TATAGCCAGC 
AGTTTGATAT ATAATAAATT TAAGTAATTG 

40 

AGAAACATAG GAGGCATCAT ATTATGAGTA 
GGGAGTTAAG TCAGTTAAAG CACTGGTTAA 
TTGTAGTCCT TTTTAAAGTG TATGAAGCTG 

45 

CATTACATTT TGAAATGCTA TGGGATACAA 
ATAAAAAAGA GCTTATTTCT AAATTGCGTT 
so TCTATAGTAC TTCTCAAAAG AAATTGTTAG 
GCGTTACAAA CTAAAAACTT aAAAAgcaTG 



TATTTCAATC AATCAATAAT CGTAATATTA 12360 

ACGCGACGTC TATTGGTTTG AATGTAAAAG 12420 

ATGATCAAAT CATACCAATG CTTGAATATT 12480 

TAGAATTTAT GGATGTTGGT AATGATAATG 1254 0 

ATGAAATGCT TACAATGATA GAGCAGCACT 12600 

TTGGGGAAGT AGCAAAATAT TATCGCCATA 12660 

CAAGTGTTTC ACAATCATTT TGTTCTACAT 12720 

AGTTTTACGG ATGTTTATTT GCAACTGTCG 12780 

CTGGCGTGAC CGACGAAGAA TTAAAAGAAC 1284 0 

ATCGATATTC AGATGAGAGA ACTGCTCAAA 12900 

ACATGAATTA. TATTGGTGGT TAATGTGTAG 12960 

TTTTAATATT TCTGTCTTAC TCCCTAAAAT 13 020 

ATATCTATGC CAGATTTAAT AGAAATGATC 13080 

CCCAACACAG AAGCTGACAG AAAGTCAGCT 13140 

ACATAGAGAA TTTCAAAAAG AAATTCTACA 13200 

CACAGAAGGT GACGAAAAGT CAGCATACAA 13260 

GAGAATTTCA AAAGAAATTC TACAGACAAT 13320 

TGAGAATATC ATTTCTATCC CACTCTTAAG 13380 

TTTAACATTG ATGTCACACT CCATGCCATT 13440 

TGTTGTCGAA TAAACAACAA GTTGTCCAAA 13500 

AATTTAATAT TTGTAATAGA TAAAATGCTA 13560 

TATAATAATA TGAATTACAA ACATCTAAGA 13620 

ATAAAGTTCA ACGTTTTATA GAAGCAGAAA 13680 

AAACAACACA TAAGATTTCA ATTGAAGAAT 1374 0 

AAAAGATTAG CGGTAAAGAA TTGAGGGATm 13800 

GTAAAATCGA TGTGATTATC CGTAAAaTCT 13860 

CTGAAACGGA TGAAAGACAA GTATTCTATT 13920 

ATAAAATTAC TAAAGAAATA GAAGTGTTAA 13980 

CCAATCTCTA TTCATCATAA TTGCGTCTTG 1404 0 
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10 



15 



20 



25 



30 



35 



GTTCATGGCA TTTCTAGTTA CATGACGTCC ATGAATTAAG AAGTAAACAA GCATAGTAAT 14160 

GATTGCTAAA GCGGCCATAA AGCCGAAGAT TTCACTATAT GAAAACATAT GAGTAAATAA 14220 

CCCAAGGAAT GATGGACCGA AGCCGACACC TGCATCTAGA CCAACGTAAA AAGTAGATGT 14280 

CGCGATACCA TATTTAATCG GGGGTGAGAC TTTTATCGCA ATAGATTGCA TTGCAGATGA 14340 

TAAATTTCCA TACCCTAAAC CTAGGCAAGC ACCAGCAAGT AATATTAACC AGCTTTGATA 14400 

GCTTGAAATT AAGCATACAA ATGAAAGGAA AAGCATGATA AATGCTGGGT AGACAATAAT 14460 

ATTTTCATTT TTATCATCCA TCAATCTACC AGCAATAGGT CTAGTAATTA ACGATGCTAT 14520 

AGCATAGCAA ATAAAGAAAT AGCTTGCTGC AGTGACTAGG TGTCGCTCTA AAGCAAATGC 14580 

TTGTAAATAA GTTAGGATGG ACGCATAGGT AACGCCAATT AAAAGCATAA TTACAGCAAC 14640 

AGGAATGGCC TCTTTTGCAA TAAATTGATG AATACTAAAT CTTGGTTTAT CAATGACATT 14700 

AGTTTCAGTT TTGTTATTTG TTACTTCGAA ATCAACTTTT ATAAATAATG AGATAATGAG 14760 

TCCGAGTATG CCTAATATGA CACAAATAAT AAACAGTAAG TCAATTGCGT ATTTTGTAAT 14 820 

AAGTAACATG CCTAGAAATG GGCCAATCGC TGTACCTAAT ACTAAACTTA AGGAAAATAA 1488 0 

ACTGATGCCT TCAC TTTTT C TATTAACAGG GGTAACGTAT GCCGCAATAG TACCTGTTGC 14940 

AGTTGTCACA ACTGCAGTTG CGATACCGTT TATGAGACGT ACAAAGATTA AAAAAGCTAA 15000 

AGATCCATCA ATAAAATAAA GTAATTGCGT GATAATTAAA GCAATTAAAC CAATAAATAA 15060 

TAATCGTTTA GGTCCrATTT SATTTACAAA TTTACCTGTA GCAAATCGA . 15109 
(2) INFORMATION FOR SEQ ID NO: 45: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9072 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 45:. 

GAGAGTCAAT GGCAAGAAGA ATATAAATAT TTGAGAGCGT TAATCTTTAA TGAAACAGAA 60 

TTAGAGGAAG CGTATAAATG GATGCATCCT TGTTACACGT TGAATAATAA AAATGTAGTA 120 

CTTATCCATG GCTTCAAAAA TTATGTTGCA CTATTATTTC ATAAAGGTGC CATTTTGGAG 180 , 

GATAAATATC ATACACTCAT TCAACAGACT GAAAAGGTGC AAGCAGCTCG TCAGTTACGA 24 0 

TTTGAAAATT TAACAGAGAT TCAAGCACGT ACCGAAGAAA TTAAATATTA TCTAGCCGAA 300 

GCAATTAAAG CTGAAAAAGC TGGTAAAAAA GTTGAAATGA AGAAAACAGA GGAATATGTT 360 
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AAATTAACGC CAGGCAGACA ACATCAATAT ATATATCATA TTGGACAAGC TAAACGCAgT 4B0 

GgAACAAGAC AAAAGCGTGT TGAAAAGTAT ATTAACCAAA TACTAGAAGG TAAAGGGATG 540 

5 CATGATAAGT AATTAATGAG TAAAGCATAC CGGTTATACA ACAACATACA AGATGACACG 600 

AAACAACCAA TGGCTCATGC TGTTGGTTGT TTTTTTAGGT GTGTCTGTCA TGGGCAACAC 660 

TTTGACGTTG GAATTCCGTT ACAGGCTTGG GAGTAGAAAA TGTTAGCAAA AGGCAAGGGT 72 0 

70 

GTCTACAATG AATGATGAAG ATATTAAAAT ATAAGGATGA CTTTGTGAGT GGCGGATGGG 780 

CGGTTGTCCG TCTGTAACAA TGGATGCGTG TGCATTATTA CAAAAATTCG ACTTTTGTAA 840 

TAATATTTCA CATTTTCGAC ACTTTTTTGC TATAAAACAA CCAATTGAGC GATAATAAAT 900 

15 

TCGCTTTTAA AAAATATGAG TTATCTATTT AGTTGCCAAA GATAAAATAA TAATGTTTAA 560 

TAACATCATA TAGAGTATGT TAGTTTTAAA TGTCGAATAT ACGAATGTGc AAACAAAGTA 1020 

ATCGGTAGAA ATTCAACATA CATAGCGCCG TTTACTGTTA AGTATTCACA TTACAGATGA 1080 

20 

AAAATATAAA ATTCTACATA ATCAAGACCA TGATGTGTAC TTGTTTAACT TATGACTCTA 1140 

TTTGTTTAAC AATTGCGATA ATGGTCTTTT TATTTTATGC GTATCATTCG TCATATTTTT 1200 

2S TATGAGGAAG GAGAAATGAT TATGTTAAGT ATTAAGCATT TAACGAAAAT TTATTCTGGT 1260 

AATAAAAAGG CAGTAGATGA CATCTCTTTA GATATTCAAT CTGGGGAATT TATCGCATTT 1320 

ATTGGAACCA GTGGAAGTGG CAAAACGACT GCTTTAAGAA TGATAAACCG TATGATTGAA 1380 

30 GCGACAGAAG GACAAATTGA AATTGATGGT AAAGATGTTC GGAGTATGAA TCCTGTCGAA 144 0 

TTGCGTAGAA ATATTGGCTA TGTTATTCAA CAAATTGGCT TAATGCCTCA TATGACGATT 1500 

AAAGAGAATA TTGTGTTGGT ACCCAAATTG TTGAAATGGA CTAAAGAGGA AAAGGATAAA 156 0 

55 CGTGCAAAGG AATTAATTAA ACTTGTGGAT TTACCGGAGT CATTTTTAGA GCGTTATCCA 1620 

GCAGAACTAT CAGGTGGGCA ACAACAACGT ATCGGTGTTG TAAGAGCACT TGCGGCCGAA 168 0 

CAAGATATTA TTTTAATGGA TGAACCTTTT GGTGCATTGG ATCCTATTAC GAGAGATACG 1740 

40 

TTACAAGATT TAGTTAAAAC GTTAGAACGA AAATTAGGCA AGACGTTTAT CTTTGTAACA 1800 
CATGATATGG ATGAAGCGAT TAAATTAGCA. GACAAAATTT GTATTATGTC AGAAGGTAAG . 1860 

GTGGTGCAAT TTGATACGCC AGACAATATT TTAAGACATC CCGCAAATGA TTTTGTACGT 1920 

45 

GATTTTATAG GACAAAATAG ACTGATTCAA GACCGTCCCA ATGACAAGAC TGTAGAAGGT 1980 

. GTAATGATTA AACCAATCAC GATACAAGCA GAAGCAACAC TGAATGACGC CGTTCATATT 2040 

sq ATGAGACAAA AACGTGTTGA TACTATTTTT GTAGTAGATA GTAATAACCA TTTACTAGGT 2100 

TTCTTAGACA TTGAAGATAT AAATCAGGGT ATACGTGGAC ACAAAAGTTT ACGAGACACC 2160 
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ATTTTAAAAA GAAACGTTAG GAATGTACCT GTCGTAGATG ATCAACAGCG TTTAGTAGGA 2280 

CTGATTACGC GTGCCAATGT TGTTGATATT GTATATGACA CGATTTGGGG CGATAGTGAG 2340 

5 GATACAGTGC AAACAGAACA TGTGGGGGAA GACAcTGCGT CCTCAAAAGT GCATGAGCAA 2400 

CACACTACTA ATGTCAAAGT ACGTGACATA GGAGATGATA AATCATGATT GAGTTCCTAC . 2460 

ATGAACATGG TGGACAGTTG ATGTCGAAAA CACTGGAACA TTTCTATATT TCTATAGTGG 2 520 

10 

CATTATTACT TGCCATCATT GTTGCAGTAC CTATAGGCAT TTTATTATCA AAAACAAAGC 2580 

GAACTGCCAA TATTGTATTA ACTGTGGCAG GTGTCTTACA AACTATTCCA ACACTAGCTG 2640 

TACTTGCTAT TATGATACCG ATTTTTGGTG TTGGTAAAAC GCCTGCAATT GTAGCGCTAT 2700 

75 

TTATTTATGT ATTATTACCT ATTTTAAATA ACACGGTACT CGGTGTTCAA AATATTGATA 2760 

GCAACATTAA AGAAGCTGGA AAAAGTATGG GAATGACACA ATTTCAATTG ATGAAGGATG 2820 

TTGAATTGCC GTTAGCATTG CCGCTTATCA TTGGTGGCAT TCGTTTGTCA TCTGTGTATG 2880 

20 

TAATTAGTTG GGCTACACTT GCAAGTTATG TAGGTGCGGG TGGATTAGGT GATTTCATTT 2940 

TCAATGGTTT AAATTTATAT GATCCACTGA TGATTGTAAC TGCAACGGTA . CTCGTTACTG 3000 

2s CACTAGCATT AGGTGTTGAT GCCTTATTAG CTTTAGTTGA AAAATGGGTA GTTCCCAAAG 3 06 0 

GCTTAAAAGT ATCTGGATAA TTAGGAGGCT AAGATAATGA AGAAAATTAA ATATATACTT 3120 

GTCGTGTTTG TCTTATCGCT TACCGTATTA TCTGGATGTA GTTTGCCCGG ACTAGGTAGT 318 0 

30 AAGAGCACGA AAAATGATGT CAAAATTACA GCATTATCAA CAAGCGAATC GCAAATTATT 3240 

TCACATATGT TACGGTTGTT AATAGAGCAT GATACACACG GTAAGATAAA GCCAACATTA .3300 

GTAAATAATT TAGGGTCAAG TACGATTCAA CATAATGCCT TAATTAATGG GGATGCTAAT 3360 

35 ATATCAGGTG TTAGATATAA TGGCACAGAT TTAACGGGAG CTTTGAAGGA AGCACCAATT 3420 

AAAAATCCTA AGAAAGCAAT GATAGCAACA CAACAAGGAT TTAAAAAGAA ATTTGATCAA 3480 

ACGfTTTTTG ATTCGTATGG TTTTGCGAAT ACGTATGCAT TCATGGTAAC GAAGGAAACC 3540 

40 

GCTAAAAAAT ATCATTTAGA GACAGTTTCA GATTTAGCAA AGCATAGTAA AGATTTACGT 3600 

TTAGGTATGG ATAGTTCATG GATGAATCGT AAAGGCGATG GCTATGAAGG ATTTAAAAAA 3660 

GAGTATGGTT TTGACTTTGG TACAGTGAGA CCAATGCAAA TAGGTCTAGT CTACGACGCA 3720 

45 

TTAAACTCAG AGAAGTTAGA CGTTGCATTA GGTTATTCTA CAGATGGTCG AATTGCGGCG 378 0 

• TATGATTTGA AAGTACTTAA AGATGATAAA CAATTTTTCC CACCTTATGC TGCGAGTGCT 384 0 

50 GTTGCAACAA ATGAATTATT ACGGCAACAC CCAGAACTTA AAACGACGAT TAATAAGTTG 3 900 

ACAGGAAAGA TTTCGACTTC AGAGATGCAA CGCTTGAATT ATGAAGCGGA TGGTAAAGGT 3 960 
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AAAGGTGGTC ATAAGTAATG GAAGGTAATT TATTACAGCA ATTATTCAAT TATTATGTTA 4080 

CGAACTTTGG TTATCTATGG GATTTATTTT TCAAACACTT ATTAATGTCT GTCTATGGTG 4140 

TGCTGTTTGC AgCTTTAATT GGTATTCCAT TGGGAATCTT GCTTGCaAGA TACACAAAAC 4200 

TTTCTGGATT TGTAATTACA ATTGCAAATA TAATTCAAAC AGTTCCAGTC ATTGCAATGT 4260 

• TAGCTATTTT AATGTTAGTC ATGGGCTTAG GTTCAGAAAC AGTAGTTTTA ACAGTGTTTT 4320 

10 

TATATGCGTT ACTTCCAATT ATAAAAAACA CTTATACTGG TATAGCTAGT GTTGATGCGA 4380 

ATATTAAGGA TGCTGGCAAA GGTATGGGAA TGACACGCAA TCAAGTGCTA CGAATGATTG 4440 

AATTACCGTT ATCTGTTTCG GTTATTATCG GTGGCATTCG TATTGCCTTG GTTGTTGCGA 4500 

TAGGTGTTGT TGCCGTTGGA TCATTTATAG GAGCACCTAC GCTTGGTGAC ATTGTGATTC 4560 

GTGGTACAAA TGCGACGGAT GGCACAACGT TTATTTTAGC AGGTGCGATT CCGATTGCTA 4 620 

20 TCATTGCAAT CGTCATTGAT GTACTATTAA GATTTTTAGA AAAACGATTA GACCCAACAA 4680 

CACGACATCG TAAAAATCAA TCTAATCATC GGCCGCAAAG TATTAATATG TAATAGTAGA 4740 

AGATGTTTAT AATTTAGCGA TTTCGTTTCA TGATTTATAA AAAATGAGGC TACTCAAGGA 4 800 

25 GCTCAAATAA TCTTTGAGTA GCCTTTTTAT AGGTTGTGTT TGTATGCGTT TACACTAAAA 4 860 

TAGCAATTAT TATCATGAAA GTTTTTGGAT AAAAAGCGTT AATTATTGTA AAAATACTAA 4 920 

AAAATGAGAT GTTTTATTTA TAATTTTCTG CAAATTTATG ATATTGTTTC TTAATATATC 4 980 

30 ATATTAAAAA TTTGTTTTTC TTAAACATAG GAGGCTTATC TAATTCATGG ACACATCAAA 5040 

ACAATTTAGA GGTGACAACC GATTGCTTTT GGGTATCGTT TTAGGGGTTA TTACCTTTTG 5100 

GCTATTCGCG CAGTCACTTG TTAATCTTGT TGTCCCATTA CAATCAACAT ATAGTAGTGA 5160 

35 

CGTTGGAACG ATAAATATCG CTGTTAGCTT ATCTGCCTTA TTTGCTGGTT TGTTTATCGT 5220 

AGGTGCTGGT GATGTTGCTG ATAAATTTGG TCGCGTCAAA ATTACTTATG TAGGATTGAT 528 0 

ATTAAATGTT GTAGGTTCAT TACTCATCAT CATTACACCT TTGCCAGCAT TTTTAATTAT 534 0 

40 

AGGTAGAATA ATTCAAGGTT TGTCTGCAGC ATGTATTATG CCATCAACAC TTGCTATTAT 5400 

TAACGAATAT TATATTGGTA CAAGAAGACA ACGTGCCTTA AGCTATTGGT CTATTGGTTC 54 60 

TTGGGGTGGT AGTGGTATTT GTAGGTTGTT TGGTGGCTTA ATGGCTACAT ATATAGGTTG 5520 

45 

GCGTTCAATA TTTGTTGTTT CAATTCTATT AACATTATTA GCAATGTACT TAATCAAACA 5580 

TGCACCTGAG ACTAAAGCAG AACCAATCAA AGGTATGAAA GCAGAAGCTA AAAAGTTTGA 564 0 

so CGTTATTGGT TTAGTCATTT TAGTAGTGAC GATGTTAAGT TTAAATGTAA TCATCACACA 570 0 

GACGTCTCAT TTTGGTTTAG TTTCACCGTT AATTCTAGGT TTAATTGTTG TGTTTATCTG 5760 
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AATTTTTAAA AATAGAGGAT ACAGTGGTGC AACTATTTCA AACTTCTTAT TAAATGGTGT 5880 

AGCAGGTGGT GCACTTATCG TTATTAACAC GTATTATCAA CAACAATTAG GATTTAATTC 5940 

5 TTCGCAAACG GGTTATATTT CATTAACGTA TTTAATAACA GTGTTGTCAA TGATTCGTGT 6000 

AGGTGAAAAG ATTTTATCTC AACATGGTCC GAAGCGCCCA CTATTACTAG GAAGTGGCTT 6060 

TACAGTGATT GGGTTAATCT TATTGTCGTT AACATTTTTA CCAGAAGTGT GGTATATCAT 6120 

10 ATCTAGTATA GTTGGATATT TATTGTTTGG TACTGGTTTA GGATTATATG CTACACCATC 6180 

AACTGATACA GCAGTTGCTA GTGCGCCAGA TGATAAGTCG GGTGTTGCTT CAGGTGTGTA 624 0 

TAAAATGGCG TCATCATTAG ' GAAATGCATT TGGAGTAGCA GTATCTGGTA CGGTTTATAC 6300 

75 

TGTGTTAGCA GCTAATTTAA ATTTGAACTT AGGTGGTTTC ACAGGTATGA TGTTTAATGC 6360 

CTTGCTAGCA ATTGTTGCAT TTTTAGTCAT TTTACTATTA GTTCCTAAAA ATCAAACGAA 6420 

TTTGTAAAAC TGAAATGAAA GCAAGTTATT ATGTAGGGAT TTTAAAGGAA ATTTTGTGAA 6480 

20 

AGTAAGTTTA TCATACACAC TTAATGTTGC GTATTGACGT TTAATGTTAG GTGTGTTCTT 654 0 

TTATAGACGA TAAAAGCTGT GTGCATATTA AGCGAATGAT TTTCAAATTG ACGCTAATAT 6600 

2S GCGAAAGTAG TATTTTTAAA ATGAACAACA ACGATGAAGA GGGGTTTATA GGATGAAAAT 6660 

TGCAATTGCT GGATCGGGTG CATTAGGTAG TGGCTTTGGT GCCAAACTAT TTCAAGCAGG 6720 

ATATGATGTC ACACTTATTG ACGGATATAC ATCTCATGTT GAAGCGGTTA AGCAACATGG 6780 

30 ATTAAATATA ACGATTAATG GAGAGGCATT CGAGTTAAAC ATTCCGATGT ATCATTTTAA 684 0 

TGATCAACCG GACGAAAGCA TTTACGATGT TGTCTTTCTA TTTCCAAAGT CTATGCAATT 6900 

AAAAGAAGTG ATGGAAGATA TGAAGCCACA TATTGATAAT GAAACGATCG TCGTATGTAC 6960 

35 GATGAATGGT CTGAAGCATG AAGAAGTCAT TGCGCAGTAT GTTGCTCAAT CACAAATTGT 7020 

CAGAGGTGTT ACGACTTGGA CGGCAGGTCT TGAAAGCCCT GGACACAGTC ATTTACTTGG 7080 

TAGTGGACCA GTTGAAATAG GTGAACTAGT GGATGAAGGT AAAGAAAATG TTATAAAAGT 714 0 

TGCTGATTTA CTTAACGAAG CGGAATTGAA TGGTGTCATT AGTAAAGATT TATACCAATC 7200 

GATTTGGAAA AAGATTTGTG TTAATGGTAC GGCAAATGCA TTAAGCACAG TGTTGGAGTG 7260 

TAATATGGCA TCGCTGAATG AAAGTAGTTA TGCGAAGTGT TTGATTTATA AATTAACGCA 7320 

45 

AGAAATAGTG CATGTAGCGA CGATTGATAA TGTTCATTTA AATGTTGATG AAGTATTTGA 7380 

ATATTTAGTT GATTTAAATG AAaAAGTTGG TGCGCATTAT CCATCCATGT ATCAAGATTT 7440 

AATTGTTAAT AATAGAAAAA CTGAAATTGA TTATATTAAT GGCGCAGTTG CAACATTAGG 7500 

SO 

TAAACAACGT CaTATTGAAG CGCCAGTCAA TCGCTTTATT ACTGATTTAA TTCATACTAA 7560 
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CAATCACGTG 


ATATTACGGT 


CATTATTAAG 


ATTGAAATGT 


AATAAATAAA 


GAACAGCAGT 


7680 


AAGGTACTTT 


CAAATTGAAA 


TGATCTTGGT 


GCTGTTTTTC 


TTGATTGATC 


TTCGTCATAA 


7740 


TTCAGATTTG 


TCATAGGCTA 


CGACATACTA 


TTAGTATTTA 


CTAGACAGTT 


TTTACGACGA 


7800 


CACTTTGAAA 


AATTTTGAGG 


CAAATCATTT 


GGAAGTCTCA 


CGTGAATTTT 


GTAAACTCAT 


7860 


CAAGCAAGTA 


ATTATATTAA 


AAAGACAAAT 


AGAGAAAAGG 


TGTTTATAAT 


GAGTAAAATT 


7920 


TTTGTAACTG 


GTGCAACGGG 


CCTTATTGGC 


ATTAAATTAG 


TTCAAAGACT 


AAAAGAAGAG 


7980 


GGGCATGAGG 


TTGCTGGTTT 


TACTACATCT 


GAGAATGGTC 


AACAAAAGCT AGCTGCTGTT 


8040 


AATGTAAAAG 


CATATATTGG 


TGATATATTA 


AAAGCTGATA 


CTATTGATCA 


AGCGTTAGCA 


8100 


GATTTTAAAC 


CAGAAATCAT 


TATCAATCAA 


ATTACGGATT 


TAAAAAATGT 


TGATATGGCA 


8160 


GCAAATACGA 


AAGTACGTAT 


TGAAGGTTCT 


AAAAACCTAA 


TTGATGCGGC 


GAAAAAGCAT 


8220 


GACGTTAAGA 


AAGTAATTGC 


CCAAAGTATT 


GCCTTTATGT 


ATGAACCTGG 


CGAAGGATTA 


8280 


GCAAATGAGG 


AAACTTCACT 


TGATTTTAAC 


TCAACTGGCG 


ATAGAAAAGT AACGGTTGAT 


8340 


GGTGTGGTTG 


GTTTAGAAGA 


AGAAACGGCT 


CGTATGGATG 


AATACGTTGT 


TTTACGTTTT 


8400 


GGCTGGTTAT 


ATGGCCCAGG 


TACTTGGTAC 


GGAAAAGATG 


GCATGATTTA TAATCAATTT 


8460 


ATGGATGGTC 


AAGTGACACT 


TTCAGATGGC 


GTAACATCAT 


TTGTGCATCT 


TGATGATGCA 


8520 


GTTGAAACAT 


CTATTCAAGC 


TATTCATTTT 


GAAAATGGTA 


TCTATAATGT 


AGCAGATGAT ■ 


8580 


GCACCTGTTA 


AAGGTTCTGA 


ATTTGCAGAA 


TGGTATAAAG 


AACAACTTGG 


TGTTGAACCA 


8640 


AATATTGATA 


TTCAACCTGC 


GCAACCATTT 


GAACGTGGCG 


TAAGCAATGA 


GAAGTTTAAA 


8700 


GCGCAAGGTG 


GTACTCTGAT 


TTATCAAACT 


TGGAAAGATG 


GCATGAATCC 


AATTAAATAA 


8760 


TAATTTATCC 


GTTTAATATA 


CAAAGAATAA 


AGACTTGGTC 


GAATCGTGGA 


TGATATATTA 


8820 


TCAAACbl-AC 




Au hi 1 1 1 n 


e\ 1 ±f\± \s i- til 


CGTTATCTTT 


GTATGAAGGA 


8880 


ATAACAGAAT 


TACAATTAAT 


GTACTGAATA 


ATGCAATTAA 


TGTTGTGATT AGTGCTAATT 


8940 


TAATTTCTAT 


TGGTAGCCAA 


GTCAGTACAA 


AAGACCAATT 


ATTGCTACCG 


AGAATGAGAT 


9000 


ATGGTAATGC 


ATATAATATG 


AGCGCTAAAG 


CGATACATAT 


ACATAATGAT AACCAACTCA 


9060 


ATACAGCAAT 


CC 










9072 



(2) INFORMATION FOR SEQ ID NO: 46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16826 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 46: 

GTGGAACAGC TGTAACTATA TCATTTCTTT CAACATTTAT TGGGAAAATG TTAGCTACAT 60 

TTCTATATCC GATTAATAAT GTAGTACTTT CATATATnTC TGTAAATGAA AGTGACAATA 120 

TAAAGAAGCA ATATTTGaAA ACTAATCTAA TTGCTATAGC TGCCCTATGT TTAGTCATGA 180 

TTATATGTTA TCCAATTACA ATAATTATTG TCTCTTTACT GTATAACATT GATTCAAGTT 240 

TATATTCGAA GTTTATTATT TTAGGTAATA TAGGTGTTTT ATTCAATGCA GTGAGTATTA 300 

TGATCCAAAC TTTAAATACA AAACACGCAT CAATAACATT ACAAGCGAAT TATATGACGC 360 

TTCACACGAT TACATTTATA TTCATAACTA TTTTAATGAC AATTGCGTTT GGTCTAAATG 420 

GATTCTTTTG GACAACGCTG TTCAGCAACA TTATTAAGTA TGTGATTTTA AATATTATAG 480 

GTTTAAAGTC TAAATTCATT AATAAAAAGG ACGTCGATTA GATGAGTGAA AAAAAGATTT 540 

TGATTTTATG TCAGTATTTT TATCCGGAAT ATGTATCTTC TGCGACGTTA CCAACTCAAT 600 

TGGCGGAAGA TTTAATTGCG AATCACATTA ATGTCGATGT CATGTGTGGA TGGCCATATG 660 

AATATAGTAA TCATAAACAG GTTTCTAAAA CCGAGATGCA TCGTGGTATT CGCATTCGAC 720 

25 GTCTCAAGTA TTCGAGGTTT AATAACAAAA GTAAGGTTGG AAGGATCATC AATTTCTTTA 780 

GTTTATTTTC AAAATTCGTG ATTAATATAC CTAAAATGTT GAAATATGAT CAGATTCTTG 840 

TTTACTCTAA TCCACCAATC TTGCCATTAA TACCAGACGT TTTACACAGA CTGCTTAAGA 900 

30 AAAAATATTC TTTTGTGGTG TATGATATAG CACCTGATAA TGCGATTAAG ACAGGTGCAA 960 

CTCGTCCAGG TAGCATGATT GATAAGCTGA TGCGTTACAT TAATAGACAT GTCTACAAGA 1020 

ATGCTGAAAA TGTCATTGTC CTTGGTACGG AAATGAAAAA CTACTTACTA AATCATCAAA 1080 

TTTCTAAAAA TGCTGACAAT ATCCATGTGA TTCCTAACTG GTATGACATG CGTCAATTAC 1140 

AAG^CAATCG TATCTATAAT GACACATTTA AAGCTTACCG TGAGCAATAC GACAAAATTT 1200 

TATTGTATAG CGGTAATATG GGGCAGTTAC AGGATATGGA GACACTTATC TCATTTTTAA 126 0 

AATTAAATAA GGATCAGTCT CAAACGTTAA CAATACTTTG TGGTCATGGT AAGAAATTTG 1320 

CAGATGTCAA AACGGCAATA GaAGACCATC GTATTGAAAA TGTTAAAATG TTTGAGTTTT 1380 

TAACAGGTAC AGACTATGCT GACGTATTAA AAATTGCGGA TGTATGTATT GCATCGCTGA 1440 

TTAAAGAAGG CGTCGGTTTA GGCGTGCCGA GCAAGAATTA TGGCTATCTT GCAGCTAAGA 1500 

AAGCGTTGGT ACTCATCATG GATAAGCAAT CTGATATCGT TCAACATGTT GAACAATATG 1560 

ATGCGGGTAT CCAAATTGAT AATGGCGATG CACATGCCAT TTATAACTTC ATCAACACTC 1620 

ACTCGAGTAA GGAATTGCAC GAGATGGGTG AGCGCGCACA TCAACTGTTT AAAGATAAAT 1680 
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AAGCGATTAT TCGATGTAGT GAGTTCAATA TATGGTTTAG TAGTTTTAAG TCCGATTCTG 1800 

TTAATTACAG CATTACTAAT TAAAATGGAa TCACCTGGAC CAGCCATTTT CAAACAAAAA 1860 

AGACCGAGGA TTAATAATGA ATTGTTTAAT ATTTATAAGT TTAGATCAAT GAAAATAGAC 1920 

ACACCTAATG TTGCAACTGA TTTAATGGAT TCAACATCGT ATATAACAAA GACAGGGAAG 1980 

GTCATTCGTA AGACCTCTAT TGATGAATTG CCACAATTAT TGAATGTTTT AAAAGGAGAA 2040 

10 

ATGTCAATTG TAGGTCCTAG ACCAGCGCTT TATAATCAAT ACGAATTAAT CGAAAAACGT 2100 

ACAAAAGCGA ACGTGCATAC GATTAGACCA GGTGTGACAG GACTAGCTCA AGTGATGGGG ' 2160 

AGAGATGATA TCACTGATGA TCAAAAAGTA GCGTATGATC ATTATTACTT AACACATCAA 2220 

75 

TCTATGATGC TTGATATGTA TATCATATAT AAAACAATTA AAAATATCGT TACTTCAGAA 2280 

GGTGTGCATC ACTAATGAGA AAAAATATTT TAATTACAGG CGTACATGGA TATATCGGTA 234 0 

20 ATGCTTTAAA AGATAAGCTT ATTGAACAAG GACATCAAGT AGATCAAATT AATGTTAGGA 2400 

ATCAATTATG GAAGTCGACC TCGTTCAAAG ATTATGATGT TTTAATTCAT ACAGCAGCTT 2460 

TGGTTCACAA CAATTCACCT CAAGCAAGGC TATCTGATTA TATGCAAGTG AATATGTTGC 2520 

2S TGACGAAACA ATTGGCACAA AAGGCTAAAG CTGAAGACGT TAAACAATTT ATTTTTATGA 2580 

GTACTATGGC AGTTTATGGA AAAGAAGGTC ATGTTGGTAA ATCAGATCAA GTTGATACAC .2640 

AAACACCAAT GAACCCTACG ACCAACTATG GTATTTCCAA AAAGTTCGCT GAACAAGCAT 2700 

30 TACAAGAATT GATTAGTGAT TCGTTTAAAG TAGCAATTGT GAGACCACCA ATGATTTATG 2760 

GTGCACATTG CCCAGGAAAT TTCCAACGGT TAATGCAATT GTCAAAGCGA TTGCCAATCA 2820 

TTCCCAATAT TAACAATCAG CGCAGTGCAT TATATATTAA ACATCTGACA GCATTTATTG 2880 

35 ATCAATTAAT ATCATTAGAA GTGACAGGTG TGTACCATCC TCAAGATAGT TTTTACTTTG 2940 

ATACATCGTC AGTAATGTAT GAAATACGTC GCCAATCACA TCGTAAAACG GTATTGATCA 3000 

ACATGCCTTC AATGCTAAAT AAGTATTTTA ATAAGTTGTC GGTCTTTAGA AAATTATTCG 3060 

40 

GCAATTTAAT ATACAGCAAT ACGTTATATG AAAATAATAA TGCACTTGAA ATTATTCCTG 3120 

GAAAAATGTC ACTTGTTATT GCGGACATCA TGGATGAAAC GACAACCAAA GATAAGGCAT 3180 

AAGTCATCTA TTAAATAAAA TCAACATACA AATCGTTTTA TTTGGAGGTT ATAGTATGAA 3240 

45 

GTTAACAGTA GTTGGCTTAG GTTATATTGG TTTACCAACA TCAATTATGT TTGCAAAACA 3300 

TGGcGTCGAT GTGCTTGGTG TTGATATTAA TCAGCAAACG ATTGATAAGT TACAAAGTGG 33 60 

50 TCAAATTAGT ATTGAAGAAC CTGGATTACA AGAGGTTTAT GAAGAGGTAC TGTCATCGGG 3420 

AAAATTGAAG GTATCTACAA CGCCAGATGC ATCTGATGTT TTTATCATTG CCGTTCCGAC .3480 
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TAGTATTTTA TCATTTTTAG AAAAAGGAAA TACCATTATT GTAGAGTCGA CAATTGCGCC 3600 

TAAAACGATG GATGATTTTG TAAAACCAGT CATTGAAAAT TTAGGGTTTA CAATAGGTGA 3660 

5 AGATATTTAT TTAGTGCATT GTCCAGAACG TGTACTGCCA GGAAAAATTT TAGAAGAATT 3720 

AGTTCATAAC AATCGTATCA TTGGCGGTGT GACTGAAGCT TGTATTGAAG CGGGTAAACG 3 7 BO 

TGTCTATCGC ACATTCGTTC AGGGAGAAAT GATTGAAACA GATGCACGTA CTGCTGAAAT 3840 

10 

GAGTAAGCTA ATGGAAAACA CATATAGAGA CGTGAACATT GCTTTAGCTA ATGAATTAAC 3900 

AAAAATTTGC AATAACTTAA ATATTAATGT ATTAGATGTG ATTGAAATGG CAAACAAACA 3960 

TCCGCGTGTT AACATCCATC AGCCTGGTCC AGGTGTAGGC GGTCATTGTT TAGCTGTTGA 4020 

75 

TCCGTACTTT ATTATTGCTA AAGACCCTGA AAATGCAAAG TTAATTCAAA CTGGACGTGA 40B0 

AATTAATAAT TCAATGCCGG CCTATGTTGT TGATACAACG AAGCAAATCA TCAAAGTGTT 4140 

20 GAGCGGGAAT AAAGTCACAG TATTTGGTTT AACTTATAAA GGTGATGTTG ATGATATAAG 4200 

AGAATCACCA GCATTTGATA TTTATGAGCT ATTAAATCAA GAACCAGACA TAGAAGTATG 4260 

TGCTTATGAT CCACATGTTG AATTAGATTT TGTGGAACAT GATATGTCAC ATGCTGTCAA 4320 

25 AGACGCATCG CTAGTATTGA TTTTAAGTGA CCACTCAGAA TTTAAAAATT TATCGGACAG 43 BO 

TCATTTTGAT AAAATGAAGC ATAAAGTGAT TTTTGATACA AAAAATGTTG TGAAATCATC 444 D 

ATTTGAAGAT GTATCGTATT ATAATTATGG CAATATATTT AATTTTATCG ACAAATAAAA 4500 

30 TGTGTCAAAC TAGGGCATAC ATGATTAAGG AAAGATAAGC TGTCATGTGT TTGAACTTCA 4560 

GAGAGGATAA TGTTATGAAA AAAATTATGG TTATTTTCGG TACGAGACCC GAAGCAATAA 4620 

AAATGGCACC ATTAGTAAAA GAAATTGATC ATAATGGGAA CTTTGAAGCG AACATTGTGA 4680 

TTACAGCACA ACATAGAGAT ATGTTAGATA GTGTGTTAAG TATATTTGAT ATTCAAGCTG 4740 

ATCAXGATTT AAATATTATG CAAGATCAAC AAACATTAGC AGGCCTTACG GCGAATGCAC 4800 

TTGCTAAACT TGATAGCATC ATTAATGAGG AACAACCGGA TATGATTTTA GTACATGGTG 4860 

40 

ATACTACAAC GA CTTTTG TA GGAAGTTTGG CAGCATTTTA TCATCAAATT CCGGTCGGAC 4 920 

ATGTAGAAGC TGGACTTCGA ACACATCAGA AATACTCACC ATTTCCTGAA GAGTTAAATC 498 0 

GAGTCATGGT AAGTAATATT GCTGAATTGA ATTTTGCGCC AACAGTAATT GCAGCTAAAA 504 0 

45 

ATTTACTTTT TGAAAACAAA GACAAAGAGC GTATCTTTAT TACTGGAAAT ACAGTTATTG 5100 

ACGCATTGTC AACAACAGTT CAAAATGATT TTGTTTCAAC GATTATTAAT AAACATAAAG 5160 

so GCAAGAAAGT TGTTTTACTA ACAGCGCATC GTCGTGAAAA TATTGGGGAA CCGATGCATC 5220 

AGATTTTTAA AGCAGTAAGA GATTTGGCAG ATGAATATAA AGATGTTGTC TTCATTTATC 5280 
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GGATTGAATT AATTGAGCCA 


TTAGATGCGA 


TTGAGTTCCA 


TAATTTTACA 


AATCAATCGT 


5400 




ACCTCGTGCT GACAGATTCT 


GGTGGTATTC 


AAGAGGAGGC 


TCCTACATTT 


GGAAAACCTG 


5460 


5 


TGTTGGTATT AAGGAATCAT 


ACAGAGCGTC 


CCGAAGGCGT 


TGAGGCGGGA 


ACATCGAGAG 


5520 




TAATTGGCAC AGATTATGAC 


AATATTGTTC 


GAAATGTGAA 


ACAATTGATT 


GAGGATGATG 


5580 


10 


AAGCGTATCA ACGTATGAGT 


CAAGCGAATA 


ATCCATATGG 


TGATGGACAA 


GCATCACGAC 


5640 


GTATTTGTGA AGCAATAGAA 


TATTATTTTG 


GATTGCGCAC AGACAAGCCG 


GATGAATTCG 


5700 




TACCTTTACG TCACAAATAA TAAAAAACCC 


CTAATCATGA 


AGTTGGTTTA 


GACAACCAGC 


5760 


15 


GGTGACTAGG GGTTTTTAAT ATATTTATTT 


TTGATAGTGG 


TAGCCAATAT 


CATATTTGAA 


5820 


TACTTTATTT GATAATATTG 


GACTTTGCTG 


TCCATCGTCA 


TCACTTTTTA 


AACGTACATT 


5880 




TTTATGAGCT TCTTTAAATA 


CATCGGAATT 


CAACCAATTA 


TTAAAGCTAT 


CTTCAGATTC 


5940 


20 


■ CCAAATAGTT AAGATTTTAA 


CTTCGTCTGT 


ATCCTCGGTA 


TTTAATGTTT 


TAGTGACAAA 


6000 




CATTTGTTGG AAGCCTTCAA 


TAGTTTCAAT 


ACCTTGTCTA 


TTGTAAAAAC 


GTTCAATCGT 


6060 




TTCTTCCGCA CTGCCTTTTT 


GTAATTGTAA 


TCTATTTTCT 


GCCATAAACA 


TGGGCAATCA 


6120 


25 


CTCCTCTATT TTATGATTTG ATTTGGGTAA 


TGTTTTTACA AATGTAAAGA 


GTACAGCGGT 


6180 




TTGTATGATA ACCATTATGA 


TTAATCCTAC 


ACGGACTGCA AGAACATCCA 


CCATATAAAT 


6240 




TGAAAAACCT ATTACAATGT 


ATAAGCTAAT 


TAAAATTTTA 


ATTTTCTGTT 


GTAGCGTGTA 


6300 


30 


GCCTCGATGT AAATAAAAGT 


TTTCTACATA, 


TTCTTTATAA 


AITITTOAT 


TAATAAGCCA 


6360 




ATTGTAAAAG CGATCTGAAC 


TTCGAGCAAA 


GCAAAAAACT 


GCTACGAGTA 


AAAAAGGGGT 


6420 




CGTTGGCAGT AAAGGTAATA 


CGGCACCTGC 


AATACCAAGC 


GCTGTAAATA 


TTAAGCCAAT 


6480 


35 


GACGATTAAA ATAAGTCGCA 


TTGAAAAAAC 


TCCATTCTAG 


TACTAATGCG 


CATGTAATAT 


6540 




TGTTTTAGTA ATATAACTCA TGCTAAATAT 


AATGTGTATG 


ATAAGTGCAA 


TGACTCAGTA 


6600 


40 


AAATGAAACG ATGTTGAATT ATCCTTGTCA 


CATTAACGCA TTTTAAGCGC 


GACTTTCATA 


6660 


ACAACCAAAC TATTTAATGA 


GAATTATTCT 


CAAGTATTAT 


AGTTATATTA 


TGTGTTTTAT 


6720 




TTTTGAAAAG TGCAATATGT TTTCGAAAAT 


AAGATTATTT 


TTATGTGCAA 


AAACGACGCA 


6780 


45 


AAAGTTTTAA AAATGAGACT 


TCTGTGAGCT 


GATTATTTTA TAAAATGTAA 


ACGCTTACTA 


684 0 




TATAATGTGA ATCATATCGT 


TTAAAAGCAT 


TATTAAATAT 


GATGCTAAGA 


GATTTATATT 


6900 




ATAGCCAATA AACAAAGGAG 


AGATAATATG 


GCAGTAAACG 


TTCGAGATTA 


TATTGCAGAG 


6960 


50 


AATTATGGTT TATTTATCAA 


TGGGGAATTT 


GTTAAAGGTA 


GCAGTGACGA 


AACAATCGAA 


7020 




GTGACTAATC . CAGCAACTGG 


AGAAACACTA 


TCACATATTA 


CAAGAGCAAA 


AGATAAAGAT 


70B0 
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TCAGAACGTG CACAAATGTT GCGTGATATT GGTGATAAAT TAATGGCACA AAAAGATAAA 7200 

ATTGCAATGA TTGAAACATT AAATAATGGT AAACCGATTC GTGAGACAAC AGCAATTGAT 7260 

ATTCCATTTG CTGCAAGACA TTTCCATTAT TTCGCAAGTG TTATTGAAAC AGAAGAAGGT 7320 

ACAGTGAATG ATATCGATAA AGACACAATG AGTATCGTAC GACATGAGCC GATTGGCGTC 73 80 

GTAGGTGCTG TTGTTGCTTG GAACTTCCCA ATGCTATTAG CTGCATGGAA GATTGCGCCA 744 0 

gCCATTGCTG CAGGTAATAC AATTGTGATT CAACCTTCGT CTTCAACACC ATTAAGTTTA 7500 

TTGGAAGTTG CTAAAATTTT CCAAGAGGTA TTACCTAAAG GTGTTGTCAA TATACTAACG 7560 

GGTAAAGGTT CAGAATCAGG TAATGCAATT TTCAATCATG ATGGTGTAGA TAAATTATCA 7620 

TTTACGGGCT CAACTGATGT AGGTTATCAA GTTGCCGAAG CTGCAGCAAA ACATCTAGTA 7680 

CCCGCTACAT TAGAGCTTGG TGGTAAAAGC GCCAATATCA TATTAGATGA TGCTAATTTA 7740 

GACCTTGCAG TTGAAGGTAT TCAGTTAGGT ATTTTATTCA ACCAAGGTGA AGTATGTAGT 7B00 

GCAGGTTCTC GATTATTAGT TCATGAAAAA ATTTATGATC AATTGGTGCC ACGTTTACAA 7860 

GAGGCATTTT CAAATATTAA AGTTGGAAAT CCACAAGATG AAGCTACACA AATGGGTAGT 7920 

25 CAAACTGGTA AGGATCAATT AGATAAAATT CAATCATATA TTGATGCAGC AAAAGAATCA 7980 

GATGCACAAA TTTTAGCAGG CGGTCATCGC TTAACTGAAA ATGGATTAGA TAAAGGGTTC 8040 

TTCTTTGAGC CGACATTAAT TGctGTGCCA GACAATCATC ACAAATTAGC ACAAGAAGAA 8100 

30 ATATTTGGAC CAGTGTTAAC AGTGATTAAA GTGAAGGACG ATCAAGAAGC AATTGATATA 8160 

GCTAATGATT CTGAGTATGG TTTAGCAGGC GGTGTATTTT CTCAAAATAT CACACGTGCA 8220 

TTAAATATTG CTAAAGCTGT ACGTACAGGA CGTATTTGGA TTAACACTTA CAACCAAGTA 8280 

CCAGAAGGCG CACCATTTGG TGGTTATAAA AAATCAGGTA TCGGTCGAGA AACTTATAAA 834 0 

GGTGCGTTAA GTAACTATCA ACAAGTTAAA AATATTTATA TTGATACAAG CAATGCTTTA 84 00 

AAAGGTTTGT ACTAGAATAA ATATCGTTTC TGAAGCGTGT TTGTAGGTCA GTCTAGCGGT 8460 

AAGTCTTAAC ATTTAACGGC GTTGTTTAGA TTTTAAGCAA AACAAAATAT ATAGGAACAC 852 0 

GTATCATGAT ATTAGGATAT AATGACTAAA ATAATAGCAG TAGGATGGTT TTTAATTGCA 8580 

AATCATCTTA CTGCTGTTTT TAATTATGCT AATTTGCGAT GCGGCTATTA TAAGGACAGA 864 0 

GTTGTTTATT AATTATGGTG ATTTAGAAAT ATGAAGTTCA ATATGCAAAG TCATCGTTTG 8700 

TTTTAATATG CGGAACAATC ATTAAAGTTA TTGCGATTTT TTGAACTTAA TGAAACTAAA 8760 

SO CAATAAATTT GAGATACTTT TTTGTCATTT TTATGTAACT AACACAATAA TCTCGTACAT 8820 

TATTAAAATT TTCTATATGA TAGGAATAAA GCAAAGCGCG AGTGTGCTGT AAAAGTTTTC 8 880 
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GATGATGTAT AAATCATGGT TAATTACGGA 
GAATTATTTT TAAAAGCGAC AATATTAAAT 

5 

ATGAATGGGA AAAAGGCGAA TACGATAAAC 
CAAAAAATTC AACAAAGTTC TAAAAAGACG 
TTTACAGTGA TTGAATTTGT GGGAGGTTTA 

70 

TCATTTCATA TGCTTAGTGA TGTATTAGCA 
GCAAGTAAAA AGCCGACTGC ACGATACACA 
GCATTTTTAA ATGGTTTAGC ATTAATTGTA 

is 

GTACGTATTA TTTATCCGCA ACCAATTGAA 
GGTTTACTCG TCAATATTAT TTTGACTGTT 

2Q AATATCAATA TTCAAAGTGC ATTATGGCAT 

GTCATCGTTG CAGTTGTATT GATTTACTTT 
AGTATTGTAA TTTCACTCAT CATTTTACGT 

25 tTAATTTTAA TGGAAAGTGT GCCTCAACAT 

AAAAACATAG ATGGCATATT AGATGTACAT 
CATTATTCAT TAAGTGCCCA TGTTGTGTTA 

30 GCGATTGATC AAGTATCATC ATTGTTGAAA 

CAAATTGAAA ACTTGCAATT GAATCCATTA 
ATAAAACATT GTAGCGCCTA AAACATTAAT 

35 CTTATGTTGC ATCATTTAAA TGATTTTCGT 

CGAC&TCTTT AGGTTTCAAA ATATGAATAT 

CTATGATGTA CCTTTGACCG GCCATTGTTT 

40 < 

TTGCTACGAC AGATTCTTTA TCCATAATGA 

TACCCTAACA TGATTTT7AT ACTCTTTGAA 

TTAAAAAAAT ATCTTAATAT CCTTGTAATC 

45 

CATtGTTATA GGAGGTCTTA- TTAATGACAT 
TTGCATCAAC GAAAGAAGAA CTAGAAGCAA 
CAACATTAAT TGAAGTACAA GCTACTGAAA 

50 

CAAATGACGA aGCAGAAGCT AAACAATTTT 



AGCATTAATA TTAACCTGAG AAGCTATAAA 9000 

ACGACGCATT TATTTAGGAG TGGCAAACGT 9060 

AGATACAAAT ATTTTCATCA TGTCAATCAT 9120 

CTGTGGGCAT CACTAATCAT CACATTGTTA 9180 

GTATCTAATt CATTGGCATT ACTGTCAGAT 9240 

CTTGGTTTAT CTATGTTGGC CATTTATTTT 9300 

TTTGGATATT TAAGATTTGA GATATTAGCT 9360 

ATTTCAATCT GGATTTTATA TGAAGCTATT 9420 

AGTGGCATTA TGTTTATGAT TGCTAGTATT 9480 

ATCCTTGTAA GGTCTTTAAA ACAAGAAGAC 9540 

TTCATGGGAG ACTTATTGAA CTCTATTGGT 9600 

ACAGGATGGC GCATCATCGA CCCAATCATT 9660 

GGTGGTTATA AAATTACGCG TAATGCgTGG 9720 

TTGGATACTG ATCAAATTAT GGCAGATATT 9780 

GAATTTCATT TGTGGAGTAT TACAACAGAG 984 0 

GATAAAAAAT ATGAGGGTGA TGATTATCAA 9900 

GAAAAATATG GCATTGCACA TTCAACGTTG 996 0 

GATGAGCCAT ACTTCGACAA ATTAACATAA 10020 

CTATGTCATA GGCGCACGTT TCGTTTTATA 10080 

CAATTTCTTT GATGCTATCT ACATCTAACA 1014 0 

GTTTTTCATC ATTTGTATGT AAAATGCGTT 10200 

CTACAGCAAT CTTTTTGTTT CTAGCTAAAC 10260 

TAGCCCCCTA TATATATGTT TATTTACTTA 10320 

AATATATTTT ACAGAATTTT ATCTAAATAT 10380 

CGATAAGAAT TATAGTAATA TTTTTTCAAC 1044 0 

TATTTTTATT AGAAGCTAAC AATCTTGATT 10500 

AGGCAGCATC ACTATCTACG AAGACAATTC 10560 

ATTTAACTCA TGGTTATTTT ATTGTGGAAG 10620 

TAACAGAAGC AGATATTAGT ATTCAATTAG 10680 
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TTGATTACCT TGTAACTTGG AACATTCCGG AAGGCATTAC GATGGATCAA TATTTAGCAC 10800 

GTAAAAAGAA AAATTCTGTT CATTATGAAG AAGTGCCAGA AGTTGAATTT AAACGCACAT 10860 

5 

ATGTATGTGA AGATATGTCT AAATGTATTT GTTTATACAA CGCACCTGAT GAAGAAGCGG 10920 

TACGTCGCGC GCGCAAAGCA GTTGATACAC CGATTGATGG CATCGAAAAA CTTTAATAAG 10980 

ACAACAAGTT GATGAGATAT ATGTATATAG GTTTGGCATG GATTTCGATT GCAGTTAATT 11040 

10 

AGAATAGCTC AATGCTATAA ATGTAAGTAG TTGATATGAA GAAACTAATG AACTAAATGC 11100 

AAGTATTGTC TAAAACAATC ATTTTATTGA AATTTAGTAG AGCTGAAATT AATATAACGT 11160 

CGTTAATTGA ATAACGCTTA TGTTATAAGA GCACTCATAC CAAACCATAA TCATCTATAG 11220 

ATATAACAAT TCACGATATA AGGGCTGTGT TTGGCATAGC CCTTTAGATA TACACTTAAT 11280 

TCCTATTAAA ATAGTAGGGA TTAAAAGGGG GCTTGTCATG ATTAAAATTC AACAATTACA 11340 

20 ACATCACTTT GGATCACATA AAGTAATTCA TAACTTTAAT TTGGACATTA GCAAGGGAGA 11400 

AATAGTCACT TTCATAGGGA AAAGTGGTTG CGGAAAGTCT ACTTTACTCA ATATTATCGG 11460 

TGGATTTATT CATCCATCGT CTGGTCGTGT CATTATTGAT AACGAAATTA AACAACAGCC 11520 

25 ATCTCCAGAT TGTTTAATGC TATTTCAACA TCATAATTTG CTGCCATGGA AAACGATTAA 11580 

TGACAACATT AGGATTGGAT TACAACAGAA AATTAGTGAT GAAGAGATTA ACGCACAGCT 11640 

TAAATTAGTT GATTTAGAAG ACAGGGGAAA GCATTTTCCC GAGCAACTGT CCGGGGGTAT 11700 

30 , GAAACAACGT GTGGCACTAT GTCGAGCGCA TGTGCATAAG CCTAACGTTA TATTGATGGA 1176 0 

TGAGCCATTA GGTGCATTAG ATGCATTTAC ACGTTATAAA CTTCAGGATC AACTAGTGCA 11820 

„aCTAAAACAT AAAACGCAAT CAACTATTAT TTTAGTGACG CATGACATTG ATGAAGCTAT 11880 

35 TTATCTTTCC GACCGCATTG TTCTGTTAGG TGAAGGGTGC AATATTATTT CTCAATATGA 11940 

AATTACAGCA TCACATCCAC GCAGTCGTAA TGATAGCCAC CTACTTAAGA TTCGTAATGA 12000 

AATTATGGAA ACATTTGCAT TGAATCATCA TCAAGTTGAA CCTGAATATT ATTTATAAGG 12060 

40 

AGTGAGTGAC GATGAAAAGG TTAAGCATAA TCGTCATCAT TGGAATCTTT ATAATTACAG 12120 

GATGTGATTG GCAAAGGACG TCTAAAGAAC GGTCTAAAAA TGCCCAAAAT CAGCAAGTGA 12180 

TTAAAATTGG ATATTTGCCG ATTACACATT CAGCTAATTT GATGATGACT AAAAAATTAT 1224 0 

45 

TATCACAATA CAATCATCCG AAATATAAAC TAGAATTAGT TAAATTCAAT AATTGGCCAG 12300 

ATTTAATGGA CGCATTAAAC AGTGGTCGTA TTGATGGTGC ATCAACTTTA ATAGAGCTAG 12360 

SQ CGATGAAATC AAAACAGAAG GGCTCAAATA TAAAGGCTGT GGCATTGGGC CATCATGAAG 12420 

GCAATGTCAT TATGGGACAA AAAGGTATGC ACTTAAATGA ATTTAATAAT AATGGCGATG 12480 
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GTAAACAATT AAAGATTAAA CCGGGGCATT TTAGCTATCA TGAAATGTCG CCAGCAGAAA 12600 

TGCCAGCCGC ATTGAGTGAA CACAGAATTA CAGGGTATTC TGTAGCCGAA CCATTCGGTG 12660 

CACTGGGTGA AAAGTTAGGC AAAGGTAAGA CTTTGAAACA TGGTGATGAC GTTATACCTG 12720 

ATGCGTATTG CTGTGTGCTA GTACTGAGAG GGGAATTGCT TGATCAACAC AAGGATGTAG 12780 

CGCAAgCATT TGTACAAGAT TATAAAAAGT CTGGCTTTAA AATGAATGAT CGCAAGCAAA 1284 0 

GTGTAGACAT TATGACGCAT CATTTTAAAC AAAGTCGTGA CGTTTTAACA CAGTCAGCGG 12900 

CATGGACATC CTATGGTGAT TTAACAATTA AGCCATCCGG CTATCAAGAA ATTACGACAT 12960 

TGGTAAAACA ACATCATTTG TTTAATCCAC CTGCATATGA TGACTTTGTT GAACCGTCAT 13020 

TGTATAAGGA GGCATCGCGT TCATGACACG TCCCACAAAT AACAAATTTA TATTACCTAT 13080 

TATCACATTT ATTATTTTCT TAGGCATTTG GGAAATGGTC ATTATTATTG GGCATTACCA 13140 

ACCTGTATTG TTACCGGGTC CTGCTCTTGT AGGAAAAAGT ATATGGTCTT TCATTGTTAC 13200 

TGGAGAAATT TTCCAACATT TAGCAATTAG TTTATGGAGA TTTGTAGCGG GCTTTGTTGT 13260 

CGCATTGTTG GTTGCTATTC CATTGGGCTT CTTGCTTGGA AGGAATCGTT GGCTATACAA 13320 

CGCTATCGAA CCGCTATTTC AATTGATTAG GCCGATATCT CCGATAGCAT GGGCACCATT 13380 

TGTTGTTCTA TGGTTTGGTA TTGGTAGTTT GCCAGCGATT GCGATTATTT TTATCGCTGC 13440 

TTTTTTCCCA ATTGTGTTCA ATACTATTAA AGGCGTTAGA GACATTGAAC CTCAATATTT 13500 

AAAAATAGCA GCAAATTTAA ATTTAACTGG GTGGTCATTG TATCGCAATA TATTATTTCC 13560 

CGGGGCATTT AAACAAATCA TGGCTGGGAT ACATATGGCG GTAGGAACAA GTTGGATATT 13620 

TTTAGTTTCT GGTGAAATGA TTGGTGCACA ATCGGGATTA GGTTTTTTAA TCGTTGATGC 13680 

ACGAAATATG TTGAACTTAG AAGATGTTTT AGCAGCAATA TTCTTTATCG GATTATTTGG 13740 

TTTTATTATT GATCGATTCA TTAGTTATAT TGAGCAGTTT ATACTTAGAA GATTTGGTGA 13800 

ATAAGGAGAG ATGATGATGA CTTTAGAAAC GCTTATCAAA GAACAATTAG ATCCTCATTT 1386 0 

AGTAGAAGTT GATGAAGGGA CGTATTATCC ■ GAGAACATTT ATTCAGCAAT TATTTGTAGA 13920 

TGGTTATTTC GGTGAGGCGG CATTGAGAAA AAATGCTGAA GTAATCGAAG CTGTATCGCA 13980 

GTCTTGTTTG ACAACAGGAT TTTGTTTATG GTGCCAATTA GCTTTTTCAA CGTATTTAGA 14040 

AAATGCCACG CAGCCACATT TAAATAATGA CTTACAACAG CAATTGTTAT CTGGAGAAAT 14100 

ATTAGGTGCT ACCGGATTGT CTAATCCGAT GAAGTCATTT AATGATTTAG AAAAGTTGAA 1416 0 

CCTTGAACAC ACTTATGTTG ATGGACAATT GGTTGTCAGT GGACGTATGC CAGCTGTAAG 14220 

TAATATTCAA GAAGACCATT ATTTTGGTGC GATTTCGAAA CATGAATCAT CAGATGAATT 14280 
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TTTAGGAGTC 
ATCACAAATT 
TGCTTACCAA 
ATTTTCAAAT 
AAAACGTTAT 
TTCACATTTA 
AAATCAAGCT 
TCGCAAGTTA 
TAAACTTGAA 
TTGTTGAAAC 
GTTATATCCT 
AAAAGTGTTA 
GAAGGTGAAT 
TTATCGTAAA 
TTGGGCGATT 
CAAACGCAAT 
AGAAATAGAT 
AGATATTGAT 
ATAAGTGTTG 
ATAATTCAAG 
CAAdATGCAA 
AAATCGTAGC 
AAAAAGCATT 
CAGATAATGG 
CATTTTATCC 
CAATTACAGC 
TTGGTGTCGT 
ATTTATTAAT 
GGAACTTGTC 



AACGGGTCAG 
ATCACGCATG 
ATTCCAATAG 
GTGCAAAACG 
CGTCAACTTA 
AATGAATTAA 
TCTGTTGTCA 
AAAGAAGGAT 
GCAGAGTTGA 
ATTTTTTAAA 
TTTAACTAGG 
ATAAGGTGTA 
ACTATAGATA 
GAATACGTGG 
GCACATATGA 
GAATCGATGC 
GTAAATGAGA 
TTGAATGATG 
CTGGTGTAAG 
GGGGTGGTAT 
CACGTACTTT 
ATTATTTCCA 
AGGATTAAAA 
TGAAGACTTA 
TGCATATATG 
AGGTGTAGGA 
TGAAGTTACA 
ACTTCTTAGA 
TCAAGTAGGT 



CAACGTATCA 
ATGCGAAGCA 
GATTAGGCTC 
GAATAAATCA 
GAGAGGAATA 
TATCATTGAA 
ATGGTGGTTC 

AGGGGTAAGT 
ATAATATAAA 
AAAATATACA 
TAATGAAAAT 
CGCATACTAA 
GTAAAGGACC 
CAGGTGTTTT 
TCCATTATAC 
TGGAAAGTCT 
ATGAAGTCAT 
GTACACGGTG 
GTCAAACGGT 
AAGGAAGTCA 
GAAGCAGTAG 
ACATTTTTAG 
GATAAACATT 
ACTCGTGAAC 
TCTGACCATG 
GGAAGTAATA 
AACTATGAAG 
AATCATGCGC 



AATCACATTG 
GTTTGCGGCA 
AATTAAAAGT 
ATATTTAGAG 
TTATGCAATA 
GAAGGACATC 
TAGAGCGTAC 
AGCATTGACA 
GTGATAAGCT 
TCTTAGTTTA 
TTTCGTAATA 
GTGAACAATT 
AGAACAACAA 
CAATAGTATT 
GAGTAAAGTT 
ACGCACAGAG 
TGTAGGCGCT 
TTCAATATTT 
CTGTTTGCTA 

GccGTrrrrr 

AAATTTATCA 
AAGGTCAAGA 
AGGAAAGAGG 
TACCAGATAT 
GTATTGAAAA 
TAGATTTAGC 
CAGTTAGTGT 
AAGGTCATCG 
ATGAATTACA 



AATCAAGTCG 
ACTATTCGCC 
TCTTTAGAGT 
TATGATGTTG 
TTAGATGACG 
GGCTATTTAT 
ACACCATATT 
CCGACATTAA 
GATTTTTTGT 
TAAACATTTT 
ATAATAATCG 
AATGAACTTC 
TTCTCGAATC 
CGAGTGTCGT 
GAGAGTTTTT 
AAGATTAAAC 
AAGTTTGTAA 
GTTTTCGATA 
ACTTCGCTTT 
TGTCATATTT 
TTTAGGAGAG 
AAATCAATTA 
ACATGAGTTC 
GGATGTGATT 
AGCACCGAAC 
GGCAGCAAGT 
GGCAGAACAT 
TCAATCAGTA 
ACACAAAACA 



TAGTGCCACA 
CGCAATTTAT 
TAATTGATGC 
AAGCTTTTAA 
GTAACTTAAC 
TGTTAGATGT 
CGCCACAAGT 
GACATTTAGG 
TTAGATGCGT 
CTGTTAATTT 
TTATCATTGA 
TTATTTTAAA 
TAGTAAGATC 
TTAAAGATAA 
ACCTAAACGA 
AGATGTATAA 
AATTATTTAC 
AGTCAATAGA 
GAATTTAACA 
TTAAAACAAG 
ATGGATATGA 
CTTAATACTA 
ATTATATTAG 
ATTAGTGCGC 
TTGAAATTAG 
GAACACAATA 
GCGGTTATGG 
GAAGGTGAAT 
ATTGGTATTT 



14400 
14460 
14520 
14580 
14640 
14700 
14760 
14820 
14880 
14940 
15000 
15060 
15120 
15180 
15240 
15300 
15360 
15420 
15480 
15540 
15600 
15660 
15720 
15780 
15840 
15900 
15960 
16020 
16080 
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TACAACACTA TGATCCAATC AATCAACAAG ACCATAAATT GTCTAAATTT GTAAGCTTTG 16200 

ATGAACTTGT TTCAACAAGT GATGCGATTA CAATTCATGC ACCATTAACA CCAGAAACTG 16260 

5 ATAACTTATT TGATAAAGAT GTTTTAAGTC GTATGAAAAA ACACAGTTAT TTAGTGAATA 16320 

• CTGCACGTGG TAAAATTGTA AATCGCGATG CGTTAGTTGA AGCGTTAgCA TGCGAGCATT " 16380 

TACAAGGATA TGCTGGTGAT GTTTGGTATC CaCAACCtGC ACCTGCTGAT CATCCATGGA 16440 

10 

GAACAATGCC TAGAAATGCT ATGACGGTTC ACTATTCAGG TATGACTTTA GAAGCACAAA 16500 

AACGTATTGA AGATGGAGTT AAAGATATTT TAGAGCGTTT CTTCAATCAT GAACCTTTCC 16560 

AAGATAAAGA TATTATTGTT GCAAGTGGTC GTATTGCTAG TAAAAGTTAT ACAGCTAAAT 15620 

AGAATAAGGA TGCTGGGCTA GCGATTAACG CTTTCAATTT TATATAAATG AATCATATAA 16680 

GCACTACTGC TGTTGTAAAG ATGGCAGTAG TTTTTTTATG ATTACATCTA AGTATAGTCA 16740 

CGGCTATGTT AGGACAATGA TTTAACATTT ACGCACATAT GTGTTCACTT ACGCAATTAT 16800 

20 

TGAliAAATnT CATTCATGTG GnAATC 16826 
(2) INFORMATION FOR SBQ ID NO: 47: 

2S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4012 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 47: 
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TTCAATGAGA 


GTAGTGGGCT 


GATGTTTAGC 


GATATCGCGT 


AAGATTAACC 


ATTGGCCATA 


60 


ATATATATTG 


TGTTTTTCTA 


AAATCGGCTC 


GGCTAATTTT 


AAATAGGGGC 


GATATATTGT 


120 


TATAAAACTA 


TTGAAAAATT 


CTTGTGATAG 


CATAGTGACA 


TCTCCTAAGA 


CAAAATAGTT 


180 


AGCTTAGCTA 


mCCTTTTTAC 


AACAATAGTA 


ATTATAAAAC 


GGGAGCAATT 


AGAAATCAAT 


240 


ATATAATTAT 


TAAGAGCAAA 


AATAATTATA 


CTTTGTTAAA 


ATAAGCGTAA 


TTACATGTAA 


300 


ATAGGGGGAT 


ACTAATGATA 


TTGAAATTTG 


aTCACATCAT 


TCATTATATA 


GATCAGTTAG 


360 


ATCGGTTTAG 


TTTTCCAGGA 


GATGTTATAA 


AATTACATTC 


AGGTGGGTAT 


CATCATAAAT 


420 


ATGGAACATT 


CAATAAATTA 


GGTTATATCA 


ATGAAAATTA 


TATTGAGCTA 


CTAGATGTAG 


480 


AAAATAATGA 


AAAGTTGAAA 


AAGATGGCAA 


AAACGATAGA 


mGGCGGAGTC 


GCTTTTGCTA 


540 


CTCAAATTGT 


TCAAGAGAAG 


TATGAGCAAG 


GCTTTAAAAA 


TATTTGTTTG 


CGTACAAATG 


600 


ATATAGAGGC 


AGTTAAAAAT 


AAACTACAAA 


GTGAGCAGGT 


TGAAGTAGTA 


GGGCCGATTC 


660 
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ATCAGGATGA TGATGAAATT AAGCCACCAT TTTTTATTCA ATGGGAAGAA AGTGATTCCA 780. 
TGCGTACTAA AAAATTGCAA AAATATTTTC AAAAACAATT TTCAATTGAA ACTGTTATTG 84 0 

5 TGAAAAGTAA AAACCGATCA CAAACAGTAT CGAATTGGTT GAAATGGTTT GATATGGACA 900 

TTGTAGAAGA GAATGACCAT TACACAGATT TGATTTTAAA AAATGATGAT ATTTATTTTA 960 

GAATTGAAGA TGGTAAAGTT TCAAAATATC ATTCGGTTAT CATAAAAGAC GCACAAGCAA 1020 

10 CTTCACCATA TTCAATTTTT ATCAGAGGTG* CTATTTATCG CTTTGAACCA TTAGTATAAA 1080 

TATACGTAAG TGCTATGAGC GAGAATGCCC ATATGAATAA TGACAAGCAC AATGGAAAGA 114 0 

ATCGTTAATA TATTATTTAA TCGTGATGAC TTAATTAAAA TGAAAAAGAT TGATAATATA 1200 

IS 

AATGTGAAAA AGATAAGTAT AACCCGTAAA CTAAAGTAAT TCACGGTGAG AGGTTGACTC 1260 

AATGTCATAA TGATTGCAAC GATGTTCATA ATTATAAATA GACTTAAAAT AATTGTTCTC 1320 

ATATCAAACA CCTCATTGTT AGATTATTGA CATTATAACA GGGGTAATTG TATATGAACA 13 BO 

20 

TTAATGTGGT TGCTTGAGGA AAAATTTATT CATTGAAGTC AAGTTGGTTC ATTTTAGAAA 1440 

TGAATATCGT GTTAGATGAT GAAAGTATAT TGAAGTATAG GTAACTAGTT GAAAAGTATT 1500 

25 AATTGTACGA TAACATTAAA TTTAACACGA AACATAGATA TAAAATGATT CACAATTAAA 1560 

ATGGGTAAAT TTGAACTTGC TAAACTATTA ATTGGAGCAT GGACATTTCA AAAATAAGAG 162 0 

TTCAAATCTT ACACAAGCTC TGAATCGACA CTATAAGATA CAAACTGTAT AATTAAAGGT 1680 

30 ATTGTTAAAT AGAAGGAGAT ATCATAAATC ATGGAAAAGA TGCATATCAC TAATCAGGAA 1740 

CATGACGCAT TTGTTAAATC CCACCCAAAT GGAGATTTAT TACAATTAAC GAAATGGGCA 1800 

GAAACAAAGA AATTAACTGG ATGGTACGCG CGAAGAATCG CTGTAGGTCG TGACGGTGAA 1860 

35 GTTCAGGGTG TTGCGCAGTT ACTTTTTAAA AAAGTACCTA AATTACCTTA TACGCTATGT 1920 

TATMTTCGC GTGGTTTTGT TGTTGATTAT AGTAATAAAG AAGCGTTAAA TGCATTGTTA 1980 

GACAGTGCAA AAGAAATTGC TAAAGCTGAG AAAGCGTATG CAATTAAAAT CGATCCTGAT 2040 

40 GTTGAAGTTG ATAAAGGTAC AGATGCTTTG CAAAATTTGA AAGCGCTTGG TTTTAAACAT 2100 

AAAGGATTTA AAGAAGGTTT ATCAAAAGAC TACATCCAAC CACGTATGAC TATGATTACA 2160 

CCAATTGATA AAAATGATGA TGAGTTATTA AATAGTTTTG AACGCCGAAA TCGTTCAAAA 2220 

45 

GTGCGCTTGG CTTTAAAGCG AGGTACGACA. GTAGAACGAT CTGATAGAGA AGGTTTAAAA 2280 

ACATTTGCTG AGTTAATGAA AATCACTGGG GAACGCGATG GCTTCTTAAC GCGTGATATT 2340 

AGTTACTTTG AAAATATTTA TGATGCGTTG CATGAAGATG GAGATGCTGA ACTATTTTTA 24.00 

50 

GTAAAGTTGG ATCCAAAAGA AAATATAGCG AAAGTAAATC AAGAATTGAA TGAACTTCAT 24 60 
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CAAAATATGA TTAATGATGC GCAAAATAAA ATTGCTAAAA ATGAAGATTT AAAACGAGAC 2580 

CTAGAAGCTT TAGAAAAGGA ACATCCTGAA GGTATTTATC TTTCTGGTGC ACTATTAATG 2640 

TTTGCTGGCT CAAAATCATA TTACTTATAT GGTGCGTCTT CTAATGAATT TAGAGATTTT .2700 

TTACCAAATC ATCATATGCA GTATACGATG ATGAAGTATG CACGTGAACA TGGTGCAACA 2760 

ACTTACGATT TCGGTGGTAC AGATAATGAT CCAGATAAAG ACTCAGAACA TTATGGATTA 2B20 

TGGGCATTTA AAAAAGTGTG GGGAACATAC TTAAGTGAAA AGATTGGTGA ATTTGATTAT 2B80 

GTATTGAATC AGCCATTGTA CCAATTAATT GAGCAAGTTA AACCGCGTTT AACAAAAGCT 2940 

AAAATTAAAA TATCTCGTAA ATTAAAACGA AAATAGATTA ACGACTGAAA TCTGAACGCT 3000 

CATAAGACTG TCATTTGCGT TCAGATTTTT TTACACAATA TAGAATGGTT GAGTAAAATA 3060 

TTTTTGAATA TAGTGAAAGA GGGGGAAGTA CTGTGATAAA AAAGCTATTA CAATTTTCTT 3120 

TAGGGAATAA GTTTGCTATC TTTTTAATGG TTGTTTTAGT TGTCTTGGGC GGTGTATATG 3180 

CGAGTGCTAA ATTGAAATTA GAATTACTAC CAAATGTACA AAATCCAGTT ATTTCAGTTA 3240 

CAACAACAAT GCCGGGTGCA ACGCCACAAA GTACCCAAGA TGAAATAAGT AGTAAAATTG 3300 

25 ACAATCAAGT AAGATCATTG GCATATGTGA AAAATGTTAA AACGCAATCC ATACAAAATG 3360 

CTTCAATTGT AACAGTTGAA TATGAAAATA ATACAGATAT GGATAAAGCA GAAGAACAGC 3420 

TTAAAAAAGA AATCGATAAA ATTAAATTTA AAGATGAAGT TGGTCAACCA GAATTAAGAC 3480 

30 GTAATTCGAT GGATGCTTTT CCGGTTTTAG CATATTCATT TTCAAATAAA GAGAATGACT 354 0 

TGAAAAAAGT AACGAAAGTA CTGAATGAAC AATTAATACC AAAATTGCAA ACGGTAGATG 3600 

GTGTGCAAAA TGCGCAATTA AATGGGCAGA CGAACCGTGA AATCACCCTT AAATTTAAGC 3660 

AAAATGAACT TGAAAAATAT GGGTTGACTG CTGATGATGT AGAAAACTAT CTAAAAACGG 3720 

CAACSAGAAC AACGCCACTT GGATTGTTCC AATTTGGTGA TAAAGATAAT CAATTGTTGT. 37 BO 
TGATGGTCAA TATCAATCTG TTGATGCTTT TAAAAACATA AATATTCCAT TAACGTGGCA ' 3840 

GGAGGACCAA GGGCATCTCA TCCCAAAGTG AC CAT AAA CC AAATTCAGCC ATGTCAGACG 3900 

TTATCAGGCA TCACCACAGC AAATTCAAAG CGTCAGCnCC AATATATAGT GGATGCCGCA 3960 

nGAACTAGGG GTTTAGCGnT ATCAGTGGTG TGGCGACTCT ATTCTAAACG AT 4012 
(2) INFORMATION FOR SEQ ID NO: 48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7778 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double' 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 48: 

CAATATAGGT CGCCGAGTTT CAACTaCATC AACTGGTTCA GTTACATTAG ATAATGCGCT 60 

AGGTGTAGGT GGCTATCCTA AAGGACGAAT TATTGAAATT TATGGTCCTG AAAGTTCTGG ' 120 

TAAGACAACA GTAGCGCTTC ACGCTATTGC TGAAGTACAA AGTAATGGCG GGGTGGCAGC 180 

ATTTATCGAT GCTGAACATG CTTTAGATCC AGAATATGCT CAAGCATTAG GCGTAGATAT 24 0 

10 

CGATAATTTA TATTTATCGC AACCGGATCA TGGTGAACAA GGTCTTGAAA TCGCCGAAGC. 300 

ATTTGTTAGA AGTGGTGCAG TTGATATTGT AGTTGTAGAC TCAGTTGCTG CTTTAACACC 360 

TAAAGCTGAA ATTGAAGGAG AAATGGGAGA CACTCACGTT GGTTTACAAG CTCGTTTAAT 420 

15 

GTCACAAGCG TTACGTAAAC TTTCAGGTGC TATTTCTAAA TCAAATACAA CTGCTATTTT 480 

CATCAACCAA ATTCGTGAAA AAGTTGGTGT TATGTTCGGT AATCCAGAGA CTACACCAGG 540 

2Q TGGACGTGCA TTAAAATTCT ATAGTTCAGT AAGACTAGAA GTACGTCGTG CAGAACAGCT 600 

TAAACAAGGA CAAGAAATTG TAGGTAATAG AACTAAAATT AAAGTCGTTA AAAATAAAGT 660 

GGCACCACCA TTTAGAGTAG CTGAAGTTGA TATTATGTAT GGACAAGGTA TTTCTAAAGA 720 

25 GGGTGAACTT ATTGATTTAG GTGTTGAAAA CGACATCGTT GaTAAATCAG GAGCATGGTA 780 

TTCTTACAAT GGCGAACGAA TGGGTCAAGG TAAGGAAAAT GTTAAAATGT ACTTGAAAGA 840 

AAATCCACAA ATTAAAGAAG AAATTGATCG TAAATTGAGA GAAAAATTAG GTATATCTGA 900 

30 TGGTGATGTT GAAGAAACAG AAGATGCACC AAAGTCATTA TTTGACGAAG AATAGTACAC 960 

AAATTTATAT CTATAGTTAA ACTTAGCAAA TATCCTTATA GGATTGATTG AAAGTGATAT 1020 

TCATCTCATA AAGCTAGAAT AATATCTAAC TTTATGGGAT ACACTACAAA TCGAGACTAT 1080 

° jD AAGGTTTTTT ATTTTATTTA TTATTACATT ATCAATAGTT TTATAATCGA GCTTCAAAAC 1140 

tttaSaaaat AGTAGAAATA GCATTCAATA TAGTGCAAAA GTGCAAATTG ATAACTTGAC 1200 

ACTTATCTCC TATAAACCGT ACAATTAATT TGTATGATTT ATATATAATT TCATAAAGTC 1260 

40 

ATATTGAATT TCATATAAAG AGCAAACCCT AGAAAAGGAG GTGTTTGTGT GAATTTATTA 1320 

AGCCTCCTAC TCATTTTGCT GGGGATCATT CTAGGAGTTG TTGGAGGGTA TGTTGTTGCC 13 80 

CGAAATTTGT TGCTTCAAAA GCAATCACAA GCTAGACAAA CTGCCGAAGA TATTGTAAAT 144 0 

45 

CAAGCACATA AAGAAGCTGA CAATATCAAA AAAGAGAAAT TACTTGAGGC AAAAGAAGAA 1500 

AACCAAATCC TAAGAGAACA AACTGAAGCA GAACTACGAG AAAGACGTAG CGAACTTCAA 1560 

AGACAAGAAA CCCGACTTCT TCAAAAAGAA GAAAACTTAG AGCGCAAATC TGATCTATTA . 1620 

50 4 

GATAAAAAAG ATGAGATTTT AGAGCAAAAA GAATCAAAAA TTGAAGAAAA ACAACAACAA 1680 
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CGCATCTCCG GTCTCACTCA AGAAGAAGCT ATTAATGAGC AACTTCAAAG AGTAGAGGAA 1800 

GAACTGTCAC AAGATATTGC AGTACTTGTT AAAGAAAAAG AAAAAGAAGC TAAAGAAAAA 1860 

5 

GTTGATAAAA CAGCAAAAGA ATTATTAGCT ACAGCAGTAC AAAGATTAGC AGCAGATCAC 1920 

ACAAGTGAAT CAACGGTATC AGTAGTTAAC TTACCTAATG ATGAGATGAA AGGTCGAATC 1980 

ATTGGACGAG AAGGACGAAA CATCCGCACA CTTGAAACTT TAACTGGCAT TGATTTAATT 2040 

10 

ATTGATGACA CACCAGAAGC GGTTATATTA TCTGGTTTTG ATCCAATAAG AAGAGAAATT 2100 

GCTAGAACAG CACTTGTTAA CTTAGTATCT GATGGACGTA TTCATCCAGG TAGAATTGAA 2160 

GATATGGTCG AAAAAGCTAG AAAAGAAGTA GACGATATTA TTAGAGAAGC AGGTGAACAA 2220 

15 

GCTACATTTG AAGTGAACGC ACATAATATG CATCCTGACT TAGTAAAAAT TGTAGGGCGT 2280 

TTAAACTATC GTACGAGTTA. CGGTCAAAAT GTACTTAAAC ATTCAATTGA AGTTGCGCAT 2340 

2Q CTTGCTAGTA TGTTAGCTGC TGAGCTAGGC GAAGATGAGA CATTAGCGAA ACGAGCTGGA • 24 00 

CTTTTACATG ATGTTGGTAA AGCAATTGAT CATGAAGTAG AAGGTAGTCA TGTTGAAATC 24 60 

GGTGTAGAAT TAGCGAAAAA ATATGGTGAA AATGAAACAG TTATTAATGC AATCCATTCT 2520 

25 CATCATGGTG ATGTTGAACC TACATCTATT ATATCTATCC TTGTTGCTGC TGCAGATGCA 2580 

TTGTCTGCGG CTCGTCCAGG TGCAAGAAAA GAAACATTAG AGAATTATAT TCGTCGATTA 2640 

GAACGTTTAG AAACGTTATC AGAAAGTTAT GATGGTGTAG AAAAAGCATT TGCGATTCAG 2700 

30 GCAGGTAGAG AAATCCGAGT GATTGTATCT CCTGAAGAAA TTGATGATTT AAAATCTTAT .2760 

CGATTGGCTA GAGATATTAA AAATCAGATT GAAGATGAAT TACAATATCC TGGTCATATC 2820 

AAGGTGACAG TTGTTCGAGA GACTAGAGCA GTAGAATATG CGAAATAATT TTTGTCTCCC 2880 

35 TCACAAATTA GTGAGGGAGC TTTTTTAAGT TGTAGTCTTA AtCTAGTTAG ACAGCACTTT 2940 

ATCGGTAATA ACTATATTAA ACAGTAGTTA TTTGAAAGTA AGACGGACCT TATATTAAAT 3 000 

AAGAAGTTAT TGCTTTTAAT AAAAATGTTT TAGGCTTCGT AATTACTATA TTTATATTAT 3 060 

40 

- GTAAACCTAT AAAGATGATT GGTTTTCTAT CCAATAAAAA AGAAGAGAAG ATGTAACACA 3120 

TCTTCTCTTC yGCAATATTA ATTAGGATTT ATTTCTAAGT TGAGTTATTT TAATTGTAAA 3180 

TCTGTTTTCT TTAATTCTTT TATAACTTCT GCAGTATCAT AACAATTTGT TGCAATTGTT 3240 

45 

GAATATCTCT CTGCTAAACG ATATGCATTA ATGTAAAGCT TTAAACTTTC TTTAGCTATA 3300 

TCCTCTGCAT CTTCGAATTT TGATGGGTTA GACATAACCA CTAATTCTGC AAATTTTTCT 3360 

5o GGATCAATAT TAATAGACAT GTATTTATTT ACAACTCCTA TTTATTTTGA TGTCTTAATA 3420 

CTAACATATT GAAGTTTTCA GACAAAGTAA TGTCTCTCTA TAATTGAAGA AAAATAATTC 3480 
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GGATGAACAA AACATGAGAA TAATGTTTAT 
CGCAATTGAA ACGTACATAC CTCAACTGAA 

5 ' 

AAATGCTGAA AATGCAGCAC ATGGTAAAGG 
AAGAAATGGT GTAGATTTCA TGACTATGGG 
TGATTTTATA GATGAAGCAA AACGACTAGT 

10 

GGGAATTGGT ATGAGATTTA TACAAATTAA 
AGGAAGAGCG TTTATGCCAG ATATTGATGA 
GGAAGCACAA GAACAAACTC CGTTTATATT 

15 

AAAGTATGCA ATGGGATGGC ATTTAGATGG 
CACATTCAAA CAGCAGATGA ACGTATTTTA 
GGTATGACAG GTTTTTATGA TGGCATTTTA 

20 

TTTATCACTA GTTTGCCACA AAGACATGTT 
GGTGTTGTTA TTGATTTAGA CAAAGAAGGT 

25 AATGATGACC ATCCATTTTC AACATTTTAA 

CTATCGTCCA TTAGTATGAA TTTAATATAG 
CTTTTTGTTA TCATTTAATA TGAAATATAT 

30 ATTATCGTGG AAAGTTGGCG GTCAACAAGG 

CGCTACGGCT ATGAATAGAA AAGGATATTA 
TATCAAAGGT GGACATACGA ATAATAAAAT 

35 TAGTGATGAT TTAGATATTT TGATTGCATT 

TGAAATGAGA GAAGACAGTA TTATTTTArC 
AGGATGTCAT GCACAGCTTA TTGAATTACC 

4U AGCATTAATG AAAAACATGG TTGCAATAGG 

AAATACATTT GAAGAACTTA TTACTAATAT 
AGTCAATATC CAAGCATTAA ACGAAGGTTA 

45 

CTACGGGGAC TTTGAATTAG AGTCAACAGA 
CGATGCCATT GGATTAGGTG CAATTGCTGC 
TACACCTGCG TCTGAAGTTA TGGAATATAT 

SO 

GGTTATTCAA ACAGAAGATG AAATTGCTGC 



AGGGGATATC GTAGGTAAAA TTGGACGAGA 3SO0 

GCAAAAGTAT AAACCAACAG TTACAATTGT 3660 

TTTGACTGAA AAAATATATA AACAATTACT 3720 

TAATCACACA TATGGTCAAC GTGAAATTTA 3780 

AAGACCAGCG AATTTTCCGG ATGAAGCGCC 3 840 

TGATATTAAA CTTGCAGTTA TTAATCTGCA 3 900 

TCCTTTTAAA AAGGCAGATC AATTAGTCAA 3 960 

TGTTGATTTT CATGCAGAAA CAACTTCTGA 4020 

TAGAsTAGCG CTGTTGTTGG AACGCATACA 4080 

CCAAAGGGGA CAGGGTATAT AACGGATGTT 4140 

GGAATAAATA AAACAGAGGT AATTGAGCGT 4200 

GTTCCAAATG AAGGTAGAAG TGTATTATCT 4260 

AAAACAAAGC ACATCGAACG TATATTGATA 4320 

AATTACGTAA GTAAACATTC GAATTGGACC 43 80 

TACCACTGTT TACATAGTAA ATCGGTGGTT 4440 

CCATAGGAGG CATATAACTA TGAAACCACA 4500 

CGAAGGTATT GAATCAACTG GGGAAATCTT 4560 

TTTATATGGA TATAGACATT TTTCAAGTCG 4620 

TAGAGTTTCT ACGACGCCTG TTCATGCAAT 4 680 

TGACCAAGAA ACAATTGATG TTAACCATCA 4740 

TGATGCCAAG GCTAAACCTG TGAAaCCAGA 4800 

TTTTACAGCA ACCGCTAAAG AATTAGGTAC 4860 

TGCTACTAGC GCATTGATGA ATTTGAATAC 4920 

GTTTTCTAAA AAAGGTGACA AGGTAGTTGA 4980 

TCAATTAATG CAATCTCGCT TACCTGAAAT 5040 

TGCACTACCA CATCTATATA TGATTGGTAA 5100 

AGGTTCACAA TTTATGGCGG CATATCCTAT 5160 

GATTGCCAAT ATATCTAAAG TAAACGGAGC 5220 

TGTAACTATG GCTATTGGTG CAAATTATGG 5280 
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TGGATTATCT GGTATGACTG AAACGCCATT AGTCATTATT AATACCCAAC GAGGTGGACC 5400 

TTCTACTGGA TTACCTACGA AACAAGAACA GTCAGATTTA ATGCAAATGA TTTATGGTAC 5460 

5 

ACATGGTGAT ATTCCAAAAA TTGTTGTAGC ACCAACAGAT GCAGAAGATG CATTTTATTT 5520 

AACTATGGAA GCATTTAATT TAGCAGAACA ATATCAATGC CCTGTTATAG TTCTAAGTGA 558 0 

10 TTTGCAATTA TCTTTAGGTA AACAAACTGT TGAAAAATTA GATTATAATC GTATTGAAAT 564 0 

TAAACGTGGT GAAATCATTC AATCTGATAT TGAACGTGAA GAAGATGATA AAGGTTATTT 5700 

CAAGCGTTAT GCGTtAACAT CCGATGGTGT TTCTCCTAGA CCTATCCCCG GTGTTAAAGG 5760 

15 AGGTATTCAT CATATAACTG GTGTGGAaCa CAATGAAGAA GGTAAACCTA GTGAATCTGC 5820 

GTCAAATAGA CAACAACAAA TGGAAAAACG AATGCGTAAA ATTGAGCAGT TACTAATTGA 5880 

ATCGCCAGTA GAAGCTAACT TACAACATGA GGATGCAGAT ATTCTTTATA TCGGTTTTAT .5940 

20 

TTCTACAAAA GGTGCAATTC AAGAAGGTAG TAACCGTTTG AATCAACAAG GCATAAAAGT 6000 

TAACACTATA CAAATTAGAC AATTGCATCC ATTCCCAACA AGCGTTATTC AAGATGCAGT 6060 

2S TAATAAAGCG AAGAAAGTCG TTGTAGTGGA GCACAATTAT CAAGGACAAT TGGCTAGTAT 6120 

TATAAAAATG AATGTCAATA TTCATGATAA GATTGAAAAT TATACAAAGT ATGATGGGAC 6180 

ACCTTTCCTA CCACATGAAA TCGAAGAAAA AGGCAAAATA ATTGCTACTG AAATAAAGGA 6240 

50 GATGGTATAG ATGGCGACAT TTAAAGATTT TAGAAATAAT GTTAAGCCTA ACTGGTGCCC 6300 

CGGATGTGGC GATTTCTCAG TACAAGCTGC AATTCAAAAA GCAGCCGCAA ATATAGGGTT 6360 

AGAACCTGAA GAAGTAGCTA TCATCACCGG TATAGGATGT ' TCTGGCCGTC TTTCAGGATA 6420 

35 

TATTAATTCT TATGGCGTTC ATTCTATTCA CGGACGTGCA TTACCTTTAG CTCAAGGTGT 64 80 

AAAAATGGCG AATAAAGATT TAACTGTTAT TGCATCGGGA GGAGATGGTG ATGGTTATGC 6540 

TATAGGTATG GGGCATACAA TCCATGCTTT AAGAAGAAAT ATGAACATGA CGTATATAGT 6600 

40 

CATGGATAAT CAAATTTATG GTTTGACAAA GGGACAAACA TCGCCGTCAT CAGCAGTAGG 6660 

ATTTGTTACT AAAACAACGC CAAAAGGTAA TATAGAAAAA AATGTTGCGC CTTTAGAATT 6720 

45 AGTATTATCA TCTGGTGCCA CATTTGTAGC CCAAGGTTTT TCAAGCGATA TTAAAGGATT 6780 

AACAAAACTA ATTGAAGATG cAATTAATCA TGATGGATTT TCATTCGTTA ATGTCTTTTC 6840 

ACCATGTGTG ACTTATAATA AAATTAACAC ATACGATTGG TTTaAAGAAC ATTTAACAAG 6900 

SO 

TGTTGATGAC ATTGAAAATT ATGATTCTAC AGATAAACAA TTAGCGACTA AAACTGTTAT 6960 

TGAACATGAA TCTTTAGTAA CTGGTATTGT TTATCaAGAT AAAGAAACAC CATCATATGA 7020 

ATCtCAAATT AAAGAGTTAG ATGATmCACC ACTTGCTAAA AGAGATATCa AAATTaCTGA 7080 
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TGTATTTATA ACAGATC CAT TTATGCTACT CAGTTTTTTA CTATTACAAA AAATAAAGGA 7200 

GTTTTTAAAA ATGAAAGACA CATTAATGAG TATACAAATA ATTCCTAAAA CACCAAACAA 7260 

S 

TGACAATGTT ATACCTTACG TAGACGAGGC GATTAAAATA ATTGACGAAT CTGGTTTGCA 7320 

TTTTAGAGTA GGTCCGTTAG AAACGACAGT ACAAGGAAAT ATGAATGAAT GTTTAATTTT 73 80 

1Q AATACAATCA TTAAATGAAC GAATGGTGGA ACTTGAATGT CCAAGTATTA TTAGCCAAGT 7440 

TAAGTTTTAT CATGTGCCAG ATGGCATCAC TATTGAAACT TTAACTGAAA AATATGATGA 7500 

ATAACATTAA AAGTGAAGTA AACTGGATTT GAATTGGCTT GTTAGAGATG ACGTATAACT 7560 

is TTAACTGTTT TTGCACTTTA TAGTTAAATT TAATATAATT ATTAAATGAT ACGGGCAAAT 7620 

AGAAAGGATT TTGTAAAGTG AACGAAGAAC AAAGAAAAGC AAGTTCTGTA GATGTTTTAG 7680 

CTGAGAGAGA TAAGAAAGCA GAAAAAGATT ATAGTAAATA TTTTGAACAT GTTTATCAGC 7740 

20 

CGCCTAATTT AAAAGCAAGC GCAAAAAAAG AGGTnAAA 777 8 

(2) INFORMATION FOR SEQ ID NO: 49: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 1128 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 49: 

GTTACgAAAA TTGCGTACGC TGTTTCAGAA CATGTCAAAA TAGAAACAGG * 60 

TTTCAAACAT CACATAGTGG TTGTGCGACG GGCGGATCCT GTAATTGTTC 120 

ACATCGAGTC AGAAAAAGGT GGTTATTGAA cCACTAACTA GCATCTGACT 180 

ATTTATTCGG GATTGTTTGT TTGAATTGTT GTGCTAAATC TGGTCGATCT 24 0 

TGTGTGCACC TTTTTGGTAT AAATCATTCA TCAGATTTAT ACTATTTACG 300 

CTGGAATGAT ATTCATATCA TTTAACCATT TGATAAAACG AGATGAAGTC 360 

CTTTAAAATG AGTAGGCATT TGGAACGTTT GTGCTAATGG TTGGTAGTAC 420 

ATAAATGATA TTTTAAAAAT GCTTCTGTAA CTTCCTGTTG GCTAGCACCA 4 80 

ATCCTTGTGC AATTTTATTA AAACGAACGA TTTGTTCTTT ATAAAAACTT 540 

CGCGGTCAAA TGCTTGATTT TCTGCAATTG TATCAAACAT AATTTGTGGT 600 

CTTCATAGGA TTCAGGAGCA TCTTTTAAGT CTACGTTTAT ATACATATCA 660 

TCAGCAACTc ATCGAAGGTT AGTATAGCTG TGTGTGCATG ACCACGATAT 720 



AGATGAAGTT 

35 

TAATCCATTC 
ATTATAAAAA 
CGATGTTTTT 

40 

GTCACAATCG 
CCATAATAGC 
45 AAATCAATGC 
CTACCACCTA 
ATTGCGACGG 
GTCACAAGAA 
GCGATTGAGC 
GGATATTGCT 
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AATGTATGGG CACTAACTTT TCCAGAGCCG TTCGTCGTTC TATCAACAGT TGCGTCATGA B40 

AAAACGATAA GCTGTTGATC TTTTGTGAGT CTCACATCTG TTTCAAAGCC ATCAACGCCT 900 

5 

AATTGTTTAG CATAGTCAAA TGCAAGTTGC GTTTGCTCTG GTCTTAAAGC CATACCACCG 960 

CGATGCGCAA ATATATATGG TGCATTGCCT TTGAAAAAAG CAGGGATGGT TTGCTTTTTA 1020 

10 GTAATCACTT TATTTTTATT GATCATTAAT AGACTACTTA AAAATCCAGC ACCGACTAGT 1080 

ACCGCATTTA AAATGTTTCT GTTTACnTTT TTCATAAAAA ATTCCTCC 1128 
(2) INFORMATION FOR SEQ ID NO: 50: 

75 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6252 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 50: 
CAAGCAAACA ATCGTCGATA AAATTGCTAA AATAATAAAA GTAATTCGAA CTTTCATCAT 60 

25 

GATCATCCTT TGTTTATAGA GTCAATATAA GTATGGAATA TGTTAGGTAT ATAGTCAAAT 120 
GCGTCAACTA ATGGGAATTT TGGCATAGAT AGAGAATTTA AGGCAATTAA AAAGGCATCA 180 
30 AACAGTAATA TGCTGCTTGA TGCCCAAATG ATGACTTTAG CTAAATTGAT TAGTCACTTT 240 

TAAAGATAAA GAATTGTCAT GAATTAAAAC TCATGTAATG ATGTGTTACA TTTCGCAATG 300 

ATGGCTTTCA GTTATTTATC GATAACATCA CTCTTGATAC CTTTAGATTT TAAGAAATCT 360 

35 

TTAATTTTAT CTTGTTGCTT TTTATTAACA TCACCGGCAT ATTTTGTTGG CACGTCGACA 420 

ACATTGATTT TATTTTGCGG TTGATAGCTA AGCTTTTCAA TATCTTCATC AACATTGGCG 480 

AITGTACTAT TTAAAGCTTT GAAGTAATTC ATCATTAATT CAACGGGTTT CTTATATTCT 540 

40 

TTAGGAATAT TGTTTTCAGT GACAAATTTC TTGAAATGCA AATCGTTTTT AACAGCTAAG 6 00 

TTAGATAAGT GGCTAAGTGT TTCTGCTTGT TTTTCAGTCA CTTTTGTTTG ACTGTCAATT 660 

45 TGTTTATCTA GTTTATGTTG CATAATATAT TTGTTATCAA GTATATCGCT ATTTACAGAC 720 

AAATACTTTT CTATAGCTTG CTTCATCTCT GCATCACTAA TATCACTATT TTTCTTATCT 780 

GAGTTAAAGA TATCTTTTGT tTCTAATTTT TTAGCGCTTT TAGGTGCATG GATGCCAGTA 840 

50 

CTTGTATGAT GATCTTCGTT ATCAGATTGA TCGGACGCGC AACCTGTAAG AATTAATGTC 900 

GATGCTAAAA ATGTACTTAG TAGTAATCTC TTTTTCATAA TGTAATATAA CTCCTTAGTT 960 

TATCTTTAAT TGAAAAAATA TGTATTCATG TTTAATAGAG TAACATTGAA TTAGTTTGGA 1020 
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TCTATCAATA ATGCATCATT TTGGACGTTG TTAAGGATAG CTTTATCTAT AAATAACTGC 1140 

ATAATTGGTT GTACTAATTT AGACGTAGGT ATCGTACGTA AAAGCATAAT AATTTCGTTC 1200 

5 

ACATACTTTT CTTTCTCAAT ATCATTTTTC ATATTG ATTT GTTTGCGAGA GGTACATACT 126 0 

TTAAGCATTA TCGCACATCT CGTTGTATAT ATTAAGTTTA TCATAACATG ATTTTATGTC 1320 

w GGGATAAAAA AATAACAGCA TCTTAACAAA TGTAAGATAC TGTCAGTGAA ATGAATGAAA 1380 

CTTTAGTTTC TGaTAATATA GTCAAAGGCA TTTAATGCTG CATTTGCACC AGCGCCCATT 1440 

GAAATGATAA TTTGTTTGTT CTTCTGATCT GTGACATCGC CAGCAGCAAA TATTCCAGGA 1500 

1* ACATTCGTAT TATTGTTACG ATCAATCACA ATTTCACCAC GTTCGTTTAA TTCAACAGCA 1560 

TCGTTTAACC ATGATGTGTT TGGAAGTAAA CCAATTTGAA CAAAGATACC ATCTAAGTTA 1620 

AGTAGATGTT CTTCGCCGGT GTTCATGTCT TCGTAACGTA TACCTGTAAC ATGGTCTTCT 1680 

20 

CCGACAACTT CAGTAGTTTT GGCATTTGTT TTGATATCAA CATTTGATAA AGAACGTAAA 174 0 

CGATCTTGTA ACACGTTGTC TGCTTTTAAT TCGCTAGCGA ATTCGAATAA TGTAACATGA 1800 

TTAACGATAC CAGCAAGGTC AATTGCTGCT TCAACCCCAG AGTTACCGCC ACCGATAACT 1860 

25 

GCTACGTCTT TATTTTCAAA TAGAGGTCCG TCACAGTGAG GGCAGAATGC AACACCTTTA 1920 

TTAATCAATT GCTCTTCACC TGGAATGTTT AGCTTACGCC AACCTGCACC AGTAGCAATA 1980 

30 ATGACTGTTT TACTTTCTAA GACAGCACCG TTTTCTAACG TAACTTTAAT TGCTTCGTCA 204 0 

GTCTTTTCGA TATCTGTAGC ACGTATACCT GTCATTGCAT CAATGTCATA TTGATCAATG 2100 

TGCGCTGCTA AGTTAGAAGA AAATTCAGAA CCAGTTGTTT CTTTAACAGT AATGAAGTTC 2160 

TCAATACCAG CAGTATCATT AACTTGGCCA CCGATACGAT CAGCAACTAT ACCAGTACGT 2220 

AAACCTTTAC GTGCTGTGTA AATCGCTGCA CTACCACTAG CAGGACCACC ACCAACGATT 2280 

AAGACATCAT AAGGTTCTTT ATTTTCAAAC TCAGATGCAT CTGCCGTACT GCCTAGTTTC 2340 

40 

GAAAGAATAT CTTGGATTGT CATACGACCA TTGCCAAATT CTTCGCCATT TAAAAAGACA 2400 

GCAGGGACTG CCATGATGTT TTCAGATTCT TCACGGAACA CTGCACCATC AATCATAGAA 2460 

45 TGCGTGATGT TAGGGTTGAT CACACTCATT AAGTTAAGTG CTTGAACGAC ATCAGGACAT 2520 

TTTTGACACG TTAAACTAAT GAATGTTTCA AAATGGAATG AACCTTCTAA TTTTTTAATT 258.0 

TGGTCAATGA TTGACTGTTT TTCTTTAGGT GCACGACCAC TAACCTGTAA AATTGCTAAA -2640 

50 ACAAGTGAGT TAAACTCGTG ACCTAATGGA ATACCTGCAA ATGTTACACC TGTTTCTTCG 2700 

CCAGGACGAT TGACTGAGAA ACTTGGTGTA CGTTTTAAAG ATTTTTCAGA AAGAGATAGT 2760 

CTAGGTGACA TATCAGTAAT TTCTGTCAAC AAATCTTTAA GTTCTTTGGA TTTATCATCT 2820 
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TGTTGTTTTA AATCAGCATT AAGCATGGTT GTAATGCCTC CTTAGATTTT ACCTACTAAA 294 0 

TCTAAACCAG GTTGCAATGT TTTAGCGCCT TCTTCCCATT TAGCTGGGCA TACTTCGCCA 3000 

5 

GGGTTTTTAC GAACATATTG AGCTGCTTTG ATTTTGTGAG CTAATGTACT AGCGTCACGG 3060 

CCAATTCCGT CAGCGTTAAT TTCAGATGCT TGTACAACAC CGTCTGGGTC GATAATGAAT 3120 

1Q GTACCACGTT GAGCTAAACC AGTAGCTTCA TCTAATACAT CAAAATTACG AGTGATTGTT 3180 

TGTGATGGGT CACCAATCAT AGTGTAAGTG ATTTTGCTAA TTGCATCTGA ATGGTCATGC 3240 

CATGCTTTGT GTACGAAGTG AGTATCAGTT GATACTGAGA ATACATTTAC GCCTAATTTT 3300 

15 TGTAATTCTT CATATTGGTT TTGTAAGTCT TCTAATTCAG TTGGACAAAC GAATGAGAAG 3360 

TCAGCAGGAT AGAAGCATAC TACGCTCCAA GAACCTTTTA AATCTTCTTG TGTAACTTCT 3420 

TTAAATTGAT CTTTTTTTGG ATCGAAArCT TGCGCTGTAA ATGGTAAGAT TTCTTTGTTA 3480 

20 

ATTAATGACA TAAATATCTT CCTCCTAAGA ATTTAAGTAT GAATTAGAAC TATCAATTGA 3540 

TTGCGCTTAA TTATAATAAT TCTAATCTCT TAGTTAGCAT TATTACATTT TGATCCAGAA 3600 

TAGTCAACTG GATAACTTTG TAAAGTGAAT GATTACTTTT AAAATAAAGA AAGATAATAT 3660 

25 

AAAGTGCTTT GATAATGGAT TTTGTAGTTG ATGATTTAAA AGGTTGTGTC TATATTTAAT 3720 

ATCTTGATTT TAATGTAAAA AATGTAAAAA AAGAAGATTT GTATTCTCAA CTAAGTCAAC 3780 

30 CTTATTGATA ATGGTATGAG AATATTTGTT CGAGATGGAT GAAGGTAATG AGTGAGAAAC 3840 

TGGATTTTTA AAGTATGAGA CAATATTTTA AAAAGTTCAA TTATTAACTT ATAAGCAAAT 3 900 
AATTGCTATA AAAAAGTTTG GACGTGTAGA ATTGCAATAT GAAGATTTTA AATTAATTGT - 3960 

35 ' 

AAAGTATCGA GGAGTGGGTA ACGTGTCAGA ACATGTATAT AATCTTGTGA AAAAGCATCA 4020 

TTCTGTTAGA AAATTTAAGA ATAAACCTTT AAGTGAAGAC GTTGTTAAGA AATTGGTAGA 4080 

AGCT35GACAA AGCGCTTCGA CGTCAAGTTT CCTGCAAGCA TACTCAATTA TTGGTATCGA 4140 

40 

CGATGAGAAG ATTAAAGAAA ATTTACGAGA AGTTTCTGGA CAACCTTATG TTGTAGAAAA 4200 

I 

TGGCTATTTA TTCGTCTTTG TTATTGATTA TTATCGTCAT CATTTAGTTG ATCAACATGC 4260 

45 TGAAACTGAT ATGGAAAATG CATATGGTTC AACGGAAGGT TTGCTAGTAG GTGCAATCGA 4320 

TGCAGCATTA GTTGCCGAAA ATATTGCGGT AACTGCTGAA GATATGGGGT ATGGCATTGT 4380 

CTTTTTAGGA TCATTAAGAA ATGATGTTGA ACGCGTTCGA GAAATTTTAG ACTTACCTGA 4440 

SO 

CTATGTCTTC CCGGTATTTG GTATGGCAGT AGGGGAACCc GCAGATGACG AAAATGGTGC 4 500 

AGCCAAGCCA CGCTTACCAT TTGACCATGT CTTCCATCAT AATAAGTATC ATGCTGATAA 4560 

GGAAACACAG TATGCACAAA TGGCAGATTA CGACCAGACA ATCAGCGAGT ACTATGATCA 4620 
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10 



CAAAGCAAGA TTAGATATGT TAGAACAATT GCAAAAATCA GGCTTAATAC AGCGATAgCA 4740 

AGATACCAAA ATAACCCGCC CCCCTCTAGC TTAAAATGAT AAGTATAGCT AGAGGGGGCG 4800 

GGTATTTCTT GCAATGAATT AGTGTGAAGT TAATGCAGCA TTATCATTTG AATCGAAAGT 48 50 

ATCTTTATCC CAATGTTTAG TTAACTTGGC GGTACCTGTA CCAGCTAGCA TTGAATCGTT 4 920 

CACGTTTAAT GCTGTTCTAC CCATGTCAAT CAATGGTTCA ACGGAGATGA GCACGCCGGc 4 980 

TAAAGCGACT GGCAAGTTTA ACGTTGACAA CACCAATATG GATGCAAATG TAGCCCCGCC 504 0 

ACCGACGCCA GCAACGCCGA ATGAACTAAT AATCACGACA GCGATTAACG TTACAATAAA 5100 

75 TTGTAAATCA ATTTCTACAT TAGCGACGGG TGCGACCATA ATTGCAAGCA TGGCAGGGTA 5160 

AATGCCTGCA CAACCATTTT GTCCAATCGA CAATCCAAAT GTCGCAGCGA AATTGGCAAT 5220 

ACCTTCTGGC ACGCCTAGAC GTCTTGTTTG TGTTTGTACA TTCAATGGTA AGGCACCCGC 5280 

GCTTGAGCGT GATGTGAATG CAAAGATTAA TACTTCCAAA GTCTTTTTAA CATAGCGAAT 5340 

TGGGCTAATA CCTAACAGGC TTAAAATAAT TAAGTGAATG ATATACATCG TAATTAATGC 5400 

AGCGTACGAT GCGATTAAGA ATTTTCCTAA AGTCCAAATG GCGCCAAAGT CACTTGTCGA 5460 

TAATGTGTTG GCCATAATTG CTAATACACC GTATGGCGTT AAACGTAAGA CGAACGTCAC 5520 

AATCGCCATT ACTAGTGAAT AGATAGCGTC AATCGCACGC TTAAGCAATT CACCATGATC 5580 

AGGTTGTTTG CGTnTACGCG TAAATAAGCA AATCCTATAA ACGAAGCAAA TATCACGACA 5640 

GCAATCGTGG aAGTTGCACG TTGTCCaGTG AAATCTAAGA ATGGATTTTT AGGCAATAAT 5700 

TCCAAAATTT GTTGTGGTAA CGTATGTGCT GTTAAATCTT TCGCTTGTTT AGCAATTTCG 5760 
35 CTTCCACGTG CTTGTTCAGC GTTAGCAAGG TTAATTGTTG ATGCATCTAA ACCAAACACC . 5820 

AAGGCATACA CAACACCAAC AATCGCAGCA ATGGTGACAG TGCCAATTAA AAAGATAAAA 5880 

ATGASACTAC CAATTTTAGC AAACTTTTCT CCGATTTGAA TTTTAGTGAA TGCAGCTACA 5940 

ATAGAAATGA AAATTAAAGG CATAACAATC ATTTGCAACA ATGCAACGTA ACCTTGTCCG 6000 

ACAATGTTGA ACCAGTCACT TGTTGATGTA ATAACATTCG AATGTGTGCC ATAAATAAGA 6060 

TGCAATAACA CACCGAATAC TATACCAATC CCTAAAGCTG TAAACACACG TTTCGCAAAA 6120 

GATATATGTT TGCGAGCCAT CATGTGCAAT ATTACGATGA AAATCACCAA TACAATAATA 6180 

TTAATCAGTG TAAGAAAAGC ATTCATGAAC GTCACTCCTT AAATTTTTGA ATATAATTCC 6240 

so GACTAGTATG CT 6252 

(2) INFORMATION FOR SEQ ID NO: 51: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6730 base pairs 
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20 



25 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51: 

ATCAAATCnC AAAATATTTA TTAATnAnAA GGGGATTATC CaTGTgAGAA ACAAAGTAAT 60 

10 GCTCTTTTTT TACCTCTTGT GGGTTGAAAA aTGGATCATC AGAGATAGAC TTCTTCTTTT 120 

TCGAAGATGA CATTTGATAC TTTAATCTTC TAAAACCATA ACTTGTCGCA TCAAAAATGC 180 

CTTCTTGTAC AAGTAAAATC AAAAATATGC TAATAAAAAT AATTAATGAA ACATAAAACA 240 

ATATATTTAA ATATGTAATG ATAGTATGGC TATTAAAAAG CCATATAATA AACGTTAATA 300 

TTGGCGTTAT TAGTGCCATT CCAAGCCATT TTTTCAACAT TTGATCACTC CCACTTATAG 360 

AAAACTCTTA CGCATAGTTT ACATTAAAAT CAGACATTGA GGAATGATTT TTTAATTTCT 420 

TCAGCTTTAT TGAAATTCTA AAATCAATCA TTCTTCATTA GTTTAAAGCA AAAAAATATT 4 80 

GATATATAGT AAATATTGTA TATATAATAT TAGTTAAGAT TTCaGAAAAT TTTGAAGGGA 540 

ATGGAAATTT AGAAATCGGA ATTTGTTAGA GGAGGGGATT AGATGGGGAA ATATATTTTC 600 

AAACGATTTA TTTATATGCT TATTTCTTTA TTTATTATTA TTACAATTAC ATTTTTCTTA 660 

ATGAAATTAA TGCCAGGTTC GCCATTTAAC GATGCTAAAT TAAATGCTGA ACAAAAAGAA 720 

30 ATTTTAAATG AAAAATATGG ATTAAATGAT CCTGtAGCTA CGCAgTATTT ACATTATTTA 780 

AAAAATGTTG TTACAGGCGA TTTTGGTAAT TCATTCCAGT ATCATAATCA ACCTGTGTGG 840 

GATTTGATTA AACCGAGACT ACTACCTTCT TTTGAAATGG GTCTTACAGC AATGTTCaTC 900 

GGTGTGATAC TGGGACTTAT TTTAGGTGTT GCAGCAGCTA CTAAACAAAA TTCTTGGGTT 960 

GACTATACAA CTACAGTTAT TTCAGTTATT GCAGTATCTG TACCATCTTT TGTACTTGCT 1020 

GTACfTTTAC AATATGTATT TGCAGTTAAA TTAAGATGGT TCCCAGTAGC TGGATGGGAA 1080 

GGTTTTTCGA CCGCGGTATT ACCGTCACTT GCATTATCTG CAGCTGTTTT AGCAACTGTC 1140 

GCCAGATACA TAAGAGCAGA GATGATAGAG GTATTAAGTT CAGACTATAT TTTATTAGCG 1200 

45 AGAGCTAAAG GTAATTCGAC AATGCGTGTA C T TTTTGGAC ATGCACTTAG AAATGCTTTA 1260 

ATTCCAATTA TTACAATTAT CGTTCCCATG TTAGCAAGTA TTTTAACAGG CACTTTAACA 1320 

ATTGAAAATA TTTTTGGAGT TCCTGGATTA GGGGATCAAT TCGTACGTTC AATTACAACA 1380 

50 

AATGATTTCT CAGTAATCAT GGCAATCACA CTATTATTTA GCACACTGTT TATCGTTTCT 144 0 

ATTTTTATTG TAGATATTTT GTACGGTGTG ATAGATCCAC GAATTCGTGT TCcAAGgAGG 1500 

TAAAAAATAA TGGCTGAAAA TAAAAACAAT TTGTCGATTA ACGACGATCA TTCTAATGCA 1560 

55 
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TGAATCAGGA 


ACCTGAAATG 


CAACGAGAAA 


GCAAAAACTT 


TTGGCAAGAT 


GCTTGGGCTC 


1680 


5 


AGTTAAAACG AAATAAGTTA 


GCTGTTGTCG 


GTATGATAGG 


TTTAATTATC 


ATTGTAATAT 


1740 




TTGCTTTTAT 


CGGTCCAGTT 


ATAAATAAAC 


ATGATTATGC 


TGAACAAAAT 


GTAGAACATA 


1800 




GAAATCTTCC 


GGCAAAAATA 


CCTGTATTAG 


ACAAAGTTCC 


ATTTTTACCT 


TTTGATGGTA 


1860 


10 


AAGATGCAGA 


TGGCAAGGAT 


GCTTATAAAG 


CAGCAAATGC 


TAAAGAAAAT 


TATTGGTTTG 


1920 




GTACTGATCA 


GTTGGGTCGA 


GATTTATGGA 


CAAGAACATG 


GAAAGGTGCT 


CAAATTTCAT 


1980 




TGTTTATCGG 


TGTTGTTGCA 


GCGATGTTAG ATATTTTTAT 


TGGTGTTGTA 


TATGGTGCGA 


2040 


IS 


TTTCTGGATT 


CTTCGGTGGA 


CGTGTCGATA 


CGATTATGCA 


ACGTATACTT 


GAAGTCATAG 


2100 




CATCTATTCC 


GAATTTAATT 


GTCGTAATTT 


TATTTGTATT 


AATTTTTGAA 


CCATCCATTT 


2160 




GGACAATTAT 


ATTGGCTATG 


TCTATCACAG 


GCTGGTTAGG 


CATGAGCAGA 


GTTGTACGTG 


2220 


20 


GAGAATTTTT 


AAAATTAAAA 


AATCAAGAGT 


TTGTCATGGC 


TTCGAAAACA 


TTGGGGGCTT 


2280 




CAAAATTCAA 


ATTGATATTT 


AAGCATATTT 


TACCTAATAC 


ATTAGGTGCT 


ATCGTGGTTA 


2340 


25 


CATCAATGTT 


TACAGTACCT 


AGTGCTATTT 


TCTTCGAAGC 


ATTTTTAAGT 


TTCATTGGTA 


2400 




TAGGTGTACC 


CGCACCTCAA 


ACATCGTTAG 


GGTCATTAGT 


AAATGATGGG 


CGCGCAATGT 


2460 




TATTAATTTA 


TCCACATGAA 


TTATTTATAC 


CAGCAATGAT 


TTTAAGTTTA 


TTAATTCTAT 


2520 


30 


TCTTTTACTT ATTTAGTGAT 


GGATTACGTG 


ATGCATTTGA 


TCCGAAAATG 


CGTAAATAAA 


2580 




AAGGGGGCAT 


AGCATATGAC 


TGAAAGAATA TTAGAAGTAA 


ATGATTTGCA 


TGTTTCCTTT 


2640 




GATATTACAG 


CAGGGGAAGT 


GCAGGCAGTG 


AGAGGCGTAG 


ATTTTTATTT 


GAACAAAGGG 


2700 


35 


GAAACATTGG 


CAATTGTTGG 


TGAATCAGGT 


TCAGGTAAAT 


CTGTAACAAC 


AAAAGCAATT 


2760 




ACAAAATTAT 


TCCAAGGGGA 


CACAGGAAGA 


ATTAAAAAGG 


GAGAAATTTT 


ATTTTTAGGG 


2820 


40 


GAAGATTTAG CAAAAAAACC 


TGAAAATGAG 


TTGATTAAAT 


TACGTGGCAA 


AGATATTTCA . 


2880 


ATGATCTTTC 


AAGATCCAAT 


GACATCTTTA AACCCAACGA 


TGCAAATTGG 


TAAACAAGTC 


2940 




ATGGAAC CAT 


TAATTAAGCA 


CAAAAATTAT 


AGTAAAGCAC 


AAGCTAAAAA 


GCGCGCATTG 


3000 


45 


GAAATACTAA ATCTTGTAGG 


TTTACCAAAT 


GCAGAAAAAA 


GATTTAAAGC 


ATATCCTCAT 


3060 




CAATTTTCAG 


GTGGACAAAG 


GCAAAGAATT 


GTTATTGCAA 


CCGCATTAGC 


TTGTGAACCT 


3120 




AAAGTGCTCA 


TTGCTGATGA 


ACCAACGACT 


GCATTAGACG 


TAACGATGCA 


GGCACAAATT 


3180 


SO 


TTAGATTTAA 


TGAAAGAACT 


ACAACAAAAA ATCGATACAG 


CAATTATTTT 


TATAACGCAT 


3240 




GATTTAGGGG 


TTGTTGCGAA 


TATTGCTGAT 


AGAGTGGCAG 


TTATGTATGG 


TGGTCAAATG 


3300 




GTTGAAACAG 


GAGATGTTAA 


CGAAATATTT 


TATGATCCAA 


AGCATCCATA 


TACATGGGGA 


3360 
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GGAGCGCCAC CTGATTTATT ACACCCACCT AAAGGTGATG CATTTGCGAG ACGTAGcAAT 34 BO 

ATGCATTAGA TATTGATTTT AAAGTAGAAC CACCGTGGTT TAAAGTTTCA CCGACACATT 3540 

5 

TTGTGAAATC TTGGTTATTA GACGCACGTG CACCAAAAGT TGAACTACCC GAGCTGGTAA 3600 

AACAACGTAT GAAACCGATG CCTAATAATT ATGAAAAACC ACTCAAGGTA GAAAGGGTGT 3660 

10 CGTTCAATGA AAAATGATGA AGTGCTATTA TCTATTAAAA ATTTAAAGCA ATATTTTAAC 3720 

GCAGGAAAGA AAAACGAAGT GgaGCGATTG AAAATATTTC GTTTGATATA TACAAAGGGG 37B0 

AAACATTAGG TTTAGTAGGA GAATCGGGGT GTGGTAAATC TACAACTGGT AAATCAATTA 3840 

15 TTAAACTTAA TGATATTACA AGTGGAGAAA TTTTGTATGA GGGTATTGAT ATACAAAAGA 3900 

TTCGTAAACG TAAAGATTTG CTTAAATTTA ATAAAAAGAT ACAGATGATT TTTCAAGACC 3960 

CATATGCGTC TTTAAATCCT AGGTTAAAAG TAATGGATAT AGTAGCTGAA GGTATTGATA 4020 

20 

TCCATCATTT AGCAACTGaT AAGCGTGACC GAAAAAAACG TGTCTATGaT TTACTTGaAA 4080 

CTGTTGGATT AAGTAAAGAA CATGCCAATC GCTATCCTCA TGAATTTTCA GGTGGaCAAC 4140 

GCCAACGTAT TGGaATTGCC CGTGcATTAG CCGTTGaACC AGAATTCATT ATCGCGGACG 4200 

25 

AACCAATATC GGCATTGGAT GTTTCAATCC AAGCTCAAGT AGTTAATTTA TTATTAAAAT 4260 

TACAACGTGA AAGAGGGATT ACGTTCCTAT TTATAGCTCA TGATCTATCA ATGGTGAAGT 4320 

30 ATATTTCAGA TCGTATTGCA GTCATGCATT TTGGGAAAAT AGTTGAAATT GGACCGGCAG 4380 

AAGAAATTTA TCAAAATCCA TTACACGATT ATACTAAGTC TTTATTATCA GCCATTCCAC 4440 

AACCTGATCC TGAATCAGAA CGCAGTCGCA AACGATTTAG TTATATTGAT GATGAAGCAA 4500 

55 ATAATCATTT AAGACAATTA CATGAAATTA GACCGAATCA CTTTGTCTTT AGTACTGAAG 4560 

AAGAAGCGGC ACAACTACGA GAAAATAAAT TGGTGACACA AAATTAAGGG GAAGGGGGAA 4620 

ATGCAATGAC GAGAAAATTT AGAACACTTA TTTTAATTTT GATTGCTACA ATTGCATTAA 4680 

40 

GTGGTTGTGC TAATGACGAT GGTATTTATT CAGATAAAGG TCAAGTATTC AGAAAAATTT 4740 

TGTCATCAGA CTTAACATCC CTTGATACAT CATTAATAAC GGATGAAATA TCTTCTGAAG 4800 

45 TGAcTGCGCA AACATTCGAA GGTTTATACA CATTAGGAAA AGGTGACAAA CCGGTGTTAG 4860 

GTGTTGCGAA AGCTTTTCCT GAAAAGAGTA AAGATGGTAA AACTTTAAAG GTTAAATTAA 4920 

GAAGCGATGC TAAATGGAGC AATGGTGACA AAGTGACTGC ACAAGACTTT GTTTATGCTT 4980 

50 GGAGAAAAAC AGTTGACCCT AAAACAGGTT CTGAATTTGC ATACATTATG GGGGACATTA 5040 

AAAATGCGAG TGATATTAGT ACTGGTAAGA AACCTGTAGA GCAATTAGGT ATCAAAGCAT 5100 

TAAATGATGA AACATTACAA ATTGAATTAG AAAAGCCGGT TCCATATATT AATCAATTAT 5160 
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w 



20 



2S 



ACGGTACGGC AGCTGATAGA GCGGTATACA ATGGTCCaTT TAAAGTTGAT GATTGGAAAC 5280 

AAGAAGATAA AACCTTACTA TCTAAAAATC AGTATTATTG GGATAAAAAG AATGTAAAAT 5340 

TAGATAAAGT GAATTATAAA GTTATTAAAG ACTTACAAGC CGGTGCATCA TTGTATGATA 5400 

CTGAATCAGT AGATGACGCA TTTATTACTG CAGATCAAGT AAATAAATAT AAAGACAACA 54 60 

AAGGATTAAA CTTTGTGTTA ACGACTGGGA CATTTTTTGT AAAAATGAAT GAAAAACAAT 5520 

ATCCTGATTT TAAAAACAAA AATTTAAGAT TGsTATCGCA CAAGCAATAG ATAAAAAAGG 5580 

ATACGTTGAT TCAGTGAAAA ACAATGGCTC AATTCCTTCC GATACACTAA CAGCCAAAGG 5640 

15 AATTGCGAAA GCGCCTAATG GCAAAGATTA TGCGAGTACC ATGAATTCGC CTTTAAAATA 5700 

TAATCCTAAA GAAGCAAGAG CACACTGGGA CAAAGCTAAA AAAGAGTTAG GTAAAAATGA 5760 

AGTGACATTT TCAATGAACA CAGAAGATAC ACCAGATGCA AAAATATCTG CTGAATATAT 5820 

CAAATCGCAA GTTGAGAAAA ATTTACCAGG AGTTACTTTG AAAATTAAGC AATTACCGTT 5880 

TAAACAAAGA GTATCACTAG AACTGAGTAA CAATTTTGAA GCATCACTTA GTGGTTGGTC 5940 

TGCAGATTAC CCTGATCCTA TGGCTTATTT AGAAACAATG ACCACAGGTA GCGCACAAAA 6000 

TAATACAGAC TGGGGTAATA AAGAATATGA TCAATTACTT AAAGTAGCAA GAACCAAATT 6060 
GGCACTTCAA CCGAACGAAC GATATGAAAA CTTGAAAAAA GCAGAAGAAA TGTTCCTAGG . 6120 

30 AGATGCACCG GTAGCACCAA TTTATCAAAA AGGTGTtGCA CATTTaACAA aTCCTCAAGT 6180 

AAAAGGATTA ATTtACCATA AATTTGGTCC AAATAACTCA CTTAAACATG TATATATTGA 6240 

TAAATCGATA GATAAAGAAA CAGGTAAGAA GAAAAAATAA TATGCTTTGT AAATTAGGCT 6300 

GGAGACATAT CTCCAGTCTT TTTGTGTTGG ATAAAAaCTT TGGGAATAAA AATTTAAAAT 6360 

AAGTCGTTTT TTAAATTACT GAAATTGATT AAATGCATAA ATAACTGAAT ATTCTAAAAA 6420 

TAAACTTGTA ATAATTTTTT CTATGAGTAA ACTAAAAAGA AAAAATTAGA TTGAAAGTAG 64 80 

GAGGCATATG TATGGGGAAG CTAATTAAAT ATATTTCAAT ACTTCTTATT GTCGTTTTAG 6540 

TGTTGAGTGC TTGCGGAAAA AGCAGTAATA AAGATGAAGG AGTAAAAGAT GCTACTAAAA 6600 

CGGAAACCTC AAAACATAAA GGTGGTACCT TAAATGTAGC ATTAACAGCA CCGCCAAGTG 6660 

GTGTTTATTC TTCGTTATTA AATAGTACAC ATGCAGATTC TGTAGTTGAG GGATATTTTA 6720 

ACGAAAGCTT 6730 
50 (2) INFORMATION FOR SEQ ID NO: 52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 6482 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 52: 

AATTTTTGTC ATTATTAAAA ACCTCGCTTT TAAAAGATTG AAAAGTAAAT GAGTGAAATT 60 

5 

AAAGATTATG CACATTAAAA TCACGCCACA ATTTAATTGT GAAAAATATC ACAAATATAT 120 

TATAACACTA AATTTCCCAA AATTCAAAAG TGTGTTTTAT TGCAGAAAAC TTATAACAyG 180 

70 TGCACAAGTT ATAGTGAATT GCAAACGGAT TACTTTAGTC TTTTTAAAAC ATGAAGTATA 240 

ATTTGTATAG CAATAAATAT AAAAATGGGA GGCTATGTTC AATGAGCAAT ATGAATCAAA 300 

CAATTATGGA TGCATTTCAT TTCAGACATG CGACTAAGCA ATTCGATCCA CAAAAGAAAG 360 

15 TTTCGAAAGA AGATTTTGAA ACAATATTAG AGTCAGGTAG ATTGTCTCCA AGTTCTCTTG 420 

GGTTAGAACC TTGGAAGTTT GTCGTGATTC AAGATCAAGC GTTACGTGAT GAATTAAAAG 480 

CGCACAGTTG GGGCGCAGCA AAACAATTAG ATACAGCGAG CCATTTTGTG CTAATTTTTG 540. 

20 

CGCGTAAAAA TGTAACGTCA AGATCACCGT ATGTACAACA TATGTTAAGA GATATTAAAA 600 

AATATGAGGC ACAAACGATT CCAGCTGTTG AACAAAAATT CGATGCATTC CAAGCAGATT 660 

2S TCCATATTTC TGATAATGAT CAAGCCTTGT ATGACTGGTC AAGTAAACAA ACGTATATTG 720 

CATTAGGCAA TATGATGACG ACAGCCGCAT TGTTAGGTAT TGATTCATGT CCGATGGAAG 780 

GTTTTAGTCT GGATACAGTG ACAGACATTT TAGCAAATAA AGGGATCTTA GATACTGAGC 840 

30 AATTTGGTTT ATCAGTGATG GTCGCATTTG GCTACAGACA ACAAGAGCCA CCGAAAAATA 900 

AAACACGCCA AGCTTATGAA GATGTTATTG AATGGGTTGG ACCAAAAGAA TAAATAGAAT 960 

ACCGTATGTC TAAATATATA AAATTAAAAA GTTAGCAATA AAAAAGCCTG CGATTACATA 1020 

AATGAATCGC AGGcTTTTGC GTGAAAAAAT TGTATTAATA AAGTATGGAT GATTATTTTT 1080 

CTGGfiACAAG GTCAGTATTT GAATGAACTG TGATGTCAAA CCCTTCTGGT GCCGTAAATG 1140 

TATGTGTTGA GGCGTCGGGT TGATAAATAT CAACATGTGT TAATC CATAA CTTTGTGAAT 1200 

40 

TGTTTTGTCT TGCTTGATTG GATTGCCAAG TATTAGCAGC AATATGATGG TGATAATGAT 1260 

TCGTTGACAT AAATAGCGCA CGTGGAAAAT CAGACACATG TTGGAATCCT AATTGTTCAA 1320 

45 TGTAACATTG ATATGCTGCG TCTAAATCAT GTGTTTTTAA ATGTAAGTGT CCAATCATGC 1380 

CTTTTGCTGG CATTCCTTGC CAACCTTCAT CAGTACGATG TGTTAATAAG GTTTGGCTAT 144 0 

CAACTTCTAA AGTATCCATT TTAACTTTGC CATTTTGCCA TTCCCATGAA GATGAAGGTC 1500 

SO TATCGCGATA GACTTCAATA CCATTACCTT CGGGGTCGTT GAAATATAAA GCTTCACTTA 1560 

CTAAATGATC ACCAGCGCCG ATGCCCATAT TT TTT T GTGC CACGAAATAT AAGAAGTTAG 1620 
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aAGTCTGACG GcCGTCTTCT AATAAATGTA ACGTTAGAGT ATGGcCACCA GTCCCAACAG 1740 

ATAATACGGT TGTATTATCG TCAGAACTTT TAACGGATAG TCCTAAAATG TTTTTGTAAA 1800 

5 

ATGTTGTCAT TAAGTCTAAG TCTCTTACGT TCAGTACAAT GTTTGTCACT TGTGTTGCTG 1860 

TTTTATCGTG AAATGCCATT ATGCATCGCC TCTTTTTCTA TTTTTCTATA AGTTAGTATA 192 0 

AAAAGTATAC CAGAAAAGAA AATGAATTGA TAGCATAAAG TTTGAAATGC AAAATAACTA 1980 

10 

GTCGTTTTGC AATTTTAtAT TGATGCGAAC AAAAAAGCGA TGGTACAGTT GCACCATCGC 2040 

AAAATTTATT TAACCAAGAT ATACATCTTG ATATGAATCT TCTTTTTCTA ACATATGTTT 2100 

15 GGCAAATGAA CATGAGGCAA TAATTTTCAA ATTATTTTCT CGAGCGTGTT CAACAACTGC 2160 

TTTAAGTAGT TTTTTGCCAA CACCTTGACC ACCAAGTTCA TCAGATACGC CTGTATGATC 2220 

AATGTTAATT TCATTATTAT CCACAAAACG GTATGTGATT TCAGCTAAAG CATTATTTTC . 2280 

20 ATCATCACCA ATATAGAATT TGTTCTCGCC TTGTTTGATT TCAAGGTTAC TCATACATAT 2340 

CAACTCCTAT CATGATTGAT TATAGTATTT CCCTATTCTA TTTTAACTTA AACGAAGTCA 2400 

AAGGTGCATG ACAGTCATGT GACGACATTG CCACATCTAT GTAGTCGTTT TTATTAAGCA 2460 

25 

CAGTTTGAAA TGAAGATGAA AACACGTATC TTGACATTAA ATCTATTCAG CTATATAATT 2520 

TATCTCGAAA TCGAAATAAA ATAAAAAAGT TGGTGATCAT ATGGATCGAA CGAAACAATC 2580 

TCTCAATGTT TTTGTCGGAA TGAATAGGGC GTTAGACACA TTAGAGCAAA TTACAAAAGA 2640 

30 

AGACGTAAAG CGATATGGCT TAAATATTAC TGAATTTGCA GTGCTCGAGT TGCTTTATAA 2700 

TAAAGGTCCG CAACCAATTC AACGTATTAG AGACCGCGTA TTAATTGCAA GTAGCAGCAT 2760 

35 TTCATATGTT GTAAGTCAAT TAGAGGACAA AGGTTGGATT ACACGTGAAA AGGATAAAGA 2820 

TGATAAACGT GTATATATGG CTTGTTTAAC TGAAAAAGGT CAAAGTCAAA TGGCAGATAT 2880 

TTTCCCTAAG CATGCTGAGA CATTAACAAA AGCGTTTGAT GTGTTAACAA AGGATGAATT 2940 

40 AACAATCTTA CAACAAGCGT TTAAGAAACT AAGTGCACAA TCTACAGAAG TGTAAGGCGT 3 000 

GCACTAAAAA TTTACATTAA AGTATCTCGA TTTCGAGATA AATGCACTAA AAATATAAAG 3 060 

AGGGTATATA AAATGATAAA TAATCATGAA TTACTAGGTA TTCACCATGT TACTGCAATG 3120 

45 

ACAGATGATG CAGAACGTAA TTATAAATTT TTTACAGAAG TACTAGGCAT GCGTTTAGTT 3180 

AAAAAGACAG TCAATCAAGA TGATATTTAT ACGTATCATA CTTTTTTTGC AGATGATGTA 3 240 

GGTTCGGCAG GTACAGACAT GACGTTCTTT GATTTTCCAA ATATTACAAA AGGGCAGGCA 3 300 

so 

GG AACAAATT CCATTACAAG ACCGTCTTTT AGAGTGCCTA ACGATGACGC ATTAACATAT • 3 360 

TATGAACAGC GCTTTGATGA GTTTGGTGTT AAACACGAAG GTATTCAAGA ATTATTTGGT 3420 
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5 



10 



15 



20 



25 



30 



35 



50 



TTAAATGAAG 


GGGTAGCACC 


TGGTGTACCT 


TGGAAGAATG 


GACCGGTTCC 


AGTAGATAAA 


3540 


GCGATTTATG 


GATTAGGCCC 


CATTGAAATT 


AAAGTAAGTT 


ATTTTGACGA 


CTTTAAAAAT 


3600 


ATTTTAGAGA 


CTGTTTACGG 


TATGACAACT 


ATTGCGCATG 


AAGATAATGT 


CGCATTACTT 


3660 


GAAGTTGGCG 


AAGGAGGCAA 


TGGTGGCCAG 


GTAATCTTAA 


TAAAAGATGA 


TAAAGGGCCa 


. 3720 


GCaGCACGTC 


AAGGTTATGG tGAGGTACAT 


CATGTGTCAT 


TTCGTGTGAA 


AGATCATGAT 


. 3780 


GCAATAGAAG 


CGTGGGCAAC 


GAAATATAAA 


GAGGTAGGTA 


TTAATAACTC 


AGGCATCGTT 


3840 


AATCGTTTCT 


ATTTTGAAGC ATTATATGCA 


CGTGTGGGGC 


ATATTTTAAT 


AGAAATTTCA 


3900 


ACAGATGGAC 


CAGGATTTAT 


GGAAGATGAA 


CCTTATGAAA 


CATTAGGCGA 


AGGGTTATCC 


3960 


TTACCACCAT 


TTTTAGAAAA 


TAAAAGAGAA 


TATATTGAAT 


CGGAAGTTAG 


ACCTTTTAAT 


4020 


ACGAAGCGTC 


AACATGGTTA ATTGGAATGA 


GGAGGATTTG 


TGATGGAACA 


TATTTTTAGA 


4080 


GAAGGACAAA 


.ATGGTGCGCC 


AACACTAATA 


TTATTGCATG 


GTACAGGTGG 


TGATGAGTTC 


4140 


GATTTATTAC 


CGTTAGGCGA AgcATTGAAT GAAAATTATC 


ACTTGTTAAG 


TATTAGAGGA 


4200 


CAAGTTTCAG 


AAAATGGGAT 


GAACCGTTAT 


TTCAAACGTC 


TTGGTGAAGG 


TGTTTATGAT 


4260 


GAAGAAGATT 


TGGCATTTCG 


TGGACAAGAA 


TTGTTGACGT 


TCATTAAAGA 


AGCTGCTGaA 


4320 


CGTTATGATT 


TTGaTATTGA AAAAGCAGTA 


CTTGTTGGAT 


TTTCAAATGG 


ATCAAATATA 


4380 


GCGATTAACT 


TAATGTTGCG 


TTCAGAAGCA 


CCATTTAAAA 


AAGCATTGTT 


ATATGCACCG 


4440 


TTATACCCAG 


TTGAAGTAAC 


GTCAACAAAG 


GATTTATCAG 


ATGTCAGTGT 


GTTGCTTTCT 


4500 


ATGGGGAAAC 


ATGATCCAAT 


TGTGCCATTA 


GCTGCAAGTG 


AACAAGTCAT 


TAACTTGTTT 


4560 


AATACACGTG 


GGGCACAAGT 


CGAAGAAGTT 


TGGGTGAAGG 


GCCATGAAAT 


TACAGAAACT 


4620 


GGATTAACGG 


CTGGTCAACA AATACTTGGG AAATAACAGT 


TCTATTAAGA 


AGCGGACAGA 


4680 


TGGAAAAGAT 


TTTTACTTTT 


CATCTGCCCG 


CTmTTGAT 


TTTGAAGTGC 


TGTACTAAAT 


4740 


TTTACAATAG 


TATAGATATT 


TTAATCGATA 


TGAGATTTGC 


CGGTAATACG 


CTTAATTAAA 


4800 


CCTTTATAGA 


GTACAGGTAT 


GAGTAAGATG 


AAACCGAACA 


ATCCCATAAT 


AGGGAATACT 


4860 


TTTCCAATTA 


ATGAAATGAa 


ACCGATAAAT 


GTACTAATAT 


AAGTGATGAC 


AGCCATTGTA 


4920 


AT2VATAATGA 


TGAAGTAACG 


TCTGCTGAAT 


GGAACGCTGA 


AACGTGACGC 


AAATGCATAC 


4980 


ATTAATCCAA 


CAACAGTATT 


GTAGATGACA AGTATCATAA 


TGACAGACAT 


AATAATACCA 


5040 


ATTGACGGAG 


ACATTTGTGT 


CGCTAATTTT 


AATGTAGGTA 


GATCTACGTG 


TTTAATTTTA 


5100 


TCGAATTGAG 


AAATTAAACC 


TAGATTAATC 


ATCATGAGTA 


AAAATGTAAT 


GATTAAACCG 


5160 


CCAATCAAGC 


CCCCGTATAA 


CGTTGAGTCA 


CGATATTTAA 


CTTTACTACC 


CATCACTGAT 


5220 
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5 



10 



15 



20 



35 



CCAGGTGATA 


ATGATTTCTG 


CTTATGAATC 


TGAGCATCAT 


TATTAGCGGC 


AGTAAAATCA 


5340 


AGATGACTTG 


TTGTGAAATA 


GTAGACCGCA 


ATCATAATGA 


CAATCGCAAT 


TAAAAATGGG 


5400 


GTAACACCGC 


CAAGCACAGC 


AATTAAACGA 


TCGAATTTTA 


GAAACAGTGT 


TGCTAAAATA 


5460 


AAGGCGACTA 


ATATGAGTGC 


GCTCAGCCAA 


TACGGTAAGT 


TGAAACTTTG 


ATGAATGGTT 


5520 


GACGCACCAC 


CTGCAGTCAT 


AATAATAGCT 


AAAGACAACA 


TAAACATTGT 


TAAAATAATA 


5580 


TCAAAACCTC 


TTGCAATAGA 


GGGGTATAAG 


AAATAGTTAA TTGAATCAGA ATGATTTCTG 


5640 


GACTTTAGAT 


GATGACCTGT 


ATGCATGACA 


ACCATTCCAC 


CTAAAGTAAT 


CAATAGTCCT 


5700 


GTTACAATAA 


TGCCTGAAAT 


GCTATATGCG 


CCATGACTTG 


TGAAAAACTG 


GAAAATTTCT 


5760 


TGACCAGTAG 


CAAAGCCGGC 


ACCAACGACA 


ACACCAACAA AGGCAAATGC 


CACAATAATG 


5820 


GACTCTTTTA 


AGATACGCAT 


GATTTAAAAA 


TGTCCCTTCG 


TAATTTTAAG 


TAATATAGAA 


5880 


AATGTAACAT 


ACATGTTAAT 


GAAAAATATA 


GTACTAATAT AGTATTTTGT 


TAAATTGGAG 


5940 


TAGAAGCGAG 


GGTGTCGGTC 


ATTTCATTAA 


TTTATTAGTT 


GATTTTGCAT 


TTTTTTGCTG 


6000 


TAAAGTTGTT 


ATAATACAGT 


TAACAGGAAT 


TAGCATAGAT 


ACACCAATCC 


CCTCACTACT 


6060 


CGCAATAGTG 


AGGGGATTTT 


TTTCGGTGTA 


GCTAGGTCGC 


CTATTTATCA 


TCGTGTTTGC 


6120 


GTAgCaATGC 


GTAAACACAG 


TACCACTAAA 


TAAGTGCACG 


ATACATGCAT 


CAAATGTCGT 


6180 


CTTTAGT C TA 


AGTAACGATC 


ATGCATTAAC 


ATTTTCAAAA 


TATCTATTTG 


AGCTTGAAGA 


6240 


TCTTTACCAA 


TATTGGTATC 


ACGAATCTTC 


TTACGTTGTA 


ATTCTTTATC 


TACGACGCGC 


6300 


TTTATAGAAA 


GTTCATCGAT 


ACCTTCGGAA 


AGTATTTTTn 


CTTTAGCGTT 


AAATTGTTGG 


6360 


TGTGCAACGA 


GTTGCATACC 


GAATGAATTA 


TACAATAGTG 


TATAGCCTGC 


AATGCCAGTn 


6420 


GTTGACTGAT 


AAGCTTTTGA 


AAAGCCACCA 


TCAATGACAA 


GCATCTTTCC 


ATCAGCCTTG 


6480 



AT - 6482 



(2) INFORMATION FOR SEQ ID NO: 53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : • 16592 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53: 

so 

ATTTAAGGCG ATTGCTTGTG TATTTCTCTC TTTTGTAGGC AAACCTGCAC TCGTTCCAAA • 60 

AAATGTAACT TCCATATATG CCCCTCCTTT TCTTCAATTC ATTTTATCAT AAAATTTGTA 120 
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AATTTTTCTA ACTTTAACGT AGACATAACT ATATAAATTT TGATAATTAC GTTATACTTA 240 

TCATTAATAA GTATCACATT AAACATGATA CATGAATCGA TATTTCATTT AAGACACTGC 300 

5 

ATACAGTCGA GCATATTGTA TGACCTACTG AATGGATTAT CTTATAATAA TAAATCATAT 360 

ATCTAATTAA GAATTGAGGT TTTAATCTTG AGTACTAAAA ACAAACACAT CCCATGTTTA 420 

ATCACAATCT TTGGTGCACT GCGTGACTTA AGCCATCGTA AGTnGTTTCC ATCAATATTC 4B0 

10 

CATCTCTACC AACAAGACAA TTTAGATGAA CATATTGCCA TcATCgGTAT TGGACGTCGT 540 

GACATkwnTA ATGATGATTT CCGTAATCAA GTAAAATCAT CAATTCAAAA GCACGTAAAA 600 

75 GATACAAACA AAATTGACGC GTTTATGGAA CATGTCTTCT ATCATAGACA TGATGTTAGT 660 

AATGAAGAAA GCTATCAAGA ATTACTAGAT TTTAGTAATG AATTAGATAG CCAATTTGAA 720 

TTAAAAGGTA ATCGACTATT CTATTTAGCA ATGGCACCAC AATTCTTTGG CGTTATTTCT 780 

20 GATTATCTAA AATCTTCTGG TCTTACTGAT ACAAAAGGAT TTAAACGCCT TGTTATCGAA 840 

AAACCATTCG GTAGTGATTT AAAATCAGCC GAAGCATTAA ACAATCAAAT TCGTAAATCA 900 

TTTAAAGAAG AAGAAATTTA TCGTATTGAC CACTATTTAG GAAAAGACAT GGTTCAAAAT 960 

25 

ATCGAGGTAT TACGTTTTGC GAATGCGATG TTTGAACCAT TATGGAATAA CAAATATATT 1020 

TCAAACATCC AAGTTACATC TTCTGAAATA CTAGGTGTTG AAGATCGTGG TGGTTATTAT 1080 

GAATCAAGTG GCGCGCTAAA AGATATGGTG CAAAACCACA TGTTACAAAT GGTTGcATTA 1140 

30 

TTAGCTATGG AAGCACCTAT TAGTTTAAAT AGTGAAGATA TCCGTGCTGA GAAAGTAAAA 1200 

GTACTTAAAT CACTGCGTCA TTTCCAATCT GAAGATGTTA AAAAGAACTT TGTTCGTGGT 1260 

3S CAATATGGCG AAGGCTATAT CGATGGTAAA CAAGTTAAAG CATACCGTGA TGAAGATCGC 1320 

GTTGCAGATG ACTCTAACAC ACCTACCTTT GTTTCAGGTA AATTAACAAT TGATAACTTT 1380 

AGATGGGCTG GTGTACCATT CTATATTCGT ACTGGTAAAC GTATGAAATC TAAAACAATT 1440 

40 CAAGTTGTCG TTGAATTTAA AGAAGTACCA ATGAACTTAT ACTATGgAAA CTGaTAAACT 1500 

GTTAGATTCA AACCTATTAG TAATCAATAT CCAACCTAAT GAAGGTGgTA TCTTTtACAT 1560 

CtAAATGcTA AGaAAAATAC ACAAGGTATC gAAACAGrAC CTGtCCraATT GtCTTACTCm 1620 

45 

ATGaGCGcTC aAGaTAAAAT GaATACTGTA GATGCATATG AAAATCTATT ATTTGATTGT 1680 

CTTAAAGGTG ATGCCACTAA CTTCACGCAC TGGGAAGAAT TAAaATCAAC ATGGAAATTT 1740 

GTTGATGCAA TTCAAGATGA ATGGAATATG GTTGaTCCAG AATTCCCTAA CTATGAATCA 1800 

50 

GGTACTAATG GTCCATTAGA AAGTGATTTA CTACTTGCTC GTGATGGTAA CCATTGGTGG 1860 

GGACGATATT CAATAATTGA ATTAAAACGC ACATGTTAAA CAAAAATAAA TGAGCGAATG 1920 
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TATATTATGA AATTATATTT TACAATGCCC 
GGTGTATAAT TTATAGAAAT AATGTAGAAT 
AAGTTTTGGA CGTTATCAAG CAAATACAAC 
TAGAGCCTTT CAATTTGCAA GAATTAAGTG 
ATCAATCATT TAAAATGATT GTCGGCTTAT 

10 

TGACACTCGC TGCAAATGAT GTGATTAATG 
AATATCACTA TGCAAATTCA AATGATTTTG 

15 CACCTATTCA AGCCTCTACT AAAAAAGATG 
AATTATCAAC TACTGAGAGA GCACCTTATC 
CATTGGTTGG ATATGCACGA TTTATAGACA 

20 CGGATTTTTT AGAAGACTTG CTCATTGATG 
ACGTTAGTCC ATTTGAACTA TTTGTTATTA 
TTGTAGGTGT ACCAAGTGAA CGTTATCCTG 
AACATTGTGC GAAATTCAAT TTACAAGGTG 
ACTATATTGA ATCAAGTTTG CAGTTAACAT 
AAGTGTACCC TCTCGATATT TCATTTAATG 

30 

CTGTTAAACA GAGTCCTTAT GACGAAGATT 
ATATCTATAG GTCTACAAAT GGCCTTAGAT 

3S TAGTTCGATA ACACATGCTT CATATGGACG 
ATTAAATAGC TTTACTTCTC CATGGCTTAA 
GTTAGTAAGA TTACCTACAA TAAGAACTTG 

40 AACTTGTGAA TTTTCAGCAT CTACTAAATC 
CTTTCTTAAT TGAATTAAAT CTTTATAAAA 
TTGTGCAACA TTGATAGTTT TATAATTCGG 
AAATCCTCCA TTTAACGTAT CATCCCATTG 
ATCTTTATAT TTCGCAAGTA AAGCGTCTAC 
GTCATTTTTA ACAGCAACAT CGTTAAACGT 

so 

ACCAATTTCT TGACCTTGAT AAATGAATGG 
ATGACTTGTT GCTGATTCAT ACCAATACTT 

55 



AAAACTATTT TAATAATCAT TGAACAAATG 2040 

AAAAATAAAT GATTGAATTA ATTGGAGTGA 2100 

AGGCAATTGT TTATATTGAA GATCGTTTAT 2160 
ATTACGTTGG TCTTTCGCCA TACCATCTTG * 2220 

CTCCAGAAGC TTATGCACGC GCGCGTAAAA 2280 

GTGCTACACG ACTTGTAGAT ATCGCTAAAA 234 0 

CAAATGATTT TAGTGATTTT CACGGCGTAT 2400 

AATTACAAA7 TCAAGAGCGA TTATATATCA 2460 

CATACAGATT AGAAGAGACA GATGATATTT 2520 

CTAAGTATTT GTCACATCCT TTTAATGTTC 2580 

GTAAAATTAA AGAGTTACGA CGATATAATG 2640 

GTTGTCCTCT TGAAAATGGT , TTAGAAATAT 2700 

CACACTTAGA AAGTCGATTT TTACCTGGCA 2760 

AAATTGATTA TGCAACTAAT GAAGCTTGGT 2820 

TGCCATATGA ACGAAATGAT TTATATGTTG 2880 

ACCCATTCAC TAAAATTCAG CTTTGGATTC 294 0 

AAATAATAAA AAACAAAGAA GCCCCCTAAT 3000 

TCTATTAGGG GGCATATTAA TATGTTAATT 306 0 

TAACTGTTTT AAATTAACTT TGGCATCATA 3120 

ATCAAATGGT ACAGTTAATT CTGCTTCGTG 3180 

CTTTTCATTT AATGTTCTCG TGTACGCAAA 3240 

AAATTGACCA TATACGTATA CATCATTAGA 3300 
TTGTAATACT GAATGCTCAT CTTCTAATTG ' 3360 

ATTCACTGGG AACCACGGTT CACCATTTGT 3420 

CATTGGTGTG CGAGAATTAT CTCGGTTCTC 34 80 

ATCTCCACCT TGAGCTTTCA CTATTTGATA 3540 

TTCAATACTT TCAAATGGAT AATTCGTCAT 3600 

CGTACCTTGT TGCAAGAAAT AAACAGCTGC 3660 

GTCATCGTCA CCCCACGTCG ATACACGTCG 3720 



405 



CCATCTATTT AATACAGATT TATACGAATT 
CAGTCCCAAA TGTTCAAATT GGAATATCAT 
5 GTCATCAGCA TCATCAGGGC TTACACCATT 

ACTTAATGAG CGATCTTTCA TCTCTTGTAA 
TACATCAAAT GCTGGGGCAT ATGTTTTACC 

10 

CGTCTTCTTA ATATGCGTAA TTGCATCTAC 
CCAGTTCATC ATTTCAAATA CAGCATCTCT 
TTGTTTTTTA CTGAATAAAT GGAAATAATA 

15 

AGATCCATTA AATATACTTT CCCAGTTGTT 
CCAAATGTAC CAATCACGTT TGGGATTGTC 

20 ATGTTCATCA GATGTATGAT TTACAACTAA 

AACACCTTTT AATAAACGAT CAAAGTCTTC 
GTAGTCACTA ATATCATAAC CATTGTCATC 

25 AATGACATCG ATACCGAAAT CTTTTAAGTA 

CCCAATACCA TCGTGATTAC TATCATTAAA 
TTCTTTCCAC CATTGCTTAT TCATTTTAAA 

30 

TATTTTTAAT TCTGTATCTA TAATGACGAG 
TATATTTAAA ATTGCTGCAC CAGCCTGTTG 
GGTTTGTGGT GGTGACGCAA TTTCGGTTAA 

35 

ATCTTTCGGA ATTTCAATAT TAAGTTCATA 
AGCATCTAAC GAAATGATTG CCTGTTTAAT 

40 - TTGCATGTAA TTAAAAATAA CTTCTCTTTC 
TTTATTTTGA GAAGCTATCG TTTCAAATCC 
TCCTTTTTCT GTAATAAATA TTAATTCATC 

45 ATTTTCAGAA GCTAATATAT TATCATTATC 

TGTAGGCTTA CCAATCACAA TAAATGGCAT 
ATCATTTTCT TTTGAATAGA GCAGTATAAA 

SO 

ATAAACTTCA TCCATTAAAT CATTCATATT 
ATGCTGGTTA CACGTTTCAG AAATTCCTAG 

55 
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TACATCAAAG 


TGAGAATCAC 


GACTATTCCA 


3B40 


ATTAAATTTA 


CCATTTTCTT 


CCCCGACCCA 


3900 


CGCTTCACCA 


ACAGTCATAA 


TGTCATACTT 


3960 


CCAAGTTTGT 


ATACCTGGCT 


GATTCATATC 


4020 


CTCAGGTACA 


GGTAAGTCAC 


CCGCTTCAAA 


4080 


TCTAAATCCA 


TCAATGCCTT 


TATCAAACCA 


4140 


AACTTCCGGA 


TTACCCCAAT 


TCAAATCAGG 


4200 


TTGCTCAGTA 


TTAGCATCAT 


ATTCCCATGT 


4260 


AGGTTCAGAG 


CCATCTGGCT 


TTGGATCTTG 


4320 


TTTACTAGAT 


TTGGATTCTA 


TAAACCAAGG 


4380 


ATCTAAAATA 


AGCTTCATGC 


CTCTATCATG 


4440 


CATCGTTCCA AATTCATCCA 


TAATCTCTTG 


4500 


ATTAGGTGAT 


TTAAACATTG 


GACTGAGCCA 


4560 


GTCCAATTTA 


TCAATCATTC 


CAGGTAAATC 


4620 


ACTTCTTGGA 


TATACTTGAT 


ATGCTACTGC 


4680 


ACTCCTTTGC 


TATCGCTGTG 


TTGATTTTCT 


4740 


TTCAATAACA 


TCCTGTGCTT 


TGTTTTTCAA 


4800 


ACCTAACATT 


CGAGGCTTGA 


TGTCAATACA 


4860 


ATAAGAATCA 


TTGAACGTTG 


CTGTCATTAC 


4920 


TAGGACACTT AAAATCGCTA AATGTAACAT 


4980 


ATTTGGGTCC 


TTCAAACGCG 


TATGTAGATT 


5040 


ATTACTAGTC 


TCAATAATTT 


GATAATTAAT 


5100 


TTGAATTCTA 


TCTTTTGAAA 


CTTCAAAATT 


5160 


TACACCTTGT 


TCAATAACAT 


GTCGTGTCAA 


5220 


TATATGTGTA 


AATTGATGAT 


CTATATCCGA 


5280 


GCTTTCATCA 


ATTAACATTT 


GTTTAATCGG 


5340 


CGCATCAACC 


ATTCGTTGTT 


TAATCATTTT 


5400 


ATTTGAGACT 


GTCGTTTGTG 


TACCATAGCC 


5460 


CAATACATTG 


ATGTAGAATG 


GATTCAGTCG 


5520 
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AGTTCTAGCA GCGGTATT AG GAAAATAATT CAATTCTTCC ATAACTTTCT TCACTTTTGA 5640 

AATTGTCGCT TCGCTAATAC GTTGATTTCC TTTTATAACT CTTGAAACTG TCGAAGGAGA 5700 

AACACCGGCT TTTAGTGCAA CATCTTTAAT CGTAACCATT TAATCACCTC CTGTTAATTT 5760 

CTGCATCGGA AAACGCTTCC AACCACTGTA TAATACCAGT TTAGTCACAC TTTCTAAAAA 5820 

AGTCAAAAGA TTTGTGCAAA CGATTGCATA AAACGATAAA AATAAAACCT TCATACTGAA 5880 

ATTCAATCCG AAAATCAATA TAAAGGTTTG TATAAATATT AAAATCGATT GTTTAGTCAC 594 0 

TAACTGCAAA ATAGTTACCT TGGCCATCTT GAAAATTAAA TACACGTTGA CCATTCATTT 6000 
CTACTATATC ATGCCCAGTT AAACCTAAAT CATTTAATTT TGAGTATAAT GCATCAAAGT . 6060 

TTTTCTCTTT AAACATTAAA GATGGTGTTC CTAGGTTCAC TTCCGGGCTA TGCTTTTCAA 6120 

TAAATTCTTT TGCCATAATC GTCAATGACG TTTCAGCATC TTTGGTAGGT GATACTTCAA 6180 

20 CTGCAACATA GTCCTCAGCT AACGGTGTTT CACTTACAAC AACAAATTCT AAAGTTTCTG 6240 

TCCAAAATGC TTTCGCTTTT TCGACATCAT CAACATATAA CATAACTTGA TTTAACTTTT 6300 

CCATAAAATA GTACCTCTAT TTCTCTATAG TACATGCTAT CATAACACAG .TAAATATTTT 6360 

ATTACTTCAC AAAATGCTTA AAAATATGGC GGGATGCTTT TAAGGTCAAG GATAATACTT 6420 

GTGTAATTTT TTATAGGTTG TAGCTACTCT ATCACACTCT CTTTTATATT TATCAAAAGA 64 80 

TATAAAAAAG GATAGTATCT TTCAACTATC CTTTAATCAA TATTATTCTT CAATCCATTG 6540 

TGTATGGAAT ACGCCtTCTT TATCTTTTCT TTCGTACGTA TGAGCACCGA AGTAGTCACG 6600 

TTGTGCTTGA ATTAAGTTTG CAGGTAAATC AGCAGCACGG TAACTATCAT AGTAATTAAT 6660 

ACTTGATGAG AAACCAGGTG TTGGTACACC ATTTTGAACA CCAGTTGCGA CAACATCACG 6720 

TAACGCATCT TGATATTCAG TAACGATGTT TTTAAAGTAA GGATCTAGCA ATAAGTTTTG 6780 

TAATCCTGGA TTATTATCGT AAGCATCTTT GATCTTTTGT AAGAATTGTG CACGGATAAT 6840 

40 GCAACCTTCT CTCCAAATCA TAGCTAAATC ACCAAGTTTT AAATTCCATT CATTATCTTC 6900 

ACTTGCTTTA CGCATTTGcG CGAAACCTTG TGCATAAGAA CAAATTTTAC TCATATATAA 6960 

TGCTTTACGA ATTTTTTCTA AAAAGTCTTT CTTGTCACCA TCAAATGATG CTTTTGGACC 7020 

ATTTAATTCT TTAGAAGCAT TTACGCGCTC TTCTTTGaTT GAAGAGATAA AACGTGCAAA 7080 

TACAGATTCA GTAATGATTG TTAATGGAAT ACCTAATTCT AATGCGTTAA TTGAAGTCCA 7140 

TTTTCCTGTA CCTTTTTGaC CTGCAG7ATC AAGAATTTTT TCAACTAATG CTTCTTTATT 7200 

TTCATCTAAT TTCATGAAAA TATCACCAGT GATTTCAATT AAATAACTTT CTAATTCACC 7260 

AGCATTCCAG TCTTTGAACG TTTGAGCAAT GTCTTCATGA GACATGCCTA ATAATTCTTT 7320 
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CATTTTCACA TAGTGTCCAG CACCATTAGG TCCAATATAA GTAACACATG AAGCACCGT.C 7440 

TTTTGCCTTT GCAGCAATTG CATCAAGAAT ATCTGCAACT TTGTTATAAG CTTCTTCTTG 7500 

TCCACCCGGC ATTAATGACG GACCAGTTAA CGCTCCAATT TCACCACCAG AAACGCCCAT 7560 

ACCAATAAAG TTGATTGCAC TTTGTGywAA TGCTTTATTA CGTCTGATAG TATCTTGATA 7620 

GTTTGTATTA CCACCATCAA TTAAAATATC TCCATCATCT AATAAAGGTA ACAAACTATC 76 BO 

AATCGTTGCG TCCGTAGCTT TACCTGCTTG AACCATTAAT AAAATTTTAC GTGGTTTTTC 7740 

TAAAGAATTA ACAAATTCTT CCAATGAATA CGTTGGATGA ATATTTTTCC CTTTTGATTC 7800 

TTCAACCATT AAATCAGTTT TTTCACTTGA GCGGTTAAAT ACAGATACAC TATATCCGCG 7860 

TGATTCAATA TTCCAAGCTA GGTTTTTACC CATAACGGCT AAACCAATAA CTCCAATTTG 7920 

TTGTGTCATA TTACTTACCT CACTTGTTGA TTTTTCATTA GTATTGTATC ACAAAATAGA 7980 

20 CATACACTAC ACTAAATCAT TTCGAATGTC GCGCAACTAT TTTGATTATT TCTAACACTT 8040 

GACTTGCAAG CAAGTTCAAT GATTTAATCG GCATTCTCTC ATTTGTTGTA TGGATTTTTT 8100 

CATAACCCAC TCCTAAAATG ACTGAAGGAA TACCAAATGT ATTAATAATA CTGCCGTCTG 8160 

25 AACCGCCACC AGAAATAATT GTATTTGCAG ATAATCCTAA ATTACGAGCA CTTTCTTGTG 8220 

CAATTTTAAC AACCGCTTCA TTATCATTAA TTTTAAATCC TGGATAACTT TGCTCCACTG 8280 

TAACTACTGC TTTCCCACCT AATTCTGATG CAGTAGTTTC AAACACATCA GTCATATGTT 8340 

TGACTTGTGT TTTTATTCTT TCTGGATCGT GAGAACGTGC CTCTGCTTCT AAAATGACTT 84 00 

CATCTGCAAC AATATTCGTA GCTGAACCGC CATGAAACTT ACCAATATTG GCAGTAGTTA 8460 

TTTCATCAAC TTGTCCTAAT TTCATTCGAC TAATTGcTTT CGCCGCAATA TTAATAGCAC 8520 

TAACACCCTC TTTTGGCGTA CTTGCATGAG CCGTTTTGCC AAAAATTTTA GCTGAAATTA 8580 

ACATTTGCGT CGGTGCACCT ACAACCGTAG TACCGACATC AGCACTTGCA TCAATAGCAT 8640 

AACCAAAGTC CGCGTCCAAC AACTCTGAAT TTAATTCTTT AGCACCAATT AAACCTGATT 8700 

CTTCTCCAAC AGTAATCACA AATTGAATTT GTCCATGTGG GATTTGTTGT TCCTTTATCA 8760 

CTTGCAAAAC TTCAAGCATC GCTGATAATC. CTGCTTTATC ATCTGCACCT AGAATAGTCG 8820 

45 TACCATCAGA GTATATGTAG CCGTCATCTT TTACAATTGG CTTTACATTA ATTGCGGGTA 8880 

CAACAGTATC CATATGGCTC GTCAAATATA ATTTAGGTAC TTCGCCTTCT TCGATAGTAC 8940 

TATTCATTGT ACACACTAGA TTATTGGCAC CTAATTTAGG ATGTTTAGCC GCTTCATCTT 9000 

50 CTTTAACATC TAACCCTAAT GCTATGAATT TTTCTTTTAA AATAGGTTGG ATTGTTGATT 9060 

CATTCCCTGT CTCAGAATCG ATTTGTACAA GTTCAAAAAA CGTATTAAGT AATCTTTGCT 9120 
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GATGAAATAA AATGTTACAG TAATTGACGT TACACAGATT TATCAGGTTT GTAAATTGTG 9240 

TCATATTATT TTCAATTTAT TATATATAAT TATTGTAACT CAAACTAAGC TTTGTCAAAA 9300 

ATATATTGAT TGATTTTTCA AAGATATCGT ATAATGAGGA AAATGACATA AGCAAACTTA 9360 

CTCATGTTTT TTATTATATT CCTTTATGAT GATTGCTAGT TATATCGTCT CAAGTTAAAA 9420 

GTTTTATATC TTATGTCGTA ATTATTAATA CAAAGGTTAT TCATTTGGAG GCACACAAAA 94 80 

TGCAAAATAA AGTTTTAAGA ATTATCATTA TCGTTATGCT TGTATCAGTT GTATTAGCAT 9540 

TGTTATTAAC GAGTATCATT CCAATTTTAT AAACTATATC TCAACTACCT ATACAAAATC 9600 

ATACAATTAA AAATCCATCC ATTATAAACG CATGTATTAA TAAGTTATCG TATTGCAACG 9660 

ATTACTTTCA AACATGGGTC ATACGGATGG ATTATTTTTT AAGCTACTTC ACTATGCATT 9720 

TTCAATGAAC CAAATTGCGA TTTGATTTGT AAATATTCTT CTAATTCATT TAATATTTGA 97 BO 

ATAATACTTG CTCTCGAGTT AAGCGCTTTG TGTGTTGTTG GCAATGGCAG TTCATCCAAT 984 0 

TTCAAACGCG TCTCATACAA ATTGTGTAAA CGCATTGCTG TATAGTCATT ACTATTCACA 9900 

TTTAGACCAA TTTCTTTCAG CAGTGACGCA ACATCATTTA AAAGCGGATC TTTATGACAG 9960 

ATACTTTCGA TGAGCGGTTT CATTCTCATT AACAATTCCA CTTGCTCTTC TCGCATATCA 10020 

AAATAATGAT AGTATGAATT TTCGTTTCTA ACAAAATGAT TTTTAACATC TCGGAACGCG 10080 

ATAGACTtCG CCTTTTTAAT ATTTAAAAGT AACACTTCAA ATTCAATCGC AATGGTATCT 10140 

TCATATTTTT CACAAATATA ACTATATTTA CTAAAAATAT CAGCAATTTG TTGCTCAATT 10200 

TTACATTTGT ATTCGTCtAG TTGTTTGTCT AAACTTGGCA TCATTAAATT. CaTTGTAAAT 10260 

GCAATGCTTA GTCCAATTAA CAGTAATAAT GTTTCATTAA CAATTAAATG TGCATCAATT 10320 

GATTTTGCAT TAAAAACATG AAGTAATATA ACGCAACTCG TAATGACACC TTCTTGTACT 10380 

TTTAATACGA CAGTTAATGG TATAAATAAC AATACGATAA TACCGAGTAC AATTGGACTC 10440 

TGACCTAATA AACTAAATAT TGCTGAACCT AAAAACAATA CTAAAAAACA TGATACTAAT 10500 

CTTGAAATAA TCGCTTGTAG CGAATGTACT TTTGTATGTT TAATACATAA TACGAGTAAT 10560 

ATGGCGCTTG AAGCATAATT ATCTAAACCT AACAGCTTAC TAATAATTAC ACCTAAAGTC 10620 

ATACCCACTG CTGTTTTTAT TGTTCTAAAT CCAATCTTGT AAGGATTTAA CTTTAACATG 10680 

GGTTAGCGCC TCTTATCTTT CTTCACAATA TTTATTGAAT AATGTTTGTA ATTGATTAAT 10740 

TACGTTCATC ACATCATGAC CTTCGATTTG ATGTCTTTCA ATCATTTCTG TAATCTTTCC 10800 

ATCTTTTACT AATGCAAATG ACGGACTTGA AGGCGCATAA CCTTCGAAGT ATTCACGCGC 10860 

TCTTTGTGTC GCTTCTTTAT CTTGTCCAGC AAATACTGTC ACTAGACGAT CAGGTAATAC 10920 
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AGAATTGATC 
AGTAGTTAAT 
GTTCATGTAT 
ACTAtCCTCA 
ATATTTAGAC 
AATTTGTTCA 
GCAAAAATGC 
TATTTTAGTG 
TAGATGCATC 
CGCTGTTTAA 
ATTGAAATAC 
TTTTTAACGA 
GATGATACTG 
TCAGCTGTCA 
CCTTTGATTG 
GCAACAGCAA 
CTATTTAATA 
AACGTTAGAT 
ACAAGATTTG 
TTAACCATAT 
CTATTGTCTT 
TGTTTGTCAG 
GTTACACGAC 
GCGAGTTTAA 
GTAGATGTCT 
ACTTTTGCTT 
CAGATAATTG 
CCTGATATCG 
TCATATTCAT 



ATAACTAGTG 
TGCTCATATC 
AAATCGAAAT 
TTCTACTAAT 
ACAATTTTAA 
CATGTTTTCA 
ATTCAACCAT 
CCAAAAAATA 
TATGTTATCA 
TATGATTCAT 
ATAAATTAAC 
TTGATTCTAC 
AACCAAATGT 
ATTGCTTATT 
ACTTTTOGTC 
TTGAAATATT 
AAGGATATGC 
TATATCCTTC 
TAGCATCTAC 
TTTGCGCAAT 
CAGATGATTG 
ATTGAGCTGT 
CTTCAAATCC 
ATACAACAGG 
GTTCCACTGT 
GTATCTCTTC 
TATCAATAGC 
TGGAAGGGAC 
CAATATGATC 



TTGTACCATC 
CCGCAGATTC 
TCATGnCCAT 
TAATAACATA 
CAATATACCA 
TTAATATGTT 
GTTGATTATT 
ATACATCCAT 
CTAATATATA 
ArATTTACCT 
CATGTTACGA 
TTGTAAAATC 
ACCAGTATTA 
ACGCGCTTTC 
TGCATGCTTA 
AATGTCTTTA 
TTTTAAAGCA 
TTTATTTTTA 
TTCAATCATC 
TGCTTTACGC 
GTTACTTGAT 
GGTACCACCA 
ACTACCTACA 
TGAAAAGCGA 
TGCACTAGCT 
AGTTGTTTCA 
TACTGTCTGC 
TTCAGCTGTC 
ACCAACAGAA 



TTGTTTAAGA 
AATTTCATTC 
AAGTTCAATC 
TTGTTCAATA 
AACATTATTG 
TCAAGTATGA 
GTTCTTTATC 
CGACAAGAAC 
TTTGTATTTT 
GTTTGTAAAC 
ATTGCAATCA 
GCTGCTTGTG 
TTTACCGTAA 
GTTGCTAAAG 
ATCACAGGTA 
TGTAAGACAA 
TCTGCTACAG 
AAGCTGTTTT 
ATCCATGCAT 
ACACCATTTA 
GTATCTACTG 
TTTTCAATAA 
ACTTGTGATA 
CCATTATTAC 
TTTTTAGTAG 
TTTGTCTTTT 
CCCGCTTCAA 
ACTTTATCTG 
ACTAACCATT 



ACTTTGTCAA 
CTTGCTTGTT 
ACCTATCCCT 
AACTAATCTG 
TGCTTAAAAT 
TGTCTTATTT 
TTTTTTGAAT 
AAGATAAAAC 
CTAAAGTATA 
CATCTAAAAT 
TATCATTAAT 
GATGATTTAT 
ATGTACCGCC 
TATTAATTTC 
CGTATAATTT 
TTTCATTTCC 
CTTTTACAAA 
TATAATGATT 
GTGGAATCTC 
CTGGTATTGT 
ATGTTGATTT 
CTGACATTAT 
AATCAATGTC 
GTGGTTGATT 
ATTTCTGAGT 
CATCAGCAGT 
CTAAAATTTC 
TAATAACTTC 
GTTCAATGGT 



CATCTTCTGC 
CTACAACACC 
TTATATTTAA 
AATCACACCT 
CATGGTAACT 
TGACTTTACT 
ATATTGCACA 
AAGTTGTCGA 
CTGTTCGATA 
ACGATGATCA 
TACTACTGGC 
AATACCCATT 
CTGCATATCT 
TCTAGCTATA 
ATTTTCATCA 
TTGCCAGCTA 
GAAAGCAAAG 
TCTCGTATTC 
TGTTAGACTA 
GCTGTTTTCA 
TGTTTGAACT 
ATCCTTCTTA 
ATGCTCTGAA 
TTGTTTAGCA 
ATGCTCATCC 
TTCAATTTTA 
TGTAATTGTT 
ACATAATGGT 
GCCTTCATGA 



11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
115B0 
11640 
11700 
11760 
11820 
118B0 
11940 
12000 
12060 
12120 
12180 
12240 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
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AATTCACGCA TTTTATTTAA GATTTTTTCT GGATTCATCA TAATTTCATT TTCTAATACA 12 840 

GGAGAAAATG GCATAGATGG TACAtCTGGA GCAGCTAAAC GCATGATTGG TGCATCTAAA 12900 

5 

TCGAACAAGC AATGCTCTGC AATAATCGCT GACACTTCTG ACATAATACT ACCTTCTAAA 12 960 

TTATCTTCAG TTACAAGTAA AACTTTACCT GTATGTTTAG CACGATCAAT AATTGTTTCT 13020 

TTATCTAATG GATAAACAGT TCGTAAATCA ACGACTTCAA CATTGATACC GTCTGCAGCT 13080 

10 

AAAATATCCG CTGCTTGTAA ACAATAATTG ACCATTAATC CATAACAAAA TACTGTTAAA 1314 0 

TCTTCACCTT CACGTTTCAC ATCTGCTTTT CCTAAAGGTA CAGTGTAATA TTCTTCTGGC 13200 

75 ACTTCTTCCT TTAAGAAACG ATAAGCTTTT TTATGCTCAA AGTACAATAC TGGATCATTT 13260 

GATTCGATAG ATGATAATAA AAGCCCTTTA GCATCATACG GTGTGGAAGG AATAACAATT 13320 

GTTAAACCTG GCGATGAAGC AAATATACTT TCAATACTTT GTGAATGATA TAGTCCTCCG 133 80 

20 TGAACACCGC CACCAAATGG TGCACGAATC GTTAATGGGC ATTGCCAATC ATTATTTGAA 13440 

CGATAACGCA TTTTCGCAGC TTCACTAATA ATTTGATTTG TCGCAGGTAA AATAAAATCT 13500 

GCAAATTGAA TTTCTGCAAT TGGT CTTTTA CCTACCATAG CTGCACCAAT * GGCAGTTCCA 13 560 

2S 

ACAATATTTG ACTCAGCTAA TGGCGTATCG ATAACTCTGT CTTCACCATA TTTTTGTTGC 13620 

AGTCCTTGAG TAGTACCAAA TACGCCACCT TTTCTACCAA CATCTTCACC AAGAATAAAC 13 680 

ACATCTTTAT TTTGTTGTAA TGCTAAGTCT TGTGCCtGcG TATCGCCTCT AAATAAGATA 1374 0 

30 

ATTTAGCCAT TAGTTAAGAC TCCCTTCTTC GTACACAAAT GCATAGGCTT CTTCGACACT 13800 

TGGATATGGC GCGTCTTCAG CAGCCTTTGT CGCTTTATTG ATGATGTCTT TnATgTCCGC 13860 

TTCTATTTCT GCCAACCAAG CATCATCGAT AATGCCAGCT GAAAGCAACT CTTTTTTGAA 13 920 

35 

CTTTTCATTG CAGTCTGCTT TTTTAAGcGT TTCACGCTCT TCTTTCGTAC GATATTGGTC 13980 

GTOCTCATCT GATGAATGAG CTGTCATACG ACTTGTTACT GCTTCAATCA AAGTTGAACC 14040 

40 TTGACCAGAA ATAGCTCGAT CTCTTGCTTC TTTCATCGCT TTATACATTG CTAATGGATC 14100 

ATTACCATCT ACTTGTTCAC CATGTATACC GTAACCAAGT GCTCTATCCG ATAATTTTTC 14160 

AGCTGCGTAT TGTAATGAAT CAGGTACTGA AATTGCATAT TTATTATTTA TAATGACACA 14220 

45 TACAAAAGGA AGTTTGTGTA CACCCGCGAA GTTTAAACCT TCATGGAAGT CACCTTGGTT 142 80 

TGAGCTACCT TCACCAACAG TTGCTGTTGC AATTTTCTTC TTACCATCCA TTTTTAAAGC 14340 

TAAAGCAGCA CCAACAGCAT GGGGTATTTG AGTTGCTACC GGTGAACTTT GAGACAAAAT 14400 

50 

ATTCTTAGCT CTACTACTAA AGTGTGATGG CATTTGTTTT CCACCAGAGT TAACATCGTC 14460 

TTTCTTTCCA AACGCTGATA AAAACGTATC ATACGCTGAG ATACCCATAT AAGTAACGAA 14520 
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AATC7GAGTT 
CAATAACCAC 

5 

TAGGTCTTCT 
. TACGTGAATA 
AGGATGTGCG 

10 

TGATGCCTCA 
TTCAGTTGAT 

1S CACTGCTTTA 
CTTTGCTTGT 
AGGCATCATG 

20 AACACCTTCT 
ATAAATATGT 
AAGtTTTATT 
TAGCAACACT 
GTTAACATTT 
TGACAATGAT 

30 

ACCTGCTTCT 
TCCGATAACA 
ATCATCGCTA 

35 

AGAACCAGTT 
TTCGACAGAA 

40 GCCATTATAA 
ATTAATAATG 
ATCAACGCCA 

45 CGATTTAGTA 
TATTGCCACT 
TCCACCGAGA 

so 

TATATATCCA 
CATGATTGTC 



GCTTCTTGTC CTTGACCACT 
AGTCTTTCAT CTATTTTTCT 
TCGCTAAGGC CTAATGATTT 
GCTCTACTTT CTGCTTTCAA 
TGTGTTGTTA GTCCTAATTC 
TTAATCAATT CTGTTACATG 
TGATCAATCA CCATTTCGCT 
CCAATTGCTT TAAATGGTAC 
TCAATGTTTA AACCGATAGA 
TTATAGTTTA CTGGGATTGG 
TTTGATCCAA CATGTGCCAA 
TTATCTTCAG TTTGTTGAAA 
TTAGTGTTGT TTAAACCAAT 
TTATCTACTT TAATTATGTC 
ATATCATTTT CAGAAAGTTT 
TTTTTTAATA GTTGTGAAGC 
ATAACTGTTA CGTCAACACC 
CCACCACCAA TAATACCAAT 
GATAAAATTT TATCATGATC 
GCAATTAATA CAAATTGGTT 
ATTGTGCCAC TTTGAGGTGA 
ATGTCAATGT GATTGTGTTG 
TCTTCTTTTC GTGCCAACAT 
AACATTGCTG CCTGTTTTAC 
GGAATACAAC CTTTATGGAG 
TTTTTACCTA ATTGAGACGC 
ACGACTAAAT CATATTGTTT 
TTGAAAATTT ATTAATACAT 
TATTTAGTTT GAATGCACAT 



TACAACAAAT GGAATTTTAC 
ACCTAAATCC ATCCATTTAT 
ATAATCAATC ATGTTAAATC 
TCCTAATTCC ATCAACACTT 
TAATGCCGAG CCATTCATGA 
TGGACCAATC ATATTAATAC 
ATACCCTTCG TTTGTGTCAT 
TTTAAAACTT TTAACTTTCA 
AGCAATTTCA GGTTGTGAAT 
GTTCCCCTCA AACATATGAT 
TTGTAATTTT CCTATACAAT 
TTCGTTCGTT AAAATATGTC 
ATCTGATGTG TTAGGTTTTC 
TGAGGAAATT TCAAACGTAA 
TATTCCCTCA TAGAATTTAA 
TTGTTTACTT TCAGTTGGTA 
TAAATCTATC ATCAATGATG 
ACTTGATGGT AACGTCTTTA 
AAATGATAAG AATGGCAACT 
GGGTAATAAG TCTGATTCAC 
AAATATAGAT GTACCTAGAA 
CATTAAATGC TTTACACCTT 
ATTTTCAAAA TTAACATTAG 
TGTTTGAAAT ACTTCAGCAG 
ACAAGTACCT CCTAATAGTT 
ACGTATCGCA GCAACATATC 
CTCTGACATG TTCTTACTCC 
AGTTTTCATG TCCATTAATT 
AAATAAATCC ATAAATGAGT 



CTGCACGGTT 


14640 


ATATTACTTT 


14700 


CTCCTATTTA 


14760 


CAGAGATGGA 


14620 


ACTGTAACAG 


148B0 


CCACAATTTC 


14940 


GGCTATCAAT 


15000 


TTCCCTCTGC 


15060 


AAATACACTT 


15120 


CAACAGCCAC 


15180 


CACCAGCTGC 


15240 


CTGATGTTGa 


15300 


TACCAATCGA 


15360 


CACCATCTTC 


15420 


CACCACGTGC 


15480 


AAATTCTTTC 


15540 


CAAATTCCAT 


15600 


ATGATAATAT 


15660 


CTGCAGGCGA 


15720 


CATCTTCATA 


15780 


TACGTCCCGT 


15840 


GATACATTTG 


15900 


CATCTTTGAC 


15960 


ATTTAAGCAG 


16020 


GTCGTTCTAC 


16080 


CTGCAGTACC 


16140 


TAACTAATGA 


16200 


ACCTATTTTA 


16260 


ATTCAACACA 


16320 
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TAAATCAGTA ACACTTGCAC CTGAAATCAT TCGTGCAATT TCATCTACTT TATCATCGCT 16440 
AATTAACTCT TGAACTTGTG TTGTTGTACG ATCATCTT T T GATGATTTCG AAATTAATAA 16500 

5 

ATGATGGTCG CTCATCGATG CAACTTGTGG TAAGTGAGAG ATACAAATAA CTTGTATATA 16560 
TTCTGCTaTA TCTCGCATTT ' TCTCTGCCAT TT 16592 
(2) INFORMATION FOR SEQ ID NO: 54: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13 794 base pairs 

(B) TYPE: nucleic acid 

* (C) STRAND EDNESS : double 
is (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 54: 

20 CCAATACAAC GTAAAAAGAT TGCTTGTGTT ATTAATGAGT TAGATAAAAT AATTAAAGGA 60 

TTTAATAAGG AAAGAGACTA CATAAAATAT CAATGGGCTC CAAAATATAG CAAAGAnTTT 120 

TTTATACTTT TTATGAACAT TATGTACTCA AAAGATTTTT TAAAATATCG ATTTAATTTA 180 

25 

ACATTTCTTG ATTTATCTAT CTTATATGTA ATATCATCTC GAAAAAATGA GATACTAAAT 240 

TTAAAAGATT TGTTTGAAAG TATTAGATTT ATGTATCCTC AAATTGTTAG GTCAGTTAAT 300 

AGATTAAATA ATAAAGGTAT GCTAATCAAA GAACGATCCC TTGCAGATGA AAGGATTGTG 360 

30 

TTAATCAAAA TAAATAAAAT ACAATATAAC ACTATTAAAA GCATATTCAC AGATACTTCC 420 

AAGATTCTCA AACCAAGAAA ATTTTTCTTT TAAATTTAAA CAGATTTACC TCTTGATAAA 4 80 

3S ATAAATAAGC AATCATACTA CTTCTCAATT TAGTATAAAT AAAAATACAT AATTAACTTT 54 0 

CTTTTGTTTT TATATTATTT CAATACCCTA CTATATATCA CAACACATAA ATTAAGCATG 600 

ACACTCATTC AATTTAGTTC ACCATTTCGT GTTCCAATTT TACTGAGTAT CATGCTTTTA 660 

40 ATGTTATAAA CCTAATGCTT TAATAAATCG TGTTAATTCT TCTCGCATAC TGTCATCTTT 720 

CAATGCATAT TCTATGGTAG TTTTAACGAA GCCTAATTTT TCTCCAACGT CATAACGTTC 780 

GCCTTCGAAG TCATATGCAT ACACTTGGTT ATCATTATTC ATACGTTCAA TCGCATCTGT 840 

AS ■ 

TAACTGAATT TCGTTACCTG CGCCTTCTTT TTGCGTTTTT AAATAATCGA AAATTTCAGG 900 

CGTTAATACA TAACGTCCCA TAATAGCTAG GTTTGATGGT GCCGTACCTT GTGCTGGCTT 960 

TTCAACAAAC TTTTTCACTT CATACTGACG TCCGTTTTTA GTTAATGGGT CAATAATTCC 1020 

SO 

ATAACGATGA GTATCTGCTT CCGGAACTTC TTGGACACCT ATAACTGAGT GCCCTGTTTC 10 80 

TTCATAAACG TCAATCAACT GTTTCACTGC TGGCACTTCA GATTCAACAA TATCGTCACC 1140 
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TAAACCTTTT TGTTCTTTCT GCCTTACATA AAAAATATTC GCAAGTTCCG TTGAATACTG 1260 

AACTTTCTCT AGTAATTCAG ATTTACCTTT TTCTTTTAAC ACCATTTCTA ATTCTTTTTG 1320 

5 . ACTATCAAAA TGATCTTCAA TCGCGCGTTT GTGGCGACCT GTCACTATAA TAATATCTTC 1380 

AATTCCAGCT CTTGCAGCTT CTTCAACGAT ATATTGTATT GTGGGTTTAT CTAAGATAGG 1440 

AAGCATTTCC TTTGGCATCG CTTTAGTTGC TGGTAAAAAT CTAGTCCCTA AACCAGCAGC 1500 

10 

GGGAATGATT GCCTTTTTTA TTTTTTTCAA AGTTAATGTG CTCCTTTTCC TAAGTATTAA 1560 

ATCTATGTAT CAACGTCATT TTAACACTAA TTAGAACGCC TTCATAGTGT CATTGAGTAT 1620 

GTAATTATTT CTTGGGAAAT TTGTTTTAAT TTTAAAAAAC AGGCTTACTT CATATAATTT 1680 

75 

ATGAAATAAA CCTGTCAATT TTGGATTGAT TATGCTTTGT GATTCTTTTT ATTTCTGCGT 1740 

AATAACGCTA AACCTAAAAT GCTAAATAAT CCGCCGAACA ACATGCCGTT GTTTGTTGAT IB 00 

20 TCTTCTCCAC CTGTTTCAGG TAGTTCAGAT TTCTTAGATT GTGCTTTTTT AGTTGGTACC 1860 

ACTGCTTTAA CCTTTTCATT GATTTCAATA ACAGGTGTTA CTACTTTACC TTGTTCCACT 1920 

GGTTTAGAAG GTTTTTTAGG TTCTTCTTTA GCAGGTGGTA TTGGTTTACC AGGTTCAGTT 1980 

25 GGTACCTCTG GCGTTGGCGG TGTTGGTGTT TCCGGCTCGC TTGGTACTTC TGGTGTCGGT ■ 204 0 

GGTGTTGGTG TTTCCGGCTC GCTTGGTACT . TCTGGTGTCG GTGGCGTTGG TGGCACGATT 2100 

GGAGGTGTTG TATCTTCTTC AATCGTTTGT TGACCTTCAT TATGACCACT TACTTGTGGA 2160 

30 AGTGTATCTT CTTCAAAGTC AACACTATTG TGTCCACCGA ATTGATAATT TGGTTTATCT 2220: 

TTATTTGTAT CTTCTTCAAT AATTTCAGTG TGCTTATTGA ATCCGTGAAT ATGTGGCACA 2280 

CTGTCGAAGT CGATATCAAT GATATTACCA CCTTGTTCAT ACTTAGGTTT GTCTTTCTCT 2340 

35 

GTATCTTCTT CGAATGATTG GTTACCATTA TTTTGACCAT GAATTTGAGG TACACTATCG 2400 

AAAXCGATAT CTACGATATT GCCACCTTGT TCATATTTCG GTTTATCTTC TTCTGTGTCT 2460 

TCCTCAAATG ACTGATTACC GCTATTTTGG CCACCTTCGT AACCTAATTC ACTCTTAATA 2520 

40 

TCCACGTGGC TATTTTCTTC GATTTCTTCA ATCACGCCAT AATTACCGTG ACCATTTTCA 2580 

GTTCCTAAAC CAGAATGAGA AATATGATGA TTGTTTTCAG TAATTTCCTC GATTGGTCCT 2640 

45 TGCGCTTGAC CATGTTCTTC AGGTAGTTCA TCTACTAGTT CAATCAGATT ACTTTCAGTC 2700 

GTATATTCTT TCGTATCTTC AATTGTTGTA TGATCGCTAA CAGCACCAGT TACAATACCT 2760 

TTTGTAGAAT CTTCGTCAAA TTCAACTAGG TTAGACTCAG TAGTAACCTG ACCACCACCT 2820 

50 GGGTTTGTAT CTTCTTCATA TTCAACAACA TCAGCATGAT GTTTTGAATT TTCATGTGTC 2880 

GATTCTTCAA AGTCTACATG AATAGAATCT TCTTCAGTTT CAATGGTACC TTCTGCATGA 2940 
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10 



15 



TCTTCGATTG TACCAGTCAA TTCATGCTTC TCCACTGGCG GCTCTGATTT AAATTCAAGT 3060 

TCGATAGGAG TACTATGTTC TATAATAGGT TCCTTTAGTT TATCTTTGCC GTCGCCTTGA 3120 

GCGTTATTAG AGTAAAATGC AACGCCATTT TTCCaAGTTA AATTACTTGT ATAATAATAG 31B0 

TTATAATATC CAAAAAGGTG TGTTTGAAAT TCTAAGTTGC TAGCATTTGA ATCATAATAC 3240 

CCTTCATATT TTATTACATA ATTTTTACTT TGGTCTAAAT TATTAAAGTT TAAAGAATAA 3300 

CCACCATTAG TATCAAAATC TAAACTCATA TTATCAGTCA CATCTTCAAA TTTGCTGACA 3 360 

TCATCAAGCT TTGCATAnTn AgctTTCAGC TAAATCGTCT GAACCAATGT GTTTATATAC 3420 

CTTAACTGTT GGATTATTAA CCCCTGGTTT ATTTCCTTTA GTTACTTGAC CAGTTACTGT 3480 

CACAGAGCTT AACGACTGGT TGTTAGGTTT CATGTACGCA AAATGACTAA ATTTCCCATC 3540 

TACTTTATTT AAAGTATCAA TTCGACCATT AGCTGTTACT CCCCAATTAT CTCTAACTCC 3600 

20 ACCTAAATAT TGAATATTAA ATATTTTGCT AACCGTAGTC TCACCCAATT TAACTTCAAC 3660 

ATTTTGGTTA CC TTTTTG CG TCACTGTTGT AGGATCAATA AATAGATTTA AAGATAATTC 3720 

AGCAGTTAAA TCTTTCTTTT CTTGTACATA TTCTTTAAAC GTATATCTAA CTTTTCTTTC 3780 

TCCAATTATT TCTCCTGTCG CCATAACTTG ACCATCTGTA CTTTTTATCT CCGGAACTTT 384 0 

ACGCAGTGTT GAGATACCAT GAGTTTCAAC ATTATCGCTT AATGTGAAAT CAAAATAATC 3900 

TCCCGCCTTA ATTCCTTCTC CAAATTTCCA TTTATATTTC AAGGTTACTC TTTCTGCGTT 3-960 

ATGAGGATTT ACAACATTCG TATCTTGTTT ATGTCCTACA ATTTCACTAC CTTCTTCTAC 4020 

TTCCACTTTA TTTGTTACAT CTGTACCTGT CGCTTTAGTT TCTTCCACTA CTTCTTTCTC 4080 

TGCAACTGCT GTAACGTCAt TGatCTTTTC ATTCTTGGTT TAATTTCTGA GACGTTACTT 4140 

GGTTGAGCTA TGTCAACTTG AGTTCCTGTA GTTTCCTTAT CAGCAACTTT TTCCGATGGC 4200 

AAATCAACTC GCGAAgTTTC TACTTTTGGT GCTTGCAcAG TTTTCGGTGC TTCTTCTGTT 4260 

40 GTTACTTGTG TTGATTGTGA TGGTTGCTCA GTTGATGTCG" CGCTGTATGA TTGTGTTTCA 4320 

TCTATTGTAT TAACGTTATT TGTAGTTGTT TGTGTTTCGC TTGCTTTACT TTCAGTAGCT . 43 80 

GAACTCCCAC TTTCCTCTAC TGTAGTATTG TTTTGTTCCG ATGCTGCAGC TTCTTTTTCT 4440 

45 TGTCCCATTC GAACAACGAT CATTGTTCCT AAGAATACTG AGGCCGCTCC CAATTTGTGT 4500 

TTTCTTATGC CGTATCTAAG ATTGCTTTTC ACTATAATAT TCTCCCTTAA ATGCAAAATT 4560 

CATTTATTTT TAAAACTCAA TAAATGCAAT TCTATATTGT TCGGTTTTTA AAAGCAATGA 4620 

SO 

AAAAAAGCGA GTTAATAAAA AGTTAAGATT GTTGTTAACT TTATGTATAA TGAGTTTTTT 4680 

ATTATTTGAA ACTCACATAT ATATTGCATA CAAAGCTCTT GAACACCTTG ATATAACAGG 4740 
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TACTAAACCA TACATAATAA TCGCCTGTAC AATGCATCAT TAACAAGTCA CTGAAACGCC 4 8 SO 

TTTCATTGTA TTAATAACGT CACTATAATT TTTATATCGT TCGG TTTTTG TTTGATTTTA 4920 

5 

ATGATTATTT ATACAAAAAC AGCCGTATTT CAAGCCGACA TTTTAAATTT AACTAAATTT 4 980 
GCATCTAGTT AATAATTGCA TTTATCAAAT TTGTCTTATT GATCCAATCT AATTTGTACT • 5040 

CACAAACTAG TTTAAAATTC TAACTTTATC TCTCAGTTCG TTATCAATCA TCAGACATAA 5100 

10 

ACCAATGAAG CAATCAGAAA ACACTCTAAT TTTCTATTAG AAATTTGATT TAATATAAAA 5150 

AAACAGGCTT ACTTCATATA ATTTATGAAA TAAACCCGTC AATTTTTGTT TAATTATGCT 5220 

1S TTGTGATTCT TTTTATTTCT GCGTAATAAT GCTAAACCTA GAATGCTGAA TAATCCGCCG 5280 

AACAACATAC CTTTGTTTGT TGATTCTTCT CCACCTGTTT CAGGTAGTTC AGATTTCTTA 5340 

GATTGTGGTT TTTTAGTTGG TGCCACTGCT TTAACCTTTT CATTGATTTC AATAACAGGT 5400 

20 GTTACTACTT TACCTTGTTC CACTGGTTTA GAAGGCTTTT TAGGTTCTTC TTTGGCAGGT 5460 

" GGTACTGGTT TACCAGGTTC AGCTGGTACC TCTGGTGTTG GCGGTGTTGG AGTTTCTGGC 5520 

TCACTCGGCA CTTCTGGTGT CGGTGGTGTT GGTGTTTCCG GCTCACTTGG TACTTCTGGT 5580 

25 GTTGGTGGCG TTGGTGTTTC CGGCTCACTT GGTACTTCTG GTGTCGGTGG CGTTGGTGGC • 5640 

ACGATTGGAG GTGTTGTATC TTCTTCAATC GTTTGTTGAC CTTCATTTTG GCCGCTTACT 5700 

TTTGGAAGTG TATCTTCTTC AAAGTCAACA CTATTGTGTC CACCGAATTG ATAACTTGGT 5760 

30 . 

TTATCTTTAT TTGTATCTTC TTCAATAATT TCAGTGTGCT TATTGAATCC GTGAATATGT 5820 

GGCACACTGT CGAAGTCGAT ATCAATGATG TTACCGCCAT GTTCATACTT AGGTTTGTCT 5880 

TTTTCTGTAT CTTCCTCGAA TGACTGATTA CCTTTATTTT GACCATGAAT TTGAGGTACA 5940 

35 

CTATCAAAAT CGaTATCTAC GATATTGCCA CCTTGTTCAT ATTTAGGTTT GTCTTCTTCT 6000 

GTGTCTTCCT CGAATGACTG GTTACCGCTA TTTTGGCCAC CTTCATAACC TAATTCACTC 6060 

40 TTAATATCAA CGTGGCTATT TTCTTCGATT TCTTCAATCA CGTCATAATT CCCGTGACCA 6120 

TTTTCAGTTC CTAAACCAGA ATGAGAAATA TGATGATTGT TTTTAGTAAT TTCCTCGACT 61 BO 

GGTCCTTGTG CTTGACCATG CTCTTCAGGT AATTCATCCA CTAATTCAAT CAGATTACTT 6240 

45 tCAGTTGTAT ATTCTTTCGT ATCTTCAACT GTTGTATGAT CGCTCACtGC GCCAGTTACA 6300 

ATACCTTTTG . TAGACTCTTC GTCAAATTCA ACTAAGTTAG ACTCAGTAGT AACCTGACCA 6360 

CCACCTGGGT TTGTATCTTC TTCATATTCA ACAACATCAG CGTGATGTTT TGAATTTTCA 6420 

50 

TGTGTAGATT CTTCAAAGTC AATTGGATTT GATTCCTCAG AGGACTCAGT GTATCCTCCA 6480 

ACGTGACCTG CtTCGCTATC CACAGCAGTA TGGTAATCGA TATCAATAGC TGATGAATCC 6540 
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TGGTAATCAA TGTCAAGAGT TGATGAATCA TATTCCTCTT CAACAGTAGT TACTAAATTC 6660 

TTATCATATT GACCTGTAAG AGTTTCTTTA ATTGTATCTT CTTTATATTC AAATTTATTA 6720 

TTTTGAATAA TCGGACCATT TTTCTCATTT CCGTTCGCTT TATTACTGTA TAAAACTAAA 6780 

CCATTATCCC AAGTTAAGGT ATATCCTCTA TCATAATAAT ACTTATAAAG TTGCTCTGGA 6840 

TGTCCTACCA TTTGTGTTCT AAAATCAACT TCATCAGTAC CATTTAAATA CTCTCCATCA 6900 

TAGTGAACAA CATAAGTTTT ATCTAGATTT TCTATATTCA ATGAATAGCT TCCATTATTT 6960 

TGTAAATTCA AATTCCCACT CATATTACTT GTGACTTCTT TAAATTTAGA AGTATCTGTC 7020 

GTATTTGCAT ATACACTCTT CGCTATGTCT TCATTATTAC CCAAGTATTC AAATATCCTA 7080 

ACTTTTGGTT GATTTCCATT CTGATTACTA CCTTTCATTA AAGTTCCAGT • AACAGTCACA 7140 

CTTGTCGTTT TACCATTATT AGGTTTAATA AATGCAACAT GCGAAAATCT ATTATTCGCT 7200 

20 TTATTAAATG TCTCAATCGA TCCATTTAAA TTGGCATAAT AATTCCCAAT ACCATCTTTA 7260 

TATTTAACAT CTAATTCCTT TGAAGTTTGT TCTTCATTTA GTGTTGAAGT TATAGTTTGA 7320 

TTTCCATTAG TTTGTACAGT TTTAGGATCA ATAAATAAAT TAATTTCTAG TTCAGCCGTT 73 80 

ACATCAACCT TATCTTCAAT ATCATTTGTA AATGTATATC TAATCTTTCC ACCTTCTAAA 744 0 

ACTTCACCTG TCGCCATTAC GACTGAACCA TTTTTAATTT CTGGTACTTT TCTAGCAGTT 7500 

GATACGCCAT GCGTATTTAC ATTATTTGAT AAAGTAAAGT CAAAGTAGTC ACCTTGATGT 7560 

AAACCATTCT CAAATTTCAA CTTATATTTT AGTACCGCTC GTTGTCCTGC ATGAGGTTCT 7620 

ACTTTATTTG TATTGTTATG CCCCTCAATA GAACCAATTT CTACTGTAAC TTTACTTGTT 76 80 

ACATCTGTAC CCGTTTCCAC TTTCGCGTTA CTAGCTTCCT TAGCTTCCGC TACATCTGCT 7740 

GATCTTGTCA CACGTGGCTT ACTTTCTGAT GCCGTTCTTG GCTGTGCCAC TTCAACTTGT 7800 

GTTTCTGCGA CTTGATTTTG TGTAGCCTTT TTAGGTGTTA AATCTACTTG TCTTTGATCT 786 0 

40 CCGCTATTGT CTTGAGATTG TGTTGTTTCC TTAACTTGAG GTTTCGCTTC TTCCTTAACT 7920 

ACCTCTTCTT TAACTGTTTC TATATTTGCT GGTTGTGCAG TTTGTGGTGC TTGTACTGCT 7980 

TTTGGTGCTT CTTCAGTTGT TACTTGTGTT GCGTTTGACG GTTGTTCTGT TACTGTTGCG 8040 

45 TTATATGATT GAGTTTCTTC TATATGATTA ACGTTAGTTG CAGTTGTTTG TGTTTCACTT 8100 

GTTTTATTAT CAGTAGCTGA ATTCCCATTT TCTTCTACTG TAGTTGTCTT TTGTTCTGAT 8160 

GCTGCAGCTT CTTTGTCTTG TCCCATCCCA ACAACGATCA TTGTTCCTAA GAATACTGAT 8220 

SO 

GCTGCTCCCA ATTTATGTTT TCTAATGCCG TACCTAAGAT TGTTTTTCAC TATAATATCT 8280 

CCCTTTAAAT GCAAAATTCA TTAATTTTTT AAACTTAATA AATGCAAGTC TATATTGTTC 8340 
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ATGTTAATTG ATAATTTTAT 
AACCCTTGTC ACACAAGGCT 
ATCTAATTTA AAACAATATA 
AACATGTCTT GAAACGCCTT 
GGATTCTGAG TATTTCAGAC 
TTGCAATTAC CTAAAAACAC 
AAATGAAGAT GATACCTGAA 
TTTCTTTTAC AGTTAAACCA 
GAGACAAAAT CACACTACCT 
ATGATTGTAA TAATGGTAAG 
CTAATGCGAT ACGTAGCACA 
TACCTTCAAA CATTTTAGCA 
ATGTACCGCC ACCGCCAATA 
CTGATTCCAT AATATGATTC 
ATAATACTGC TATTAGCATG 
ATAGATTTGT AGGTTTGTCA 
ATATGACTGG TAATGTTGCT 
TAAATTCTTT TTGTGCACCT 
TCATTTTTTG TGCAcTTTGT 
AATCATACCA TACAGTAATA 
CGGTCCTGGA TGTGGTGGTA 
TCCTAGTTTT AACACTGAAA 
TAAGACTAAA CCTACTTCAA 
TGCCCATTGT ACATGTTTTT 
ACCACCACCA TCAGCAAGCA 
GTGGCCGAGC GTACTGCCCA 
ACCTAGCATT AACGCTGTAA 
AAACCCAATA ATTAATACTA 
TATTTCGTTA AACATGACAT 



TATTTGAAAT ATACCTATAA ATTGTATTCA AGTCATCAGA 846 0 

TGTATTTTTT ATACTTATTT TTTAAATTAA ATTCATCATT 8520 

CTAAACGTTT CATAATTATC GCCTGTACAA TACGCACAAA 8580 

TCATTACTCT AAAATACGCA ATATACTTTT TATATCGTTC 864 0 

GATTTTCTGC ATAAAAATAA ACGTGTTTCA AGGCAATATA 8700 

GTTTACTTAA TATTTAGTTA AACAAATAAG CTAATGAATA 8760 

ACGGAAATAA TCGTTTCTAA TAATGACCAT GTTAAGAATG 8820 

AAATATTCTT TAAACATCCA AAATCCTGCG TCATTTAGAT 8880 

GCACCTATCG CAAGTACAAC TAATGCAACA TTTACATCTG 894 0 

ACAATACCTG TAGTTGAAAT CGCAGCTACT GTAGCCGAAC 9000 

GCTGCAACAA TCCATGCTAG TAAAATCGGA GACATCTCTG 9060 

ATTGTATTTC CGACACCGCC GTCAATTAAT ACTTGTTTAA 9120 

ATCAATAACA TCATTCCGAT TGGATAAATC GCATTCGTCA 9180 

ATCTTACGCT TTCTCATTAA TCCCATCGTA ACGATTGCAA 924 0 

GCTGTCCCTG CTGTTCCTAT CATATAAATG ATAGATTCAA 9300 

TGCCCAGTTA CAAGTTGGGT TATCGTAGAC ACTAACATTA 9360 

GTTAATAAAC TCATACCAAA TCCTGGCATC TCTTGATCCG 9420 

AACGCTGAAA TATCGCCTTC TCGTGTATAC GGAGACGGAA 9480 

TAAATATAGG CCCTGCAATG AGTGTAACTG GaATGGCAAT 9540 

CATCTCCAAC ATTTGCCTTT AATTCTTTTG CGATGACTAC 9600 

AAAAGCCATG TGTCACTGAT AAAGCTGTTA CCATAGGTAG 9660 

CATTTGCGCG TTTTGCTACT GTAAATACTA ATGGAATCAG 9720 

AGAACAATGC AATACCGACG ATAAATGCTG CAACAAGCAT 9780 

GACCAAATTT TTGAATCAAC GTGTCTGCGA TTCGAGTTGC 9840 

ATTTCCCAAG TATGGCACCT AAACCGAATA TCAGTGCAAT 9900 

TTCCTTTCTC AATCGTCTCC ATAATTTTAG TCAATGGTAT 9960 

TCATCGATGT GATAATTAAT GAAATAAATG TATTTAATTT 10020 
ATAAAATAAC GATACCTAAA ACAACACTGA TTAACGGCCA ■ 10080 

TCCCCTCTTT CTCTTTTCAA TAGAATGTAA CACCGTCGTC 10140 
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GAGTGACGTA TTTATTGTGT TTTATTTTCA GCGATATGTT GGCGTTGAAA ATCTGCAATT 10260 

TGTTCATAAT TCTCTGTTAA AGAACGACTT AAATTGATAA AAATGGATAC GATCTCTTGG 10320 

s 

TAAACAGTGA CAT TT TC T TC AATCGGCGTA TGATTGTTTG TGGCACCGAC CATCGATGAA 10380 

. ACGATTGAAA AATCTTCAAT GTCACCTACA GCTTTAAGTC CGAGCACGCA GGCACCTAAG 10440 

CATGAACTTT CATAACTTTC AGGAACCACT AACTCTGTGT CAAATATATC TGACATCATT 10500 

10 

TGACGCCATA CTTCACTTTT CGCAAAACCA CCTGTTGCTT TTATCATCTT AGGTGTTTCA 10560 

TTCATTACTT CAATAAGCGC AAGATAGACG GTATACAAAT TGTAAAGAAC ACCTTCTAAT 10620 

75 GCAGCGCGAA TCATATGTTC TTTTTTATGA GATAAAGTTA AACCGAAGAA TGAACCTCTT 10680 

GCATTTGCGT TCCAAAGCGG CGCACGTTCT CCTGCTAAAT AGGGATGGAA TATTAAACCA 10740 

TCTGCACCTG GTTTAACACG CTTTGCAATT TGAGTTAAGA CATCATAAGG ATCAACACCG 10800 

20 AGACGTTTCG CAGTTTCGAC TTCACTCGCT AGCAACTCGT CGCGCAACCA TCTCAATACG 10860 

ACACCACCAT TATTTACAGG ACCTCCGATG ACGTAGTGGT CCTCTGTTAA GACATAACAA 10920 

AATATTCTAC CTTTGTAATC AGTACGCGGT TTATCTATCA CAGTACGAAT CGCCCCAGAT 10980 

25 GTACCGATTG TGACAGCAAC TTCTCCTTTA CCAACACTAT TGACACCTAA ATTAGAAAGG 11040 

ACCCCATCAC TCGCACCAAT AACAAACGGT GTATCTTTAT TAAGCCCCAT TAATGTTGCA 11100 

TAACGTTCTT TCATACCTTT CAtCACATAC GTTGTTGGAA CTAATTCCGG CAACATTTCC 11160 

30 

TTGGAAATAC CCAGCAGTTC TAATGCCTCA ACATCCCAAT CTAATGTTTC TAAATTAAAC 11220 

ATCCCTGTTG CGGAAGCCAT TGAATAATCA ATGATATATG TATCAAATAA ATGATAGAAA 112 BO 

ATGTATGTTT TAATATCTGC AAACTTAGCA GTACGTTGAA ATACATCTTG CCATTCATGT 1134 0 

35 

TTCATCCAAA AAATCTTCGC TAATGGCGAC ATAGGATGAA TCGGTGTGCC TGTTCGCTGG 11400 

TAAATCGCAT TGCCATCATG CACTTCATTT ATTACTGTTG CATATTTTGC AGCGCGGTTA 11460 

40 TCTGCCCAAG TAATATTATT TGTTAATCTT TGATGTTGCT GATCCATCGC AATCAAGCTA 11520 

TGCATTTGCG CACTAAATGA CACAAACTTA ATGTCGTCTT TATTAACTTT GGATTCTCTC 11580 

ATAACATATT TAATAGTCAT TAGTACTGCA TCAAATAATT CATCTGGGTT TTCTTCTGAG 11640 

45 ACATCAACGT TTGGTGTGTG TAAATCATAG CCTATTTGAT GTTTCATGAT AAAAGTTCCA 11700 

TTTTCATCAT ATAAGACTGA CTTGGTACTC GTCGTTCCAA TGTCGACACC AATCATATAT 11760 

TTCATGATAA ATCCTTCTTT CTTTCATTTT AATTCAACCA AAATCCTTCA ATATCTTTAC 11820 

SO 

CAACATCGTC GAAATTTAAA TGAAACGCTT CTTTCAAAAT TTGACTGTCG TATTGTTCCA 11B80 

CTGCATCAAT AAACACTTGA TGATTATGAT GTATGCGTTC AAAATCTTGC GGGTTCTGTT 1194 0 
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AAAATGAGTT 
CATGCTTCGT 
GTTTCTTCAT 
ATGACTCTAA 
ACACATGTGC 
CATCTCGAAT 
ACGTACCTTC 
CTGCAGTTGT 
CTTCATTACA 
CGCTTTCTTT 
TGGTCACTAT 
CTGTACTCAT 
ATTCGAATCG 
AGAATATATG 
CACCAATGCC 
TCAAAATTGC 
AAATGTTACG 
TTAAGTGTGT 
TGTTTAATAC 
GTAAAGTATT 
ACGC&CAACC 
TATACACACG 
AATATTGAAT 
TAGACATAGA 
ACAATTTCCA 
CCATGTTGAT 
TATTTCAGAA 
TTTCAGTAAA 
TAAACTTGcC 



TAAATATTGA 
AAATGATTCT 
TTCTTTTACG 
CATCAGTCGC 
ACCCATTCTT 
TGGCGAACGA 
AGCTATGTGA 
CATACCTTCC 
CGACATACTT 
ACGAGCACTA 
CACACGAATG 
ATGCGCTTTA 
ACGTGTTGTC 
CTGGCAGTTA 
ATAACCAATC 
ACTTATAATG 
ACGAATACTT 
GATTGGAGAC 
CGCTTGTTGC 
GAGCGTCTTC 
TAAATCTTTA 
ACGCTTTCCT 
CGTTCGTGTT 
TTCCACCTCC 
CATTTTAAAG 
TTACAAACTC 
TGAATTTGTT 
TCTCGATACT 
CACATATCCA 



TGATTAGATG 
GCATCCTCAA 
ATAGGTAGTC 
AAATCATACA 
TCTAATTGGA 
CTCACATTAA 
CCATTCACAA 
AACCATTTCT 
GTATACAAGT 
AAATAGTGAC 
ATTTAATTGT 
GGTACTTCAA 
GCTGTATGTT 
GGACATAAAT 
ATAAATGGTA 
CTAGAATATT 
TTCATTTCAG 
GTCGACGCTT 
TGTTTAACTT 
AAAGTACCTT 
AGCAATAAGA 
TCTGTAAATC 
GTCACATTGC 
TATAATTACC 
AAATTTATTA 
ACTCAAACTA 
GTAATTTATC 
TTTCCGTCTC 
TACCTATCGT 



CTTTGATTAA 
ATTTTACTGC 
GCTCTTGATT 
TTTCTTTCTT 
TGAGTTGATT 
ATTGCTTTGC 
TGCCTAAGCG 
CTGGATATCC 
ATGTTAATAT 
CACCCCTTTT 
TATGTTGTAT 
TGCAATAATG 
TCGCTTTGAT 
AGGCAACCTT 
AAGCAATTAA 
GAATTATTCC 
CTGATTTACT 
CATTTACGTA 
GTTGGTTAAT 
CACCTTTTAG 
TTAACTCTAA 
CTTGTGGTTT 
ATAATTTTGC 
ATAGTTGATG 
TACTAGGCGT 
AGTAACACAC 
TTCAACTTCA 
TGCATTGTTT 
TCTAATTTGA 



TGTTTCATGA 
CACTTTCATC 
TTTAACTCTT 
TTCTTGTTCC 
TTGTTGCAAT 
CATTTGATTT 
TAATTCTGCC 
ATACATCATC 
AGTTATTATG 
CGATTTAAAT 
GTGGGATATT 
CGTTTCATGA 
AACTGCCCAC 
TTGTTGGTAA 
AAACGGCCAT 
TATAATACCA 
CATGACATGC 
ATATTGAACA 
TTCTTGTTGT 
CAACATATCT 
TGTTTGTCGC 
CAAAATACCT 
GAGTTCTCCA 
ACCCGACGTC 
CTTATTTTTA 
CTACTAAACA 
GTAATCTCTG 
TTATAACGTA 
ATTTCAACAG 



AATTCAAAGT 
ATTTCAAGTT 
GAAAATGCAA 
CCAAACGGCA 
AATTTAAATG 
TCAGTGAGTA 
GCGATACCTT 
AAAGTCACTC 
AGTTTGCAAG 
TTAAAGGAAA 
TCTAATTGTT 
CAGTTTGGAC 
AAAGATGGTG 
TAAAAAGTAA 
TTATTTTTCA 
GCACTAATCC 
TCTATGTCTT 
TTTTTAATTT 
TTCATAGTTA 
ATATCGCTTA 
TGTTGTTCTG 
TTGCGATCAT 
GTCGAATAGT 
ACGAGCAAGT 
TGATTTCGTA 
TCTACTCTGT 
TCGCACATTC 
TTTTATGTTC 
GCAATACCTC 



12060 
12120 
12180 
12240 
12300 
12360 
12420 
124B0 
12540 
12600 
12660 
12720 
12780 
12840 
12900 
12960 
13020 
13080 
13140 
13200 
13260 
13320 
13380 
13440 
13500 
13560 
13620 
13680 
13740 
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(2) INFORMATION FOR SEQ ID NO: 55: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 55: 
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GGATAAGTTC 


AGGTAAATTC 


ATTTCTTTTT 


CAATTTTGAT 


TTTCATTGTT 


TCCGCCCTTT 


60 


TAAAATAAAG 


TTAGTTGCTT 


CTGTTCCTCA 


TATTCCAAAT 


CACTTTGCTT 


TATATATGTT 


120 


TCAAGCTCTT 


CCGCTGTATC 


AAATGTCTTT 


TTCACACCTT 


GCCAACCTGG 


CACGATATGA 


180 


CCGTGAAAGT 


AATAAGTGCC 


ATTTACTACA 


TGGATATGTG 


CCACTCGTTC 


GTTATCCTGA 


240 


TACAGATATC 


TCTTAGATCC 


AAAGAATTGA 


TTTAGGTATT 


CTTTACGCGC 


GCTATCTGTC 


300 


ATGGTCATCA 


CTCCTTTTAA 


CAATTAGGCA 


GACCAAACGA 


CATGCATTCG 


TCGTATAGCT 


360 


CTTCATTACT 


TATGCTTGCC 


TTATAGTTTT 


CAATCACATT 


GCTAACTTCT 


TTATGACTCA 


420 


TTGCTTTAAC 




vjIAJLAJ. 1 L i. L 


CGCAGTCTTC 


TAATTCCAGT 


X \3\*> X V>V X \3 1A 


480 


ATGACATCAC 


ATATTCAACT 


TGTCTTTGGG 


TTGCCATCGT 


TAACCCTCCC 


ACAAGTCAAA 


54 0 


AGCTCTTTGG 


ACGTAAAACT 


TCGCCTTTGC 


TAAATCCTCA 


TGACCATTCT 


TTAACGGTGC 


600 


TCTAGACATG 


TATTTGATTG 


CATTACCTAT 


TGCGAATGCT 


AGTTGAGGTG 


GATACTGTGC 


660 


CGTAACCTGT 


TCGATAAAAT 


CTATAATTTC 


AATGTCGCCG 


TATGTGTAGT 


GCGCTGGTTG 


720 


CTTAACATTG 


TCTTGCGCTT 


CGTTCATATC 




TTACTGATTA 


CGCTCATTAT 


780 


GCTTCACTCC 


ATTTCTTGAA 


CATTTGGTTA 


TAAGTGACAT 


CGAACCAGTA 


CGGATCACGT 


840 


gaatSttttt 


GTGGCGTTCC 


ATCATAAAGC 


CATGGTCTTA ATCTTCTCTT 


TCTTTCCTGT 


900 


TCATATTCCG 


CTCTCACATT 


TCGTTGGTAT 


CGGTTCAAAA 


TCGCTTTTTT 


TCTGATTTTT 


960 


TCTCTCCCTT 


TTTCTTCATC 


TTTnATtTGA 


CTCTnCATAT 


ATTCAACTTC 


TTCTGTAGAT 


1020 


nTTGAGTCCT 


TTCTTCCACA 


CAATAATTCA 


nCGCCGCGC 






1059 



45 (2) INFORMATION FOR SEQ ID NO: 56: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30246 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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GAAGTAAAAG AAGAATTAAA TTTAACATTA ACAATGGATG AAATTGAATA TGTCGGGACA 60 

ATTGTAGGTC CTGCATATCC ACAACAGGAT ATGTTAACTG AGTTAAATGG ATTTCGCGCA 120 

5 

TTAACCAAAA TCGATTGGGA AAACGTAACT ATCAATAATG AAATTACGGA TATACGCTGG 180 

ATTGATAAAG ATAATGATGC GTTGATTGCG CCTGCTGTCA AAGTTTGGAT TGAAACTTAT 240 

GGTGGTAAAC ATGACAAATA ATGACACCAT CATGTTACGA CATTATGTCC CACAAGATTA 300 

10 

TTCGATGTTA GAAGCTTTTC AATTAAGTGA AAGTGATTTG AAGTTTGTTA AAACGCCAGA 3 60 

GGAAAATATT ACAGCTGCAA TGTCTGATAA TGAAAGGTAT CCCATCGTTG TAATGGATGG 420 

15 CAGGCAATGT GTGGCCTTTT TTACATTACA TCGTGGAAAA GGGGTCGCAC CATTTAGCGA 480 

TAACCAAGAT GCAGTATTTT TCAGGTCATT TAGTGTTGAT CAACGTTATC GTAATAGAGG 540 

AATAGGTAAA GTGGTAATGG AAAAATTGGC GTCATTTATC ACTTCAACAT TTCAGGATAT 600 

20 TAATGAGATT GTGTTAACGG TTAATACTGA CAATCCACAT GCCATGGCAC TTTATCGCCA 660 

ACAAGGATAT CAATATATGG GAGATAGTAT GTTCGTCGGA AGACCTGTTC ATATTATGGC 720 

GTTAACTATA AAATAAATTA AATTTAAAAG CATCTTTACT CATCGTCGAC CACAACAATT 780 

25 AATGATGAAT AAAGGTGCTT TTTGTTATAG ATCATCGGAC AATTTACTAT AGTAAAAAGC B4 0 

GACCTAGTGA ACAATTGACA TATATCCACA GGTCGCTTAA CTTAAGTTAT ATTGCTAGTT 900 

GCGATTAATT GATAGACTCA TCATTTTTGC GCTGTCGAGA TGGTCTTTTT ATTAAAAATG 960 

30 

CCGTAATCCA AGCCGTAATC GGAATACTGA TTGCAACGGC AATACCGCCT AAAATAATAG 1020 

AAATAAATTC TTGGGCAAAT ATTTTCGAGT TTATAATATG ACCAAATGAA TATTTAAGTT 1080 

TGAAAAACCA AATAAATAAA GCAAGTTGGC CACCAAAAAA GGCAAGGTAA ATCGTGTTCG 1140 

35 

CAGATGTCGC TAAAATTTCT CTACCAACAC GCATGCCAGA TTGGAATAAT TCGTATTGCG 1200 

TAACGTTgGA TTCACTTGAT GCAATTCATA AATGGGTGAA CTAATGGTAA TTGTTAAATC 1260 

TATCACAGCT GCAATAACAG CAAGAATAAT AGTGAACACC ATAAATTGAA CCATATCAAT 1320 

40 

GCCAATATTC ATTGAATACA CATATGTTTC ATCTTGTTGT TCGGTTGaAA AGCCTTGTAG 1380 

ATGACCGAAG TAGACCGATA AATAAATGAG TGTAATCAAC AATATTGTTG TAACGATAgT 1440 

45 GCtGgATAAA TGCaGCTTGT • GTTTTAACAT TGTAACTATT GAGTACGAAT AAATTACAAG 1500 

CGCCAATAAT AATGCAGAAA AAGAATGTGA CGACATAAAT CGGTACGCCA AAAATAATCA 1560 
ATACAATACT AATAATTAAA ATAGCGAAAT TTAAAAATAG GGTTAAATAA GAGATGAATC " 1620 
CCTTTTTACC TCCGAAAATT ATCATCAGAA AGAGGAGCAA TAACGCCAAT ATAAATACAG ' 1680 

CATTCATTGT TTCGCCCTCC TTAATGTTTC AAATATTTCC ATAAACAATA TTGTGATAGG 1740 
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CATCGAAATA GTATAAGTCA CTGTATTGGC ATTTTTTAAA AAGATTAAAA ACATAGGTAG 1860 

TGCACCGGAT AAATATGAGA ATAATAAGAT GTTAGTCATT GTTCCCATAA TATCTTGGCC 1920 

GATGTTTCGC CCAGCAAGCG CCCATCTCCT CATTGAAATG TGTGGCGTAC GCTGTAAAAT 1980 

TTCATGCATA CCACTAGCAA TTGTAATTGC AACATCCATA ATAGCGCCAA GTGAACCTAT 2040 

TAACACTGAG GCTAGGAAGA TATCTTTCGG TGGTAATGAT AAAAAGTTCA TCGTTTCATA 2100 

TTTAATGCCT TTACCATCTG TCATATATAT GATTAATTCT GTTAAACCTA TACTCAAAAA 2160 

AGTTCCGATA ATTGTACTGG CTATGGTAAT GAGTGTACGC ATATGCCAGC CTGTAACGAG 2220 

CAATAAAGTG AGTATTGTTG AACAGATCAT GGCAATGGTC ATGAGTAAGA ATAAATTAAT 2280 

ATTGCTATGT TGAATATGAA TGTAAATTGC GATTAATATG GCAATAGAAT TCAAGATTAA 2340 

CGATAAAATC GATTGCAGTC CGACTTTGCG ACCAACCAAT AATACAGTTA ATAAGAACAA 2400 

ACCAGTGATG ATAACCGTTA AGGTATCACG CTTCTTTTCT ATAATATAAG CATCACTCGG 2460 

CTTGTTAGAA ATATGTAATA ATACTTTTTC GTGTGTGCGA AATGCCTCAG AATCTGCTTG 2520 

CGATTTGACG TACTGATGAT . TAATCGTCGT CGTTTCTCCA GCAAATTGAC CATTTAATAT 2580 

25 TTTGACTTTT AATTGATTTT TATATTTAAT ATCACGATTA TTTTGTGCAT CTTTTGTAGG 2640 

TGTCGAAGAA ACATGTTTGA CATCTATAAT TTGACCAATT' GGTTTGTTGT AAAAGTTCTC 2700 

ATTATTGAAT GTAAATAAAA TAGCACCAAT GAATGCGATG CAGAACAAAC CTAAAATTAT 2760 

30 ATTAAATGGC TTTGTAAATA AATTTCTATA TTTCAAAAAC AAAACCCCAA TTCTATGAAT 2820 

GAATTAATAT GGTGATTATA CGCCCTTAAT TTTTTATTTT CAAAGATATT ACTGCTAAGT 2880 

GTAAAACGAA AATCATCATT GATAGCATCG AATTACTTAA TGGAATGTAG ACGTTTTAGT 2940 

CATTAATTGC TGAATAAGTG TTAATAATAT GCCAATATCA CTCTTTGTAT AAGGCTCCTT 3 000 

TGT&ATAGCA CATATCGTTC TTTTTAATTC AGTATGATCT AATTTTATAT CTATCCATGA 3060 

TTTAGATTCT GGTAAATGTA TATTTTGTGA TGAAATGATG ■ TAACCTTCTT TTTGACGAAG 3120 

GAGATAcTGC GCAAGTGGTT GGCTACTGAT TGTGTATACA TCTGATTTAG TAATCTTGCG 3180 

CAATTGTTTT TTTACAGTTT CGGCAAATGG TGCCAAGCAA TAAATATGAC TATGCTCAAA 3240 

CTGAATTAAT GGTGGGTGTG TCGCCATCGT AATTGGATCG TCTGAAGGCG CATATAAATG 3300 

ATAGTGCTCT TCGAATAAAG GTAGCATATG TAATTGTTTG TGTTTACGTA TTTCTGGTGT 3360 

AAGTTCCGTG AAACCAATGT CTATATTCCC ATTTAATACG CTATTTATAA TTGTGTCATG 3420 

so TTCTAATAAG CTCGGTATGA CATGTGTATC ATTTTGTAAA TGAAACGTTT GGATAAGTGG 3480 

TAGTAACATG TGGGATACGT CACTCTCATC ATAGCCAATG TAGATACTTT TATTTTTAGT 3540 
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TTCATTAAAT AATAATTTCC CTTCAGATGT GAGCGTAATA TTGCGTCCTT GCTTTTTAAA 3660 

TAAAGACACA TTAAGTTCTT GTTCTAATAA TGTAATTTGA CGGCTTATCG CTGATTGAGC 3720 

5 

AATGTTTAGT TCAAGTGCTG TTTCGGAGAT ATGTTCTCTT TTAGCGACCT CGATAAAATA ■ 3780 

TCTTAATTGT TTAATTTCCA TAGCGATATA GGCACCTCCA AAAATGAGTG TTTTGTAACT 384 0 

ATTATAGCAA TATTATTGAT AAATGTTCTA TTTTTTAGAT GAATATCTTC TATTTTATAT 3 900 

10 

ATTGAACAGA TAAATTTTTT AGATTATAGT AATTATCATT AATAACTAAT ATCAGAATAT 3960 

TCTAAAAAAG GGGTGTGCAT CATGCACAAT GAGAAATTAA TTAAAGGCTT ATATGACTAT 4020 

1S CGTGAGGAAC ATGATGCGTG TGGTATTGGT TTTTATGCGA ATATGGATAA TAAAAGGTCT 4080 

CACGACATCA TTGATAAATC GCTTGAAATG TTGCGACGCT TAGATCACAG GGGCGGGGTC 4140 

GGCGCAGATG GCATCACTGG TGATGGCGCA GGTATTATGA CTGAAATACC TTTTGCATTT 4200 

20 TTCAAACAAC ATGTAACGGA CTTTGATATC CCAGGTGAAG GTGAATATGC CGTGGGGTTA 4260 

TTTTTTTCCA AAGAACGCAT TTTAGGTTCT GAACATGAAG TAGTTTTTAA AAAATATTTT 4 320 

GAAGGCGAAG GGTTATCAAT TCTTGGTTAT CGTAATGTAC CAGTTAATAA AGATGCCATT 4380 

25 GCTAAACATG TAGCAGATAC GATGCCAGTC ATTCAACAAG TGTTTATTGA TATTAGGGAC .4440 

ATTGAAGATG TTGAAAAGCG TTTGTTTTTA GCGAGAAAAC AATTAGAGTT CTATTCGACT 4500 

CAGTGCGATT TAGAATTGTA TTTTACGAGC TTATCACGCA AAACAATTGT ATATAAAGGT 4560 

30 TGGTTACGAT CAGACCAAAT TAAAAAACTA TATACAGATT TATCGGATGA TTTATATCAA 4 620 

TCAAAGCTAG GGTTAGTGCA TTCGAGATTT AGTACGAATA CATTCCCGAG TTGGAAAAGG 4 680 

"> 

GCACATCCTA ACCGTATGTT AATGCATAAT GGTGAGATTA ACACGATTAA AGGTAATGTA 4740 

35 < 

AACTGGATGC GAGCACGCCA ACATAAATTA ATCGAAACAT TATTTGGCGA GGATCAACAT 4800 

AAAGTGTTTC AAATTGTCGA TGAGGATGGT AGTGACTCTG CCATTGTAGA TAATGCGCTA 4B60 

GAGTTCTTAT CGTTAGCCAT GGAGCCAGAA AAGGCAGCGA TGTTACTCAT ACCTGAACCT 4920 

40 

TGGTTATATA ATGAAGCGAA TGATGCAAAT GTACGTGCGT TTTATGAATT TTATAGTTAT 49B0 

TTAATGGAAC CGTGGGATGG TCCTACAATG ATTTCGTTCT GTAACGGTGA CAAACTTGGC 5040 

GCGCTTACAG ATAGAAATGG ATTACGTCCA GGTCGTTATA CGATTACTAA AGATAACTTT 5100 

4b 

ATTGTCTTTT CATCTGAAGT GGGTGTTGTG GACGTACCTG AAAGTAATGT TGCTTTTAAA 5160 

GGTCAATTGA ATCCTGGAAA GTTATTGCTT GTTGATTTTA AACAGAATAA AGTCATTGAA 5220 

SO AATAATGATT TAAAAGGTGC GATTGCTGGA GAATTACCAT ATAAAGCGTG GATTGATAAC 5280 

CATAAAGTTG ACTTTGATTT TGAAAATATA CAATATCAAG ATTCGCAATG GAAAGATGAG 5340 
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CAGGAACTTG 


TAGAAGGTAA GAAGGATCCT 


ATCGGTG CAA 


TGGGATATGA 


TGCGCCAATT 


54 60 


5 


GCAGTGTTGA 


ACGAGCGACC AGAATCACTA 


X X X/V\» AXVwX 


TTAAACAGCT 


GTTTGCACAA 

w x x x vw\vnn 


J J x V 




GTTACGAATC 


CACCAATTGA TGCGTATCGT 


GAAAAAATHG 


TAACGAGTGA 




JJOU 




TTAGGTGGCG 


AAGGTAACTT ACTAGCACCT 


GACGAAACGG 


TTTTAGATCG 


TATTCAATTG 


5640 


10 


AAAAGGCCGG 


TATTGAATGA ATCACACTTA 


GCAGCGATTG 


ATCAGGAACA 


TTTTAAATTA 


5700 




ACTTATTTAT 


CAACGGTATA TGAAGGGGAT 


TTGGAAGATG 


CGTTAGAAGC 


ATTAGGCCGA 


w / OU 




GAAGCAGTGA 


ATGCTGTAAA GCAAGGCGCT 


CAAATTCTAG 


TGTTAGATGA 


TAGTGGATTA 


5820 


15 


GTTGATAGCA ATGGCTTTGC AATGCCGATG 


TTACTCGCAA TAAGTCATGT 


GCATCAATTA 


5fi a n 

JOOU 




CTTATTAAAG 


CAGATTTACG TATGTCTACA 


AGTTTAGTCG 


CTAAATCTGG 


TGAGACACGA 


504 n 

35*tU 




GAAGTGCATC 


ATGTTGCTTG TTTACTCGCA 


TATGGCGCGA ATGCAATTGT 


GCCATACCTA 


6000 

DUUU 


20 


GCGCAACGTA 


CAGTTGAACA ACTGACATTG ACAGAAGGGT 


TACAAGGCAC CGTTGTCGAT 


coco 
©you 




AATGTTAAGA 


CATATACGGA TGTATTGTCA 


GAAGGTGTCA 


TTAAAGTAAT 


GGCTAAGATG 






GGAATTTCGA 


CAGTGCAAAG TTATCAAGGG 


GCACAAATAT 


TTGAAGCGAT 


TGGCTTGTCT 


<C1 PO 


25 


CATGATGTGA 


TTGATCGTTA TTTTACTGGG 


ACACAGTCTA 


AGTTATCTGG 


TATTTCGATT 


Oil U 




GATCAAATTG 


ATGCTGAAAA TAAAGCACGT 


CAACAAAGTG 


ATGATAATTA 


TCTTGCATCA 


oJUU 


30 


GGTAGTACAT 


TCCAATGGAG ACAACAAGGT 


CAACATCATG 


CTTTTAATCC 


GGAATCTATT 


OjdU 


TTCTTATTGC 


AGCACGCATG TAAAGAAAAT 


GACTATGCGC 


AATTTAAAGC 


ATACTCTGAA 


D *i Z U 




GCGGTGAACA 


AAAATAGAAC AGATCACATT 


AGACATTTAC 


TTGAATTTAA 


AGCATGTACA 


64 80 


35 


CCGATTGACA 


TCGACCAAGT TGAACCGGTA 


AGTGACATTG 


TCAAACGCTT 


TAATACAGGG 


6540 


GCGATGAGTT 


ATGGATCGAT TTCAGCGGAA 


GCACATGAAA 


CGTTAGCACA AGCCATGAAC 


6600 




CAAUAGGTG 


GAAAGAGTAA TAGTGGTGAA 


GGTGGCGAAG 


ATGCAAAACG TTATGAAGTA 


6660 


40 


CAAGTTGATG GAAGCAACAA AGTAAGTGCG ATTAAACAAG 


TTGCTTCTGG 


GCGTTTTGGT 


6720 




GTAACTAGTG 


ATTATTTACA ACATGCCAAA 


GAAATTCAAA 


TTAAAGTTGC 


GCAAGGTGCA 


6780 




AAGCCTGGTG 


AAGGTGGTCA ATTACCTGGT 


ACTAAGGTAT 


ATCCGTGGAT 


TGCGAAGACA 


6840 


45 


AGAGGGTCAA 


CGCCAGGTAT CGGTCTGATT 


TCACCACCGC 


CACATCATGA 


TATTTATTCA 


6900 




ATAGAAGATT TAGCGCAACT GATACATGAT 


TTGAAAAATG 


CGAATAAAGA TGCAGATATC 


6960 




GCGGTAAAAT 


TAGTTTCGAA AACAGGTGTT 


GGTACCATTG 


CATCTGGGGT 


GGCAAAAGCA 


7020 


50 


TTTGCAGATA 


AAATTGTCAT CAGTGGTTAC 


GATGGTGGTA 


CAGGGGCTTC ACCCAAAACG 


7080 




AGTATTCAGC ATGCCGGTGT TCCTTGGGAG 


ATTGGTTTAG 


CAGAAACACA TCAAACATTA . 


7140 
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50 



AAAGATGTAG 


CGTACGCATG 


TGCGCTTGGA 


GCGGAAGAAT 


TTGGATTTGC 


AACTGCACCA 


7260 


TTAGTGGTGT 


TGGGCTGTAT 


TATGATGCGT 


GTATGCCATA 


AAGATACATG 


TCCAGTAGGA 


7320 


GTTGCAACTC 


AAAACAAAGA 


TTTACGTGCT 


TTATATAGAG 


GTAAAGCACA 


TCATGTTGTT 


7380 


AATTTTATGC 


ATTTTATTGC 


ACAAGAATTA 


AGAGAAATTT 


TAGCATCTTT 


AGGTTTGAAA 


7440 


CGTGTAGAAG 


ACTTAGTTGG 


AAGAACTGAT 


TTATTACAAC 


GATCATCAAC 


ATTAAAAGCG 


7500 


AATAGCAAAG 


CGGCTAGTAT 


TGATGTTGAA 


AAACTGTTAT 


GTCCTTTCGA 


TGGGCCAAAC 


7560 


ACAAAAGAAA 


TTCAACAAAA 


TCATAATCTT 


GAGCATGGAT 


TTGATTTAAC 


AAATTTATAT 


7620 


GAAGTAACGA 


AGCCATATAT 


TGCTGAAGGG 


CGTCGCTATA 


CAGGTAGCTT 


TACAGTAAAT 


7680 


AATGAACAAC GTGATGTAGG 


GGTTATTACA GGTAGTGAGA 


TTTCGAAACA 


ATATGGAGAA 


7740 


GCAGGACTTC 


CTGAAAATAC 


AATTAATGTT 


TATACGAATG 


GTCATGCTGG 


TCAAAGTCTT 


7800 


GCAGCATATG 


CACCGAAAGG 


CTTAATGATT 


CATCATACTG 


GAGATGCGAA 


TGACTATGTT 


7860 


GGTAAAGGAT 


TATCTGGTGG 


TACGGTCATT 


GTCAAAGCAC 


CTTTTGAAGA 


ACGACAAAAT 


7920 


GAAATTATTG 


CTGGTAACGT 


CTCATTCTAT 


GGTGCGACAA 


GTGGTAAGGC 


ATTTATTAAC 


7980 


GGTAGTGCAG 


GAGAAAGATT 


CTGTATTAGA AATAGTGGTG 


TAGATGTTGT 


CGTTGAAGGT 


8040 


ATCGGCGACC ATGGATTAGA 


GTATATGACT 


GGTGGACATG 


TCATTAATTT 


AGGTGATGTA 


8100 


GGTAAGAACT 


TCGGTCAAGG 


TATGAGTGGT 


GGTATTGCTT 


ACGTTATCCC 


GTCTGATGTA 


8160 


GAAGCTTTTG 


TTGAAAATAA 


TCAACTAGAT 


ACGCTTTCGT 


TTACAAAGAT 


TAAACACCAA 


8220 


GAAGAAAAAG 


CATTCATTAA 


GCAAATGCTG 


GAAGAACATG 


TGTCACACAC 


GAATAGTACG 


8280 


AGAGCGATTC 


ATGTGTTAAA 


ACATTTTGAT 


CGCATTGAAG 


ATGTCGTCGT 


TAAAGTTATT 


8340 


CCTAAAGATT 


ATCAATTAAT 


GATGCAAAAA ATTCATTTGC 


ACAAATCATT 


ACATGACAAT 


8400 


GAAGATGAAG 


CGATGTTAGC 


TGCATTTTAC 


GATGACAGTA 


AAACAATCGA 


TGCTAAACAT 


8460 


AAACCAGCCG 


TTGTGTATTA 


AGGAAAGGGG 


GAGATACGAT 


GGGTGAATTT 


AAAGGATTTA 


8520 


TGAAGTATGA 


CAAACAGTAC 


TTAGGTGAAT 


TATCACTGGT 


AGACCGTTTG 


AAGCATCATA 


8580 


AAGCATATCA ACAACGATTT 


ACTAAAGAAG 


ATGCCTCTAT 


CCAAGGTGCA 


CGATGTATGG 


8640 


ATTGTGGAAC 


GCCGTTTTGT 


CAAACCGGAC 


AACAGTATGG 






B7nn 
o / uu 


CAATTGGAAA 


CTACATTCCT 


GAATGGAACG 


ACTTAGTGTA 


TCATCAAGAT 


TTTAAAACTG 


8760 


CTTATGAACG 


CTTAAGCGAA 


ACAAATAACT 


TTCCTGACTT 


TACAGGGCGT 


GTATGTCCTG 


8820 


CACCATGCGA AAGTGCTTGT 


GTGATGAAGA TTAATAGAGA 


ATCGATTGCG 


ATTAAAGGTA 


8880 


TTGAACGCAC 


AATTATTGAT 


GAAGCTTTTG 


AAAATGGTTG 


GGTAGCGCCG 


AAAGTTCCGA 


8940 
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w 



15 



20 



25 



30 



35 



40 



45 



SO 



CTGAAGAACT 
GCGGTTTATT 
GTATTAAGTT 
ATATTGATAA 
AAAAAGGTAG 
ATTATTTAAC 
CAAAAGATAA 
CAGCATTAAG 
AAGCAATTAC 
ACTATGCGCA 
AAACAATGCG, 
TAGAGCAAGG 
ACCTTGTATT 
ACATTAAAAC 
AAAAGGTATT 
AAGAAGGTAG 
AATCTTTGTA 
AAATCTAGTA 
GACAACAAAT 
GTGGAAGACA 
AACGTTCATA 
CGGTCTTGAA 
CCATCAATAT 
TTACCTTTTT 
TTGAGATTGG 
GTGAAATATG 
ATTTAGGAGT 
AATAACGCAA 
TGAAATGGCT 



TAATCTACTA 
AATGTATGGT 
AATGGAAGAA 
AGCAACGTTA 
AGATTTACCT 
TGAACAAACG 
GAATGTCATT 
AGAAAATTGT 
ATTTACAGAA 
CCAAGAGTAC 
TTACGATGTT 
CGAAAATGGT 
ATTATCAATC 
OGATAGAAAT 
TGCTGCTGGA 
AGGCGTAGCG 
TGGAAATGGT 
TCTATCAACG 
AGATTAATTA 
ATGATTTGTG 
TATGATAAAT 
AACCGACAGG 
T.TATATTAAA 
TATTTGTCTT 
AAGGGCATTT 
ATGGATGGCT 
TGGCCATGCA 
TTGTAGCGAG 
ACCCCAGATA 



GGATATCAAG 
ATTCCGAATA 
GCGGGCATTA 
GAATCTGAGT 
TTAGAAGGAC 
CAGTTGTTAA 
ATCATTGGTG 
AAATCGATTG 
AATGCATCAT 
GAAGCTAAGT 
GACGATAAAG 
ATGGTCATGA 
GGCTTCGAAG 
CGAATCGTGG 
GATGCTAGAC 
AAAGCAGTAG 
GGTTACGTTG 
TCACATGCCA 
TAAGTAGTGA 
GTAATCATGT 
ATTGTGTTTA 
GGCTTAACGG 
TTCTATATAT 
TGAATGGCTC 
GGCTTGTGCA 
TGTGTGGACA 
TCTACACTTT 
GAGTTATTGC 
ATTGTGACAA 



TAACTATTTA 
TGAAACTTGA 
CTTTCATTAA 
ATGATGCCAT 
GCATGGGTGA 
ATGGAGAAAT 
CTGGTGATAC 
TTCAATTTAA 
GGCCTTTAGC 
TTGGTAAGGA 
GACACATACG 
AAGAAGGACC 
GTACAGAACC 
CGGATGATAC 
GTGGTCAAAG 
ATCAGTATTT 
ACGTTGTGAC 
TCTTTGTAAC 
TTTTTTACAT 
AATGCTTAAA 
GGAGGAATAC 
CTCGCGGGGG 
AATGAAGGTA 
GTAATTTTTG 
ATATACATAG 
AGTTTGCTAT 
ATAATGGTGA 
TACATATGTC 
AATAAAAATA 



TGAACGTGCT 
TAAAGATGTG 
TGGTGTTGAA 
TATATTATGT 
TGGTATACAT 
TGATGATATA 
AGGGGCAGAC 
TAAATATACG 
AATGCCGGTG 
ACCACGTGCA 
TGGTTTGTAT 
TGAAAGATTT 
AACAGTACCG 
AAACTATCAA 
TTTAGTTGTA 
AGCTAGTAAA 
ATGCTGAATC 
CTAAAAACAA 
TCGTTTATAG 
AACAATATTG 
CCAAGTCCGG 
TTCGAATCCC 
AGTGCTCAAA 
ATAATAGAAA 
CTAAATGTCT 
TTATAGATAT 
GAGCGTGGTG 
GTTATGGCTC 
TTTTGTTGAA 



AGAGAATCAG 
GTTCGACGTC 
GTCGGTGTTG 
ACTGGTGCAC 
TTCGCTATGG 
ACAATAACTG 
TGTGTAGCGA 
AAATTGCCAG 
TTTAAAATGG 
TATGGTGTTC 
ACTCAAATTT 
TGGCCTGCTG 
AATGCTTTTA 
ACTAATAATG 
TGGGCAATTA 
GTTTGTGTAT 
GAGTTTGAAA 
AGGTTTGTAA 
GTCAACTGTA 
ACTTTTACAG 
CTGAAGGGAT 
TCTTCCTCCG 
TTTTGAGTAT 
TGATAAGGCA 
TTTTTGTTTT 
GCATTTTTCA 
AGGTATTGTT 
ATTGATTTTC 
AGCCTTTACA 



9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
10680 
10740 
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TAAAAAGAGA AGATGTAAAA GCCATCGTAA CCGCTATTGG GGGAAAAGAA AATCTTGAAG 10860 
CTGCAACGCA TTGTGTAACA CGATTACGTT TAGTGCTGAA GGATGAAAGT AAAGTTGATA ( 10920 

S AAGACGCATT AAGTAATAAC GCGTTGGTCA AGGGGCAGTT TAAAGCAGAC CATCAATATC 10980 

AAATTGTCAT TGGTCCAGGA ACAGTCGATG AAGTGTATAA GCAGTTTATT GATGAAACAG 11040 

GTGCTCAAGA AGCTTCGAAA GATGAAGCGA AACAAGCAGC TGCACAAAAA GGGAATCCAG 11100 

10 TACAACGTTT GATCAAATTG TtGGGGGATA TTTTTATACC AATATTACCT GCGATTGTGA 11160 

CAGCTGGTTT GTTAATGGGA ATCAATAATT TACTTACAAT GAAAGGTTTA TTTGGTCCAA 11220 

AAGCACTTAT TGAGATGTAT CCACAAATTG CTGATATTTC AAACATCATT AATGTGATTG 11280 

75 

CGAGTACGGC ATTTATTTTC TTACCAGCAT TAATTGGTTG GAGTAGTATG CGTGTATTTG 11340 

GTGGTAGTCC GATTCTAGGC ATAGTCTTAG GTTTGATTTT AATGCATCCG CAATTAGTAT 11400 

CTCAGTATGA TTTGGCAAAA GGGAATATTC CGACGTGGAA CTTATTTGGC TTAGAGATTA 11460 

20 

AGCAGTTGAA TTACCAAGGT CAAGTGTTGC CAGTtTTAAT TGCAGCTTAC GTTCTAGCTA 11520 

AAATTGAAAA AGGATTAAAT AAAGTCGTTC ACGATTCGAT AAAAATGTTG GTCGTTGGAC 115 80 

CCGTAGCGCT TTTAGTTACT GGATTTTTAG CATTTATTAT CATTGGACCA GTTGCGTTAT 11640 

25 

TGaTTGGTAC AGGTATTACA TCTGGTGTTA CATTTATATT CCAACATGCA GGATGGCTTG 11700 

GCGGAGCAAT ATATGGATTG TTATATGCAC CACTTGTAAT TACAGGACTA CACCATATGT 11760 

30 TTTTAGCAGT AGATTTCCAA TTGATGGGTA GCAGCTTAGG CGGTACGTAT TTATGGCCAA 11820 

TTGTTGCGAT TTCCAATATT TGTCAGGGCT CTGCAGCATT TGGAGCATGG TTTGTCTATA 11880 

AACGTCGTAA AATGGTTAAA GAAGAAGGCT TGGCATTAAC ATCTTGTATT TCTGGTATGT 11940 

35 TAGGTGTTAC TGAACCAGCC ATGTTCGGTG TGAACTTACC TCTGAAATAT CCATTTATCG 12000 

CTGCGATATC AACGTCTTGT GTATTGGGGG CAATCGTTGG TATGAATAAC GTACTTGGAA 12060 

AAGTTGGTGT TGGTGGCGTG CCAGCATTCA TTTCAATTCA AAAAGAATTT TGGCCAGTAT 12120 

40 ATCTTATTGT GACAGCTATT GCTATTGTTG TACCATGTAT ACTAACAATT GTGATGTCTC 12180 

ATTTTAGTAA ACAAAAAGCG AAAGAAATTG TTGAAGATTA ATAAAATAAA AAAGGGGCGT 12240 

TCGTTATTTG GACGTCCTTT ATTACGTTAT AAGGTGGTAA TTGTGTGTCG AAAGAAATAG 12300 

45 

ATTGGAGAAA ATCCGTTGTA TATCAAATTT ATCCTAAGTC GTTTAATGAT ACGACGGGGA 12360 

ATGGTATAGG AGATATCAAT GGAATTATAG AAAAATTGGA TTATATCAAG TTXTTGGGTG 12420 

TTGATTATAT TTGGTTAACA CCAGTGTATG AATCACCGAT GAATGATAAT GGCTATGATA 12480 

50 

TCAGCAATTA TTTAGAAATC aATGAAGACT TTGGAACGAT GGATGATTTT GaAAAGTTAA 12540 
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10 



is 



20 



25 



30 



35 



40 



45 



50 



CGACGGAGCA TGaATGGTTT AAAGAAGCCC GTAAATCTAA AGATAACCCy TATAGAGATT 12660 

ATTACTTTTT CAGATCATCT GAAGACGGGC CGCCAACAAA TTGGCATTCT AAATTCGGTG 1272 0 

GTAATGCATG GAAGTATGAT TCTGAGACAG ATGAATATTA TTTACATTTA TTTGATGTCA 12780 

GTCAAGCTGA TTTAAATTGG GATAATCCGG AAGTACGTCA ATCGTTATAT CGCATAGTCA 1284 0 

ATCATTGGAT AGACTTCGGC GTTGATGGTT TTCGATTTGA TGTCATTAAC TTAATTTCTA 12900 

AAGGTGAATT TAAGGACTCT GACAAAATAG GTAAAGAATT TTATACGGAT GGTCCTAGAG 12960 

TGCATGAGTT TCTGCATGAA TTAAATCGTC AAACGTTTGG TAACACTGAC ATGATGACTA 13020 

TAGGAGAAAT GTCTTCGACG ACGATTGAAA ATTGTATTAA GTATACACAA CCAGAACGCC 13080 

AAGAATTGAA TAGTGTTTTT AATTTTCATC ATCTAAAGGT TGATTATGTT GATGGTGAAA 13140 

AGTGGACAAA TGCGAgcTTG nATTTTCATA AGTTAAAGGA AATTCTGATG CAATGGCAAC 13200 

GAGGTATTTA TGACGGTGGC GGATGGAACG CGATTTTCTG GTGTAATCAT GATCAGCCAC 13260 

GGGTAGTGTC TAGATTTGGT GATGATACGT CGGAAGAGAT GAGGATACAA AGTGCTAAAA 13 320 

TGTTAGCTAT CGCACTGCAT ATGTTGCAAG GGACGCCATA TATTTACCAA GGTGAAGAAA 13380 

TTGGTATGAC GGACCCACAT TTTACATCAA TAGCACAATA TCGTGATGTT GAATCGATTA 13440 

ATGCCTACCA TCAGTTGTTA AGTGAAGGGC ATGCTGAAGC GGATGTGTTA GCGATTTTAG 13 500 

GACAGAAGTC ACGAGACAAT TCGAGAACGC CTATGCAATG GAGTGATGAT GTTAATGCTG 13 560 

GATTTACAGC TGGTAAnCCT TGGATTGATA TTTCGGAAAA TTATCATCAG GTCAACGTTA 13620 

GACAAGCACT TCAGAATAAA GAGTCTATTT TCTATACGTA TCAAAAATTA ATACAATTAA 13680 

GACATACGCA TGATATTATT ACGTATGGAG ACATTGTGCC ACGTTTTATG GATCATGATC 13740 

ATTTATTTGT TTATGAACGT CATTATAAGA ATCAACAATG GCTAGTAATT GCGAATTTCT 13800 

CAGCATCGGC TGTTGATTTG CCAGAAGGAT TGGCTAGAGA AGGTTGTGTT GTGATTCAAA 13 860 

CAGGCACAGT GGAAAATAAT ACGATAAGCG GGTTTGGTGC AATTGTAATC GAAACAAACG 13 920 

CGTAAAATAA ATTGAGTGGA TGCGTTTATA TGGCGAAACA AAAAAAGTTT ATGAAGATTT 13 980 

ATGAGGCGTT GAAAGAAGAT ATATTAAACG GGCAGATTCA ATATGGTGAA CAAATTCCGT 14040 

CTGAACATGA TTTGGTGCAA TTGTACCAGT CATCTCGAGA GACCGTGCGT AAGGCATTAG 14100 

ATTTGTTGGC ATTAGACGGC ATGATTCAAA AGATTCATGG TAAAGGGTCA CTTGTCATTT 14160 

ATCAGGAGGT TACAGAGTTT CCATTTTCTG AACTTGTTAG TTTTAAAGAA ATGCAAGAAG 14 220 

AAATGGGCGT CGCATATTTA ACTGAAGTTG TTGTGAATGA GGTTGTTGAA GCGCATGAAG 14280 

TTCCAGAAGT TCAACATGCT TTAAACATCA ATTCTAGTGA ATCACTCATT CATATTGTTA 14340 
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TTGTTTCAGA TATAGGTAAT GATGTTGCGA GTGATTCTAT TTATGATTAT TTGGAAAAGG 14460 

TATTAAATCT TAATATTAGT TATTCAAGTA AGTCTATTAC TTTTGAACCG TTTGATGAAC 14520 

5 AAGCATATCA ATTGTTTGGT GATGTATCGG TGGCTTATTC AGCAACAGTT CGAAGTATTG 14580 

TGTATTTAGA AAATACAATG CCGTTTCAAT ATAATATTTC AAAACATCTT GCAAATGAAT 14640 

TTAAATTTAA TGACTTCTCA AGACGTCGTA TAAAGTAAAC AATGATATAA ATGATTTATA 14700 

10 CTTGCAATTA ACTATTAAAA TATAGTAATA TATATCTTGC CGTGCTAGGT GGGGAGGTAG 14760 

CGGTTCCCTG TACTCGAAAT GCGCTTTATG CGAGGCTTAA TTCCTTTGTT GAGGCCGTAT 14820 

TTTTGCGAAG TCTGCCCAAA GCACGTAGTG TTTGAAGATT TCGGTCCTAT GCAATATGAA 14880 

IS 

CCCATGAACC ATGTCAGGTC CTGACGGAAG CAGCATTAAG TGGATCATCA TATGTGCCGT 1494 0 

AGGgTAGCCG AGATTTAGCT AACGACTTTG GTTACGTTCG TGAATTACGT TCGATGCTTA 15000 

GGTGCACGGT TTTTTATTTT TTAAATATTA AACCGATTAT TAAGAGTTGA AAATATATAA 15060 

20 

TTATAGAAGC TA CTTTCTT G AAGACAATTC AGCGTATTAT ACGTGGAACA TGTTTGTGGG 15120 

AAGTAGCTTT TTTATATGTG AAGTTTGATT CAAGTGAACT CGATGTGCAG TTTGAATGAT 15180 

TTTTGTGTCA ATGAAAAGTA AGAAGTTATA ATTTGATGAT AAAGAAATGA TGGTGAAATG 1524 0 

.25 

AGGGGGAGTA TCTTACAATA GAATTATTAA TGAGATACGT TATGATTATT GACAATCAAA 15300 

TGCCTACGGA GGACATATGC AAATATATTT AAGTACTTTA ACAGAGTTAG ATTATGATAA 15360 

30 ATCTTTAAAT AGTATTGAAG AAAGTTTTGA TGATAATCCT GAAACGAGTT GGCAAGCACG 15420 

TGCGAAAGTA AAACATTTAA GAAAATCTCC TTGCTATAAT TTTGAATTAG AAGTAATAGC 15460 

GAAAAATGAA AATAACGATG TCGTTGGACA CGTTTTATTA ATTGAAGTAG AAATTAATAG 15540 

35 TGATGATAAG ACGTATTATG GTTTGGCGAT TGCCTCTTTA TCAGTTCATC CTGAATTACG 15600 

TGGACAAAAA TTAGGTCGTG GCTTGGTTCA AGCAGTAGAA GAGCGTGCCA AAGCACAAGA 15660 

GTATAGTACG GTTGTTGTAG ACCATTGTTT TGACTACTTT GAAAAGTTGG GTTATCAAAA 15720 

40 TGCTGCTGAG CATGACATTA AATTAGAATC TGGTGATGCA CCGTTACTTG TAAAATATTT 157 BO 

ATGGGATAAT TTGACGGATG CACCACACGG AATCGTAAAA TTTCCAGAAC ATTTTTATTA 1584 0 

ATTGTTCAAT TAAGAAGTAA AGGTATTATC ATGCTATAAT GAGAGGTAAT TGTTTATGGA 15900 

45 

GGTGCTAACT TGAATTATCA AGCCTTATAT CGTATGTACA GACCCCAAAG TTTCGAGGAT 15960 

GTCGTCGGAC AAGAACATGT CACGAAGACA TTGCGCAATG CGATTTCGAA AGAAAAACAG 16020 

TCGCATGCTT ATATTTTTAG TGGTCCGAGA GGTACGGGGA AAACGAGTAT TGCCAAAGTG 160 BO 

50 

, TTTGcTAAAG CAATCAACTG TCTAAATAGC ACTGATGGAG AACCTTGTAA TGAATGTCAT 16140 
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AATAATGGCG TTGATGAAAT AAGAAATATT 
TCGAAATATA AAGTTTATAT TATAGATGAG 
GCCCTTTTAA AGACGTTAGA AGAACCTCCA 
GAACCACATA AAATCCCTCC AACAATCATT 
ATTAGCCTAG ATCAAATTGT TGAACGTTTA 

10 

TGTGAAGATG AAGCCTTGGC ATTTAtcgCT 
TTAAGTATTA TGGATCAGGC TATTGCATTT 
TTGAATGTCA CAGGTAGCGT ACATGATGAA 

IS 

CAAGGTGACG TACAAGCATC TTTTAAAAAA 
GTGAATCGCC TAATAAATGa TATGATTTAT 
TCTGAGAAAG ATACTGAGTA TCGAGCACTG 

20 

ATGATTGATC TTATTAATGA TACATTAGTG 
CATTTTGAAG TGTTGTTAGT AAAATTAGCT 

2S GCGAATGTAG CTGAACCAGC ACAAATTGCT 

CGTATGGAAC AGTTAGAGCA AGAACTAAAA 
CCTGTTCAAA AATCTTCGAA AAAGCCTGCG 

30 TCAATGCAAC AAATTGCAAA AGTGCTAGAT 

AAAGATCATT GGCAAGAAGT GATTGATCAT 
AGTTTATTGC AAAATTCGGA ACCTGTGGCG 

35 GAGGAAGAGA TCCATTGTGA AATCGTCAAT 

AGTCTTGTAT GTAATATCGT TAATAAAAAC 
TGGCAAAGAG TTCGAACGGA ATATTTACAA 

40 AAGCAACAAG CACAACAAAC AGATATTGCT 

ACTGTACATG TGATAGATGA AGAGTGATAC 
AAAGAAACAT CATTTTATTG ATAAATATTT 

45 

GCGGTGGCGG AAACATGCAA CAAATGATGA 
CTCAAGAACA AGAAAAACTT AAAGAAGAGC 
TTGCAGTTAC TGTAACTGGT CATAAAGAAG 

SO 

TAGACCCAGA CGATATTGAA ATGCTACAAG 



AGAGACAAAG TTAAATATGC ACCAAGTGAA 16260 

GTGCACATGC TAACAACAGG TGCTTTTAAT 16320 

GCACACGCTA TTTTTATATT GGCAACGACA 16380 

TCTAGGGCAC AACGTTTTGA TTTTAAAGCA 15440 

AAATTTGTAG CAGATGCACA ACAAATTGAA 16500 

AAAGCGTCTG AAGGGGGTAT GCGTGATGCA 16560 

GGTGATGGTA CGTTAACATT GCAAGATGCG 16620 

GCGTTGGATC ACTTGTTTGA TGATATTGTA 16680 

TACCATCAGT TTATAACAGA AGGTAAAGAA 16740 

TTTGTCaGAG ATACGATTAT GAATAAAACA 16800 

ATGAACTTAG AATTAGATAT GTTATATCAA 16860 

TCGATTCGTT TTAGTGTGAA TCAAAACGTT 16920 

GAGCAGATTA AGGGTCAACC ACAAGTGATT 169B0 

TCATCGCCAA ACACAGATGT ATTGTTGCAA 17040 

ACACTAAAAG CACAAGGAGT GAGTGTCGCT 17100 

AGAGGCATAC AAAAATCTAA AAATGCATTT 17160 

AAAGCGAATA AGGCAGATAT CAAATTGTTG 17220 

GCCAAAAATA ATGATAAAAA ATCACTCGTT 17280 

GCAAGTGAAG ATCACGTACT TGTGAAATTT 17340 

AAAGACGACG AGAAACGTAG TAGTATAGAA 17400 

GTTAAAGTTG TTGGTGTACC ATCAGATCAA 17460 

AATCGTAAAA ACGAAGGCGA TGATATGCCA 17520 

CAAAAAGCAA AAGATCTTTT CGGTGAAGAA 17580 

ATGACAAGCG ATATAATCGT ATGTATAATG 17640 

ATTGATTTTC AAGGAGGAAA TGGAATATGC 17700 

AACAAATGCA AAAAATGCAA AAGAAAATGG 17760 

GTATTGTAGG AACAGCTGGC GGTGGCATGG 17820 

TTGTCGACGT TGAAATCAAA GAAGAAGCTG 17880 

ACTTAGTGTT AGCAGCTACT AATGAAGCGA 17940 
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TCCCTGGaAT 


GTGATCATAG 


ATGCATTATC 


CAGAACCTAT 


ATCAAAACTT 


ATTGATAGCT 


18060 




TTATGAAATT 


GCCAGGCATT 


GGTCCAAAGA 


CAGCCCAACG 


TCTGGCTTTT 


CATACCTTAG 


1B120 


5 


ATATGAAAGA 


AGACGATGTT 


GTTCAGTTTG 


CCAAAGCATT 


AGTAGATGTT 


AAGAGAGAAT 


18180 




. TAACATATTG 


TAGCGTATGT 


GGTCACATTA 


CTGAAAATGA 


TCCATGTTAT 


ATTTGTGAAG 


18240 




ATAAGCAAAG 


AGATCGTTCA 


GTTATTTGTG 


TTGTGGAAGA 


TGACAAAGAT 


GTCATAGCTA 


18300 


10 


TGGAAAAAAT 


GAGAGAATAC 


AAAGGTTTAT 


ATCACGTTTT 


ACATGGGTCT 


ATTTCGCCTA 


18360 




TGGATGGCAT 


TGGACCAGAA 


GATATTAATA 


TTCCTTCATT 


GATTGAACGC 


TTGAAAAACG 


1B420 


15 


ATGAAGTTAG 


CGAATTAATC 


TTAGCTATGA 


ACCCGAACTT 


AGAGGGGGAA 


TCTACAGCCA 


18480 


TGTATATTTC 


TAGATTAGTT AAGCCTATAG 


GTATCAAAGT 


GACGAGATTA 


GCACAAGGGT 


18540 




TATCGGTAGG 


TGGCGATTTA GAGTATGCTG 


ACGAAGTAAC 


ATTATCTAAA 


GCAATCGCAG 


18600 


20 


GTAGAACAGA 


AATGTAATJcT 


CTTCTATTAA 


ACATTTTTGA 


TTTTAATACT 


ATAGTAAGAA 


18660 


AAGTCACAGT GTAATCATTG 


TGGCTTTTTT 


TATGGTGTGG 


TGTGATGTAC 


TACTTTATTT 


18720 




GCGGTGTGGC 


GGTGGTATGG 


TTTACCTAGT 


TTTACTGAGG 


GATGGGTAAT 


CTTTAGGAAG 


187B0 


25 


CAAGCCGTTG 


GTTGTGATTT 


GTTACTTCTA 


ATAGTAATGA 


TGTGAATTGG 


ATTATCGAAT 


18840 




TAGATCTATG 


GTTATGGTGT 


GTTGGTGCTA 


TTAATTTGAT 


AAATGCGGTT 


AATGACTATG 


18900 




CAAATGAAAT 


TCTTTTGTAA 


TTGAAATGAT 


AGATGCTGGC 


TTAGTAAGTT 


GTACTTCTTT 


18960 


30 


GGTCTAAAGC 


TTATTAAATC 


AGCCTGTATA 


GCGGTGTTTT 


GAGAGATTAT 


TTAAAACTTG 


19020 




TAAATTTATT 


TTTAATTTCT 


GGTAAAAAAA 


TAACGTTCTG 


TTTTGCGTTT 


TTTTTGATTG 


19080 




ATATGGTTAG 


AGAAAAATCT 


GTTTCTTGTT 


CTAAAAAACG 


TACTATTTAT 


AAGTGGGGAT 


19140 


35 


TTTTTAAGTT 


CGATTTTTAG 


GATAAGGGCG 


TTCAGTACAG 


ATGACAAAGG 


TGTAATTTTT ' 


19200 




ACTGXTGTTA AGCAGTTTGA AAGCCTGTAT AGTATTTATT TGTTGAGGCA AACAAAACAA 


19260 




CTCAACTTAA 


GAAATAACTT 


GAATTACTAA 


CGAAAATTAA 


TTTTAAAAAG 


TTATTGACTT 


19320 


40 


AAATGTTAAT 


AAAATGTATA 


ATTAATTCTT 


GTCGGTAAGA 


AAAATGAACA 


TTGAAAACTG 


' 19380 




AATGACAATA 


TGTCAACGTT 


AATTCCAAAA 


AACGTAACTA 


TAAGTTACAA 


ACATTATTTA 


19440 




GTATTTATGA 


GCTAATCAAA 


CATCATAATT 


TTTATGGAGA 


GTTTGATCCT 


GGCTCAGGAT 




45 


GAACGCTGGC 


GGCGTGCCTA 


ATACATGCAA 


GTCGAGCGAA 


CGGACGAGAA 


GCTTGCTTCT 


19560 




CTGATGTTAG 


CGGCGGACGG 


GTGAGTAACA 


CGTGGATAAC 


CTACCTATAA 


GACTGGGATA 


19620 


50 


ACTTCGGGAA 


ACCGkAGCTA 


ATACCGGATA 


ATATTTTGAA 


CCGCATGGTT 


CAAAAGTGAA 


19680 


AGACGGTCTT 


GCTGTCACTT 


ATAGATGGAT 


ccgcgctgca' 


TTAGCTAGTT 


GGTAAGGTAA 


19740 
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GAGACACGGT CCAGACTCCT ACGGGAGGCA GCAGTAGGGA ATCTTCCGCA ATGGGCGAAA 19860 

gCtGaCGGAG CAACGCCGCG TGAGTGATGA AGGTCTTCGG ATCGTAAAAC TCTGTTATTA 19920 

5 GGGAAGAACA TATGTGTAAG TAACTGTGCA CATCTTGACG GTACCTAATC AGAAAGCCAC 19980 

GGCTAACTAC GTGCCAGCAG CCGCGGTAAT ACGTAGGTGG CAAGCGTTAT CCGGAATTAT 20040 

TGGGCGTAAA GCGCGCGTAG GCGGTTTTTT AAGTCTGATG TGAAAGCCCA CGGCTCAACC 20100 

10 GTGGAGGGTC ATTGGAAACT GGAAAACTTG AGTGCAGAAG AGGAAAGTGG AATTCCATGT 20160 

GTAGCGGTGA AATGCGCAGA GATATGGAGG AACACCAGTG GCGAAGGCGA CTTTCTGGTC 20220 

TGTAACTGAC GCTGATGTGC GAAAgCGTGG GGATCAAACA GGATTAGATA CCCTGGTAGT 20280 

IS 

CCACGCCGTA AACGATGAGT GCTAAGTGTT AGGGGGTTTC CGCCCCTTAG TGCTGCAGCT 20340 

AACGCATTAA GCACTCCGCC TGGGGAGTAC GACCGCAAGt TGAAACTCAA AGGAATTGAC 20400 

GGGGACCCGC ACAAGCGGTG GAGCATGTGG TTTAATTCGA AGCAACGCGA AGAACCTTAC 20460 

20 

CAAATCTTGA CATCCTTTGA CAACTCTAGA GATAGAGCCT TCCCCTTCGG GGGACAAAGT 20520 

GACAGGTGGT GCATGGTTGT CGTCAGCTCG TGTCGTGAGA TGTTGGGTTA AGTCCCGCAA 2 0580 

CGAGCGCAAC CCTTAAGCTT AGTTGCCATC ATTAAGTTGG GCACTCTAAG TTGACTGCCG 20640 

25 

GTGACAAACC GGAGGAAGGT GGGGATGACG TCAAATCATC ATGCCCCTTA TGATTTGGGC 20700 

TACACACGTG CTACAATGGA CAATACAAAG GGCAGCGAAA CCGCGAGGTC AAGCAAATCC 20760 

3Q CATAAAGTTG TTCTCAGTTC GGATTGTAGT CTGCAACTCG ACTACATGAA GCTGGAATCG 20820 

CTAGTAATCG TAGATCAGCA TGCTACGGTG AATACGTTCC CGGGTCTTGT ACACACCGCC 20880 

CGTCACACCA CGAGAGTTTG TAACACCCGA AGCCGGTGGA GTAACCTTTT AGGAGCTAGC 20940 

35 CGTCGAAGGT GGGACAAATG ATTGGGGTGA AGTCGTAACA AGGTAGCCGT ATCGGAAGGT 21000 

GCGQCTGGAT CACCTCCTTT CTAAGGATAT ATTCGGAACA TCTTCTTCAG AAGATGCGGA 21060 

ATAACGTGAC ATATTGTATT CAGTTTTGAA TGTTTATTTA ACATTCAAAT ATTTTTTGGT 21120 

40 TAAAGTGATA TTGCTTATGA AAATAAAGCA GTATGCGAGC GCTTGACTAA AAAGAAATTG 21180 

TACATTGAAA ACTAGATAAG TAAGTAAAAT ATAGATTTTA CCAAGCAAAA CCGAGTGAAT 21240 

AAAGAGTTTT AAATAAGCTT GAATTCATAA GAAATAATCG CTAGTGTTCG AAAGAACACT 21300 

45 CACAAGATTA ATAACGCGTT TAAATCTTTT TATAAAAGAA CGTAACTTCA TGTTAACGTT 21360 

TGACTTATAA AAATGGTGGA AACATAGATT AAGTTATTAA GGGCGCACGG TGGATGCCTT 21420 

GGCACTAGAA GCCGATGAAG GACGTTACTA ACGACGATAT GCTTTGGGGA GCTGTAAGTA 214 80 

SO 

AGCTTTGATC CAGAGATTTC CGAATGGGGA AACCCAGCAT GAGTTATGTC ATGTTATCGA 21540 
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GAGGAAGAGA AAGAAAATTC GATTCCCTTA GTAGCGGCGA GCGAAACGGG AAGAGCCCAA 2X660 

ACCAACAAGC TTGCTTGTTG GGGTTGTAGG ACACTCTATA CGGAGTTACA AAGGACGACA 21720 

TTAGACGAAT CATCTGGAAA GATGAATCAA AGAAGGTAAT AATCCTGTAG TCGAAAATGT 21780 

TGTCTCTCTT GAGTGGATCC TGAGTACGAC GGAGCACGTG AAATTCCGTC GGAATCTGGG 21840 

AGGACCATCT CCTAAGGCTA AATACTCTCT AGTGACCGAT AGTGAACCAG TACCGTGAGG 21900 

GAAAGGTGAA AAGCACCCCG GAAGGGGAGT GAAATAGAAC CTGAAACCGT GTGCTTACAA 21960 

GTAGTCAGAG CCCGTTAATG GGTGATGGCG TGCCTTTTGT AGAATGAACC GGCGAGTTAC 22020 

GATTTGATGC AAGGTTAAGC AGTAAATGTG GAGCCGTAGC GAAAGCGAGT CTGAATAGGG 22080 

CGTTTAGTAT TTGGTCGTAG ACCCGAAACC AGGTGATCTA CCCTTGGTCA GGTTGAAGTT 2214 0 

CAGGTAACAC TGAATGGAGG ACCGAACCGA CTTACGTTGA AAAGTGAGCG GATGAACTGA 22200 

GGGTAGCGGA GAAATTCCAA TCGAACCTGG AGATAGCTGG TTCTCTCCGA AATAGCTTTA 22260 

GGGCTAGCCT CAAGTGATGA TTATTGGAGG TAGAGCACTG TTTGGACGAG GGGCCCCTCT 2232 0 

CGGGTTACCG AATTCAGACA AACTCCGAAT GCCAATTAAT TTAACTTGGG AGTCAGAACA 22380 

TGGGTGATAA GGTCCGTGTT CGAAAGGGAA ACAGCCCAGA CCACCAGCTA AGGTCCCAAA 22440 

ATATATGTTA AGTGGAAAAG GATGTGGCGT TGCCCAGACA ACTAGGATGT TGGCTTAGAA 22500 

GCAGCCATCA TTTAAAGAGT GCGTAATAGC TCACTAGTCG AGTGACACTG CGCCGAAAAT 22560 

GTACCGGGGC TAAACATATT ACCGAAGCTG TGGATTGTCC TTTGGaCAAT GGtAGGAGAG 22620 

CGTTCTAAGG GCGTTGAAGC ATGATCGTAA GGACATGTGG AGCGCTTAGA AGTGAGAATG 226 80 

CCGGTGTGAG TAGCGAAAGA CGGGTGAGAA TCCCGTCCAC CGATTGACTA AGGTTTCCAG 22740 

AGGAAGGCTC GTCCGCTCTG GGTTAGTCGG GTCCTAAGCT GAGGCCGACA GcGTAGGCGA 22800 

TGGATAACAG GTTGATATTC CTGTACCACC TATAATCGTT TTAATCGATG GGGGGACGCA 22860 

tAGGATAGGC GAAgcGTGcG ATTGGATTGC ACGTCTAAGC AGTAAGGCTG AGTATTAGGC 22920 

AAATCCGGTA CTCGTTAAGG CTGAGCTGTG ATGGGGAGAA GACATTGTGT CTTCGAGTCG 22980 

TTGATTTCAC ACTGCCGAGA AAAGCCTCTA GATAGAAAAT AGGTGCCCGT ACCGCAAACC 23 040 

GACAGAGGTA GTCAAGATGA GAATTCTAAG GTGAGCGAGC GAACTCTCGT TAAGGAACTC 23100 

GGCAAAATGA CCCCGTAACT TCGGGAGAAG GGGTGCTCTT TAGGGTTAAC GCCCAGAAGA 23160 

GCCGCAGTGA ATAGGCCCAA GCGACTGTTT ATCAAAAACA CAGGTCTCTG CTAAACCGTA 23220 

AGGTGATGTA TagGGcTGAC GCCTGCCCGG TGCTGGAAGG TTAAGAGGAG TGGTTAGcTT 23280 

CTGCGAAgCT ACGAATCGAA GCCCCAGTAA ACGGCGGCCG TAACTATAAC GGTCCTAAGG 23340 
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TGTCTCAACG AGAGACTCGG TGAAATCATA GTACCTGTGA AGATGGAGGT TACCCGCGAC 23460 

AGGACGGAAA GACCCCGTGG AGCTTTACTG TAGCCTGATA TTGAAATTCG GCACAGCTTG 23520 

5 TACAGGATAG GTAGGAGCCT TTGAAACGTG AGCGCTAGCT TACGTGGAGG CGCTGGTGGG 23580 

ATACTACCCT AGCTGTGTTG GCTTTCTAAC CCGCACCACT TATCGTGGTG GGAGACAGTG 23 640 

TCAGGCGGGC AGTTTGACTG GGGCGGTCGC CTCCTAAAAG GTAACGGAGG CGCTCAAAGG 23700 

10 

TTCCCTCAGA ATGGTTGGAA ATCATTCATA GAGTGTAAAG GCATAAGGGA GCTTGACTGC 23760 

GAGACCTACA AGTCGAGCAG GGTCGAAAGA CGGACTTAGT GATCCGGTGG TTCCGCATGG 23 820 

AAGGGCCATC GCTCAACGGA TAAAAGCTAC CCCGGGGATA ACAGGCTTAT CTCCCCCAAG 23880 

15 

AGTTCACATC GACGGGGAGG TTTGGCACCT CGATGTCGGC TCATCGCATC CTGGGGCTGT 23940 

AGTCGGTCCC AAGGGTTGGg CTGTTCGCCC ATTAAAGCGG TACGCGAGCT GGGTTCAGAA 24000 

CGTCGTGAGA CAGTTCGGTC CCTATCCGTC GTGGGCGTAG GAAATTTGAG AGGAGCTGTC 24060 

20 

CTTAGTACGA GAGGACCGGG ATGGACATAC CTCTGGTGTA CCAGTTGTCG TGCCAACGGC 24120 

ATAGCTGGGT AGCTATGTGT GGACGGGATA AGTGCTGAAA GCATCTAAGC ATGAAGCCCC 24180 

CCTCAAGATG AGATTTCCCA ACTTCGGTTA TAAGATCCCT CAAAGATGAT GAGGTTAATA 2424 0 

25 

GGTTCGAGGT GGAAGCATGG TGACATGTGG AGCTGACGAA TACTAATCGA TCGAAGACTT 24300 

AATCAAAATA AATGTTTTGC GAAGCAAAAT CACTTTTACT TACTATCTAG TTTTGAATGT 243 6 0 

30 ATAAATTACA TTCATATGTC TGGTGACTAT AGCAAGGAGG TCACACCTGT TCCCATGCCG 24420 

AACACAGAAG TTAAGCTCCT TAGCGTCGAT GGTAGTcGAA CTTACGTTCC GCTAGAGTAG 244 80 

AACGTTGCCA GGCAAAAAAT GGATGCGATG AGCCGCATTG AGACCGCAAG GTCT C TTTT T 24 54 0 

35 TTTATGTCTA AAACGTCAAA ATAAAAAGCA AACACAAAGA AAAATGGCTT GGCGAAGTGA 24600 

AAACGTTTGA ATCTGACGAA ACGAGAAAAG ArCGCAACGA GTTTAGTAGA GCTAAATGAG 24660 

TAAGyGAGAG CCGAAGrAGA GGAAAGAAGC AAGCGATTGT CACAAGTCAA GAAAGGTTCT 24720 

40 TAGCGAsGAT GGTAGCCAAC TTACGTTCCG CTAGAGTAGA ACTGGAAATG ATAATTTAAT 24780 

AATGTACACT TTCGATTGTC TAAGTATGTA CAACTTTAAT TTTGTGTTTA TATAAATTTA 2484 0 

AAATGATATC ATCGAAAACA AAATATTGTA TAAATAGAGA AGAGCAGTAA GACGGTATCT 24 900 

45 ' 

AATTGAAAAT GATCTTACTG CTCTTTTATA TACTTTATTG AAATACAAAA AGGAAATTAA 24 960 

TTATTATACA ATAGACAAGC TATTGCATAA GTAACACTAA CTT T TATCAA AGAAGTGTTA 25020 

CTTTATAATT AATGATTTTA TTAGAGCGTC TACATGCGGT TTTAAAGCAT CATCGTCTAT 25080 
so t 

ACCGCCAAAG CCTAATATAA ATTTAGGGGT TTTCTTATAG TCTTGATCAT CATCAAAATT 25140 
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TCCATTTTTT ACTGTAATTG TAAAATGCAT ACCCGTTTCA GCACCTTGAA TATCAAGCTG 25260 

CTCTTTGTAA GGTTTCAATC TTTTTAAAAT ATAGGTTAGT TTTCTACGAT AAATTCGTCT 25320 

CATTTTATTT AAATGCCTTT CAAAACCACC GGAAGATATA AACGTTGCAA TAAGGTTTTG 253 80 

CATATGAACA GGTACAGTGT TGCCTTCAAT GTGATTTTGA GAATGATATT TTTTCATTAT 25440 * 

AGAATAGGGT AACACCATAT ATGCAACTCG ACAGCTAGGA AAAATAGACT TTGAAAATGT 25500 

ACTGATATAA ATCACTTTTT CTCCTCTTGA ATATAGACCT TGAATTGCTG GAATGGGTTT 25560 

GCCGAAATAT CTAAACTCGG AATCATAATC ATCTTCTATA ATAAATCGTT CTTCTTTTTC 2562 0 

TTGAGCCCAT TGTATTAATT GAGTTCGTTT TTTTAAGTCC ATCACATATC CAGTTGGAAA 256 BO 

TTGATGGGAA GGCGTTATAT ATACTATATT TTTTTGTGAT TTAATAACTT CATCTACGTT 25740 

TATTCCATTA TCTTCAACTT CAATTTGTTC ATATTCAACT TGTTTTTTAT CTAAAATATT 25800 

TTTGATTGGT GGATAACTAG GTTTTTCGAT AATAAATGTT GAAGTATAAA GTAAATCGAC 25860 

TAATTGATTT ACTAATTGTT CGGTAGATGA GCCAATTATA ATTTGATTAG GATCACAAAT 25920 

TACGCCACGA TTAGTAAATA AATAAAATGC CAGTTGAAAC CGCAAATGTA ATTCTCCTTG 25980 
AAAATGTCCT CTACGTAATT GATTTAAATG ATTTGTATCA TAAAGATCTT TGGAATACTT . 26040 
TCTGAAAAGT TCTATAGGGA AATGTTTCGT ATCTATTTCA TCCAAATTAA AAGCATAATC ' 26100 

ATAAGCTTCA TCACTCGCTT TTGGTTTATA TGAATCATCA TCAAAAAGAG AGGGGATAGG 26160 

TTGATTGTTT AAAATTGTTA AAGATTCAAT TTCGGACACA AAATATCCAG AGCGAGGTCT 26220 

TGAATAAATG TAACCTTCGT CTAATAGAAG TTGATATGCA TGCTCTACGG TTGTTTGGCT 26280 
AATAGATAAA TGTTTGCTTA ATTGTCTTTT AGAATAAAAT TTATCGCCTT CTTTAAATTG ■ 26340 

ACCTTCAATT ATTTGTTTTT TTAATTTTTC ATAAAGTTGA TGGTATAAAG TGTTTTTCAA 26400 

TTTIJVTAACT GACCTCCTAA ATTTATCTTA TTTTGTACCT TTTTAAATAT CAGTTTATAC 26460 

ATTACAATGT ATTTAATCAA CTTGAAAAGG GGTTTTATGT ATAATGAGTA AAATTATTGG 26520 

ATCAGACAGA GTCAAAAGAG GTATGGCTGA AATGCAAAAA GGCGGCGTTA TTATGGATGT 26580 

CGTTAATGCT GAGCAAGCAA GAATTGCAGA AGAAGCTGGC GCGGTAgCAG TTATGGCATT 26640 

AGAACGAGTA CCTTCTGATA TTAGAGCTGC TGGTGGTGTT GCACGTATGG CAAACCCTAA 26700 

AATTGTAGAA GAAGTAATGA ATGCTGTTTC TATTCCAGTC ATGGCTAAAG CACGTATTGG 26760 

TCATATCACT GAAGCAAGAG TATTAGAGGC GATGGGTGTT GACTATATTG ATGAATCAGA 26820 

AGTGTTAACA CCAGCAGATG AGGAATATCA CTTAAGAAAA GATCAATTTA CAGTACCATT 26880 

TGTATGTGGA TGTCGTAATT TAGGTGAAgm TGCGCGTAGA A1TGGTGAAG GTGCTGCTAT 26940 
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ACAAGTTAAT TCAGAAGTTA GTCGATTGAC TGTAATGAAT GATGATGAGA TTATGACTTT 27060 

TGCGAAAGAT ATCGGTGCGC CTTATGAAAT TTTAAAACAA ATTAAAGACA ATGGTCGTTT 27120 

ACCGGTAGTT AACTTTGCAG CTGGTGGCGT TGCGACTCCT CAAGATGCTG CTTTAATGAT 27180 

GGAATTAGGT GCTGACGGTG TATTCGTTGG ATCAGGTATT TT7AAATCAG AAGATCCAGA 27240 

AAAATTTGCT AAAGCAATTG TTCAAGCAAC AACACATTAC CAAGACTATG AACTAATTGG 27300 

AAGATTAGCA AGTGAACTTG GCACTGCTAT GAAAGGTTTA GATATCAATC AATTATCATT 27360 

AGAAGAACGT ATGCAAGAGC GTGGTTGGTA AGATATGAAA ATAGGTGTAT TAGCATTACA 27420 

AGGTGCAGTA CGTGAACATA TTAGACATAT TGAATTAAGT GGTCATGAAG GTATTGCAGT 27480 

TAAAAAAGTT GAACAATTAG AAGAAATCGA GGGCTTAATA TTACCTGGTG GCGAGTCTAC 27540 

AACGTTACGT CGATTAATGA ATTTATATGG ATTTAAAGAG GCTTTACAAA ATTCAACTTT 27600 

ACCTATGTTT GGTACATGCG CAGGATTAAT AGTTCTAGCG CAAGATATAG TTGGTGAAGA 27660 

AGGATACCTT AACAAGTTGA ATATTACTGT ACAACGAAAC TCATTCGGTA GACAAGTTGA 27720 

CAGCTTTGAA ACAGAATTAG ATATTAAAGG TATCGCTACA GATATTGAAG GTGTCTTTAT 27780 

AAGAGCCCCA CATATTGAAA AAGTAGGTCA AGGCGTAGAT ATCCTATGTA AGGTTAATGA 2784 0 

GAAAATTGTA GCTGTTCAGC AAGGTAAATA TTTAGGCGTA TCATTCCATC CTGAATTAAC 27900 

AGATGACTAT AGAGTAACTG ATTACTTTAT TAATCATATT GTAAAaAAAG CATAGCTTAA 27960 

TGTATGCTAA ATCAACGAAT TATTGATATT TATAGATTTG TTGAGAAGAA AATATCTCCT 2 8020 

TCAAACTTAG CTTTGGAGGA GTTATTTTTT ATGTCAAAAT TAAAAATGAT AAAAAATAAA 28080 

GCTATACATA AGAAAAAAAC CCTTCAAAGA GACTGAGAAT AGTCAAAATT TTGAAGGGGT 28140 

TAATTCGATG TTGATGTATT TGTTAAATAA AGAATCcAGC GATTGCAGCT GAAATGAAAG 28200 

ATACTAGTGT tGCACCGAAT AATAATTTCA AACCAAAGCG GGCAACTGTA TCTC CTTTTT 28260 

TGTCATTAAG TGATTTAATC GCACCTGAAA TAATACCGAT AGAGCTAAAG TTAGCAAATG 28320 

ATACTAAGAA TACAGATGTA ACACCTTTTG CGTGTTCAGA TAAATCACTA AGTTTACCAA 28380 

GTGCTTGCAT TGCTACAAAT TCGTTAGATA ATAGTTTTGT CGCCATAACT GAACCGGCTT 28440 

GAACTGCATC TTGCCATGGC ACACCGACTA AGAATGCAAA TGGTGCAAAG ACAAAACCAA 28500 

TTAATGTTTG GAAATCCCAA GAAATAGCGC CACCTGAAAC TGTACTAAAG ATATTGCTTA 28560 

CAATTCCATT TAATAGAGCG ATAATGGCAA TGTATCCGAT TAACATTGCG CCTACAATGA 28620 

CAGCTACTTT AAATCCATCT AAAATATATT CTCCTAGCAT TTCGAAGAAT GATTGTTGTC 2 8680 

TTTCTTCAGT TTCTTCAACT AATAATTTGT CATCTTCTTC ATTAACTTTA TAAGGGTTAA 28740 
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TAGGTTCAAT TAAGGTAAAG TATGCACCGA TAATTGAAGC AGAAACAGTC GACATTGCTG 28860 

AAGCTGTTAA TGTGTATAAA CGTTGCTTAG GTATGTATGG TAATTGTTTT TTAATTGAAA 2 8920 

TAAATACTTC AGATTGTCCC AAAATTGCTG CAGCAACTGC ATTGTATGAT TCTAAACGTC 28 980 

CCATACCATT AATTTTAGAA ATTAAGAATC CTAAAACATT AATGATTAAA GGTAAAATCT 29040 

TTGTGTATTG AAGGATACCG ATAATCGCTG AAATAAATAC GATAGGTAAT AATACACTGA 29100 

AGAAGAATGG TGGTTGCTTA GGATCGATAT ATTGAATACC ACCGAATACA AAGTTAACAC 29160 

CATCTGCTGC TTTTAATAAT AAGTAGTTAA AACCGTTTGA AATACCACCA ATAACCTTGA 29220 

TTCCCATTGT AGTTTTAAGC AAGATAAATG CAAAGATAAG CTGAATTGCA AGTAAAATTC 2 92 BO 

CTACATATTT CCAGCGAATA TTTTTCCTGT CTGAGCTAAA TAGAAACGCA AGTGCTAAAA 29340 

AGAAGATAAT TCCGATAATC CCAATTAGAA TATGCATATA TTTCTCATTC CTTTAG TTT T 29400 

TTCTACaATc TATCATACAA TAAAATGGAA GGGCTAACAT CATAAATTTT TGAAAATATA 29460 

AAAACAAATT AATTGAAAAA GGTCAAAATA GGTCATATAA TATAGTCAAA GAAGGTCAAA 29520 

AAGGGGTGAT ATACATGCAC AATATGTCTG ACATCATAGA ACAATAaTCA AACGTTTATT 29580 

TGAAGAGTCG AATGAAGATG TCGTTGAAAT TCAGAGAGCG AATATCGCAC AGCGTTTTGA 29640 

TTGCGTACCA TCACAATTAA ATTATGTAAT CAAAACACGA TTCACTAATG AACATGGTTA 29700 

TGAAATCGAA AGTAAACGTG GTGGTGGTGG TTACATCCGA ATCACTAAAA TTGAAAATAA 29760 

AGATGCAACA GGTTATATTA ATCATTTGCT TCAGCTGATT GGACCTTCTA TTTCTCAACA 29820 

ACAAGCTTAT TATATTATTG ATGGGCTTTT AGATAAAATG TTAATAAATG AACGTGAAGC 29880 

TAAAATGATT CAAGCAGTTA TTGATAGAGA AACGCTATCA ATGGATATGG TTTCTAGAGA 29940 

TATTATTAGA GCAAATATTT TAAAACGTTT GTTACCAGTT ATAAATTATT ACTAAATGAA 30000 

ATGAGGTGTT GAAGTGCTTT GTGAAAATTG TCAACTTAAT GAAGCGGAAT TAAAAGTTAA 30060 

AGTTACAAGT AAAAATAAAA CAGAAGAAAA AATGGTGTGT CAAACTTGTG CTGAGGGGCA 30120 

CCATCCGTGG AATCAAGCTA ATGAACAACC TGAaTATCAA GAACATCAAG ATAATTTCGA 3 0180 

AGAAGCATTT GTTGTTAAGC AAATTTTACA ACATTTAGCT ACGAAACATG GAATTAATTT 30240 

TCAAGA 30246 
(2) INFORMATION FOR SEQ ID NO: 57: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14333 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 57: 

TATTCCCCCA TCGGTTTATT AAATCGTCCA TTTCAATACT GTTTTTCCCC AAGATGTCGA 60 
TAAATCCATT TCAAACGCTT GGACGATATC TTGCATCGTA CATACATTAA TTTCATGTCC . 120 

TTTTAATAAT GCTAACTTTT CAACTATGTC TGGGTACTTA CGATATAAAT CAACAACTTG 180 

CTCAAAATCT TTAGAGCCGC TTCGACTACT ACCAATCAAC GTTAATCCTT TTTCAAGTAC 240 

TAATCGTGTA TTCACTTCCA CGGGTAATTC ACTTACGCCT AACAAAGCAA TACTGCCTTC 300 

TGGTGAAATA TGTTCAACTA TTTGTTGAAG TGCAACTTGA CTTCCTTTAC CTCCAACACA 360 

TTCAAATGCA TGATCAATTT TAAGATCATC TGGTATTTGA TTTACTGTAA AGATGTCATC 420 

TACAAATGAA AAATGACTTA ATTTATAGTC TGTCTTACCA AATACATAAG TTTTAGCTTC 4 80 

TGGGTACAAC TTACGTAGCA AAATAGCAGT AATATAACCT AAGTTACCAT CACCCCAAAT 540 

20 ACCAAAGCTG GTTTTCAAAG GTATAGATTT ACGTTCAAAT CGTTGTATAG CATGATAACT 600 

TACTGACACT AACTCTGTGT ATGAAATCGT ACTCAAATCA ATGTCATTAG GCAGCGGAAC 660 

GATACGATCA TGTGCCATCA CAACGTAGTC TTGCATAAAA CCATCATAAC CACTAGATCT 720 

AAAATAACTA GAGGCTAAGT AATTCTCCGC AATAATATGA TGTTGCTCTG TAGGTGTATT 780 

CGGTACCATT ACTACTTTCG TACCTTTTTC AAATACCCCT TTACTATCAA ATACAACTTC- 840 

ACCAACAGCT TCATGAACTA ATGACATTGG TAATTTTTTG CGTAGTACAT TTTCATCTCT .900 

TCGACCTGTG TAATACCTTT GATCAGCTGC ACAAATAGAC AAGTATAAAG GTCTTACGAT 960 

GACATGATTA CCATAAATAT CAACATTATT ATATGTGACG TCGAACTGTC TCGGTGCAAC 1020 

GAGTTGATAT ACTTGATTAA TCATCGGCAA TATCACCTTG AATAATGGCA TTTGCTACTT 1080 

TTAAATCATA CGGTGTTGTC ACTTTAATGT TGTATAGTTC TCCaCGTACC AATTTAACTG 1140 

CAT£TCCAGA TTCGACAATG ATTTTACATG CATCTGATAA GATTTCTTTT TGTTCACTAC 1200 

40 TTAAGGCGCG ATAACTATCT TGTAATAATT TAATATTAAA TGATTGTGGT GTTTGGCCTT 1260 

GATACATTTC ATTCCTTACA GGGATACTGT GTATGTTCTG TTTATCTTTA GACATTACAA 1320 

TCGTATCAAT TGCTTCAATG ACTGTATCTA CTGCACCATA TTTTGCTGCT ACTTCAATGT 1380 

45 TCTCTTTAAT AATACGTTGA GTTAAAAATG GTCTTACGGC ATCATGAGTT ACAATCACAT 1440 

CATCATTATT AATTCCATTT ACATTGCGAA TATGGTCGAT AATGTTCATA ATTGTTTCAT 1500 

TTCGATCCGT ACCACCTGCA ACTACTTTGA CACGTTGATC TGTAATGTTA TATTTTTTTA 1560 

50 

AAATATCCTG TGTATGGGAA ATCCACTGTG CTGGCGTTGC GATAATAATC TCATTAAATT 1620 

CACTCACTAA AATGAACTTC TCAATTGTAT GGATTAAAAT CGGTTTATTA TCAATATCTA 1680 
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CTGCATAAAT CATGTTGTCC TCCATTCTGT CATTACATCA TTTCCATTTA TACATTACTG 1800 

ACCTATGCCC GCACATAAGC CTAACCTATT GCTCACTTGC CTCTTTTATT AATCCAAAGA . I860 

5 

TAGTTGTCAC AATAGTGTGA TAATTTTTTA TAAAAATGTA TTTTTGTAAC TGACCATTCT 1920 

AAGTTGTTTT GCCATGCAGT TAATCATTAA CTCTGACGAT ATTAAATTGT TAAAGGTATT 1980 

AATGTTTACT CTTTTTCAAA TTCATTATTA CTGCCATCAT TTTACCATAT ATTATAATAA 2 040 

10 ■ 

ATTTATCTTA TTAAGTGGCT GTACTTGATT TTCACTTTAA AAATTATCAA ATATTGCCAT 2100 

CTCATTTTAA GTATACAAAA TGCAAAACAA CCGATTCACA AGCATATTTC ACACAAGTAA 2160 

J5 ACCGGCTATT TATCAACGTA TATTCGAAGA TGAATTATTT CGATAGTATC TATAGACCAG 2220 

ACGGCATTCG CACTTTCATA GCTATAACTA TACCAGCGTT TTCGTCCTCA AAGGTGCATA 2280 

CTAATAAATC GTAAACATGA CTTTATCAAA TCGTTCTTTC TTGTTAACTA ATTTATCAAA 2340 

20 TGTCTCCGGG CCTTTTTCTA ACGGTAAAAA ATGAGAAATA ATAGGCTTTA CATTAATATC 2400 

TTTCGTCTTC ATATAATGTA AGGTTGCCGT CCACTCTTTG CCCGGAAAAT TACTGGACAA 2460 

ACAGTTCCAA GAGCCACATA CTGTCAACTC GTTACGCAGA ATTTTTTCAA AATGAACGCG 2520 

ATCAATCTCA ATATCATCAT ATGGTATTCC GAGTAATACC ACCTGGCCAC CTTTTTTAGG 2580 

TAGCGTCAAT ATTTGACCAA TCGTAACTTT AGCACCTGAT GATTCTATAG CTAAATCGAT 2640 

TTGATTGGCG TAATGATTTT CGATGAATTT CTCAAGATTT TCTTCTTTTG AATTGATTGT 2700 

30 

TTGATGTGCG CCCAATGATG TTGCAATATC TAGTTTATGC GCATCTATAT CTATAGCGAT 2760 

GATATGTGCA GCACCAAATA TTCGTGCCCA TTGAATAGCT AACAAACCTA TACTGCCACA 2820 

CCCCATTACT GCAACAGTCA TACCAGGTTG TATATTCGAT TTATAAAACC CATGCGCAAC 2880 

35 

AACGGCTGAT GGCTCAACCA TTGCTGCTTC AATGTAATCA ACATTGTCTG GAACCTTTAA 2940 

AACATTTTGC GCTGGCAATT tgacatattc CGCGAACGAT CCAGGTTCAT ATGAGCCAAT 3000 

40 GACGAATAAC TTTTCACATC GTGCATATTC ACCTTTTAAA CAATACTCGC ATTGATAACA 30S0 

AGGTATTGCT' GGGCAACCTG TCACTTTGTC GCCCACATTA ACATGCGTAA CATCACTTCC 3120 

AATGGCATCT ACTACACCTG AAAATTCATG ACCAAATGGC ATACCTTTAA TGTATGGCCC 3180 

45 CATTTTTTTG TATCGTGACG TGTCTGAACC ACATATX5CCA GTCGCTCGTA CTTTAATAAT 324 0 

AACGTCATTC GCACTTTCAA TGACTGGCTT TTCATTATCC TCATACCGTA AATCTTCCAC 3300 

GCCATATAAT TTCAATGCTT TCACTTGTAA ATCACCTCAA ATTTGATTTA ATTCACAACT 3360 

SO 

TTTTTCTTTT TAAAAATACC TGTCGCAAAA TAACCTGCAA TGACAATGGA ATTACTTACG 3420 

AGTAAATGTT CCATATAAAA ATCAGTGATT TGTCTTAATG GCCCAAGCAT AAAAGTTAGC 3480 
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TGCTTTAATA CCTTCGCCGG ATTTTAAATG TTGATACGCC TCGTCCCATT TCGAAATATC 3600 

ATATATTTTT GTCACCAAAG CTTCAGCATT TACTAAACCA TCCGCCATAA GTTGCAATGA 3660 

AGGTTCCCAA TCTGCTGGCT TTTGACTTCT ACTACCAACA ACTGTTATTT C T TTTTGAAT 372 0 

CACTTTTTCC ATATCAAATG GAATTTCAGC ATCCTTAAAA ATACCTATTT GACTGTAGAA 3780 

ACCTTTTTTG CGTAAAATAT CCAAACCTTG TCGTGCTGCT GGAACTGCAC CTGAACATTC 3840 

AACAACAACA TCTGCACCGT AACCGTCTGT AATTCCATTG ATATACGTTT TTAAGTCTGT 3900 

TTGTTGTAAA TTGACTACAT AATCCATGTG CAATGCTTCT GCTTTATCTA ATCTGACTTT 3960 

GTCATTGTCC AATCCAGTTA CCACAACAGT TGCGCCTTTA CTTTTTAACA CTTGTGCTAC 4020 

AAGTAATCCG ATTGGCCCAG GTCCCATTAC AACTGCTACA TCGCCTGAAT TGACTTGAAT 4080 

CTTAGAAACG CCATGATGTG CACATGCTAA TGGTTCTGTC ATAGCTGCAG ACTGATACGA 4140 
20 TAtTCGTCTG GAATATGATG CAAACTTTCT TCACGTGCAA TGACATAATT AGTAAATGCG . 4200 

CCATCAACTT GTGTTCCAAT ACCTTTTCGA TGGTTGCATA AATTATAGTC TTTTGATTTA 4260 

CAGTATTCAC ACTCATTACA AACATAGAAT GTCGTTTCAG aTGtGACACG GTCACCAACT 4320 

25 TTAAAATCTT TAACGTCTGC TCCAACTTCA ACGATTTCAC CAGAAAATTC ATGACCTAAT 43 80 

GTCACTGGAA AATTAACTTT ATAATGACCT TCATAAGTAT GAATATCTGT GCCACAAATT 4440 

CCTGCATAAT GTACTTTAAT CTTTACTTTA TCATCTAGCG GTGTTGCAAC TTCTTTATCA 4500 

AGAAGTTCTA AGTTGCCATG TCCTTCTCTT GTTTTTACTA AAGCTTTCAC CACAAACACC 4560 

TCGATTTTTA ATTGAATAGA CTAAATAGTT TAAAGATAAG ATAGTTAACG ATATTACCAC 4620 

CTTGATCAAT ACTTGAAATT TCAGATGAAC CTTTTGGCAT TTGTACATTC GTACGTTTCG 4680 

CCATATCTGT GAAAATGGGT GCTACGTCTG TTGCAATATA TAGTGAAATT GCAATCATAA 4740 

TCGTACCCAC AATGACAGAA TGAATAATGT TTCCTCTTGC TGCACCAACA ATAAACGCGA 4 800 

CAACAAATGG TATCGTTGCT AAGTCACCAA AAGGTAGTAC TTGGTTTCCT GGTAAAATAA 4860 

CGGCTAATAA AACAGTGATA GGTACTAAAA TTAATGCTGT CGAAATAACT GCTGGATGAC 4 920 

CTAATGCTAC AGCCGCATCC AATCCAATAT AAATTTCACG TTCGCCAAAA CGTTTATTTA 4 980 

45 GCCATGTTCT TGCAGACTCT GAAACTGGCA TTAAACCTTC CATTAAGATT TTTACCATTC 5040 

TAGGCATTAA TACCATTACT GCAGCCATTG ACATTCCTAA ATTAATGATG TCTCCAGGTT 5100 

TGTAACCTGC TAACACACCA ATACCTAAAC CTAAAATTAA GCCGACAAAT ATAGACTCTC 5160 

CAAATGCGCC AAAACGTTTT TGAATTGTTT CAGGATCAGC ATCTAACTTA TTCAGACCGG 5220 

GTACTTTTTG TAACAATTTA ACTAAGTAAA TACCTGGTGC ATAAGAAATT GTACTTCCTG 5280 
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CTACTTTCAA ACAGATAATT TGGAAAATAA CTGCTGCTAA TAACGCTTGC CAAATACTGC 5400 

CTGATACGGC ATAAACCATT GCTGCTGTAA ACGTATAATG CCAAAAATTC CAAATATCTA 5460 

5 

CATTCATCGT CTTTGTCACT TTAGTTACTA GCAATACAAC GTTAACTATG ATTCCGAGTG 5520 

GAATAATAAA TGCTGCGACA GATGATGCCC AAGCGATAGA TGATGTTGCT GGCCAACCTA 5580 

CATCAATCAC ATTCAGACTG ACGCCTAAAT TTTTAACCAT CGCTTGTGCT GCTGGCCCTA 5640 

10 

AATTTTTAAC TAATAAATCG ATGACTAAGA AAATCCCTAC AAAAGCCACA CCTATTGTTA 5700 

AACCAGACCT AAATGCCGCT CCAATTTTCT GCCTAAAGAA TAGGCCAAGC AAGAATATGA 5760 

75 CAACCGGTAA AATAACAGTt GCACCTAAAT CTAAAAATCC CCTTACAAAA TCAGTGAAGT 5820 

AACTCATATT TAAACCCTCC CTGTTATATA TGCATTGTCA CGATACTTTC CGATTGTGAT 5880 

TACATTTGAC GTTACAGTCA TTTCAACGAC AACCCTTGCT AAATTCGACT GCAGTCCTTT 5 940 

20 TGAATTACAG tCACTGCGTT TCTATGTCAT CAACAATCAT TTGTCGTGAT AGTCATTTAT 6000 

ATGCAATTTG CATATATTAA TATGTTATCG ACCCACGTTA CATATCAATT CCGTTA TTTT 6060 

TGTAACTCTG TTAAGATTTG TTGTTTTGTT TCTTCAATAC CAATACCAGT TAAGAAATTA 6120 

25 

CGTGCGTTGA TAACTGGGAA TTTATATTCT TTTTTTGTCA TTGCAGTTGT AACTAATAAA 6180 

TCTGCAGTGT CTTCATAAGG TCCAACTTCT GTAATTTTGA TTTGTTTAAT ATCTACTTTA 624 0 

ATATTGTGTT CCTTTGCCAT TTCTTCAATT GCATTATTTA CTACTGTTGA CGTTGCAATA 6300 

30 

CCTGCACCAC ACGCTACTAA TACTTGTTTC ATTTTCAATT CCTCCAATTA ATTTTTAGTT 6360 

. ATATTCCAAA TAATCATTGA TTAGTGTTGC TAAAATTGTT TCATCTTTCG TTCGTAGAAT 6420 

CTGCTCCAAT TTTTCTTCAC TTTGAAAAAT TTGCATCAAC TGTTGTAACA GCTTAAGTTG 6480 

35 

ATCATCTACT TTATCCATTG CTAACATAAA AACGATTTTC ACTTCTGTCT GTTGATCAAG 6540 

TGTXCCCATT TCAATAAACG GCACTTCTTT TTCTAGAACA GCCACACCTA TCGTTCTATG 6600 

40 GTT/&TATGT TCGACATCTG TATGCGGTAT AGCGACCGAA CATAGATGCG TTGGTAAACC 6660 

AGTAGCAAAT TCTTTTTCTC TGTCGATGAC TGCATCTTTA AACGTTGACT TCACGAACCC 6720 

ATTTTGAAAT AACACATCTG ACATTTGTGA CAATACGGAT TCTTTATCAG TTGCCGACAA 6780 

45 ATTGAGCATT ATATTTTCTT TATGCACTAA TTGCTGTCCC ATCCATTTTC CCTCGCTTCT 6840 

TTATTTGAAT AATTTTTTAA AATCTCATTT ACATCAGAAT TTTTGCGACT TTGTATGATG 6900 

CGCTTAATTG CGTCATTGTC TTGCGCCACA TCTCTCAATT GTAGTAACGC TCTTAAGTGT 6960 

SO 

GTCACTTTAT CAACAGCAGC AATAGGTACA ATAATATGGA TTGCTGTGCC ATCTGACATG 7020 

TATATTGGTT CTTGTAATAT CAACATACTC ATCGCTGTTT TATGTACATG CTTTTCAGAG 7080 
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TGCATCTCAT GAATATATTT AATATCAATA AAATGATTAG CAACTAACAC ATCACTTGCT 7200 

TTAGCAATAG CTTCATCAAT ATTTTCAACA TGATGCATTC TTTTCACGTG CCTTGCCGGT 7260 

ATCAAGTCAG CTAAATCTAA TGyCTwATTT tGTGtGACaA TCGATCCATT AATGGTTGAA 7320 

ATTGAATTAT AATTGGCAAT AAAATCTTCT AAACCATCAC GTAGTcTGTA . ATGTCATTAA 7380 

CTGTCGTTGT GCGTTCAATT AATGCCATTA ACTTGTTTAT TTCCTTATCA ATGTCAGCCG 7440 

ATTCCTTATT AATGTACTTC ATCACTTCTT TACGTAACTT TCGTTGCTCA TTTTCAGATA 7500 

AAGCTACTTT TGTGATAAAT AATTTTTTAT GTGTTAGGAC AAACATTGGT GAAAAGACGA 7560 

,5 TGTCATAATC TAATGTGTAA TTTTCAAATG TTCTAAGTGA AATCGCATCT AAGAAAATAA 7620 

TTTCTGGAAA TAAGTTTCOC AACTCGTATA ACATCATTTG TGATACTGAC GTGCCTTGTG 7680 

TACACACGAT AATAGCTTTT ATCTTGCCAT CGAAGTTTTC ATCTTGACGT CTCAAACTAC 7740 

20 CTCCGAACAA CATGGTTAAA TATGCTATTT CATTATCAGG CAACGATTTT CCGAAATATT 7800 

CAGTTAACGA TTGACATGAT TGTTTCACCA. TATGAAATAA GGATTGATAA TTTCCTTGTA 7860 

AAGGATTTAT TAATTCATCA CGATCCGTTA AGTTATATTT AATCCTATAA AAAGCAGGCG 792 0 

TTAAATGTAA CAAGAGTTGC TGTGATAATT TCTCCTTATC TTCAATGTTA ATAAAAGTGA 7980 

TTTGTTCAAA ATGGTGAATC ATTTGAGCGA TGGCCATCGT TAAATTCGAT ATGCTATCTG 8040 

ATTCTTGCAA ATCAGTCCAT TGCACACTTG TTGAAAGTAA GTGTAATGTC AAATATAACT B100 

TTTCCGCTTC TGGCAAATCC GGCTCATGTT GCGTCATAAT CTCCGTTGCT TGATATTCTT 8160 

TCGTATCCCT CAAATACTGA TAATTAATAT TTAATGGATT CATCACATGA CCACTTTGAA 8220 

TTCGTCTACG AATCACACAA AGGACATAAG GCAATGAACT AAGTGATTTG TCTATAAAGC 8280 

GACTCTTCAA AAATTGTTCT ACCTGTTTGA TCTTGTCTTT TTGATATGCG ATATCTTCGA 8340 

ATG*&AAGTT GAGCGCCTTT AAAACTTCAC TTTTAGTAAT ATCATGATTC AACCTTTGAT 84 00 

40 CAATCAACTT AATGAAGAAA CGGCGAACTT CAAATTCATC ACCAACAATT TCATAACCAT 8460 

GTTTTCGAGA ATACTTAAGT GACAAACCAT GATTTTCCAA TTGCTCTTTC ACATGATTTA 8520 

TATCGTGAAT GACAGTATTT TTACTGACTT GTAAATCAAT TGAAAAATGG TTTAGAGACA 8580 

45 TTGCGTTTTC CTTACTAAAA AGCATGAGCA TTAAATAATA ACGACGTGTT TCTATGCTAA 8640 

AAATGACATT GTTGCCGTTT AACATTTGCT GCTCCGATAC ATCTCGCTTG AATAACGTCA 8700 

TGATTTCAGA ACTTACAATA AAATTTCCTT GGCTTGTTCT TTCAAGTTTT GGATAACCCT 8760 

SO 

CTTGTTCAAG CCACAAATTG ATTTTTTGAA TGCGATATCC TAGTTGTCTA CGAGACAAAC 8820 

CAAATATCGA TTCAAGTTCT TTACCATGAA TAGTAGGATT CAATACAATT TCTCTGAGTA 8880 
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TCAATCGTCA CACCGATGTA CACACTTTGA ACACATATTT TCAAAATGAG CATGTACATC 9000 

ATTGTGATGT TTTAACAACA TTTCAATTAT ATCTATATTT TTTGTGATTT TAATCTTTTA 9060 

5 _ 

AAATAAAGCA ATTGAAATTT TTGCATATAT TTTTGTGTTT " TGTGTTTTTT TGAAGCATTT 9120 

TTAACATACA TATCTCAATC ATTATCAAAT TGTCATGACC ATTGTAACCC AATACAAAAA 9180 

CCCTAAGGAC GCTTATATCA GGCGCCTTAG GGTTAACTGT ATCTATTTAA TTAAGTATTA 9240 

10 

TTATTCGTAT GTACGTAACT TATGGTCTAT CAAGTTCCAC ACTTCTTCAA CATCAACTGC 9300 
TGTAGCAAAA TAAGCATTGG CAGGCTTACC TGTAACATGA TTTAAATCGA CAGCCATAGT 9360 

15 . GCCATAAGTT AGTGGACTTT GATGTTCAAT GTCGATATTA ACGGGTACCA TTGTAAACAA 9420 
TTCTGGTTGT AACAAATACA AAATTGTACA AGCATCATGT ATTGGACCAC CATCCATATT 94 80 

AAAGTGAGTC TTGTATGTCT TCTTAAAGAA TTGCAATAAT TCTACGACGA ACTGTGCAAC 9540 

20 AGGATTATTG ATACTTTCAA AGCGTTCAAT CACGTGATCG TCGGCTAAAA CTTGATGTGT 9600 

TACATCTAAA CCAAACACAT TTATAGTAAT CCCACTTTCA AAAACACGCT TCGCTGCTTC 9660 
AGCATCTACC CAAATATTGA ATTCTGCTGT AGGCGTCCAA TTTCCAAATG TACCACCACC 9720 

25 CATCAAAGTA ATAGATTCAA TATGCTCAGC GATTCTTGGC TCACGAATCA ATGCCGTTGC 9780 

TACATTCGTA AGAGGACCTG TCGCTACAAT TGTTACAGGT GTATCACTCG TCATCACTTT 9840 
GTTTATAATC ACATCTGATG CTGGCATTGC AACTGCTTGA CGTGATGGTG TCGACGGTAG 9900 

30 

TTTCGGACCA TCTAATCCAG ATTCCCCATG TATTTCAGAA GCAAAGGCAG CTGGTTTAAT 9960 

TAACGGCCTA TCCGCACCTT TCGCTACTGC TATATCTTGG CGTCCCATAA TATCCAATAC 10020 

GTTCAAGGCG TTTGTCGTAT TCTTGTCAAC TGATTGATTA CCTGCGACTG TTGTTACAGC 10080 

35 

TAATATCTCT AGTGGACTGT CAATTGCCCC CGCTAAAATT AATGCTATTG CATCATCGTG 10140 

TCCTDGATCA CAATCCATAA TAATCTTTCT TTTCATTTAT ATATCCACCT TTCTTAAGTT 10200 

GTTATCGATA GCTTATGTAT ATTTATTTAT GTGGTGAATC ATGTTTATTT TGAAAAATAG 10260 

TTTTAACTTT CTCATATTTT TGGATACAAA CACTATTTAT CTATTTTATG GCTTATAAAT 10320 

TTATCCGATA TGCCTTATCA ACCTACCTCG CTAAAAATAG GATGTCTACA TATCTATACC 10.380 

45 GACTTTTGTC AACTCATTTT CACAACAATA TAAACAGCAA TTTATATGAT TGTTACATGA 10440 

TTCAAACAAT TTTTATGAAA AATATTTTCA TACACAGAAT ATATATTGAT ATTAAATTTC 10500 

TCAAAAGCTA TATTGAGAAT AATTAGGAGG GATGTTGATG AAATCTTTAT TTGAAAAAGC 10560 

a ° ACAGCAGTTC GGCAAGTCCT TTATGTTACC TATCGCAATC TTACCAGCTG CAGGTCTATT 10620 

GTTGGGTATC GGTGGTGCAT TAAGTAATCC AAACACCGTT AAAGCATACC CTATTTTAGA 10680 
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AAATTTACCG GTCATCTTTG CAATTGGTGT CGCAATCGGA TTATCTAGAA GCGATAAAGG 10800 

TACTGCAGGT tTAGctGCGC TGCTCGGTTT CTTAATTATG AACGCAACTA TGAATGGCTT 10860 

ATTAACTATC ACGGGCACAT TGGCAAAAGA TCAGCTTGCA CAAAATGGAC AAGGCATGGT 10920 

GCTCGGTATA CAAACGGTTG AAACCGGTGT TTTTGGCGGG ATTATCACAG GTATTATGAC 10980 

CGCAATACTT CACAACAAAT ATCACAAAGT GGTATTACCA CCGTATTTAG GTTTCTTTGG 11040 

TGGCTCTAGA TTTGTCCCTA TTGTCACAGC ATTTGCCGCA ATCTTTTTAG GTGTATTGAT 11100 

GTTTTTCATT TGGCCAAGCA TACAAGCCGG CATTTATCAT GTTGGTGGAT TTGTAACGAA 11160 

AACAGGTGCC ATCGGTACTT TTGTTTATGG CTTCATCTTA AGATTGTTAG GTCCACTCGG 11220 

TTTACACCAT ATTTTTTACT TACCGTTTTG GCAGACGGCA CTTGGTGGTA CTTTAGAAGT 11280 

CAAAGGGCAC TTAGTTCAAG GTACGCAGAA CATCTTCTTT GCTCAACTTG GTGATCCAGA 11340 

TGTGACGAAG TATTATTCAG GTGTGTCACG CTTTATGTCA GGCCGTTTTA TTACGATGAT 11400 

GTTCGGCTTA TGTGGTGCCG CACTTGCAAT TTATCACACA GCTAAACCTG AACATAAAAA 11460 

AGTTGTCGGC GGTTTAATGT TATCCGCTGC ACTCACTTCA TTTTTAACAG GTATTACCGA 11520 

ACCTTTAGAG TTTAGTTTCT TGTTTGTCGC ACCTATTCTT TATGTAATCC ATGCCTTCTT 11580 

TGATGGATTA GCATTTATGA TGGCAGACAT TTTCAACATT ACAATTGGTC AAACCTTCAG 11640 

TGGAGGCTTT ATCGATTTCT TACTCTTTGG TGTGCTACAA GGTAATAGTA AAACAAACTA 11700 

CCTATACGTC ATACCTATTG GAATTGTGTG GTTCTGTTTG TATTACATCG TTTTCAGATT 11760 

CTTAATTACG AAATTTAATT TCAAAACACC TGGTCGAGAA GATAAAGCTG CAGCACAACA 11820 

AGTTGAGGCT ACTGAAAGAG CACAAACTAT TGTTGCTGGT TTGGGAGGCA AAGATAACAT 11880 

TGAAATCGTT GACTGTTGTG CAACGAGACT ACGCGTCACA CTTCATCAAA ATGACAAAGT 11940 

CGATAAAGTA TTACTCGAAA GTACTGGTGC CAAAGGTGTA ATCCAGCAAG GCACTGGTGT . 12000 

GCAAGTAATT TATGGGCCTC ACGTTACAGT TATCAAAAAT GAAATTGAAG AATTGCTCGG 12060 

GGATTAAGAC TAACCGAAAT ATCAACAGAA CTAATGGCAA CGATGTACGA AGTAAGAAGT 12120 

GACATCGTTG CTTTTATTTT TAATGTTACA TTTGAAGCAT TAAGTTCATC ATGCACTGTA 12180 

GTGAGCCCGC AAATCGCCTC TGCTAGACAA TCATCTTAAT GCTATGATTA AAGCTTAAGT 12240 

GCCAGATTTG AATTTAATTT CAACAACGAC TTTCACTACA TTAAAAATAG GGCCACTCGA 12300 

CACATATAGT TGTATCAAAT AGCCCTTTAT ACAATTTTTT. GGGTAAGGTT TTACAATTTT 12360 

TGGGATGGTA TAGATTTTAT AAAAAGTTAT TTAAGTTCTT CTGCTTCAGC CATAATATCT 12420 

TTTAATGTTT TAGCTGAATG TGCGAACTTG CTTTGTTCTT CGTCGTTTAA TGGGATTTCT 12480 
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TCCTCATATT CGCCTTCTAA TAATGCTGAT ACAGTCAATA CGGCATCTTC ATTTCTGAAA 12600 

ATCGCTTCAG TAATTCTAGC TAATCCCATT GCAACACCAT AATAAGTGGC ACCTTTAGCT 1266 0 

5 

TGAATAATGT CATATGCTGC ATCACGTGTT TGAACAAAAA TTTGTTCAAT TTGCGCTTTG 12720 

CCCTCAGGAC GTTCTTCAAG TAATGTCTTC AAAGGTTGAC CCGCAATATT AGCGTGTGAC 12780 

CATACTGGTA ATTCAGTGTC ACCATGTTCA CCAATAATTT GAGCATCGAC GCTACGTGGC 12840 

10 

GCAACATCGn AcgyTcGCTT AACAATAATC TAAAGCGTGC AGAGTCTAAA ATTGTACCAG 12900 

AACCTATAAC ACGTTCTTTA GGTAAACCAG AGAATTTCCA TGTTGCATAC GCTAAAATAT 12960 

15 CAACAGGATT TGTAGCTACC AAGAAAATAC CATCAAATTT TGATGCCATT ACTTCACCAA 13020 

CAATTGATTT GAATATTTTC AAGTTTTTAG ATACTAAATC TAAACGTGTT TCTCCAGGTT 13080 

TTTGTGCAGC ACCAGCACAG ATGACAACTA GATCCGCATC ATGACAATCA CTGTATTCGC 13140 

20 CAGCTTTCAC ACGAACTGTT GTTGGAGAAT ATGGTGTGGC ATGTTTTAAA TCCATAACAT 13200 

CTCCTCGAAC TTTTTCAGTG TCTAAATCAA TGATGACTAA TTCATCAACA ATGCTTTGGT 13260 

TCACTAATGA AAATGCGTAG CTTGAACCTA CTGCACCATT ACCTATTAAT ACAACTTTGT 13320 

25 

TCCCTTTAAA TTTGTTCATT ACAAAAACTC CCTTATGATT AATTCACTAA CATACATGTA 133 80 

GCTTCAAATA TGTTAGTTTA ATGCTGCTTA TTGACGATAC AAAAGCAAAT AAACATCTCT 13440 

TTTATTTTCA ACGCATAACT TAAAAGGTCA TGTGTCATCC GCTTTTAAGT TTGTGATTTA 13500 

30 

TTTCACATAT AAAATGTAAC ATGCATTAAG TACTGGGTCA ATATTAAATT GTGATTTATT 13560 

TCACATTTTA TTTTAATTTT TACACCTTTT TAATTTGTAT mCGATTACAT CTTAGATGTC 13 620 

TTTAGTCTTC GTACTTCGCC AGTGATTATT TACACTTTCA CATTTTTATT ATCATGTTTA 13680 

35 

CTTTTTTCTA GGAAAACAAC AATGTTTTTT GAATTAGTCA AATAAATGCG CTCAATCGTC 13740 

GGTGTGCAAA CAGACAATTG TACACAATGC TTATTGATAA GTATTTAAAA AATTAAAAAT 13800 

40 GTCATACAAT TATCAAATTT GCCATTTTAT TTATATTTTC TCAAACCAAT TAATTGAATA 13 860 

TCGAAATTTT TAGTAGAATA ATCAAAATAT ACAGATTAAA GGAGGAGTAT CATGCTTACA 13920 

GAACAAGAGA AAGACATTAT CAAACAAACG GTGCCTTTAC TTAAAGAGAA AGGGACAGAA 13980 

45 ATTACGTCAA TCTTTTATCC AAAAATGTTT AAAGCGCATC CTGAA CTTTT AAACATGTTT 14040 

AATCAAACGA ACCAAAAACG AGGCATGCAA TCTTCAGCAT TAGCACAAGC TGTAATGGCC 14100 

GCAGCGGTTA ATATCGATAA CTTAAGTGTT ATTAAACCAG TCATTATGCC AGTCGCATAT 14160 

50 

AAACACTGCG CACTACAAGT TTATGCTGAA CATTATCCAA TTGTGGGGAA AAATTTATTA 14220 

AAAGCCATTC AAGACGTGAC AGGATTAGAA GAAAATGACC CTGTCATTCA AGCTTGGGCA 14280 

55 



446 



EP 0 786 519 A2 

(2) INFORMATION FOR SEQ ID NO: 58: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 58: 
GGTATTTTnG GAnGGGTACC TAAAGCAATT CCGGCAAAGG GTnAATCCAG GTACCGAAAT 
GGACTTCCCG TTATCGATAA TACCGACATA TATTGTGACA AGTAGATTTT ATGGACATTT 
AGGCTTACTT TTACTTGTGA TAATTGCATG TATGTTTACT GGTATTTAtC CaTCaATACA 
TATCATTCAA TTATTGATAT ATGTACCGTT TTGTTTTTTC TTAACTGCCt CGGTGACGTT 
ATTAACATCA ACACTCGGTG TGTTAGTTAG AGATACACAA ATGTTAATGC AAGCAATATT 
AAGAATATTA TTTTACTTTT CACCAATTTT GTGGCTACCA AAGAACCATG GTATCAGTGG 
TTTAATTCAT GAAATGATGA AATATAATCC AGTTTACTTT ATTGCTGAAT CATACCGTGC 
AGCAATTTTA TATCACGAAT GGTATTTCAT GGATCATTGG AAATTAATGT TATACAATTT 
CGGTATTGTT GCCATTTTCT TTGCAATTGG TGCGTACTTA CACATGAAAT ATAGAGATCA 
ATTTGCAGAC TTCTTGTAAT ATATTTATAT GACGAAACCC CGCTAACCAT TAATAAATGG 
AAGTGGGGTT CATTTTTGTT TATAATTTAA GTAAATAACA TATTAAGTTG GTGTATTATG 
AACGTTTTAA TAAAGAAATT TTATCATTTG GTAGTTCGAA TACTTTCTAA AATGATTACG 
CCTCAAGTGA TTGATAAACC GCATATCGTA TTTATGATGA CTTTTCCAGA AGATATTAAG 
CCTATCATCA AAGCATTAAA TAATTCGTCG TATCAGAAAA CTGTTTTAAC AACACCAAAA 
CAAGCGCCTT ATTTATCTGA ACTTAGCGAC GATGTTGATG TGATAGAAAT GACTAATCGA 
ACATTGGTAA AACAAATTAA GGCTTTGAAA AGCGCGCAGA TGATTATTAT CGATAATTAT 
TACCTATTGC TAGGTGGATA TAATAAGACT TCTAATCAAC ACATTGTTCA AACGTGGCAT 
GCAAGTGGTG CATTAAAAAA CTTTGGCTTA ACAGATCATC AAGTCGATGT GTCTGACAAG 
GCAATGGTTC AGCAGTACCG TAAAGTTTAT CAAGCGACGG ATTTTTACTT AGTGGGTTGT 
GAACAAATGT CACAATGTTT TAAACAGTCT TTAGGTGCAA CAGAAGAGCA AATGCTGTAT 
TTTGGGCTTC CGAGAATTAA TAAATATTAC ACAGCTGATA GAGCAACGGT TAAGGCAGAG 
TTAAAGGATA AATATGGAAT TACAAATAAG TTGGTATTAT ATGTACCAAC ATATAGAGAA 
GATAAAGCAG ATAATAGGGC TATTGATAAA GCTTATTTTG AAAAATGTTT ACCAGGATAT 
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.ATCGACACGT CTACATTAAT GCTAATGTCA 


GATATAATTA 


TTAGCGACTA 


TAGTTCGCTG 


1500 




CCAATAGAAG CTAGCTTGTT AGATATTCCA 


ACTATATTTT 


ATGTGTATGA 


TGAAGGAACA 


1560 


5 


TATGATCAGG TGAGAGGCCT GAATCAATTT 


TACAAAGCAA 


TACCGGATAG 


CTACAAAGTG 


1620 




TATACTGAAG AAGATTTAAT AATGACGATA 


CAAGAAAAAG 


AACATCTATT 


AAGTCCGTTA 


1680 




TTTAAAGATT GGCATAAGTA TAATACTGAT 


AAAAGTTTAC 


ATCAGCTCAC 


AGAATATATA 


1740 


10 


GATAAGATGG TGACAAAATG AGGTTTACGA 


TAATCATACC 


TACATGTAAT 


AATGAGGCAA 


1800 




CAATTCGACA ATTGTTAATA TCTATTGAGA 


GTAAAGAACA 


CTATAGAATC 


CTTTGTATTG 


1860 


1 c 

lb 


ATGGTGGTTC TACTGATCAA ACAATTCCTA 


TGATTGAACG 


GTTACAAAGA 


GAACTCAAGC 


1920 


ATATTTCATT AATACAATTA CAAAATGCTT 


CGATAGCTAC 


GTGTATTAAT 


AAAGGTTTGA 


1980 




TGGATATCAA AATGACAGAT CCACATGATA 


GTGACGCATT 


TATGGTCATA 


AAACCAACAT 


2040 


20 - 


CAATCGTATT GCCAGGTAAA TTAGATAGGT 


TAACTGCTGC 


TTTCAAAAAT 


AATGATAATA 


2100 




TTGATATGGT AATAGGGCAG CGAGCTTACA 


ATTACCATGG 


TGAATGGAAA 


TTGAAAAGTG 


2160 




CTGATGAGTT TATTAAAGAC AATCGAATCG 


TTACATTAAC 


GGAACAACCA 


GATTTGTTAT 


2220 


25 


CAATGATGTC TTTTGACGGA AAGTTATTCA 


GTGCTAAATT 


TGCTGAATTA 


CAGTGTGaCG 


2280 




AAACTTTAGC TAACaCATAC AATCACGCAA 


TACTTGTCAA 


GGCGATGCAA 


AAAGCTACGG 


2340 




ATATACATTT AGTTTCACAG ATGATTGTCG 


GAGATAACGA 


TATAGATACA 


CATGCTACAA 


; 2400 


30 


GTAACGATGA AGATTTTAAT AGATATATCA 


CAGAAATTAT 


GAAAATAAGA 


CAACGAGTCA 


2460 




TGGAAATGTT ACTATTACCT GAACAAAGGC 


TATTATATAG 


TGATATGGTT 


GATCGTATTT 


2520 




TATTCAATAA TTCATTAAAA TATTATATGA 


ACGAACACCC 


AGCAGTAACG 


CACACGACAA 


2580 


35 


TTCAACTCGT AAAAGACTAT ATTATGTCTA 


TGCAGCATTC 


TGATTATGTA 


TCGCAAAACA 


2640 




TGTTTGACAT TATAAATACA GTTGAATTTA 


TTGGTGAGAA 


TTGGGATAGA 


GAAATATACG 


2700 


40 


AATTGTGGCG ACAAACATTA ATTCAAGTGG 


GCATTAATAG 


GCCGACTTAT 


AAAAAATTCT. 


2760 


TGATACAACT TAAAGGGAGA AAGTTTGCAC 


ATCGAACAAA 


ATCAATGTTA 


AAACGATAAC 


2820 




GTGTACATTG ATGACCATAA ACTGCAATCC 


TATGATGTGA 


CAATATGAGG 


AGGATAACTT 


2880 




AATGAAACGT GTAATAACAT ATGGCACATA 


TGACTTACTT 


CACTATGGTC 


ATATCGAATT 


2940 




GCTTCGTCGT GCAAGAGAGA TGGGCGATTA 


TTTAATAGTA 


GCATTATCAA 


CAGATGAATT 


3000 




TAATCAAATT AAACATAAAA AATCTTATTA 


TGATTATGAA 


CAACGAAAAA 


TGATGCTTGa 


3060 


50 


ATCAATACGC TATGTCGATT TAGTCATTCC 


AGAAAAGGGC 


TGGGGACAAA 


AAGAAGACGA 


3120 




TGTCGAAAAA TTTGATGTAG ATGTTTTTGT 


TATGGGACAT 


GACTGGGAAG 


GTGAATTCGA 


3180 
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TAAAATCAAA CAAGAATTAT ATGGTAAAGA TGCTAAATAA ATTATATAGA ACTATCGATA 3300 

CTAAACGATA AATT AACTT A GGTTATTATA AAATAAATAT AAAACGGACA AGTTTCGCAG 3360 

5 CTTTATAATG TGCAACTTGT CCGTTTTTAG TATGTTTTAT TTTCTTTTT C TAAATAAACG 3420 

ATTGATTATC ATATGAACAA TAAGTGCTAA TCCAGCGACA AGGCATGTAC CACCAATGAT 3480 

AGTGAATAAT GGATGTTCTT CCCACATACT TTTAGCAACA GTATTTGCCT TTTGAATAAT -354 0 

10 

TGGCTGATGA ACTTCTACAG TTGGAGGTCC ATAATCTTTA TTAATAAATT CTCTTGGATA 3600 

GTCCGCGTGT ACTTTACCAT CTTCGACTAC AAGTTTATAA TCTTTTTTAC TAAAATCACT 3660 

TGGTAAAACA TCGTAAAGAT CATTTTCAAC ATAATATTTC TTACCATTTA TCCTTTGCTC 3720 

15 

ACCTTTAGAC AATATTTTTA CATATTTATA CTGATCAAAT GAGCGTTCCA TTAATGCATT 3780 

CCCCATCATA TTACGTTGCT TCTCGCCACC AAGGTTTTTA TAGTCTCCTG CACCCATGAT 384 0 

2Q AACTTGATTA ATTCTAAATT TACCTCGTTT GGTAGTAATC GTATGGTTGT AATTTGCTGT 3900 

ATCACTTGAT CCAGTTTTTA AACCATCTGT ACCCGGCAAA CTCATTTTTG CACCTTCCAA 3960 

TGAAAAGTTG AATGTGTAAT ACGTAACTGC ATGCGTTGTT GGTGCTAACT GCTTTGTAAA 4020 

25 GTCTAATATT TTAGGTGTCT CTTTAATCAC GTGTAAATCT AAAATGGCAT AGTCTCTAGC 4080 

AGTCGTTACA GTACGTTCTT GGTCTTTATA CTTTGTTGGT GCAAATGTAC GTAATCTTGA 4140 

ATTTTCAGCA CCCGTTGGAT TGACGAAATG TGTATTTTTC ATTCCGATAG CTTTAGCTTT 4200 

30 GTTATTCATT AAATCAACGA AATCGCTGGT GTTTTTTGAA ACCTTCTTAG CTAAAATTAA 4260 

TGCCGCGGCA TTACTAGAAT TAGATACTGT AATTTGTAAT AGGTCTGCGA TTGTCCATAC 4320 

TTGTCCAGGA TATAGTTTCG TATTACTCAA CTCAGGTAGT GTAGACATAA TATATTCTTT 4 3 80 

35 

GTTCGTCATT GTGACTGTGT CATCAAGTGA AAGCTGCCCC TTATTTACAG CTTCCAATGT 4440 

TAACTACATT GTCATTAATT TAGTCATAGA CGCTGGAtTC CACTTAGTAT CGATATTGTA 4500 

TTGAiTACAGT AATTGTCCAG TTTGACTTAC ATTAACAGCA CTCGTCGGTT CGTATGCAGC 4 560 

40 

CGACAAACCT GCATAACCAT ATTGATTTGC TGCTTGTACA GGGGTTACGT CACTGTTAGT 4620 

AGCTTGTGCA TATGGTGTCA TAATACTTAA TGTTAAACAT AAAATGATGA TAATAGATAT 4 680 

4s TAAATTTTTC ATAAAG CGTT AATCTTCCCT TTTCCAATTC TTAAATATTC CCTAAAAGCA 474 0 

ATGGTTATTC CTACTTACGG AAATCATTGC TAATTCACTT CACCTTAATT AAATTGTTGA 4 800 

AAATAAAGTT TTCTGCAGTT AATTTGAAAA ATAATGCAAA TATATTACGT GTGTAGCTAA 4 860 

SO AGGTGTTATA ATGTTTGTAC GAAGAGCAAA CTTACTCAAA AGCGATTAAT TTTCATGTTT 4 920 

TAATATAAAG ACTTTGAGAA GTTATTACAA AAAATGCAAT AGAAATATTC TATCATATAA 4980 
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AAGTATATGA TAGAAATGCA TGTATCTATC TAAATGAATT AACTATAAAT TTCAAACAGA 5100 

AGAGGTAAAA CTATGAAACG AGAAAATCCA TTGTTTTTCT TATTTAAAAA ACTATCATGG 5160 

5 CCAGTGGGTC TTATCGTTGC AGCTATCACT ATTTCATCAC TAGGGAGCTT AAGTGGACTA 5220 

TTAGTGCCAC TGTTTACTGG ACGAATTGTA GATAAATTTT CCgTGAGCCA TATCAATTGG 5280 

AATCtAATCG CATTATTTGG TGGTATCTTT GTCATCAATG CTTTATTAAG CGGATTAGGT 534 0 

10 

TTATATTTAT TAAGTAAAAT TGGTGAAAAG ATTATTTATG CGATACGCTC AGTTTTATGG 54 00 

GAGCATATCA TACAATTAAA AATGCCATTC TTTGACAAAA ATGAAAGTGG TCAATTAATG 5460 

AGTCGATTAA CTGACGATAC GAAAGTGATA AATGAATTTA TTTCACAAAA GCTACCTmAC 5520 

15 

TTATTACCAT CAATCGTTAC ATtAGTTGGG TCACTAATCA TGTTATTTAT TTTAGATTGG 5580 

AAAATGACAT TATTAACATT TATAACGATA CCGATATTCG TTTTaATTAT GATTCCTCTA 5640 

20 GGTCGTATTA TGCAAAAGAT ATCGACAAGT ACACAATCTG AAATTGCAAA CTTCAGTGGT 5700 

> 

TTGTTAGGGC GTGTCCTAAC TGAAATGCGT CTTGTTAAAA TATCAAATAC AGAGCGTCTT 5760 

GAATTAGATA ATGCACATAA AAATTTGAAT GAAATATATA AATTAGGTTT AAAACAGGCT 5820 

25 AAAATTGCGG CAGTTGTACA ACCAATTTCA GGTATAGTTA TGTTGCTAAC AATTGCAATT 5880 

ATTTTAGGTT TTGGTGCATT AGAAATTGCG ACTGGTGCAA TCACTGCAGG TACATTAATT 594 0 

GCAATGATAT TTTATGTTAT TCAGTTATCT ATGCCTTTAA TCAATCTTTC CACGTTAGTT 6000 

30 

ACAGATTATA AAAAGGCAGT CGGTGCAAGT AGTAGAATAT ACGAAATCAT GCAAGAACCT 6060 

ATTGAACCGA CAGAAGCTCT TGAAGATTCT GAAAATGTAT TAATTGATGA CGGTGTATTG 6120 

TCATTTGAAC ATGTAGACTT TAAATATGAT GTGAAGAAAA TATTAGATGA TGTGTCGTTC 6180 

35 

CAAATCCCAC AAGGTCAAGT GAGTGCTTTT GTAGGCCCTT CTGGGTCTGG TAAAAGTACG 624 0 

ATATTTAATC TGATAGAACG TATGTATGAA ATTGAGTCAG GTGATATTAA ATATGGCCTT 6300 

GAAAGTGTCT ATGATATCCC GTTATCTAAG TGGCGACGCA AAATTGGATA TGTTATGCAA 636 0 

40 

TCAAATTCGA TGATGAGTGG TACAATTAGA GACAATATTT TATACGGAAT TAATCGTCAT 6420 

GTTTCAGATG AAGAACTTAT TAATTATGCT AAATTAGCGA ACTGTCATGA TTTTATCATG 64 80 

45 CAATTTGATG AAGGATATGA CACGCTTGTA GGTGAACGAG GATTGAAACT GTCTGGCGGA 6540 

CAACGTCAAC GTATTGATAT TGCTAGAAGT TTTGTTAAAA ATCCTGATAT TTTGTTACTT 6600 

GATGAAGCAA CAGCTAATCT CGATAGTGAA AGTGAATTGA AAATTCAAGA AGCTTTAGAA 6660 

50 ACATTGATGG AAGGTAGAAC AACGATTGTC ATTGCGCATC GTTTGTCTAC AATTAAAAAA 6720 

GCCGGTCAAA TTATATTCTT AGACAAAGGA CAGGTAACAG GTAAAGGTAC GCATTCAGAA 6780 
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TTTTATATAT ATAAGTAAGC TTGGAGCAAA TACACATATA CCATCGAGGA AATTAAAGTG €900 

TGGCACATTG ATGGATATAG ATGTTAATAA ATTGCTTCAA GCTTTTGTCT ATTTTAAATC €960 

5 ATTTGAGAAG TTACGACATA ATAATTCTTA AATTAATGAA ATCGATATTT TAAGAAAAAA 7020 

ATGCTCATGG TATAATACAA GTTATAAGCA AACATACATA TATTAAATAC TGTAGCCACG 7080 

AGTCATAATT CTTCATATTT TACATAGCAA TTTAACTGAT .TTTAGAGTCC ACGGTACAGA 7140 

to 

AGTTTGATAT TTCAATGTTT CTAAATTTTT AAAAAATTAA ATCATAGGTG GGTGCCAAAT 7200 

GTTTTTATTA ATCAACATTA TTGGTCTAAT TGTATTTCTT GGTATTGCGG TATTATTTTC 7260 

AAGAGATCGC AAAAATATCC AATGGCAATC AATTGGGATC TTAGTTGTTT TAAACCTGTT 7320 

75 

TTTAGCATGG TTCTTTATTT ATTTTGATTG GGGTCAAAAA GCAGTAAGAG GAGCAGCCAA 7380 
TGGTATCGCT TGGGTAGTTC AGTCAGCGCA TGCTGGTACA GGTTTTGCAT TTGCAAGTTT . 7440 

GACAAATGTT AAAATGATGG ATATGGCTGT TGCAGCCTTA TTCCCAATAT TATTAATAGT 7500 

20 

GCCATTATTT GATATCTTAA TGTACTTTAA TATTTTACCG AAAATTATTG GAGGTATTGG 7560 

TTGGTTACTA GCTAAAGTAA CAAGACAACC TAAATTCGAG TCATTCTTTG GGATAGAAAT 7620 

2s GATGTTCTTA GGAAATACTG AAGCATTAGC CGTATCAAGT GAGCAACTAA AACGTATGAA 7680 

TGAAATGCGT GTATTAACAA TCGCAATGAT GTCAATGAGC TCTGTATCGG GAGCTATTGT 7740 

AGGTGCGTAT GTACAAATGG TACCAGGAGA ACTGGTACTA ACGGCAATTC CACTAAATAT 7800 

30 CGTTAACGCG ATTATTGTGT CATGCTTGTT GAATCCAGTA AGTGTTGAAG AGAAAGAAGA 7860 

TATTATTTAC AGTCTTAAAA ACAATGAAGT TGAACGTCAA CCATTCTTCT CATTCCTTGG 7920 

AGATTCTGTA TTAGCAGCAG GTAAATTAGT ATTAATCATC ATCGCATTTG TTATTAGTTT 7980 

35 

TGTAGCGTTA GCTGATCTAT TTGATCGTTT TATCAATTTG ATTACAGGAT TGATAGCAGG 804 0 

ATGGATAGGC ATAAAAGGTA GTTTCGGTTT AAACCAAATT TTAGGTGTGT TTATGTATCC 8100 

ATTTGCGCTA TTACTCGGTT TACCTTATGA TGAAGCGTGG TTGGTAGCAC AACAAATGGC 8160 

40 

TAAGAAAATT GTTACAAATG AATTTGTTGT TATGGGTGAA ATTTCTAAAG ATATTGCATC 8220 

TTATACACCA CACCATCGTG CGGTTATTAC AACATTCTTA ATTTCATTTG CAAACTTCTC 8280 

AACGATTGGT ATGATTATCG GTACATTGAA AGGCATTGTT GATAAAAAGA CATCAGACTT 8340 

45 

TGTATCTAAA TATGTACCTA TGATGCTATT ATCAGGTATC CTAGTTTCAT TATTAACAGC 8400 

AGCTTTCGTT GGTTTATTTG CATGGTAATA TGTCGAAGAG TGACTATGAT AATACATTTT 84 60 

so ■ AACTAATAAA TATGTCCAGG CATGTCGTCT ATTGATATAG GTGAGATGCT TGGACTTTTT 8520 

TATTATTGAT ATAAAGGTAT nTAAATATTT TTAAAGTTAC CGAAATTGAA GCATTATAAA 8580 
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GACAGTAAGG ACTAGGTACA GTCATAGTAC TTCGAGCAAA ATTTGTTTTG TTATTATAAA 8700 

CAACACAAAG GAGATAACTT CTCTAnTGAA GAAGTTAAAA ACATTATAGC AGACAATGAA 8760 

5 

ATGAAAGTAA ATTAAAAAT e7 7 9 

(2) INFORMATION FOR SEQ ID NO: 59: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 31096 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 59: 

GTTGCAGTAG TCAAAGAATT AAACAAGGTG AAGGcGTGTA GCTTGCACAC CCGAAAATGT 60 

20 GCGTAAGTTA aCGGATGCAG GACATAAAGT AATTGTTGAA AAAAATGCTG GCATTGGTTC 120 

AGGATTTTCT AACGATATGT ATGAAAAAGA AGGCGCTAAG ATCGTAACTC ACGAACAAGC 180 

ATGGGAAGCT GATCTTGTTA TCAAAGTAAA AGAACCTCAT GAAAGCGAAT ATCAATATTT 24 0 

25 CAAAAAGAAT CAAATTATCT GGGGATTTTT ACATCTAGCA TCTTCAAAAG AAATAGTAGA 3 00 

AAAAATGCAA GAAGTTGGTG TAACTGCGAT TAGTGGTGAA ACCATTATAA AAAATGGAAA 360 

AGCAGAATTA TTAGCGCCAA TGAGTGCTAT AGCAGGTCAA CGCTCAGCAA TTATGGGAGC 420 

30 

TTACTACTCT GAAGCACAAC ATGGTGGTCA AGGTACTTTA GTGACTGGTG TACATGAAAA 4 80 

TGTGGATATA CCTGGTAGTA CATATGTGAT TTTCGGTGGT GGAGTAGCAG CAACAAATGC 540 

AGCAAATGTT GCCTTGGGAC TAAATGCTAA AGTAATCATT ATCGAGTTAA ACGATGACCG 600 

35. 

CATTAAATAT CTTGAAGATA TGTATGCAGA AAAAGATGTC ACAGTAGTCA AATCAACACC 660 

AGA^AATTTA GCAGAACAAA TTAAGAAAGC AGATGTATTT ATTTCTACAA TTTTAATTTC 720 

AGGTGCGAAA CCGCCAAAAT TGGTTACTCG TGAGATGGTT AAATCAATGA AAAAAGGTTC 780 

40 

AGTATTAATC GATATAGCTA TTGACCAAGG TGGAACTATT GAAACAATTA GACCAACTAC 840 

AATTTCTGAT CCAGTGTATG AAGAAGAAGG TGTGATTCAT . TATGGTGTAC CAAATCAACC 900 

45 AGGAGCAGTC CCAAGAACTT CAACAATGGC ATTAGCACAA GGAAATATTG ATTATATATT 960 

AGAAATTTGT GACAAAGGCT TAGAACAAGC AATTAAAGAT AATGAAGCCT TAAGTACTGG 1020 

TGTAAACATT TACCAAGGAC AAGTGACAAA TCAAGGATTA GCTTCATCAC ATGACCTAGA 1080 

50 TTATAAAGAA ATATTAAATG TTATCGAATA GATAGTAATT TAAATGAAAT TGAGTGAAAT 114 0 

GAATATTTTA AATATAGCAT TATAGTTTGG ACTAAAAATT TACAAAACGG AAGGATGTAA 1200 
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TCGAAGAAGC TAAAGCAAGC ATTAAACCAT TTATTCGTCG AACACCTCTA ATTAAATCAA 1320 

TGTATTTAAG CCAAAGTATA ACTAAAGGGA ATGTATTTCT AAAATTAGAA AATATGCAAT 1380 

5 TCACAGGATC TTTTAAATTT AGAGGCGCTA gCAATllAAAA TTAATCACTT AACAGATGAA 1440 

CAAAAAGAAA AAGGCATTAT CGCAGCATCT GCTGGGgAAC CATGCACAAG GTGTTGCTTT 1500 

AACAGCTAAA TTATTAGGCA TTGATGCAAC GATTGTAATG CCTGAAACAG CACCACAAGC 156 0 

10 

GAAACAACAA GCAACAAAAG GCTATGGGGC AAAGGTTATT TTAAAAGGTA AAAACTTTAA . 1620 

CGAAACTAGA CTTTATATGG AAGAATTAGC GAAAGAAAAT GGCATGACAA TCGTTCATCC 1680 

ATATGACGAT AAGTTTGTAA TGGCAGGCCA AGGAACAATT GGTTTAGAAA TTTTAGATGA 1740 

IS 

TATTTGGAAT GTGAATACAG TCATCGTACC AGTTGGCGGT GGAGGATTAA TTGCAGGTAT 1800 

TGCCACCGCA TTAAAATCAT TTAACCCTTC AATTCATATT ATCGGTGTTC AATCTGAGAA 1860 

TGTTCATGGT ATGGCTGAGT CTTTCTATAA GAGAGATTTA ACTGAACATC GAGTGGATAG 1920 

20 

GACAATAGCA GATGGTTGTG ATGTAAAAGT TCCTGGTGAA CAAACATATG AAGTAGTTAA 1980 

ACATTTAGTA GATGAATTTA TTCTTGTTAC TGAAGAAGAA ATTGAACATG CTATGAAAGA 204 0 

25 TTTAATGCAG CGTGCCAAAA TTATTACTGA AGGTGCAGGC GCATTACCAA CAGCTGCAAT 2100 

TTTAAGTGGA AAAATAAACA ATAAATGGCT TGAAGATAAA AATGTTGTTG CATTAGTTTC 2160 

AGGCGGGAAT GTTGACTTAA CTAGAGTTTC AGGTGTCATT GAACATGGAC TGAATATTGC 2220 

30 AGATACAAGC AAGGGTGTGG TAGGTTAAAA CATTTAATCT TAAAAATGAG GTGTAATTAT 2280 

GTCAAATGGT AAAGAATTAC AAAAAAATAT AGGTTTCTTC TCAGCGTTTG CTATTGTTAT 234 0 

GGGGACAGTT ATTGGTTCAG GAGTATTCTT TAAAATATCA AACGTAACAG AAGTAACAGG 2400 

35 

AACAGCAGGA ATGGCCTTGT TTGTATGGTT CCTAGGCGGC ATCATTACCA TTTGTGCGGG 2460 

GTTAACAGCA GCAGAACTTG CTGCTGCAAT CCCTGAAACA GGTGGCTTAA CGAAGTATAT 2520 

AGAATATACA TACGGTGATT TCTGGGGCTT CCTATCAGGT TGGGCGCAAT CATTTATTTA 2580 

40 

■ TTTTCCAGCT AACGTAGCAG CATTGTCTAT CGTATTTGCG ACACAGCTAA TTAATTTATT 2640 

CCATTTATCT ATAGGTTCGT TAATACCAAT AGCAATCGCA TCTGCGTTAT CTATTGTGTT 2700 

GATAAATTTC CTAGGTTCAA AAGCAGGCGG AATTTTACAA TCAGTTACTT TAGTAATTAA 2760 

45 

ACTGATTCCA ATCATCGTTA TTGTAATTTT TGGTATTTTT CAATCTGGAG ATATCACTTT 2820 

TTCATTAATT CCAACTACAG GTAATTCaGG AAATGGCTTC TTTACAGCAA TTGGTAGTGG 2880 

SO TTTATTAGCA ACTATGTTTG CATATGATGG TTGGATTCAT GTAGGAAATG TTGCGGGGGA 294 0 

ACTTAAAAAT CCTAAACGCG ATTTACCTTT AGCGATTTCA GTTGGTATCG GTTGTATTAT 3000 
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TGGTAATTTA AATGCAGCTT CAGATACATC AAAAATATTA TTTGGTGAAA ATGGCGGTAA 3120 

GATTATTACA ATCGGTATAT TAATTTCTGT TTATGGTACG ATCAATGGCT ATACTATGAC 3180 

5 TGGTATGCGC GTACCATATG CAATGGCTGA AAGAAAATTA TTGCCATTTA GCCATTTATT 324 0 

CGCAAAATTA ACAAAATCTG GCGCACCATG GTTTGGCGCA ATTATACAAC TTATAATCGC 33 00 

TATCATCATG ATGTCAATGG GAGCATTTGA TACAATTACA AATATGTTAA TCTTTGTTAT 3360 

10 

TTGGTTGTTC TATTGTATGT CATTTGTTGC GGTAATAATT TTAAGAAAAC GTGAACCAAA 3420 

TATGGAACGA CCATATAAAG TACCGTTATA TCCGATCATA CCTTTAATTG CTATTTTGGC 3480 

AGGATCATTT GTATTAATTA ATACACTGTT TACACAATTT ATATTAGCAA TCATTGGAAT 3540 

TCTAATAACA GCACTTGGTA TACCAGTTTA TTACTATAAA AAGAAACAAA AAGCAGCATA 3600 

AGGTAAGATA ACTAGCATTG AGAATAAATG GATGGACTAC TAATAAATTT AAAGTTTTAC 3660 

2Q ACATTAAAAT CAAAAACCAT TCAATTATTC TATGGAACAG ACAAATTTCT GTTATGGAAT 3720 

TTGTCTGTTT TTCAAAAGTA TAGGGAGGCA AATAGAGATG GAAAAGCCGT CAAGAGAGGC 3780 

ATTTGAAGGC AATAATAAGT TGTTAATAGG AATTGTTCTA AGTGTAATAA CGTTTTGGCT 3840 

25 ATTTGCACAA TCATTGGTTA ATGTTGTACC AATACTTGAA GATAGTTTCA ATACAGATAT 3900 

TGGAACGGTT AATATCGCCG TTAGTATAAC TGCTTTATTT TCAGGAATGT TTGTAGTAGG 3 960 

AGCAGGTGGT CTTGCTGATA AATATGGCAG AATTAAACTC ACGAACATTG GTATTATCTT 4020 

30 . AAATATATTA GGTTCATTAT TAATCATTAT TTCAAATATT CCTTTATTAC TTATTATAGG 4080 

AAGATTAATT CAAGGACTTT CAGCAGCATG TATTATGCCT GCAACTTTGT CTATTATTAA 4140 

GTCATATTAC ATTGGGAAAG ATAGACAACG CGCTTTAAGT TATTGGTCAA TTGGCTCATG 4200 

35 

GGGCGGCTCT GGTGTTTGTT CATTTTTTGG AGGTGCAGTT GCAACGCTTT TAGGTTGGCG 4260 

TTGGATTTTC ATCCTATCAA TTATAATTTC ATTAATTGCA CTGTTTCTTA TTAAAGGCAC 4320 

ACCTGAAACT AAATCTAAAT CGATTTCTCT AAATAAATTT GACATTAAAG GTCTGGTTCT 4380 

40 

TTTAGTCATT ATGCTCCTCA GTTTAAATAT TTTAATTACT AAAGGATCAG AATTAGGTGT 4440 

AACCTCACTT CTTTTTATTA CTTTATTAGC TATTGCAATT GGATCTTTTA GTTTATTTAT 4500 

45 AGTTCTTGAA AAGCGTGCTA CAAATCCTTT AATCGATTTT AAATTATTTA AAAATAAAGC 4560 

TTACACAGGT GCAACAGCTT CAAACTTTTT GTTAAATGGT GTTGCAGGAA CATTAATAGT 4620 

AGCCAACACA TTTGTTCAAA GAGGTTTAGG ATATTCTTCA TTGCAAGCAG GAAGTTTATC 4680 

SO AATCACTTAT TTAGTAATGG TACTAATTAT GATTCGTGTT GGTGAAAAGT TACTTCAAAC 474 0 

ACTCGGATGC AAGAAACCAA TGTTAATTGG AACAGGAGTT CTTATTGTCG GAGAATGTCT 4800 
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ATTCTTTGGT 


TTAGGACTAG 


GGATATATGC 


TACACCATCA 


ACAGATACAG CAATTGCAAA 


4920 




TGCACCGTTA 


GAAAAAGTAG 


GCGTTGCTGC 


AGGTATCTAT 


AAAATGGCTT CTGCATTAGG 


4980 


5 


TGGAGCATTT 


GGCGTCGCAT 


TGAGTGGTGC 


AGTATATGCA 


ATCGTATCAA ATATGaCAAA 


5040 




CATTTATACA 


GGTGcAATGa 


TTGnCATTAT 


GGTTaAATGC 


AGGTATGGGa ATATTATCaT 


5100 


10 


TCGTTATCAT 


TTTGtTACTT 


GTGcCTAAAC 


mAAACGACAC 


TCAATTATGA TAATTGAGAA 


5160 


TTAAATTGAA ATCATACAAG 


TCGCTACAAT 


ATTAAACAAA 


AATATAAACC GATTCTTATG 


5220 




TGTCATTATT 


TTAAATGAAC 


ATAGGGATTG 


GTTTTTTATT 


ACT CTTTTAC GCTACTTTAT 


5280 


15 


TTATAATTAT 


TATAAATTGT 


CACAAATTCA 


ATTTACCTTA 


CAATATATTT TGTGTTATTA 


5340 




TATTCTGGAG 


CATAAATAAA 


TTGTTCAACA 


CATAGTTGTA 


ATGTGTTTCA ATACTTTTTG 


5400 




GATAGATTGC 


GAAATTGTAT 


TGAATCGTCA 


TCGTTTTAAA 


TTTTTAAATG AGAATGGAAT 


5460 


20 


GAGCATTACA 


ATACACAAGC 


AATCAAAAGT 


AAATACATTC 


ACAACACAAC AGAGACATAA 


5520 




CAACAAGATA AGGAGTGAAC AATAGCTGTG 


AATTATCGTG 


ATAAAATTCA AAAGTTTAGT 


5580 




ATTCGTAAAT 


ATACAGTTGG 


TACATTTTCA 


ACTGTCATTG 


CGACATTGGT ATTTTTAGGA 


5640 


25 


TTCAATACAT 


CACAAGCACA TGCTGCTGAA 


ACAAATCAAC 


CAGCAAGCGT GGTTAAACAG 


5700 




AAACAACAAA 


GTAATAATGA ACAGACTGAG 


AATCGAGAAT 


CTCAAGTACA AAATTCTCAA 


5760 




AATTCACAAA 


ATGGTCAATC 


ATTATCTGCT 


ACTCATGAAA 


ATGAGCAACC AAATATTAGT 


5B20 


30 


CAAGCTAATT 


TAGTAGATCA 


AAAAGTAGCG 


CAATCATCTA 


CTACTAATGA TGAACAACCA 


5880 




GCATCTCAAA 


ATGTAAATAC 


AAAGAAAGAT 


TCGGCAACGG 


CTGCGACAAG ACAACCAGAT 


5940 


35 


AAAGAACAAA GTAAGCATAA ACAAAACGAA 


AGTCAATCTG 


CTAATAAAAA TGGAAACGAC 


6000 


AATAGAGCGG 


CTCATGTAGA AAATCATGAA 


GCAAATGTAG 


TAACAGCTTC AGATTCATCT 


6060 




GATAATGGTA 


ACGTACAACA 


TGACCGAAAT 


GAATTACAAG 


CGTTTTTTGA TGCAAATTAT 


6120 


40 


CATGATTATC 


GCTTTATTGA 


CCGTGAAAAT 


GCAGATTCTG 


GCACATTTAA CTATGTAAAA 


6180 




GGCATTTTTG ATAAGATTAA TACGTTATTA 


GGCAGTAATG 


ATCCAATAAA CAATAAAGAC 


6240 




TTGCAACTTG 


CATACAAAGA ATTGGAACAA 


GCTGTTGCTT 


TAATTCGTAC AATGCCTCAA 


6300 


45 


CGTCAACAGA 


CTAGCCGACG 


TTCAAATAGA 


ATTCAAACGC 


GTTCGGTTGA GTCAAGAGCT 


6360 




GCAGAGCCTA 


GATCAGTATC 


AGACTATCAA 


AATGCAAATT 


CATCATATTA TGTTGAAAAT 


6420 




GCTAATGATG 


GTTCGGGCTA TCCTGTTGGT ACATATATCa 


ATGCTTCTAG TAAAGGGGCG 


6480 


50 


CCATATAATT 


TACCAACTAC 


ACCATGGAAT 


ACATTGAAGG 


CCTCTGACTC AAAGGAAATT 


6540 




GCTCTTATGA 


CAGCGAAACA AACTGGAGAC 


GGGTACCAAT 


GGGTTATTAA GTTTAATAAA 


6600 
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GTAGGAAGAA CTGACTTTGT AACAGTTAAT TCAGATGGAA CAAATGTACA ATGGAGTCAT 6720 

GGAGCAGGAG CAGGTGCAAA TAAACCACTT CAACAAATGT GGGAATATGG AGTAAATGAT 6760 

CCTCATCGTT CACATGACTT TAAAATAAGA AATAGAAGTG GCCAAGTAAT ATATGACTGG 684 0 

CCAACTGTCC ATATTTATTC TTTAGAAGAT TTATCTAGAG CGAGTGATTA TTTTAGTGAA 6900 

GCTGGAGCGA CACCTGCTAC TAAAGCTTTT GGTAGACAAA ATTTTGAATA TATTAATGGT 6960 

CAAAAACCTG CTGAATCACC GGGTGTTCCT AAAGTTTATA CTTTCATCGG TCAAGGTGAT 7020 

GCAAGTTATA CAATTTCATT TAAAACACAA GGTCCAACTG TTAATAAATT GTACTATGCA 7080 

GCAGGTGGGC GTGCTTTAGA GTACAATCAA TTATTTATGT ACAGTCAACT ATACGTCGAA 7140 

TCAACGCAAG ACCATCAACA ACGTCTTAAT GGTTTAAGAC AAGTGGTTAA TCGTACATAT 7200 

CGCATAGGTA CAACTAAACG TGTAGAAGTG AGTCAAGGAA ATGTACAAAC GAAAAAGGTA 7260 

TTAGAAAGTA CAAACCTAAA TATAGATGAT TTTGTTGATG ATCCTTTAAG TTATGTTAAG 7320 

ACGCCGAGTA ATAAAGTGTT AGGATTTTAT TCGAATAATG CAAATACTAA TGCTTTTAGA 7380 

CCGGGTGGAG CCCAACAATT AAATGAATAT CAATTAAGTC AATTATTTAC TGATCAAAAA 7440 

TTACAAGAAG CAGCAAGAAC TAGAAACCCA ATAAGATTAA TGATTGGTTT CGACTATCCT 7500 

GATGCTTATG GTAATAGTGA AcTTTAGTTC CTGTTAACTT AACGGTATTA CCTGAAATCC 7560 

AACATAATAt TaAATTCTTT AAAAATGACG ATACTCAAAA TATTGCTGAA AAACCATTTT 7620 

30 CAAAACAAGC TGGGCATCCA GTTTTCTATG TATATGCAGG TAACCAAGGG AATGCTTCCG 7680 

TGAATTTAGG TGGTAGCGTA ACATCTATTC AACCATTACG TATTAATTTA ACAAGTAATG 7740 

AGAATTTTAC AGATAAAGAT TGGCAAATTA CAGGTATTCC GCGTACATTA CACATTGAAA 7800 

ACTCGACAAA TAGACCTAAT AATGCCAGAG AACGCAATAT TGAACTTGTT GGTAACTTAT 7860 

TACC&GGGGA TTACTTTGGA ACGATACGTT TTGGACGTAA AGAACAATTA TTCGAAATTC 7920 

• GTGTTAAACC ACATACACCA ACAATTACAA CGACAGCTGA GCAATTAAGA GGTACAGCAT 7980 

TACAAAAAGT GCCTGTTAAT ATTTCGGGAA TACCGTTGGA TCCATCGGCA TTGGTTTATT B04 0 

TAGTTGCACC AACAAATCAA ACTACGAATG GTGGTAGTGA GGCAGATCAA ATACCATCTG 8100 

GTTATACGAT ACTTGCGACT GGTACACCTG ATGGGGTGCA TAATACAATT ACTATACGAC 8160 

CGCAAGATTA TGTTGTATTC ATACCACCTG TAGGTAAACA AATTAGAGCA GTAGTTTATT 8220 

. ATAATAAAGT AGTTGCATCT AATATGAGTA ATGCTGTTAC TATTTTGCCA GATGACATTC 8280 

50 CACCAACAAT CAATAATCCT GTTGGAATAA ATGCCAAATA CTATCGAGGC GACGAAkCAA 834 0 

CTTTACAATG GGTGTCTCTG ATAGACATTC TGGTATAAAA AATACAACTA TTACGACATT 8400 
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TACAGGTAGA GTGAGTATGA ATCAGGCATT TAACAGTGAT ATTACATTTA AAGTGTCAGC 8520 

GACAGaCAAT GTCAATAATA CGACAAATGA TAGTCAATCT AAACATGTTT CAATTCATGT 8580 

5 AGGTAAAATT AGTGAAGATG CTCATCCGAT TGTATTAGGA AATACTGAGA AAGTTGTAGT 8640 

AGTCAATCCG ACTGCTGTAT CTAATGATGA AAAGCAAAGC ATAATTACTG CCTTTATGAA 8700 

TAAAAACCAA AATATAAGAG GATATTTAGC ATCAACTGAT CCAGTAACTG TCGATAATAA 8760 

10 

TGGTAATGTC ACATTACATT ACCGTGATGG CTCATCGACA ACGCTTGATG CTACAAATGT 8820 

GATGACATAC GAACCAGTTG TGAAACCTGA ATACCAAACT GTCAATGCTG CTAAAACAGC 8880 

AACGGTAACG ATTGCTAAAG GACAATCATT TAGTATTGGT GATATTAAAC AATATTTTAC 894 0 

75 

TTTAAGTAAT GGACAACCTA TTCCAAGTGG CACATTTACA AATATTACAT CTGATAGAAC 9000 

TATTCCAACT GCACAAGAAG TTAGTCAAAT GAACGCAGGC ACGCAGTTAT ACCATATAAC 9060 

2Q TGCTACAAAT GCGTATCATA AAGATAGTGA AGACTTCTAT ATTAGTTTGA AAATCATCGA 9120 

TGTGAAACAA CCAGAAGGCG ATCAACGTGT ATATCGTACA TCAACATATG ATTTAACTAC 9180 

TGATGAAATC TCAAAAGTAA AACAAGCATT TATTAATGCA AATAGAGATG TAATTACGCT 924 0 

25 TGCCGAAGGT GATATTTCAG TTACAAATAC ACCTAATGGT GCTAATGTAA GTACTATTAC 9300 

AGTAAATATT AATAAAGGTC GATTAACGAA . ATCATTCGCG TCAAACCTAG CTAATATGAA 9360 

TTTCTTGCGT TGGGTTAATT TCCCACAAGA TTATACAGTG ACATGGACGA ATGCAAAAAT 9420 

30 TGCAAACAGA CCAACAGATG GTGGTTTATC ATGGTCTGAT GACCATAAAT CTTTAATTTA 94 80 

- TCGTTATGAT GCTACATTAG GTACTCAAAT TACGACGAAT GATATTTTAA CAATGTTAAA 954 0 

AGCAACAACT ACAGTGCCTG GATTGCGAAA TAACATTACT GGTAATGAAA AATCACAAGC 9600 

35 

AGAAGCTGGC GGAAGACCTA ACTTTAGAAC GACTGGTTAT TCACAATCAA ATGCGACAAC 9660 

TGATGGTCAA' CGTCAATTTA CGTTGAATGG TCAAGTGATT CAAGTGTTAG ACATCATCAA 9720 

CCCTTCAAAC GGTTATGGTG GGCAACCTGT TACAAATTCA AATACTCGTG CAAACCATAG 9780 

40 

TAACTCAACT GTTGTTAACG TAAACGAACC GGCAGCTAAT GGTGcTGGCG CATTTACAAT 984 0 

TGACCACGTT GTAAAAAGTA ATTCTACACA TAATGCAAGT GATGCAGTTT ATAAAGCACA 9900 

4$ GTTATACTTA ACGCCATATG GTCCAAAACA ATATGTTGAA CATTTAAATC AAAATACAGG 99.60 

AAATACTACT GACGCTATTA ACATTTATTT TGTACCAAGT GACTTAGTGA ATCCAACAAT 10020 

TTCAGTAGGT AATTACACTA ATCATCAAGT GTTCTCAGGT GAAACATTTA CAAATACTAT 10080 

50 TACAGCGAAT GATAACTTTG GTGTGCAATC TGTAACTGTA CCAAATACAT CACAAATTAC 10140 

AGGTACTGTT GATAATAACC ATCAACATGT TTCTGCAACG GCACCAAATG TGACATCAGC 10200 

55 



457 



EP 0 786 519 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



GTTCAATGTA ACAGTGAAAC CTTTGCGTGA TAAATATCGA GTTGGTACTT CATCAACGGC 10320 

TGCTAATCCT GTGAGAATTG CCAATATTTC GAATAATGCG ACAGTATCAC AAGCTGATCA 103 BO 

AACGACAATT ATTAATTCGT TAACGTTTAC TGAAACAGTA CCAAATAGAA GTTATGCAAG 10440 

AGCAAGTGCG AATGAAATCA CTAGTAAAAC AGTTAGTAAT GTCAGTCGTA CTGGAAATAA 1050 0 

TGCCAATGTg CACAGTAACT GTTACTTATC AAGATGGAAC AACATCAACA GTGACTGTAC 10560 

CTGTAAAGCA TGTCATTCCA GAAATCGTTG CACATTCGCA TTACACTGTA CAAGGCCAAG 10620 

ACTTCCCAGC AGGTAATGGT TCTAGTGCAT CAGATTACTT TAAGTTATCT AATGGTAGTG 10680 

ACATTGCAGA TGCAACTATT ACATGGGTAA GTGGACAAGC GCCAAATAAA GATAATACAC 10740 

GTATTGGTGA AGATATAACT GTAACTGCAC ATATCTTAAT TGATGGCGAA ACAACGCCGA 10 BOO 

TTACGAAAAC AGCAACATAT AAAGTAGTAA GAACTGTACC GAAACATGTC TTTGAAACAG 10B60 

CCAGAGGTGT TTTATACCCA GGTGTTTCAG ATATGTATGA TGCGAAACAA TATGTTAAGC 10920 

CAGTAAATAA TTCTTGGTCG ACAAATGCGC AACATATGAA TTTCCAATTT GTTGGAACAT 10980 

ATGGTCCTAA CAAAGATGTT GTAGGCATAT CTACTCGTCT TATTAGAGTG ACATATGATA 11040 

ATAGACAAAC AGAAGATTTA ACTATTTTAT CTAAAGTTAA ACCTGACCCA CCTAGAATTG 11100 

ACGCAAACTC TG7GACATAT AAAGCAGGTC TTACAAACCA AGAAATTAAA GTTAATAAGG 11160 

TATTAAATAA CTCGTCAGTA AAATTATTTA AAGCAGATAA TACACCATTA AATGTCACAA 11220 

ATATTACTCA TGGTAGCGGT TTTAGTTCGG TTGTGACAGT AAGTGACGCG TTACCAAATG 11280 

GCGGAATTAA AGCAAAATCT TCAATTTCAA TGAACAATGT GACGTATACG ACGCAAGACG 11340 

AACATGGTCA AGTTGTTACA GTAACAAGAA ATGAATCTGT TGATTCAAAT GACAGTGCAa 11400 

CAGTAACAGT GACACCACAA TTACAAGCAA CTACTGAAGG CGCTGTATTT ATTAAAGGTG 11460 

GCGACGGTTT TGATTTCGGA CACGTAGAAA GATTTATTCA AAACCCGCCA CATGGGGCAA 11520 

CGGTTGCATG GCATGATAGT CCAGATACAT GGAAGAATAC AGTCGGTAAC ACTCATAAAA 11580 

CTGCGGTTGT AACATTACCT AATGGTCAAG GTACGCGTAA TGTTGAAGTT CCAGTCAAAG 11640 

TTTATCCAGT TGCTAATGCA AAGGCGCCAT CACGTGATGT GAAAGGTCAA AATTTGACTA 117 00 

ATGGAACGGA TGCGATGAAC TACATTACAT TTGATCGAAA TACAAACACA AATGGTATCA 11760 

CTGCAGCATG GGCAAATAGA CAACAACCAA ATAACCAACA AGCAGGCGTG CAACATTTAA 11820 

ATGTCGATGT CACATATCCA GGTATTTCAG CTGCTAAACG AGTTCCTGTT ACTGTTAATG 11880 

TATATCAATT TGAATTCCCT CAAACTACTT ATACGACAAC GGTTGGAGGC ACTTTAGCAA 11940 

GTGGTACGCA AGCATCAGGA TATGCACATA TGCAAAATGC TACTGGTTTA CCAACAGATG 12000 
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TGAATAAACC GAATGTGGCT AAAGTCGTTA ACGCAAAATA TGACGTCATC TATAACGGAC ' 12120 

ATACTTTTGC AACATCTTTA CCAGCGAAAT TTGTAGTAAA AGATGTGCAA CCAGCGAAAC 12180 
5 CAACTGTGAC TGAAACAGCG GCAGGAGCGA TTACAATTGC ACCTGGAGCA AACCAAACAG . 12240 

TGAATACACA TGCCGGTAAC GTAACGACAT ACGCTGATAA ATTAGTTATT AAACGTAATG 12300 

GTAACGTTGT GACGACATTT ACACGTCGCA ATAATACGAG TCCATGGGTG AAAGAAGCAT 12360 

10 

CTGCAGCAAC TGTAGCAGGT ATTGCTGGAA CTAATAATGG TATTACTGTT GCAGCAGGTA 12420 

CTTTCAACCC TGCTGATACA ATTCAAGTTG TTGCAACGCA AGGAAGCGGA GAGACAGTGA 12480 

GTGATGAGCA ACGTAGTGAT GATTTCACAG TTGTCGCACC ACAACCGAAC CAAGCGACTA 12540 

15 

CTAAGATTTG GCAAAATGGT CATATTGATA TCACGCCTAA TAATCCATCA GGACATTTAA 12600 

TTAATCCAAC TCAAGCAATG GATATTGCTT ACACTGAAAA AGTGGGTAAT GGTGCAGAAC 12660 

20 ATAGTAAGAC AATTAATGTT GTTCGTGGTC AAAATAATCA ATGGACAATT GCGAATAAGC 12720 

CTGACTATGT AACGTTAGAT GCACAAACTG GTAAAGTGAC GTTCAATGCC AATACTATAA 12780 

AACCAAATTC ATCAATCACA ATTACTCCGA AAGCAGGTAC AGGTCACTCA GTAAGTAGTA 12840 

25 ATCCAAGTAC ATTAACTGCA CCGGCAGCTC ATACTGTCAA CACAACTGAA ATTGTGAAAG 12900 

ATTATGGTTC AAATGTAACA GCAGCTGAAA TTAACAATGC AGTTCaAGTT GCTAATAAAC 12960 

GTACTGCAAC GATTAAAAAT GGCACAGCAA TGCCTACTAA TTTAGCTGGT GGTAGCACAA 13020 

30 

CGACGATTCC TGTGACAGTA ACTTACAATG ATGGTAGTAC TGAAGAAGTA CAAGAGTCCA 130 BO 

TTTTCACAAA AGCGGATAAA CGTGAGTTAA TCACAGCTAA AAATCATTTA GATGATCCAG 13140 

TAAGCACTGA AGGTAAAAAG CCAGGTACAA TTACGCAGTA CAATAATGCA ATGCATAATG 13200 

35 

CGCAACAACA AATCAATACT GCGAAAACAG AAGCACAACA AGTGATTAAT AATGAGCGTG 13260 

CAACACCACA ACAAGTTTCT GACGCACTAA CTAAAGTTCG TGCAGCACAA ACTAAGATTG 13320 

ATCAAGCTAA AGCATTACTT CAAAATAAAG AAGATAATAG CCAATTAGTA ACGTCTAAAA 133 80 

40 

ATAACTTACA AAGTTCTGTG AACCAAGTAC CATCAACTGC TGGTATGACG CAACAAAGTA. .1344 0 

TTGATAACTA TAATGCGAAG AAGCGTGAAG CAGAAACTGA AATAACTGCA GCTCAACGTG 13500 

45 TTATTGACAA TGGCGATGCA ACTGCACAAC AAATTTCAGA TGAAAAACAT CGTGTCGATA 13560 

ACGCATTAAC AGCATTAAAC CAAGCGAAAC ATGATTTAAC TGCAGATACA CATGCCTTAG 13620 

AGCAAGCAGT GCAACAATTG AATCGCACAG GTACAACGAC TGGTAAGAAG CCGGCAAGTA 13680 

SO TTACTGCTTA CAATAATTCG ATTCGTGCAC TTCAAAGTGA CTTAACAAGT GCTAAAAATA 13740 

GCGCTAATGC TATTATTCAA AAGCCAATAA GAACAGTACA AGAAGTGCAA TCTGCGTTAA ' 13800 
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CTGATAATAG TGCTTTAAAA ACTGCTAAGA CGAAACTTGA TGAAGAAATC AATAAATCAG 13920 

TAACTACTGA TGGTATGACA CAATCATCAA TCCAAGCATA TGAAAATGCT AAACGTGCGG 13980 

5 GTCAAACAGA ATCAACAAAT GCACAAAATG TTATTAACAA TGGTGATGCG ACTGACCAAC 1404 0 

AAATTGCCGC AGAAAAAACA AAAGTAGAAG AAAAATATAA TAGCTTAAAA CAAGCAATTG 14100 

CTGGATTAAC TCCAGACTTG GCACCATTAC AAACTGCAAA AACTCAGTTG CAAAATGATA 14160 

10 

TTGATCAGCC AACGAGTACG ACTGGTATGA CAAGCGCATC TATTGCAGCA TTTAATGAAA 14220 

AACTTTCAGC AGCTAGAACT AAAATTCAAG AAATTGATCG TGTATTAGCC TCACATCCAG 14280 

ATGTTGCGAC AATACGTCAA AACGTGACAG CAGCGAATGC CGCTAAATCA GCACTTGATC 14340 

AAGCACGTAA TGGCTTAACA GTCGATAAAG CGCCTTTAGA AAATGCGAAA AATCAACTAC 14400 

AACATAGTAT TGACACGCAA ACAAGTACAA CTGGTATGAC ACAAGACTCT ATAAATGCAT 14460 

20 ACAATGCGAA GTTAACAGCT GCACGTAATA AGATTCAACA AATCAATCAA GTATTAGCAG 14520 

GTTCACCGAC TGTAGAACAA ATTAATACAA ATACGTCTAC AGCAAATCAA GCTAAATCTG 145 BO 

ATTTAGATCA TGCACGTCAA GCTTTAACAC CAGATAAAGC GCCGCTTCAA ACTGCGAAAA 1464 0 

25 CGCAATTAGA ACAAAGCATT AATCAACCAA CGGATACAAC AGGTATGACG ACCGCTTCGT 14 700 

TAAATGCGTA CAACCAAAAA TTACAAGCAG CGCGTCAAAA GTTAACTGAA ATTAATCAAG .14 760 

TGTTGAATGG CAACCCAACT GTCCAAAATA TCAATGATAA AGTGACAGAG GCAAACCAAG 14 820 

30 CTAAGGATCA ATTAAATACA GCACGTCAAG GTTTAACATT AGATAGACAG CCAGCGTTAA 14 880 

CAACATTACA TGGTGCATCT AACTTAAACC AAGCACAACA AAATAATTTC ACGCAACAAA 14940 

TTAATGCTGC TCAAAATcAT GctGCGCTTG AAACAATTAA GTCTAACATT ACGGCTTTAA 15000 

35 

ATACTGCGAT GACGAAATTA AAAGACAGTG TTGCGGATAA TAATACAATT AAATCAGATC 15060 

AAAATTACAC TGACGCAACA CCAGCTAATA AACAAGCGTA TGATAATGCA GTTAATGCGG 15120 

CTAAAGGTGT CATTGGAGAA ACGACTAATC CAACGATGGA TGTTAACACA GTGAACCAAA 15180 

40 

AAGCAGCATC TGTTAAATCG ACGAAAGATG CTTTAGATGG TCAACAAAAC TTACAACGTG 15240 

CGAAAACAGA AGCAACAAAT GCGATTACGC ATGCAAGTGA TTTAAACCAA GCACAAAAGA 15300 

45 ATGCATTAAC ACAACAAGTG AATAGTGcAC AAAACGTGCA AGCAGTAAAT GATATTAAAC 15360 

AAACGACTCA AAGCTTAAAT ACTGGTATGA CAGGTTTAAA ACGTGGCGTT GCTAATCATA 15420 

ACCAAGTCGT ACAAAGTGAT AATTATGTCA ACGCAGATAC TAATAAGAAA AATGATTACA 15480 

50 ACAATGCATA CAACCATGCG AATGACATTA TTAATGGTAA TGCACAACAT CCAGTTATAA 15540 

CACCAAGTGA" TGTTAACAAT GCTTTATCAA ATGTCACAAG TAAAGAACAT GCATTGAATG 15600 
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ATTTAAATAA TGCACAACGT CAAAACTTAC 


AATCGCAAAT 


TAATGGTGCG 


CATCAAATTG 


15720 




ATGCAGTTAA TACAATTAAG CAAAATGCAA 


CAAACTTGAA 


TAGTGCAATG 


GGTAACTTAA 


15780 


5 


GACAAGCTGT TGCAGATAAA GATCAAGTGA 


AACGTACAGA 


AGATTATGCG 


GATGCAGATA 


15840 




CAGCTAAACA AAATGCATAT AACAGTGCAG 


TTTCAAGTGC 


CGAAACAATC 


ATTAATCAAA 


15900 


10 


CAACAAATCC AACGATGTCT GTTGATGATG 


TTAATCGTGC 


AACTTCAGCT 


GTTACTTCTA 


15960 


ATAAAAATGC ATTAAATGGT TATGAAAAAT 


TAGCACAATC 


TAAAACAGAT 


GCTGCAAGAG 


16020 




CAATTGATGC ATTACCACAT TTAAATAATG 


CACAAAAAGC 


AGATGTTAAA 


TCTAAAATTA 


16080 


15 


ATGCTGCATC AAATATTGCT GGCGTAAATA 


CTGTTAAACA 


ACAAGGTACA 


GATTTAAATA 


16140 


CAkCGATGGg TAACTTGCAA GGTGCAATCA 


ATGATGAACA 


AACGACGCTT 


AATAGTCAAA 


16200 




ACTATCAAGA TGCGACACCT AGTAAGAAAA 


CAGCATACAC 


AAATGCGGTA 


CAAGCTGCGA 


16260 


20 


AAGATATTTT AAATAAATCA AATGGTCAAA 


ATAAAACGAA 


AGATCAAGTT 


ACTGAAGCGA 


16320 




TGAATCAAGT GAATTCTGCT AAAAATAACT 


TAGATGGTAC 


GCX3TTTATTA 


GATCAAGCGA 


16380 




ilCAAaCAGCA AAACAGCAGT TAAATAATAT 


GACGCATTTA 


ACAACTGCAC 


AAAAAACGAA 


16440 


25 


TTTAACAAAC CAAATTAATA GTGGTACTAC 


TGTCGCTGGT 


GTTCAAACGG 


TTCAATCAAA 


16500 




TGCCAATACA TTAGATCAAG CCATGAATAC 


GTTAAGACAA 


AGTATTGCCA 


ACAAAGATGC 


16560 




GACTAAAGCA AGTGAAGATT ACGTAGATGC 


TAATAATGAT 


AAGCAAACAG 


CATATAACAA 


16620 


30 


CGCAGTAGCT GCTGCTGAAA CGATTATTAA 


TGCTAATAGT 


AATCCAGAAA 


TGAATCCAAG 


16680 




TACGATTACA CAAAAAGCAG AGCAAGTGAA 


TAGTTCTAAA 


ACGGCACTTA 


ACGGTGATGA 


16740 




AAACTTAGCT GCTGCAAAAC AAAATGCGAA 


AACGTACTTA 


AACACATTGA 


CAAGTATTAC 


16800 


35 


AGATGCTCAA AAGAACAATT TGATTAGTCA 


AATTACTAGT 


GCGACAAGAG 


TGAGTGGTGT 


16860 




TGATACTGTA AAACAAAATG CGCAACATCT 


AGACCAAGCT 


ATGGCTAGCT 


TACAGAATGG 


16920 


40 


TATTAACAAC GAATCTCAAG TGAAATCATC 


TGAGAAATAT 


CGTGATGCTG 


ATACAAATAA 


16980 


ACAACAAGAG TATGATAATG CTATTACTGC 


AGCGAAAGCG 


ATTTTAAATA 


AATCGACAGG 


17040 




TCCAAACACT GCGCAAAATG CAGTTGAAGC 


AGCATTACAA 


CGTGTTAATA 


ATGCGAAAGA 


17100 


45 


TGCATTGAAT GGTGATGCAA AATTAATTGC 


AGCTCAAAAC 


GCAGCGAAAC 


AACATTTAGG 


17160 




TACTTTAACG CATATCACTA CAGCTCAACG 


TAATGATTTA 


ACAAATCAAA 


TTTCACAAGC 


17220 




TACAAACTTA GCTGGTGTTG AATCTGTTAA 


ACAAAATGCG 


AATAGTTTAG 


ATGGTGCTAT 


17280 


SO 


GGGTAACTTA CAAACGGCTA TCAACGATAA 


GTCAGGAACA 


TTAGCGAGCC 


AAAACTTCTT 


17340 




GGATGCTGAT GAGCAAAAAC GTAATGCATA 


CAATCAAGCT 


GTATCAGCAG 


CCGAAACCAT 


17400 
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TGTTAATAAT GCGAAACATG CATTAAATGG TACGCAAAAC TTAAACAATG CGAAACAAGC 17520 

AGCGATTACA GCAATCAATG GCGCATCTGA TTTAAATCAA AAACAAAAAG ATGCATTAAA 17580 
5 AGCACAAGCT AATGGTGCTC AACGCGTATC TAATGCACAA GATGTACAGC ACAATGCGAC • 17640 

TGAACTGAAC ACGGCAATGG GCACATTAAA ACATGCCATC GCAGATAAGA CGAATACGTT 17700 

AGCAAGCAGT AAATATGTTA ATGCCGATAG CACTAAACAA AATGCTTACA CAACTAAAGT 17760 

10 

TACCAATGCT GAACATATTA TTAGCGGTAC GCCAACGGTT GTTACGACAC CTTCAGAAGT 17820 

AACAGCTGCA GCTAATCAAG TAAACAGCGC GAAACAAGAA TTAAATGGTG ACGAAAGATT 178 BO 

ACGTGAAGCA AAACAAAACG CCAATACTGC TATTGATGCA TTAACACAAT TAAATACACC 17940 

75 

TCAAAAAGCT AAATTAAAAG AACAAGTGGG ACAAGCCAAT AGATTAGAAG ACGTACAAAC IB 000 

TGTTCAAACA AATGGACAAG CATTGAACAA TGCAATGAAA GGCTTAAGAG ATAGTATTGC 18060 

TAACGAAACA ACAGTCAAAA CAAGTCAAAA CTATACAGAC GCAAGTCCGA ATAACCAATC 18120 

20 

AACATATAAT AGCGCTGTGT CAAATGCGAA AGGTATCATT AATCAAACTA ACAATCCGAC 18180 

TATGGATACT AGTGCGATTA CCCAAGCTAC AACACAAGTG AATAATGCTA AAAATGGTTT 18240 

25 AAACGGTGCT GAAAACTTAA GAAATGCACA AAACACTGCT AAGCAAAACT TAAATACATT 18300 

ATCACACTTA ACAAATAACC AAAAATCTGC CATCTCATCA CAAATTGATC GTGCAGGTCA 18360 

TGTGAGTGAG GTAACTGCTA CTAAAAATGC AGCAACTGAG TTGAATACGC AAATGGGTAA 18420 

30 CTTGGAACAA GCTATCCATG ATCAAAACAC AGTTAAACAA AGTGTTAAAT TTACTGATGC 18480 

AGATAAAGCT AAACGTGATG CGTATACAAA TGCGGTAAGG AGAGCTGAAG CAATTCTGAA 18540 

TAAAACGCAA GGTGCAAATA CGTCTAAACA AGATGTTGAA GCGGCTATTC AAAATGTTTC 18600 

35 

AAGTGCTAAA AATGCATTGA ATGGTGATCA AAACGTTACA AATGCGAAGA ATGCAGCTAA 18660 

AAATGCATTA AATAACTTAA CGTCAATTAA TAATGCACAA AAACGTGACT TAACAACTAA 18720 

AATTGATCAA GCAACAACTG TAGCTGGTGT TGAAGCTGTA TCTAATACGA GTACACAATT 187B0 

40 

GAAtACAGCG ATGGCTAACT TGCAAAATGG TATTAATGAT AAAACAAATA CACTAGCAAG 18840 

TGAAAACTAT CATGATGCTG ATTCAGATAA GAAAACTGCT TATACTCAAG CCGTTACGAA 18 900 

CGCAGAAAAT ATTTTAAATA AAAATAGTGG ATCAAATTTA GACAAAACTG CCGTTGAAAA 18960 

45 

CGCGTTGTCA CAAGTTGCTA ATGCGAAAGG TGCCCTAAAT GGTAACCATA ATTTAGAGCA 1902 0 

AGCTAAATCA AATGCAAACA CTACTATAAA CGGACTTCAA CATTTAACAA CTGCTCAAAA 19080 

SO AGATAAATTG AAACAACAAG TGCAACAAGC ACAAAATGTT GCAGGTGTAG ATACTGTTAA 1914 0 

ATCAAGTGCC AACACATTAA ATGGTGCTAT GGGTACGTTA AGAAATAGCA TACAAGATAA 19200 
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TAACAATGCT 
TGCTAATGCA 
TACACATAAT 
CTTAAATAAA 
AAATGTAACA 
ACATGGTATT 
AAGTAAGAAA 
AACAGGTTCA 
GAAAGATGCA 
CTTAGGCACT 
TCAAGCGACG 
CGCAATGAAT 
TTATCTTGAT 
AGGCATTTTA 
AAATGCAGTT 
AACTTCAGCA 
CTTGAAGCAT 
AGGTAATACG 
GACGAAAACA 
TGCTGTAAAT 
.TGCGATTAAT 
AAACTTAGCT 
CCAAAAACAA 
AAATAACGTT 
TATTGCTGAT 
ACGTCAAGCG 
TGCGACATTA 
AGCATTAAAT 
CGGCTTAGCA 



GTTGATAGTG 
ATTAACCAAA 
TTAACGCAAG 
GCGCAAAAAG 
AGTATCCAAC 
GATGATGAAA 
ACTGCTTATG 
AATTCAGATA 
TTGAATGGTG 
TTAAACCATA 
- ACTGTTGATG 
AGCTTACAAG 
GCGGATGAAT 
AATAAACAAA 
ACAAGAGCGA 
ACAAATACGA 
CAAGTTGAaC 
TTAAATACTG 
AGTCAAAATT 
AATOCAAATG 
GGCATGGCAA 
CAAGCTAAAA 
AAAGATGCAT 
CAACACACTG 
AAAGAAAGAA 
TATGATTCAA 
ACAGTCAATG 
GGTGATAACA 
CAATTGAATA 



CTAATGGTGT 
TCGCTACACA 
CGAAACAAAC 
ATGCGTTAAA 
AAACTGCAAA 
ATGCAACAAA 
ATCAAGCTGT 
AAGCAGCAGT 
ATGCAAAACT 
TTACGAATGC 
GCGTTAATAC 
GTTCAATCAA 
CAAAACGAAA 
CTGGTGGTAA 
AAGcGgCTTT 
TTGATGGTTT 
AAGCGCAAAA 
CCATGGGTGC 
ATCTTGATGC 
GTGTTATTAA 
ATCAAGTCAA 
CAAATGCGAC 
TAAAAACACA 
CAACTGAATT 
CAAAAGCAAG 
AAGTGACTAA 
ACGTAAATAG 
ACTTACGTGT 
ATGCACAAAA 



CATTAATGCA 
AGTGACATCA 
AGCAACAAAT 
AGCACAAGTT 
TGAACTTAAT 
ACAAACTCAA 
AGCTGCTGCG 
TGACCGTGCA 
GGCAGAAGCG 
ACAACGTACT 
TGTAAAAACA 
TGATAAAGAT 
TGCATATACG 
CACATCTAAA 
AAATGGTGCT 
ACCTAACTTA 
TGTAGCAGGT 
ATTACGTACA 
ATCTGACAGC 
TGCAACGAAC 
TACAACAAAA 
GAACACAATT 
AGTTAACAAT 
GAACAGTGCG 
CGGTAATTAT 
CGCTGAAAAT 
TGCGGCATCA 
AGCGAAAGAG 
AGCAAAATTA 



ACAAGCAATC 
ACGAAAAATG 
GCCATCGATG 
ACAAGTGCGC 
ACAGCTATGG 
AAATATCGTG 
AAAGCAATTT 
TTACAACAAG 
AAAGCGGCAG 
GACTTAGAAG 
AATGCCAATA 
GCGACATTAA 
CAAGCTGTCA 
GCAGACGTTG 
GACAACTTAA 
ACACAATTAC 
GTAAATGGTG 
AGTATCCAAA 
AACAAAAATA 
AATCCAAATA 
GCAGCGTTAA 
AACAACGCAC 
GCACAACGTG 
ATGACAGCAC 
GTCAATGCTG 
ATCATTAGTG 
CAAGTCAATG 
CATGCCAACA 
AAAGAACAAG 



CAAATATGGA 

CATTAGATGG 

GTGCTACTAA 

AACGTGTTGC 

GTCAATTACA 

ACGCTGAACA 

TAAATAAACA 

TAACAAGTAC 

CTAAACAAAA 

GCCAAATCAA 

CATTAGACGG 

GAAATCAAAA 

CAGCGGCTGA 

ATAATGCATT 

GAAATGCGAA • 

AAAAAGACAA 

TTAAAGATAA 

ATGATAATAC 

ATTACAATAC 

TGGATGCTAA 

ATGGTGCACA 

ATGACTTAAA 

TATcTGATGC 

TTAAAGCAGC 

ATCAAGAAAA 

GTACACCGAA 

CGGCTAAAAC 

ATACAATTGA 

TTCAAAGTGC 
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GAAAGGCTTA AGAGATAGTA TTGCGAATGA AGCAACAATT AAAGCAGGTC AAAACTACAC 
TGACGCAAGT CCAAATAATC GTAACGAGTA CGACAGTGCA GTTACTGCAG CAAAAGCAAT 
CATTAATCAA ACATCGAACC CAACGATGGA ACCAAATACT ATTACGCAAG TAACATCACA 
AGTGACAACT AAAGAACAGG CATTAAATGG TGCGCGAAAC TTAGCTCAAG CTAAGACAAC 
TGCGAAAAAC AACTTGAATA ACTTAACATC AATTAACAAT GCACAAAAAG ATGCGTTAAC 
GCGTAgcATT GATGGTGCAA CAACAGTAGC TGGTGTAAAT CAAGAAACTG CAAAAGCAAC 
AGAATTAAAT AACGCAATGC ATAGTTTACA AAATGGTATC AATGATGAGA CACAAACAAA 
ACAAACTCAG AAATACCTAG ATGCAGAGCC AAGTAAGAAA TCAGCTTATG ATCAAGCAGT 
AAATGCAGCG AAAGCAATTT TAACAAAAGC TAGTGGTCAA AATGTAGACA AAGCAGCAGT 
TGAACAAGCA TTGCAAAATG TGAACAGTAC GAAGACGGCG TTGAACGGTG ATGCGAAATT 
AAATGAAGCT AAAGCAGCTG CGAAACAAAC GTTAGGTACA TTAACACACA TTAATAATGC 
ACAACGTACA GCGTTAGACA ATGAAATTAC ACAAGCAACA AATGTTGAAG GTGTTAATAC 
AGTTAAAGCC AAAGCGCAAC AATTAGATGG TGCTATGGGT CAATTAGAAA CATCAATTCG 
TGATAAAGAC ACGACGTTAC AAAGTCAAAA TTATCAAGAT GCTGATGATG CTAAACGAAC 
TGCTTATTCT CAAGCAGTAA ATGCAGCAGC AACTATTTTA AATAAAACAg CTGGCGGTAA 
TACACCTAAA GCAGATGTTG AAAGAGCAAT GCAAGCTGTT ACACAAGCAA ATACTGcATT 
AAACGGTATT CAtnAACTTAG ATCGTGCGAA ACArGCTGCT AACACAGCGA TTACAAATGC 
TTCGGACTTA AATACAAAAC mAAAAGAAGC ATTAAAAgCA CAAGTAACAA GTGCAGGACG 
TGTATCTGCA GCAAATGGTG TTGAACATAC TGCGACTGAA TTAAATACTG CGATGACAGC 
TTTAAAGCGT GCCATTGCTG ATAAAGCTGA GACAAAAGCT AGTGGTAACT ATGTCAATGC 
TGATCJCGAAT AAACGTCAAG CATATGATGA AAAAGTTACA GCTGCCGAAA ATATCGTTAG 
TGGTACACCA ACACCAACGT TAACACCAGC AGATGTTACA AATGCAGCAA CGCAAGTAAC 
GAATGCTAAG ACGCAGTTAA ACGGTAATCA TAATTTAGAA GTAGCGAAAC AAAATGCTAA 
CACTGCAATT GATGGTTTAA CTTCTTTAAA TGGTCCGCAA AAAGCAAAAC TTAAAGAACA 
AGTGGGTCAA GCGACGACGT TGCCAAATGT TCAAACTGTT CGTGATAATG CACAAACATT 
AAACACTGCA ATGAAAGGTC TACGAGATAG CATTGCGAAT GAAGCAACGA TTAAAGCAGG 
TCAAAACTAC ACAGATGCAA GTCAAAACAA ACAAACTGAC TACAACAGTG CAGTCACTGC 
AGCAAAAGGA ATCATTGGTC AAACAACTAG TCCATCAATG AATGCGCAAG AAATTAATCA 
AGCGAAAGAC CAAGTGACAG CTAAACAACA AGCGTTAAAC GGTCAAGAAA ACTTAAGAAC 
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AGATGCAGTG AAACGTCAAA TCGAAGGTGC AACGCATGTT AATGAAGTAA CACAAGCACA 22920 

AAATAATGCG GATGCaTTAA ATACAGCTAT GACGAACTTG AAAAATGGTA TTCAAGATCA 22980 

GAATACGATT AAGCAAGGTG TTAACTTCAC TGATGCCGAC GAAGCGAAAC GTAATG CATA 23040 

TACAAATGCA GTGACGCAAG CTGAACAAAT TTTAAATAAA GCACAAGGTC CAAATACTTC 23100 

AAAAGACGGT GTCGAAACTG CGTTAGAaAA TGTACAACGT GCTAAAAACG AATTGAACGG 23160 

TAATCAAAAT GTTGCGAACG CTAAGACAAC TGCGAAAAAT GCATTGAATA ACCTAACATC 23220 

AATTAATAAT GCACAAAAAG AAGCATTGAA ATCACAAATT GAAGGTGCGA CAACAGTTGC 23280 

AGGTGTAAAT CAAGTGTCTA CAACGGCATC TGAATTAAAT ACAGCAATGA GCAACTTACA 23340 

AAATGGTATT AATGATGAAG CAGCTACAAA AGCAGCGCTT AATGGTACTC AAAACCTTGA 234 00 

AAAAGCTAAA CAACACGCAA ATACAGCAAT TGACGGTTTA AGCCATTTAA CAAATGCACA 23460 

AAAAGAGGCA TTAAAACAAT TGGTACAACA ATCGACTACT GTTGCAGAAG CACAAGGTAA 23520 

TGAGCAAAAA GCAAACAATG TTGATGCAGC AATGGACAAA TTACGTCAAA GTATTGCAGA 23580 

TAATGCGACA ACAAAACAAA ACCAAAATTA TACTGATGCA AGTCAGAATA AAAAGGATGC 23 640 

GTACAATAAT GCTGTCACAA CTGCACAAGG TATTATTGAT CAAACTACAA GTCCAACTTT 23700 

AGATCCGACT GTTATCAATC AAGCTGCTGG ACAAGTAAGC ACAACTAAAA ATGCATTAAA 23 760 

TGGTAATGAA AACCTAGAGG CAGCGAAACA ACAAGCGTCA CAATCATTAG GTTCATTAGA 23B20 

TAACTTAAAT AATGCGCAAA AACAAACAGT TACTGATCAA ATTAATGGCG CGCATACTGT 23880 

TGATGAAGCA AATCAAATTA AGCAAAATGC GCAAAACTTA AATACAGCGA TGGGTAACTT 23940 

GAAACAAGCG ATAGcTGACA AAGATGCTAC GAAAGCGACA GTTAACTTCA CTGATGCAGA 24 000 

TCAAGCAAAA CAACAAGCAT ATAACaCTGC TGTTACAAAT GCTGAAAATA TCATTTCAAA 24 060 

AGCTAATGGC GGCAATGCAA CACAAGCTGA AGTTGAACAA GCAATCAAAC AAGTTAATGC 24120 

TGCAAAACAA GCATTAAATG GTAATGCCAA CGTTCAACAT GCAAAAGACG AAGCAACAGC 24180 

ATTAATTAAT AGCTCTAATG ACCTTAACCA AGCACAAAAA GACGCATTAA AACAACAAGT 2424 0 

TCAAAATGCA ACTACTGTAG CTGGTGTAAA CAATGTTAAA CAAACAGCAC AAGAGTTAAA 24300 

CAATGCTATG ACACAATTAA AACAAGGCAT TGCAGATAAA GAACAAACAA AAGCTGATGG 24360 

TAACTTTGTC AATGCAGATC CTGATAAGCA AAATGCATAT AATCAAGCAG TAGCGAAAGC 24420 

TGAAGCATTA ATTAGTGctA CGCCTGATGT TGTCGTTACA CCTAGCGAAA TTACTGCAGC 24480 

GTTAAATAAA GTTACGCAAG CTAAAAATGA TTTAAATGGT AATACAAACT TAGCAACGGC 24540 

GAAACAAAAT GTTCAACATG CTATTGATCA ATTGCCAAAC TTAAACCAAG CGCAACGTGA 24600 
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AGCGGCGACA ACGCTTAATG 
ACAAATTAAA GGTAGCGAGA 
TAATGCAGTA ACAAAAGCAG 
AAATACAATT CAACAAGCAT 
TCAAAAATTA GCTGATGCCA 
AAATGATGCT CAAAAACAAG 
AGTTAATAAT GTTAAGCAAA 
TGCATTACAA GATAAAACTG 
TAAGAAAGAT GCTTATACTA 
TGGCAGCAAT GCAAGTCAAA 
ACAAGCATTG AATGGTAATG 
TACAAATGCA AATGATTTAA 
AGACCAAACT AAAGCAAATG 
CAACAATGCG GTAGCACATG 
TCCACAACAA GTGGCTCAAG 
TAACCATAAC TTACAAGTTG 
CTTAAATCAA CCACAAAAAA 
AGGTGTTAAT GCTATTAAGC 
ACAACAAATT CAAGCGAACA 
AGACAAACAA CAAGCATATA 
ACCAACACCT GTATTGACGC 
GAAAGATGCA TTAAACGGTG 
TCTTGATACG TTACGCGATT 
TCAAGCACAA GCGTTAGCTA 
aGCaATGAGT AACTTGAAAC 
CTATCATGAT GCTGATGCCG 
AGGTATTATC AATCAAACGA 
AACTCAAGTG ACTGATGCTA 
GCAAAATGCT AAAGATGCCG 



ACGCGATGAC 
ACTATCACGA 
AAGAATTGTT 
TAACTAAAGT 
AACAAGATGC 
CGCTAACAAC 
ATGCTCAAAA 
AGACATTAAA 
ATGCGGTTTC 
CTGAAGTGGA 
ACAATGTACA 
ATCAAGCAAT 
GTAACTTTGT 
CTGAACAAAT 
CGTTACAACA 
CTAAAGACAA 
CAGCATTAAA 
AAAATGCTGA 
GTCAAGTACC 
ACAATGCGGC 
CTGATACAGT 
ATGAAAAATT 
TAAATCAACC 
CAGTTGAACA 
aAGGTATTGC 
ATAAGCAAAC 
CAAATCCAAC 
AAAATGGCTT 
TAAGTGGGAT 



ACAATTGAAA CAAGGTATTG 
TGCTGATACT GACAAGCAAA 
AAAACAAACA ACAAATCCAA 
GAATGACACA AATCAAGCAC 
TAAGACAACA CTTGGTACAC 
TCAAGTTGAA CAAGCACCAG 
TCTGAATAAT GCTATGACTA 
TAGCATTAAC TTTACTGATG 
ACATGCAGAA GGTATTTTAT 
ACAAGCGATG CAACGTGTGA 
ACGTGCAAAA GATGCAGCGA 
GACACAATTG AAACAAGGTA 
CAATGCTGAT ACTGATAAGC 
AATTAGTGGT ACACCAAATG 
AGTGAATCaA GCTAAGGGTG 
TGCAAATACA GCCATTGATC 
AGACCAAGTG TCGCATGCAG 
TGCGTTAAAT AATGcAATGG 
ACAGTCAGTT GACTTTACAC 
TAACCAAGCG CAACAAATCG 
AACACAAGCA GTGACAACTA 
AGCACAAGCG AAACAAGAAG 
ACAACGTGAT GCATTACGTA 
AACTAAACAA AATGCACAAA 
aAACAAAGAT ACTGTCAAAG 
AGCATATACA AATGCAGTGT 
GCTTAACCCA GATGAAATAA 
AAACGGTGAA GCTAAATTGG 
GACGCATTTA AACGATGCTC 



CGAATAAAGC 
CAGCATATGA 
CAATGGATCC 
TTAACGGTAA 
TAGATCATTT 
ATATTGCAAC 
ACTTAAACAA 
CAGATCAAGC 
CTAAAGCAAA 
ACGAAGCGAA 
AACAAGTGAT 
TTGCAGATAA 
AAAATGCTTA 
CAAACGTGGA 
ATTTAAACGG 
AGTTACCAAA 
AACTTGTTAC 
GTACATTGAA 
AAGCGGATCA 
CAAATGGCAT 
TGAATCAAGC 
CTTTAGCAAA 
ACCAAATCAA 
ATGTGAATAC 
CAAGTGAGAA 
CTCAAGCGGA 
CACGTGCATT 
CAACTGAAAA 
AAAAACAAGC 



24720 

24780 

24840 

24900 

24960 

25020 

2508.0 
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25380 
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AGCAACGAGC CTAGATCAAG CAATGGATCA ATTATCACAA GCTATTAATG ATAAAGCTCA 26520 

AACATTAGCG GACGGTAATT ACTTAAATGC AGATCCTGAC AAACAAAATG CGTATAAACA 26580 

GGCAGTAGCA AAAGCTGAAG CATTATTGAA TAAACAAAGT GGTACTAATG AAGTACAAGC 26640 

ACAAGTTGAA AGCATCACTA ATGAAGTGAA CGCAGCGAAA CAAGCATTAA ATGGTAATGA 26700 

CAATTTGGCA AATGCAAAAC AACAAGCAAA ACAACAATTG GCGAACTTAA CACACTTAAA 26760 

TGATGCACAA AAACAATCAT TTGAAAGTCA AATTACACAA GCGCCACTTG TTACAGATGT 26820 

CACTACGATT AATCAAAAAG CACAAACGTT AGATCATGCG ATGGAATTAT TAAGAAATAG 26880 

TGTTGCGGAT AATCAAACGA CATTAGCGTC TGAAGATTAT CATGATGCAA CTGCGCAAAG 26940 

ACAAAATGAC TATAACCAAG CTGTAACAGC TGCTAATAAT ATAATTAATC AAACTACATC 27000 

GCCTACGATG AATCCAGATG ATGTTAATGG TGCAACGACA CAAGTGAATA ATACGAAAGT .27060 

TGCATTAGAT GGTGATGAAA ACCTTGCAGC AGCTAAACAA CAAGCAAACA ACAGACTTGA 27120 

TCAATTAGAT CATTTGAATA ATGCGCAAAA GCAACAGTTA CAATCACAAA TTACGCAATC 271 BO 

ATCTGATATT GCTGCAGTTA ATGGTCACAA ACAAACAGCA GAATCTTTAA ATACTGCGAT 27240 

GGGTAACTTA ATTAATGCGA TTGCAGATCA TCAAGCCGTT GAACAACGTG GTAACTTCAT 27300 

CAATGCTGAT ACTGATAAAC AAACTGCTTA TAATACAGCG GTAAATGAAG CAGCAGCAAT 27360 

GATTAACAAA CAAACTGGTC AAAATGCGAA CCAAACAGAA GTAGAACAAG CTATTACTAA 27420 

AGTTCAAACA ACACTTCAAG CGTTAAATGG AGACCATAAT TTACAAGTTG CTAAAAGAAA 27480 

TGCGACGCAA GCAATTGATG CTTTAACAAG CTTAAATGAT CCTCAAAAAA CAGCATTAAA 27540 

AGACCAAGTT ACAGCTGCAA CTTTAGTAAC TGCAGTTCAT CAAATTGAAC AAAATGCGAA 27600 

TACGCTTAAC CAAGCAATGC ATGGTTTAAG ACAGAGCATT CAAGATAACG CAGCAACTAA 27660 

AGCAAATAGC AAATATATCA ACGAAGATCA ACCAGAGCAA CAAAACTATG ATCAAGCTGT 27720 

TCAAGCCGCA AATAATATTA TCAATGAACA AACTGCAACA TTAGATAATA ATGCGATTAA 27780 

TCAAGCAGCG ACAACTGTGA ATACAACGAA AGCAGCATTA CATGGTGATG TGAAGTTACA 2784 0 

AAATGATAAA GATCATGCTA AGCAAACGGT TAGTCAATTA GCACATCTAA ACAATGCACA 27900 

AAAACATATG GAAGATACGT TAATTGATAG TGAAACAACT AGAACAGCAG TTAAGCAAGA 27960 

TTTGACTGAA GCACAAGCAT TAGATCAACT TATGGATGCA TTACAACAAA GTATTGCTGA 28020 

CAAAGATGCA ACACGTGCGA GCAGTGCATA TGTCAATGCA GAACCGAATA AAAAACAATC 28080 

CTATGATGAA GCAGTTCAAA ATGCTGAGTC TATCATTGCA GGATTAAATA ATCCAACTAT 2B140 

CAATAAAGGT AATGTATCAA GTGCGACTCA AGCAGTAATA TCATCTAAAA ATGCATTAGA 28200 
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TCAATTAACA CCAGCTCAAC AACAAGCGCT AGAAAATCAA ATTAATAATG 


CAACAACTCG 


28320 




TGATAAAGTG 


GCTGAAATCA 


TTGCACAAGC 


GCAAgCATtA AATGAAGCGA TGAAAGCATT 


28380 


5 


AAAAGAAAGT 


ATTAAGGATC 


AACCACAAAC 


TGAAGCAAGT 


AGTAAATTTA 


TTAACGAGGA 


28440 




TCAAGCGCAA 


AAAGATGCTT 


ATACGCAAGC 


AGTACAACAC 


GCGAAAGATT 


TGATTAACAA 


28500 




AACAACTGAT 


CCTACATTAG 


CTAAATCAAT 


CATTGATCAA 


GCGACACAGG 


CAGTGACAGA 


28560 


10 


TGCTAAAAAC AATTTACATG 


GTGATCAAAA ACTAGCTCAA 


GATAAGCAAC 


GTGCAACAGA 


28620 




AACGTTAAAT AACTTGTCTA ACTTGAATAC ACCACAACGT CAAGCACTTG AAAATCAAAT 


28680 


15 


TAATAATGCA GCAACTCGTG GCGAAGTAGC ACAAAAATTA ACTGAAGCAC AAGCACTTAA 


28740 


CCAAGCAATG GAAGCTTTAC 


GTAATAGCAT TCAAGATCAA 


CAGCAAACGG AAGCGGGTAG 


28800 




CAAGTTTATC AATGAAGATA AaCCaCmAAA AGrTGCTTAC 


CAAGCAGCAG 


TTCAAAATGC 


28860 


20 


AAAAGATTTA 


ATTAATCAAA 


CTAACAATCC 


AACGCTTGAT AAAGCACAAG 


TTGAACAATT 


28920 




GACACAAGCT 


GTTAACCAAG 


CTAAAGATAA 


CCTACACGGT 


GATCAAAAAC 


TTGCAGACGA 


28980 




TAAACAACAT 


GCGGTTACTG 


ATTTAAATCA 


ATTAAATGGT 


TTGAATAATC 


CGCAACGTCA 


29040 


25 


AGCACTTGAA AGCCAAATAA ACAACGCAGC 


AACTCGTGGC 


GAAGTAGCAC AAAAATTAGC 


29100 




TGAAGCAAAA 


GCGCTTGATC 


AAGCAATGCA 


AGCATTACGT 


AATAGTATTC 


AAGATCAACA 


29160 




ACAAACAGAA 


TCTGGTAGCA 


AGTTTATCAA 


TGAAGATAAA 


CCGCAAAAAG 


ATGCTTACCA 


29220 


30 


AGCAGCAGTT 


CAAAATGCAA 


AAGATTTAAT 


TAACCAAACA 


GGTAATCCAA 


CACTCGACAA 


29280 




ATCACAAGTA 


GAACAATTGA 


CACAAGCAGT 


AACAACTGCA 


AAAGATAATC 


TACATGGTGA 


29340 




TCAAAAACTT 


GCTCGTGATC 


AACAACAAGC 


AGTAACAACT 


GTAAATGCAT 


TGCCAAACTT 


29400 


35 


AAATCATGCA 


CAACAACAAG 


CATTAACTGA 


TGCTATAAAT 


GCAGCGCCTA 


CAAGAACAGA 


29460 




GGTTSCACAA 


CATGTTCAAA 


CTGCTACTGA ACTTGATCAC 


GCGATGGAAA 


CATTGAAAAA 


29520 


40 


TAAAGTTGAT 


CAAGTGAATA 


CAGATAAGGC 


TCAACCAAAT 


TACACTGAAG 


CGTCAACTGA 


29530 


TAAAAAAGAA 


GCAGTAGATC 


AAGCGTTACA 


AGCTGCAGAA AGCATTACAG 


ATCCAACTAA 


29640 




TGGTTCAAAT 


GCGAATAAAG ACGCTGTAGA 


CCAAGTATTA ACTAAGCTTC AAGAAAAAGA 


29700 


45 


AAATGAGTTA AATGGTAATG AGAGAGTCGC 


TGAAGCTAAA ACACAAGCGA AACAAACTAT 


29760 




TGACCAATTA ACACATTTAA ATGCTGATCA AATTGCAACT 


GCTAAACAAA 


ACATTGATCA 


29820 




AGCGACGAAA CTTCAACCAA TTGCTGAATT AGTAGATCAA GCAACGCAAT TGAATCAATC 


29880 


50 


TATGGATCAA 


TTACAACAAG 


CAGTTAATGA 


ACATGCTAAC 


GTTGAGCAAA 


CTGTAGATTA 


29940 




CACACAAGCA 


GATTCAGATA AACAAAATGC 


TTATAAACAA 


GCTATTGCTG 


ATGCTGAAAA 


30000 
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TGCAAAACAA GCATTAAATG GTGATGAACG TGTAGCACTT GCTAAAACAA ATGGTAAACA 30120 

TGACATCGAC CAATTGAATG CATTAAACAA TGCTCAACAA GATGGATTTA AAGGTCGCAT 30180 

CGATCAATCA AACGATTTAA ATCAAATCCA ACAAATTGTA GATGAGGCTA AGGCACTTAA 30240 

TCGTGCAATG GATCAATTGT CACAAGAAAT CACTGACAAT GAAGGACGCA CGAAAGGTAG 30300 

CACGAACTAT GTCAATGCAG ATACACAAGT CAAACAAGTA TATGATGAAA CGGTTGATAA 30360 

AGCGAAACAA GCACTTGATA AATCGACTGG TCAAAACTTA ACTGCAAAAC AAGTTATCAA 3 0420 

ATTAAATGAT GCAGTCACTG CAGCTAAGAA AGCATTAAAT GGTGAAGAAA GACTTAATAA 3 04 BO 

TCGTAAAGCT GAAGCATTAC AAAGATTGGA TCAATTAACA CATCTAAACA ATGCTCAAAG 30540 

ACAATTAGCA ATCCAACAAA TTAATAATGC TGAAACGCTA AATAAAGCAT CTCGAGCAAT 30600 

TAATAGAGCA ACTAAATTAG ATAATGCAAT GGGTTCAGTA CAACAATATA TTGACGAACA 3 0660 

GCACCTTGGT GTTATCAGCA GCACAAATTA CATCAATGCA GATGACAATT TGAAAGCAAA 3 0720 

TTATGATAAT GCAATTGCGA ATGCAGCACA TGAGTTAGAT AAAGTGCAAG GTAATGCAAT 3 07 BO 

TGCaAAAGCT GAAGCAGAGC AATTGAAACA AAATATTATC GATGCTCAAA ATGCATTAAA 30 84 0 

TGGAGACCAA AACCTTGCAA ATGCCAAAGA TAAAGCAAAT GCGTTTGTTA ATTCGTTAAA 30900 

TGGATTAAAT CAACAGCAAC AAGATCTTGC ACATAAAGCA ATTAACAATG CCGATACTGT 30960 

ATCAGATGTA ACAGATATTG TTAATAATCA AATTGACTTA AATGATGCAA TGGAAACATT 31020 

GAAACATTTA GTTGACAATG AAATTCCAAA TGCAGAGCAA ACTGTCAATT ACCAAAACGC 31080 

TGACGATAAT GCTAAA 31096 
(2) INFORMATION FOR SEQ ID NO: 60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2243 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



45 



50 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 60: 

ATGACAGAAT GGGAGCGAGG ACTTAGAATG TTTCCTAAAT CAGGTTTATT AAATTTTGAG 60 

TTAGCGATAG mAAATCGTTC ATTAAATGAT GATGAAAAAG CATTAAAATA TGTGCGTAAA 120 

GCATTAAATG CAGACCCTAA AAATACAGAT TATATTAACT TAGAAAAAGA GTTGACTAAA 1B0 

TCAAATGAGT CGAAAAATAA ATAACTTTTA TGATGTACAA CAGTTATTGA AAAGTTACGG 24 0 

ATTTCTAATA TATTTTAAAA ATCCAGAAGA TATGTACGAA ATGATTCAAC AGGAGATTTC 300 
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TAATCAGAGA AGGAATGAAC AGAAATGACA AAAATTATTT TAGCAGCTGA TGTAGGCGGG 420 

ACGACTTGTA AATTAGGTAT TTTCACACCT GAATTAGAAC AATTACATAA ATGGTCTATT 4 BO 

CACACTGATA CATCTGATAG TACAGGATAT ACACTTTTGA AAGGAATTTA TGATTCGTTT 54 0 

GTTGAAAAAG TAAATGAAAA TAATTATAAT TTTTCAAATG TACTTGGCGT AGGTATTGGT 600 

GTACCAGGTC CTGTTGACTT TGAAAAAGGT ACAGTAAATG GAGCAGTAAA CTTATATTGG 660 

10 

CCAGAAAAAG TTAATGTACG TGAGATTTTT GAACAATTCG TTGATTGTCC AGTGTATGTA 720 

GATAATGATG CTAACATAGC TGCTTTAGGG GaGAAACACA AAGGTGCTGG TGAAGGTGCC 780 

GATGATGTTG TTGCCATCAC ACTTGGTACA GGTCTAGGTG GAGGAATTAT TTCCAAATGG 840 

TGAAATCGTA CATGGTCATA ATGGCTCtGG CGCAGAAATA GGTCATTTTA GAgCAGACTT 900 

CgATCAACGA TTTaAATGTA ATTGTGGTCG TTCTGGATGT ATTGAAACAG TTGCTTCaGC 960 

20 GACAGGCGTT GTTAACTTAG TTAACTTCtA CTATCCGAAG TTGACGTTTA GATCTTCTAT 1020 

ATTAGAATTG ATTAAAGAAA ATAAGGTtAC aGCAAAAGCT GTTTTTGATG CGGCAAAAGC 1080 

TGGTGACCAA TTCTGTATTT TCATTACTGA AAAGGTTGCA AACTATATTG GATATTTATG 1140 

25 TAGTATTATT AGTGTTACAA GTAATCCGAA ATATATCGTT CTAGGTGGAG GAATGTCTAC 1200 

TGCAGGACCT ATTTTAATTG AAAATATTAA AACAGAATAT CATAATTTAA CATTTGCACC 1260 

TGCTCAATTT GAAACTGAAA TTGTACAAGC GAAATTAGGT AATGATGCAG GTATTACAGG 1320 

30 

AGCAGCAGGA TTAATCAAGA CCTATGTATT AGATAAAGAG GGGGTAAAAT AATGGCTATT 1380 

GTTGATGTGG TTGTTATTCC AGTTGGAACG GAAGGTCCGA GTGTTAGTAA ATATATTGCA 1440 

GATATTCAGA AAAAACTTCA AGAATATAAA GCAATGGGTA AAATTGATTT TCAATTAACA 1500 

35 

CCAATGAATA CTCTAATTGA AGGTGAATTA AGCGATGTAT TAGAAGTTGT GCAAGTGATA 1560 

CATG$ATTAC CTTTTGATAA AGGTTTAAGT AGAGTTTGTA CAAATATCCG TATTGATGAC 1620 

CGACGAGACA AATCTAGAAA AATGAATGAT AAACTAACAT CAGTACAAAA ACATTTAGAA 1680 

40 

AATAGTGGTG AAAACCTATG AGGATTTCAA GCTTAACTTT AGGCTTAGTT GATACTAATA 1740 

CGTATTTCAT CGAAAATGAC AAAGCTGTTA TTCTGATTGA CCCTTCAGGT GAAAGTGAAA 1800 

45 AAATTATTAA AAAATTAAAC CAAATAAATA AACCGTTAAA AGCTATTTTA TTAACACATG ,1860 

CACACTTTGA TCATATCGGA GCAGTCGATG ATATAGTTGA TCGATTCGAT GTCCCGGTTT 1920 

ATATGCATGA AGCAGAGTTT GATTTTCTAA AAGATCCCGT TAAAAATGGG GCAGATAAAT 1980 

50 TTAAGCAATA TGGATTACCA ATTATTACAA GTAAGGTAAC TCCTGAAAAG TTAAmCGAAG 204 0 

GTAGCACAGA AATAGAAGGA TTTAAGTTnT nAyrTGTaCA CACACCTGGA CATTCACCAG 2100 
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GAATCGGACG TACAGATTTA TATAAAGGTG ATTATGAAAC GCTAGTTGAT " TCTATTCAAG 222 0 

ATAAAATATT TGAATTAGAA GGC 2243 
(2) INFORMATION FOR SEQ ID NO: 61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8009 base pairs 

(B) TYPE: nucleic acid 



10 




(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 






IS 


txi) 


SEQUENCE DESCRIPTION: SEQ ID NO: i 


SI: 




TTGGnATCAT tyAcgGTAAA AAGAATAAaG CAAGATTtAT TTCATTAGTA CTAATTTGTG 






CAATGTTTGC AATTTGTTGG GTTGCATATA TTCAATGGGA GTCTACAATC GCTTCATTTA 


120 


20 


CACAATCTAT. TAATATTTCa ATGGCACAAT ATAGTGTTTT ATGGACAATT AACGGAATAA 


180 




TGATTTTAGT 


AGCACAACCA TTAATTAAAC CGATTCTCTA 


TCTGTTAAAA GGAAACTTAA 


O A f\ 




AGAAGCAAAT 


GTTTGTCGGC ATCATCATTT TTATGTTGTC 


GTTCTTTGTC ACGAGTTTTG 




25 


CCGAAAACTT 


TACAATATTT GTTGTCGGTA TGATTATTTT 


AACTTTTGGA GAAATGTTTG 


i c n 




TATGGCCAGC 


AGTTCCAACT ATAGCCAATC AGTTAGCGCC 


AGATGGTAAG CAAGGACAGT 


A O ft 




ACCAAGGTTT 


TGTGAATTCA GCTGCTACAG TAGGAAAAGC 


ATTTGGTCCA TTTCTTGGTG 


SOU 


30 


GTGTATTAGT 


TGATGCGTTT AATATGCGCA TGATGTTTAT 


CGGTATGATG CTACTACTTG 


c * ft 




TATTTGCATT 


AATATTATTA ATGGTTTTCA AGGAGAATAA 


TACGCAACCT AAAAAAATAG 


6 00 




ATGCATAATG 


AGTAAATAGA ATTAACGTTA TAGACTTGAA 


ATAAATGTCG TTATAACATA 




35 


ATATTAATTT 


GTATAATTTA ATTTCGTTTG GAGCTTTTCT 


ACAGAAAGCT AGTGATGCTG 


720 




AGAGCTAGTG 


TTAAGGACTA AATGTAAATC GTATTAATTT 


TAAATTGAAT GAATGACATC 


780 


40 


.TCTTACTATT 


AAAATGAGTG CACAATTTTT GTGAAATAGG 


GTGGTAACGC GGCAAATGTC 


840 


GTCCCTATGT 


AAATAGAATA GTTAGAGGTG TCTTTTTTAT 


TGAATAGGAG GAAATGTGTT 


900 




GAATTACAAC 


CACAATCAAA TTGAAAAGAA ATGGcAAGAC 


TATTGGGACG AAAATAAAAC 


960 


45 


ATTTAAAACA AATGATAACT TAGGTCAAAA GAAATTTTAT GCTTTAGACA TGTTTCCATA 


1020 




TCCATCAGGT 


GCTGGTTTAC ATGTTGGACA TCCTGAGGGc 


TATACAGCAA CAGATATCAT 


1080 




TTCAAGATAT 


AAAAGAATGC AAGGATATAA TGTATTACAT 


CCGATGGGGT GGGATGCATT 


1140 


SO 


CGGATTACCA GCAGAGCAAT ATGCTTTAGA CACTGGCAAC GACCCACGTG AATTTACAAA 


1200 




GAAAAATATC 


CAAACTTTTA AACGACAAAT TAAAGAATTA GGGTTCAGTT ATGATTGGGA 


1260 
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GTTATATAAC AAAGGTTTAG 


CATACGTTGA 


TGAAGTTGCA 


GTTAACTGGT GTCCAGCATT 


1380 




AGGCACTGTT 


TTATCTAACG 


AAGAAGTGAT 


TGATGGTGTC 


TCTGAACGTG GTGGACATCC 


1440 


5 


AGTTTATCGT 


AAGCCGATGA 


AACAATGGGT 


ACTTAAAATC 


ACAGAATATG CAGATCAATT 


1500 




ATTAGCAGAT TTAGATGATT TAGATTGGCC 


TGAGTCTTTA 


AAAGATATGC AGCGCAATTG 


1560 


10 


' GATTGGACGT 


TCTGAAGGGG 


CCAAAGTTTC 


ATTTGATGTA 


GATAATACGG AAGGAAAAGT 


1620 


AGAAGTATTT 


ACGACTAGAC 


CAGATACAAT 


CTATGGTGCA 


TCATTCTTAG TCTTAAGTCC 


1680 




TGAACATGCA 


TTAGTTAATT 


CAATTACAAC 


AGATGAATAT AAAGAAAAAG TAAAAGCTTA 


1740 


15 


TCAAACAGAA 


GCTTCTAAAA AGTCAGATTT 


AGAACGTACA 


GATTTAGCAA AAGATAAATC 


1800 


AGGTGTATTT ACTGGTGCAT ATGCAACTAA TCCTTTATCT GGTGAAAAAG TACAAATTTG 


1860 




GATTGCTGAT 


TATGTATTAT 


CAACATATGG 


TACTGGAGCA 


ATTATGGCAG TACCAGCGCA « 


1920 


20 


TGATGACAGA 


GATTATGAAT 


TTGCTAAAAA GTTTGATTTG 


CCAATCATTG AAGTCATCGA 


1980 




AGGTGGAAAT 


GTTGAAGAAG 


CAGCATACAC 


TGGTGAAGGT 


AAACATATTA ATTCTGGTGA 


. 2040 




ACTTGATGGT 


TTAGAAAATG 


AAGCGGCAAT 


TACTAAAGCT 


ATTCAATTAT TAGAGCAAAA 


2100 


25 


AGGTGCTGGC 


GAAAAGAAAG 


TTAATTACAA ATTAAGAGAT 


TGGTTATTCA GTCGTCAGCG 


2160 




TTATTGGGGC 


GAACCAATTC 


CTGTCATTCA 


TTGGGAAGAT 


GGAACAATGA CAACTGTTCC 


2220 




TGAAGAAGAG 


CTACCATTGT 


TGTTACCTGA 


AACAGATGAA 


ATCAAGCCAT CAGGGACTGG 


2280 


30 


TGAGTCTCCA 


CTAGCTAATA 


TTGATTCATT 


TGTAAATGTT 


GTAGATGAAA AAACAGGTAT 


2340 




GAAAGGACGT 


CGTGAAACAA 


ATACAATGCC 


ACAATGGGCA 


GGTAGTTGTT GGTATTATTT 


2400 


35 


ACGTTACATC 


GATCCTAAAA ATGAAAATAT 


GTTAGCAGAT 


CCTGAAAAAT TAAAACATTG 


2460 


GTTACCTGTT 


GATTTATATA 


TCGGTGGAGT 


AGAACATGCG 


GTTCTTCACT TATTATATGC 


2520 




AAGATTTTGG CATAAAGTCC TTTATGATTT GGCTATCGTA CCTACTAAAG AACCTTTCCA 


2580 


40 


AAAATTATTT 


AACCAAGGTA 


TGATTTTAGG 


AGAAGGTAAT 


GAGAAGATGA GTAAATCTAA 


2640 


AGGAAATGTA 


ATCAATCCTG 


ATGATATAGT 


ACAGTCTCAT 


GGTGCAGATA CTTTGCGTCT 


2700 




TTACGAAATG 


TTTATGGGAC 


CTTTAGATGC 


TGCAATTGCA 


TGGAGTGAAA AAGGATTAGA 


2760 


HO 


TGGGTCTCGT 


CGATTCTTAG ATCGCGTATG 


GCGTTTAATG 


GTAAATGAAG ATGGGACATT 


2820 




GAGTTCAAAA 


ATTGTAACTA 


CAAATAATAA 


ATCTTTAGAT 


AAAGTTTATA ACCAAACTGT 


2880 




TAAAAAGGTA 


ACAGAAGACT 


TTGAAACATT 


AGGATTTAAT ACTGCTATTA GTCAATTAAT 


2940 


50 


GGTATTTATT 


AATGAGTGTT 


ATAAAGTTGA 


TGAAGTTTAT 


AAACCTTACA TTGAAGGCTT 


3000 




CGTTAAAATG 


TTAGCACCTA 


TTGCACCACA 


TATCGGTGAA 


GAATTATGGT CAAAATTAGG 


3060 
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TGATGAAGTA GAAATCGTTG TTCAAGTGAA TGGTAAATTG AGAGCTAAAA TTAAAATTGC 3180 

TAAAGATACA TCAAAAGAAG AAATGCAAGA AATTGCCTTA TCTAATGACA ATGTTAAAGC 3240 

5 GAGTATTGAA GGTAAAGACA TCATGAAAGT CATCGCTGTT CCTCAAAAAT TAGTCAATAT 3300 

TGTAGCTAAA TAATGITTTA AGGAGGACTT TGAAATGAAG TCAATTACTA CAGATGAATT 3360 

AAAAAATAAA CTTTTAGAAT CTAAACCAGT TCAAATTGTT GATGTTCGTA CTGATGAAGA 3420 

10 

AACAGCAATG GGATATATTC CTAATGCAAA GTTAATTCCA ATGGATACCA TTCCGGATAA 3480 

TTTAAATTCA TTTAATAAAA ATGAAATATA TTATATTGTA TGTGCTGGTG GAGTTCGAAG 354 0 

CGCTAAAGTT GTAGAATATT TAGAGGCAAA TGGCATTGAT GCCGTAAATG TCGAAGGCGG 3600 

IS 

CATGCACGCA TGGGGCGATG AAGGTTTGGA AATAAAAAGT ATTTAAAGTA GTGACATAAT 3660 

TTAAAATAAT ATTACATTTG TAATGACACC AAGTAACGTT TCGGTTGCTT GGTGTTTTTT 3720 

GGTATGAATT ACTTTCTGTT ACAAAACAAT CTAAAGCGTT CTTGTTATGT TTTATTAAGA 3780 

20 

TTTTAATTAC AAAACGGAAA CTAAATTGTA ATAAAATAAA ACTTTATTTT ATAAAATGAT 3840 

GATGATAAAA TTGAGTGAAC TTAAAATATT GTACAAAATA ATATAGCTAT AAATATAATA 3900 

25 TAGCTATAAA TATAATATGA GGGAGCGTAT ATTTTTAGCA TAATTCTTAA CAACACAGCA 3960 

GAGAACAGAC AACCAGGAGG AAAATGAAAT GAATTTGTTA AAGAAAAATA AATATAGTAT 402 0 

TAGGAAGTAT AAAGTAGGCA TATTCTCTAC TTTAATCGGA ACAGTTTTAT TACTTTCAAA 4080 

30 CCCAAATGGT GCACAAGCCT TAACTACGGA TAATAATGTA CAAAGCGATA CTAATCAAGC 4140 

AACACCTGTA AATTCACAAG ATAAAGATGT TGCTAATAAT AGAGGTTTAG CAAATAGTGC ■ 4200 

GCAGAATACA CCTAATCAAT CTGCAACAAC CAATCAAGCA ACGAATCAAG CATTGGTTAA 4260 

35 

TCATAATAAT GGTAGTATAG TAAATCAAGC TACGCCAACA TCAGTGCAAT CAAGTACGCC 4320 

TTCAGCACAA AACAATAATC ATACAGATGG CAATACAACA GCAACTGAGA CAGTGTCAAA 43 80 

CGCTAATAAT AATGATGTAG TGTCGAATAA TACCGCATTA AATGTACCAA CTAAAACAAA 444 0 

TGAAAATGGT TCAGGAGGAC ATCTAACTTT AAAGGAAATT CAAGAAGATG TTCGTCATTC 4500 

TTCAAATAAA CCAGAGCTAG TTGCAATTGC TGAACCAGCA TCTAATAGAC CGAAAAAGAG 4 560 

AAGTAGACGT GCGGCACCGG CAGATCCTAA TGCAACTCCA GCAGATCCAG CGGCTGCAGC 4620 

45 

■ GGTAGGAAAC GGTGGTGCAC CAGTTGCAAT TACAGCGCCA TATACGCCAA CAACTGATCC 4680 

TAATGCCAAT AATGCAGGAC AAAATGCACC TAACGAAGTG CTGTCATTTG ATGACAATGG 4 74 0 

SO TATTAGACCA AGTACCAACC GTTCTGTGCC AACAGTAAAC GTTGTTAATA ACTTGCCGGG 4 800 

CTTCACACTA ATCAATGGTG GCAAAGTAGG GGTGTTTAGT CATGCAATGG TAAGAACGAG 4 860 
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TCGTATACAT GGAACTGATA CGAATGACCA TGGCGATTTT AATGGTATCG AGAAAGCATT 4980 

AACAGTAAAT CCGAATTCTG AATTAATCTT TGAATTTAAT ACAATGACTA CTAAAAAOGG 504 0 

5 TCAAGGCGCA ACAAATGTTA TTATCAAAAA TGCTGATACT AATGATACGA TTGCTGAAAA 5100 

GACTGTTGAA GGCGGTCCAA CTTTGCGTTT ATTTAAAGTA CCTGATAATG TGAGAAATCT 5160 

CAAAATTCAA TTTGTACCTA AAAATGACGC AATAACAGAT GCGCGTGGCA TTTATCAACT 5220 

10 

AAAAGATGGT TACAAATACT ATAGCTTTGT TGACTCTATC GGACTTCATT CTGGGTCACA 5260 

TGTTTTTGTT GAAAGACGAA CAATGGATCC AACAGCAACA AATAATAAAG AGTTTACTGT 534 0 

AACAACATCA TTAAAGAATA ATGGTAATTC TGGTGCTTCT CTAGATACAA ATGACTTTGT 5400 

15 

ATATCAAGTT CAATTACCTG AAGGTGTTGA ATATGTGAAC AATTCATTGA CTAAAGATTT 5460 

TCCAAGTAAC AATTCAGGCG TTGATGTTAA TGATATGAAT GTTACATATG ATGCAGCAAA 5520 

TCGTGTGATA ACAATTAAAA GTACTGGAGG AGGTACAGCA AACTCTCCGG CACGACTTAT 5580 

20 

GCCTGATAAA ATACTCGATT TAAGATATAA ATTACGTGTA AATAATGTGC CGACACCAAG 5640 

AACAGTAACA TTTAACGAGA CATTAACGTA TAAAACATAT ACACAAGATT TCATTAATTC 5700 

25 AGCTGCAGAA AGTCATACTG TAAGTACAAA TCCATATACT ATCGATATCA TCATGAATAA 5760 

AGATGCATTA CAAGCCGAAG TTGACAGACG TATTCAACAA GCTGATTATA CATTTGCGTC 5820 

ATTAGATATC TTTAATGGTC TGAAACGACG CGCACAAACG ATTTTAGATG AAAATCGTAA 5880 

30 CAATGTACCA TTAAATAAAA GAGTTTCTCA AGCATATATT GATTCATTAA CTAATCAAAT 5940 

GCAACATACG TTAATTCGAA GTGTTGATGC TGAAAATGCA GTTAATAAAA AAGTTGACCA 6000 

AATGGAAGAT TTAGTTAATC AAAATGATGA ATTGACAGAT GAAGAAAAAC AAGCAGCAAT 6060 

35 

ACAAGTTATC GAGGAACATA AAAATGAAAT AATTGGTAAT ATTGGTGACC AAACGACTGA 6120 

TGATGGCGTT ACTAGAATCA AAGATCAAGG TATACAGACC TTAAGTGGGG ATACTGCAAC 6180 

ACCGGTTGTT AAACCAAATG CTAAAAAAGC AATACGTGAT AAAGCAACGA AACAAAGGGA 6240 

40 

AATTATCAAT GCAACACCAG ATGCTACTGA AGACGAGATT CAAGATGCAC TAAATCAATT 6300 

AGCTACGGAT GAAACAGATG CTATTGATAA TGTTACGAAT GCTACTACAA ATGCTGACGT 636 0 

TGAAACAGCT AAAAATAATG GCATCAATAC TATTGGAGCA GTTGTTCCTC AAGTAACTCA 6420 

45 

TAAAAAAGCT GCAAGAGATG CAATTAACCA AGCAACAGCA ACGAAAAGAC AACAAATAAA 6480 

TAGTAATAGA GAAGCAACTC AGGAAGAGAA AAATGCAGCA TTGAACGAAT TAACTCAAGC 654 0 

SO AACCAACCAT GCTTTAGAAC AAATCAATCA AGCAACAACA AATGCTAATG TTGATAACGC 6600 

CAAAGGAGAT GGTCTAAATG CCATTAATCC AATTGCTCCT GTAACTGTTG TTAAGCAAGC 6660 
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TGATGCGACT CAAGAAGAAA GACAAGCAGC AATTGACAAA GTGAATGCTG CTGTAACTGC 6780 

AGCAAACACA AACATTTTAA ACGCTAATAC CAATGCTGAT GTTGAACAAG TAAAGACAAA 6840 

5 TGCGATTCAA GGAATACAAG CAATTACACC AGCTACAAAA GTAAAAACAG ATGCAAAAAA 6900 

TGCCATCGAT AAAAGTGCGG AAACGCAACA TAATACGATA TTTAATAATA ATGATGCGAC 6960 

GCTCGAAGAA CAACAAGCAG CACAACAATT ACTTGATCAA GCTGTAGCCA CAGCGAAGCA 7020 

10 AAATATTAAT GCAGCAGATA CGAATCAAGA AGTTGCACAA GCAAAAGATC AGGGCACACA 7080 

• AAATATAGTA GTGATTCAAC CGGCAACACA AGTTAAAACG GATACTCGCA ATGTTGTAAA 7140 

TGATAAAGCG CGAGAGGCGA TAACAAATAT CAATGCTACA ACTGGCGCGA CTCGAGAAGA 7200 
15 ' 

GAAACAAGAA GCGATAAATC GTGTCAATAC ACTTAAAAAT AGAGCATTAA CTGATATTGG 7260 

TGTGACGTCT ACTACTGCGA TGGTCAATAG TATTAGAGAC GATGCAGTCA ATCAAATCGG 7320- 

CGCAGTTCAA CCGCATGTAA CGAAGAAACA AACTGCTACA GGTGTATTAA ATGATTTAGC 73 80 

20 

AACTGCTAAA AAGCAAGAAA TTAATCAAAA CACAAATGCA ACAACTGAAG AAAAGCAAGT 7440 

GGCTTTAAAT CAAGTGGATC AAGAGTTAGC AACGGCAATT AATmATATAA ATCAAGCTGA 7500 

25 TACAAATGCG GAAGTAGATC AAGCGCAACA ATTAGGTACA AAAGCAATTA ATGCGATTCA 7560 

GCCAAATATT GTTAAAAAAC CTGCAGCATT AGCACAAATC AATCAGCATT ATAATGCTAA 7620 

ATTAGCTGAA ATCAATGCTA CACCAGATGC AACGAATGAT GAGAAAAATG CTGCGATCAA 7680 

30 TACTTTAAAT CAAGACAGAC AACAAGCTAT TGAAAGTATT AAACAAGCTA ACACAAATGC 7740 

s. 

AGAAGTAGAC CAAGCTGCGA CAGTAGCAGA GAATAATATC GATGCTGTTC AAGTTGATGT 7 BOO 

AGTAAAAAAA CAAGCAGCGC GAGATAAAAT CACTGCTGAA GTGGcGAacG TATTGaAGCG 7B60 

35 

GTTAAACAAA CACCTAATGC AACTGACGAA GAAAAGCAGG CTGCTGTTAA TCAAATCCAA 7920 
TCAACTTTAA AGATTCAAGC AATTTAATCC AAATTTAATC CAAAACCCAA ACAAATGGAT 7980 
TCAGGGTAGG ACACCACTTA CAAATCCAA 8009 

40 

(2) INFORMATION FOR SEQ ID NO: 62: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10953 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

SO (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 62: 

ACCCACCCCn TGGGGATAnT TTACCTGGTG GGGCCTTCGA TTGCCTTTAG GTGAAACCaG 60 
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AGATGAATGC TAACCATATT CATTCTGCTA AAGATGGTCG TGTTACTGCG ACAGCTGAAA 100 

TTATTCATCG AGGTAAGTCG ACACATGTAT GGGATATAAA AATTAAGAAT GACAAAGAAC 240 

5 AATTAATTAC AGTTATGCGT GGTACAGTTG CTATTAAACC TTTAAAATAA AAGAACTGCT 3 00 

AGCTGAAATG TTATGAGATA TTCATAACTA CGGCTAGCAG TTTTTTTATG CGCTATATTG 360 

TTGTAGTTTT AGAAATGCTT GTTCAATGCG TTCGGCAGCT TTACGGCCAC CCATAACATT 420 

10 

TCTACCAAAT GGTCCTAATT CTAAGTCTGC AAAGCATCCT GCGACAAATA GATTTGGTAT 4 80 

CCATTCTAAT TTTTCGGAAA TAACAGGGTA ATTACATTCG TTGATAGGTG CATCATAATT 54 0 

TTGTATTAAT TGCTTAATAA GTGGTTGTGA CATAAAATCT TGTTCAAAAC CAGTTGCAAC 600 

*5 

CATAATCTGT TGATATGGAA CAGAATCATT TTCAGTGTTA ATTACACCAC CACTAATTTG 660 

AGTGATAGGT GTTTTATGCa CATTTATACG ACCATTTTTA ATATGTTTTT TAAGGCGTAA 720 

2Q GTACAGTTCG TGAGGCATTG ATCCTTTATG ACGTTCGCGT TGTACAATGG CATTTCTTTC 780 

AGGCATGCTT TTAGTACTTA AAAATGAAGA CATATTTTTC GGACCTAACC AACCAGGATC 84 0 

AGCATCAAAG TCATGTATTT CAATATCTTT ATTTAGCCAT AAATGAATCT TTTTATCGTT 900 

2s ATCATGATTT AACAATTTAA GTGCAAGATG TGCAGCAGTa ATGCCGCTAC CAACGATATG 960 

ATCGGTCTTA TCATATACTA CTTGATCAAG TTCTTTCTCG AAGATATGAT TTACATTCTG 1020 

TTTGTCTTTT AAAATGTCAG GCATAAACGG AATATTTGTA CTGCCTATTG CAATAACGAC 10 BO 

30 GCAATCTGTA GTGATAATTT GTCCATCTTC TAACTTGATA TGCCATTTGT CTTCTTGTTT 1140 

ATCTAAAGTT TGAACTAAAC CTTGAACCAA GCAATCCTCT AATTGATATT GTTTAGAAGC 12 00 

ATGTGCAATA TGATCCATAA ACATTGTCAA TTCAGGTCGT TGATAAGGAC CATAAAAAGC 1260 

35 

ATTTGTATAT TGGTGCTGTT TAGCGAATTG TTTTAGATGG AACGGTTGTG GATGTACGTG 1320 

ATGTACAATC GGTGATCTTA AATAAGGCAT TTCTATTCGA TTTGTATATG AGTTAAACCT 13 80 

TTGGCAAAAA GTTTCGTGTG GGTCAATGAT TGTTAATCGG TCTGTTGTTA ATCCGCTTGA 144 0 

40 

TAATAGTTTT TGTGCGATTG CAGTTCCCTG TATGCCACCG CCGATAATTG TCCAATGCAT 1500 

AATAAAACCT CTCTCTTTTT AAAACGTAAT AGTTACGATT TATAATTATT ATTATCATAA 1560 

45 TACATAACGA CATGAAAGGC AATTAAATTA AAGAGATATA TGTAGATAGG GCGAATCTGT 1620 

AGTCAAAGAA AAAATCATTG AAAAAGAGGT AACAATGTCA AAAGAwAACA GCAGTAAAAT 1680 

CATTCCTAAT TTGGAATCAT CTTACTGCTG TTTGTTGTTG ATTTATATTC ATGATTTTGT 174 0 

50 TATATAATCT ACAATTTTGT GTCTTTTAAG TCTTCCGAAA TTTCATCGAC TTTAGTCTTT 1800 

TTAGTATAAG GCGTTTTAAT ATTATATGCT GCTTTCATAA TCATATGACT TGAAAGAGGA 186 0 
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GCAATAAAAT ATAAAAACGT ACCAAATAGT AATGACATTG 


CACCTAATGT 


TGATGCTTTT 


1980 




CCGGCAGCAT 


GTGCACGTGA ATATACATCT 


TCAAGTCTCA 


ATAATCCTAT 


AGCTGCTAGG 


2040 


5 


GCGCTAATTA 


AAGCACCGAT 


GATAACAAAG 


ATAAGTGCAA GACTAATCAG 


TATGATTTTG 


2100 




ATCATGTTCA ATCACCTTAC 


CTTTGTCCAT 


AAATTTAGAG 


AATACTGCAG 


TACCTAAAAA 


2160 


10 


AGCTAATATA 


CCAATCATCA 


TAATAACGAC 


AATCATGTAT 


TTAATATTTA 


ATAAAATACT 


2220 


GAATAATGCT 


ATAACTGCCA 


TTAATTGAAG 


ACCAATCGCA 


TCTAATGCGA 


CAACACGATC 


2280 




GGCAAGTGAT 


GGGCCTAGCA 


CAACGCGAAT 


GAGCATAGCT 


AACATAGAAA 


TGACAACTAT 


2340 


15 


GATTAATGCA 


A7AACGATAA 


TAACATTATG 


ATTCATTATA 


TTTCGCCCAC 


CTCTCTTACA 


2400 


ATTTTCTCTA 


ATGATGTTTT 


AATACTTTCT ACTTCTTGCT 


CTTTAGTTGA AAAATCTATG 


2460 




GCATGAATAT AAATTTTTGT ACGATCGTCA 


CTTACACCAA 


GCACTACAGT 


ACCAGGTGTT 


2520 


20 


• AATGTAATTA 


AATTAGACAG 


CAAGACAATT 


TGCCAATCTT 


TTTTTAAATC 


TGTGTGATAA 


2580 




ACAAAGAATC 


CTGGTTCATT 


TTTAATCGAA 


GGTTTAATAA 


TAATTTTCAA 


AACATCAAAA 


2640 




TTAGCTTTAA 


TCAGTTCGAT 


TAAGAAAATA 


ATAACTAATT 


TAATAATACG 


ATATAGCGTG 


2700 


25 


ATGACATAAA ATCTACCTGG 


TAACACTCTG 


TGTAAGAGGT 


AAACAAGAAC 


TAGGCCAAAG 


2760 




ATGAAACCTA ACACAAAGTT 


ATTTGTTGTG 


TAACTATTTG 


TCACAAACAA 


CCAAAACACT 


2820 




GCGATAATAA AGTTTAATAC 


TAATTGTACA 


GCCATGTTAT 


TTACCTCCTA 


ATACAGCTTT 


2880 


30 


AACGTAGGTT 


GATGGATTGT 


AGAATGTTTC 


TGCACCAGCT 


TTTACCATTG 


GATATAAGTA 


2940 




ATCTGCTGAC 


AATCCATATA 


AAACAGTTAT 


CACAACTGCA 


ACGATTGCAA 


TCGTAGTTAA 


3000 


35 


ATATTTGACG 


TCGACTTTGT 


TATTAAGATC 


ATATCCTTTT 


GGTTGACCGA AAAAGCCTTG 


3060 


TAGGAATATG 


CGAATGACAG 


AATATAATAC 


GACTAAACTT 


GATAATAAGA 


CGATGACACC 


3120 




ACTTAAATAA 


AATCCTCTTT 


CAAATGTTGA 


TTGGACAATA 


AAAAATTTTC 


CATAAAAGCC 


3180 


40 


ACTGAGTGGG 


GGAATGCCAG 


CTAAACTTAA 


TGCTGCGATA 


AAGAATGACC 


AACCAAGTAC 


3240 


AGGATATCGT 


TTAATTAAGC 


CACCAAATTG 


TCTTAAATCA 


GCAGTGCCTG 


TAATTTTAAT 


3300 




CATAATTCCG 


ATAAGCAAGA 


ATAATGCAAG 


TTTTACTAAC 


ATGTCGTGCA 


ATGTATAGTA 


3360 


45 


AATAGCCCCA 


ATCATACCTG 


ACTCTGTCAT 


CATTGCAACG 


CCGACTAAGA TCACACCTAC 


3420 




AGCAATCATG 


ACATTGTATA 


GGATGATTTT 


TTTAATGTTG 


GCATATGCAA 


CAGCACCGAC 


3490 




ACAACCAAAG 


ATGATCGTTA 


ATAGTGCTAA 


GAATAAAATG 


ACATAATGTG 


AAAAGCTTAC 


3540 


SO 


ATTATCACTA 


AAGAATAGGC 


TCAATGTTCT 


AGCGATTGCA 


TAAACACCAA 


CTTTTGTTAA 


3600 




CAAAGCACCA AAGAATGCAA TGATTGGAAT TGGTGGgCAT AGTATGCACT AGGTAACCAA 


3660 
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ATATTGACTA AGCCACTGTC ATGCGCTGAA AGGTTAGCTA ATTTATTGCT TATATCTGCT 3780 

AGATTCAATG TTCCTACTAC TGAATATAAA ATCGCTACAC CCATTACGAA GAAGGATGAC 3840 

S GATACAACGT TAACAAGAAC ATATTTTATT GTTTCTTGTA GTTGAATTTT TGTAGAACCA 390 0 

ATTACTAATA AGAAATAAGA TGACATTAAA AATACTTCGA AAAATACGAA TAGGTTGAAA 3960 

ATGTCACCAG TTGTGAATGC ACCAATGATA CCTATTAACA TAAATAGTAC TGAAAAATAA 402 0 

10 

TAATAATATC TTTCACGTTC AATACCAATT GTTTGGTATG AATATAAAAT CACAATAGCT 4080 

GTAATAATAA TACTAGTAAT TATTAGTAGG GCACTGAATA TGTCTAATAC AAAGACAATA 4140 

CTGTATGGTG CTTTCCATGA ACCTAGCTCT ACGCGTATTG GTCCATGTTT AACAACATTT 4200 

75 

GCTAAATTGA TAATTGCCGC GACCAAGGTT AATAATGTAC CGCCTAGTGC GACATAACGC 4260 

TTTATAATAG GACGCTTTCC AATAAAGACA AGTAATATGG CTGTAATTAC TGGAATAACT 4320 

2Q AGCGTTAACA CAAGCATATT ACTTTCAATC ATCTTCTGGA ACTCCTTTCA TACTCTCAAC 43 80 

GTTATCTGTG CCTAATTCTT TATATGTTCT AAATGCTAAT ACTAAGAAAA AGGCTGTTGT 444 0 

CGCAAgGCGA TAACGATTGC TGTTAAAATA AGTGCTTGCG GGaTAGGaTC AACATAGCTT 4500 

25 TTTACGTTCG CTTCATAAAT TGGAACAGTA CCATGTTTAA GTCCGCCCAT AGTTATTAAA 456 0 

AATAAATTTG CTGCATGTGT TAATAGTGTA GTTCCCATAA CAATTCGTAT CAGACTTTTA 4620 
GACAAAACGA GATAGACACT AATTGCTGTG AGAATACCAC TAACAAAAAT CATAATAATT ' 4680 

30 TCCACTATTC GTTCTCTCCA ATCGAAATAA TAATTGTCAT GACAGTACCA ACTACTGCAC 4740. 

ATAAAACACC GAAATCAAAG AATACTGCTG TTGTCATATG AACAGGTTCT AATATAAATA 4800 

ACGGTATATC AAATGTGACA TGCGTAAAGA AATTTTTGCC TAAAAACCAA CTTGCGATAG 4 860 

35 

GCGTCGCAAT ACAAAAAACT AATCCGATAC CTATCAAGAT TTTAAAATCT AATGGGAAAA 4920 

TTTTACGCAT TGTTTCTATA TCAAATGCAA TCGTAATGAT AACAAGTGAA CTTGCGAATA 4980 

ATAATCCGCC GACGAAACCG CCACCAGGTG TATAATGTCC TGCTAAGAAA AGTGAAAAAC 5040 

40 

CAAAGACCAT TACCATGAAA AAGATAATAA CTGCAGCAAA TTGCAAAATT AGATCATTTT 5100 

GTTGTCTATT CATGATTTTT CACCTCGTTA CCTTGCGTTT GACGCTTTTT ACGTAATTTA 5160 

ATCATTGTAT ATACAGCTAA TCCTGCGATA CCAAGCACAG ATGACTCGAA TAAAGTATCC 5220 

45 

ATACCACGGA AATCAACAAG TATGACGTTT ACCATGTTTT TACCGTGAGC tAAATCATAA 5280 

ACGTGCTCTT GATAAAACTT AGATATCGAT TCAAAATGTC TATTTCCGTA TGCAATTAAA 534 0 

SO CCGATAATAA TGACGGACAA ACCAACACCA CCAGCAATTA AAGCATTAGT AAGCTGGAAT 54 00 

GAGCGCTTTT CATTATAACG ATTTAAATTT GGTAAGTGGT AGAAGCATAA TAAGAACAAT 5460 

.55 



478 



EP 0 786 519 A2 





ATAAACAATA CAGACACAGC ATATCCAACT GCACTTAACA TAATGATGCT 


AAATAATCTT 


5580 




■ GATTTAGCGA AAAGAATTAA AAAGGCAGCA CTTAATAATA AAATTACGAT 


ACAAACTTCG 


5640 


5 


AAAATTCTAA TCGGACTAAC GTCTTTAAAA TTAATGTTGA AAGGTACTGA 


GAATATAGTG 


5700 




ACAAATGTTA ATAAAATTAA TGCACCAAAA ATGATAACTA AATTATTACG 


TGAATAATCG 


5760 


10 


GTAACATAGC TATTCGTCAT CTTTTCAGAG TAGTTTGGAA TAACATTTGC 


ACTTCTGTTG 


5820 


TACCAATAAT TGAATGTTAG TTTACCAGGT TGTCGTTGCA ACAATTTCAC 


CCAATAACTA 


5880 




AATGTCACAA TTAGTAAGAT ACCTAAAATA TAAATCACTA ATGTTGATAA 


AAAGGCAGGC 


5940 


IS 


GTTAATCCAT GGAACATATG GAATTCAACA TCATCAATTA CCGTATGATT 


AATCGAAGag 


6000 


TnAGCTGGTT CAATAATCGA ATTAGTTAAA ATGCCAGGGA ATAAACCAAA 


TACAATTACT 


6060 




AATGTAGCTA AAATAGCTGG TGATAAAAGC ATTAATATTG ATACTTCGTG 


TGCTTTTTTA 


6120 


20 


GGTAATTGTT CAGGTTTATA TTGTCCGAAA AATATATGCA TTATAAATTT 


AATTGAATAT 


6180 




ACAAATGTGA AGACACTGCC CACTATACCA ATGATTGGGA ATAGGTAGCC 


TAATGTATCA 


6240 




ACACTGAATA AATTTGCTTG GCTTGCTGTA AATGTTGTTT CTAAAAATGA 


TTCTTTTGAT 


6300 


25 


AAGAAACCAT TGAACGGTGG TACACCAGCg CATACTTAAT GCTGTAATAA 


CAGTGATTGT 


6360 




AAATGAAATA GGCATAATTG TTAGTAAGCC ACCTAATTTC TTAACATCAC 


GTGTACCAGT 


6420 




AGAATGATCC ACTGCACCTG TAATCATAAA TAGGGCACCT TTAAATGTTG 


CATGGTTGAT 


6480 


30 


TAAATGGAAT ATTGCAGCCG TAAATGCAGC AGCATATATT TTGCTATCAT 


CGCCTTGATA 


6540 




GTGATAACTA ATGGCACCGA TTCCAAGCAT CGCCATAATC ATACCTAATT 


GGGATACTGT 


6600 


35 


TGAAAATGCC AGTATACCTT TCAAGTCTTG TTGTTTTGTT GCGTTTAGCG 


AAgCCCAGAA 


6660 


TAATGTAATT AAACCAACGA GTGTGACAGT CCATACCCAA CCTTGCGATG 


CTGCGAAGAT 


6720 




TGGTGTCATT CGAGCGATTA AATATAACCC TGCTTTAACC ATTGTTGCTG 


AATGAAGATA 


6780 


40 


AGCACTGACT GGTGTAGGTG CTTCCATTGC ATCTGGTAGC CAAATATAAA 


ATGGAAACTG 


6840 


AGCAGATTTT GTAAAAGCAC CAATCATGAT TAAAATCATC GCAAAAATGA 


AGAATGGGCT 


6900 




ATTTTGAATT TCAGAAGCAT GTTGAATCAT GTACTGAATG CTAAATGATT 


GTGTTGGTAT 


6960 


45 


AGCGAGTAAG ATGATACCAC CTAATAATGA TAGACCACCA AATACTGTGA 


TTATGAGCGA 


7020 




TTTTTGAGCA CCATATATAG ATGCTTGTCG TTCGCGCCAG AATGAAATAA 


GTAAAAAACT 


70B0 




AGAAAATGAC GTTAGCTCCC AGAATAAATA TAGAATAATA ACATTATCTG 


AAAGTACGAC 


7140 


50 


ACCTAACATT GCACCCATAA ATAGTAATAA ATAACAATAA AAATTCCCTA 


GTTGTTCTGA 


7200 




CTTACTTAAG TAGCCGATTG AATATAATAC TACTAAACTG CCGATTCCTG 


AAATAAGCAA 


7260 
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CCAATTTAAG GTTTTCATTA CAGTATTACC TGACATCGTC GTTTTAATTA ATGTAAGCAT 


7380 




ATAAATAAAT ATGACGATAG GGACAGGTAA TACGAACCAT 


CCTAAATGTA 


TACGTTTAAA 


7440 


5 


AAATCTATAC AGGATAGGAA TAATGAGTGC GAATATTAAC 


GGTAATATCA 


CCGCAATATG 


7500 




TAACAAACTC 


ACTATGTTGT CCTCCTTTAA AAAATATTTA 


TGTTATTCAT 


TATACATGAA 


7560 




TGATATAGTT 


CTGAAAAACG TACACACTCC TTGTTGTGCT 


TTATTTTCAG 


AaGTATTTAA 


7620 


10 


ATAAGAAGAA ACACGTCATT TTTTATTTAA AATTTTCTTT 


GTATTGAAGT 


GAATAATCTT 


7680 




CTTTTAAGCG 


TGCTAAACTA GCTAAAGACA TTTCAGCATG 


TTTTGTTTGC 


TGAGCTTTAA 


7740 


15 


GTTTAGTTTC 


TAAATCTGTA ATTGCTTGTT GAAGTGAATC 


TTCATAGCGC 


AATACATCAA 


7800 


CATTGAAGTC 


GCGTAATTGT GAACGTTTCG TATAGCGTTT 


TTCAAAATGG 


CTTAATGCTT 


7860 




TGCGGTCATG 


GAAAAATACA CCTTCAGTTT CAGTAGGGTT 


ATGTAAATCA 


CCTTGTTTCG 


7920 


20 


GGTGTTTGAT 


AACTTGTTCA ACTTTAACAA GGACATCGTC 


TCCATTTTCT 


TCAACAATCG 


7980 




TGACACCATA 


GCTACCTGTT TTGTGTGAAA ATCGATATAG 


CTTCATGCTA 


TTTTCCTCCC 


6040 




TTAAAAGTAT 


GTTAATATAT ATGTATCATA ACATGAATGG 


AGAATATAAA 


TGGCTAACTA 


6100 


25 


- TCCACAGTTA 


AACAAAGAAG TACAACAAGG TGAAATCAAA 


GTGGTTATGC 


ACACAAATAA 


8160 




AGGTGACATG 


ACATTCAAAT TATTTCCAAA TATTGCACCA 


AAAACAGTTG 


AAAATTTTGT 


8220 




GACACATGCA 


AAAAATGGTT ATTATGATGG AATCACATTC 


CACCGTGTCA 


TTAATGACTT 


6280 


30 


CATGATTCAA 


GGTGGCGATC CAACAGCTAC TGGTATGGGT 


GGCGAAAGTA 


TTTATGGCGG 


8340 




TGCTTTTGAA 


GATGAATTTT CATTAAATGC ATTTAACTTA 


TATGGCGCAT 


TATCAATGGC 


B400 




TAACTCAGGA 


CCTAATACTA ATGGTTCACA ATTTTTCATT ■ 


GTTCAAATGA 


AAGAAGTACC 


8460 


35 


TCAAAATATG 


TTAAGTCAAC TTGCAGATGG TGGCTGGCCT 


CAACCAATCG 


TTGATGCATA 


8520 




TGGCJ3AAAAG 


GGTGGTACAC CATGGTTAGA TCAAAAACAT 


ACAGTATTCG 


GTCAAATCAT 


8580 


40 


TGATGGTGAA 


aCTACATTAG AAGATATTGC AAATACAAAA 


GTGGGACCAC 


AAGATAAACC 


8640 


ACTTCATGAT 


GTTGTAATTG AATCTATTGA TGTTGAAGAA 


TAATATCTAA ACATAATTAA 


8700 




CTACCAACAT 


TTTAAACTCG GATAAAGCTA ATTTATGAAT 


GGATTAGTAT 


ATATTCCAAC 


8760 


45 


gAAAATAAAT AAACTAATAT GATGAGCAAT CTCAATATAT TTATCaAGAA AGCACAGTTT 


8820 




TTAAATAGAT 


GTGTATTTTA AAGATAATAG TTGAGGTTGC 


TTTTTATGTT 


TTTACAGAGA 


8880 




ATTGCTATTC 


AAATAGTAAA TAAATTGAAA ACAAAGTAGC 


TGGATATCAT 


ATTGATTTAG 


8940 


50 . 


ATAGGAATTT 


GTTGCTAATT TTATTTGTAA ATCCAAGTTT 


GTAGAATTCT TATTCATTTA 


9000 




TAAAATAATA TTCGTATGAT TTGATTTTTT AATTAGTCCA 


CCATTTCGAT 


TTGTGCTATG 


9060 
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AACATATCAA GGTGCGTGTA cTGGTATTGA ACCATACGGT GCGTTTGTTG AGACCCCTAA 9180 

TCATACTGAA GGACTGATTC ATATATCAGA AATTATGGAT GACTACGTTC ATAATTTGAA 924 0 

5 GAAATTTCTA TCAGAAGGCC AAATTGTTAA AGCTAAAATT TTGTCTATAG ATGATGAAGG 9300 

AAAGCTTAAT CTATCATTAA AGGATAATGA TTACTTCAAA AATTATGAGC GTAAGAAGGA 9360 

AAAACAATCA GTATTAGATG AAATCAGAGA AACAGAAAAA TATGGGTTTC AAACACTTAA 9420 

! ° AGAACGCTTA CCAATCTGGA TAAAACAGTC AAAGCGAGCA ATTCGAAACG ACTAAAGGAA 9480 

CAGATAAATC GTACCGAAAA TCATACAAAG GGTCTGAAAT GAAAGTTTCT TAGACTATAA 954 0 

AAGAGATTAG TATCTATTAA ATTTTATTAG ATACTAATCT CTTTTTGTCT ACGATAACGT 9600 

IS 

AATATGaTTG ATTCTATTTA CACGTACAAA TGGTTTAAGG TGACATATCC ATTATCTTTG 9660 

TT&GATAGAA TCGTTGATTT GCaATATTGT ATGTGGATTT GTTTTTTTTA TTTATTTTAG 9720 

AAATGAGAAC TACAACTTAA AGTATTAAAC GAATTGCAAC TATATAAACA GATAATTGGA 9780 

20 

GAATGAAAAA ATTACATGTT ATAGTCAACT CAATAATTTT AAGGAGGAAT TAAGTAATGA 984 0 

AAAGTAAATA CGAACCATTG TTTGATAAAG TAGAATTACC AAATGGAGTA GAGTTGAGAA 9900 

25 ATCGATTTGT GTTAGCCCCT TTAACACATA TTTCTTCAAA TGATGATGGT ACTATTTCAG 9960 

ATGTAGAACT TCCTTATATT GAAAAGCGTT CACAAGATGT TGGTATTACA ATTAATGCTG 10020 

CGAGTAATGT GAGTGATGTC GGAAAAGCAT TTCCAGGACA GCCATCAATC GCGCATGACA 10080 

30 GTAATATTGA AGGACTAAAA CGATTAGCTA CAGCAATGAA GAAAAACGGT GCCAAAGCAC 10140 

TCGTACAAAT ACATCATGGC GGTGCACAAG CATTGCCTGA ATTAACACCT GATGGAGACG 10200 

TCGTAGCACC AAGTCCAATT TCTTTAAAAA GTTTTGGTCA GAAACAAGAA CATAGTGCTA 10260 

55 GAGAAATGAC GAATGAAGAG ATTGAACAAG CAATCAAGGA TTTTGGTGAA GCAACGCGAC 10320 

GTGCAATTGA AGCAGGGTTT GATGGTGTTG AAATACATGG CGCGAATCAT TACTTAATTC 10380 

ATCAATTTGT ATCACCATAC TATAATAGAA GAAATGATGT ATGGGCAAAT CAATATAAAT 10440 

40 

TCCCGGTCGC TGTGATTGAA GAAGTACTTA AAGCGAAAGA AGCGTATGGC AATAAAGACT 10500 

TTATAGTTGG ATACAGATTA TCTCCAGAGG AAGCGGAGTC TCCAGGAATC ACAATGGAAA 10560 

TTACAGAGGA ACTCGTTAAT AAAATTAGCC ATATGCCAAT CGACTATATT CATGTTTCAA 10620 

TGATGGATAC GCATGCAACG ACACGTGAAG GTAAATACGC TGGACAAGAA AGACTGCCTT 10680 

TAATTCACAA ATGGATAAAT GGTCGTATGC CACTTATCGG TATTGGTTCA ATTTTCACAG 10740 

SO CTGACGAAGC TTTAGATGCA GTTGAAAATG TTGGTGTTGA CTTAGTAGCC ATTGGTAGAG 10800 

AGCTACTACT GGATTATCAA TTTGTTGAAA AAATTAAAGA TGGACGGGAA GATGAAATTA 10860 
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AATTTAATGA AGGGTTTTAT CCATTACCAC GTA 10953 
(2) INFORMATION FOR SEQ, ID NO: 63: 

6 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8155 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

10 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 63: 

TTTGATAnAA AACTGAATnA ATTAAATGTA TCGATTCAAC CTAATGAAGT GAATTTACAA 60 

75 

GTTAAAGTAG AGCCTTTTAG CAnAAAGGTT AAAGTAAATG TTAAACAGAA AGGTAGTTTA 120 

GCAGATGATA AAGAGTTAAG TTCGATTGAT TTAGAAGATA AAGAAATTGA AATCTTCGGT 180 

20 AGTCGAGATG ACTTACAAAA TATAAGCGAA GTTGATGCAG AAGTAGATTT AGATGGTATT 24 0 

TCAGAATCAA CTGAAAAGAC TGTAAAAATC AATTTwCCAG AACATGTCAC TAAAGCACAA 300 

CCAAGTGAAA CGmAGGCTTA TATAAATGTA AAATAAATAG CTAAATTAAA GGAGAGTAAA 360 

25 CAATGGGAAA ATATTTTGGT ACAGACGGAg TAAGAGGTGT CGCAAACCAA GAACTAACAC 420 

CTGAATTGGC ATTTAAATTA GGAAGATACG GTGGCTATGT TCTAGCaCAT AATAAAGGTG 480 

AAAAACACCC ACGTGTACTT GTAGGTCGCG ATACTAGAGT TTCAGGTGAA ATGTTAGAAT 54 0 

3D 

CAGCATTAAT AGCTGGTTTG ATTTCAATTG GTGCAGAAGT GATGCGATTA GGTATTATTT 600 

CAACACCAGG TGTTGCATAT TTAACACGCG ATATGGGTGC AGAGTTAGGT GTAATGATTT 660 

CAGCCTCTCA TAATCCAGTT GCAGATAATG GTATTAAATT CTTTGGATCA GATGGTTTTA 720 

35 

AACTATCAGA TGAACAAGAA AATGAAATTG AAGCATTATT GGATCAAGAA AACCCAGAAT 780 

TACCAAGACC AGTTGGCAAT GATATTGTAC ATTATTCAOA TTACTTTGAA GGGGCACAAA 840 

AATATTTGAG CTATTTAAAA TCAACAGTAG ATGTTAACTT TGAAGGTTTG AAAATTGCTT 900 

40 

TAGATGGTGC AAATGGTTCA ACATCATCAC TAGCGCCATT CTTATTTGGT GACTTAGAAG 960 

CAGATACTGA AACAATTGGA TGTAGTCCTG ATGGATATAA TATCAATGAG AAATGTGGCT 1020 

4S CTACACATCC TGAAAAATTA GCTGAAAAAG TAGTTGAAAC TGAAAGTGAT TTTGGGTTAG 1080 

CATTTGACGG CGATGGAGAC AGAATCATAG CAGTAGATGA GAATGGTCAA ATCGTTGACG 1140 

GTGACCAAAT TATGTTTATT ATTGGTCAAG AAATGCATAA AAATCAAGAA TTGAATAATG 1200 

50 ACATGATTGT TTCTACTGTT ATGAGTAATT TAGGTTTTTA CAAAGCGCTT GAACAAGAAG 1260 

GAATTAAATC TAATAAAACT AAAGTTGGCG ACAGATATGT AGTAGAAGAA ATGCGTCGCG 1320 
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CTGGTGATGG TTTATTAACT GGTATTCAAT 


TAGCTTCTGT 


AATAAAAATG 


ACTGGTAAAT 


1440 




CACTAAGTGA ATTAGCTGGA CAAATGAAAA 


AATATCCACA 


ATCATTAATT 


AACGTACGCG 


1500 


5 


TAACAGATAA ATATCGTGTT GAAGAAAATG 


TTGACGTTAA 


AGAAGTTATG 


ACTAAAGTAG 


1560 




AAGTAGAAAT GAATGGAGAA GGTCGAATTT 


TAGTAAGACC 


TTCTGGAACA 


aACCATTAGT 


1620 


10 


TCGTGTCATG GTTGAAGCAG CAACTGATGA 


AGATGCTGAA 


aGATTTGCAC 


AACAAATAGC 


1680 


TGATGTGGTT CAAGATAAAA TGGGATTAGA 


TAAATAAATA 


CTGTATTACA 


AATGAGCCGA 


1740 




TGCGTATGcA nTcgtTTTTT GTGTTTGTAG 


AAATAATTTA 


TAGTACAAAC 


GTAAAATGAT 


1800 


15 


ATAAACAAAA TAAAAACAAA GTAATCAATA 
ATAATGATGA TAAAATTAAT TTTAATTAGA 


TGTAATATAA 


AATACACTGG 


TACTCAATAT 


1860 




TAGAGTTGCT 


TTGTGTTTTT 


AACGCAGATG 


1920 




CTACTACTTA TCTTAACAGT TGATTAAGTG 


AAATCATTTA 


ACAGCGAGAA 


TAATCAACCA 


1980 


20 


GGAGGATGAC TTAATGAATT TATTCAGACA 


ACAAAAATTT 


AGTATCAGAA 


AATTTAATGT 


2040 




CGGTATTTTT TCAGCTTTAA TTGCCACTGT 


TACTTTTATA 


TCTACTAACC 


CGACAACAGC 


2100 




GTCTGCAGCA GAGCAAAATC AGCCTGCACA 


AAATCAACCA 


GCACAACCAG 


CTGATGCCAA 


2160 


25 


TACACAGCCT AACGCAAATG CTGGTGCTCA 


AGCTAATCCT 


ACAGCACAGC 


CAGCTGCACC 


2220 




TGCCAACCAA GGACAACCAG CAGTACAACC 


AGCAAACCAA 


GGTGGACAGG 


CTAATCCAGC 


2280 




AGGAGGAGCA GCACAACCAA ATACACAACC 


AGCTGGACAA 


GGTGATCAAG 


CTGATCCGAA 


2340 


30 


TAACGCTGCA CAAGCACAAC CTGGAAATCA 


AGCAACACCG 


GCAAACCAAG 


CAGGTCAAGG 


2400 




AAATAACCAA GCAACACCTA ATAATAATGC 


AACACCGGCA 


AATCAAACAC 


AGCCAGCGAA 


2460 




TGCTCCAGCA GCAGCGCAAC ' CAGCAGCACC 


TGTAGCAGCA 


AACGCACAAA 


CTCAAGATCC 


2520 


35 


AAATGCTAGC AATACTGGTG AAGGCAGTAT 


TAATACGACA 


TTAACATTTG 


ATGATCCTGC 


25B0 




CATATCAACA GATGAGAATA GACAGGATCC 


AACTGTAACT 


GTTACAGATA 


AAGTAAATGG 


2640 


40 


TTATTCATTA ATTAACAACG GTAAGATTGG 


TTTCGTTAAC 


TCAGAATTAA 


GACGAAGCGA 


.2700 


TATGTTTGAT AAGAATAACC CTCAAAACTA 


TCAAGCTAAA 


GGAAACGTGG 


CTGCATTAGG 


2760 




TCGTGTGAAT GCAAATGATT CTACAGATCA 


TGGTAACTTT 


AACGGTATTT 


CAAAAACTGT 


2820 


HO 


AAATGTAAAA GCAGATTCAG AATTAATTAT 


TAACTTTACT 


ACTATGCAAA 


CGAATAGTAA 


2880 




GCAAGGTGCA ACAAATTTAG TTATTAAAGA 


TGCTAAGAAA 


AATACTGAAT 


TAGCAACTGT 


2940 




AAATGTTGCT AAGACTGGTA CTGCACATTT 


ATTTAAAGTA 


CCAACTGATG 


CTGATCGTTT 


3000 


50 


AGATTTACAA TTTATTCCTG ACAATACAGC 


AGTTGCTGAT 


GCTTCAAGAA 


TTACAACAAA 


3060 




TAAAGATGGT TATAAATACT ATTCATTCAT 


TGATAATGTA 


GGTCTATTCT 


CAGGATCACA 


3120 
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TAATACTGAA ATCGGTAACA ATGGTAATTT TGGTGCTTCA TTAAAAGCAG ATCAATTTAA 3240 

ATATGAAGTA ACATTACCAC AAGGTGTAAC TTACGTTAAT AATTCATTAA CTACAACATT 3300 

CCCTAATGGT AATGAAGACA GTACAGTATT GAAAAATATG ACTGTTAATT ATGATCAAAA .3360 

TGCAAATAAA GTTACATTTA CAAGCCAAGG TGTGACAACG GCACGTGGTA CACACACTAA 3420 

AGAAGTTTTA TTCCCAGATA AATCTTTAAA ATTATCATAT AAAGTTAATG TTGCGAATAT 3460 

CGATACACCT AAAAATATTG ATTTTAATGA AAAATTAACA TATCGTACTG CTTCAGATGT 3540 

TGTAATTAAT AATGCGCAAC CAGAAGTaCA CTAACTGCAG ATCCATTTTC AGTAGCGGTT 3600 

GAAATGAACA AAGATGCGTT GCAACAACAA GTAAACTCAC AAGTTGATAA TAGTCATTAC 3660 

ACAACAGCAT CAATTGGAGA ATACAATAAA CTTAAACAAC AAGCAGATAC TATTTTAAAT 3720 

GAAGATGCGA ATCATGTTAA AACTGCAAAT CGTGCATCTC AAGCGGATAT TGATGGTTTA 3 7 BO 

GTAACTAAAT TACAAGCTGC ATTAATTGAT AATCAAGCAG CAATTGCTGA ATTAGATACT 384 0 

AAAGCTCAAG AAAAGGTTAC AGCAGCACAA CAAAGTAAAA AAGTTACGCA AGATGAAGTT 3900 

GCAGCACTTG TAACTAAAAT TAACAATGAT AAAAATAATG CAATCGCAGA AATTAATAAA 3 960 

25 CAAACTACAG CACAAGGTGT CACAACTGAA AAAGATAATG GTATCGCAGT GTTAGAACAA 4020 

GATGTGATTA CACCAACAGT TAAACCTCAA GCGAAACAAG ATATTATCCA AGCAGTTACA 4 080 

ACTCGTAAAC AACAAATTAA AAAGTCAAAT GCATCATTAC AAGATGAAAA AGATGTAGCA 4140 

30 AATGATAAAA TTGGTAAAAT TGAAACAAAG GCAATTAAAG ATATTGATGC AGCAACAACA 4200 
AATGCACAAG TAGAAGCCAT TAAAACAAAA GCAATCAATG ATATTAATCA AACTACACCT . 4260 

GCTACAACAG CTAAAGCAGC AGCTCTTGAA GAATTTGACG AAGTTGTTCA AGCACAAATT 4320 

GATCAAGCAC CTTTAAATCC TGATACAACA AATGAAGAAG TAGCGGAAgC TATTGAACGT 4380 

ATTAATGCAG CTAAAGTTTC TGGTGTTAAA GCAATTGAAG CGACAACGAC TGCACAAGAT 4440 

TTAGAAAGAG TTAAAAACGA AGAAATCTCA AAAATTGAAA ATATTACTGA CTCTACGCAA 4500 

ACAAAAATGG ATGCCTATAA TGAAGTTAAA CAAGCTGCAA CAGCTAGAAA AGCTCAAAAT 4560 

GCTACAGTTT CAAATGCAAC AAATGAAGAA GTAGCAGAAG CTGATGCAGC AGTAGATGCA 4620 

GCTCAAAAGC AAGGTTTACA TGACATCCAA GTTGTTAAAT CAAAACAGGA AGTTGCTGAT 4680 

ACAAAATCAA AAGTATTAGA TAAAATCAAT GCAATTCAAA CACAAGCAAA AGTTAAACCT 474 0 

GCAGCTGATA CGGAAGTAGA AAACGCATAT AATACACGTA AACAAGAAAT TCAAAATAGC 4800 

50 AATGCTTCAA CTACAGAAGA AAAACAAGCT GCATATACAG AATTAGATAC TAAAAAGCAA 4860 

GAAGCAAGAA CAAATCTTGA TGCTGCAAAT ACAAACAGTG ATGTAACAAC AGCTAAAGAC. 4920 
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GCGGAAATCG 


CTCAAAAAGC AAGTGAACGT AAAACAGCAA TTGAAGCAAT GAATGATTCG 


5040 




ACTACTGAAG AACAACAAGC AGCGAAAGAC AAAGTGGATC AAGCAGTAGT TACTGCAAAC 


5100 


5 


GCTGATATAG ATAATGCTGC AGCAAACAAT GATGTGGATA ATGCAAAAAC TACAAATGAA 


5160 




GCTACAATCG 


CAGCCATTAC ACCTGATGCA AATGTTAAAC 


CAGCAGCAAA 


ACAAGCAATT 


5220 




GCAGATAAAG 


TACAAGCTCA AGAAACAGCA ATTGATGGAA ATAACGGCTC 


AACAACTGAA 


5280 


10 


GAAAAAGCAG 


CTGCTAAACA ACAAGTTCAA ACTGAAAAAA 


CAACAGCTGA 


TGCCGCAATA 


5340 




GATGCAGCAC 


ATACAAATGC GGAAGTTGAA GCGGCTAAAA AAGCAGCAAT 


TGCTAAAATT 


5400 


15 


GAAGCGATTC 


AGCCAGCAAC AACAACTAAA GATAATGCGA 


AAGAAGCAAT 


TGCTACGAAA 


5460 


GCGAATGAAC 


GTAAAACAGC AATCGCTCAA ACGCAAGACA 


TTACTGCTGA 


AGAAATTGCA 


5520 




GCGGCTAATG 


CGGACGTAGA TAATGCTGTG ACACAAGCAA ATAGCAACAT 


TGAAGCTGCT 


5580 


20 


AATAGTCAAA ATGATGTAGA CCAAGCGAAA ACGACAGGTG AAAATAGTAT 


TGATCAAGTA 


5640 




ACACCAACAG 


TTAATAAAAA AGCAACTGCA CGTAATGAAA 


TCAGAGCAAT 


TTTAAATAAC 


5700 




AAATTGCAAG 


AGATTCAAGc tACGCCAGAT GCAACAGATG AAGAAAAACA AGCAGCTGAT 


5760 


25 


GCTGAAGCAA ATACTGAAAA TGGTAAAGCA AATCAAGCCA TTTCAGCAGC 


AACTACTAAC 


5820 




. GCACAAGTTG 


ATGAAGCTAA AGCAAATGCA GAAGCAGCGA 


TTAATGCGGT 


AACACCAAAA 


5B80 




GTTGTGAAGA 


AACAAGCGGC TAAAGATGAA ATTGATCAAT 


TACAAGCAAC 


GCAAACAAAT 


5940 


30 


GTTATCAATA 


ATGATCAGAA CGCTACAACA GAAGAAAAAG 


AAGCAGCTAT 


TCAACAATTA 


6000 




GCAACAGCAG TTACAGACGC GAAAAATAAT ATTACAGCTG CAACTGATGA TAATGGTGTA 


6060 




GATCAGGCGA AAGACGCTGG AAAGAATTCA ATTCAAAGCA 


CGCAACCAGC 


AACAGCGGTT 


6120 


35 


AAATCAAATG 


CTAAAAATGA TGTTGATCAA GCTGTGACAA 


CTCAAAATCA 


AGCAATTGAT 


6180 




aataeaactg 


GTGCTACAAC TGAAGAGAAA AATGCAGCAA 


AAGATTTAGT 


TTTAAAAGCT 


6240 


40 


AAAGAAAAAG 


CGTATCAAGA TATCTTAAAT GCACAAACAA 


CTAATGATGT 


TACGCAAATT 


6300 


AAAGATCAAG 


CAGTTGCTGA TATTCAAGGT ATTACTGCAG 


ATACAACAAT 


TAAAGATGTT 


6360 




GCGAAAGATG 


AATTAGCAAC AAAAGCAAAC GAAGAAAAAG 


CGCTTATTGC 


ACAAACTGCA 


6420 


45 


GATGCGACTA 


CTGAAGAAAA AGAACAAGCA AATCAACAAG 


TAGACGCACA ATTAACACAA 


6480 




GGTAATCAAA 


ATATTGAAAA TGCACAGTCA ATCGATGATG 


TAAACACTGC 


AAAAGATAAT 


654 0 




GCAATTCAAG 


CAATTGACCC AATTCAAGCA TCAACAGATG 


TTAAAACGAA 


TGCAAGAGCG 


6600 


50 


GAATTGCTAA 


CTGAAATGCA AAATAAAATA ACTGAAATAC 


TTAATAATAA 


TGAGACTACT 


6660 




AATGAAGAAA AAGGTAACGA TATTGGACCA GTTAGAGCAG 


CATATGAAGA 


AGGTTTAAAT 


6720 
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AAAGTTCAAC AACTTCATGC AAATCCTGTT AAGAAACCAG CAGGTAAAAA AGAATTAGAT 684 0 

CAAGCTGCAG CTGATAAGAA AACACAAATA GAACAAACAC CAAATGCATC ACAACAAGAA 6900 

ATTAATGATG CAAAACAAGA AGTTGATACT GAATTAAATC AAGCGAAAAC AAATGTCGAT 6960 

CAATCATCAA CAAATGAATA TGTTGATAAT GCAGTTAAAG AAGGAAAAGC TAAAATTAAT 7020 

GCAGTTAAAA CATTTAGTGA GTACAAAAAA GATGCTTTAG CTAAAATTGA AGATGCATAT 7080 

AATGCTAAAG TAAACGAAGC GGATAACTCT AACGCATCGA CTTCAAGTGA AATTGCTGAA 714 0 

GCGAAACAAA AACTTGCTGA ATTAAAACAA ACTGCGGATC AAAATGTTAA TCAAGCTACT 7200 

TCTAAAGATG ACATTGAAGT TCAAATTCAT AATGACTTAG ATAATATTAA CGATTACACA . 7260 

ATTCCAACAG GTAAAAAAGA ATCAGCTACA ACAGATTTAT ATGCTTATGC AGATCAGAAG 7320 

AAAAATAATA TTTCAGCTGA CACTAATGCA ACACAAGATG AAAAGCAACA AGCAATTAAG 7360 

CAAGTTGACC AAAATGTTCA AACTGCATTA GAAAGCATTA ATAATGGTGT GGATAATGGT 7440 

GACGTTGATG ATGCATTAAC ACAAGGTAAA GCAGCAATTG ATG CTATTCA AGTAGATGCT 7500 

ACTGTTAAAC CTAAAGCGAA CCAAGCTATT GAAGTTAAAG CAGAAGATAC GAAAGAATCT 7560 

ATTGATCAAA GTGACCAGTT AACTGCTGAA GAAAAAACTG AAGCATTAGC AATGATTAAA 7620 

CAAATTACAG ATCAAGCTAA ACAAGGTATT ACTGATGCAA CAACAACTGC TGAAGTTGAA 7680 

AAAGCGAAAg cTCaAGGACT TGAAGCATTT GATAACATTC AAATCGACTC AACAGAAAAA 774 0 

30 CAAAAAGCTA TCGAAGAATT AGAAACTGCA CTAGACCAGA TTGAAGCAGG TGTAAATGTC 7800 

AACGCTGATG CTACAACTGA AGAAAAAGAA GCGTTTACGA ATGCTTTAGA AGACATTTTA 7860 

TCAAAAGCAA CTGaAGATAT TTCTGATCAA ACTACAAATG CAGAAATCGC TACTGTCAAA 7920 

AATAGTGCGC TTGAACAACT TAAAGCACAA CGTATTAATC CTGAAGTTAA GAAAAATGCT 7980 

TTGGAAGCAA TCAGAGAAGT GGTTAACAAG CAAATAGGAA tAATTAAAAA TGCAGATGCA 604 0 

GATGCATCGG CGGAAAGAnA TTGCACGTAC GGGATTTAGG TAGATATTTT GGACCGATTT 8100 

GCTGGATAAA TTTAGGGTnA AACCCCAACC AATGCCGAAG TTGCCTGAAT TACCA 8155 

(2) INFORMATION FOR SSQ ID NO: 64: 

(i) SEQUENCE CHARACTERISTICS: 
45 . (A) LENGTH: 1630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 64: 
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CTGTTTTATT TGCAGCACCC ATACTGGAAA TCACTTTAAT CCCTCGGTCA ■ AGACACTCTT 120 

. TCATTAAGTG TACTTTGTAC ATTATTGTAT CACTTGCATC TACAAAATAA TCTATATCGT 180 

AGTTATCGAA AATTTCTTCA TATGTCTCTT CTGTATAAAA CATATGTAAG GGCGTGACTT 24 0 

TACAATCTGG ATTAATTAAT TTAATACGTT CTTCCATCAA AGAAACTTTA CTTTGTCCTA 300 

CCGTTGTAGT TAAAGCGTGT AATTGTCTGT TTACATTTGT AATATCAACA TCATCTTTAT 36 0 

CTATTAATAT AATATGACCA ATATTCGTTC TTGCTAATGC TTCAGCAGCA AATGAACCAA 420 

CACCTCCAAC GCCAAGTATG ACAACAGTTT GTTGCTTCAA TAAATCTAAA CCTTGTTGTC 4 80 

CAATCGCTAG TTCATTTCTT GAAAATTGAT GTTTCATTAT TTTACCTCTT TCACTGATTT 540 

ATACATAAGT ACATAGTAAC TTAAAATTTT ATATTTAGCA TTATCACTTT GATTATTTTC 600 

CCAAAATTCA ACGAGGAAAC ATTTATTAAA CGCTATAAAA CCCAACTAAT TCTTTATTAA 660 

AAACTTAAAG AAACGCATAA AAATACGCAA GACAAAGTCT TGCGTATCGA TAGAGTCCGT 720 

ATTGCCGTAG TTATAATAGC TTGATCATTC GGCCTGTTAT ATACAGGTGG GTGCCCTGTT 78 0 

TCTTGTTTTG TACGTCCTTC ATATAAGGCG TGTACGCTGC AAGAAAACCC ATTGGGCTCC 840 

25 CTTGATCAAA GAGTGTTAGG CCCAAATTAA AAAGCAAACT TACGAACAAC TCAGATGACT 900 

ATCTTATGAT GTTATATTAC CACATAATTA AAATTAATGA AATTATAACA AACCAAAGTT 960 

TATTGATTTT TTAAAATTTA GTGACGAATT CGGAAAGAAA GTTCTTCTAA TTGTTTATCA 1020 

30 GAAACTTCAC TAGGCGCATT CGTTAATAAA CATGTAGCAG ATGCTGTTTT AGGGAATGCG 10 BO 

ATTGTATCTC TCAAGTTTGT TCTATTAGTC AATAACATGA CTAATCGGTC tAATCCTAAT 1140 

GCAATACCGC CATGTGGTGG TGCACCATAT TTAAATGCAT CTAGTaAGAA GCCGAACTGT 12 00 

TCCTgTGCTT GTTCTTTAGT AAATCCAAGA ACTTCGAACA TTTTTTCTTG TAACTCACCA 1260 

TCATSAATTC TGATTGAACC GCCACCTAAT TCATAACCAT TTAATACTAT GTCATAAGCA 1320 

TTTGCCTCAG CTTCtTCTGG CGCAGTGCCA AGCTTAGCAA TATCAGCTTC TTTTGGAGAT 13 BO 

GTAAATGGAT GATGTGCTGC AACGTAACGT TTCGCATCTT CATCATATTC TAATAATGGC 144 0 

CAATCTGTCA CCCATAAGAA GTTTAATTTT GTTTCATCGA TTAAACCTAA TTCTTTAGCT 1500 

AATTTGACAC GTAATGCACC TAAACTTTGT GCAACGACAT TTGGTttGTC TGCAACAAAC 1560 

ATTACTAAGT CACCAGCTTC AGCACCAGTT AATGTAAGTA ATGTTTCAAC ATTTTCTGTT 1620 

CAAAGAAACG 1630 

SO (2) INFORMATION FOR SEQ ID NO: 65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 732 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: €5: 
CAATTGGACA TCTTGTATGA AAAGGACAAC CTTGCGGCGG ATTACTTGGC GAAGGTAATT 
CTCCTTTTAA TATAATTCTA TTGTTATTAT GTTTATCAAT TTGTGGTATT GATGAAATCA 
ACGCTTTTGT ATATGGATGT TTGGGATTTT CATAAATTTC TTTATCAGAT GCGATTTCAA 
CTATATGACC TAAATACATA ACTCCAATGA CATCACTTAT ATGTTTTACT ACACTTAAAT 
CATGTGCGAT AAATAAATAG CTTAAGTTAA ATTGTTCTTG TAAATCTTTT AATAAATTCA 
GTACTTGAGA TTGAACAGAT ACATCTAATG CACTTACAGG CTCATCAGCA ACAATTAAAC 
TCGGACGCAA AGCCAATGCT CTTGCAATTC CCACTCTTTG TCTCTGTCCA CCTGAAAATT 
CATGTGCATA TTtATAATAT GCATCTTCAC TTAGGCCAAC ACATTTTAAT AAATATAGTA 
CTTCTTTTTT TATTTCTTCT TTTGGCAATT TTTTATAATT TAAAATAGGT TCTGAAATGA 
TATCTCCAAC CATTTGCATC GGATTCAATG ATGCATACGG ATCTTGAAAT ATCATCTGAT 
ATTGTTGTCG TGATTTTCTG AGTTTTTTAC CTTGTAATCT TGTTATATCT TCACCATTAA 
CAATTATTGA GCCTGAAGTT GCATCTTCAA GCCTGATAAT CACTTTACCT AACGTTGACT 
TACCACAACC CG 

(2) INFORMATION FOR SEQ ID NO: 66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 66: 
AATATATTCA TATGTTTCAT CAACAATATT AGCTGCTTTT TGAATTAAAG CAATTTCGTC 
AGCATCTTTG ACGTCTCTAA TTTTATCTAC AGTATTAGAA ATGCTTATTA ATGATATACG 
GCTTTTATTT AATTCAAGGT ATGTATCATA ACTTACATGA TGCCCCTCAA AACCTACATT 
TTCAAAATTT TCTTGGTGTA GCAATTCTTT AATCTCACCA ATAATAGTAG ATTTACGATT 
AATAATTTCA TAATTTGGCG CCTGCTTAGT TGCTTGATCA ATATATCTAA AGTCTGTTAT 
CAAATATTGT TTATCTTTAG ATATGATAAG TGCTCCACTG GTACCAGTAA AACCTGATAA 
ATATCTTCTA TTGTAATCCG AAAGAATGaT AATCGCATCT AAATGTTTTT GTTCTAAAAT 
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CAACTTTATA CATTAAAATA ATATCATAAT AAGGATAAAA AATAATAGAT ATTGATTTTA . 


540 




GGGAGATAGT AATGAAAAAA TTGGTTTCAA TTGTTGGCGC AACATTATTG 


TTAGCTGGAT 


600 


5 


GTGGATCACA AAATTTAGCA CCATTAGAAG AnAAAACAAC AGATTTAAGA 


GAAGATAATC 


660 




ATCAACTCAA ACTAGATATT CAAGAACTTA ATCAACAAAT TAGTGATTCT 


AAATCTAAAA 


720 




TTAAAGGGCT TGAAAAGGAT AAAGAAAACA GTAAAAAAAC TGCATCTAAT 


AATACGAAAA 


780 


10 


TTAAATTGAT GAATGTTACA TCAACATACT ACGACAAAGT TGCTAAAGCT 


TTGAAATCCT 


B40 




ATAACGATAT TGAGAAAGAT GTAAGTAAAA ACAAAGGCGA TAAGAATGTT 


CAATCGAAAT 


900 


15 


TAAATCAAAT TTCTAATGAT ATTCAAAGTG CTCACACTTC ATACAAAGAT 


GCTATCGATG 


960 


GTTTATCACT TAGTGATGAT GATAAAAAAA CGTCTAAAAA TATCGATAAA 


TTAAACTCTG 


1020 




ATTTGAATCA TGCA7TTGAT GATATTAAAA ATGGCTATCA AAATAAAGAT AAAAAACAAC 


1080 


20 


TTACAAAAGG ACAACAAGCG TTGTCAAAAT TAAACTTAAA TGCAAAATCA TGATAGGAGT 


1140 




CTTTTAATGC GTAATATAAT ATTTTATCTT GTACTTATTA TTGCTGCGAT 


TGGATTAGTA 


1200 




ATGAATCTAG ATGCCTTTAT TTTTTCAATC GTCAGAATGT TAATCAGCTT TGcgTAaTAG 


1260 


25 


CTGGTATTAT TTATCTGATT TATTATTTCT TCATCTTAAC TGAAGACCAA 


CGCAAATATC 


1320 




GCAAAGCAAT GCgTrAaGTA TAAAAGAAAT CAAAGAAGAA AATAGATAAA AAAACGGAAG 


1380 




CACTTGTAGG TAAAATAGTC TACGTGCTTC CATTTTTTAT TCTAAAAACT ACTTTCTAAA 


1440 


30 


CATCCATTCA TCTGAACGAT ATTTTTCAGT TAATTCTTCC ACTTCTGCCA 


ATTGAGCTTC 


1500 




TGtTAATTCA AGTGGCTTTA ATTCTATATT TAAACCTTTC TTAAAACCTT 


TCTCGAAAGC 


1560 


35 


TTCTTCCATT TGACTAATAG TAATGTGTTC ATCTGAAATA TCATTGATGG 


CAACTGCTTT 


1620 


TTCAACGAAT GCCTCTTTCA TTTTTAATTT TAATCTTTCA TTTTTATAAA 


TrAACATATC 


1680 




AAACAGTTCA TCAATATCAA TATCTTGTAA AATCGAACCG TGTTGGAGGA 


TTACGCCCTT 


1740 


40 


TTGTCTCGTT TGAGCACTCC CAGCAATCTT ACGGCCTTCA ACAACTAGCT 


CATACCAACT 


1800 




TGGTGCATCA AAACACACTG AACTTCGAGG TTGTTTTAAT TTTTGACGCT 


CTTCAGGCGT 


1860 




TTTAGGTACC GCAAAATAAG TATCAAATCC TAAGTTTTTA AATCCTTCTA 


ATAATCCTTG 


1920 


45 


TGAAATCACT CTGTACGCTT CTGTAACTGT AGAAGGCATA TTCGGATGCG 


ATTCAGGCAC 


1980 




AATCACACTG TAAGTTAACT CTTTATCATG TAGCACCCCA CGGCCACCAG 


TTTGACGCCT 


2040 




TACGAGACCA AAACCTTTCT CTTTAACCTT ATCAATATCA ATTTCTTTTT 


GTAGCCTTTG 


2100 


50 


GAAATACCCT ATTGATAATG TTGCAGGATT CCATGTGTAA AAACGTATAA 


CTGGATCAAT 


2160 




TTCACCTCTA GAGACAAAAT TTAATAACGC TTCATCCATT GCCATATTAT 


AATATGGGTC 


2220 
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AAATGTATAA TATTTGATTC GCTAATTAAT 
TCTTTAGTGA AATATTTTGA TAATTTGACC 
TTAGGGAGGA TGCAAGATGA GTGCTAGTTT 
TATTGCTTAT ATGATTGTTC AACAAATTCT 
AAATGAATTC CATAATGGGA TTAGAAAAGC 

10 

CTATGACTAC GGTCACATTA ATGGGTCTCG 
ATTCCAAGGA TTAAGAAAAG ATCAACCGGT 

15 CTATAGAGCC GCTCGTATTT TGAAAAAGAA 
CGGCTATAAA AAATGGACTG GAAAAATAAA 
TACGATTTAA TAAAATCTGA GTGTTAATTG 

20 . TTTTAACAAA GTCTGTTACT ATATTTCTCT 
CATAATAAAG AAAGTTCAGC TCATTTTCAA 
AAAAGACAAT TTCTATGAAA GAATAATACC 

25 

TTTCTTGTTT AAATAGAAAT TGTCTTTTTC 
TTCATACAAA GTTAGAATAA TAATTCTCGG 
TTAAGATTTT CAAACTTTAA TATTGGTTTA 

30 

ATCACAGTTG TATGTGGTGC TTCTAGCacT 
ATACTAATTA ATGTATCGAT AAAATAATCA 
TCAATCATCA TACCTTCTTC AACATTTAAT 

35 

AAATCTAATA ATCGCTTAGC CATGTCTAAA 
CCACTTAACA CAAACTCGTG TTTACAATAT 

40 AAACGTGCTT TAATATAATT CGCATTAAGA 
GTTGCTCCCA TAGTTCGAAT ATACGTATAA 
AATGGTTTTA CACGTCCGAT AGAATTTTTA 

45 TCTTTAATAA CCATTGGCTT TGGTAAGTAA 
CCTGAACCAG GACCGCCACC ACCATGTGGA 
ACAGCATCAA ATCCCATATC TCCTGGGCGA 

50 

CCATCATAAT ATAATAGACC ACCAGCATTA 
TTTTCGAAAA TACCTAAAGT GTTTGGATTA 

55 



CAATTTAACT AAATGAATAA TAATTGCAAT 2340 

TAACAGTCTT ATAATTATAT TATCGTTTAA 2400 

GTACATCGCA ATAATTTTAG TTATAGCAAT 2460 

TAACAAGCGA GCTGTTAAAG AATTAGATCA 2520 

TCAAGTCATC GATGTTAGAG AGAAAGTTGA 2580 

CAATATTCCT ATGACAATGT TCAGGCAACG 2640 

ATACTTATGT GATGCCAATG GGATTGCTAG 2700 

TGGATATACA GATATCTATA TGTTAAAAGG .2760 

GTCTAAAAAA TAGTTTTTGT » AAATTTAATA 2820 

ATCATCAATA ACAATACTCA GATTTTAATT 2880 

AGCTTCACTG ATCATTAAAC TTAGTTTCAG 2940 

TACGATTCAA TTACCGCAAT CTAAAAAATG 3000 

AAACCCTAAG AGTTATTACT TCGGTTTAGT 3060 

AATTGATTTT GAAACCATTA TCCTTAAATC 3120 

AATATGTGTT TAATACTTTA TTTTTCCTGT 3180 

CGAGCAGCTG TAGCTTCGTC TAATCGATCA 3240 

TTATCAGGAT CATTTTTAGC TTCTTCAGCA 3300 

AGTGTTTCTT TAGACTCTGT CTCAGTCGGT 3360 

GGGAAGTATA TTGTTGGTGG ATGTACACCG 3420 

GTACGTACAC CAAATTCTTT TTGACGCACA 3480 

TGTTTATAAG GTATTTCAAA GTGTTTAGAT 354 0 

ACCGCTGCTT CAGAAACCTC TTTAAGTCCA 3600 

GCTCTTAAGT AAATACCAAA GTTACCATAA 3660 

ATGTCATTAT CATATTTAAA TTTGTCGCCA 3720 

CTTGCTAGT-T CTTTTACTAC " ACCGACTGGA 3780 

CCAGTAAATG TTTTATGCAA GTTTAAATGA 3840 

ACTTTGTCCA TAATAGCGTT TAAATTCGCA 3 900 

TGGACGATTT CACGGATTTC CATAATATTT 3960 

GTTAACATAA TAGCTGCTGT ATTTTCATTT 4 020 
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GATTTAAATC CTGCAAATCa AGCTGAGGCT GGaTTCGTAC CATGCGCAGA ATCTGGcACA 414 0 

ATGACTTCAT CACGATGACC TTCACCATTA TTCTCATGGT AAGCTTTAAA TATCATCAAT 4200 

5 GCAGTCCATT CACCATGTGC GCCAGCAGCT GGTTGTAATG TCACCTCATC CATACCAGTA 4260 

ATTTCTTTTA ATTCTTCTTG CAAACTATAA ATAATTTCTA ATGAACCTTG AACTTGATCT 4320 

TCATCTTGTA ATGGATGTGA TTCACTAAAT CCTGGTATTC TAGCAACCTT TTCATTAATT 4380 

10 

TTAGGGTTAT ACTTCATCGT ACATGAACCC AATGGATAAA ATCCGTTGTC TACACCGAAA 4440 

TTTTTATTTG AAAGTTCAGT ATAATGACGT ACTAAGTCTA GTTCAGCAAC TTCAGGAAAC 4500 

TCCGCTTTGT TTTTACGAAT AAATTTATCA TCTAACAATG ACTCAACAGA ATTTGTTTTA 4560 

75 

ATATCACTTT TTGGTAATGA ATATGCATAT CTGCCTTCAC GAGATCTTTC AAAAATTAAT 4620 

GGACTTGATT TACTAGTCAT TTAACTCACC AGCCTTTTCT ACAAATGTAT CGATTTCATC 4680 

2Q TTTTGTTCTT AATTCAGTTA CAGCTATTAA CATGTGATTT TTAAAGTCGT CTGAAACAAC 4740 

ACCTAAATCA AAACCACCGA TAATATTGTA CTTCACTAAT TCCTCGTTAA CTTGTTGAAT 4800 

TGGTTTGTCA AATTTGACTA CAAACTCATT GmnAAGllTGT AC CATCTAAT ACTTCAAAAC 4 860 

25 CTTTTTTAAT AAATTGTTGT TTAGCATAGT TAGCATGTTC TATATTTTGA ACTGCAATAT 4920 

CATAGATACC TTGTTTACCA AGTGCTGACA TTGCAATTGA TGaCGcTAAA GCATTTAATG 4 980 

CTTGGTTAGA ACAAATATTA GATGTCGCTT TATCGCGTCG AATATGTTGT TCACGTGCTT 5040 

30 GTAATGTTAA TACAAAGCCA CGATTACCTT CATCATCTTG TGTTTGACCG ACTAATCTAC 5100 

CTGGCACTTT ACGCATTAAC TTTTTCGTCG TTGCAAAATA TCCACAATGT GGCCCACCGA 5160 

ATTGAGCAGG AATTCCGAAT GGCTGAGTAT CACCTACAAC AATATCTGCA CCAAATGAAC 5220 

35 

CTGGAGGTGT AAGTAATCCC AATGCTAATG GATTTGCATA TACGATAAAT AATGCTTTTT 5280 

TATCTTCAAT AAAGCTATGA ATCTTTTCAA GATCTTCAAT TGAACCGTAA AAGTTTGGAT 534 0 

ATTGTACTGC AACAGCTGCT GTTTCATCAT CCACTGCTGC TTCTAATTTT TTCAAATCTG 5400 

40 

TAACAGTGCC ATCTAAATCG ATTTCCACTA CTTCGAATTC CTTACGCGTC TTAGCATAAG 5460 

TATGAAGTAC TTGTAATGCT TGATAATGTA AACCTTTTGA GACTACAATT TTATTTTTCT 5520 

45 TTGTTTGACT AAATGCTAAG ATACATGCTT CAGCAAAGCT AGTCATCCCA TCATACATAG 5580 

AAGAATTTGC TACATCCATA TCTGTTAATT CACAAATTAA AGTTTGGAAC TCAAAAATGG 564 0 

CTTGTAATTC ACCTTGAGAA ATTTCCGGTT GATATGGCGT ATATGCTGTG TAAAATTCTG 5700 

SO ATCTTGAAAT CATAGCATCC ACAACTGATG GCGCGTAATG ATCATAAACA CCAGCACCCA 5760 

rAAATGATGT ATGCGTTTCT TTAGTGATAT tCTTGCTkGC AATGGGGATT TAAACnTCTA 5B2 0 

55 
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<2) INFORMATION FOR SEQ ID NO: 67: 
(i) SEQUENCE CHARACTERISTICS: 



5 




(A) LENGTH: 1B355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








10 


£xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 


67: 








ATnATAATTG 


GCTTTGCTAA TAATTACTTC CCTGAATTAC 


aAGTATTAGC 


AAACGAAATA 


60 


15 


AAATCTGATA TGGCTAGTTC ATTAAAACAA TGATATTTTT ATTTAAATTT TTaAAGCTTT 


120 




GTACGAAATT 


GTACAAAGCT TTTTTGGTGC' GTATTGTATG 


GGCAACAACT 


TGACGATGAA 


180 




AATCCGTTAC 


AGGATTGGTA ATAGGAAATG TTAGCGAAAG 


ACAAGGGTAT 


CCATTGTAGA 


240 


20 


TTAACAAAAG 


GACGTTTCCA CAAGTGTGGG TTATTCTCAC 


TAAAGCAATA 


CGCAGAGACA 


300 




ACTTACGTAA AATTTTGAAC TGACTAGAAC GGAACTTCTA 


CTCAATTATT 


GATAAAAATT 


360 




TTCAAAAAGA 


CTTGAATGTG CTGAGAATAC GAAGTTTATG 


GAAGGATTAT 


CAAAATATAA 


420 


25 


ATGTGCATTC 


ATTTACAACC TTTATTGACA ATGATTCTCA 


ACTAATATAG 


TATATAATCA 


480 




AATCGTAATA 


GTTACGATTT GTTTTCTGCA ACTTTTTTGA 


AGTTTTAGTT 


GAGGTGAAAA 


540 




CAATAAAAGC 


ATCTAAGTGA ATGTAGTTAA CGGACAACTG 


CATTCGCTTG 


TAGAGCCACA 


600 


30 


AGAAGCAACT 


TTAAATAAGG TTTACGGTTG CATTTTGATA 


CAACAACCGA 


TTACTAAGTC 


660 




ATGCTTTCCA 


CTTTGCGGGT TAGCATGACT TACCTAATAG 


ATAGAGCTAT 


TAGGTTCAGC 


720 


35 


TTCTAAAAAA 


TTACAGTTTT AGAGGAATAC AGTTGcTTGc 


tTCGCAACAA 


CTGCATAAGA 


780 


GCCATGGTTT 


TCGCTTTTGC GAATTAGCAT GACTTACCTA 


CTAGATAGAG 


CTATTAGGTT 


840 




CATCTTCTAA 


AAAATTACAG GTTTAGAGGA ATACAGTTGT 


TTGcTTCGCA ACAACTGCAT 


900 


40 


AAGAGCCTCT AGTAATTAAA ATTACAGAGG CTCTAAAAAT ACATCTAAAG 


GAGTGTCGTA 


960 




TGAATCGGCA 


GGTTATAGAA ' TTTTCTAAGT ATAATCCTTC 


GGGGAATATG 


ACGATACTTG 


1020 




TTCATTCAAA 


ACATGATGCT AGTGAATATG CATCTATCGC 


CAATCAGTTG 


ATGGCCGCAA 


1080 


45 


CACATGTATG 


CTGTGAACAG GTAGGCTTTA TAGrATCAAC 


ACAAAATGAT 


GATGGTAATG 


1140 




ATTTTCACTT 


AGTTATGAGC GGTAATGAAT TTTGCGGTAA 


TGCGACGATG 


TCATATATAC 


1200 




ATCATTTGCA 


GGAAAGTCAT TTGCTTAAAG ACCAACAGTT TAAGGTGAAG GTGTCTGGCT 


1260 


50 


GTTCGGATTT 


AGTGCAATGC GCAATTCATG ATTGCCAATA 


CTATGAAGTT 


CAAATGCCAC 


1320 




AAGCCCATCG 


TGTTGTGCCA ACAACAATTA ATATGGGTAA 


TCATTCATGG 


AAAGCAATAG 


1380 
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TTCAACATTT GGTTGAAGCG TTTGTGCGTG AgcAACAATG GAGTCACAAA TATAAAACAG 1500 

TAGGTATGAT GCTTTTTGAT GAACAACGTC AATTTTTACA GCCATTAATC TATATACCAG 1560 

5 AAATTCAAAG TTTAATTTGG GAAAATAGCT GTGGTTCTGG TACAgcATCA ATTGGGGTTT 1620 

TTAATAATTA TCAACGTAAT GACGCATGCA AAGATTTTAC AGTACATCAG CCAGGGGGCA 1680 

GTATTTTAGT GACATCAAAG CGATGTCATC AATTGGGATA TCAAACTTCA ATTAAAGGAC 174 0 

W AGGTTACAAC TGTAGCTACA GG aAAAGCAT ATATAGAATA AGGAGCCTAC AATGAATAAC 1800 

TTTAATAATG AAATCAAATT GATATTACAA CAATATTTAG AAAAGTTTGA AGCGCATTAC 1860 

GAGCGTGTAT TACAAGACGA TCAATATATC GAAGCATTAG AAACATTGAT GGATGACTAT 1920 

15 

AGTGAATTTA TTTTAAATCC TATTTATGAA CAACAATTTA ATGCTTGGCG TGACGTTGAA 1980 

GAAAAAGCAC AATTaATAAA ATCACTGCAA TATATTACAG CGCAGTGTGT TAAACAAGTG 204 0 

GAAGTCATTA GAGCGAGACG TCTATTAGAC GGACAGGCGT CTACCACAGG TTACTTTGAC 21D0 

20 

AATATAGAAC ATTGTATTGA TGAAGAGTTT GGACAATGTA GTATAGCTAG CAATGACAAA 2160 

TTATTGTTAG TTGGTTCAGG TGCATATCCA ATGACGTTAA TTCAAGTAGC AAAAGAAACA 2220 

25 GGTGCTTCAG TTATCGGTAT TGATATTGAT CCACAAGCCG TTGACCTAGG GCGCAGAATC 22 BO 

GTTAACGTCT TAGCACCAAA TGAAGATATA ACAATTACGG ATCAAAAGGT ATCTGAACTT 234 0 

AAAGATATCA AAGATGTGAC GCATATCATA TTCAGCTCGA CAATTCCTTT AAAGTACAGC 24 00 

30 ATTTTAGAAG AATTATATGA TTTAACAAAT GAAAATGTCG TAGTTGCAAT GCGCTTTGGT 24 60 

GATGGCATCA AAGCAATATT TAATTATCCG TCACAAGAAA CAGCGGAAGA TAAGTGGCAA 2520 

TGTGTGAATA AACATATGAG ACCACAGCAA ATTTTTGATA TAGCACTTTA TAAAAAAGCA 2580 

35 

GCTATAAAGG TAGGTATTAC GGATGTCTAA ATTATTAATG ATAGGCACTG GTCCgGTCGC 2640 

AATGCAATTA GCGAATATTT GCTATTTAAA ATCAGATTAT GAGATTGATA TGGTTGGACG 2700 

TGCCTCAACA TCAGAAAAAT CAAAACGCTT ATATCAAGCG TATAAAAAAG AGAAACAATT 2760 

40 

TGAAGTCAAA ATACAAAACG AGGCGCATCA ACATCTGGAA GGTAAGTTTG AAATTAATCG 2820 

TTTGTATAAA GATGTTAAAA ACGTTAAGGG TGAATACGAA ACGGTTGTCA TGGCATGCAC 2880 

45 AGCAGATGCT TATTATGACA CACTACAGCA ATTGTCGTTA GAAACTTTGC AAAGTGTCAA 2940 

ACATGTCATT TTAATATCAC CGACATTTGG TTCGCAAATG ATTGTCGAAC AATTTATGTC 3000 

TAAATTTAAT AAAGATATCG AAGTGATTTC ATTCTCAACT TATCTTGGCG ATACACGTAT 3060 

50 TGTTGATAAA GAAGCGCCTA ATCATGTGTT GACAACAGGT GTAAAAAAGA AATTGTACAT 3120 

GGGATCGACA CATTCAAACT CAACAATGTG TCAACGAATC TCTGCTTTAG CTGAGCAATT 3180 
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TTATGTGCAC CCACCACTAT TTATGAATGA CTTTTCATTG AAAGCCATTT TCGAAGGAAC 3300 

AGATGTACCG GTTTATGTGT ATAAGTTATT TCCTGAAGGA CCGATAACGA TGACACTAAT 3360 

CCGTGAAATG CGTTTAATGT GGAAGGAAAT GATGGTTATT TTACAAGCAT TTAGAGTGCC 3420 

GTCAGTCAAC CTGCTTCAAT TTATGGTGAA GGAAAATTAT CCAGTACGTC CTGAAACTTT 34 BO 

GGATGAAGGT GATATTGAGC ATTTCGAAAT CTTGCCAGAT ATCTTACAAG AATATCTGCT 3540 

10 

TTATGTAAGA TATACCGCAA TCCTCATTGA TCCATTTTCA CAGCCAGACG AAAACGGACA 3600 

TTACTTTGAT TTTTCAGCTG TACCATTTAA GCAAGTCTAT AAAAATGAAC AGGATGTTGT 3660 

is . TCAAATTCCA AGAATGCCAA GTGAAGATTA TTACAGAACG GCGATGATTC AGCATATTGG 3720 

GAAAATGCTA GGTATCAAAA CGCCAATGAT TGATCAGTTC CTAACTCGCT ATGAAGCAAG 3 7 BO 

TTGCCAGGCG TACAAGGATA TGCATCAAGA TCAACACTTA TCTTCTCAAT TTAATACAAA 3840 

20 TCTATTTGAA GGAGATAAAG CACTCGTCAC AAAATTTTTG GAAATCAATA GAACGCTTTC 3900 

ATAATAAGGG TTTGAAGTTT TATAATAGAA AAAAATTATT GAATTATGTT TGACATTTAC 3960 

ATAAAAATAA GCAAATAATT GAGAAAAATA ATCATTACGA TTTGATTAAG TAATGCAACT 4 020 

25 TATCAATTTA GAAAGAGGAA. AAGCAAATGA GAAAACTAAC TAAAATGAGT GCAATGTTAC 4080 

TTGCATCAGG GCTAATTTTA ACTGGTTGTG GCGGTAATAA AGGTTTAGAG GAGAAAAAAG 414 0 

AAAACAAGCA ATTAACGTAT ACGACGGTTA AAGATATCGG TGATATGAAT CCGCATGTTT 4200 

30 

ACGGTGGATC AATGTCTGCT GAAAGTATGA TATACGAGCC GCTTGTACGT AACACGAAAG 4260 

ATGGTATTAA GCCTTTACTA GCTAAAAAGT GGGATGTGTC TGAAGATGGG AAGACATACA 4320 

CGTTCCATTT GAGAGATGAC GTTAAATTCC ATGATGGTAC GCCATTTGca TGctGACGCA 4380 

35 ■ 

GTTAAGAAAA ATATTGACGC AgTTCAAGAA AACAAAAAAT TGCATTCTTG GTTAAAGATT 4440 

TCGACATTAA TTGACAATGT TAAAGTTAAA GATAAGTACA CGGTTGAATT GAATTTGAAA 4500 

40 GAAGCATATC AACCTGCATT GGCTGAATTA GCGATGCCTC GTCCATATGT ATTTGTGTCT 4560 

CCAAAAGACT TTaAAAACGG TACAAcAAAA GATGGCGTTA AAAAGTTCGA TGGTACTGGT 4 620 

CCATTTAAAT TAGGTGAACA CAAAAAAGAT GAGTCTGCAG ACTTTAACAA AAATGATCAA 4 680 

45 TACTGGGGCG AAAAGTCTAA ACTTAACAAA GTACAAGCAA AAGTAATGCC TGCTGGTGAA 4 74 0 

ACAGCATTCC TATCAATGAA AAAAGGTGAA ACGAACTTTG CCTTCACAGA TGATAGAGGT 4 800 

ACAGATAGCT TAGACAAAGA CTCTTTAAAA CAATTGAAAG ATACAGGTGA CTATCAAGTT 4B60 

SO 

AAGCGTAGTC AACCTATGAA TACGAAAATG TTAGTTGTCA ATTCTGGTAA AAAAGATAAC 4 920 

GCTGTGAGTG ACAAAACAGT CAGACAAGCG ATTGGTCATA TGGTAAACAG AGATAAAATT 4 980 
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ACAGACATTA ATTTCGATAT GCCAACACGT AAGTATGACC TTAAAAAAGC AGAATCATTA 5100 

TTAGATGAAG CTGGTTGGAA GAAAGGTAAA GACAGCGATG TTCGTCAAAA AGATGGTAAA 516 0 

5 AACCTTGAAA TGGCAATGTA CTATGACAAA GGTTCTTCAA GTCAAAAAGA ACAAGCAGAA 5220 

TACTTACAAG CAGAATTTAA GAAAATGGGT ATTAAGTTAA ACATCAATGG CGAAACATCA 5280 

GATAAAATTG CTGAACGTCG TACTTCTGGT GATTATGACT TAATGTTCAA , CCAAACTTGG 5340 

10 

GGATTATTGT ACGATCCACA AAGTACTATT GCAGCATTTA AAGAGAAAAA TGGTTATGAA 5400 

AGTGCAACAT CAGGCATTGA GAACAAAGAT AAAATATACA ACAGCATTGA TGACGCATTT 5460 

AAAATCCAAA ACGGTAAAGA GCGTTCAGAC GCTTATAAAA ACATTTTGAA ACAAATTGAT 5520 

15 

GATGAAGGTA TCTTTATCCC TATTTCACAC GGTAGTATGA CAGTTGTTGC ACCaAAAGAT 5580 

TTAGAAAAAG TATCATTCAC ACAATCACAG TATGAATTAC CATT t CAATGA AATGCAGTAT 564 0 

2Q AAATAAAGGA GCAATTAGAT GTTCAAATTT ATCTTAAAAC GTATTGCGCT CATGTTTCCA 5700 

TTGATGATTG TAGTAAGTTT TATGACATTT CTATTGACGT ATATTACAAA TGAAAATCCA 5760 

GCTGTGACAA TTTTACATGC ACAAGGGACG CCAAATGTAA CACCAGAGTT GATTGCAGAA 5320 

25 ACGAATGAGA AGTACGGTTT CAATGATCCA TTATTAATTC AATATAAAAA TTGGTTACTT 5880 

GAAGCGATGC AATTTAATTT TGGTACAAGC TACATTACAG GTGACCCAGT TGCTGAACGT 5 940 

ATTGGTCCAG CATTTATGAA TACATTGAAA TTAACAATAA TTTCAAGTGT TATGGTGATG 6000 

30 ATTACATCAA TTATTTTAGG TGTAGTTAGT GCATTAAAAA GAGGAAAGTT CACTGATCGT 6060 

GCGATACGTT CAGTGGCTTT CTTTCTAACT GCATTACCAT CATATTGGAT AGCTTCAATA 6120 

CTTATTATTT ACGTTTCAGT GAAGTTAAAC ATATTGCCGA CTTCTGGATT AACAGGTCCA 618 0 

35 

GAAAGTTACA TATTGCCAGT GATCGTTATT ACGATTGCCT ATG CTGGTAT TTACTTTAGA 6240 

AATGTTAGAC GCTCGATGGT GGAACAATTA AATGAAGATT ATGTACTTTA TTTAAGAGCA 63 00 

AGCGGTGTGA AATCTATCAC ATTAATGTTG CATGTGTTGC GTAATGCTTT ACAAGTTGCG 63 60 

40 

GTATCAATCT TTTGTATGTC TATACCAATG ATAATGGGTG GACTAGTTGT TATCGAGTAT 6420 

ATCTTTGCAT GGCCTGGACT AGGTCAATTA AGTTTAAAAG CAATACTTGA ACACGATTTT 64 80 

4S CCAGTCATTC AAGCATATGT ATTAATTGTA GCGGTATTAT TTATTGTATT TAATACATTA 6540 

GCAGATATCA TTAATGCGCT ATTAAATCCA AGATTAAGGG aGGGCGCACG ATGATAATTT 6600 

TAAAmCGATT ATTmCArGwT AAAGGTGCAG TAATTGCTTT AGGCATTATT GTATTATATG 6660 

SO TCTTTTTAGG ATTAGCAGCA CCACTTGTGA CATTTTATGA TCCTAACCAT ATCGATACAG 6720 

CAAACAAATT TGCTGGCATG AGTTTTCAAC ATCTACTAGG TACTGACCAT TTAGGTAGAG 6780 
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TATTTGTTTC TGTACTTATT GGATCTATTT TAGGATTCTT ATCAGGATAT 


TTCCAAGGGT 


6900 




TTGTTGACGC 


CTTAATCATG 


CGTGCGTGTG ATGTTATGTT 


GGCATTCCCA 


AGTTATGTTG 


6960 


5 


TAACGTTAGC 


ATTAATTGCA 


TTGTTTGGAA TGGGTGCCGA 


AAATATTATC 


ATGGCATTTA 


7020 




TTTTGACGCG 


TTGGGCATGG 


TTCTGTCGTG TTATACGTAC 


AAGTGTTATG 


CAGTACACTG . 


7080 


10 


CTTCTGACCA 


TGTAAGATTT 


GCTAAAACAA TCGGTATGAA 


TGATATGAAA 


ATTATTCACA 


7140 


AACATATTAT 


GCCATTAACA TTAGCAGATA TTGCTATCAT 


CTCTAGTAGC 


TCGATGTGTT 


7200 




CAATGATCTT 


GCAAATATCT 


GGCTTTTCAT TTTTAGGATT 


AGGTGTCAAA 


GCGCCTACTG 


7260 


IS 


CAGAGTGGGG 


CATGATGCTT AACGAaGCTA GAAAAGTGAT 


GTTTACACAT 


CCTGAAATGA 


7320 


TGTTTGCGCC AGGTATTGCC ATAGTGATTA TAGTGATGGC ATTTAACTTC 


TTATCCGATG 


7380 




CTTTACAAAT 


TGCTATTGAT 


CCCCGCATCT CTTCTAAAGA 


TAAACTTCGT 


TCTGTGAAAA 


7440 


20 


AAGGAGTGGT 


GCAATCATGA 


CATTGTTAAC AGTTAAACAT 


TTGACGATTA 


CAGATACCTG 


7500 




GACAGATCAA 


CCACTCGTGA GTGATGTGAA TTTTACATTA ACTAAGGGTG 


AAaCTTTAGG 


7560 




CGTTATTGGA GAAAGTGGTA GTGGTAAATC AATCACTTGT AAATCGATTA 


TTGGTTTGAA 


7620 


25 


TCCCGAACGA 


CTCGGGGTGA 


CAGGTGAAAT TATCTTTGAT 


GGTACAtCAA 


TGTTGTCATT 


7680 




ATCTGAATCG 


CAATTGAAAA 


AGTACCGTGG TAAAGACATT 


GCGATGGTCA 


TGCAACAAGG 


7740 




TAGTCGTGCC 


TTTGACCCAT 


CAACTACTGT CGGTAAACAA 


ATGTTTGAGA 


CTATGAAAGT 


7800 


30 


ACATACGTCA 


ATGTCTACAC 


AAGAAATTGA AAAGACATTG 


ATTGAATATA 


TGGATTATTT 


7860 




AAGTTTGAAA 


GATCCTAAAC 


GTATATTAAA ATCATACCCT 


TACATGTTAT 


CAGGAGGAAT 


7920 




GTTACAGCGA 


TTGATGATTG 


CTTTAGCGTT AgcTTTgAAA 


CCAAAGTTAA 


TCATTGCTGA 


7980 


35 


TGAGCCGACA 


ACGGCTTTAG 


ATACAATTAC ACAATATGAT 


GTACTGGAAG 


CATTTATAGA 


804 0 




TATTJVAAAAA 


CACTTTGACT 


GTGCGATGAT TTTCATTTCA 


CATGATTTAA 


CGGTTATTAA 


8100 


40 


CAAGATTGCA 


GACCGTGTTG 


TTGTGATGAA AAATGGTCAG 


CTTATTGAAC 


AAGGGACACG 


8160 


TGAATCAGTC 


TTGCATCATC 


CAGAACATGT TTATACGArt 


ATTkTATTAT 


CAACGAAGAA 


6220 




GAAGATTAAT 


GATCATTTTA 


AACATGTGAT GAGGGGTGAT 


GTACATGATT 


AAAATTAAAG 


8280 


45 


ATGTTGAAAA GTCATATCAA AGCGCACATG TTTTTAAGCG 


TCGTCGAACA 


CCTATCGTGA 


8340 




AAGGTGTGTC 


ATTTGAGTGT 


CCAATCGGTG CGACGATTGC 


GATTATCGGA 


GAAAGTGGTA 


8400 




GCGGTAAATC 


GACGTTGAGT 


CktATGATAT TAGGTATTGA 


GAAACCGGAT 


AAAGGTTGTG 


8460 


50 


TAACCTTAAA 


TGATCAACCG 


ATGCATAAGA AGAAAGTGAG 


ACGTCATCAA 


ATTGGTGCTG 


8520 




TATTTCAAGA 


TTATACGTCA 


TCATTACATC CATTTCAGAC 


TGTTAGAGAA 


ATCTTATTTG 


8580 
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TGTTGGAAGA AGTCGGTCTA 


TCTAAGGCAT ACATGGATAA ATATCCTAAT 


ATGTTATCAG 


8700 




GTGGAGAGGC 


GCAACGTGTT 


GCGATTGCGC 


GTGCAATATG TATTAACCCT 


AAATATATTT 


8760 


5 


TGTTTGATGA 


AGCCATTAGT 


TCACTCGACA 


TGTCAATTCA AACACAAATA 


TTAGATTTAT 


8820 




TGATTCATTT 


ACGTGAAACG 


CGTCAGTTGA 


GTTATATTTT TATCACACAT 


GATATTCAAG 


8880 


10 


CTGCCACGTA 


TTTATGTGAT 


CAATTAATTA 


TTTTTAAAAA CGGAAAAATA 


GAAGAACAAA 


8940 


TTCCGACAAG 


CGCATTGCAT 


AAAAGTGACA 


ATGCTTATAC AAGAGAATTA 


ATAGAAAAAC 


9000 




AACTATCATT 


CTAAGGAGTG 


AGATAATGAA 


AGGTGCAATG GCTTGGCCCT 


TTTTGAGATT 


9060 


15 


ATATATATTA 


ACATTGATGT 


TCTTTAGTGC 


CAATGCAATC TTAAACGTGT 


TTATACCTTT 


9120 




ACGAGGGCAT 


GATTTAGGCG 


CAACGAATAC 


GGTTATCGGT ATCGTTATGG 


GGGCATACAT 


9180 




GTTAACAGCA 


ATGGTATTTC 


GACCATGGGC 


AGGACAAATT ATTGCTCGTG 


TCGGTCCCAT 


9240 


20 


TAAAGTATTA 


AGAATTATTT 


TGATTATCAA 


TGCCATAGCT TTAATTATTT 


ATGGTTTTAC 


9300 




TGGCTTAGAA 


GGTTATTTCG 


TAGCACGTGT 


TATGCAAGGT GTGTGTACGG 


CATTCTTTTC 


9360 




TATGTCTTTA 


CAGCTAGGTA 


TTATTGATGC 


ATTACCAGAG GAACATCGTT 


CTGAAGGTGT 


9420 


25 


ATCATTGTAC 


TCGCTATTTT 


CAACGATTCC AAACTTAATC GGACCATTAG 


TTGCCX3TAGG 


9480 




TATTTGGAAT 


GCAAATAATA 


TTTCACTATT 


TGCAATTGTC ATTATCTTTA 


TCGCATTAAC 


9540 




AACAACATTC 


TTTGsTATCG 


CGTGACCTTT 


GCTGAACAGG AACCCGATAC 


GTCAGATAAG 


9600 


30 


ATTGAAAAAA 


TGCCGTTTAA CGCTGTAACT GTTTTTGCGC AATTTTTCAA AAATAAAGAG 


9660 




TTGTTGAACA 


GTGGTATTAT 


CATGATTGTT 


GCATCGATTG TATTTGGTGC 


AGTTAGTACA 


9720 


35 


' TTTGTACCGT 


TATACACAGT 


GAGTTTAGGA 


TTCGCGAATG CGGGAATCTT 


TTTGACAATA 


9780 


CAGGCCATCG 


CAGTTGTTGC 


GGCAAGATTT 


TACTTAAGGA AATACATTCC 


' GTCAGATGGT 


9840 




ATGTCGCATC 


CTAAATATAT 


GGTATCTGTA 


CTATCATTAT TAGTAATCGC 


GTCATTTGTA 


9900 


40 


GTGGCATTTG GTCCGCAAGT AGGTGCAATT ATTTTCTATG GTAGTGCGAT ATTAATAGGA 


9960 


ATGACGCAAG 


CAATGGTGTA 


CCCAACATTA ACATCATACT TAAGCTTCGT 


CTTACCAAAA 


10020 




GTAGGTCGTA ATATGTTGTT AGGTTTATTT ATTGCCTGTG CAGACTTAGG 


TATATCGTTA 


10080 


HO 


GGTGGCGCAT TGATGGGACC TATTTCCGAT TTAGTAGGAT TTAAATGGAT GTATCTAATT 


10140 




TGTGGTATGT 


TAGTCATTGT 


AATAATGATT 


ATGAGTTTCT TGAAAAAGCC 


AACACCACGT 


10200 




CCAGCGAGTA 


GTCTTTAATG 


AAGTGAATTA AAGCATATTA AGTTAATGAA 


TATTTAAATT 


10260 


50 


TTAAAAGGTA 


TATTGaGCAT 


GGCGATTCAT 


GTGCTTCATG CTAGGACATG AAACATTCTA 


10320 




TATGGCTCGT 


TTTTAGAACG 


ACAtATATCT 


AAATAAAGCA CGCTTArAAG 


TGAGTTTTGA 


10380 



55 



497 



EP0 786 519 A2 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



TTACATGAAA 
ATTACGGTAT 
GAATTCCCAC 
■ TACTATTGTA 
AATAAGTTGA 
GATGGAaCCA 
TAGTGCGAAA 
TATAAGAATA 
AAACGCATCT 
GCTAATGTTC 
TACTAACCCA 
AAGGAAAATA 
TTCTATGTAA 
GATACTTTAG 
AACATATCAT 
TGTATCATAA 
TCATGCAATT 
GGCTTATTTG 
CTTTACTAGC 
TATTTTCTGG 
TGGCfiGGAAA 
GCTTTTACAT 
CAACATTGTT 
AATATCGTGT 
TAGGTGAACA 
GTTATGGCGC 
GCACTTGGAT 
ATATCATTTT 
GTGAAGGTTT 



ATATGCAAAA 
GATTTTAAAT 
CATGTATTAA 
ATCTTTAAAG 
CAAATCAAAG 
TACCCTTTAA 
ATATTGAAAT 
TGCCATTTAT 
TCATGCATCA 
ATTGAAATAG 
AAAGTTAAAA 
AATCCAATAT 
TGTTTATTTG 
CACATATTAC 
AAAATTGTTT 
ATATTGAATT 
TATAGTTAAC 
GTTAATTAAA 
ACGTATTAGA 
GGATAATGAA 
ATATAACTCT 
ACAGAACACA 
ATCAACATTC 
GCCGACAAAG 
TTTAAAACAT 
TTTAGGAAAA 
GAATACAGGT 
CCAAATTGGT 
ATTTAAAGAG 



CGAGTATAAC 
ATAAGTAAGT 
TGTATGGATA 
GTATTAATCT 
TATTTAATAT 
TGAGCGGGTA 
GATTTAAAAG 
ATTTAGCACT 
GACGAAAAAT 
GAAAAGAGAA 
AGACGATAAT 
CACGTTTGAA 
TATTTGACAT 
TTTGTATTGT 
TATAAAATGA 
GAAATTTTGG 
ATTATCGTTG 
GATAAAAGAC 
TATATTTCAG 
GGGAAACCTT 
CGTATGACCA 
ATGTTTCCGA 
ATTTATAAAA 
ATTGATCCGT 
CCATTTATTT 
AATGCCATTA 
GAAGGTGGCT 
CCCGGTTTAT 
GTTGCACAGT 



TGCTAATTGA 
CGCACTACCT 
GTAGAACAGA 
GCTTAATTCT 
AATGGTTAAC 
AATGTCAAAG 
TAAAAAGAGT 
AGCAACGATT 
AGCTAGTGAA 
ACCCCACGGA 
GATCGGCAAG 
AAAACGCGAT 
AGTATACCTC 
ATGTTTTATA 
AGCGCTTCCA 
GGGGAGGTAT 
TAGGATTCAT 
AATCACAACA 
AAAAAATGGG 
TTTCACGTAA 
GCTTCGGTAC 
TGCAACGTAA 
TCGCGAATGA 
ATTACTTAAG 
TAAAACGTAT 
CAGCTTTATC 
TATCAGAATA 
TTGGTGTTCG 
TATCTAACGT 



TAGAAATAGC 
GCTAGTATCA 
GTTTCAAGGA 
TGAATTAAAA 
GAAAATATAG 
ACAGTAAAGG 
ACGACACTTA 
TGCGAACGTA 
ATAATAACTG 
GCTTGTTGAG 
ATGTTAACCA 
TGTTCGGTAG 
TTAAATAGTT 
CATTAAAATT 
TTGTGTTTTG 
TGTAATGACG 
GCTTACGGTT 
TAGTGTATTA 
ACCGGAATTA 
TGATTATAAA 
TACTAAAGAT 
TGAGATTTCA 
GCGTTTATTT 
TGATGACCAT 
CGTAGGACAA 
TAAAGGTCTA 
TCATTTAAAA 
TGATAAAGAA 
ACGCGCATTT 



TCACCATAAA 

ATGCTGGAAT 

TAATGGACAA 

TATGACGGAA 

CTATTAAACT 

AATCTACATT 

GTGTAAATGA 

TCATTGGAAT 

CGAGTAAATA 

TGAATACAGC 

AAAATATGTA 

CGTATTCTTC 

GTATTATATA 

TAAAATGAAA 

TTTTGTAAGG 

TTTCTTACAG 

ATTGTTATCG 

AGGAATTATC 

CGTCAGTATT 

AATATCGTTT 

TATCAAGACG 

GTAGATAATA 

AGTCGTGAAG 

GCAATAAAAT 

TCTGGTATGA 

GCTAAAGCGG 

GGTAATGGGG 

GGTAATTTTA 

GAGCTGAAGT 



10500 

10560 

10620 

10680 

10740 

10800 

10660 

10920 

10980 

11040 

11100 

11160 

11220 

11280 

11340 

11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

118B0 

11940 

12000 

12060 

12120 

12180 
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TTGCTAAAAT CCGAAATGTT GAACCTTATA AAACAATCAA TTCACCTAAC CGTTACGAAT 123 00 

TTATTCATAA TGCTGAAGAT TTGATTCGTT TCGTCGATCA GTTGCAGCAA TTAGGTCAAA 12360 

AACCAGTAGG ATTCAAAATT GTAGTAAGCA AAGTTTCAGA AATTGAAACA CTTGTACGTA 12420 

CGATGGTGGA ACTAGATAAG TATCCAAGCT TTATTACGAT TGATGGTGGT GAAGGTGGTA 12480 

CTGGTGCAAC ATTCCAAGAA TTACAAGATG GTGTTGGCTT ACCGCTATTT ACAGCTCTAC 12540 

CTATTGTGTC TGGCATGTTA GAAAAATATG GTATTCGAGA TAAAGTGAAA TTGGCGGCAT 12600 

CTGGTAAGTT AGTGACACCA GATAAAATTG CGATTGCACT AGGTTTAGGT GCAGATTTTG 12660 

TAAATATCGC! ACGTGGGATG ATGATTAGTG TCGGTTGTAT AATGAGTCAA CAATGTCACA 12720 

TGAATACGTG TCCTGTAGGT GTTGCAACGA CAGATGCGAA GAAAGAAAAA GCATTGATTG 12780 

TTGGAGAAAA GCAATATCGT GTCACAAACT ATGTAACAAG TTTGCATGAA GGCTTATTCA 12840 

ATATTGCAGC AGCTGTTGGC GTATCCAGTC CTACAGAAAT TACTGCTGAT CATATTGTAT 12900 

ATCGAAAAGT CGATGGTGAG TTACAAACGA TACATGATTA TAAATTAAAA CTCATTAGTT 12960 

AACTTAATTA TTTCGGGAAA TTGAAAGCAG CGGATTTTAG CGTTACTGCA AATAATTTTA 13020 

TATTAGTAGT GGATGCTGGT CACACAAGAA CTTCAAATAT TAAAGCCCTC AGAATATGAA 13080 

TTAAGGTTTG TAACCTTAGT CTTATCTGAG GGCATTTTTA AGTTATAAAC TATTTGTCGT 13140 

CCATTTTATC TTTTTCTTTT AAACCTCTGT GCTTTAATTG CTTTTCAAGT TTTTCAAAAC 13200 

TAATATCTTT ATTTTCTTTA GTCGAAACAC CAAGACGTTT ATTTAATTTT TTCATGTCAA 13260 

CTTCTGTGTA ATCTATGTCT AAGTGyTCAA TTGCTTTTTT ATCTTTATAG TCTACTTTGT 13 320 

ATTTTACGCC TTTAAGGTCT TTGAAAATAC TTTCAGATTT GGCGAATAAC TTTTTGGCTT 13380 

CGTCTTTATC CATACCTAGA TCGTCATATT TAATTGTGTT GATTGTAGAC TGTTTTAAAA 13440 

CTTfATCATC TTTATATGTG ATAGAAGTTA GTACATGTTT ACCACTAACA TCACCwTCAT 13 500 

ATGTTTTGGT TTGTTCTTTA CCACAAGCTG ATAATGCAAT GATACAAACT AATGCTACTA 13560 

CAATTAATGA ACATAATTTT TTCAAAGTCA GTCGCCTTCT TTCGATATTT GTATTATAAA 13520- 

GAAATTATAA CATTTACTAA AAAATGATGT TATTCAAAAA TTTAAATTTT GTCATTTTTT 13680 

TTGAAGATAT GAGTTTTTTT AAGCGGATTC CTCACAAAAT TTTAAAAATA TTTAAGCCTk 13740 

AAAATGATAA AGCGkTAGGG AACGTTTTTC TGAAAGTTAG TGATACAATA GTTTTAAGTT 13800 

GAAATACAGG AGGATGAATA ACATGAATCA GTCAGTCAAA TTACTTAAAC ATTTAACAGA 1386 0 

TGTAAACGGC ATTGCTGGTT ATGAAATGCA AGTTAAAGAA GCAATGCGTa ACTATATAGA 13920 

GCCTGTCAGT GATCAAATTA TTGAAGATAA CTTGGGTGGC ATTTTTGGAA AGAAAAATGC 13980 
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15 



20 



25 



30 



35 
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SO 



AACAAAGATT 
GTCATGCTAT 
ATCGGTTCTA 
AAAAATATGT 
GAAGTAGGCA 
TTAACTGCGA 
GTTTAAAAGA 
TTGGTTTGCG 
TcGATGTAGG 
TAGGCGGTGG 
GAAAgcATaT 
CAGGTGGAGG 
CAATCGGTGT 
ATGAAAATTC 
AAAATATCAT 
GTTAAACAAT 
TTTTGTTTTT 
ATGATTTTTT 
TATATTACAT 
AGCGATAGAA 
ATTATAGGTC 
AGTCCAACTA 
GAATTGCGTC 
GAAGATAACA 
AAGCAATTAA 
ATGTCGGGTG 
ATTCTTTTAT 
AATATCATTT 
GACCAAAGTA 



GATAAACATG 
CTCAAAAAGT 
AACCGCCACA 
TTATAGATAT 
ATATGGTTAC 
ArCATTTGAT 
TGAAAATATT 
TGGTGCGAAA 
TATTGCTTAT 
TCCAGTTGTC 
TAAAGATGTA 
TACAGATGCG 
TACGCTGCGA 
TATCCGTCTT 
GTGGTAATCA 
TGTCTAATTT 
AATTTAAATG 
CTTAAATGTA 
GAGGAGCGGT 
TCATACTAGA 
CATCAGGTAG 
GTGGAGAACT 
AACGAATCAG 
TGATATTCCC 
TTAAAGATGT 
GTGAGCGGCA 
TAGATGAATC 
TTAAATTAGC 
TGCGACACTT 



GTTTTATTTC 
AACGATTACA 
TGTCTTAACG 
TGGTGTTAGT 
GCCATATAGT 
AATCGCTATG 
GGCATTAACT 
GTGGCAGCGA 
GATACCCCAG 
ATTATGATGG 
GCTAAGGAAC 
GGAAGTATTC 
TACATGCATT 
GTTACTGAAA 
AATCCATAAA 
TAATTCTTAG 
CTGAAAATCA 
ATTGCACTAA 
GCAAATGTTG 
TCATATCAGT 
TGGTAAAAGT 
TTATTTTAAA 
TTATTTGATG 
ATCACTTGCA 
CGGTTTGGGA 
AAGAATTGCT 
GACCAGTGCA 
AGATCAAGGC 
TCAAAAGCGT 



ATTTACGCCA 

ACAGATTCGG 

CCTGAAGAAC 

AGCAAGGAAG 

GAATTTGAAG 

GCTGTGCATT 

TATACAGTGG 

ATACGATTAA 

GTATGTCAGG 

ATGCTACAAG 

ATAACATCGA 

ATGTCGCAAA 

CTAATGTTTC 

TTGTCCGTTC 

TAATAAAGAA 

TCATTAGACA 

ATTATGCCTA 

AAACCAAAAA 

TTAGAAATTA 

CTAAAAGTAG 

ACATTTCAAA 

GGTAAACCCT 

CAGCAAAGTG 

CGTAATGATA 

CATTATCAAT 

ATAGCGCGCC 

TTAGACGTTA 

GTGGCAATTA 

ATAACAATTG 



kTgGTGGATG 
GCAAAGAAAT 
GTAAAAAGCC 
AAGCTGAAGA 
TGCTTGCAAA 
AGCTATTGAG 
TGCCACAGTG 
ACCAGACTTG 
TCAAACGAGC 
TATTGCTCAC 
AGTACAATGG 
TGAAGGTATT 
AGTGCTCAAT 
ATTGAATGAT 
TCCTTTTAAT 
GTATCCATGT 
AATTTTGATA 
AACGGGAATA 
AAGATTTAGT 
ATAAAGGCGA 
AGCAAATATG 
ATAATGATTA 
ACTTGTTTGG 
AATTTGATAG 
TAAGTTCGGA 
AACTGATGTA 
ATAATAAAGA 
TGTGGATTAC 
TTGATGGTCA 



GTGGAATCAA 

TAGAGGTATC 

AATGGAAATC 

AGCTGGCGTT 

TGATAAATAT 

GTATTAAAAC 

CAAGAAGAAG 

GCGATAgcTG 

GATAGTAAAC 

CAAGGTTTGC 

GATACGACAC 

CCAACGATGA 

GTAGATGATT 

GAAAGTTATA 

ATGGTAGGTT 

TAATAGGATT 

TTACAAGAAA 

ATATACCTGA 

GTATAAAGCG 

GAGTATTGCC 

TAATTTGTTT 

TGACCCGGAA 

TGAAACGATT 

AAAACGTGCA 

AGTGGAAAAT 

TACACCGGAT 

AAAGATAGAA 

CCACAGCGAT 

AATTTCTAAT 



14100 

14160 

14220 

14280 

14340 

14400 

14460 

14520 

14580 

14640 

14700 

14760 

14820 

14880 

14940 

15000 

15060 

15120 

15180 

15240 

15300 

15360 

15420 

15480 

15540 

15600 

15660 

15720 

15780 
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CATTCCGATT ATCATTTCAT ATAAAGAAGG 


TTTACATATT 


ATTAAAGATT 


TAATTGTTGC 


15900 




GACATTACGA GCAGTTGTGC AATTAATCAT 


TTTGGGATTT 


TTGCTGCATT 


ATATTTTTAA 


15960 


5 


AATAAACGAT 


AAATGGCTGC 


TTATTTTATG 


TGTATTGGTC 


ATTATTATTA 


ATGCATCATG 


16020 




GAATACAATT 


AGTCGAGCAT 


CACCAGTGAT 


GCATCATGTG 


TTTTGGATAT 


CATTTCTAGC 


16080 


10 


TATCTTCATT 


GGAACGGCAT 


TACCGCTTGC 


AGGTACTATT 


GCGACAGGGG 


CCATTCAATT 


16140 


TACCGCAAAT 


'GAAGTTATAC 


CTATCGGCGG 


CATGCTTGCA 


AATAATGGCT 


TGATTGCAAT 


16200 




TAATTTAGCT 


TACCAGAATT 


TAGATCGTGC 


ATTCGTACAA 


GATGGTACTA 


ATATTGAATC 


16260 


15 


TAAATTATCA 


CTTGCAGCTA 


CACCTAAATT 


GGCTTCTAAA 


GGTGCAATAC 


GTGAAAGTAT 


16320 


TCGTTTAGCT ATAGTGCCAA 


CTATTGATTC 


GGTTAAAACA 


TATGGGCTTG 


TGTCGATTCC 


16380 




TGGTATGATG ACAGGCTTAA 


TTATTGGTGG 


CGTACCACCT 


TTACAAGCGA 


TTAAATTTCA 


16440 


20 


ATTGTTAGTC 


GTGTTTATTC 


ATACAACTGC 


GACCATTATG 


TCTGCTTTGA 


TTGCGACATA 


16500 




TTTAAGCTAT GGTCAATTTT TCAATGCAAG 


ACATCAATTA 


GTAGCACGAA 


ATACTGATGT 


16560 




TAAGAGTGAA TCATGATAGA TTTTACTGCA 


TCAGATTTAG 


GCATTAGTTT 


TAATTGGAAA 


16620 


25 


TGAAGTGACG 


CGCACATATA 


GTATCGCTAT 


TCATTAGCGC 


AGCGAAAATA 


TTCATAAAGG 


16680 




CACGCATACT TTGTAGTCAG 


TTATCTGTTC 


TGACATATAA 


AGCGTGCGTG 


CTTTTTTGGA 


16740 




GTTATTGTTG 


AAACTGAAGT 


AATTATACAT 


AATTATTAAA 


TGACATACTT 


GTGTTAATTT 


16B00 


30 


TTCAAATACT 


GAAAAACAAT 


TTCaATAATT 


TTCCaATTAA 


GCACAGAAAA 


TTAAAGCAAA 


16660 




ATATTATATA 


ATAGAACGGT 


TATATATaAA 


nATTngTgCA 


CACATTTTTT 


AATAAATCGT 


16920 


35 


TATTCTAAGG 


GAAATGAATA 


TCGGAAATTT 


TGTTTGAAAG 


GAGTTTTAAA 


TTGTCAATCA 


16980 


TGCGACTATT 


TACATTCATT 


TTAAGTATTT 


TTATCGTAGG 


AATGGTTGAA 


ATGATGGTTG 


17040 




CAGGAATTAT 


GAACTTGATG 


AGTCAGGACT 


TACATGTATC 


AGAAGCTGTC 


GTTGGTCAAT 


17100 


40 


TAGTGACAAT 


GTACGCTTTA ACATTTGCGA 


TATGTGGACC 


TATTCTGGTT 


AAATTAACGA 


17160 


ACCGTTTTTC 


ATCAAGGCCT 


GTATTATTAT 


GGACATTACT 


TATATTTATC 


ATTGGTAATG 


17220 




GCATTATTGC 


TGTAGCGCCA 


AATTTTTCaA 


TATTAGTAGT 


TGGTAGAATT 


ATCTCATCTG 


17280 




CAGCAGCAGC 


ACTAATTATC 


GTAAAAGTAT 


TAGCTATTAC 


AGCGATGTTA 


TCAGCACCTA 


17340 




AAAATCGTGG 


TAAAATGATT 


GGACTTGTCT 


ATACAGGGTT 


TAGTGGTGCT 


AATGTTTTTG 


17400 




GTGTACCAAT 


TGGAACGGTT 


ATCGGCGATT 


TAGTAGGTTG 


GCGCTATACA 


TTTCTATTCT 


17460 


50 


TAATTATTGT 


GAGTATTATT 


GTTGGCTTCT 


TGATGATGAT 


CTATTTACCG 


AAGGATCAGG , 


17520 




AAATACAACG 


AGGCCCTGTG 


AATCATGAGA 


CACCATCTCA 


TGAAAATCAT 


GTTACTTCGA 


17580 
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CAAACTCAGT GACATTCGTC TTTATAAATC CACTTATTTT ATCTAATGGT CATGATATGT 17700 

CATTCGTTTC ATTAGCACTT CTAGTAAATG GAATCGCTGG CGTTATTGGA ACATCATTAG 17760 

5 

GTGGTATATT CTCCGATAAA ATTACAAGTA AGCGTTGGTT AATGATTTCT GTTTCTATTT 17820 

TTATCGTCAT GATGTTACTT ATG AATTTAA TCTTACCTGG TTCAGGTCTA TTGTTAGCAG 17880 

GACTATTTAT TTGGAATATC ATGCAATGGA GTACTAATCC AGCAGTGCAA AGCGGTGTGA 17940 

10 

TTCAACATGT TGAAGGCGAC ACAAGCCAAG TAATGAGTTG GAACATGTCT AGTTTAAACG 18000 

CTGGTATTGG TGTTGGAGGC ATTATTGGAG GCTTGGTCAT GACACATGTT TCTGTTCAAG 18060 

1S CTATCACATA TACGAGTGCC ATCATTGGCG CATTAGGATT AATCGTTGTT TTCACATTGA 18120 

AAAATAATCA TTATGCTAAA ACATTTAAAT CATCATAATT CTCATATGAm AAGCACGCCT 18180 

GCTATCAAAT TCAGGTGTGC TTTTTTAGAT GCGATAACGT TATTGATATG TGCGATAATA 16240 

20 GCGACGTTCA TTATGATACA TCGGCCAAGG CATTTTACCG C T T TT A GCAA AATTAGCTAA 18300 

ATCATTTTGC ATTTGTCGAC TTAAAAATTT AAGGTGaGCA GTTGTTGGaT ATgAT 18355 
(2) INFORMATION FOR SEQ ID NO: 68: 

25 ■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1192 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 68: 
CGCAAAGAAG TACAAAAAAT GTTTTTACAA GAAGGTATTA AAACACCTCA ACCAATTATG 60 

35 

ACTGCTTATA ATCATAGTGA AAACGgTGTT TAGTAGTTTA TAATACATGG AGGTCATATT 120 
TAATGGCGTC AAAATATGGA ATAAATGATA TAGTAGAAAT GAAAAAACAA CATGCGTGTG 180 

4Q GAACAAACCG . TTTTAAGATT ATTAGAATGG GTGCAGACAT AAGAATTAAA TGTGAAAATT 24 0 

GTCAAAGAAG TATTATGATT CCACGTCAAA CGTTTGATAA AAAACTTAAA AAAATCATCG 300 
AATCTCATGA TGATACACAA AGATAGGAGA ATGATTAATG GCTTTAACAG CAGGTATCGT 360 

45 ■ TGGATTGCCA AACGTTGGTA AATCAACATT ATTTAATGCA ATAACAAAAG CAGGTGCTTT 420 
AGCAGCGAAC TATCCATTCG CTACGATTGA TCCTAATGTA GGGATAGTAG AAGTGCCAGA 480 
TGCTAGATTA CTTAAATTAG AAGAAATGGT TCAACCTAAA AAGACATTGC CGACTACATT 540 

50 TGAATTTACA GATATCGCTG GTATTGTGAA AGGTGCTTCA AAGGGAGAAG GGTTAGGTAA 6 00 

TAAATTCTTA TCACATATTA GAGAAGTAGA TGCGATTTGT CAGGTCGTTC GTGCATTTGA 660 
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TAATATGGAA TTAGTACTAG CGGACTTAGA ATCTGTTGAG AAACGTTTGC CTAGAATTGA 780 

AAAATTAGCA CGTCAAAAAG ATAAGACTGC TGAAATGGAA GTACGTATTT TAACAACTAT 84 0 

5 TAAAGAAGCT TTAGAAAATG GTAAACCCGC TCGTAGTATT GACTTTAATG AAGAAGATCA 900 

AAAATGGGTG AATCAAGCGC AATTACTGAC TTCTAAAAAA ATGCTTTATA TCGCTAATGT 960 
TGGTGAAGAT GAAATTGGTG ATGATGATAA TGATAAAGTA AAAGCGATTC GTGAATATGC * 1020 

10 

AGCGCAAGAA GACTCTGAAG TGATTGTTAT TAGTGCAAAA ATTGAAGAAG AAATTGCTAC 1080 

ATTAGATGAT GAAGATAAAG AAATGTTCTT AGAAGaTTTA GGTATCGaAG AACCAGGATT 1140 

AGATCgrTTA ATTAGGAmCA ctTATGAATT ATTAGGnTTA TCCACCATAA TT 1192 

15 

(2) INFORMATION FOR SEQ ID NO: 69: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7494 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 . (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 69: 



30 



35 



40 



45 



AATATAGCTG 


CAATAGCATC 


TCGTTTCATT 


TGTATAATCA 


ATTCCGGTTT 


AAATATCAGT 


60 


GTGAACGTAA 


GCACGACACA 


GATTAAAAAT 


AACACTGCCG 


GAATGAGTCG 


TTTCAATCGT 


' 120 


CGCTtCCAAA 


ACTCTAGCAA 


ATCGATTTTT 


TGCGTCCGAT 


AATACTCACT 


TATCAACAAA 


180 


CTTGTTATTA 


AATAACCTGA 


AATAACGAAG 


AATGTATCTA 


CTCCTAAAAA 


GCCCCCACTT 


240 


AACCATTGTG 


CATTCAAGTG 


ATAAATAATG 


ATTCCTATAA 


CTGCGAATGC 


CCTCAATCCA 


300 


TCTAATCCAG 


GTAAGTATCG 


CGGGGAATAC 


ATTTTTTCTA 


AACGTTTAAA 


GTCTTTTGTA 


360 


TCCAtGTTAA TAAACGCCCC 


ATTTATTTTT 


CTCTATTTTG 


TAGTATATCA 


CAATATTTTT 


420 


GAAAATAAAA 


TATTGCACTG 


aTTTTCATTA ATTGATTTAA 


CCCTTAATTA 


AGATAGTTTT 


480 


AAATTTTTTA 


TTAAGTAGAA 


AACAATTATT 


ACAGTTGATT 


TCATTACTGC 


AAACCACATA 


540 


TAAATTTGTC 


GATTTTACTA 


CATAACATAG 


ATTATCATAG 


ATTCTTGAAT 


TTTTAGCAAA 


600 


ATAACTGTTA 


TTTTCATTAT 


ATTTTTACAA 


AAAAAGGTTC 


GTTTTATATT 


TTATGCATCT 


660 


TACTGTAACA 


GAATCATTAA 


GATATGCTAT 


TCGAATATAC 


TTTTTCAAAA 


TTTATATAAT 


720 


GAATAAATTA 


ACATGTATTG 


AAAAAAAAGC 


GAAATGCAGC 


CTATCCTCTA 


ATGTAAACCA 


780 


AACGATATAT 


CTCGTCAGAC 


TTTATATTTA AACGCTATGT 


GTCACTTTTA AAATGAATAT 


840 


TACTAAGATT 


GTCATATCAA 


TTATTATTGC 


ATCGAATTAA 


TCTTTTAAAT 


TTCTGTAATA 


900 
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ACGGAAGTCA TTATTAGAAT AAAAATACTG TGCACTAATA AATTTATCAA TTGTTCCTAA 1020 

ATAAATACCA TCGATATTTT GTTCTTTACA TGTCATTATA ACTTTATCTA AAAGTTTTTT 1080 

5 

ACCTATTTTT AAATTCCTAT AACCTTTATC AACAAACATT TTTTTAAGTG CAGACATATT 1140 

ATTATCTAGT CTAATCAAAC CTATAGTACC AACAATATTT TGaTGATTGT TTATTGCAAG 1200 

CCAAAATgCC CTCCATTATT CAAATAGTTA TGTTCGATGT TCTCCAAATC AGGTTGATCA 1260 

10 

TCTCTATCAA TTTTTATATa AATTCATTTT TTTGAATCGA TAAAATAAAC TCGATTAGCT 1320 

CTTCCTTATA AGACCTATTA TATTCAATTA TGTTTATAGC CATTTTTATC TCCTTTTTCA 1380 

TTTAATTTAA TTATAAAATG TGCGTTTAGT TTGTATCTAG TGTACTCAGT ACAGCCTCAA 144 0 

ATGAAGTTTC ATTCCACTTG GCACTTAATA AAGACAAGTA TTTTAGCAGT AATACAATAA 1500 

AGTCCAATAA ATTTCCCTAA CTTCAATATC CACTTTTTAA AAAATGTATT TTTAATTAAT 1560 

20 AAAAAAACTC TCCCCAATTT CTATGGGAAG AGCTATATAT TTAATGTCTA AACATTACTT 1620 

TTATTTATTA TGAAGGAATT AGAATCCCCA AGCACCTAAA CCTTGTGCTT TGTATGCTTT 1680 

AACAGCTGCG TTGATTTGTT GGTCAACAGT GTTTGTTGGA CCCCAACCTG GCATAGTTTG 1740 

25 GAATAAACCT GAAGCACCTG ATGGGTTGTA AGCATTTACT TGACCATTTG ATTCACGAGC 1800 

GATGATTGCA GCCCATGTAG AAGCTGAAAC ACCAGTACGT TGAGCCATGA TTTGAGCTGC 1860 

TGATGAACCA GTAGCACCTG CAGTATTACC ATTGCTTAAT CTCACTGAAC TTGAAGTAGT 1920 

30 

TGAAGTGCTG TAGTTATGGT AAGTTGGAGC TGAAACAGCT TCAACGTtTG AGTTACTTGA 1980 

TTGTCCATTG TAGCTTACTG ATTGTACATT TGAACCTTGG TTGTATGAAG TAGTGTAGTC 204 0 

TGCACCTGCA ACGTTTGAGA AACCAGCAGT TTGACCATTA GCTGCTTCAT AGCTCCATGA 2100 

35 

CCATGTAGTA CCATTTGAAG TGAAGTTATA TTGGAAACCA TCTTTTACAA AGTGGATGTC 2160 

ATATGCACCA TCTTTGATTG GAGCTGCATT TAATTGATCT TGGTGATTAT GCGCTAAGTC 2220 

AACTAAGTGT GCTTGATCAA CGTTTACTTC AGCAGCGTGT GCTTGATGTC CTGTACCTGC 22 80 

40 

TGCGTAACCT GTTACACCTA ATGCCACTGC TAATGATGAT GCCATAATTG TCTTTTTCAT 234 0 

AGTAAAAAAT CCTCCAGTAA TAATTGTnAG TTTATGTTTT TAGTAATTAT AtTTTGaATT 24 00 

45 TGAATGTCGT AGTgCAAGTT TAAATTGTCT TTTATTTCTT TCaACGGTAC TCACTATATC 2460 

ACAaAAAACC AGCCAGTAAA TTACACTTTC TTTACAAAAC ATTACAATAT CAAGTGTTAT 2520 

TTGtAATGTT GAAATATGGC TGTTTTATAC TGTAATGTGA AATATGTGCC CTTTAGAATC 2580 

SO CAATCAACCC TTGAAATAGT CTTTAACACA TAAGATTTTT ACTATATTTA GCTCAACTAT 264 0 

TACAGCTTTC GTAATATTAC AGATTGTATT TTTGTTACAT AGCTGTAATA TATCTGACAT 2700 
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10 



15 



20 



35 



40 



45 



SO 



TACACATGTA TTGATTGCTA TTATTGTTGT ATATTCAAAG 


TTTTAAAACA 


CACATCTTTT 


2B20 


GTGAATTGTC TTATCTTTTA TTAGCGCAAA TAAACTGCAG 


CTCAATTATA 


TTGTTCAACT 


2880 


TCATTCTCGC AATTCACAAT 


AACATTAAAT 


AATTTTTGGT 


CTCATATTTT 


CAAAAAACAT 


2940 


ACTGTTATTA TCCCATGAAT 


TTAAAAATAT 


CATTAGTATA 


TAAACGAAAC 


ACTTTACGAT 


3000 


AAATGATATC TGCAAGCCAA 


GCTGTTACAA 


ATGGTACAAC 


AAAGAACGCT 


ACTACAATTA 


3060 


GTAAGACACT CAACCAAGCA 


GAATCAACCT 


CCATAAATTT 


AAATGCATTA 


ATCGGTCCTA 


3120 


CCATTCCTAT AAAACCAAAT 


CCAGCTGACT 


CTTTCGTTCC 


ATGAATACCT 


ACTAATGCTG 


3180 


ATACCAAACC TGATACAATG 


GCTGTCGTTA ATATTGGTAA 


CATAAGAATT 


GGATATTTCA 


3240 


CCATATTAGG TATCATCATT 


TTAACGCCTC 


CAAAGAAGAC 


GGATAACGGC 


ACCCCTAAAC 


3300 


GATTCACTTT ACTTGTACCA ATTATCAATA 


CTGCTTCAGT 


CGCGGAGATA 


CCAATTGACG 


3360 


CTGATCCAGC TGCTAAACCT 


GTAATACCTA 


TCGCAAAGGC 


AATGGCCACA 


GTTGATAGTG 


3420 


GCGAAATAAT AATAAGACTA AATACCATTG 


AAATCAAAAT 


ACTCATGACA 


ATCGGTTGTA 


3480 


ATTCTGTAAA ACCATTAACC 


ATATTACCGA 


TGGCTGTTGT 


AATCATCTTC 


GTATACGGCA 


3540 


ATATTAAAAC ACCAATTGCA CCTGAAATAC 


CGCCAACAAC 


TGTTGGGAAT 


ACAATCAATG 


.3600 


CCATACTACC TACGCGATGT 


TGAATAAGTA AAATGAATAA 


CACTGCAATC 


GCTGCTGTAA 


3660 


TCATTGTATT AATTAAATCA 


CCAATACCCG 


TAATCATCCA 


AGCACCATTT 


TTAAACTGCG 


3720 


CTGCACCGCT TCCTACATAT 


GCTGCACTTG 


CCACAACAGC 


AATTGCTAAT 


GGCGATAGGT 


3780 


CAAATTTCAT GGCAACCAAT 


GCACCAATCA 


AAGCAGGTAC 


TGTAAATTGA 


ATTGCAACGA 


3840 


CAACGCCTAA TAACGTTTTA 


AAAATCGGAT 


GATAATCCAT 


AAAGTATTTA 


AAAATTTCTC 


3900 


CAAGTATCGC ATTAGGAACT 


AAACCCGCAA 


CAATACCTAT 


GGCGACACCT 


GATAAAACTC 


3960 


TAAATATAAA ATCTTTGGGT 


GTAATTGTTT 


TAATTGATGT 


CATAATATCA 


TCCTTCCATT 


4020 


TATGTATATA CATCTGTATG 


CAAATAATAA AGAGCCTTAA 


GTTATAAGCT 


GCCACTAGCT 


4080 


TAAATTCTAA GATGTGCATG 


CCGATGTTGT 


TATATTTAGG 


CTAGCAGTAT 


CATCTATAAC 


4140 


TCAAGACTAT GAAAAATAGT ATATCACAAA ATTCTGAATT 


TTTAGATAAA 


TAAATTGGCA 


4200 


ATTTTTCAAA CATATTGTTA 


CAATACACTT 


TTATTTTATC 


TTCATTTTTA 


AAATC CATTA 


4260 


ATACAATAGA AGAAAGACAT 


TCAAATGCTT 


ACCAAAAAGG 


TACATTATTT 


GTTAGGAGCG 


4320 


TATCAGCaCT TACATATCAT 


CAACACAATT 


GACAATATAA 


TAGAAGATAC 


TGATAATAAG 


4380 


TGTTAAAACA ACAGATGTTA 


GGTAGTGAAC 


AAATGATGGA 


AAGTAAATCC 


ATAGATCCAA 


4440 


GAATCGTTAG AACCAAACAA 


TTGCTTGTCG 


ATGCTTTTCT 


TAAAATTTCT 


AGAGAAAAGA 


4500 
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TTTACGCTCA TTTCGCTGAT AAAGAAGACC TCCTAGACTA CACATTATCT GTAACCATTT 4620 

TAAAAGACTT GAATGATAAT TTGAGCATTT CTAATGTCAT TAATGAAAAG GTTCTGCGTA 4680 

5 

ATATTTTCAT TTCAATTGCG AGTTATATCA AAGATGCTGC AAAGTCTTGC GAATTAAATA 474 0 

GTGAAGCATT TTGCAACAAA GCACATCAAC GTATTAATAA TGAATTAGAA GATATTTTTG 4800 

CGATTATGTT AGAAAACAGC TATCCGGAGC ATCAACGAGA TATCATTGTA AATAGTGCGA 4860 

10 

GTTTTTTAGC AGCTGGTATC TCAGGCTTAG CATTACATTG GTTTAACACG AGTCAAGAGA 4920 

CAGCCGATGT GTTTATCGAT CGCAACCTTC CATTTTTAAT TCATCATATA GCACATTTTT 4980 

15 AATAAAACTT GGTATTTAGT CATGCATCTT GAAATCACTA TGTGACTTAG GTTCATACTT 504 0 

GTACACACAA TAAAATTTAA CGTATTACGA TTGATTAGCC GTGTCTAGGA CATAAATCAA 5100 

CGTCCTATAC TCTACAATGT CATATTAGCA GTCGTTAACT GAATGAAAAT AAGCTTGTCA 5160 

20 TTAAAACATA TAGATTTTAG TGACAAGCAT TTTTGTTTTT GCGTACTTAA ACAACACTTC 5220 

AGGCAATATG TTGTTTAGGC AACAAATGAT ATGTGCGTGT TTATTGGCAA ACGTACGACA 5280 

TAGTAGTATA GTATGTCTAA ACAACATATG TTGCATAGTT GATATGCGTT GTTTAAATAC 5340 

25 TAAGATAGGA GGGATTGACG TGAGCGAGAC AGATGAACCT CAGGGGTTTG AACGCACGCA 54 00 

TAATATATTA AATATTAATC AGAGTAGTCT GGGTGTAGTG ACATACATTA CAAATAAATT 5460 

AAAGTCGACG TTGAAGCAAC ACATAATAAT TGCTCGTGGT AAAAAGCGAA TCGACTATCG 5520 

30 

ACTGTCGTAT AACTTTTACA TACGTATTAT GATAATGTAG AAATCAAGAA AATCGACTGT 5580 

GAATATACCT ATGCTATGCC CATTGCAATT TTAATAAGAC ACACGATGTC ATTCGACAAT 564 0 

GCTCATTTCT TTGCTCAGTT ACGTCATCCT GTCTTATAAA ACAACATTGC AGACATGTAT 5700 

35 

ATCAAACGAC ACTTCAATAA CATCACTTTG CCcATCGTAC TACTAGTAAA ATCGTGTCTC 5760 

AAATCCCTTA TTTTAATTCC AAAAAtCTGC TGGTCAAAAG ACCGAGAAAC TAAAAACATT 5820 

4Q ACTTAATGTG TTGATAAATT ACCATATAAA AATAATCTCA AAATATATCA ACACTTGATT 5880 

CTAAGGAGGA TATGACAATA TGAAAATTTT AGATAGAATT AATGAACTTG CAAATAAAGA 594 0 

AAAAGTACAA CCACTTACTG TAGCTGAAAA ACAAGAACAA CATGCATTGC GTCAAGAcTA 6000 

45 CTTAAGcATG ATCCGAGGAC AAGTATTAAC AACATTTTCC ACAATAAAAG TGGTTGATCC 6060 

AATCGGTcAG GATGTCACAC CAGATAAAGT TTATGATCTT CGCCAACAAT ACGGTTATAT 6120 

TCaAAATTAA tATTTGCTCA CGAGGTATTG CACTTAAGGT GCCAACTGAC CTCATAAACA 6180 

50 

AAGCCCATAC TGATTGAAGA CACTAATGTG tCsaCCATGG TGCACATTAC GCTTCATCTC 624 0 

TGTATGGGCT TTTTATTTAT TCTTTTGAGA ATTTCATTTT AGCAGACCAA AAAATTAAAA 6300 
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TGAACGACTG 


TGCCACCCGC 


TTCTTTCACT TTATTCACCA 


ACTGGTCAAC 


TTCTTCATTT 


6420 


GTGTTCACAC 


CTAGAGAAAT 


CATCACTTCA TTTGGTTCAG 


TATTAAGGCT 


TTGCTGACTT 


6480 


ACATTTTGAA 


AATGCTTGTn 


TTCTATTAAA ATTACGGkTG 


tTTGACCTAT 


tTGAATGCCG 


6540 


ACCATTTTAT 


CTAACATTTG 


TGGGTTTCTA TTTATTTTAA 


ATCCTAACGC 


TTTATAAAAC 


6600 


TGTGCGCTCT 


TTTCTAAATC 


TTGCACATGC AAATTAAACC 


ACATTGATTG 


AATCATGATT 


6660 


GCACCCCATT 


CATTACTTAT 


TATAGTTTTG GACTTTAAGC 


CAATCACTTA ATGATAATCT 


6720 


TGTTGGATTT 


ATTTCAGCCA 


TTAATTCAAA GTCTACTTCA 


TAACCTTTTT 


CTTCCAACCA 


6780 


TTGCTTTTCT 


GCAACACCAC 


TAACAAATTC TCCTTCTATA 


ACAGTAGATT 


TACCTGTCAC 


6840 


TTCACTAAAA 


ATTGTTGCTG 


CTTCACTTAA TGTAACTTCA 


TCGGAACCAA 


TCTCTATTGA 


6900 


TTGATGCGTA 


AAGCTTTGTG 


GATGTGCAAA AATATACGAT 


GCAATTTTAG 


CTATATCAAT 


6960 


AGAAGAAATC 


ATTGTGAATT 


TTATATTCGG ATTAATAAAT 


TCTGGTAATG 


TAATACGTTC 


7020 


ATCTTCGACT 


TTAGCAATGC 


GTAAAAAATT ATCCATAAAG 


AATGATGGTT 


TGATAACTGT 


7080 


TGCATTTATA 


TTAGATTCCA 


TTAATCTATT TTCTATTTTT 


GCTAGTACTT 


CAAAGTGTGG 


7140 


GCCAGTTCGA 


TTTCGATTAA 


CCCCTCCCGC AGTACTATAC 


ACAATATGTT 


GAATATTTTC 


7200 


TTGCTCAGCT 


ATTTCAATTA 


TCTTCATACC TTGTCTTAAT 


TCTTCGCTAA 


CATCATCTTT 


7260 


AACGATTGGC 


TGAATACTGT 


ATAAGCCATA CTTACCTTTC 


ATCGCTGATT 


GCAAACTAAC 


7320 


ATTATCACTC 


AGATCACCTT 


CArCGATTGA TAAATGCGGA 


TGTCCTATGT 


CTGAAAGTTT 


7380 


ACGATTATnC 


TTATTTCTAG 


TTAATGCACT TACATACCAT 


CCATCCTCTA 


ACAACTGTTT 


7440 


TACAACTGCA 


TTACCTTGCT 


TCCCTGTTGC GCCTATTACn 


AAAATATCTT 


TCAT 


7494 



35 

(2) INFORMATION FOR SEQ ID NO: 70: 

r (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11802 base, pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 70: 

AATTTATTTC QCCGTCCCAC CCCAACTTGC ATTGTCTGTA GAAATTGGGA ATCCAATTTC 60 

TCTTTGTTGG GGCCCcGCCC CAACTCGCAT TGCCTGTAGA ATTTCTTTTC GAAATTCTCT 12 0 

50 GTGTTGGGGC CCCTGACTAG AATTGAAAAA AGCTTATTAC AAGCGCATTT TCGTTCAGTC 180 

AATTACTGCC AATATAACTT CGTAGATCAT AGAACATTGA TTTATTTCCC AGCCTATTCT 240 
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AGCAAAGGTA ATAATGATAT TAATAATGTA CAAAAAATAT AAATCAAATC GACATCCTTA 360 

TAAAACATCA GAACCACTAA AAACAAAAAA GCACAAAATA AAATTAAATT TAAAATAAAC 420 

GACCACTTTT CAAAAAAATC TCtTTTCaTa TTTCCACCCC TAATTTTAAT AAGCATTATT 480 

TTATATTCTC TTTTAAGTTT ATTATTCAAA AGGAAAACAG AAATATCTTT CaATATTATT 540 

ATAAACATTT CAACTACTTT TAAAAACCAA CAAAAAAATA CTTATTTTAA GTAGATGAGC 600 

ATAAGTGAAC ATAGTTCTTT AGTTATAATA ATTAATTCAA CCAAAAGTCG ATTTGTTTTT 660 

GCAATTGGTT TTCATTTCCT CTTAAAGATA TTTTCATTAA ATCTGTCAAA TCAATAGACG 720 

CTATATTTTT CAACTTATCT CTATATTTAT TTTTAGTACG TCTTTCTAAA TTTCCCCATT 7 BO 

CCTCTTCTTC GTGAGTTAAT AAATGAAGCA TTGCTCGTTC TTGTATATTT TCAATCATTT 840 

TTAAATTCGG TTTTAAAATA TGCAAATCAT CAAAACAATC TTTCCAACAA TCAACCATAT 900 

CTCGTTTTAA TTCAATTTCC ACACGCCATA GAAATGTTGA ATCAATTTCA ACATCTGCAT 960 

TATCTTTACG TTCTTGTTTT TATTATAAAT CCGAATAAAC CTATCACTAT TACGCACACC 1020 

AAAATATTTT GTTTCTGGTT TTACATTACG TCCATAAAAT ATAGTTTTCT TTACCGACTT 1080 

ATCTGACAAT GCATAATAGT CATTTAAATC AAATTCAAAA TCAAAAGCCA AATCTAATCT 1140 

CGTAAAACTA ACATCGTCCA AATAACTGAT GATATTTTGT TTTAACCAAA GCACTTCATC 1200 

ATGCGAAAGC TTATTAGGAT TAAATTCAAC GCGCATAtAC GTCTATTCCA AAGAGTTGCT 1260 

TTTATTTTGT CATATTCAAT ATAAACTTTT TCTTTAAGAG CTTTAGCTTT AAAGTTTGTT 1320 

TGTAAAATAT CCCAAAGCCG AATTTCAGGA TTAGTACTCA TAAAATGTGA AAGTCTCTCT 1380 

35 GCGTTAGACA TGCTAAGATT CCCAACAATC GTTATAGCGT CAAAAGACAA TTTTGGAATA 144 0 

GCTAGTGACA TCCTATGTCG ATTTAACCGG CTATTACCGG ATATTAGAGT ATCCAGTTTT 1500 

ACAAATGGAT GAAACGAAAT TCAAAACACT AAAAAATATG TTCCACTAAC AGCAAAAAAA 1560 

TACCATTATG TTCCTACTAA AAAACyAAAA ATACTGGAGA ACAAATGTCA GGATATAACT 1620 

TAGGATACTA TGTAATAAAA ATTTACAATA AAAAAACAGG AAAACAAATT TCAAGTAAAA 16 80 

GmATACCCAT ACAAAGAGGA TAAAATAAAA AACCTCGAAC TGaAATGATG ATCTTTTCAG 174 0 

CTCGAGGTTT AAATATTGGT GCCTTATTTA TATAGATTCG TTATATTATA TTCTCTATTT 1800 

TCATTAACmT AATCCTTAAA GAGTTTTAAA TTAATACCTG CTAGATGATT CAAAAATGTT 1860 

50 TCATCAACTT TTAAATAATT CAATAATTTT TGTGGTGTCA GTAAATnTCT ATCAAAATAC 1920 

AACTTTAATA AACTATTCAT TTTGACAGGA CGTGACATTT CAATCACGTC GTCTAAAGAT 1980 

AATACTTTCT CGCTTTAnAC AAAnACAAAA ACTTACCCGA TTAAAATCAA GTAAGTTTTA 2040 
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10 



20 



25 



TATTTGATAA AAAATCAATA AGTAATTGTG CGCCTTCAAC TTGAATATCT TTTACAACTG 2160 

GCGCGTCGAT ATACATATCA TACTGACCAC CGCCTACTGC ACGATAATTA TTTACACAAA 2220 

TTGTATATGT CTGCTTTAAA TCAACTGCGT GACCTTGAAT CATCATATTG CTCACACGTT 2280 

GTCCCTTTGG TCTTCCAACA TGAATGGTAT AACTTACGCC ACCATATATA TCATAATTAA 2340 

AGTGTTGTGG TTTGGGTTCA AGGAAGTCTG CGCTCACACT AACTTCATCA TTTTTCACGT 2400 

CAAAATATTC TGCTGATCGT TCAATGGCTT CTTTAAGTTT GGCACCACTT ACAGCTAAAA 2460 

CTTTAAATGT ATTTGGAAAT GGGTAATTGT TAATAACATC TCGCATCGTC ACGACTTGCT 2520 

is TGAAACCACT AGCAGAATCA AACAAAGCTG TAGAGGCAAC ATCTGCGTCA CTTTTTTCTA 2580 

ATAAAGCGTA ATTCATAAAA TTTGTAAAAG GATGCGGTGC CACACGTGCC TCAAATGCAT 2640 

GATTAATCGT CATATCATAT GGCAATGTAG TAATTTCGTA ATCTAACCAG TCCTCTAACT 2700 

GCTTTCGTAA ATGTTGGTCA TCTTCATCAA TAGTAAATGT GGAATCATCT ATAACAGGAA 2760 

GTAATTCACA TGATTCAACG GATAGATTTT CATATTCATC AGTACTCAAG ACTACTCTGC 2 820 

CTACAGTTGT ACCTCTCGTA CCAGGTTGAA TCACAGCCGT TTGCTTAAAC CTTTCAGCAA 2880 

TTTGTCGATG TTGGTGACCC GTAATAAAGA TATCTATATC TTTAGAAAAC GCTTCTAACA 2940 

TGGCATATCC TTCATTTTCA CCCGTTAATA CTTCGGTCGG CGTACCACTT TCTAAATCCT 3000" 

30 TTTCAAATCC ACCATGGTAA CAAACCACAA TGATATCTGC ATGTCGCTTC ATTTCAGGTA 3 060 

AGTATTGTTG AAGTATTTCA AAAGCACTAT GAAACGTArT GnCnTGAATA TGCTCTGGTT 3120 
GTTCCCAATG GGGAATAAAT TGTGTCGTTA AACCTATCAC ACCAACAGTT TGATCTCCAA . - 3180 

35 CCTGAAAATA CTTCACACCG TTATCAGTCA ATGTACTATC ATTTTCATAT ATATTAGCGC 324 0 

ACAAAACTGG ATAATTGAGT CTGCGTAAAG TGTCTTTTAA GTATGGTAAT CCATAATTAA 3300 

ATTCATGATT ACCAAGCGTA CCAAAGTCGA ATGCCATTCG ATTATAAAAA TCAACTAAAG 3360 

GCTGGCTACT GCCGCTATGC GCGATTAAGT AATTACAAAA TGGTGACCCT TGCAAAAAAT 342 0 

CACCATTATC TATTTTAAAA CTTTGGTCAT ACTGCCTTCT GTsTTGTTCT ATAACATGAT 34 80 

TCGCTAGTAA CAATCCCATA GGTTGATATT GATTTCTACT CGTAAAATCT GTTGGGAAAA 3 54 0 

TATAACCATG TACGTCACTC ACGACATAAA ATGCTATGTT TGACATCCTC ACTCACTCCT 3600 

TCAATCACAA ACATCTTTCT TATTTCTATT ATATATTTAT TTGAAGTCTG TTGTAATCAA 3660 

SO GGTTTTGTCA CCGAGTTTTA AACGAATCTT TGAACCTTCC ATACTTTCAA GTACTTTAGC 3720 

ATTGACCTTA ATTGTGACAT TTCCGTTTTC ATCTGCTTTA ACTGTTGGCA AAGTACTGTA 3 780 

ACCTGGTGGG TTATAATCGT TATCTTTACT TGAAAATTGT CCGATTTGAC GTCCGCCTTC 384 0 
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TATTGTCATT TCAAATGGCT CATTTACAGA 
TTCGTTCTGA TTTGGTGGTG TATGATCATC 
TTCGCTGCCA TAACTACCTG CTTTAAATGT 
CGGTTGTGAC CATGGCTCTT TTTCAGGCTC 
CAACTTAGTC TTTGTATCTA ATGTTAGGCT 
TTTAGACATC CAAGCCGTTA TATTATTTAA 
ATCATATGTT TTCTTCTTTT CTCCATTATC 
CACAAGTGAT GAATCACCGA TAAATGCTGC 
ATAGGGGCCT TCTGCTTTAC CGCCCCCATT 
TTTACTTTTC GCTGGCAATT GTTCTGGTGT 
AGTAATTGCT AATGTCGATC CGGCATGCAT 
GAAACTTTCT TTTGAAGAAA CAATATTGCT 
AAAACGTACG CCAAAGTTTG TAGATAACCA 
AGAACTTTTT TCTTCTGCAT TCATACCTTT 
ACCATTCATT GCC'TCCGATG AATCAATACG 
TGAAATAAAG ACAACATTGC CACCTTGTTt 
TTCTTTGAAA GGAATGTTAG CCTCAGGAAT 
TTCTGTTATG TTCGAATGAC CATCAATAGC 
ATCCGCATAA TCTGAAAATG CACCATCACT 
ATGAGAACGA TCGAATAATA CCGTTCGCTG 
AGCTAAAGAT TGCGGTAAAG CACTTAATGA 
TGACTTATAT ATTTTTTTCA TTTTGTGAGG 
ATAGGATAAA AATTCGTTGC ATATGAGCAA 
ACTCTTAAAG AAAGTTATTA AAATTCATTT 
GTACGCTATA AGTGTAATTT CATTGCATAC 
AGTTATCAAA TGAAAAATTT TAAGTGTTTA 
GCAGCAGCAT GTGGAAACTC AAGTTCTTTA 
GATTCTAAAT CAGGAGGATA CAAACCTAAA 
AATGCTGGAA CATTAGAAGC TAAAGCAAAA 



AACATTTTGC GGGATATCAA 
TGGTGTGTTT GGCTGAGGAT 
TGTTGGATCA TACCATTTAT 
AGTTGAACGC TCTGGTCGTT 
ACTCGCCTTA AGTGATTTCC 
TAGCTTACCG TTGTCTTGTT 
TTCTCTTACA TATTTGGGCG 
TTTACCTTTT CCAACTTTAG 
ATAAATACCT TGATCTACAG 
ATACACAATA CCTTTTGCTT 
AGAGACAGAT TTCACACCTT 
CGTATTTAAA TCACCTAGTG 
ATCTGAACTT TTCACACCTT 
CGACATATCT TCATATGCTC 
ATTTAAATTT CGGTCAGCAT 
CACATATTTA ACAATTGCTG 
TACAAATATT TTGGAACTTT 
TTTAACGTCA TAACCTTGTT 
AACCCAATCT GCAGCACCAG 
TTGCTTTGTA GGTTGCGATT 
TACCGTTGCA ACAATTGCAG 
CTCCTTTTAA AATAAATTTG 
TTTAACGAAA AATTTACAAA 
TTATAAAATA CTTTTTAACA 
ATATTACACG ATTAAGAATG 
TTTGTATTAA TGTTAGCAGT 
GATAATCAAA AGAACGCTAG 
GAATTAACCG TTCAATTTGT 
CCATTAGAAA AATTACTATC 



ATGTTACTTT 
CTGCGCCTTT 
AACCACTCGG 
CAAAATCAAG 
CATCATTATC 
CTTTAAAACC 
AACTATCTTC 
AAATTGCTAC 
CATGTGACCA 
TCTCTGGATT 
CAGTAATACC 
CATTATATCG 
GCATTGCAGT 
CACGTCGATA 
TGTAATGATC 
CCTGTTCTGA 
TCAAACTTGC 
TTTGTATTGA 
CTGTTTGACC 
CATGCGTTAT 
AGACAGTTAA 
TTCTTGAATT 
ATCTTATCAA 
TTTAAATGTG 
TGAAGGGGAC 
CATTGTTTTT 
TAATGATTCG 
ACCTTCGCAA 
TAAAGAATTA 



3960 

4020 

4080 

4140 

4200 

4260 

4320 

4380 

4440 

4500 

4560 

4620 

4680 

4740 

4800 

4860 

4920 

4980 

5040 

5100 

5160 

5220 

5280 

5340 

5400 

5460 

5520 

5580 

5640 



510 



EP0 786 519 A2 



TCTAAAAAAG TTGATGTTGG TTTCTTACCA CCAACGGCAT ACACATTAGC ACATGATCAA 5760 

AAAGCAGCTG ATTTATTATT ACAAGCACAA CGTTTCGGTG TAAAAGAAGA TGGTTCAGCA 5820 

5 

AGTAAAGAAC TTGTAGATAG TTATAAATCA GAAATTCTTG TTAAAAAAGA CTCAAAAATT 5880 

AAAAGCTTGA AAGATTTAAA AGGTAAGAAA ATTGCCTTAC AAGATGTAAC ATCAACTGCT 594 0 

GGATATACAT TCCCACTTGC GATGTTAAAA AACGAAGCAG GTATTAATGC AACTAAAGAT 6000 

70 

ATGAAAATTG TGAATGTTAA AGGTCATGAC CAAGCAGTTA TCTCATTATT AAATGGAGAt 6060 

GTAGATGCTG CGGCTGTATT TAACGATGCA CGTAATACTG TGAAAAAAGA CCAACCAAAT 6120 

75 GTATTTAAAG ACACACGAAT TTTAAAATTA ACACAAGCTA TTCCGAATGA CACAATTTCT 6180 

GTAAGACCAG ATATGGATAA AGATTTTCAA GAAAAATTGA AAAAAGCTTT TATAGACATT 624 0 

GCTAAATCAA AAGAAGGTCA CAAAATTATT AGCGAAGTTT ATTCACATGA AGGATACACA 6300 

20 

GAAACGAAAG ATTCAAATTT CGACATTGTA AGAGAGTACG AAAAATTAGT TAAAGATATG 6360 

AAATAATCAT TATTTAACAA ATGAATCATT AGCGAATTTG GTATTAAAAG CTTTCGTTCA 64 20 

ATAGATATAT TCTAGATTAA TATTGAAAAG CTAGGCGCTA AACTGAAACA GATATAGAAA 64 BO 

25 

GGTG.TCGCTG TACATTTGAA ACCATTTGTA CACAGAAACC CAATGTCTAT GATATTTCAG 654 0 
TTTACCTTGG CTTTTCTTTA TTAAAGAAAG GTGTCAAACA TGAGTCAAAT CGAATTTAAA ■ 6600 

3Q AACGTCAGTA AAGTCTATCC TAACGGTCAT GTAGGCTTGA AAAATATTAA CTTAAATATT 6660 

GAAAAAGGTG AATTTGCAGT TATTGTCGGA CTATCTGGTG CTGGGAAATC CACGTTATTA 6720 

AGATCTGTAA ATCGTTTGCA TGATATCACG TCAGGTGAAA TTTTCATCCA AGGTAAATCA 6780 

35 ATCACTAAAG CCCATGGTAA AGCATTATTA GAAATGCGCC GAAATATAGG TATGATTTTC 684 0 

CAACATTTTA ATTTAGTTAA ACGGTCAAGT GTATTACGAA ATGTACTAAG TGGACGTGTA 6900 

GGTTATCACC CTACTTGGAA AATGGTATTA GGTTTATTCC CAAAAGAAGA CAAAATTAAG 6960 

40 

GCAATGGATG CACTAGAACG CGTCAATATC TTAGATAAAT ATAATCAACG CTCTGATGAA 7020 

TTATCAGGTG GCCAACAACA ACGTATATCT ATTGCACGTG CGCTATGCCA AGAATCTGAA 7080 

ATTATTCTTG CAGATGAACC AGTTGCTTCA TTAGACCCAT TAACTACGAA ACAGGTTATG 7140 

45 

GATGATTTAA GAAAAATCAA CCAAGAATTA GGCATCACAA TTTTAATTAA TTTACATTTT 7200 

GTTGACTTGG CAAAAGAATA TGGCACACGC ATCATTGGTT TACGTGATGG TGAAGTTGTC 7260 

50 TATGATGGTC CTGCATCTGA AGCAACAGAT GACGTATTTA GTGAAATATA TGGACGTACA 7320 

ATTAAAGAAG ATGAAAAGCT AGGAGTGAAC TAACATGCCT TTAGAAATAC CTACAAAGTA 7380 

TGACTCCCTT TTAAAGAAAA AGGTTTCTTT AAAAACGAGT TTTACCTTCA TGTTAATCAT 7440 
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AATACCTCAA ATAGGTGATC TATTCAAACA AATGATTCCA CCTGATTTCG AGTATTTACA 7560 

ACAAATTACA ACGCCAATGT TAGATACCAT TCGAATGGcT ATCGTAAGTA CAGTATTAGG 7620 

. TAGCATCGTT TCAATACCAA TTGCGTTATT ATGTGCTAGC AATATCGTTC ATCAAAAGTG 7660 

GATTTCAATA CCCTCGCGCT TTATTTTAAA TATAGTTCGT ACTATTCCAG ATTTGTTATT 7740 

AGCAGCAATC TTTGTGGCTG TATTTGGAAT CGGTCAAATT CCAGGGATAT TAGCACTGTT 7800 

TATTTTAACT ATCTGTATTA TTGGAAAATT ATTATATGAA TCATTGGAAA CGATAGATCC 7860 

AGGTCCAATG GAAGCAATGA CGGCTGTTGG CGCTAATAAA ATAAAATGGA TTGTTTTCGG 7920 

TGTTGTACCA CAAGCCATAT CGTCATTTAT GTCATACGTA TTATATGCAT TTGAAGTAAA 7980 

TATACGTGCT TCAGCTGTGC TTGGATTAGT CGGCGCTGGC GGTATTGGAT TGTTTTATGA 804 0 

TCAAACACTT GGTTTATTTC AATATCCAAA AACAGCAACG ATTATTTTAT TTACTTTAGT 8100 

TATCGTCGTC GTCATTGATT ACATCAGTAC GAAAGTGAGG GCACATCTCG CATGACACAG 8160 

GAAATAGCAA AATATAATGT TCACACAAAA GCACACAAAC GAAAATTGAT TAAAAGATGG 8220 

CTTATTGCAA .TTGTCGTCTT AGCTATTATC ATCTGGGCAT TTGCAGGTGT ACCAAGTTTA 8280 

GAACTTAAAA GTAAATCATT AGAAATCTTA AAATCCATAT TCAGCGGATT ATTCCATCCT 834 0 

GATATCAGCT ATATCTATAT ACCAGATGGC GAAGACTTAT TACGTGGTTT ACTTGAAACC 8400 

TTTGCGATAG CCGTTGTAGG TACTTTCATC GCCGCAATTA TCTGTATTCC ATTAGCATTT 8460 

CTAGGTGCAA ATAATATGGT AAAGCTACGC CCAGTTTCAG GTGTTAGCAA ATTTATTTTA 8520 

AGTGTTATAC GTGTCTTCCC AGAAATTGTA ATGGCACTTA TATTTATCAA AGCTGTTGGC 8580 

CCAGGTTCAT TTTCAGGTGT ATTAGCTTTA GGTATCCATT CCGTAGtATG CTTGGGAAAC 8640 

TTTTAGCTGA AGATATTGAA GGTCTAGATT TCAGTGCTGT AGAATCATTA AAGGCCAGTG 8700 

GTGCBAATAA GATTAAAACA CTCGTATTTG CAGTCATACC ACAAATTATG CCTGCCTTTC 8760 

TATCACTCAT ACTTTATCGC TTTGAACTAA ACTTACGTTC AGCTTCTATA CTGGGGCTAA 8820 

TTGGGGCTGG TGGTATCGGG ACACCACTCA TATTTGCCAT TCAAACACGT TCTTGGGACC 88 80 

GTGTAGGTAT TATATTAATC GGTTTAGTAC TAATGGTCGC AATTGTCGAT TTAATTTCCG 8 94 0 

GTTCAATCCG AAAACGTATT GTTTAACATT AAATCAGGAT ACTCCTAAAT AAGAAGTCCT 9000- 

ACCGTCTTAC GTTTCTCTAT TATAATAAAA ACAGCAGTGA AGAAAACTAT TGTTATAGTT 9060 

AACTTCACTG CTGTTTTTAT AATATCTAAA TTTATTCTAT TTCAATTCCT TTAAATAACT 9120 

TTTACCGAAC TCTGGTAATG TTACGTTGAA ATTATCTGCT ATAGTTGCAC CGATAGAACT 9180 

GAATGTAGTA TCACTTTCTA GTGCATGACC ACCTTTAAAT TTCGGACTGT ACATAATTAC 924 0 
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TGTAATAATT ACTAAATCGT CTTCTTTTAA 
GAAATCTTTA ATTGCTTGTG CATAACCTGG 
A AAAGTCTACT AAGTTTAAGA AGCTAATACC 

TTGATCGATA CCGTCCATGT TACTCTTCGT 
ATAAATGTCA TTAATTTTAC CGATGGCAAT 

10 

TAAGACAGTT TTACCAAAAG GTTTTAACGC 
GTTTCCTGGT TCACCAACAT ATGGACGTGC 

1S TTTTGTCAAC TCACGAACCT TTTCACAAAT 

TTCATGTGCA GCAATTTGCA ATACTGGGTC 
TTTCATTTGG TGCTCGCCCC ACTCATCGAT 

20 AACAACTTTA CGACCTGTCA TTTCTTCAAT 

AGGGTATACT TTAAAAGGTT GCATAATATT 
TGTATCTTTA CCAACTGAAG CTTCACTCAA 
TGCATTTACT ACTGGTAATT TATCGATGTT 
AGTTTGATCG AAACCTTCTA AGGTATGTCT 
AGCTGCGTCT GGCGCTTCAC CAATACCTAC 

30 

AAATGGTCTT GTCATAGCTA TCACTCCCAA 
TTCTAAACCT TGCATAATTT GAACACCTGC 

3S AACCATTTTA TTGAAATCTT CTAAATTACG 

AGCACCTACT GTATCTTTCA TTAATTTAAC 
ACCTGTTGAA GTTTTAACGA AGTCCGCACC 

40 AATTTCGTCA TGGTCCAACA ATACCGTCTC 

AGCTTTAACC ACTGCTTCAA TGTCTTGTTG 
GCCGATGTTG ATGACCATGT CAATTTCATC 

45 

AAATGCTTTC GTTGCAGTTG TCGACGCACC 
CACCTCTGAA TCAGCTAGTC GCTCTGCTGC 
AGATTTAAAA TTGTATGctT TCGCTTCATC 

50 

CTCAGGCTTC AATAAAGTGT GATCTATATA 
TGTTATATAA TCTCTTTATT TAATTTTACT 

55 



GTTGCTAAAC AGTTCTGGCA AGCGATCATC 9360 

TTTATCACGA CGATGACCGT ATAATGCATC 9420 

TGTGaAATCT TTCTTAACAA TTTTCATCAA 94 BO 

ACGAACCGCT TCTGTTACAC CTTCACCATC 954 0 

AACATCATAA CCACCGTCTT TCAAATGATC 9600 

ATAGTCATGT CGATTAGATG TACGTGTAAA 9660 

GATAATACGA CCAATTAAAT ATTTAGGGTC 9720 

ATCATATAAC TCTTCTAATG GGATAATGTC 9780 

TGCACTTGTA TAAACAATTA AGTCACCAGT 9840 

AATTTGCGTA CCCGATGCCG GTTTGTTAGC 9900 

TTGTTGAATT AACTCTTCAG GGAATCCATT 9960 

TAATCCCATA ATTTCCCAGT GACCAGTCAT 10020 

TTTAGTATAG TATGCTTCTG GTTGTTCAAC 10080 

CCCTAGACCT AACTTTTCAA GGTTTGGTAA 1014 0 

TAAAGTATGT GAACCTTCAT CTTTAAAATC 10200 

TGAATCCATT ACGATTAAAT GTACACGATT 10260 

AATTTATATA TATTAGTAAT CTGAATCTGC 10320 

GCTCGCACCA ATACGTGTCG CACCTGCTTC 10380 

TACGCCACCT GATGCTTTTA CTTCTACATC 1044 0 

GTCTTCTGCA GTCGCACCGC CACCTGCAAA 10500 

AGCCGCTTTT GTTAATTCAC TCGCTTTTAC 10560 

AATAATCACT TTTACTGTGT GACCTTTCGC 10620 

TACATCATCA AAACGTCCAT CTTTTAATGC 10680 

TGCACCATTT TGAATTGCAT CTTCTGTTTC 10740 

TAATGGGAAT CCTATTACCG TACAAACGAG 10800 

ATATTTAACA TGTGTTGGAT TCACACATAC 10860 

GATGATTTGA TCGATTTGCG TACGTGTTGA 10920 

TTTCTCAAAT TTCATACTTA CTACTCCTCG 10980 

ATAAATACGA ATATATCTCG CGAATTTATA 11040 
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ATACTCATTA AACCTAAAAT AATTAAAATA ATACCGAAAT GTGAACTTAA TGCATCATTG 11160 
CCTGGGAAAT TTAATGCTTT AAAATCGATT AGAGCCGCAG CAATCGCAAT ACCTACAGAT * ' 11220 

ACCGCCACAT TAATAATTAA ATTATAAAAA CCAATAGCCA CACCTGTCAT ATTAAGATCT 11280 

ATTGTTTTAA 7GGCTTCGTT AAGTAAAGGT GCATACATTA AAGCAAAGCT ACCTGCAAAG 1134 0 

AATATCATAG AAATGACGAA GATTGAAATG TGATTACCTA CTGCAAATGC AGGTAAAATC 11400 

AAGCTCAGTG CTATTAAAAT AATTGCTGTG ATAATCGCTT GTTTTGAATT CAGATATTCG 11460 

CCGATTTTAC CACTTAGTGC ACCAACAATG ACTGCTACTA TATAACCCGG TACTAATAAC 11520 

AGTGATGTTG TGTCTAGTTG CAGATGATAA ATTTGCTCCA TTATGAATGG GAACGTAAAA 11580 

ATATAACCCA ATTGGATAGC ATACATTACA AATACTATAA ATAAAAATGA AGCATAACGT 11640 

TTATTTTGGA AAAATGATTT ATTTACTAAT GGACGTTGCG CATTTTTAAT ATATAGCGCA 11700 

AAAACGATAA TCGCAATTAA GGCACCAATC ATATATAACC AATTAAAGTT CGTAATAAAC 11760 

AGCATGACTG TTGTAGCAGG GGATCCTCTA GAGTCGAnCC TG 11802 
(2) INFORMATION FOR SEQ ID NO: 71: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1196 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
\D) TOPOLOGY: linear 



35 



40 



45 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 71: 

CTAAAGAAGA TGCGAAACAA GATGTTGATA AACAAGTTCA AGCTTTAATT GACGAAATCG 60 

ATCAAAATCC AAATCTAACA GATAAGGAAA AACAAGCACT TAAAGATCGT ATTAATCAAA 120 

TACFTCAACA AGGTCATAAC GACATTAACA ATGCGATGAC AAAAGAAGCA ATTGAACAAG 180 

CAAAAGAACG TTTAGCGCAA gCATTGCAAG ACATCAAAGA TTTAGTGAAA GCTAAAGAAG 240 

ATGCGAAAAA TGATATTGAT AAACGTGTAC AAGCTTTAAT TGACGAAATC GATCAAAATC 3 00 

CAAATCTAAC AGATAAGGAA AAACAAGCAC TTAAAGATCG AATTAATCAA ATACTTCAAC 360 

AAGGTCATAA CGACATTAAC AATGCGCTGA CTAAAGAAGA AATTGAGCAG GCAAAAGCAC 420 

AACTTGCACA AGCATTGCAA GACATCAAAG ATTTAGTGAA AGCTAAAGAA GATGCGAAAA 480 

ATGCAATAAA AGCCTTAGCT AATGCGAAgc GTGATCAAAT CAATTCAAAT CCAGATTTAA 54 0 

CACCTGAGCA AAAAGCAAAA GCGCTCAAAG AAATTGACGA AGCTGAAAAA CGAGCACTAC 600 

AAAACGTTGA GAATGCTCAA ACTATAGATC AATTAAATCG AGGATTAAAC TTAGGTTTAG 660 
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TTGAAGCAAC ACCTGAGCAA ATCCTAGTTA ATGGTGAACT 


CATTGTACAT 


CGTGATGACA 


780 


TCATTACAGA ACAAGATATT 


CTTGCACACA 


TAAACTTAAT 


TGATCAGCTT 


TCAGCAGAAG 


840 


TCATCGATAC ACCATCAACT 


GCAACGATTT 


CTGATAGCTT 


AACAGCAAAA 


GTTGAAGTTA 


900 


CATTGCTTGA TGGATGAAAA 


GTGATTGTTA 


ATGTTCCTGT 


AAAAGTTGTA 


GAAAAAGAAT 


960 


TGTCAGTAGT CAAACAACAG 


GCAATTGAaT 


CAATCGAAAA 


TGCGGCACAA 


CAAAAGATTA 


1020 


ATGAAATCAA TAATAGTGTG 


ACATTAACAC 


TGGAACAAAA AGAAGCTGCA 


ATTGCGnAAG 


1080 


TTAATAAGCT TAAACAACAA 


GCAATTGGAT 


CATGTTnAAC AATGGCACCT 


GGATGTTCCA 


1140 


TTCAGTTGAA GGAAATTTCA ACAACAAGGA ACAAGCGCCn 


GATTGGAACA 


ATTTGA 


1196 



(2} INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1519 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 • (xi) ' SEQUENCE DESCRIPTION: SEQ ID NO: 72: 

CAATCGTTTC AACGCTATTA TCTTTAGACA ACAATTGTAA 
AACAGTCTAT AATTCGAGTT CTTAATTCAG CTGGATCATC 

30 

CTGCAACTTT GTAGACAAAT GTTAAATAGG TAAGTTCACT 
TGTTACCAAC TGGGTCATGC TTACGAATTT CACTGCCTAA 
TTGAAAGTTG AATATCTAAA AAGTAACAGC CTATGTCATT 

35 

GCACCTCATA AGGATTATCA GTTGCGAGGG CAATTTCCAT 
TATAATTTTT AATAATGGTA ACCATGTTTT CTCTTTGTTT 

40 AAATTTTCAT ACATTCACAT CCTTATGGCT AGTTGTTAAT 

AGAAACCATT TTCGATAATT GTATCTAATA AGACATTGTC 
TTAAAGTTGA TAGACCTAAA CCACGACCTT CACCTTTAGT 

45 ATTCATGAAT GCGTGGTATA TCATCAGCGC ATTTATTCAT 
TTTCACTTTC AATAAATGCA ■ ACGCGAATGA TAGGGTCATC 
TTGCATTATC AAGAATAATA CCAATACTGC GACTTAAATC 

50 

TACTTACTTC ATCGGGTATT TCGATACTAA TCGGAATATT 
TTTTCGCAGT AATTAAGCCT TTAATTTCAC GTACTTTAAG 

55 



GCGTGTATGT GCAGTTTCTA 60 

TTTAAAAATA AAATCCATCG 120 

GTGACTCGTA ACGAAAATAA 18,0 

TTTGATACCA TTAATATCAG 240 

CATATTTTTA GCTTGCTCAA 300 

AGGCTTTTCT TCTATCATTA 360 

TGGATCGTCT TCGCAAATGA 420 

AATTTCAACT TTTTGAATAA 480 

TGCATTATCA GCAATTTCTT 540 

AGAAAAACTT TCTTGGAACA 600 

AACAATAAAC GTTACTGAAT 660 

AATTTCAGTT GATGCCTCAA 720 

GATCATATTC AAGTTAATGC 780 

CATTTCTTGT GCACGTAAAA 84 0 

ATTCTCGATA CCATTTAATT 900 
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GTAGGCCAGG CATGTCATCT TCTCGAATGT ATTCTGAAAG TGTCGTTAAG ATATTGACAT 1020 

AATCATGACG GAACTTGCGC ATTTCGTTGT TGATAGCTTC AATCTTCAAT GTATATTCAT 1080 

AATAGGTTTC AATTTCTTCT TGATTACGTT TATATTTCAT CTCTTTAAGG AGAAATTGAG 1140 

AAATAACAAA TGTTAATATA CTTAAAAATA TAGTGATACC AATAAAAATA AAAGAATACT 1200 

GCCTTATTAC TTTAGCTTCA TCCGAGTTTA TTTGTGAATA AAAGAAAAAT AATGAAAAAG 1260 

TAAGCAGTAA GATAGTCGAA ATAACTATTA AAAATCCTTT GTTTAGTATT AGATATGGTG 1320 

TGCTAATTTT TTTGAGAACT CTATTTATTA TATATGAGAA TAGTATACTA ATAGTCACAT 1380 

AAACTACAAA AAAGCTAGGG AATATTACAA ATATACTATC AGAAATTTTG GTGGATATAT 144 0 

GCATATATAA CTATATACCT GTAGTTAGCA CnGTnATAGG AATAATCnGG CGAGGTCCAT 1500 

AATCCACCAA AATAGAATA 1519 
2Q (2) INFORMATION FOR SEQ ID NO: 73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



10 



15 



30 



35 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73: 

GTAGGAATCT CTTTGTCTTT TTGGGAGGAC ATTTAATATG AATGTATATT TAGCAGAATT 60 

CCTAGGAACT GCAATCTTAA TCCTTTTTGG TGGTGGCGTT TGTGCCAATG TCAATTTAAA 120 

GAGAAGTGCT GCGAATGGTG CTGATTGGAT TGTCATCACA GCTGGATGGG GATTAGCGGT 180 

TACAATGGGT GTGTTTGCTG TCGGTCAATT CTCAGGTGCA CATTTAAACC CAGCGGTGTC 240 

TTTAGCTCTT GCATTAGACG GAAGTTTTGA TTGGTCATTA GTTCCTGGTT ATATTGTTGC 300 

TCAAATGTTA GGTGCAATTG TCGGAGCAAC AATTGTATGG TTAATGTACT TGCCACATTG 360 

GAAAGCGACA GAAGAAGCTG GCGCGAAATT AGGTGTTTTC TCTACAGCAC CGGCTATTAA 420 

GAATTACTTT GCCAACTTTT TAAGTGAGAT TATCGGAACA ATGGCATTAA CTTTAGGTAT 480 

45 TTTATTTATC GGTGTAAACA AAATTGCCGA TGGTTTAAAT CCTTTAATTG TCGGAGCATT 540 

AATTGTTGCA ATCGGATTAA GTTTAGGCGG TGCTACTGGT TATGCAATCA ACCCAGCACG 600 

TGATTTAGGT CCGAGAATTG CACATGCGAT TTTACCAATA GCTGGTAAAG GTGGTTCAAA 660 

50 TTGGTCATAT GCAATCGTTC CTATCTTAGG ACCAATTGCC GGTGGTTTAT TAGGTGCAGT 720 

GGTATACGCT GTATTTTATA AACATACATT TAATATTGGT TGTGCAATTG CrATTGTTGT 780 
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CGAATCAATT TACTAAAATA AAAAGAAACG TAAATAGCAT AATTTAACAT GTTTGATTCA 900 

TGGATTATGC TA TTTTTT CG CCAAAATTTA ACAGATTTTG TACAATGGGT TAGCGATTAT 960 

TTTTTAATAA AGGAGATACT ACTAATGGAA AAATATATTT TATCTATAGA CCAAGGAACA 1020 

ACAAGCTCAA GAGCGATTTT ATTCAATCAA AAAGGGGAAA TTGCAGGGGT AGCACAACGT 1080 

GAGTTTAAGC AATATTTTCC ACAATCAGGT TGGGTTGAAC ATGATGCAAA TGAAATTTGG 1140 

ACATCTGTGT TAGCTGTAAT GACGGAAGTA ATTAATGAAA ATGATGTTAG AGCTGATCAA 1200 

ATTGCAGGTA TCGGTATTAC AAACCAACGT GAAACAACGG TTGTTTGGGA CAAaCATACT 1260 

GGCCGCCCAA TTTATCACGC AATTGTTTGG CAATCACGTC AAACACAATC ^AATTTGTTCA 1320 

GAATTAAAAC AACAAGGATA TGAACAAACA TTTAGAGATA AGACAGGATT ACTTTTAGAT 1380 

CCGTATTTTG CAGGTACAAA AGTTAAATGG ATTCTAGACA ATGTTGAAGG TGCACGAGAA 1440 

2 0 AAAGCAGAAA, ATGGCGATCT ATTATTTGGA ACGATTGATA CTTGGTTAGT ATGGAAATTA 1500 

TCaGGaAAAg CtGCGCATAT TACTGATTAT TCaAATGCGA GTCGTACATT AATGTTTAAT 1560 

ATCCATGATT TAGAATGGGA CGATGAGTTA TTAGAACTAt TACAGTACCT AAAAATATGT 1620 

25 TGCCAGAAGT TAAAGCTTCG AGTGAAGTAT ATGGTAAGAC AATTGATTAC CACTTCTATG 1680 

GTCAAGAAGT ACCAATCGCT GGAGTAGCTG GTGATCAACA AGCAGCATTA TTTGGACAAG 1740 

CTTGCTTCGA ACGTGGTGAC GTGAAAAACA CATATGGAAC TGGTGGCTTC ATGTTAATGA 1800 

ATACAGGTGA CAAAGCGGTT AAATCTGAAA GTGGTTTATT AACAACAATT GCTTATGGTA 1860 

TTGATGGAAA AGTAAATTAT GCGCTTGAAG GTTCCATCTT TGTTTCGGGT TCAGCAATCC 1920 

AATGGTTACG TGATGGATTA AGAATGATTA ATTCAGCACC ACAATCAGAA AGTTATGCGA 1980 

CACGAGTTGA CTCTACTGAG GGTGTTTATG TTGTTCCAGC TTTTGTAGGT TTAGGAACAC 2040 

CATATTGGGA TTCTGAAGCA CGTGGTGCGA TTTTCGGTTT AACACGTGGA ACTGAAAAAG 2100 

AGCACTTTAT CCGTGCAACT TTAGAATCAC TATGTTACCA AACTCGTGAC GTTATGGAAG 2160 

CAATGTCAAA AGACTCTGGT ATTGATGTCC AAAGTTTACG TGTCGATGGT GGTGCAGTTA 2220 

AAAATAACTT TATTATGCAG TTCCAAGCAG ACATTGTTAA TACTTCTGTT GAAAGACCTG 22 80 

45 AAATTCAAGA AACTACAGCT TTAGGTGCTG CATTTTTGGC AGGTTTAGCA GTTGGATTCT 2340 

GGGAGAGTAA AGATGATATC GCTAAAAACT GGAAATTAGA AGAAAAATTC GATCCGAAAA 2400 

TGGATGAAGG CGAAAGAGAA AAATTATATA GAGGTTGGAA AAAAGCTGTT GAAGCAACAC 24 60 

SO 

AAGTTTTTAA AACAGAATAA ACTTGTAGAT TAGACTTTTG TATAAACATT GTGATACAAT 2520 

CAATTTAAGT TAATATTTGA ATCGAGAAGC GAGAGATTTG TTCGAACATG TACAATTGAA 2580 
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GCATTGTCTA CTTTTAAGAG AGAACATATT 
TTAGTAATTA TTGGTGGCGG TATTACAGGT 
5 GGAATGAAAG TTGCATTAGT TGAAATGCAA 

ACAAAATTAG TCCATGGTGG TTTGCGTTAC 
GAAACTGGTA AAGAACGTGC GATTGTTTAT 

- 10 

TGGATGCTTT TACCAATGCA TAAAGGTGGA 

TTAGGAATGT ATGATCGTTT AGCAGGTGTT 

» 

AAAAAAGAAA CTTTAGCTAA AGAACCATTA 

15 

TACTATGTTG AATATCGTAC TGACGATGCG 
GCTGAAAAAG GCGCAGAAAT TATCAACTAT 

20 AATCAACAAG TAAATGGTGT TAAAGTTATA 
AAGGCTAAAA AAGTGGTTAA TGCAGCAGGT 
TATGCACGCA ATAATAAAAA ATTACGTTTA 

25 TCAAAATTCC CATTAGGTCA AGCAGTATAC 
TTTGCAATTC CACGTGAAGG AAAAGCGTAT 
ATCAAATCTT CACCATTAAC TACACAAGAA 

30 

TACATGTTCC CTAGTGTTAA TGTTACAGAT 
AGACCATTAA TTTACGAAGA AGGCAAAGAC 
TGGGAAGGTA AATCAGGTTT ATTAACTATT 

35 

ATGGCTCAAG ACATTGTTGA TTTAGTATCT 
TTTA.GTCCAT GTAATACAAA AGGTCTGGCA 

40 AACTTTGATG CGTTTGTAGA GCAAAAAGTA 
GATGTTGCAA GACGTTTAGC ATCTAAATAT 
GCGCAAACAT CTCAATACCA TGATAGCAAG 

4S - TATAGTATTC AACAAGAAAT GGTATACAAA 
AAAATGTATT ■ TCAATATTAA AGATGTATTA. 
GCAGATATGC TTGATTACTC TCCAGCTCAA 

50 

GCAATTAAAG AAGCGCAACA TGGaAATAAT 
ACAATCATAA ACTGGTGTCC TGTTTTAAGG 

55 



AAAAAGAATT TAAGAAATGA TGAATATGAT 2700 

GCAGGTATTG CACTAGACGC GAGTGAAAGA 2760 

GACTTTGCAC AAGGAACAAG CTCAAGATCT 2B20 

TTAAAACAAT TCCAAATTGG AGTAGTTGCC 2 880 

GAAAATGGGC CTCATGTTAC GACTCCAGAG 294 0 

ACATTTGGTA AATTCTCAAC ATCAATTGGT 3000 

AAGAAGTCTG AACGTAAAAA AATGTTATCT 3 060 

GTTAAAAAAG AAGGTCTAAA AGGCGGCGGT 3120 

CGTTTAACTA TTGAAGTTAT GAAGCGTGCT 3180 

ACTAAATCTG AACACTTCAC TTATGATAAA 324 0 

GATAAATTAA CTAATGAAAA TTATACAATT 33 00 

CCATGGGTTG ATGATGTTAG AAGTGGTGAT 3360 

ACTAAAGGTG TACATGTTGT TATTGATCAA 3420 

TTTGATACTG AAAAAGATGG AAGAATGATT 34 80 

GTAGGTACTA CAGATACATT CTATGACAAT 3540 

GACAGAGACT ATTTAATCGA TGCGATTAAT 3 600 

GAAGATATTG AATCAACATG GGCAGGAATT '3660 

CCTTCTGAAA TCTCTCGTAA GGATGAAATT 3720 

GCAGGTGGTA AATTAACAGG CTATCGTCAC 3780 

AAACGCTTGA AAAAAGACTA CGGTTTAACA 384 0 

ATTTCAGGTG GCGATGTAGG TGGTAGCAAG 3900 

GATGTAGCTA AAGGATTCGG CATTGATGAA 3 960 

GGTTCAAATG TTGATGAATT GTTCAACATT 4 020 

TTACCATTAG AAATTTATGT AGAACTTGTT 4080 

CCTAACGATT TCTTAGTTCG TCGTTCTGGT 414 0 

GATTATAAAG ATGCTGTCAT CGATATTATG 4200 

ATTGAAGCAT ATACTGAAGA AGTTGAGCAA 4260 

CAACCAGCAG TTAAAGAATA AtTAATTTGT . 4320 

GCATCAGTTT TTTTATACGA GATACATTAG 4380 
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GTTATTAAAG 


GTGTGAGATG 


ATGACTGAAA 


AACAATTTAA 


ATTAACTGTA 


CAAGATAATA 


4500 




CGAATATTGA 


AGTTAAAGTG 


AATTTTACAG 


ATGTAGATTC 


AAAAGGAATT ATTCATATAT 


4560 


5 


TTCATGGTAT 


GGCTGAACAT 


ATGGAACGTT 


ACGATAAATT 


AGCACATGCA 


CTTTCAAAGC 


4620 




ATGGCTTCGA 


TGTGATACGT 


CATAATCATC 


GAGGACATGG 


TATTAATATT 


GATGAATCAA 


4680 




CAAGAGGGCA 


TTACGATGAT 


ATGAAACGAG 


TTATCGGTGA 


TGCCTTTGAA 


GTAGCGCAAA 


4740 


10 


CAGTGAGAGG 


CAATGTTGAT 


AAACCATACA 


TTATAATCGG 


ACATTCAATG 


GGATCCGTTA 


4800 




TAGCTAGATT 


GTTTGTAGAA 


ACATATCCGC 


AATATGTTGA 


TGGTCTAATT 


TTAAGTGGTA 


4860 


15 


CTGGTATGTA 


TTCATTATGG 


AAAGGTTTAC 


CAACCGTTAA 


AGTGTTACAA 


Jill,* ■Ifc^lHj m IWVllL ^ITfc % 

CTGATTACAA 


4920 


AAATTTATGG 


TGCTGAGAAA 


CGAGTTGAAT 


GGGTTAACCA 


GTTAGTATCA 


AATAGTTTTA 


4980 




ATAAAAnnAT 


ACGTCCATTA 


CGTACACAAA 


GTGATTGGAT 


TTCTAGTAAT 


CCAATTGAAG 


5040 


20 


TAGATAaCTT 


TATTAAAGAT 


CCATATAGTG 


GaTTTAATGT 


GTCAAATCAA 


TTATTATATC 


5100 




AAACAGCCTA 


TTATATGCTA 


CATACATCAC 


AATTAAAAAA 


TATGAAAATG 


TTAAaTCATG 


5160 




CCATGCCTAT 


ATTATTAGTT 


TCAGGATATG 


ACGATCCTTT 


AGGTGATTAT 


GGTAAAGGGA 


5220 


25 


TTTTAAAATT 


GGCGAATATA 


TATAGAAACG 


CTGGCATnAA 


AAATGTTAAA 


GTGAATCTTT 


5280 




ATCATCATAA 


ACGTCATGAA 


GTGTTATTTG 


AAAAnGATCA 


TGAQlAAATT 


TGGGAAGACT 


5340 




TGTTTAAATG 


GTTGAATCAA 


TTTTATAAAA 


AATAAAGAAA 


GTGGAATTAA ATATGAATAA 


5400 


30 


AAATAAGCCT 


TTTATTGTAG 


TAATTGTGGG 


GCCAACTGCT 


TGCAG 




5445 



(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS : 
35 (A) LENGTH: 2569 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
~ (D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74: 
TGGCTTGAAC TACGCCAATA AGTCCCCCTA GTACAAGAAT GAATACCATG ATATCGACCG , 60 

45 CTTCTATCGT ACCTTCAACC ATGCTACTTG TTATTTGTTC TGGTCCAGCT GGATGTTGCT 120 

TTAATCTTTC ATAAGTATTC GGAATTGATA CCGGCTTATT AATTGCACCT GATTTAAATT 180 

GTTCAATCTT AATTTTAACC CCCATTTTGT CTAGTTCCTG TTGCGTACCC GGAACCTTTT 24 0 

50 _ _ 

TCACTTGGTT ATGAGGGTTA ACTATCTTTA GTTCTTGGGA TGAAGGTTCG TAAGAAAGTT 300 

TAGAATATGC ACCAGCAGGA ATAACCCATG TTGCTATAAC TGCAACAACC GTTAAAATGA 360 
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TAATTGTATT TTCCACGGTT TCATCTCCTT CGACATTTAA CCTAGCATTT CTACCTTAAA .480 

GATTTTATAA AT ATAAATTA AGAAAGTGCA CCCCGCATCA AAATAGAGGC ATTATTTTCA 540 

v. 

GGGGGTGCAC ATAAATAATA AAAATCATGC ATTTGACATA TAGTAATTGA AAAGCGTTTC 600 

AATTCAATTA CTTTTTAATC ACAGTACCTA CTTTACCCTC TAAGGCAGCA TCTAATTCAT 660 

TTAATGATGT TATAAGCACA CTTCCTTTTG GATTGTTTTC AATAAATGAT ATGGCTGCTT 720 

CAATTTTTGG TAACATACTT CCTTTTGCAA ATTGATTTTC GTCTATATAT CGTTTTAATT . 780 

CATCAACATT TGTTGTTTTC AAAGGCTGTT GGTTTTCAGT GTTAAAATTA AT AT AT ACAT 84 0 

AATCAATTGC TGTTAAAATA ATCAATTGAT CGCATTGAAT ATTAGCACCC AACAACGCAC . 900 

TTGTTTTATG TTTGTCTATA ACTGCATCAA TACCTTTAAA ACCATCATGT TGCTCTCTAA 960 

TTACTGGTAT ACCTCCACCA CCAGCAGCAA TAACGAGTGT ATCATTTTTA ATAAGTGTTT 1020 

20 TAATACTCTC TAATTCAATA ATAGAGATGG GTTGTGGTGA AGGAACAACG CGTCTATATC 1080 

CTCTTCCAGC ATCTTCAACA AATATAAATC CTTTTTCTTT TTGAATTTGT TCAGCTTCTT 1140 

CTTTGTTGTA AAATAACCCA ATTGGTTTTG AAGGATTGTT AAATGCCGGA TCATTTTCAT 1200 

CAACTTCAAC TTGTGTCACT AGTGTTACCA CTTGTTTATC CATTCCAATA GAATGCAATT 1260 

CATTTTGTAA GCTTTCTTGT AATTGATAGC CGATGTAAGC TTGACTCATT GCGCCACATT 1320 

CAGCAAATGG AAATGCCGGA CCTTGGTTAT GTTCTGCAGC ATAGTTAAGT CCCAAATTAA 1380 

TGCTTCCAAC CTGTGGTCCA TTACCATGAC TAATAACAAT CTCATGTCCT TTTGTnATTA 1440 

AyCCTACTAA TGATTtCGCA GTATTTTTAA CAAGCTCGAG TtGgTyCTTG aGGTGATTTh 1500 

CCTAAAGCAT TACCACCTAA TGCTACTACT ATTTTCGCCA TCATATTCAC TTCCTTATAT 1560 

CATTTAAAAT TCACCCAATG TAGCAACCAT GaCTGCTTTG ATTGTATGCA TTCTGTTCTC 1620 

AGCTTCTTGG AATACAACTG AAGCTTTACT TTCGAATACT TCATCTGTAA CTTCCATTTC 1680 

40 TCGAATACCA TATTTTTCAA AAATTTGTTG ACCTATTTTC GTATCAGCAT TATGGAAAGA 1740 

TGGTAAGCAA TGCTCAAAAA TAACATTTGG ATTACCAGTT TTATCCATTA TTTCTTTATT 1800 

TACTTGATAT GGTTTCAATA ATTCAAGTCG TTCTTTCCAT ACTTCATCAG GTTCACCCAT 1860 

TGATACCCAA ACATCAGTGT AAATTACATC CGAACCTTTT ACaCCTTGGT CaATATCATC 1920 

TGTGATTAAT ATGTTGCCaC CATTTTCaGC GGCAATATTT TTACAGCGAT TTAATAATTC 1980 

ATCTGTTGGA TTTAATTCTT TTGGACAAAC TAAATGGAAG TTCATACCCA TAATGGCAGC 2040 

ACCTTGCATT AATGCATTTG CAACGTTATT ACGACCATCT CCAACATATG TAAAGTTAAT 2100 

ATCTGCATAA TCTTTTTTTA AGACTTCTTT TGCTGTTAAG AAATCAGCAA GAACTTG AGT 2160 
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TTCTACTGTT CTTTGTGAAA AACCACGGTA TTCAATGCCA TCATACATTC CACCAAGCAC 2280 

ACGTGCAGTA TCTTTAGTTG TTTCTTTTTT ACCCATTTGT GATCCAGTTG GGCCTAAATA 2340 

6 AGTTACATTT GCACCTTGAT CATGCGCTGC AACTTCAAAT GCACATCGCG TTCTTGTAGA 2400 

ATCTTTTTCA AATAACAGTG CAATATTTTT ATTTTTTAAC ATAGGCTTTT CAGTGCCAAT 2460 

ATATTTAGGA CGTTTTAAAT CCTCGGAGAG TGTTAATAAG GTTCTACCTC TTGTCGTGAA 2520 

AAGTCTAATA AAGTTAAAAA ACTTCTGTTT CGTAnATTTT TCATTAAnA 2569 

(2) INFORMATION FOR SEQ ID NO: 75: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 75: 
CCTGGAACCA TCCaATCGtG CaAATCtTGa AAGaGAATAC GCAACAACAA TTAAATGTAT 60 
25 TGGAACACTA TATTCCAAAT GACCATCCAG CACTCGTTGA ATTAAAAATA TGGGAACGTT 120 

GGTTACATAA ACAAGGTTAC AAAGACATCC ATTTAGATAT TACTGCGCAC CACCTAGATC 180 
CTATTACACA GGTTTATTTA TTCAATGTCA TTTTGCTGAA AATGAATCTC GAGTTTTAAC 24 0 

30 

AGGTGGTTAT TACAAAGGAA GCATCGAAGG GTTTGGATTA GGATTAACAC TTTAAGTAAG 300 
GGAGTATGCA CAATGTTAAG AATCGCCATA GCCAAAGGAC GTCTAATGGA TAGTTTAATT 360 
AACTATTTAG ATGTAATTGA ATATACGACA TTATCAGAAA CATTAAAAAA TAGAGAACGC 420 

35 

CAATTATTAT TAAGTGTAGA TAATATTGAA TGCATTTTAG TAAAAGGAAG TGACGTGCCA 480 
ATCTATGTGG AACAAGGAAT GGCAGACATA GGCATTGTTG GTAGCGACAT ATTAGATGAG 54 0 

CGCCAATATA ATGTTAATAA TTTGTTGAAT ATGCCTTTTG GAGCATGTCA TTTTGCGGTT . 600 

40 

GCAGCGAAAC CTGAAACGAC CAATTATCGT AAAATCGCAA CGAGTTATGT TCATACTGCT 660 
GAAACATATT TTAAATCAAA AGGTATTGAT GTCGAATTGA TTAAATTGAA TGGCTCTGTT 720 
45 GAATTGGCCT GTGTTGTAGA TATGGTAGAC GGAATTGTCG ACATCGTTCA AACAGGTACT 780 

ACGCTAAAAG CGAACGGACT GGTTGAAAAG CAACATATTA GTGATATCAA TGCAAGATTA 84 0 

ATAACTAATA AAGCAGCTTA TTTTAAAAAA TCACAATTAA TAGAGCAATT TATTCGCTCT 900 

50 

TTGGAGGTGT CTATTGCCAA TGCTTAATGC ACAACAATTT TTAAATCAAT TTTCATTAGA 960 
AGCACCATTA GATGAGTCAT TGTATCCaAT TATTCGCGAT ATTTGTCAGG AAGTTAAAGT 1020 
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TTTAGaAATT AGTCATGAmC AAATTAAAGC AGCATTTGAC ACATTAGATG AAAAAACAAA 114 0 

ACAAGCATTA CAACAAAGTT ATGAAAGAAT TAnAGCATAT CAaGAAaGTA TtaAACAGaC 1200 

GaATCAACAG TTAGAAGaAT CAGTGGaGTG tTrTGaAATA TACCATCCmC taGaAAGTGT 1260 

CGGTATTTAT GTG 1273 

(2) INFORMATION FOR SEQ ID NO: 76: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 76: 

20 GTTGATAAAT TAAAAATGTT TTTATCAGAT ATTCAAAGTT ACCAACAATA TAGTAAAGAT 60 

CATCCGGTGT ATCAGTTAAT TGATAAATTT TATAATGATC ATTATGTTAT TCAATACTTT 120 

AGTGGACTTA TTGGTGGACG TGGACGACGT GCAAATCTTT ATGGTTTATT TAATAAAGCT 180 

25 ATCGAGTTTG AGAATTCAAG TTTTAGAGGT TTATATCAAT TTATTCGTTT TATCGATGAA 240 

TTGATTGAAA GAGGCAAAGA TTTTGGTGAG GAAAATGTAG TTGGTCCAAA CGATAATGTC 300 

GTTAGAATGA TGACAATTCA TAGTAGTAAA GGTCTAGAGT TTCCATTTGT CATTTATTCT 360 

GGATTGTCAA AAGATTTTAA TAAACGTGAT TTGAAACAAC CAGTTATTTT AAATCAGCAA 420 

TTTGGTCTCG GAATGGATTA TTTTGATGTG GATAAAGAAA TGGCATTTCC ATCTTTAGCT 480 

TCGGTTGCAT ATAGAGCTGT TGCCGArAAA GAACTTGTGT CAGAAGAAAT GCGATTAGTC 54 0 

TATGTAGCAT TAACAAGAGC GAAAGAACAA CTTTATTTAA TTGGTAGAGT GAAAAATGAT 600 

AAATCATTAC TAGAACTAGA GCAATTGTCT ATTTCTGGTG AGCACATTGC TGTCAATGAA 660 

CGATTAACTT CACCAAATCC GTTCCATCTT ATTTATAGTA TTTTATCTAA ACATCAATCT 720 

GCGTCAATTC CAGATGATTT AAAATTTGAA AAAGATATAG CACAAATTGA AGATAGTAGT 780 

CGTCCGAATG TAAATATTTC AATTGTGTAC TTTGAAGATG TGTCTACAGA AACCATTTTA 840 

45 GATAATGATG AATATCGTTC GGTTAATCAA TTAGAAACTA TGCAAAATGG TAATGAAGAT 900 

GTTAAAGCAC AAATTAAACA CCAACTTGAT TATCGATATC CATATGTAAA TGATACTAAA * 960 

AAGCCCTCAA AACAATCTGT TTCTGAATTG AAAAGACAAT ATGAAACAGA AGAAAGTGGC 1020 

50 ACAAGTTACG AACGAGTAAG GCAATATCGT ATCGGTTTTT CAACGTATGA ACGACCTAAA 1080 

TTTCTAAGTG AACAAGGTAA ACGAAAAGCG AATGAAATTG GTACGTTAAT GCATACAGTG 114 0 
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GATGGATTAA TCGATAAACA TATTATCGAA GCAGATGCGA AAAAAGATAT CCGTATGGAT 
GAAATAATGA CATTTATCAA TAGTGATTAT ATTCGATATT GCTGAAGC 
(2) INFORMATION FOR SEQ ID NO: 77: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 77: 
GATGCCATTn ATnnGTATGC AAGAAGTTGT TCCGGGTTCA GGTGGATTaC CAGTTGGTAC 
TGGTGGTAAG ACGTTACTAA TGCTTTCAGG CGGTATAGAC TCACCAGTTG CTGGGATGGA 
AGTGATGAGA CGTGGCGTAA CAATTGAAGC GATTCATTTC CATAGTCCAC CATTTACAAG 
TGATCAAGCA AAAGAAAAAG TTATTGAATT GACACGTATT TTAGCTGAAC GTGTTGGACC 
AATTAAATTG CATATTGTAC CATTTACAGA ATTGCAAAAA CAGGTAAATA AAGTTGTACA 
TCCAAGATAT ACAATGACTT CAACGAGACG TATGATGATG CGTGTTGCTG ATAAATTAGT 
ACATCAAATA GGGGCTTTAG CTATTGTAAA TGGTGAAAAC CTAGGGCAGG TAGCCAGTCA 
AACACTTCAT AGCATGTATG CAATTAATAA TGTAACTTCT ACTCCTGTAT TACGTCCTTT 
ATTAACTTAC GATAAAGAAG AAATTATTAT TAAATCGAAA GAAATTGGTA CATTTGAAAC 
ATCTATTCAA CCATTTGAAG ATTGTTGTAC AATTTTCACC CCTAAAAATC CAGTAACCGA 
ACCAAACTTT GATAAGGTAG TCCAATATGA AAGTGTCTTT GATTTTGAAG AGATGATTAA 
TCGTGCTGTT GAAAATATTG AAACACTTGA AATAACTAGT GATTATAAAA CTATTAAAGA 
ACAQCAAACA AACCAATTAA TAAACGACTT TTTATAAATA AAATCCTAGA GTAAATTTAA 
ACATAAGGGG ATGTTAAACT ATGGATTTGA ACTTAACGAT GATTATAATC ATAATTTTAT 
TTGGTTTTAT CGCGGCGTTT ATAGATTCGG TTGTAGGGGG TGGCGGTTTA ATTTCTACGC 
CAGCATTATT AGCAATCGGT CTACCACCAT CTGTGGCTTT AGGTACAAAT AAATTGGCAA 
GTTCGTTTGG TTCTTTAACT AGTACGATAA AGTTTATAAG GTCCGGTAAA GTGGACTTAT 
ATGTTGTTGC CAAATTATTT GGTTTTGTAT TTTTGGCATC TGCATGTGGC GCATATATTG 
CAACGATGGT TCCGTCACAA ATATTGAAAC CTTTAATCAT CATTGCACTT TCGTCGGTGT 
TTATATTCAC ATTACTTAAA AAAGATTGGG GCAATACACG CACGTTTACT CAATTTACAT 
TTAA6AAAGC CATAATATTT GCAGCACTTT TTATATTAAT CGGCTTTTAT GATGGATTTG 
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TAAGTGCAGC AGGAAATGCT AAAGTTTTGA ACTTTGCTTC TAATATAGGT GCGCTTGTAT 1380 

TATTTATGGT ATTAGGACAA GTAGATTATG TAATAGGTTT AATTATGGCT A 1431 

5 

(2) INFORMATION FOR SEQ ID NO: 78: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 4403 base pairs 

(B) TYPE: nucleic acid . 
10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

AATATTATTT TAAATTCAAT ATTTATTGGT GCATTTATTT TAAACTTATT ATTCOCCTTT 60 

ACCATTATTT TCATGGAAAG ACGTTCTGCC AATTCTATCT GGGCTTGGTT ACTAGTCTTA 120 

20 GTTTTCTTGC CTTTATTCGG CTTCATTTTA TACTTACTAT TAGGACGACA AATTCAACGT 180 
GACCAAATTT TCAAAATTGA TAAGGAAGAT AAAAAAGGAT TAGAGTTAAT CGTTGATGAG . 240 

CAATTAGCTG CTTTAAAAAA TGAAAACTTT TCAAATTCCA ATTATCAAAT TGTAAAATTT 300 

AAAGAAATGA TTCAAATGTT GTTATATAAT AACGCAGCAT TTTTAACAAC AGACAACGAT 3 60 

TTArrrrtAT ACACAGACGG CCAAGAAAAA TTTGATGACC TAATACAAGA CATCCGTAAT 420 

GCTACTGATT ATATTCATTT TCAGTACTAT ATTATTCAAA ATGATGAATT AGGTCGTACC 4 80 

ATTTTAAATG AACTTGGTAA AAAAGCGGAA CAAGGTGTAG AAGTTAAAAT TCTTTATGAT 540 

GACATGGGTT CTCGTGGACT GCGTAAAAAA GGCTTACGCC CGTTTCGCAA TAAAGGTGGA 600 

CATGCTGAAG CATTTTTCCC ATCAAAATTA CCTTTAATTA ACTTGCGTAT GAACAATCGA 660 

AACCATCGAA AAATTGTTGT AATAGATGGG CAAATTGGAT ATGTTGGTGG TTTTAATGTT 720' 

GGTC&TGAGT ACTTAGGTAA ATCAAAAAAA TTCGGCTATT GGCGAGATAC GCATTTACGA 780 

40 ATTGTCGGGG ATGCAGTGAA TGCATTGCAA TTACGATTTA TTCTAGATTG GAATTCACAA 84 0 

GCCACACGTG ACCACATCTC CTATGATGAT CGTTATTTCC CAGATGTAAA TTCTGGTGGA 900 

ACAATTGGCG TTCAAATAGC TTCTAGTGGT CCTGACGAAG AATGGGAACA GATTAAATAC 960 

GGCTATTTGA AAATGATTTC ATCTGCTAAA AAATCGATTT ATATTCAATC TCCCTATTTC 1020 

ATACCTGATC AAGCCTTTTT AGATTCTATT AAAATTGCGG CATTAGGTGG TGTTGATGTC 1080 

AATATCATGA TTCCTAATAA ACCTGACCAT CCGTTTGTTT TTTGGGCTAC TTTAAAAAAT 114 0 

GCAGCATCCT TATTAGATGC CGGTGTTAAA GTATTTCACT ACGACAATGG CTTTTTACAC 1200 

TCAAAAACAC TTGTTATAGA TGATGAAATT GCAAGTGTGG GAACAGCTAA TATGGACCAT 1260 
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AAATTAAAAC AAGCTTTTAT AGATGATTTA 
TATGCTAAGC GAAGTCTTTG GATTAAATTT 

5 

ATCTTATAAA ATAGAAATAT GAGGAGTGTA 
GCTGCCAAAA AATATATGGA ATCTATTCAT 
CATGTATATC GTGTCACTGC TTTAGCTAAA 

10 

ACTTTAGTCA TTGAACTCGC ATGTTTGCTT 
GCTAACAAAC AATATGTTGA ATTGAAGTCA 

15 GATCAAGAGC ACATTTTATT TATTATTAAT 
CATGTCACTT TATCTTTAGA AGGTCAAATT 
GGCGCTATAG GTGTTGCACG AACATTTCAA 

20 ACAGAACATA TGTCACTAGA TAAGATTAAT 

GCAATTAAAC ATTTCTTTGA AAAATTACTT 
GCGAAGATGA TTGCTAAAGA ACGTCACGAC 

25 ACGGAATGGA ATTGTCACGA CTAGACATTG 

CGTGTTGTTG TGGAAGCTTG GTGTCATGCC 
TGGTGACATG TCATGCTACT TTGATGTGCT 

30 

TGATGTGGCA TTGCGGTGTT ATGGTGTTAT 
TTGATGTGCT GGTACCACGA TGCGACTTGA 
ATGGTGTTAT AGACCGGTTT GATGTTGATG 

35 

TGCGACTTGA TGTAGTGCTA TGATGTGGCG 
GGTGTTGATG TCATGCCGTT ACGATTCTAT 

40 TATGCCGTTG TGACGTTATT ATTTCACACT 

TTTGCGACAT ATACTGCTAC ACTGATGAAT 
ATGACAACTC TGTTATTAAC CACTTTTTAC 

45 TAAAAACAGC AGTAGGATGA CTTTCACATT 

CACATATTGT ATAATGTGAC ACTAAGTTTC 

ATAAAGTTAA AATTATCTTC AACTTTTAGG 

SO 

TTCTTTTTCT TTTTAGACAC AACTTGTGTG 

TGCTCTCTTT CATACGCTTC AATGAAAGGT 

55 



GCAGTATCTT CTGAATTAAC AAAAGCACGT 1380 

AAAGAAGGTA TTTCACAATT ATTGTCACCT 1440 

aCTTTAATGC AACAATCAGA CGTCATTAGT 1500 

CAAAATGATT ATACAGGCCA TGATATTGCG 1560 

TCAATCGCTG AAAATGAAGG TGTTAATGAT 1620 

CATGATACCG TTGACGAAAA AGTTGTAGAT 1680 

TTTTTATCTT CTTTATCACT ATCAACCGAA 1740 

AATATGAGCT ATCGCAATGG CAAAAATGAT 1800 

GTCAGGGATG CAGATCGTCT TGATGCTATA 1860 

TTTGCAGGAC ACTTTGGTGA ACCTATGTGG 1920 

GATGATTTAG TTGAACAGTT GCCACCATCT 1980 

AAGTTAGAAT CTTTAATGCA TACAGATACG 2040 

TTTATGATGA TGTACTTGAA ACAGTTTTTT 2100 

AAGTTGTAGT ATGATGATGC GATGTAATGG 2160 

ATGTTACTTT GATGTGTTGT TGTGGGAGCT 2220 

GGTACCACGA TGCGTCTTGA TGTAGTGCTA 2280 

AGACAGGTTT GGCGTTGATG CCATGTTACT 2340 

TGTAGTGCTA TGATGTGGCA TTGCGGTGTT 2400 

CCATGTTACT TTGATGTGCT GGTGCTACGA 2460 

TTGCGCTGTT ATGGTGTTAT AGCCAGGTTT 2520 

GATATGTTGT TGGGACGTTG CAATGTGTAT 2580 

GTTACATGTA TAAGTGAATT GCTGTGGAAA 2640 

CATTGTGTCA AGATGACATT GCGATGAAGA 2700 

ATACTGAAAA CTCGTTAATA TTATTTCAAA 2760 

TGAAATCATC TTACTGCTGT TTCTATTTAT 2820 

GCTATTGAAG CGAAAAATAA TGTGCGCCCT 2880 

GTGCACATTA TTTGGACTTG CTAAGGTTAT 2940 

TTTTTGCCTT TTTTATTGCt GCCGCCGTTG 3000 

TGTACTTCTT TTTTAGCGAC TTTTTCATAA 3060 
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10 



15 



CCAAGTGCTG ATGCTGAGCT TAATGAAATC CAGATAATCA TAATTGGTGA AATGACCATC 3180 

ATCATGTAAC CCATTTGACG TTGTTCGTCT GGCATCGTTT TACTTGATAC ATATGCTTGG 3240 

ATAAAGTATA AAACACCGGC AATAATTGTA ATCCAAATAT CAGGACGTCC TAAATCGAAC 3300 

CATAAGAAGT GTGGATATTT AAACAAACCA TCTACAAGTT GGTCTTTAAG TACAAAGTAT 3360 

AATCCCATGA TGATTGGTAA TTGGATTAGC ATTGGTAAAC AACCCAACAT ACTCTTAATC 3420 

GGGTTCATGT CATACTTTTT ATATACTTGC ATTAATTCTT GGTTTGCAGC CATTTTTTCT 34 BO 

TCTTGTGTAC GCGnCaCGTT CACTTTTTCT TGAATTTTTT CAACTTCTGG CTTTGCAACT 3540 

TTCATTTTTT GACGCATCAT ATGACTATTT TTATAGTTTG ACAACATGAA TGGTAATAAA 3600 

ATAATACGAA TTACCAATAC AAGGAT AAT A ATAGCTAAAC CATAATTGTC GTTTAATAAG 3660 

TTATTTCCCA ACCAATCCAA TACATTTTTC ATTGGATCTA CGAATGTATT GTAGAAAAAy 3720 

20 CwCtACGTTT TTCAGGTTTA GAATAGTCAC AACCAGCCAA AAAGACCATA ATACCTAAAA 3780 

ATAATGGTAG TAACGCTTTT TTCTTCATTT TTCCACCTCT ATCATTATAT TCACATAGGA 384 0 

TTTATTCTAT CACATTAATG AGTACGTATG AAACAATAAG TGGAAAAATT TAACTAATTA 3 900 

TTAAAAAAAT CTTTGAATCG ATTAACAGTC TTTTCAATAT TTTCA CTTTT AGAAATGGCT 3 960 

GAAATGACTG AAATTCCATT GGCACCTGCT TCTACAATCG GCGCCACATT ATTAGTATTG 4 020 

ATACCGCCAA TAGCTACAAT CGGTAGTTGC GGATTCATTT CTTTAAACGT TGCAATCATT 4080 

TCTGGACCTA CTGGTATATG CGCGTCATGC TTCGACGGCG TAGGATAGAT TGGTCCAACA 4140 

CCTATATAAT CmACATGAGT TAAATCAGAT TTTGCATACT CATCTAAATC ACTAATACTA 4200 

AGTCCAATAA TTTTATCAGT GAAATATTGT GCTATCTCTT TGACTTTCGC ATCATCTTGA 4260 

CCGACATGTA TACCATCCGC GTTAATTTCT TTTGCCAAGG ATACATCATC ATTAACGATA 4 320 

AAAGGCACAT CATATTGATG ACAGAGATGC TGTAATTCTT TAGCTAATAC AAGTTTATCG 4380 

40 TTTCCTTTTA AAGCTGATTC ACC 44 03 
(2) INFORMATION FOR SEQ ID NO: 79: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1808 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 79: 
TGGAnCCAAT ATTAGAAATG ATTAAAACAT TAACAGGTAT TAATAGTCCT TCAGGAGnCA 60 
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TAACAAATAA AGGTGCGTTA TTAATAACAG TGCCAGGCAA AAATGATGAA GTACAACGCT 180 

GTATTACTGC TCATGTTGAT ACTTTAGGTG CaATGGTTAA AGAAATTAAA GAAGATGGTC 24 0 

5 

GCTTaGCAAT AGAATTAATT GGAGGATTCA CGTATAACGC GATTGAGGGT GAATATTGCC 300 

AAATTAAAAC TGATGCTGGT CAAATATATA CAGGAACAAT TTGTCTGCAT GAAACAAGTG 360 

TTCATGTATA TAGAAATAAT CATGAAATAC CTAGAGATCA AAAGCATATG GAAATAAGAA 420 

10 

TTGATGAAGT AACTACATCA GAAGAAGATA CAAAGAGTTT AGGTATTTCA GTAGGTGATT 480 

TTGTTAGCTT TGATCCACGT ACAGTTATCA CGTCATCAGG TTTTATTAAA TCTCGTCATT 540 

75 TAGATGATAA AGCTAGCGTA CGgTtGATAC TACAATTACT AAAGAAATTA AAAGAAGAGC 600 

AAATAATATT ACCACATACA ACGCAATTTT ATATTTCTAA TAACGAAGAA ATAGGTTACG 660 

GTGCAAATGC ATCAATTGAT TCGAAAATCA AAGAATATAT TGCATTAGAT ATGGGCGCGT 720 

20 TGGGAGACGG TCAAGCATCG GATGAATATA CAGTTTCTAT TTGTGCCAAA GATGCTTCAG 780 

GTCCATATCA TAAGCAATTG AAATCGCACC TAGTTAATCT , TTGCAAAATA AATAACATTC 840 

CATATAAAGT AGACATATAT CCATATTATG GTTCAGATGC TTCAGCAGCT TTACATGCTG 900 

25 

GTGCGGATAT CAGACATGGT TTATTTGGCG CTGGCATTGA ATCATCTCAT GCAATGGAAC 960 

GAACACATAT TGATTCTATT AAAGCGACAG AGAAATTACT ATATGCATAT TGCTTATCAC 1020 

CAATTGAGTA AACAATTAGT GTTGACAAAT GTGaACGACC TATGTAATAT AATGAACTAT 1080 

30 

AAAAATAATT AGAATTTTCT AAAGAAATAG TAGCAGATAT GAAACGTAGC AAATAGAAAG 1140 

CTAATGGGTG ATGGGAATTA GCACGCCATA TCTTGTGAAT TGGACTTTGG AAAACAATTG 1200 

35 AATGAGTTTT GAAAGTGAAC ATGAATTATG TTAACTAAGG TGGCACCACG GTAACGCGTC 1260 

CTTACAGGTA TATGCGTTAT GTGGTGTCTT TTTATTTAGA CAAAATGTAG TAGTTAATTA 1320 

AAGCTAGCAA CAGAAAGTTA GTGGATGATG TGAACTAACA CCGAGATTAA TGAAATTGGG 1380 

40 TTTTGTCTGC AACAGAAAAA TTATATATAG TAAAGAGTGA ACTATGAATA TTTCGAATAT 1440 

TCGGTTAATT TAGGTGGTAC CACGCGTCAc nTCCTTTATA TTGATAAGGA TGCTGGCGCT 1500 

TTTTTGAAAG GAGCGTATAG AATGGATATA TTTTATAAAA AAATAAAAGC AAATGTAACG 1560 

45 

CCCGAAGTTT TAGCACAACT TCATTCCAAG AAGaTCATTT TGGAAAGTAC AAATCAACAA 1620 

CAAACTAAAG GTCGCTATTC AGTTGTTATT TTTGATATTT ATGGCACTTT AACTTTAGAT 1680 

AATGATGTAT TATCAGTAAG TACTTTAAAA GAATCGTATC AAATCACTGA AAGACCGTAC 1740 

50 

CATTATTTAA CGACTAAnAT AAATGAAGAC TACCATAATA TTCCAAGATG AGGCAACTTA 1800 

AGTCATTA 1808 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1320 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 80: 

10 ' 

TGGTCGTCAA TTTCTTGATT ATATCTATAA TCCTCATTTT CAATATTAGA GTCTGTAGAA 60 

TCATCGATAT TATTATCATT CGCATGACTA GAAGCAGAAT CATTATTTTT ATCATTGCTT 120 

75 TCTTCTTTTT TGAAGTCTTT ATTTATCAAG TAAATTTCTT CATCAAAATC AGCTTGTTGA 180 

GATGTATCAT CTTTATTTTG ATTAGAAAAA TGTGTAGCCT TTGATCTTTT TCTTTGCCGT 24 0 

CTTTTCTTAG ATGTATTCCT CGTAAATAAT TCTAATTCAT CTTTATCTTC ATTTGATTCT 300 

20 TGTTGATCGT TCTTCGTTTT ATCATCCATC AATACTCACA CCCTTTAATA AGATGGTAAA 360 

TGGGCACGGA ATCTTTCAAT AAATTTCTCT CCACGCTCTT CAAAAGTACT ATATTGATCC 420 

CAACTCGCAC AAGCAGGTGA CAATAATACA ACATCATTTG GTTCTATAAT ATCTTGTACT 4 80 

25 

TTATCAACAG CGTCTTCGAC ATTGTTCGCT TCAATGACCG ATTTCCCTTG ACTATTACCT 54 0 

AGTTTAGCAA ACTTAGCTTT CGTTTGTCCG AATACAACCA • TCGCGCGAAC ATTTTCCATA 600 

TAAGGAATGA GTTCGTCAAA TTCATTCCCT CGATCCAAAC CACCACATAA CCAAATGATT 660 

30 

GGTTGATTAA ATGAATTTAA GGCAAACTGT GTTGCTAGCG TGTTTGTTGC TTTGGAATCA 720 

TTATAATATT TATTAGTTCT ATTAGTACCA ACATATTGCA ATCTATGCTC TATTCCTGAA 780 

3$ AATGTAGTTA AACTATCAAT AATTGCtTTA ATAGGTACAC CAGCanAATA CAAGCAAGCA 84 0 

CAGCTGCTAA TATATTTcTA AATTATGTTC ACCAGGCAAT ACTAGAtCTT CAGTGTTAAT 900 

AATaCSAACA CCTTTATaAA CGATAAAACC ATCTTtAATA TAAaTACCAT CArCTtCTTG 960 

40 TTGAGTTGAG AAATACAATG TCTTAGCTTT TAATTCTTCC GACTCTATCA CTTGTCTTTG 1020 

ATGATAATTA CAAATCAAAT AATCCTCTTC CGTTTGATTT TTATATATTT G CTTTTTA GC 1080 

ATTTTGATAG TTTTCTAAAT TTTCATGGTA ATCTAGATGC GCCGAATAAA TGTTAGTAAT 1140 

TATAGCAATG TGTGGTTTAT ACTTTTCGAT TCCAAGTAAC TGGAATGACG ACAACTCTGT 1200 

AACTAAATAA TCTGTAGGCT TTACTTCTTG TGCTACTTTA GATGCAACAT AACCAATATT 1260 

GCCGGATAAT CTTCCAGTTA AGCGACTTTT TTTAAACATA TCTCCAATTA GAGAAGTAAC 1320 

50 

(2) INFORMATION FOR SEQ ID NO: 81: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4280 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 81: 

TTTACACCAA TCAAAAAATC GAACTGATAT AAATAAGTAC AAAGCTTATC TATCAATCCG 6 0 

10 ATTTAGTTAT AAAACAAAAA AAGCCACAGT AATGTGGCTT TTTGTTATAT TCAGTATCAA 120 

AATGGTATCA ATAGCCATTT TCGGAAGTCA AGAATGGCTT AACAACGCGG TTTAAAGCTA 180 

TCCAATACTA CCTTCCATTT CGAACTTGAT TAAACGGTTC ATTTCGACCG CGTATTCCAT 240 

75 TGGAAGTTCT TTTGTAAATG GTTCGATGAA TCCCATAACA ATCATTTCTG TCGCTTCTTC 3 00 

TTCAGAAATA CCACGACTCA TTAGATAGAA TAATTGTTCT TCAGAAACTT TTGAAACCTT 360 

GGCTTCATGT TCTAATGATA TTTGATCGTT GAATACTTCG TTATATGGAA TTGTATCTGA 420 

20 

TGTTGATTCG TTATCTAAGA TTAATGTATC ACATTCAATA TTTGAACGAG CACCTTTTGC 4 B0 

TTTACGTCCA AAATGAACAA TACCGCGATA AATAACTTTA CCACCATTTT TAGAAATAGA 540 

TTTAGAAACA ATTGTAGAAG ATGTATTAGG TGCTTTATGA ATCATTTTAG CACCGGCATC 600 

25 

TTGAACTTGT CCTTTACCAG CAAATGCAAT AGATAATGTA CTACCTTTTG CACCTTCACC 66 0 

TAAAAGAACA CAGTTTGGAT ATTTCATCGT TAACTTAGAA CCTAAGTTAC CATCTACCCA 720 

TTCCATATTT CCGTTTTCAT AAACAAAAGT ACGTTTTGTA ACTAAATTGT ATACATTGTT 780 

30 

CGCCCAGTTT TGAATCGTAG TATAACGAAC GTGCGCATCT TTATGCACAA TGATTTCCAC 8.4 0 

AACAGCAGAG TGTAAAGAAC TAGTTGTATA AACTGGTGCA GTACAACCTT CTACGTAATG 900 

35 TACAGAAGCA CCTTCATCAG CAATGATTAA TGTACGTTCA AATTGACCCA TGTTCTCAGA 960 

GTTAATACGG AAATAAGCTT GTAGTGGCGT ATCTAGTTTG ATATTTTTAG GTACATAAAT 1020 

GAAGGAACCA CCTGACCATA CTGCTGAGTT TAACGCCGCA AATTTGTTAT CTGCTGCAGG 1080 

40 TACTACAGAA GCAAAGTATT TTTTGAATAA TTCTTCATTT TCTTGTAAAG CACTATCTGT 1140 

ATCTTTAAAG ATAATACCTT TTTCTTCAAG TTCTTTTTCC ATATTATGGT AAACAACTTC 1200 

AGATTCATAT TGAGCAGAAA CACCAGCTAA ATATTTTTGT TCAGCTTCAG GAATTCCTAA 1260 

45 

TTTATCGAAA GTTCTTTTAA TTTCTTCTGG CACTTCATCC CATGAACGTT CAGCTTGTTC 1320 

TGAAGGCTTT ACATAGTAAG TAATGTCATC GAAATTCAAT TCTGATAAGT CGCCACCCCA 1380 

TTGAGGCATT GGCATTTTAT AAAACAATTT TAATGATTTA AGACGGAAAT CTAACATCCA 144 0 

SO 

TTCCGGCTCA TTTTTCATGT TAGAAATTTC TCTAACGATA TTCTCAGTTA AACCACGTTC 1500 

TGATCTGAAA ATGGACACAT CATCGTCGTG GAATCCATAT TTATAATCCC CAACATCAGG 1560 
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TTTAATTCAT GATGTAAACC ATATTATAAC AATGACATGA CATCTTATAA AAATTTTTAT 1680 

ACTTTTATAT GTCTAATATC AAAATTATCT ATGATTAACA GCATTCTATT CTTCTTCAGT 1740 

CGTACCTTCT GCTTTACCTT CTTTAGCAAC AGTACCTTTT TCCAATGCTT TCCAAGCTAA 1800 

TGTGGCACAT TTAATACGAG CTGGGAATTG AGATACACCT TGCAATGCTT CAATATCTCC 1860 

CATTTCTTCT GTAATCACAT AGTCTTCACC AAGCATCATT TTCGTAAATT CTTGGCTCAT 1920 

TTGCATTGCT TCTCCAAGTG AATGACCTTT AACAGCTTGT GTCATCATCG ATGCACTTGC 1980 

CATTGAAATC GAACAACCTT CACCTTCAAA CTTAGCATCT TTTATAATGC CGTCTTCTAT 2040 

ATCAAATGTT AGTCGTATAC GGTCACCGCA TGTCGGGTTA TTCATATCTA CTGTCATAGA 2100 

CCCGTTATCT AATACACCTT TATTTCTAGG ATTTTTATAA TGATCCATAA TGACAGATCT 2160 



ATATAATTGA TCTAGATTAT TAAAATTCAT AAGAGAAAAA CTCCTTCGTT TGTTTCAAGG 2220 

20 CATTTATTAA CTGATCAACG TCTTCTTTOG TGTTGTATAT ATAAAAACTC GCTCTAGCTG 2280 

TTGAAGACAC ATTTAACCAT TTCATTAACG GTTGCGCACA ATGATGCCCA GCTCTAACCG 2340 

CTACACCTTC TGTATCTACG GCTGTAGCAA CATCGTGTGG ATGTACATCT TGTAAATTAA 2400 

25 ACGTTATTAC ACCTGCACGA CGATCCTTTG GCGGGCCATA AATTTCAATT CCTTCAATTG 24 60 

CAGACATTTG CTCATAAGCA TATATCGTTA ATTCTTGTTC ATATTTATGA ATTGCATCAA 2520 

AACCTATGCG TTCTAAATAG CGAATAGCTT CTGCAAGCCC AATTGCTTGA GCAATTAATG 258 0 

GAGTACCCGC CTCAAATTTA GTAGGTAAAT CAGCCCATGT TGCATCATAC TTACTTACAA 264 0 

AATCAATCAT GTCGCCACCG AACTCAATCG GTTCCATTTT TTGTAGTAAC TCACGTTTAC 2700 

CAAATAATAC GCCAATACCT GTTGGTCCAA GCATTTTATG ACCACTAAAA CTATAAAAAT 2760 

CAGCATTCAT TTCTTGCATA TCAAGTTTCA TATGTGGTGC TGctTGCGCC CCATCAACAC 2 820 

TGATSATTGC ACCATGTTGA TGAGCTATTT CTGCAATGGT TTTAACATCA TTAATTGTAC 2880 

CGAGCACATT AGATATATGT GCAATAGCAA CGATCTTTGT TTTATCATTA ATCGTTTGCT 2940 

TAATATCCTC GATGTTTAAT TCACCGTCAG CTGTCATTGG TATAAATTTC AATGTCGCAT 3 000 
TTTTACGCTT TGCTAACTGT TGCCAAGGAA CAATATTGGC ATGATGTTCC ATTTCAGTGA . 3060 

45 CAACAATTTC ATCGCCCTCT TCAACATTTG CATCACCATA GCTATGTGCT ACAAGGTTAA 3120 

TCGACGCAGT TGTTCCGCGT GTAAAAATGA TTTCTTCAAA ATACTTCGCA TTAATAAAAC 3180 

GACGAACGGT TTCACGGGCA TTTTCATAAC CATCAGTTGC CAATGATCCT AATGTATGAA 324 0 

50 CACCACGATG AACGTTTGAA TTATAACGCT TGTAGTAATC TTCTAAAACA TTTAACACTT 3300 

GCACAGGCGT TTGACTTGTC GCTGTTGAAT CAAGATATGC TAAACGTTTG CCATTGACTT 3360 
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CTTCATTCAC 


GACCTTTCTT 


AAATAAAAAT 


CCTAATCATT TAAATACTGA CGTTGTATTA 


3480 




GTCTTATACC 


AATATCGACA 


GTCTATATCT 


ATTACAAACT TTTATTTTCA 


AAATATTATT 


3540 


5 


TAGAAACTTT 


GCGTTCAATT 


ACTTCTCTCA 


ATTGACGTTT AACGTCTTCG 


ATAGGTAATT 


3600 




CACGTACTAC 


TGGATCTAAG 


AAACCATGTA 


TAACAAGACG TTCCGCTTCT 


CTTTGAGAAA 


3660 


10 


TACCACGACT 


CATTAAATAG 


TAAAGTTGAT 


CTGGATCAAC ACGACCTACT 


GATGCAGCAT 


3720 


GACCAGCTTG 


TACATCATCT 


TCATCAATTA 


ATAAAATAGG ATTCGCGTCA 


CCACGAGCAT 


3780 




GTTCAGATAA 


CATTAATACA 


CGTGATTCCT 


GATTAGCAAT TGATTTAGTT 


CCACCATGCT 


3840 


15 


TAATGTAGCC 


GATACCATTA 


AATACAGACG 


ATGCATGTTC TTTCATAACA 


CCATGTTTAA 


3900 




GGATATAACC 


ATCTGTTTCT 


TTACCATATT 


GTACGATTTT AGATGTTAGA 


TTAATTTTTT 


3960 




GTTCGCCTGT 


ACCTACAACT 


ACTGATTTAA 


GTGAACTTGT TGAACGATCA 


CCAAATAAAT 


4020 


20 


TTGTTGTATT 


ATCAATAATT 


TGGCTACCCT 


CATTCATTAA ACCTAGTGCC 


CAATTAATTG 


4080 




AGGCATCCGC 


TTCAGTAATA 


CCACGTCGAA 


TGATATGACC TGTAAAGCCT 


TTATCCATAT 


4140 




AGTCCACTGA 


GCCATATGTG 


ATATTTGAAT 


TTGCACCAGC AATCACTTCA 


GAAATAATAT 


4200 


25 


TtAATTGATT 


TCCTTCACCA 


GATGCATTTG 


TtlTAAGTAATT TTCAACATAT GTGACTTCGG 


4260 




CGCTTTCTTC 


AGTAACGATG 








4280 




(2) INFORMATION FOR SEQ ID NO: 82: 






30 
35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15598 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

~(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 82: 






40 


TCnGACTCGA 


ACGGTGmAAC 


TAttCCGTTG 


TaATTCCgGA GgAAsCAAGG TATGCCCATC 


60 




TGCaAAGAAA gaATGsAATG AACTTTTTGG 


AAATGTAGAA GTGGTAAATA 


AAGATAAAGG 


120 




ATATTACATT 


CTGAGAAGTA 


TAAAAGCTTG 


AAATGAAATG GATATTCTGT 


TATAGTTATA 


180 


45 


TAATGTAAAA ATTTATGTTC 


AATAAGTGTG 


TACTTTTACG TTAAATAGAT 


AAGTTAATTA 


240 




AGAATAAATA 


TAGAATCGAA 


AATGGTGTCA 


TCATTAGTGT TGCCGTTTTC 


TTTTTGTCTT 


300 


• 


TTTATTAATA 


TGCTTATGGT 


ATTTAGCTAA 


AAGCGGATCA CATAATTTTT 


GAGGGGTGAA 


360 


50 




AGGTCAAGTT 


GTCCAATATG 


GAAGACATCG TAAACGTAGA 


AACTACGCGA 


420 




GAATTTCAGA 


AGTATTAGAA 


TTACCAAACT 


TAATAGAAAT TCAAACTAAA 


TCTTACGAGT 


480 
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70 



15 



CTGGTAATTT GTCATTAGAG TTTGTGGATT ACCGTTTAGG AGAACCAAAA TATGATTTAG 600 

AAGAATCTAA AAACCGTGAC GCTACTTATG CTGCACCTCT TCGTGTAAAA GTGCGTCTAA 660 

TCATTAAAGA AACAGGAGAA GTTAAAGAAC AAGAAGTCTT TATGGGTGAT TTCCCATTAA 720 

TGACTGATAC AGGTACGTTC GTTATCAATG GTGCAGAACG TGTAATCGTA TCTCAATTAG 7 BO 

TTCGTTCACC ATCCGTTTAT TTCAATGAAA AAATCGACAA AAATGGTCGT GAAAACTATG 840 

ATGCAACAAT TATTCCAAAC CGTGGTGCAT GGTTAGAATA TGAAACAGAT GCTAAAGATG 900 

TTGTATACGT ACGTATTGAT AGAACACGTA AACTACCATT AACAGTATTG TTACGTGCAT 960 

TAGGTTTCTC AAGCGACCAA GAAATTGTTG ACCTTTTAGG TGACAATGAA TATTTACGTA 1020 

ATACTTTAGA GAAAGACGGC ACTGAAAACA CTGAACAAGC GTTATTAGAA ATCTATGAAC 1080 

GTTTACGTCC AGGTGAACCA CCAACTGTTG AAAATGCTAA AAGTCTATTG TATTCACGTT 1140 

20 TCTTTGATCC AAAACGCTAT GACTTAGCAA GCGTGGGTCG TTATAAAACA AACAAAAAAT 1200 

TACATTTAAA ACATCGTTTA TTTAATCAAA AATTAGCTGA GCCAATTGTA AATACTGAAA 1260 

CTGGTGAAAT TGTAGTTGAA GAAGGTACAG TGCTTGATCG TCGTAAAATC GACGAAATCA 1320 

25 TGGATGTACT TGAATCAAAT GCAAACAGCG AAGTGTTTGA ATTGCATGGT AGCGTTATAG 1380 

ACGAGCCAGT AGAAATTCAA TCAATTAAAG TATATGTTCC TAACGATGAT GAAGGTCGTA 144 0 

CGACAACTGT AATTGGTAAT GCTTTCCCTG ACTCAGAAGT TAAATGCATT ACACCAGCAG 1500 

ATATCATTGC TTCAATGAGT TACTTCTTTA ACTTATTAAG CGGTATTGGA TATACAGATG 1560 

ATATTGACCA TTTAGGTAAC CGTCGTTTAC GTTCTGTAGG TGAATTACTA CAAAACCAAT 1620 

TCCGTATCGG TTTATCAAGA ATGGAAAGAG TTGTACGTGA AAGAATGTCA ATTCAAGATA 1680 

CTGAGTCTAT CACACCTCAA CAATTAATTA ATATTCGACC TGTTATTGCA TCTATTAAAG 174 0 

AATttTTTGG TAGCTCTCAA TTATCACAAT TCATGGACCA AGCAAACCCA TTAGCTGAGT 1800 

TAACGCATAA ACGTCGTCTA TCAGCATTAG GACCTGGTGG TTTAACACGT GAACGTGCTC 1860 

AAATGGAAGT ACGTGACGTT CACTACTCTC ACTATGGCCG TATGTGTCCA ATTGAAACAC 1920 

CTGAGGGACC AAACATTGGA TTGATTAACT CATTATCAAG TTATGCACGT GTAAATGAAT 1980 

TCGGCTTTAT TGAAACACCA TATCGTAAAG TTGATTTAGA TACACATGCT ATCACTGATC 204 0 

AAATTGACTA TTTAACAGCT GACGAAGAAG ATAGCTATGT TGTAGCACAA GCAAACTCTA 2100 

AATTAGATGA AAATGGTCGT TTCATGGATG ATGAAGTTGT ATGTCGTTTC CGTGGTAACA 2160 

50 ATACAGTTAT GGCTAAAGAA AAAATGGATT ATATGGATGT ATCGCCGAAG CAAGTTGTTT 2220 

CAGCAGCGAC AgcATGTATT CCATTCTTAG AAAATGATGA CTCAAACCGT GCATTGATGG 228 0 
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35 



40 
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CAGGTATGGA ACACGTTGCA GCACGTGATT CTGGTGCGGC TATTACAGCT AAGCACAGAG 24 00 

GTCGTGTTGA ACATGTTGAA TCTAATGAAA TTCTTGTTCG TCGTCTAGTT GAAGAGAACG 2460 

5 

GCGTTGAGCA TGAAGGTGAA TTAGATCGCT ATCCATTAGC TAAATTTAAA CGTTCAAACT 2520 

CAGGTACATG TTACAACCAA CGTCCAATCG TTGCAGTTGG AGATGTTGTT GAGTATAACG 2580 

AGATTTTAGC AGATGGACCA TCTATGGAAT TAGGAGAAAT GGCATTAGGT AGAAACGTAG 2640 

10 

TAGTTGGTTT CATGACTTGG GACGGTTACA ACTATGAGGA TGCCGTTATC ATGAGTGAAA 2700 

GACTTGTGAA AGATGACGTG TATACTTCTA TTCATATTGA AGAGTATGAA TCAGAAGCAC 2760 

15 GTGATACTAA GTTAGGACCT GAAGAAATCA CAAGAGATAT TCCTAATGTT TCTGAAAGTG 2820 

CACTTAAGAA CTTAGACGAT CGTGGTATCG TTTATATTGG TGCAGAAGTA AAAGATGGAG 2880 

ATATTTTAGT TGGTAAAGTA ACGCCTAAAG GTGTAACTGA GTTAACTCCC GAAGAAAGAT 2940 

20 TGTTACATGC AATCTTTGGT GAAAAAGCAC GTGAAGTTAG AGATACTTCA TTACGTGTAC 3000 

CTCACGGCGC TGGCGGTATC GTTCTTGATG TAAAAGTATT CAATCGTGAA GAAGGCGACG 3060 

ATACATTATC ACCTGGTGTA AACCAATTAG TACGTGTATA TATCGTTCAA AAACGTAAAA 3120 

25 TTCATGTTGG TGATAAGATG TGTGGTCGAC ATGGTAACAA AGGTGTCATT TCTAAGATTG 31 BO 

TTCCTGAAGA AGATATGCCT TACTTACCAG ATGGACGTCC GATCGATATC ATGTTAAATC 324 0 

CTCTTGGTGT ACCATCTCGT ATGAACATCG GACAAGTATT AGAGCTACAC TTAGGTATGG 3300 

30 

CTGCTAAAAA TCTTGGTATT CACGTTGCAT CACCAGTATT TGACGGTGCA AACGATGACG 33 60 

ATGTATGGTC AACAATTGAA GAAGCTGGTA TGGCTCGTGA TGGTAAAACT GTACTTTATG 3420 

ATGGACGTAC AGGTGAACCA TTCGATAACC GTATTTCAGT AGGTGTAATG TACATGTTGA 34 80 

35 

AACTTGCGCA CATGGTTGAT GATAAATTAC ATGCGCGTTC AACAGGACCA TATTCACTTG 3540 

t TACACAACA ACCACTTGGC GGTAAAGCGC AATTCGGTGG ACAACGTTTT GGTGAGATGG 3600 

AGGTATGGGC ACTTGAAGCA TATGGTGCTG CATACACATT ACAAGAAATC TTAACTTACA 3660 

40 

AATCCGATGA TACAGTAGGA CGTGTGAAAA CATACGAGGC TATTGTTAAA GGTGAAAACA 3720 

TCTCTAGACC AAGTGTTCCA GAATCATTGC GAGTATTGAT GAAAGAATTA CAAAGTTTAG 3780 

45 GTTTAGATGT AAAAGTTATG GATGAGCAAG ATAATGAAAT CGAAATGACA GACGTTGATG 3640 

ACGATGATGT TGTAGAACGC AAAGTAGATT TACAACAAAA TGATGCTCCT GAAACACAAA 3900 

AAGAAGTTAC TGATTAATAC GCAATTTACA AAACAGGCAA AAAGATACTA AGCTGAATTT 3 960 

50 TATTGATGAT TCAGTTTAGT ACTTTAAGCC ATTTTAAATA AATGCAAATC AATCAAATAG 4 020 

CACAGCTAAT CTAAATTGAA GGAGGTAGGC TCCTTGATTG ATGTAAATAA TTTCCATTAT 4 080 
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AAACCTGAAA CAATCAACTA CCGTACATTA AAACCTGAAA AAGATGGTCT ATTCTGTGAA 4200 

AGAATTTTCG GACCTACAAA AGACTGGGAA TGTAGTTGTG GTAAATACAA ACGTGTTCGC 4260 

5 

TACAAAGGCA TGGTCTGTGA CAGATGTGGA GTTGAAGTAA CTAAATCTAA AGTACGTCGT 4320 

GAAAGAATGG GTCACATTGA ACTTGCTGCT CCAGTTTCTC ACATTTGGTA TTTCAAAGGT 4380 

ATACCAAGTC GTATGGGATT ATTACTTGAC ATGTCACCAA GAGCATTAGA AGAAGTTATT 444 0 

10 

TACTTTGCTT CTTATGTTGT TGTAGATCCA GGTCCAACTG GTTTAGAAAA GAAAACTTTA 4500 

TTATCTGAAG CTGAATTCAG AGATTATTAT GATAAATACC CAGGTCAATT CGTTGCAAAA 4560 

ATGGGTGCAG AAGGTATTAA AGATTTACTT GAAGAGATTG ATCTTGACGA AGAACTTAAA 4620 

75 

TTGTTACGCG ATGAGTTGGA ATCAGCTACT GGTCAAAGAC TTACTCGTGC AATTAAACGT 4680 

TTAGAAGTTG TTGAATCATT CCGTAATTCA GGTAACAAAC CTTCATGGAT GATTTTAGAT 4740 

20 GTACTTCCAA TCATCCCACC AGAAATTCGT CCAATGGTTC AATTAGATGG TGGACGATTT 4 8 DO 

GCAACAAGTG ACTTAAACGA CTTATACCGT CGTGTAATTA ATCGAAATAA TCGTTTGAAA 4860 

CGTTTATTAG ATTTAGGTGC ACCTGGTATC ATCGTTCAAA ACGAAAAACG TATGTTACAA 4920 

25 GAAGCCGTTG ACGCTTTAAT TGATAATGGT CGTCGTGGTC GTCCAGTTAC TGGCCCAGGT 4 980 

AACCGTCCAT TAAAATCTTT ATCTCATATG TTAAAAGGTA AACAAGGTCG TTTCCGTCAA 5040 

AACTTACTTG GTAAACGTGT TGACTATTCA GGACGTTCAG TTATTGCAGT AGGTCCAAGC 5100 

30 

TTGAAAATGT ACCAATGTGG TTTAGCAAAA GAAATGGCAC TTGAACTATT TAAACCATTC 5160 

GTAATGAAAG AATTAGTTCA ACGTGAAATT GCAACTAACA TTAAAAATGC GAAGAGTAAA 5220 

ATCGAACGTA TGGATGATGA AGTTTGGGAC GTATTGGAAG AAGTAATTAG AGAACATCCT 5280 

35 

GTATTACTTA ACCGTGCACC AACACTTCAT AGACTTGGTA TTCAAGCATT TGAACCAACT 534 0 

TTAGTTGAAG GTCGTGCGAT TCGTCTACAT CCACTTGTAA CAACAGCTTA TAACGCTGAC 54 00 

TTTGACGGTG ACCAAATGGC GGTTCACGTT CCTTTATCAA AAGAGGCACA AGCTGAAGCA 5460 

40 

AGAATGTTGA TGTTAGCAGC ACAAAACATC TTGAACCCTA AAGATGGTAA ACCTGTAGTT 5520 

ACACCATCAC AAGATATGGT ACTTGGTAAC TATTACCTTA CTTTAGAAAG AAAAGATGCA 5580 

45 GTAAATACAG GCGCAATCTT TAATAATACA AATGAAGTAT TAAAAGCATA TGCAAATGGC 564 0 

TTTGTACATT TACACACTAG AATTGGTGTA CATGCAAGTT CGTTCAATAA TCCAACATTT 5700 

ACTGAAGAAC AAAACAAAAA GATTCTTGCT ACGTCAGTAG GTAAAATTAT ATTCAATGAA 5760 

50 ATCATTCCAG ATTCATTTGC TTATATTAAT GAACCTACGC AAGAAAACTT AGAAAGAAAG 5820 

ACACCAAACA GATATTTCAT CGATCCTACA ACTTTAGGTG AAGGTGGATT AAAAGAATAC 5B80 
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GAAGTATTCA ACAGATTTAG CATCACTGAT ACATCAATGA TGTTAGACCG TATGAAAGAC 6000 

TTAGGATTCA AATTCTCATC TAAAGCTGGT ATTACAGTAG GTGTTGCTGA TATCGTAGTA 6060 

5 TTACCTGATA AGCAACAAAT ACTTGATGAG CATGAAAAAT TAGTCGACAG AATTACAAAA 6120 

CAATTCAACC GTGGTTTAAT CACTGAAGAA GAAAGATATA ATGCAGTTGT TGAAATTTGG 6180 

ACAGATGCAA AAGATCAAAT TCAAGGTGAA TTGATGCAAT CACTTGATAA AACTAACCCA 6240 

10 

ATCTTCATGA TGAGTGATTC AGGTGCCCGT GGTAACGCAT CTAACTTTAC ACAGTTAGCA 6300 

GGTATGCGTG GATTGATGGC CGCACCATCT GGTAAGATTA TCGAATTACC AATCACATCT 6360 

TCATTCCGTG AAGGTTTAAC AGTACTTGAA TACTTCATCT CAACTCACGG TGCACGTAAA 6420 

IS 

GGTCTTGCCG ATACAGCACT TAAAACAGCT GACTCAGGAT ATCTTACTCG TCGTCTTGTT 6480 

GACGTGGCAC AAGATGTTAT TGTTCGTGAA GAAGACTGTG GTACTGATAG AGGTTTATTA 6540 

20 GTTTCTGATA TTAAAGAAGG TACAGAAATG ATTGAACCAT TTATCGAACG TATTGAAGGT 6600 

CGTTATTCTA AAGAAACAAT TCGTCATCCT GAAACTGATG AAATAATCAT TCGTCCTGAT 6660 

GAATTAATTA CACCTGAAAT TGCTAAGAAA ATTACAGATG CTGGTATTGA ACAAATGTAT 6720 

25 ATTCGCTCAG CATTTACTTG TAACGCACGA CATGGTGTTT GTGAAAAATG TTACGGTAAA 6780 

AACCTTGCTA CTGGTGAAAA AGTTGAAGTT GGTGAAGCAG TTGGTACAAT TGCAGCCCAA 6840 

TCTATCGGTG AACCAGGTAC ACAGCTTACA ATGCGTACAT TCCATACAGG TGGGGTAGCA 6900 

3D 

GGTAGCGATA TCACACAAGG TCTTCCTCGT ATTCAAGAC5A TTTTCGAAGC ACGTAACCcT 6960 

AAAGGTCAAG CGGTAATTAC GGAAATCGAA GGTGTCGTAG AAGATATTAA ATTAGCAAAA 7020 

GATAGACAAC AAGAAATTGT TGTTAAAGGT GCTAATGAAA CAAGATCATA CCTTGCTTCA 7080 

35 

GGTACTTCAA GAATTATTGT AGAAATCGGT CAACCAGTTC AACGTGGTGA AGTATTAACT 7140 

GAAGGTTCTA TTGAACCTAA GAATTACTTA TCTGTTGCTG GATTAAACGC GACTGAAAGC 7200 

TACTTATTAA AAGAAGTACA AAAAGTTTAC CGTATGCAAG GTGTAGAAAT CGACGATAAA 7260 

40 

CACGTTGAGG TTATGGTTCG ACAAATGTTA CGTAAAGTTA GAATTATCGA AGCAGGTGAT .7320 

ACGAAG TTAT TACCAGGTTC ATTAGTTGAT ATTCATAACT TTACAGATGC AAATAGAGAA 73 30 

4S GCATTTAAAC ACCGTAAGCG TCCTGCAACA GCTAAACCAG TATTACTTGG TATTACTAAA 7440 

GCATCACTTG AAACAGAAAG TTTCTTATCT GCAGCATCAT TCCAAGAAAC AACAAGAGTT 7500 

CTTACAGATG CAGCAATTAA AGGTAAGCGT GATGACTTAT TAGGTCTTAA AGAAAACGTA 7560 

SO ATTATTGGTA AGTTAATTCC AGCTGGTACT GGTATGAGAC GTTATAGCGA CGTAAAATAC 7620 

GAAAAAACAG CTAAACCAGT TGCAGAAGTT GAATCTCAAA CTGAAGTAAC GGAATAACAA 76 BO 
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15 



ATGTTGACGA ATTCTCTTGT TCAATGTTAA TATATTAAAG GTTGATGCAA GCAGAACTTT .7800 

GGAGGATAAA TTATTGTCTA AGGAAAAAGT tGCACGCTTT AACAAACAAC ATTTTGTAGT 7860 

TGGTCTTAAA GAAACGCTTA AAGCGTTAAA GAAAGATCAA GTTACATCTT TGATTATTGC 7920 

TGAAGACGTT GAAGTATATT TAATGACTCG CGTGTTAAGC CAAATCAATC AGAAAAATAT 7980 

ACCTGTATCT TTTTTCAAAA GCAAACATGC TTTGGGTAAA CATGTAGGTA TTAACGTCAA 8040 

TGCGACAATA GTAGCATTGA TTAAATGAGA ATTAGTAAGT GTTTTACTTA CTAAATTTTA 8100 
TTTAACCTAA AAATGAACCA CCTGGATGTG TGGGATTAAA AAGTGAAGAG AGGAGGACAT ■ 8160 

ATCACATGCC AACTATTAAC CAATTAGTAC GTAAACCAAG ACAAAGCAAA ATCAAAAAAT 8220 

CAGATTCTCC AGCTTTAAAT AAAGGTTTCA ACAGTAAAAA GAAAAAATTT ACTGACTTAA 8280 

ACTCACCACA AAAACGTGGT GTATGTACTC GTGTAGGTAC AATGACACCT AAAAAACCTA 8340 

20 ACTCAGCGTT ACGTAAATAT GCACGTGTGc gTtTATCAAA CAACATCGAA ATTAACGCAT 84 00 

ACATCCCTGG TATCGGACAT AACTTACAAG AACACAGTGT TGTACTTGTA CGTGG TGGAC 8460 

GTGTAAAAGA CTTACCAGGT GTGCGTTACC ATATTGTACG TGGAGCACTT GATACTTCAG 8520 

25 GTGTTGACGG ACGTAGACAA GGTCGTTCAT TATACGGAAC TAAGAAACCT AAAAACTAAG 8580 

AATTTAGTTT TTAATTAAAT CTTAAACTTA AAATATTTAA TATAAGGAAG GGAGGATTTA 8640 

CATTATGCCT CGTAAAGGAT CAGTACCTAA AAGAGACGTA TTACCAGATC CAATTCATAA 8700 

CTCTAAGTTA GTAACTAAAT TAATTAACAA AATTATGTTA GATGGTAAAC GTGGAACAGC 8760 

ACAAAGAATT CTTTATTCAG CATTCGACCT AGTTGAACAA CGCAGgtTCG TGATGCATTA 8B20 

GAAGTATTCG AAGAAGCAAT CAACAACATT. ATGCCAGTAT TAGAAGTTAA AGCTCGTCGC 8880 

GTAGGTGGTT CTAACTATCA AGTACCAGTA GAAGTTCGTC CAGAGCGTCG TACTACTTTA 8940 

GGTTTACGTT GGTTAGTTAA CTATGCACGT CTTCGTGGTG AAAAAACGAT GGAAGATCGT 9000 

TTAGCTAACG AAATTTTAGA TGCAGCAAAT AATACAGGTG GTGCCGTTAA GAAACGTGAG 9060 

GACACTCACA AAATGGCTGA AGCAAACAAA GCATTTGCTC ACTACCGTTG GTAAGATAAA 9120 

AGCTTTTACC CTGAGTGTGT TCTATATTAA TGAATTTTCA TTAAGCGTTC ATGCTTAGGG 9180 

45 CATCGCCATA TCTATCGTAT TTATTCAGTA ATATAAACTG GAAGGAGAAA AAATACATGG 9240 

CTAGAGAATT TTCATTAGAA AAAACTCGTA ATATCGGTAT CATGGCTCAC ATTGATGCTG 9300 

GTAAAACGAC TACGACTGAA CGTATTCTTT ATTACACTGG CCGTATCCAC AArGknGGTG 936 0 

SO AAaCACACGA AGGTGCTTCA CAAATGGACT GGATGGAGCA AGAACAAGAC CGTGGTATTA 9420 

CTATCACATC TGCTGCAACA ACAGCAGCTT GGGAAGGTCA CCGTGTAAAC ATTATCGATA 9480 
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CAGTTACAGT ACTTGATGCA CAATCAGGTG 
AGGCTACAAC TTATGGTGTT CCACGTATCG 
5 CTAACTTCGA ATACTCTGTA AGTACATTAC 

TCCAATTACC AATTGGTGCG GAAGACGAAT 
AATGTTTCAA ATATACAAAT GATTTAGGTA 

10 

ACCACTTAGA TAGAGCTGAA GAAGCTCGTG 
GCGACGAATT AATGGAAAAA TATCTTGGTG 
AAGCTATCCG CCAAGCTaCt AcTAACGTAG 

15 

TCAAAAACAA AGGTGTTCAA TTAATGCTTG 
TAGACGTTAA ACCAATTATT GGTCACCGTG 

20 AAGCAGACGA TTCAGCTGAA TTCGCTGCAT 
TTGGTAAATT AACATTCTTC CGTGTGTATT 
AGAACTCTAC TAAAGGTAAA CGTGAACGTG 

25 CACGTCAAGA AATCGATACT GTATACTCTG 
ATACAGGTAC TGGTGATACT • TTATGTGGTO 
AATTCCCAGA GCCAGTTATT CACTTATCAG 

30 AAATGACTCA AGCTTTAGTT AAATTACAAG 
ACGAAGAAAC TGGACAAGTT ATCATCGGTG 
TAGACCGTAT GAAGAAAGAA TTCAACGTTG 

35 

ATCGTGAAAC ATTCAAATCA TCTGCACAAG 
GTCGTGGTCA ATACGGTGAT GTTCACATTG 
TCGAATTCGA AAACGCTATC GTTGGTGGTG 

40 

AAGCTGGTCT TAAAGATGCT ATGGAAAATG 
TTAAAGCTAA ATTATATGAT GGTTCATACC 

4$ AAATTGCTGC ATCATTAGCA CTTAAAGAAG 
AACCAATGAT GAAAGTAACT ATTGAAATGC 
ACGTAACATC TCGTCGTGGA CGTGTTGATG 

SO TTAATGCTTA TGTACCACTT TCAGAAATGT 
CTCAAGGTCG CGGTACTTAC ACTATGTACT 

55 



TTGAACCTCA AACTGAAACA GTTTGGCGTC 9600 

TATTTGTAAA CAAAATGGAC AAATTAGGTG 9660 

ATGATCGTTT ACAAgCTAAC GCTGCTCCAA . 9720 

TCGAAGCAAT CATTGACTTA GTTGAAATGA 9780 

CTGAAATTGA AGAAATTGAA ATTCCTGAAG 984 0 

CTAGCTTAAT CGAAGCAGTT GCAGAAACTA 9900 

ACGAAGAAAT TTCAGTTTCT GAATTAAAAG 9960 

AATTCTACCC AGTACTTTGT GGTACAGCTT 10020 

ACGCTGTAAT TGATTACTTA CCTTCACCAC 10080 

CTAGCAACCC TGAAGAAGAA GTAATCGCGA 1014 0 

TAGCGTTCAA AGTTATGACT GACCCTTATG 10200 
CAGGTACAAT GACATCTGGT TCATACGTTA ■ 10260 
TAGGTCGTTT ATTACAAATG CACGCTAACT . 10320 

GAGATATCGC TGCTGCGGTA GGTCTTAAAG 10380 
AGAAAAATGA CATTATCTTG GAATCAATGG ' 10440 

TAGAGCCAAA ATCTAAAGCT GACCAAGATA 10500 

AAGAAGACCC AACATTCCAT GCACACACTG 1056 0 

GTATGGGTGA GCTTCACTTA GACATCTTAG 1062 0 

AATGTAACGT AGGTGCTCCA ATGGTTTCAT 10680 

TTCAAGGTAA ATTCTCTCGT CAATCTGGTG 10740 

AATTCACACC AAACGAAACA GGCGCAGGTT 10800 

TAGTTCCTCG TGAATACATT CCATCAGTAG 10 860 

GTGTTTTAGC AGGTTATCCT TTAATTGATG 10920 

ATGATGTCGA . TTCATCTGAA ATGG CCTTCA 10980 

CTGCTAAAAA ATGTGATCCT GTAATCTTAG 11040 

CTGAAGAGTA CATGGGTGAT ATCATGGGTG 11100 

GTATGGAACC TCGTGGTAAT GCACAAGTTG 11160 

TCGGTTATGC AACATCATTA CGTTCAAACA 11220 

TCGATCACtA TGCTGAAGTT CCaAAATCaA 11280 
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GCCTAGGTTA 
TTGGGGTGAA 
CTCATGATGG 
GGAGAGATTT 
GTACTATCGG 
TAGCAAAAAA 
AAAAAGAACG 
ACTACGCTCA 
CTGCTCAAAT 
CTCGTGAACA 
ACAAAGTTGA 
• ACTT ATTAAG 
TAAAAGCTTT 
TAGATACTTA 
AGGACGTATT 
AAATCAAAGT 
TTACAGGTGT 
GTGCATTATT 
CTGGTTCAAT 
AAGGTGGACG 
CTGAGGTAAC 
ACGTTGAAAT 
CAATCCGTGA 
TTCTAATTTC 
TGTGTAAATT 
AATAAAACTT 
AATGCTCCCT 
TTTATGTCTC 
GCTCTAATTC 



AAATACAAGG 
TGCATTATAA 
TGAGAAACTA 
TATAATGGCA 
TCACGTTGAC 
TGGTGACTCA 
TGGTATCACA 
CGTTGACTGC 
GGACGGCGGT 
CATTCTTTTA 
CATGGTTGAC 
CGAATATGAC 
AGAAGGCGAT 
CATTCCAACT 
CTCAATCACT 
TGGTGAAGAA 
TGAAATGTTC 
ACGTGGTGTT 
TACACCACAT 
TCACACTCCA 
TGGTGTTGTT 
GACAGTAGAA 
AGGTGGACGT 
TTAGATTTTA 
TTGTAATGGC 
ATAACTGATA 
TCAAAGTTTT 
AGATTCATAT 
TATGATTTTT 



TGAGCTTAAA 
AAGAATTGTA 
TCATGAGAGA 
AAAGAAAAAT 
CATGGTAAAA 
GTTGCACAAT 
ATCAATACTT 
CCAGGACACG 
ATCTTAGTAG 
TCACGTAACG 
GATGAAGAAT 
TTCCCAGGTG 
GCTCAATACG 
CCAGAACGTG 
GGTCGTGGTA 
GTTGAAATCA 
CGTAAATTAT 
GCTCGTGAAG 
ACTGAATTCA 
TTCTTCTCAA 
CACTTACCAG 
TTAATCGCTC 
ACTGTAGGAT 
TATAAAAAGA 
TATTCGATTT 
ATACTGTTTA 
CATTTTTTCa 
CTTTCAATTA 
AAAAATACAG 



TGTAAGCTAT 
AAATTCTTTT 
TAAATTTAAA 
TCGATCGTTC 
CAACATTAAC 
CATATGACAT 
CTCACATTGA 
CTGACTACGT 
TATCTGCTGC 
TTGGTGTACC 
TATTAGAATT 
ACGATGTACC 
AAGAAAAAA7 
ATTGTGACAA 
CTGTTGCTAC 
TCGGTTTACA 
TAGACTACGC 
ACGTACAACG 
AAGCAGAAGT 
ACTATCGTCC 
AAGGTACTGA 
CAATCGCGAT 
CAGGCGTTGT 
AGATCCCTCA 
AGAAGAACAA 
AATAAAATTG 
ATGTCTACTT 
ATTTAAATGC 
CTACAGCGTA 



CATCTTTATA 
TGCATCGCTA 
TATTATTTTT 
TAAAGAACAT 
AGCAGCAATC 
GATTGACAAC 
GTACCAAACT 
TAAAAACATC 
TGACGGTCCA 
AGCATTAGTA 
AGTAGAAATG 
TGTAATCGCT 
CTTAGAATTA 
ACCATTCATG 
AGGCCGTGTT 
TGACACATCT 
TGAAGCTGGT 
TGGTCAAGTA 
ATACGTATTA 
ACAATTCTAT 
AATGGTAATG 
TGAAGACGGT 
TACTGAAATC 
ATCGAGGGGt 
TAATTGATGA 
TTGAGTCTTG 
TGAAGGGAGC 
TTAATTTGTT 
TTTTAATGAT 



GTTTGATTTT 

TAAATAATTT 

AATTAGAATA 

GCCAATATCG 

GCTACTGTAT 

GCTCCAGAAG 

GACAAACGTC 

ATCACTGGTG 

ATGCCACAAA 

GTATTCTTAA 

GAAGTTCGTG 

GGTTCAGCAT 

ATGGAAGCTG 

ATGCCAGTTG 

GAACGTGGTC 

AAAACAACTG 

GACAACATTG 

TTAGCTGCTC 

TCAAAAGACG 

TTCCGTACTA 

CCTGGTGATA 

ACTCGTTTCT 

ATTAAATAAT 

CTTTTTTTAA 

AAGACTGACT 

GACATTGTAA 

ATTTCATTAG 

TTAAATACTT 

TTTTCATCAA 



11400 

11460 

11520 

11580 

11640 

11700 

11760 

11820 

11880 

11940 

12000 

12060 

12120 

12180 

12240 

12300 

12360 

12420 

12480 

12540 

12600 

12660 

12720 

12780 

12840 

12900 

12960 

13020 

13080 
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TCAGAAAGAA TGCACCTGGT CGTACTTTCA AATAATGTGA AAAATCTTCT CCAATCATCA 13200 

TTAAATCTGA TTCATTAAAG CGTACATGTA AGTCATTTGT TGCTTCTTTA ATAACTTGAT 13260 

5 ATGCTTTCTC GTTATTATGG ACAGGCAAAT ACCCTTTAAT ATAATTCAAA TCATAGTTAA 13320 

TATCATTTGC TATTGCTAAA CCTTGTAGAA GCTTATCCAT TTTGTCCATT ACATGATTCT 13380 

GTATATCTGA ATCGAAAGTT CTAACTGTAC CTTTACAAAA TGCTTGATCA GGAATAACGC 1344 0 

10 

TATCTGTGGT GCCTGCTTGA ATCATTCCAA ATGAAAGTAC AGCTTGTTTA ACTGGATCGA 1350 0 

TCGTACGTGA AATTATTTTT TGTGCACTTA AAATGAACTC TGCCATGATT ACTATTGGGT 13560 

CAATGGTTTC ATGAGGTTTG GCACCATGAC CACCACGACC TTTAAATGTG ACGCTAAATT 13620 

75 

CATCTGGAGA GGCCATGATT GCCCCCGCAC GTGAATGAAT AGTTCCAGTA GGATAACCAC 13680 

TCCATAAATG TGTACCGTAA ATTCTATCTA CATTTTCCAG ACATCCAGCA TCTATCATTT 13740 

CTTGAGAACC ACCTGGCATG ATTTCTTCAC CGTACTGGAA TATTAATACA ACATTACCTT 13800 

20 

CTAATAAATG TTTATGTTCA TCTAAAATCT CTGCTACAGT AAGTAAAATT GCTGTATGAC 13860 

CATCATGCCC ACACGCATGC ATACATCCTG GATTTTTAGA CTTATAAGGC ACATCGTTTA 13920 

25 ATTCCTCGAC AGGTAACGCA TCAAAGTCAG CTCTTAATGC AATGGTAGGT CCTGTGCCCA 13980 

AGCCTTTAAA TGTGGCTTTG ATACCATTGC GGCCGATAGG AGTTTCAATA TCACAAGATA 14040 

ACTGGCTTAA TTGGTTAACA ATATAATCAT GTGTTTGAAA TTCTTCAAAA GATAACTCAG 14100 

30 GATATTGGTG TAAATAACGT CTGAGTTGAA TTGTTTTATT TTCTTTATTA TTTGCTAGTT 14160 

GGAACCAATC TAACACCCTT ATCACTACTT TCTAAAATAA TGTTTATAGT ATAACATTTT 14220 

ATGAAATTAT CGTACTAAAT GATTGCTTTG AGATATTTTA TCTATGAATG ATAAGGCTTT 14280 

55 CAAGTTATGT AGAATTACTG TATGATAAAG GTATTACCAA ACAATACTTA AGGGGGATTA 14340 

TATACTGTGG TTCAATCATT ACATGAGTTT TTAGAGGAAA ATATAAATTA TCTAAAAGAA 14400 

AATGGTTTGT ATAATGAAAT AGATACAATT GAAGGTGCAA ACGGACCAGA AATCAAAATC 14460 

40 

AATGGGAAAT CATACATTAA CTTATCTTCA AATAATTATT TAGGACTAGC AACAAATGAA 14520 

GATTTGAAAT CaGctGCAAA AGCAGCTATT GATACACATG GTGTAGGTGC AGGCGCTGTT 14580 

CGTACAATCA ATGGTACATT AGATTTACAC GACGAATTAG AAGAAACACT AGCAAAATTT 14640 

45 

AAAGGAACAG AAGCTGCAAT AGCTTATCAA TCAGGATTTA ATTGTAATAT GGCTGCTATT 14700 

TCAGCTGTCA TGAATAAAAA TGATGCTATT TTATCAGATG AGCTTAATCA TGCATCAATT 14760 

SO ATTGATGGAT GTCGCTTATC TAAAGCTAAA ATTATTCGAG TTAACCATTC AGACATGGAT 14820 

GATTTACGTG CGAAAGCAAA AGAAGCAGTT GAATCAGGTC AATACAATAA AGTGATGTAT 14880 
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20 



25 



ATTGCAGAAG AATTTGGTTT ATTAACTTAT GTTGACGACG CTCATGGTTC AGGTGTTATG 15000 

GGTAAAGGCG CTGGTACGGT TAAACATTTT GGTTTACAAG ATAAAATCGA TTTCCAAATA 15060 

GGTACGCTTT CTAAAGCAAT TGGTGTCGTT GGCGGTTATG TAGCAGGTAC AAAAGAGTTA 15120 

ATAGATTGGT TAAAAGCACA ATCACGACCA TTCTTATTCT CTACATCATT AGCACCTGGG 15180 

GATACCAAAG CAATAACTGA AGCAGTTAAA AAGTTAATGG ATTCAACTGA ATTACATGAT 15240 

AAATTATGGA ACAATGCACA ATATTTAAAA AATGGATTGT CAAAATTAGG ATATGATACA 15300 

GGTGAGTCAG AAACTCCAAT TACACCAGTA ATTATTGGTG ATGAAAAAAC AACTCAAGAA 15360 

TTTAGTAAGC GTTTAAAAGA CGAAGGTGTC TATGTGAAAT CTATCGTTTT CCCAACAGTA 15420 

CCAAGAGGTA CAGGACGTGT AAGAAATATG CCTACAGCTG CACATACAAA AGACATGTTA 15480 

GATGAAGCAA TTGCGGCTTA TGAAAAAGTA GGAAAAGAAA TGAAGTTGAT TTAATATTTA 15540 

TTTATTGCCA CGGCAAATAT TGTCGTGGGC TTTTTTTAAT GTTTAGTTTA TTAACAGT 15598 
(2) INFORMATION FOR SEQ ID NO: 83: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 661 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



35 



40 



45 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 83: 

AAGTAAATCA ACTTACTGGG ATAAGAATAA AGGCGATTAT AGTAACAAGT TGATTTTATT 60 

CGAAAAACAT TTTGAACCGG TTCTGGGTAT CAAGATGCAA CATAGTGGAG GTCATAGCTT 120 

TGGCCACACG ATTATTACGA TTGAAAGTCA AGGAGATAAA GCAGTTCATA TGGGTGATAT 180 

ATTCCCAACT ACTGCACATA AAAATCCTCT ATGGGTAACG GCATATGATG ATTATCCTAT 24 0 

GCAATCGATT CGTGAAAAAG AACGCATGAT ACCATATTTT ATTCAGCAAC AATATTGGTT 300 

CTTGTTTTAT CATGATGAAA ACTACTTTGC TGTAAAATAC AGCGATAATG GTGAAAACAT 360 

AGATGCATAT ATTTTACGTG AAACATTAGT TGATAATAAC TAAAATAAAG ATGTATTACT 420 

AAACAAATTT TCAAAAATAA AAAATTGAGC CACATCCAAT CTTACTAATT AGGGTGTGGC 480 

TCATTTTTAA GTTTTACgAT CCAAATCAAA TATGGaTAAA ATTCgTATTA ACGCTCTACa 540 

ATGtTAATGA CTTCACCAGT ATATGCATCT GCATAAAAAT CATAATGAAT ATTTTGACCA 600 

TTTTTAATAG TTGTAATTCC ACCTTGATAA ACTAAACGGT ATTTATCAGT TTCAGGATGA 660 

A 661 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 84: 

10 

GCAGACGGTA CAGCAGTTAA AGTCGCACCA AaACTGTAGT GAATcTAATC GGTGcATTCT 60 

TTTTAGGATT AGTTGTCGCG CTTATATATA TCTTCTTCAA AGTAATTTTC GATAAGCGAA 120 

TTAAAGATGA AGAAGATGTA GAGAAAGAAT TAGGATTGCC TGTATTGGGT TCAATTCAAA 180 

75 

AATTTAATTA AGGATGGTTG CTACTTATGT CAAAAAAGGA AAATACGACA ACAACACTAT 240 

TTGTATATGA AAAACCAAAA TCAACAATTA GTGAAAAGTT TCGAGGTATA CGTTCAAACA 300 

20 TCATGTTTTC AAAAGCAAAT GGTGAAGTAA AGCGCTTATT GGTTACTTCT GAAAAGCCTG 360 

GTGCAGGTAA AAGTACAGTT GTATCSAATG TAGCGATTAC TTATGCACAA GCAGGCTATA 420 

AGACATTAGT TATTGATGGC GATATGCGTA Age CAACACA AAACTATATT TTTAATGAGC 4 80 

25 • AAAAXAATAA TGGACTATCA AGCTTAATCA TTGGTCGAAC GACTATGTCA GAAGCAATTA 540 

CGTCGACAGA AATTGAAAAT TTAGATTTGC TAACAGCTGG CCCTGTACCT CCAAATCCAT 600 

CTGAGTTAAT TGGGTCTGAA AGGTTCAAAG AATTAGTTGA TCTGTTTAAT AAACGTTACG 660 

30 ACATTATTAT TGTCGATACA CCGCCAGTTA ATACTGTGAC TGATGCACAA CTATATGCGC 720 

GTGCTATTAA AGATAGTCTG TTAGTAATTG ATAGTGAAAA AAATGATAAr AATGAAGTTA 780 

AAAAAGCAAA AGCACTTATG GAAAAAGCAG GCAGTAACAT TCTAGGTGTC ATTTTGAACA .840 

35 

AGACAAAGGT CGATAAATCT TCTAGTTATT ATCACTATTA TGGAGATGAA TAAGTATGAT 900 

TGATATTCAT AACCATATAT TGCCTAATAT CGATGACGGT CCGACAAATG AAACAGAGAT 960 

GATGGATCTT TTAAAACAAG CGACAACACA AGGTGTTACA GAAATCATTG TAACATCACA 1020 

40 

TCACTTACAT CCTCGATATA CCACACCTAT AGAAAAAGTG AAATCATGTT TAAACCATAT .1080 

TGAAAGCTTA GAGGAAGTAC AAGCACTAAA TCTAAAGTTT TATTATGGTC AGGAAATAAG 1140 

45 AATTACCGAT CAAATCCTTA ATGATATTGA TCGAAAAGTT ATTAACGGTA TTAATGATTC 1200 

ACGCTATTTA CTAATAGAAT TTCCATCAAA TGAAGTTCCA CACTATACTG ATCAATTATt 1260 

TTTCGAATtA CAGAGTAAAG GCTTTGTACC GATTATTGCA CATCCAGAGC GGAATAAAGC 1320 

SO AATAAGTCAA AACCTTGACA TACTATACGA TTTAATTAAC AAAGGTGCTT TAAGTCAAGT 1380 

GACAACGGcG TCATTAGCGG GTATTTCCGG TAAAAAAATT AGAAAATTAG CAATTCAAAT 144 0 
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GTTCTTAATG AAAGACTTAT TTAATGATAA 


GAAATTACGT 


GATTATTATG 


AAGATATGAA 


1560 




CGGATTTATT AGTAATGCGA AGTTAGTTGT 


TGATGATAAA AAAATTCCTA 


AACGAATGCC 


1620 


5 


ACAACAAGAT TATAAACAGA AAAGATGGTT 


TGGGTTATAA 


ACAGCAAATG 


AGGGGTTTTA 


1680 




TGGCACATTT ATCTGTGAAA TTGCGGCTTT 


TAATACTAGC 


ATTAATCGAT 


TCACTGATAG 


1740 


10 


TGACATTTTC AGTATTCGTA AGTTATTACA 


TTTTAGAACC 


GTATTTCAAA ACATATTCTG 


1B00 


TCAAATTATT AATATTGGCA GCTATATCAC 


TATTCATATC 


GCATCATATT 


TCaGCATTTA 


1660 




TTTTTAATAT GTATCATCGA GCGTGGGAAT ATGCCAGTGT 


GAGTGAATTG 


ATTTTAATTG 


1920 


15 


TTAAAGCTGT GACGACATCT ATCGTTATTA 


CGATGGTGGT 


CGTGACAATT 


GTTACAGGCA 


1980 




ATAGACCGTT TTTTAGATTG TATTTAATTA 


CTTGGATGAT 


GCACTTGATT 


TTAATAGGTG 


2040 




GCTCAAGGTT ATTTTGGCGT ATTTATCGGA AATACCTTGG AGGTAAGTCA TTTAATAAGA 


2100 


20 


AGCCAACTTT AGTTGTTGGT GCTGGTCAAG 


CAGGTTCAAT 


GCTGATTAGA 


CAAATGTTGA 


2160 




AAAGTGACGA AATGAAACTT GAACCGGTAT 


TAGCAGTCGA 


TGATGACGAA 


CATAAACGCA 


2220 




ATATCACAAT TACTGAGGGT GTAAAAGTCC AAGGTAAAAT 


TGCGGATATT 


CCAGAACTAG 


2280 


25 


TGAGGAAATA TAAGATTAAA AAAATCATCA 


TTGCAATTCC 


AACTATTGGT 


CAAGAGCGTT 


2340 




TGAAAGAAAT TAATAATATT TGCCATATGG 


ATGGCGTTGA 


GTTATTGAAA 


ATGCCAAATA 


2400 




TAGAAGACGT CATGTCTGGT GAGTTAGAAG 


TGAACCAACT TAAAAAAGTT 


GAAGTAGAAG 


2460 


30 


ATTTACTAGG CAGAGATCCT GTTGAATTAG 


ATATGGATAT 


GATATCAAAT 


GAATTGACGA 


2520 




ATAAAACTAT TTTAGTTACG GGTGCAGGTG 


GTTCAATAGG 


ATCAGAAATT 


TGTAGACAAG 


2580 


35 


TTTGTAATTT CTATCCAGAA CGTATTATTC 


TACTTGGCCA 


TGGTGAAAAC 


AGTATTTATT . 


2640 


TAATCAATCG TGAATTGCGA AATCGCTTCG 


GwAAAAATGT 


TGATATCGTT 


CCTATTATAG 


2700 




CGGATGTGCA AAATAGAGCG CGTATGTTTG 


AAATTATGGA 


AACGTATAAA 


CCATACGCAG 


2760 


40 


TTTATCATGC AGCAGCACAC AAGCACGTGC 


CGTTAATGGA 


AGACAACCCT 


GAAGAAGCAG 


2B20 


TACGTAATAA TATTTTAGGT ACGAAAAATA 


CTGCTGAAGC 


TGCTAAAAAT GCAGAGGTAA 


2880 




AGAAATTCGT TATGATTTCT ACGGATAAAG 


CCGTTAATCC 


GCCTAATGTC ATGGGAGCTT 


2940 


45 


CAAAGCGAAT TGCAGAAATG ATTATTCAAA GTTTAAATGA TGAAACGCAT 


CGAACAAATT 


JUUU 




TTGTTGCAGT GAGATTTGGT AATGTACTTG, GATCGAGAGG ATCTGTGATT CCACTTTTCA 


3060 




AAAGTCAAAT TGAAGAAGGT GGGCCAGTTA 


CTGTGACACA 


TCCTGAAATG 


ACACGTTACT 


3120 


50 


TTATGACAAT TCCTGAAGCT TCTAGACTAG 


TTTTGCAGGC 


AGGGGCATTA 


GCAGAAGGTG 


3180 




GCGAAGTATT TGTGCTAGAT ATGGGAGAAC 


CAGTGAAAAT 


TGTAGATTTG 


GCACGTAATT 


3240 
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CCGGCGAAAA AATGTTTGAA GAGCTTATGA 
TTGAAAAAAT TTATCGTGGC AAAGTACAAC 
5 TTCAAGACAT CGTCAATGAC TTTAGTAAAG 

AGGGAGATAA TTATGTTCGA TGACAAAATT 
GGTAATGCTG TTATGAAACA GTTTTTAGAT 

70 

CGCGATGAGA AAAAACAAGA TGACATTCGA 
TACATTGGTG ATGTGCGTGA TAGTCAAAGT 
GTATTCCATG CAGCAGCTTT AAAACAAGTG 

75 

GTGAAGACAA ATATTATTGG TACAGAAAAT 
. AAAAAAGTCA TATGTTTATC TACAGATAAG 

20 TCAAAAGCAA TGATGGAAAA AGTATTCGTA 

ACGCTTATTT GTGGTACAAG ATACGGTAAT 
TTGTTTATCG ACAAAATCAA AGCTGGAGAA 

25 AGATTTTTAA TGAGCTTAGA AGATGCGGTA 

GAGACAGGAG ATATTATGGT TCAAAAAGCA 
GCATTATTAG AATTGTTTGA AGCTGATAAT 

30 

GAGAAAAAAG CAGAAACATT. GTTGACGAGA 
GATTATTTTA GAGTGCCGGC AGACTCCAGA 
GGTAACGAAA AGATTACGCA ATCTTATGAA 

35 

GTGGAAGAGA TAAAAGAAAA ACTTTTAACA 
TATAAAGCTT CAATGAGATA GGAGAGATTG 
GGTTTTGTAG GAAAAAACTT GAAAGCAGAT 

40 

GAAGTACATC GACAAACTAA AGAGGAAGAA 
GTCGTGCATT TAGCGGGTGT TAATCGACCT 

45 GTGAGTTATT TAGATCATGT ACTTGATATA 
TTATTATCGT CTTCAATACA AGCAACACAA 
GGGGAACAGC TATTAAGAGA GTATGCCGAA 

50 TGGCCAAATT TATTCGGCAA GTGGTGTAAG 
TGTTACAAAA TTGCACGTAA CGAAGAGATT 

55 



ATAAAGATGA 


GGTTCATCCT 


GAACAAGTAT 


3360 


ATATGAAATG 


TAATGAAGTT 


GAAGCGATTA 


3420 


AAAAAATTAT 


TAACTATGCC 


AATGGCAAAA 


3480 


TTATTAATTA 


CTGGGGGCAC 


AGGATCATTC 


3540 


TCTAATATTA 


AAGAAATTCG 


TATTTTTTCA 


3600 


AAAAAATATA 


ATAATTCAAA 


ATTAAAGTTC 


3660 


GTAGAAACAG 


CAATGCGAGA 


TG TTGATTAC 


3720 


CCGTCATGTG 


AATTCTTTCC 


AGTTGAGGCA 


3780 


GTCTTACAAA 


GTGCTATTCA 


TCAAAATGTT 


3840 


GCAGCGTATC 


CTATTAATGC 


TAGGGGTATT 


3900 


GCCAAATCAA 


GAAATATTCG 


TAGTGAACAA 


3960 


GTGATGGCTT 


CAAGAGGATC 


AGTAATACCT 


4020. 


CCTTTAACGA 


TTACAGATCC 


TGATATGACA 


4080 


GAACTAGTTG 


TTCATGCATT 


TAAGCATGCA 


4140 


CCAAGCTCAA 


CGGTAGGGGA 


TCTTGCGACC 


4200 


GCAATTGAAA 


TCATTGGTAC 


GCGACATGGA 


4260 


GAAGAATACG CACAATGTGA AGATATGGGT 


4320 


GATTTAAATT 


ATAGTAATTA 


TGTTGAAACC 


4380 


TATAACTCCG 


ATAATACACA 


TATTTTAACG 


4440 


CTAGAATATG 


TTAGAAACGA 


ATTGAATGAT 


4S00 


ACGTTGAATA 


TTGTAATTAC 


AGGAGCAAAA 


4560 


TTAACTTCAA 


CGACAGATCA TCATATTTTC 


4620 


TTAGAGTCAG 


CATTGTTGAA AGCAGACTTT 


4680 


GAACATGACA AAGAATTCAG 


CTTAGGAAAC 


4740 


TTAACTAGAA ATACGAAAAA 


GCCAGCGATA 


4800 


GATAATCCTT 


ATGGTGAGAG 


TAAGTTGCAA 


4860 


GAGTATGGCA ATACGGTTTA TATTTATCGC 


4920 


CCGAATTATA ACTCAGTGAT AGCAACATTT 


4980 


CAAGTTAATG ATCGGAATGT TGAACTAACG 


5040 
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ATTGAAAATG GTGfTACCTAC AGTACCAAAC GTATTTAAAG TGACATTGGG AGAAATTGTA 5160 

GATTTATTAT ACAAGTTCAA ACAGTCACGT CTCGATCGAA CATTGCCGAA ATTAGATAAC 5220 

5 . 

TTGTTTGAAA AAGATTTGTA TAGTACGTAT TTAAGCTATC TACCTAGTAC aGACTTTAGT 5280 

TAyCCCTTAC TTATGAATGT GGATGATAGG GGTTCTTTTA CAGAATTTAT AAAAACACCG 5340 

GATCGTGGTC AAGTTTCTGT AAATATTTCT AAACCAGGTA TTACTAAAGG TAATCACTGG 5400 

10 

CATCATACTA AAAACGAAAA ATTTCTAGTC GTATCAGGTA AAGGGGTAAT TCGTTTTAGA 5460 

CATGTTAATG ATGATGAAAT CATTGAATAT TATGTTTCTG GCGACAAATT AGAAGTTGTA 5520 

15 GACATACCAG TAGGATACAC ACATAATATT GAAAATTTAG GCGACACAGA TATGGTAACT 5580 

ATTATGTGGG TGAATGAAAT GTTTGATCCA AATCAGCCAG ATACGTATTT CTTGGAGGTA 5640 

TAGCGCATGG aAAAACTGAA rTTAATGACA ATAGTTGGTA CAAGGCCTGA AATCATTCGT 5700 

20 TTATCATCAA CGATTAAAGC ATGTGATCAA TATtTTAA 5738 

(2) INFORMATION FOR SEQ ID NO: 85: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9062 base pairs 
25 -(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 85: 
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ATCATCAACA 


AGAATGATAT 


TTTTCCCATC TACTATATCT 


TTTACCGCAG 


ATAACTTCAC 


. 60 


TCTCACACCT 


TGCTCACGTA 


ATTCTTGAGT TGGTTGAATA 


AATGTTCTTG 


CAACATATTG 


120 


ATTTTTAACT AGTCCCATTT 


CATATGGCAA ACCTATTTCT 


TCAGCATAAC 


CACTCGCAGC 


180 


TGATAGCGAT gAATTGGGTA 


CACCGATGAC CATATCAGCA 


TTTACAGGGC 


TTTCTTGGGC 


240 


TAATTTTTTA 


CCAGAAGCTT 


TACGTACTGC ATGGACATTT 


TTACCAGCTA 


TTGTTGAGTC 


300 


TGGTCTAGCA AAATAAATAT 


ATTCCATCGC AGAAATTGCA 


GTTGTCGTAT 


GATGTGTATA 


3S0 


AGATTTAACT 


GTAATACCTT 


TATCGTTAAT CACGACATAT 


TCACCTGCAT 


GAATATCTTG 


420 


AACAAATTCT 


GCACCTAACA 


CATCTATTGC ACATGTTTCA 


CTTGCAAGGA 


TGTATGTCCC 


480 


ATCTTTCATT 


TTACCTACAA 


CAAGTGGTCT GATAGCATTT 


GGATCTACTG 


CGCCATATAA 


540 


CGCATCTTTA 


GTTAAAATCG 


CAAATGTAAA ACCGCCTTTA 


ACTTTTCGCA AACTTTCTTT 


600 


CAACGCTTCC 


TCAAAAGTAG 


GAGCTTTACT TCGACGTATC 


AAATGCATAA 


TGACTTCAGT 


660 


ATCAGAAGAC 


GAATGGAAGA 


TAGCACCTTG TTTTTCTAAA 


TTCTGACGCA 


ATGATTTAGC 


720 
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CGGTTGAATA 


TTTTCAATAC 


CTTTATTACC 


TGAAGTAGCA 


TAACGGACGT 


GACCAATTGC 


840 




ATGTTGATAT 


CCTTTTAATC 


GTTCCATTTG 


ATCATCTTTA ATCGCTTCAG TTAGTAAGCC 


900 


5 


TAATCCTCGC 


TCGCCTTTTA 


ATTCATTTTG 


ATCAGAAACA 


ACTATACCTG 


cACCTTCTTG 


960 




ACCACGATGT 


TGCAAACTAT 


GAAGTCCCAT 


ATAtGTTAGT 


TGCGCTGCtT 


CaGGATGATT 


1020 


10 


CCAAATACCA 


AACACGCCAC 


ATTCTTCGTT 


TAATCCTGAG 


TAGTTAAACA 


TTGaGCAATT 


1080 


GCCCCtTCCC 


ATATTTGTTT 


AATATCTGAA 


ACATTTTCAC 


TAATCTCTGT 


aTATGGTGTT 


1140 




GTTACCTTGr 


aATTATCACT 


ATCTGTTAAA 


AGTCCAATTT 


CTATTGCATT 


ATCAATATTT 


1200 


15 


AAAGTTTTAC 


CTGATTTAAC 


AGAAACAACA 


TATCGGCCTT 


GCGTCTCACT 


AAACAATTGT 


1260 




GCATTTGTTA 


TATCTATTGA 


AGATTTTAAT 


CCTAAACCGT 


AATGCGCACT 


TAGTTTAGCT 


1320 




AAGGTAATCA 


GTAAGCCACC 


TTTACCAACT 


GTTTGAACAT 


GTGATAATAG 


TCCTTCACGA 


1380 


20 


ATAGCGGTCT 


TGATTGATTC 


ACCTTTTTCA 


ACTTCTGAAC 


TCAAATCTAA 


TGACTCAAAT 


1440 




TCATGATTAA 


CTTTGCCATA 


AATTAACTTT 


TCAAGTTGAC 


TACCACCAAA 


GTCGTCCTTA 


1500 




GTATCACCGA 


TTAAATATAA 


TTTATCTCCA 


ACTTGAGGTT 


CAAAATCATT 


TAAATAATTT 


1560 


25 


ACATTTTCAA 


TCAAACCTAC 


CATTCCAACA 


ACTGGTGTTG 


GGAAAATAGA 


AGTACCTTTC 


1620 




GTTTCGTTAT 


ATAAAGATAC 


ATTACCAGAA 


ACTACTGGTG 


TCTTAAGAAT 


GTCGCATGCT 


1660 




TCTGCCATAC 


CTTTCGTTGA 


ATCTATCAAC 


TGTTGATAGA 


TTTCTTTCTT 


TTCAGGAGAA 


1740 


30 


CCATAATTTA 


AACAATCTGT 


CATTGCTAAT 


GGTGTTGCAC 


CCACGGCAAT 


TAAATTTCGA 


1800 




TAAGCTTCAG 


CTACTACCAT 


CTTTCCACCT 


TCATATGGAT 


TGTTATATAC 


ATAACGCGCT 


1860 


35 


TCACCATCAA 


TTGTTGAAGC 


AATTGCCTTA 


TTTGTGCCTT 


CCACACGTAC 


TACCGATGCT 


1920 


TGAAGTCCTG 


GCTTAATTAT 


CGTATTGGCA 


CCAACTTGTT 


GGTCGTATTG 


ATCATATAAA 


1980 




TAGTGTTTAG 


ATGCTATAGT 


CGGATGCTTA 


AGTAATTTAA 


AGAAAGTATC 


TTTAACATCG 


2040 


40 


ATGTGTGTAT 


AATCATTTTT 


AGAAGTATTA 


TAATCTTTTT 


CTTCTCCTTC 


TAAAATATAT 


2100 


ACAGGTGCTT 


CATCAGCTAG 


TGGTTCAACT 


GGAATGTCAG 


CATAAACTTC 


GTCATCATAT 


2160 




GTTAAAACAA 


AACGATTTGT 


ATCTGTAACT 


TCACCTATAA 


CAGCACTATC 


CAATTCGTGC 


2220 


HO 


TTATCAAATA 


AATCTAAGAA 


TTTTTGTTCA 


gtaccttttt 


CAACAACTAG 


TAACATACGT 


2280 




TCTTGAGTTT 


CTGAAAGCAT 


CATTTCATAA 


GGAGAAATAC 


CTGGCTCACG 


TGTTGGCACT 


2340 




TGTTCTAATC 


TCAAATGTAA 


CCCACTACCA 


CCTTTTGCCG CCATTTCAGA CGATGAAGAT 


2400 


50 


GTTAAACCAG 


CAGCACCCAT 


ATCTTGAATA 


CCAACTAATT 


CATCAAATGT 


AATTGCTTCA 


2460 




AGTGTTGCTT 


CCATTAATTT 


TTTACCTACA 


AATGGATCAC 


CGATTTGTAC 


AGAAGGTCGT 


2520 
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CGACCAGTTT TCAAACCAAC ATAAATGACC GAATTACCTA CACCTTTTGC TGTGCCTTTT 264 0 

TGAATCATGT CGTGATTGaT AACACCAACA CACATTGCAT TAACAAGTGG ATTGCCATCA 2700 

5 TAACGTTCAT CAAATTCGAT TTCACCAGCA GTTGTTGGaA TACCAATGCA GTTACCATAA 2760 

CCTCCGATAC CCTTTACAAC ACCTTTAAGT AATCTTTGGT TTTGTTTATT ATCTAATTCT 2 820 

CCAAATCTAA GACTGTTTAA CAAATTAATA GGTCTAGCCC CAATAGAGAC AATGTCACGA 28 80 

10 

ATGATTCCAC CAACGCCTGT AGCAGCCCCT TGATATGGTT CAATTGCTGA TGGATGATTG 294 0 

TGAGACTCTA CTTTAAATAC TACGGCTTGA TTATCACCTA TATCGACTAC CCCTGCACCT 3000 

TCACCAGGCC CCATAAGCAC ATGGTcACCT GACGTAGGAA ATTGCTTTAA AAACGGTTTA 3060 

75 

GAATGTTTAT AAGAGCAATG TTCACTCCAC ATAACAGAAA AGATACCTGT TTCTGTAAAG 3120 

TTAGGTTGTC TGCCTAAAAT ATCGCAAACT TTTTCATATT CTTGATCaCT TAATCCCATA 3180 

20 . TCTTGATATA CTTTTTCAAG TTTAATTTCT TCAACGCTTG GTTCGATAAA TTTAGACATG 3240 

TTGTTCCCTC CAACTTTTTA CCATCGCTTC AAATAATTTC ACACCACTAT CAGTACCTAA 3300 

CAACGTTTCT AAAGCTCTTT CagGATGtGG CATCATGCCA CATACATTGC CTTTTTCGTT 3360 

25 AACAATTCCT GCAATATCAT CATATGAACC GTTCGGATTA TTCACATATT TCAGAATAAT 3420 

TTGATTGTTA GCTTTTAATT GTTGATATAT TTCATCAGTA CAATAATAAT GACCTTCACC 34 80 

GTGAGCTACA GGATATATAA CTTTTTCACC TTGTTCATAA AGATTTGTAA ATGCCGTTTG 3 540 

30 ATTATTCACT ATTTCTAACT CTTCATTTCT ACTAATAAAT AAATGTGAAT CGTTATGCAA - 3 600 

TAATGCACCA GGTAATAAGC CTATTTCAGT TAAAATTTGA AACCCATTAC AAACACCTAA 3660 

TACTGGCTTA CCTTCAGCTG CAAGACGTTT AACTTCCGAA ATAATCGGsG CTACACTAGC 3720 

.35 

CATTGCCCCA GATCTTAAGT AATCCCCGAA TGAAAATCCA CCAGGAATAA GTACGCCATC 3780 

AAAT€CACTT AGTGATGTTT CTCTATAATC TACATATTCC GCTTCAACAC CACTTTTAAT 3840 

AGCAGCATTA AACATGTCTC TATCACAATT CGAACCTGGA AAAACAAGAA CCGCAAATTT 3900 

40 

CATTTTATGC ATTCTCCTTT TCATCATCTA ACACTTTATA GCTATATTCT TCAATCACTG 3 96 0 

TATTTGCAAA CAATTTTTCA CTTAGAGTTG TAATAATGTT GTGTACCTTT TCATCACTAA 4 02 0 

45 CCTCATCCAC TGTCATATAT AATACTTTTC CTACACGAAT ATCATTCACT TGTGCATAAC 4080 

CTAAGTCATG TACAGCTCGA GTAAGCGTTT GTCCTTGCGT ATCTAATACT TGTGGTTGTA 414 0 

ATGTGATATG TAGTTCAATT GTTTTCATTA TTTTAAATCC TCCAATTTGT TTAAAAATAT 4200 

50 TTGATATGTT TCAATCAGTG ATCCAGTGTT ATTTCTATAT ACATCTTTAT CAAAGTTTGC 4260 

ATTGGTAGCT TTATCCCAAA TTCGACATGT ATCTGGAGAT ATTTCATCCG CTAAGAAAAT 4320 
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ATCCATTAAT 


TGTTTCAACA 


CATTATTAAT 


CTTTAATGCT 


TTGGATTTTA 


GTATTTCAAT 


4440 




ATCTTCATCT 


GATGCTATAT 


TGAGCAATTT 


AACATGGTCA 


TCCGTTATCA ACGGATCATT 


4500 


£ 


TAACGCATCA 


TTTTTATAGA 


AAAATTCTAC 


AAGTGGTTCT 


CTAAAAACTT 


CACCATTTTC 


4560 




AAAACCTAAA 


CGCTTTGTAA 


TAGATCCACT 


AGCAATATTA 


CGAACAACTA 


CTTCTAATGG 


4620 


10 


AATTATTTTC 


ACAGGCTTAA 


CTAATTGTTC 


TGTTTCAGAT AATTGTTTAA 


TAAAGTGACT 


4680 


TTCTATTCCA 


TTTTCTTGTA 


AATATTTAAA 


TATAATAGAA 


GTAATTTGAT 


TATTTAATCG 


4740 




CCCCTTACCT 


GCCATTGTGT 


CTTTCTTAGC 


CCCGTTTCCA 


GCAGTAACTT 


CATCTTTATA 


4800 


15 


TTCAACTCTT 


AATTCATTTT 


CTTGATTTGT 


TGAGAAAATG 


CGcTTCGCTT 


TTCCTTCATA 


4860 


TAATAATGTC 


ATGCTTTAAT 


TACTCCCCTC 


AAATTTAGCG 


TACATATCTT 


GTTCAGTTTG 


4920 




GTTTACATCA 


TTCGTTAGTA 


CAGTCATATG 


CCCCATTTTT 


CTGCTATCTT 


TACGCTCAGA 


4980 


20 


CTTACCATAA ATATGTAAGT GCCACTCTGG ATGTTCATTA AATTCATTTT 


CCAATAAATC 


5040 




TAAATCTTTA 


CCTAGTAAGT 


TCATCATGAC 


TGCTGGCTTT 


AATAATTCAA 


TTGAATTTGG 


5100 




TAATGATTGT 


CCGGTAACTG 


CTAAAATATG 


AGTATCAAAT 


TGTGAATAAT 


CACATGCTTC 


5160 


25 


AATTGAATAA 


TGTCCGGAAT 


TGTGAGGCCT 


TGGTGCTATC 


TCGTTCACAT 


ACAATTGGTT „ 


5220 




GTTACTATCT 


ATAAAAAATT 


CAACTGTAAA 


TGTTCCAATG AAATGAATCG ATTGGATAAT 


5280 




TTTATTAACT 


TGCTCTTTCG 


CCTCAGCTGT 


TTTATCTATT 


CTCGCTGGAA 


CAATTGTTTT 


5340 


30 


GAAAAGTATT TGATTTCTAT 


GCTCATTTTC 


TTGTAATGGG 


AAAAAAGTGA 


TTTGATTGTT 


5400 




GTTTCCTCTT 


GTAACAGTAA 


GAGATACTTC 


TTTCTTGATA 


TTCAAATATT 


TTTCAGCTAC 


5460 




GCATTCACTA 


GTTTCAATTA 


ATTTAAAACC 


TTCTTGTAAG 


TCTTTTTCGT 


TGTTAATTAA 


5520 


35 


AACTTGACCT 


TTGCCATCGT 


AGCCACCAAA 


TCTAGTTTTT 


ACAATAAAAG 


GATATCCTAA 


5580 




TGTl^CAATT GCTTTGTCAA TATCTGTAGA TTCTTTTACT GAAATGAACG GGACAACTTT 


5640 


40 


GGTACCAGCA 


CTTTTTAATG 


TTTCTTTTTC 


AGTTAAGCGA 


TCTTGTAATA 


ACTGTATAGC 


5700 


TTGGTAACCT 


TGCGGAATAT 


TGTACTTTTC 


ACATAATAGT 


TTTAATTGTT 


GGGCTGAAAT 


5760 




GTTTTCAAAT 


TCATAAGTAA 


TCACATCACA 


TTTTTGTCCT 


AATTGATTGA 


GTGCCTTTTC 


5820 


45 


ATCGTCATAC 


TTGGCTTGTA 


TAAATTCGTG 


TGCAACGTAT 


CTACATGGAC 


AATCTTCAGA 


5880 




AGGATCCAAT ACAACCACTT 


TATAACCCAT 


TTTTTGAGCT 


GATTGTGCCA 


TCATCTTTCC 


5940 




AAGCTGACCA 


CCACCAATAA 


TGCCAATAGT 


CGCACCAAAC 


TTTAATTTAT 


TGAAGTTCAT 


6000 


50 


TTTGCATGTC 


CTCCACTTTT 


TGAATTAACG 


AAGATTGATA 


CTGATTTAGT 


TTTTCAACTA 


6060 




AAGAAGGATT 


TTGAATACTT 


AACATTCTTG 


CTGCAAGTAT ACCTGCGTTT 


TTAGCACCTG 


6120 
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AAGAATCTAT ACCCTTTAAA CTTTTTGTTT CAATCGGCAC TCCAATAACT GGTAGCGTCG 624 0 

TTAATGATGC AACCATACCT GGTAAATGTG CCGCACCGCC AGCGCCTGCA ATGATAATGT 6300 

5 TTATACCTCT TTCTCTCGCT TCAGAAGCAA ATTGAACCAT CATTTTTGGC GTACGATGTG 6360 

CGGATACTAC TTGTTTTTCG TACGGAATTT CAAAATAATC CAACATGTTA CAACTCTCTT 6420 

GCATAATTTT CCAATCGGAA GAACTGCCCA TAATGACTGC TACTTTCACT TTGTACACCC 6480 

10 

TTTCAAAAGT TTGAATTGTG AATTACTTTA GTTGTATATT ATAGATATAG CATAACAAGC 654 0 

AATTTCTGCT TTTTCAATCA AAAATCGAAC TTTATTTTGA TTTTTTATTT GAATTTACGT 6600 

CTTTTGCTAT GTAAATTAGT TTTATAAACT AACAAAGTTA GGATATTGAC AATAGGAGGA 6660 

15 

GAAGTTTTTA TGGTTGCTAA AATTTTAGAT GGTAAACAAA TTGCCAAAGA CTACAGACAG 6720 

GGGTTACAAG ATCAAGTTGA AGCGCTAAAA GAAAAGGGTT TTACACCTAA ATTATCCGTT 67 BO 

20 ATATTAGTTG GTAATGATGG CGCTAGTCAA AGTTATGTTA GATCAAAAAA GAAAGCAGCT 684 0 

GAAAAAATTG GTATGATTTc AGAAATCGTA CATTTGGAAG AAACAGCTAC TGAAGAAGAA 6300 

GTATTAAACG AACTAAATAG ACTAAATAAT GATGATTCTG TAAGTGGTAT TTTGGTACAA 6960 

25 GTACCATTAC CAAAACAAGT TAGCGAACAG AAAATATTAG AAGCAATCAA TCCTGAAAAA 7020 

GATGTGGACG GTTTTCATCC AATAAATATA GGGAAATTAT ATATCGATGA ACAAACTTTT 7080 

GTACCTTGCA CACCGCTCGG CATCATGGAA ATATTAAAAC ATGCTGATAT TGATTTAGAA 7140 

30 GGTAAAAATG CAGTTGTAAT TGGACGAAGT CATATTGTCG GACAACCAGT TTCTAAGTTA 7200 

CTACTTCAAA AAAATGCATC AGTAACAATC TTACATTCTC GTTCAAAAGA TATGGCATCA 7260 

TATTTAAAAG ATGCTGATGT CATTGTCAGT GCAGTTGGTA AGCCTGGTTT AGTAACAAAA 7320 

35 

GATGTGGTCA AAGAAGGAGC AGTAATTATC GATGTTGGCA ATACGCCAGA TGAAAATGGC 7380 

AAATTAAAAG GTGACGTTGA TTATGATGCG GTTAAAGAAA TTGCTGGAGC TATTACACCA 744 0 

GTTCCTGGTG GCGTTGGTCC ATTAACAATT ACTATGGTAT TAAATAATAC TTTGCTTGCA 7500 

. 40 

GAAAAAATGC GTCGAGGTAT TGATTCGTAA AGAGCCTGAG ACATAAATCA ATGTTCTATG 7560 

CTCTACAAAG TTATAATGGC AGTAGTTGAC TGAACGAAAA TTCGCTTGTA ACAAGCTTTT 7620 

45 TTCAATTCTA GTCAACCTTG CCGGGGTGGG ACGACGAAAT AAATTTTACG AAAATATCAT 7680 

TTCTGTCCCA CTCCCTAATA ACTGAGTTTT AATGAAGTCT TTTAACCCAC ATTAAATATT 7740 

ATTTTGCAAT TGCAATGAAT AACAAGAAAA ATCTGGGACA TTAATCGATC AAATGCTCCC 7800 

50 TTCAAAGTAG ACATTGAATA AATGAAGGCT TTGAAGGGAG CATTTCACTT TGTACTTGGC 7860 

TCAACAATTT TATATAGACA GTAGTTAATT GAATGAAAAT AAGCTTGTAA CAAGTTTTCA 7920 
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15 



GTTGGGGATG GGCCCCAACA CAGAAGCTGT GACTATGATA AAGTACTACT ACATAGTTAA 8040 

TCATTAGTGG TTCTTTATCA TTTTCGCCTC C C TTTTCTTA TTGTTTTGAT ACACAAAAAT 8100 

TTAAGTTCAA ACTGTCGAAT AAAGTTATAT TTGATTTCAA ATTATCCCTA AATTATTAAT 8160 

TkTACAATTG TGGCAGATTT TCAAAATAAT AATTATTTCC TCATTATTTA TAAATTTATA 8220 

TTTAAATTTC ATTCTTTATA GGGTAAGATT AGGACTATAG TATGATGTGT ArATAATATA 8280 

AATTAAGGTA TAGTAAAGCT AACTCAGAAA TGACTTATCA TTCGGAGGTT ACATTATGAA 8340 

TAAACTATTA CAGTCATTAT CAGCCCTCGG TGTTTCTGCT ACACTAGTAA CACCAAATTT 84 00 

AAATGCAGAT GCAACGACGA ATACTACACC ACAAATTAAA GGCGCTAATG ATATCGTTAT 8460 

TAAGAAAGGT CAAGATTATA ACCTTCTAAA CGGCATAAGT GCATTTGATA AAGAAGATGG 8520 

AGATTTAACC GATAAAATTA AAGTCGATGG CCAAATTGAT ACATCTAAAT CTGGTAAATA 8580 

20 TCAAATTAAA TATCATGTCA CTGATTCAGA TGGTGCAATT AAAATTTCCA CTAGGTATAT 8640 

TGAGGTTAAA TAGCCCTCAT CACTATACTG CAAATAAAAT GGTAGCAAAC GAACATGTTT 8700 

TGCTACCATT TTATTTGTTA TTCTAACTTC ATCTGCAACT TTAACCCAAA TATTGTATTT 8760 

25 TTTCTGTATA CCAAAGGACT ACCTATCAAA TTATTAAAAC TTAACTGCTC TTTTTAAAAA 8820 

AATGTTTTGA TTTTGAACAA ACAAATTTCC ACTTTTCATT GTTTAACGAT AAATTACTTT 8880 

TGGCAAATTC CTTATTAAAA TGTTTGCGCT TCCTTTCAAT CAACTAGCCA TCATTTTCAA 8940 

TTTATTAGAC AATTTCAAAC TT TTTTTA TT TTCATTCAAT TAACCTTTAA TTGAAAGCTA 9000 

TTCTCAACTT TCCTTTTAAA TATGAAGCAA TTTTTTCAAA AACGCTATTA GTCACAAAAT 9060 

GT 9062 
(2) INFORMATION FOR SEQ ID NO: 86: 

- ( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



35 



40 



45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 86: 

AAATATTTTT TCAAAACTAT GTGAAAATGG aCCATGTCtA aATCATGTAA TAATGCAGyA 60 

CATAATGCCA ACGGTCTmTC TTTATTGTCC CATGCATCAT GACCAATAAA TGACTCATCA 120 

50 ATTAATCGTC TAACTATTTC ATACACACCT AAAGAATGTC CAAAGCGACT ATGTTCTGCT 180 

GTGTGAAAAG ATAGGTACAG TGTTCCTAGT TGTCTAATTC GACGTAACCT TTGGAATTCC 240 
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TCTTTAAAAA CTTTTTCTTC TACTAATTTT AAATCTACAT ATGCGTTAGT CATTATTCCC 360 

CTCCTTTTCG TTTAATATAA TATTTAATTT ACTTAAAATG CTTTGTACAT AAGTGCTAAG 420 

TCTAACTTTT CGCCATACAT TTCTGGCTCA TAAGAGCGTA AGATTGTAAA ACCTTGCTCT 4 80 

TTATAGTAAG CTACTGCTTC TTCATTTTTA TTATCTACTT CTAAGTAAAC ACCTTCAAAT 54 0 

TTATCTTCAA AACGTGATAA TCCTTCATTT AACAATGCTG TACCATAACC TGTATGTTGC 600 

GATTCTGGTT TAACATAATG AGCTGATAAA TATAATTCTT CACCGTAAAT AAAGTTAGCA 66 0 

AAGCCAACGA TGTCATTACC TTCTTCAACG ACTAAGAATA ATTGTTCTTG AAGTCTTTTC 720 

TTTAAATGAT GTTCATTATA TGAAGCTtCT AACAAGTGAT TAACTGTTGT CGCAGCGTAT 780 

ATATTTAAGT ATGTATTAAA CCAAGCTTTA GTTGCGACAT CTCTAATTTG AACAACATCT 64 0 

TTTTCAGTTG CTTGTCTTAC CTTGAACATG ACTTTCTCCC CTTATTAACA AGTTTTAATA 900 

ACGGCATTAT ACCACAACTT GCTCAATACT TAATAAACAA TGATTGTCTA TTCAATTTAT . 960 

ATATtTATAT TTTCCGTTAA AATTAAAAAT AAAAAATAAC GAAGCAAAAA AtCACTTCGT 1020 

TTAGTATGAG GTATGTCTTA TTGCAATATA CTATTCCACT CAGTTGCACG TGCTAAGGCA 10 80 

2s TAGTTGTCTT TCATGATGTC ACCAGGCTTT TCAGCAGTTC CAATAATATA ACCATTTAAA 1140 

GTGGCACCTA rAAAGTCTAA ACTATATTTC ATTTGCGTAA TTGCTGGTTC GCTTTTATTT 1200 

TTGGACAATC TCCACCAACT AAAATAACTC TAAAATCCTT TTCGGCCATT TGTGCCTTAA 12 60 

30 AATTAGGATA TCGTTTATCT TGTAATGTTT CTGACCAATG TTCGATAAAT GCTTTCAATG 1320 

GTGCTGAAAT GCTATACCAA TACACTGGTG ATGCAAAAAT AATTGTATCA CTAGCCAATA 1380 

TTTTATCTAG AATCGGCAAA TAGTCATCGT CATATGAAGT AATAGTCTCT GCTGTATGTC 1440 

TCACGTCACG TATCGGTTTA AACTGATGTT GTGTCACGTC AATCCATTGA TACTCTAAAT 1500 

CTTGCAAAGC GAATTTTGTT AATTGTGCAG TATTACCGTT TGGTCTACTC CCACCAAACA 1560 

AAACAGTAAT CATTTTAGCC TAACCTCACT TTTGATTAAT AAATATCTGT GTTTTTCGTT 1620 

ACCTAATTAT ACTATCATAA GCTTTGCCTA CCGAATAGTA AAACGCTTAC AACTTTTATA 16 80 

TAAATTTGAC GAAATTTCGT CATGCCTTAT ATAACGTCGT TTGTGATACG GGGCTAATTC 1740 

ATGATGAAAT TAGATACATA TATCACCATT AAATACAATT CATTTAGTCT TCAATCGGAA 1800 

ACAGTTCATC GATATATTGA ATCTCATCAT CTGATAAAAC GATATCTGCA GCTTTAATAT 1860 

TTTCAACGAC TTGTTCTGCA CGTTTTGCAC CAGGAATAAT CACATCGATA GCTGGTCTCG 1920 

SO TTAAATAAAA TGCTAATACA ATGTTCGCAA TTGAAGTTTG ATGTGCTGCA GCTATGCTTT 1980 

CCAAAGCTTT TACGCGACGC ACATTTTCTT CAAATACACC TGGTTTAAAA TCACGACGTG 204 0 
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GCTAATGGGA AATATGGAAT AAATGTGATT TGGTGATCAA CACAATATTG TAATACTGCC 2160 

TCATTTTCGC GATGCAATAA ATTATATTCT AACTGTACAA CATCAACGTA ACCATCTTTA 2220 

TTTGCTTCTT. TAAGTTGATC TAATGTGAAA TTTGATACAC CAATTGCTTT AATCTTCCCT 2280 

TGTTCCTTAA GCTCTTGTAA TGCTGCAACT GCTTGATCTT TCGGAGTGTT GTTATCCGGA 234 0 

AAATGAATAT AATATAAATC GATATAATCA GTTTGTAGAC GTTTCAAACT ATTCTCAACT 2400 

TGTTGTTTTA AATATTCCGG TTGATTGTTC TGATGTACTT CTTGATTTTC ATCAAATTCA 2460 

TGAGACCCTT TCGTAGCAAT TTTAATTTGC TCTCGCGGAT ATTCTTTAAC AACTTCTCCA 2520 

ACCAATTCTT CTGATCGTTC TGGCCCATAA ATATATGCCG TATCTAATAA ATTAATACCA 25 BO 

TGATTAATGG CTTGACGAAC AACATCTTTT CCTTGTTCTT CATCTAAGTT CGGATATAAA 264 0 

TTATGCCCAa CCTAtGCGTT CGTCCCAAGT GCGATTGGAA ACACTTCAAC ATCAGATTTA 2700 

CCTAAGTTTA CAAATTGCTn CATTAGACCC AGCnCCTT 2738 
(2) INFORMATION FOR SEQ ID NO: 87: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9425 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 87: 

GATTAGATGA TATTTAACGA AAATTAaGrT GmAATACTtG AATGTArGAa GTCTGATGTC 60 

GAAAATAGCT ATTAAAATAG AGTAGACGTA ATGtAAATGA AAGCACCTAA AATAGAAAAA 120 

TTTCAAAAAT AGCGTAATTA TTATAATAAA TAGACTGCCA ATAAAATGCA ATTTTTCACT 180 

TATAACATTC TTCAAAAAAT AATAGCAAAA TTATGTAAAA AATATCTTGT CATGGCAAGA 240 

TTGGCTGTGC TATAATCTAT CTTGTGCTTA AGAACGGCTC CTTGGTCAAG CGGTTAAGAC 3 00 

ACCGCCCTTT CACGGCGGTA ACACGGGTTC GAGTCCCGTA GGAGTCACCA TTTTTTAGGT 360 

CTCGTAGTGT AGCGGTTAAC ACGCCTGCCT GTCACGCAGG AGATCGCGGG TTCGATTCCC 420 

GTCGAGACCG TACAAATGCC TATCCAAGAG GATAGGCATT TTTTTGCGTT TAATATTATA 4 80 

TTAATAAAAG ATATATGGAC GAATGATAAT CATATTGATT TATCTGTTCG TCCATTTTCT 540 

TTAAAATGTA TGAACCTCAA GTAACTTAGT GGTTGGATAT GAAAGATAAA CGTAGACAAT 600 

SO AAAATCTTTA TTAGACGTAC AAACATATGC TACTGTCAAC ATATTTCTTC GTTGTGATAT 660 

GCCACCAGTC CTCCATAACA TCAATTGTTA AAGTAACGAA TAACGAATAA TGATATTTAT 720 
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GACCTCATCA TTGTGTTAAA TATCATTGTC ACAATCCGCC GTGAGAAACT AATAAAAAAT B40 

AGTAATATAT AAGTTTATAT TGGAAAATAG AATTAATAGC TTATAAATGG TAAATTATAT 900 

5 AATAGGTTAC TATACGTTAT AAGACGGAAA ATGCGCACAA TAACAAAAAT AGTAAGCGAC 960 

ATCCTGTGAT TTTTTACACA AACATAAACG ATAAAGAACA AAAAATGATA AAATAATATT 1020 

AATGATTTAA GAAAAGAGGT TTATGCAAAT GGCTAGAAAA GTTGTTGTAG TTGATGATGA 1080 

10 

AAAACCGATT GCTGATATTT TAGAATTTAA CTTAAAAAAA GAAGGATACG ATGTGTACTG 114 0 

TGCATACGAT GGTAATGATG CAGTCGACTT AATTTATGAA GAAGAACCAG ACATCGTATT 1200 

ACTAGATATC ATGTTACCTG GTCGTGATGG TATGGAAGTA TGTCGTGAAG TGCGCAAAAA 1260 

ATACGAAATG CCAATAATAA TGCTTACTGC TAAAGATTCA GAAATTGATA AAGTGCTTGG 1320 

TTTAGAACTA GGTGCAGATG ACTATGTAAC GAAACCGTTT AGTACGCGTG AATTAATCGC 1380 

2Q ACGTGTGAAA GCGAACTTAC GTCGTCATTA CTCACAACCA GCACAAGACA CTGGAAATGT 144 0 

AACGAATGAA ATCACAATTA AAGATATTGT GATTTATCCA GACGCATATT CTATTAAAAA 1500 

ACGTGGCGAA GATATTGAAT TAACACATCG TGAATTTGAA TTGTTCCATT ATTTATCAAA 1560 

25 ACATATGGGA CAAGTAATGA CACGTGAACA TTTATTACAA ACAGTATGGG GCTATGATTA 162 0 

CTTTGGCGAT GTACGTACGG TCGATGTAAC GATTCGTCGT TTACGTGAAA AGATTGAAGA 1680 

TGATCCGTCA CATCCTGAAT ATATTGTGAC GCGTAGAGGC GTTGGATATT TCCTCCAACA 1740 

30 ACATGAGTAG AGGTCGAAAC GAATGAAGTG GCTAAAACAA CTACAATCCC TTCATACTAA 18 00 

ATTTGTAATT GTTTATGTAT TACTGATTAT CATTGGTATG CAAATTATCG GGTTATATTT 1860 

TACAAATAAC CTTGAAAAAG AGCTGCTTGA TAATTTTAAG AAGAATATTA CGCAGTACGC 1920 

35 GAAACAATTA GAAATTAGTA TTGAAAAAGT ATATGACGAA AAGGGCTCCG TAAATGCACA 1980 

AAAAGATATT CAAAATTTAT TAAGTGAGTA TGCCAACCGT CAAGAAATTG GAGAAATTCG 204 0 

TTTTATAGAT AAAGACCAAA TTATTATTGC GACGACGAAG CAGTCTAACC GTAGTCTAAT 2100 

40 

CAATCAAAAA GCGAATGATA GTTCTGTCCA AAAAGCACTA TCACTAGGAC AATCAAACGA 2160 

TCATTTAATT TTAAAAGATT ATGGCGGTGG TAAGGACCGT GTCTGGGTAT ATAATATCCC 2220 

. AGTTAAAGTC GATAAAAAGG TAATTGGTAA TATTTATATC GAATCAAAAA TTAATGACGT 2280 

45 

TTATAACCAA TTAAATAATA TAAATCAAAT ATTCATTGTT GGTACAGCTA TTTCATTATT 234 0 

AATgCACAGT CATCCTAGGA TTCTTTATAG CGCGAACGAT TACCAAACCA ATCACCGATA 24 00 

SO TGCGTAACCA GACGGTCGAA ATGTCCaGAG GTAACTATAC GCAACGTGTG AAGATTTATG 2460 

GTAATGATGA AATTGGCGAA TTAGCTTTAG CATTTAATAA CTTGTCTAAA CGTGTACAAG 2520 
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GTGATGGTAT 


TATTGCAACA 


GACCGCCGTG 


GACGTATTCG 


TATCGTCAAT 


GATATGGCAC 


2640 




TCAAGATGCT 


TGGTATGGCG 


AAAGAAGACA 


TCATCGGATA 


TTACATGTTA 


AGTGTATTAA 


2700 


5 


GTCTTGAAGA 


TGAATTTAAA 


CTGGAAGAAA 


TTCAAGAGAA 


TAATGATAGT 


TTCTTATTAG 


2760 




ATTTAAATGA 


AGAAGAAGGT 


CTAATCGCAC 


GTGTTAACTT 


TAGTACGATT 


GTGCAGGAAA 


2820 


10 


CAGGATTTGT 


AACTGGTTAT 


ATCGCTGTGT 


TACATGACGT 


AACTGAACAA 


CAACAAGTTG 


2880 


AACGTGAGCG 


TCGTGAATTT 


GTTGCCAATG 


TATCACATGA 


GTTACGTACA 


CCTTTAACTT 


2940 




CTATGAATAG TTACATTGAA GCACTTGAAG 


AAGGTGCATG 


GAAAGATGAG 


GAACTTGCGC 


3000 


15 


CACAATTTTT ATCTGTTACC 


CGTGAAGAAA 


CAGAACGAAT 


GATTCGACTG 


GTCAATGACT 


3060 




TGCTACAGTT ATCTAAAATG 


GATAATGAGT 


CTGATCAAAT 


CAACAAAGAA 


ATTATCGACT 


3120 




TTAACATGTT 


CATTAATAAA 


ATTATTAATC 


GACATGAAAT 


GTCTGCGAAA 


GATACAACAT 


. 3180 


20 


TTATTCGAGA 


TATTCCGAAA AAGACGATTT 


TCACAGAATT 


TGATCCTGAT. 


AAAATGACGC 


3240 




AAGTATTTGA 


TAATGTCATT 


ACAAATGCGA 


TGAAATATTC 


TAGAGGCGAT 


AAACGTGTCG 


3300 




AGTTCCACGT 


GAAACAAAAT 


CCACTTTATA 


ATCGAATGAC 


GATTCGTATT 


AAAGATAATG 


3360 


25 


GCATTGGTAT 


TCCTATCAAT 


AAAGTCGATA 


AGATATTCGA 


CCGATTCTAT 


CGTGTAGATA 


3420 




AGGCACGTAC 


GCGTAAAATG 


GGTGGTACTG 


GATTAGGACT 


AGCCATTTCG 


AAAGAGATTG 


3480 




TGGAAGCGCA 


CAATGGTCGT 


ATTTGGGCAA 


ACAGTGTAGA 


AGGTCAAGGT 


ACATCTATCT 


3540 


30 


TTATCACACT 


TCCATGTGAA 


GTCATTGAAG 


ACGGTGATTG 


GGATGAATAA 


TAAGGAGCAT 


3600 




ATTAAATCTG 


TCATTTTAGC 


ACTACTCGTC 


TTGATGAGTG 


TCGTATTGAC 


ATATATGGTA 


3660 


35 


TGGAACTTTT 


CTCCTGATAT 


TGCAAATGTC 


GACAATACAG 


ATAGTAAGAA 


GAGTGAAACG 


3720 


rAACCTTTAA 


CGACACCTAT 


GACAGCCAAA 


ATGGATACAA 


CTATi'ACGCC 


ATTTCAGATT 


3780 




ATTCATTCGA AAAATGATCA TCCAGAAGGA 


ACGATTGCGA 


CGGTATCTAA 


TGTGAATAAA 


3840 


40 


CTGACGAAAC 


CTTTGAAAAA 


TAAAGAAGTG 


AAGTCCGTGG 


AACATGTTCG 


TCGTGATCAT 


3900 




AACTTGATGA 


TTCCTGATTT 


GAACAGTGAT 


TTTATATTAT 


TCGATTTTAC 


GTATGATTTA 


3960 




CCGTTATCAA 


CATATCTTGG 


TCAAGTACTG 


AACATGAATG 


CGAAAGTACC 


AAATCATTTC 


4020 




AATTTCAATC 


GTTTGGTCAT 


AGATCATGAT 


GCTGATGATA 


ATATCGTGCT 


TTATGCTATA 


4080 




AGCAAAGATC 


GCCACGATTA 


CGTAAAATTA 


ACAACTACAA 


CGAAAAATGA 


TCATTTTTTA 


4140 




GATGCATTAG 


CAGCAGTGAA AAAAGATATG 


CAACCATACA 


CAGATATCAT 


CACAAACAAA 


4200 


50 


GATACAATTG 


ATCGTACGAC 


GCATGTTTTT 


GCACCAAGTA 


AACCTGAAAA 


GTTAAAAACA 


4260 




TATCGCATGG 


TATTTAACAC 


GATTAGTGTT 


GAGAAAATGA 


ATGCTATACT 


ATTTGACGAT 


4320 
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GCAAACTATA ACGATAAAAA TGAAAAATAT CATTATAAAA ACCTGTCCGA 


AGATGAAGCG 


4440 




AGTTCCAGCA AAATGGAAGA AACGATTCCA GGAACCTTTG ATTTTATTAA 


TGGTCATGGT 


4500 


5 


GGTTTCTTAA ACGAAGACTT TAGATTGTTT AGTACGAATA ATCAGTCAGG 


CGAGTTAACA 


.4560 




TATCaACGTT TCCtTAATGG TTATCCAACG TTTAATAAAG AAGGTTCTAA 


TCAAATTCAA 


4620 


10 


GTCACTTGGG GTGAAAAAGG CGTCTTTGAC TATCGTCGTT CGTTATTACG 


CACCGACGTT 


4680 


GTTTTAAATA GTGAGGATAA TAAATCGTTG CCGAAATTAG AGTCTGTACG 


TTCAAGCTTA 


4740 




GCGAACAATA GTGATATTAA TTTTGAAAAA GTAACAAACA TCGCTATCGG 


TTACGAAATC 


4800 


15 


CAGGATAATT CAGATCATAA TCACATTGAA GTGCAGATTA ACAGTGAACT 


CGTACCGCGT 


4860 


TGGTATGTAG AATATGATGG CGAATGGTAT GTTTATAACG ATGGGaGGCT 


TGaATAAATG 


4920 




AACTGGaAAC TGACAAAGAC ACTTTTCATT TTCGTGTTTA TTCTTGTCAA 


CATCGTGTTA 


4 980 


20 


GTATCGATTT ATGTTAATAA AGTCAATCGC TCACACATTA ATGAAGTCGA 


GAGTAACAAT 


5040 




GAAGTTAATT TTCAGCAAGA AGAAATTAAA GTACCGACTA GTATATTGAA 


TAAATCAGTT 


5100 




AAAGGTATAA AATTAGAGCA AATTACAGGG CGATCAAAAG ACTTTAGTTC 


TAAAGCTAAA 


5160 


25 


GGCGATTCGG ATTTGACCAC ATCAGATGGT GGAAAATTAT TGAATGCGAA 


CATTAGTCAA 


5220 




TCGGTAAAGG TCAGTGACAA TAACTTAAAA. GATTTGAAAG ATTATGTTAA 


CAAGCGCGTA 


5280 




TTTAAAGGTG CTGAATATCA ATTAAGCGAG ATTAGTTCAG ATTCTGTAAA 


ATATGAACAA 


5340 


30 


ACGTATGATG ATTTTCCGAT TTTAAATAAC AGTAAAGCGA TGTTAAACTT 


TAATATAGAA 


5400 




GATAACAAAG CGACTAGTTA TAAACAATCA ATGATGGATG ACATTAAGCC 


CACAGATGGT 


5460 




GCAGATAAGA AGCATCAAGT GATTGGTGTG AGAAAAGCAA TCGAGGCATT 


ATATTATAAT 


5520 


35 


CGTTACTTGA AAAAAGGTGA TGAAGTCATT AATGCTAGAC TCGGTTACTA 


CTCAGTCGTG 


5580 




AATG&AACGA ATGTTCAATT GTTACAACCA AACTGGGAAA TTAAAGTGAA 


GCATGACGGT 


5640 


40 


AAGGATAAAA CGAATACTTA CTATGTCGAA GCGACAAATA ATAACCCTAA 


AATTATTAAT 


5700 


CATTAATATG AATCGTAATA AGCTAGCATT GCAAGCTCAT CATATGTGAG 


AAGCGGTGCT 


5760 




AGCTTTTTTG CTGGTACGGT TTATTATGGC TGATGTTTTT GCGTCTCCAA 


CGTGCGCATT 


5820 


45 


TATTPATTATT TT , AlVf^ f F tk.CZ AA C , C^CZr , x\ r V f T*CZ r Vh. AAA r T w T T A(2'TY2T t^Jir yr T % n r WTi r V*T > 
xt\x x <~*£\x £\ x x x Xnnv? xf\\Jt\Jt\ ^uvjvjv x x uia nnn x xjtvj x vj x nA^ loi Xr\ X x 


' I " I' A 1\ TV J\ ?W " | " 1 ■ 


coon 

JOOU 




TAGTATTTGT CTAATCATTG TTATAATAAT TAAGAAATTC ATTGCACGTG 


ATTATCAAAA 


5940 




TTTAAATATA AGAAACCGGT CGATGAACTA AAGTTACATA ATAGGAAAGG 


TATACAAAAC 


6000 


50 


AGCTAATATA CTGATAGTTT CTGTAGGGAA AATCGTATAT TTGCACTGAT 


GTATATTGCA 


6060 




GTCATATAGA GAGATTGACT GTTTAAAGAG AAAGGATGAG CCGCTTGATA 


CGCATGAGTG 


6120 
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TAGTTGATGT TGGTTTGACT GGAAAGAAAA TGGAAGAATT GTTTAGTCAA ATTGACCGTA 6240 

ATATTCAAGA TTTAAATGGT ATTTTAGTAA CCCATGAACA TATTGATCAT ATTAAAGGAT 6300 

5 TAGGTGTTTT GGCGCGTAAA TATCAATTGC CAATTTATGC GAATGAAAAA ACTTGGCAGG 6360 

CAATTGAAAA GAAAGATAGT CGCATCCCTA TGGATCAGAA . ATTCATTTTT AATCCTTATG 6420 

AAACAAAATC TATTGCAGGT TTCGATGTTG AATCGTTTAA CGTGTCACAT GATGCAATAG 64 BO 

10 

ATCCGCAATT TTATATTTTC CATAATAACT ATAAGAAGTT TACGATTTTA ACGGATACGG 6540 

GTTACGTGTC TGATCGTATG AAAGGTATGA TACGTGGCAG CGATGCGTTT ATTTTTGAGA 6600 

GTAATCATGA CGTCGATATG TTGAGAATGT GTCGTTATCC ATGGAAGACG AAACAACGTA 6660 

75 

TTTTAGGCGA TATGGGTCAT GTATCTAATG AGGATGCGGC TCATGCAATG ACAGACGTGA 6720 

TTACAGGTAA CACGAAACGT ATTTACCTAT CGCATTTATC ACAAGACAAT AACATGAAAG 6780 

20 ATTTGGCGCG TATGAGTGTT GGCCAAGTAT TGAACGAACA CGATATTGAT ACGGAAAAAG 6B40 

AAGTATTGCT ATGTGATACG GATAAAGCTA TTCCAACGCC AATATATACA ATATAAATGA 6900 

GAGTCATCCG ATAAAGTTCC GCATTGCTGT GAGACGACTT TATCGGGTGC TTTTTTATGT 6960 

25 TGTTGGTGGG AAATGGCTGT TGTTGAGTTG AATCGGCTTG ATTGAAATGT GTAAAATAAT 7020 

TCGATATTAA ATGTAATTTA TAAATAATTT ACATAAAATC AATCATTTTA ATATAAGGAT 7080 

TATGATAATA TATTGGTGTA TGACAGTTAA TGGAGGGAAC GAAATGAAAG CTTTATTACT 7140 

30 TAAAACAAGT GTATGGCTCG TTTTGCTTTT TAGTGTAATG GGATTATGGC AAGTCTCGAA 7200 

CGCGGCTGAG CAGCATACAC CAATGAAAGC ACATGCAGTA ACAACGATAG ACAAAGCAAC 7260 

AACAGATAAG CAACAAGTAC CGCCAACAAA GGAAGCGGCT CATCATTCTG GCAAAGAAGC 7320 

35 

GGCAACCAAC GTATCAGCAT CAGCGCAGGG AACAGCTGAT GATACAAACA GCAAAGTAAC 7380 

ATCGAACGCA CCATCTAACA AACCATCTAC AGTAGTTTCA ACAAAAGTAA ACGAAACACG 744 0 

CGACGTAGAT ACACAACAAG CCTCAACACA AAAACCAACT CACACAGCAA CGTTCAAATT 7500 

40 

ATCAAATGCT AAAACAGCAT CACTTTCACC ACGAATGTTT GCTGCTAATG CACCACAAAC 7560 

AACAACACAT AAAATATTAC ATACAAATGA TATCCATGGC CGACTAGCCG AAGAAAAAGG 7620 

GCGTGTCATC GGTATGGCTA AATTAAAAAC AGTAAAAGAA CAAGAAAAGC CTGATTTAAT 7680 

45 

GTTAGACGCA GGAGACGCCT TCCAAGGTTT ACCACTTTCA AACCAGTCTA AAGGTGAAGA 774 0 

AATGGCTAAA GCAATGAATG CAGTAGGTTA TGATGCTATG GCAGTCGGTA ACCATGAATT 7800 

SO TGACTTTGGA TACGATCAGT TGAAAAAGTT AGAGGGTATG TTAGACTTCC CGATGCTAAG 7860 

TAcTAACGTT TATAAAGATG GAAAACGCGC GTTTAAGCCT TCAACGATTG TAACAAAAAA 7920 
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TGAAGGCATT AAAGGCGTTG AATTTAGAGA TCCATTACAA AGTGTGACAG CGGAAATGAT 804 0 

GCGTATTTAT AAAGACGTAG ATACATTTGT TGTTATATCA CATTTAGGAA TTGATCCTTC 8100 

AACACAAGAA ACATGGCGTG GTGATTACTT AGTGAAACAA TTAAGTCAAA ATCCACAATT 8160 

GAAGAAACGT ATTACAGTTA TTGATGGTCA TTCACATACA GTACTTCAAA ATGGTCAAAT 8220 

TTATAACAAT GATGCATTGG CACAAACAGG TACAGCACTT GCGAATATCG GTAAGATTAC 82 80 

10 

ATTT AATT AT CGCAATGGAG AGGTATCGAA TATTAAACCG TCATTGATTA ATGTTAAAGA 8340 

CGTTGAAAAT GTAACACCGA ACAAAGCATT AGCTGAACAA ATTAATCAAG CTGATCAAAC 8400 

?5 ATTTAGAGCA CAAACTGCAG AGGTAATTAT TCCAAACAAT ACCATTGATT TCAAAGGAGA 8460 

AAGAGATGAC GTTAGAACGC GTGAAACAAA TTTAGGAAAC GCGATTGCAG ATGCTATGGA 8520 

AGCGTATGGC GTTAAGAATT TCTCTAAAAA GACTGACTTT GCCGTGACAA ATGGTGGAGG 8580 

20 TATTCGTGCC TCTATCGCAA AAGGTAAGGT GACACGCTAT GATTTAATCT CAGTATTACC 8640 

ATTTGGAAAT ACGATTGCGC AAATTGATGT AAAAGGTTCA GACGTCTGGA CGGCTTTCGA 8700 

r 

ACATAGTTTA GGCGCACCAA CAACACAAAA GGACGGTAAG ACAGTGTTAA CAGCGAATGG 8760 

25 CGGTTTACTA CATATCTCTG ATTCAATCCG TGTTTACTAT GATATAAATA AACCGTCTGG 8820 

CAAACGAATT AATGCTATTC AAATTTTAAA TAAAGAGACA GGTAAGTTTG AAAATATTGA 8880 

TTTAAAACGT GTATATCACG TAACGATGAA TGACTTCACA GCATCAGGTG GCGACGGATA 8940 

30 

TAGTATGTTC GGTGGTCCTA GAGAAGAAGG TATTTCATTA GATCAAGTAC TAGCAAGTTA 9000 

TTTAAAAACA GCTAACTTAG CTAAGTATGA TACGACAGAA CCACAACGTA TGTTATTAGG 9060 

TAAACCAGCA GTAAGTGAAC AACCAGCTAA AGGACAACAA GGTAGCAAAG GTAGTAAGTC 9120 

35 

TGGTAAAGAT ACACAACCAA TTGGTGACGA CAAAGTGATG GATCCAGCGA AAAAACCAGC 91 BO 

TCCAGGTAAA GTTGTATTGT TgtAGCGCAT AGAGGAACTG TTAGTAGCGG TACAGAAGGT 9240 

TCTGGTCGCA CAATAGAAGG AGCTACTGTA TCAAGCAAGA GTGGGAAACA ATTGGCTAGA 9300 

40 

ATGTCAGTGC CTAAAGGTAG CGCGCATGAG AAACAGTT AT TTCATAATCA ACAGTCATTG 9360 

ACGTAGCTAA GTAATGATAA ATAATCATAA ATAAAATTAC AGATATTGAC AAAAAATAGT 9420 

45 AAATA 9425 

(2) INFORMATION FOR SEQ ID NO: 88: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3886 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 88: 

AGTTGTAATG TCACATTTCC AGAGTCTGAA ATTATCTTTA TCACGTTACA TTTACTAGGC 60 

TCTAAAATGA CTGAACATAC AGCATCTTCA ATTACCTTTG AATACCATGA TTTATCGCAA 120 

AATATACATG AATTGATCAC TTGTGTTAGC CAAGAATTAG GCATTGATAT GTCAAAAGAC 180 

AACAAGTTAC ATACCAGTCT GATCACACAT ATCAAACCAG CTATACATCG TATTAAATAC 240 

GATATGCTAC AACCTAATCC TTTGAGGCAA GAAGTTATGC GTCGCTATCC TCAAATCATT 300 

GAAGCCGTTA GCAAGCATAT TAGTCCAATT GAACAAGATG CTGCTATTCG CTTCAACGAA 360 

GATGAATTAA CATACATTAC AATTCACTTC GCATCAAGTA TAGAGCGTGT TGCAACACAT 420 

AAACAATCAA TGATTAAGGT TGTCTTACTA TCTGGTTCTG GTATAGGCAC GTCACAACTT 4 80 

TTAAAATCAA AACTAAATCA CCTGTATCCT GaGTTnCACA TTTGGGAtGc CTATTcCATT 54 0 

TaTcAATTGG aAGaAAGTCG ATTATTGCAA GATAACATTG ATTATGTCAT TTCAACAGTA 600 

CCTTGTGAAA TATCAGCTGT ACCAGTTATT CATGTCGATC CATTTATCAA TCAACAATCT 660 

CGTCAAAAAT TGAATCAAAT TATCAATGAC TCAAGAGAAC AACGAGTCAT GAAAATGGCA 720 

2s ACTGATGGCA AGTCACTCGC AGATTTATTG CCTGAACATC GCATCATTAT AAATAAACAA 780 

CCATTATCAA TTGAATCCGC AATTGCAGTG GCTGTGCAAC CTTTAATCAA TGATGGCATT 84 0 

GTCTATTCAA ATTATACAGC TGCAATTTTA AAACAATTTG AACAATTCGG GTCATATATG 900 

30 GTCATTAGTC CACATATTGC ACTTATTCAC GCTGGTACTG ATTATGTACA GAATGGTGTA 960 

GGTTTCGCAC TAACATATTT CACTGAAGGG ATTATCTTTG GTAGTAAAGC TAACGATCCC 1020 

GTTCACC7TG TAATTACATT AGCAACGGAC CACCCCAATG CACATTTAAA GGCATTGGGA 1080 

CAGTTAAGCG AATGCTTAAG CAACGACTTA TATCGACAAG ATTTCTTAGA TGGGAATATT 1140 

TTT^AATTA AACAACACAT TGCTTTAACT ATGACAAAGG AGGCTTAATA ACGTGTCATT 1200 

AGACATTTTG TCAACAACAC GCATCATTGT AAAAGAACAA GTAAATGATT GGACTGAAGC 1260 

TATAACTATA GCTTCTCAGC CATTACTACA AGAACAAATT ATTGAACAAG GCTATGTTCA 1320 

AGCAATGATT GATAGCGTTA ATGAACTTGG ACCTTATATC GTTATCGCAC CTGAAATTGC 1380 

AATTGCACAT GCAAGACCGA ACAATGACGT ACATCAAGTT GGTTTAAGTC TATTAAAGTT 144 0 

GAATCAACAT GTGGCATTTT GTGATGAAGA TCACTACGCA TCTCTCATTT TTGTATTGAG 1500 

TGCCATCGAC AATCATTCAC ACTTATCTGT ATTACAAAAT TTAGCAACCG TACTGGGCGA 156 0 

SO • TAACCAAACA GTCCAGCAAC TATTAACTGC AACAAATGCA CAAGACATTA AAAACATTTT 1620 

AAAGGAGCAT GATTAATATG AAAATTTTAG TAGTATGTGG CCACGGTTTA GGAAGTAGTT 1680 
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AAGTTGAACA TAGTGACATT ATGACAGCAA GTCCAGAGAT GGCTGACTTG TTTATTTGTG IB 00 

GTAGAGATTT AGCTGAAAAT GCCGAACGTC TAGGGGATGT CTTAGTTCTT GATAATATTT 1B60 

5 

TAGATAAAGC TGAATTACAA CAAAAGCTCT CAGAAAAATT ACAACAACTT AACATGATTT. 1920 
AAAGGAGGTA CGACCTATGC AAGCAATCCT TAATTTTATA GTCGATATTT TAAGTCAACC , 1980 

AGCCATTCTT GTTGCACTGA TTGCCTTTAT AGGTTTAATC GTTCAGAAAA AACCTGCCGC 2040 

10 

AACGATCACT TCAGGAACCA TTAAAACGAT ATTAGGCTTC TTAATTTTAA GTGGAGGTGC 2100 

TGATGTCGTC GTTCGATCTC TTGAACCATT CGGCAAAATA TTCCAACACG CATTTGGTGT 2160 

1S GCAAGGTATC GTACCTAACA ACGAAGCTAT CGTCTCACTA GCCTTAAAAG ATTTTGGAAC 2220 

AACAGCTGCA CTCATCATGG TCTGTGGCAT GATTGTTAAT ATTTTAATTG CCCGCTTCAC 2280 

TAATTTAAAA TATATCTTTT TAACAGGTCA' TCATACATTT TACATGGCTG CGTTTTTAGC 2340 

20 AATCATTTTA ACAGTCAGTC ATATTAAAGG CTGGCTAACG ATTGTTATCG GCGCACTCGT 24 00 

ATTAGGATTA ATCATGGCAG TATTACCTGC ATTACTCCAA CCTACGATGC GAAAAATTAC 2460 

AGGGAATGAC CAAGTAGCTT TAGGTCATTT TGGCTCAATC AGTTACTTTG CCGCAGTGCT 2520 

25 GTAGGTCAAT TATTCAAAGG TAAGTCTAAA TCAACGGAAG AGATTAAATT TCCAAAAGGC ' 25B0 

TTAAGTTTCT TACGAGAAAG TACAATTAGT ATCTCGATTA CGATGGCATT ACTTTACTTC 2640 

ATCGCATGCT TATTTGCGGG CGTTAGTTAT GTACACGAAT CTATTAGTGA TGGTCAAAAC 2700 

30 

TTTATTGTCT TTTCATTAAT TCAAGGTGTG ACATTTGCTG CTGGTGTATT TATTATTTTA 2760 

ACGGGCGTTC GTTTAATCTT AGCTGAAATC GTCCCAGCAT TTAAAGGAAT TTCTGAAAAG 2820 

CTTGTACCAA ATTCTAAACC TGCATTAGAC TGCCCTATTG TGTTCCCTTA TGCACAAAAT 2880 

35 

GCAGTATTAA TTGGATTCTT TGTCAGCTTT ATTACAGGTG TCATCGGTAT GTTTATCTTA 294 0 

TTCTTATTTG GTGGCGTCGT CATTTTACCT GGCGTAGTTG CACACTTCTT CTTAGGTGCA 3000 

ACGGCTGCTG TATTCGGTAA TGCAAGAGGC GGTATTAAAG GTGCTATTGc TGGCGCCGCT 306 0 

CTAAATGGTA TCCTAATCAC GTTTTTACCA TTATTATTCT TGCCATTTTT AGGCGAATTA 3120 

GGTGGTGCTG CAACAACATT CTCAGATACA GACTTTTTAG CTGTCGGTAT CGTGTTCGGT 3180 

45 AACGCAGTAA AATATATGGG ATTATTTGGT GCGATTCTAT TTATTATTAT CGTAGGTGCG 324 0 . 

ACAACAATTT TATTAAAAGG CCGTCAAAAA . GAACAGCAAT AGTGTTAACG TAGAAATATA 3300 

AAACACCGTC ACATATTGAG TGAATGCCCC TTTtATCAAG AGGAAAGCCA CTTACTTATG 336 0 

50 GACGGTGTTT TGTATTATAT TAAATGATAC TTAGCCATAC TATCGACAGC TGCTAAAATT 3420 

GCTTCTTCTT GTGTCGCAAT CGGTTCCCAA CCAAGTAATG TTTTTgCACG TTCGTTACTT 3480 
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CCTAGACTCA AAATAAAGTC TGGTAATTTT TTAGTAGAAA CTTTTTGAGC TATTTCAGGT 3600 

CTCTTTTCTT TAATTAATTT TGCAATTTCC AACAAATTAA TTTGTCCATC AGCCGTCGCA 3660 

5 ATAAATCGCT TGCCATTAGC TTGTTCATTT GTCATTGCCA AAATGTGCAG TTCAGCTACG 3720 

TCTCTCACAT CAACAACATT TAACGGAATT TGCGGTACAC GTTTCATTGA ACCATTCAAT 3780 

AAATTTTCTA ATAAATGAAA GCTTCCTGAA ACGTGTGCAT CTAATGATGG CCCAAAAATT 3840 

10 

GCAACTGGAT TGATTGTGGC AAATTCTACT GTTGTATTTT CATTCT 3886 

(2) INFORMATION FOR SEQ ID NO: '89: 

(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 4879 base pairs 

(B) TYPE: nucleic acid 
{C) STRAND EDNESS : double 
(D) TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 89: 





GTCATCTATC 


AAAAATTTGG 


TATACAGACC 


GACAATTATT 


AATTAATAAT TTAATTTCCC 


60 


25 


AGGCAATACC 


AGTGATTAAA 


TATCCACAAA 


TACAACATAA 


AGAACAACCA 


TTAGAATCTA 


120 




TTTCACAACT 


TATATTGTCT 


AAGATGACAT 


CTAATCAATA 


GTGTTTAAAT 


TTCTCAGTGG 


180 




CTGTGAATGA 


GGTTTAAAAG 


TACTATAAAA 


CGTAAACTTT 


GATACTTTAA 


AATACGCAAA 


240 


30 


AAACGGTAAA 


CCCTAATTCA 


TATTATAGAG 


TTTACCGTTT 


TATTTTTTAA 


CTTGCATCAT 


300 




AGTTATATTA 


ACATTATTGT 


TGGTAGTTTG 


GATCAGTAAC 


CATTGCTTGT 


CCAGTATAAT 


360 




CAACCGTTAC 


AATTGAATAT 


TTTCCaTTTG 


CATTTGGGTC 


TTTAAAACTA 


AACACATACT 


420 


35 


TATAGTTGCC 


ATTATGTTCT 


TCAATAGAAT 


AATCATTATA 


CACTTTATTA 


TTACTACCAA 


480 




ATT'ilATTTGC 


TTCATTATTA 


GCCGCATTTA 


AAGCTGTTTG 


GAAATTTGGC 


AATTGCTGTA 


540 


40 


AAGCTTGATT 


TTTATTTCCA 


TTAAACGGAT 


AAATTTGACG 


TGCAACCGGC 


GCGGCATTTT 


600 


GnCCATAATA 


TGGTGCAACG 


TAACTTGATT 


TTTGATTATT 


ATTCGCTTGG 


TTATTACTTG 


660 




ATTGGTTATT 


ATTTGTTTGG 


TTTTGGTCAT ' 


TGTTTGTTGC 


ATTTGAATTA 


GATTGrTGCT 


720 


45 


GGTTATCGTT 


TGCACTATTA 


TCTTTATTAT 


CTTTGTTTAC 


GTCTTTACTA TCATCTTTAT 


780 




TATCTTTCTT 


ATCTTTAGAT 


GAATCATTTG 


■rrmTi'ATC 


TTGTTGTTCA 


GTTTTCGCTT 


840 




TATCATCTTT 


TTCTTTATTA 


CCGTCTTTTT 


GTTGGTCACT 


ATCTTGACCA 


CATGCAGCTA 


900 


50 


AAAATAATGA 


TAATGCTAGT 


AACCCTGTAA 


CTAATCTTTT 


CATACATATC 


TCCTCCTATA 


960 




ATTCGATATT 


CATTGAATAA 


TCTTGAAATA 


CATATCTACC 


ATGTGTATCT 


TTTCATGGCT 


1020 
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TAAGGTTCTT TTTATTATAC CCTAATTTTT GTTCATTATT ATTTAATTTT TGTGAATTTT 114 0 

ATGtTTkCTA TAAATTTAAT TATTTTACTT TAACAATTCA TTACGCATTT AGCATTTCAA 1200 

GGTATACACA ATATTTATTA CTATGATTTC ATTTTATCTG CTGCAAAAAC AATCATTATA 1260 

ACTCTTTTTC CATAATTAAA TCTGTATCCG TTACATCACC TGTTTGAAAA TGATGTTCAC 1320 

CAACCACTTT AAATCCATGA CGTTTATAAA ATGCTTGAGC ACGAGGATTA TGCTCCCAAA 13 80 

CTCCTAGCCA AATTTTATGT TTATTATGTT CTTGAGCAAT TTTTTCGGCC AATTCTATCA 1440 

ATTGTGAACC TCTTCCGCCA CCTTGAAAGT CTTTCAAAAA ATATATGCGC TGCACTTCTA 1500 

AATAGGTCTC CCCCATTTCT TCAGTTTGAG CACTATTAAT ATTCATCTTT ATATAACCAA 1560 

CATTCGCACC ATCTTCTTGa TAAAAATAAT GAAATGAATC TACATGGTTA ATCTCTTGTG 1620 

TAAATTTCTC TACAGTATAA TTGTCTTTAA AAAATTGATC AAAATCTTTG TCATCATAGT 16 BO 

AAGAACCAAA CGTGTCATAA AATGTTCTAG TTGCTAATTC AACTAATTCA CTAGCATTTT 1740 

GTTCTGAAAT TTCTTTGATT ATCCCAGCCA TATAAATCCT CCAATAAACA GTGATCGAAT 1800 

CAAAATATTA CTTATGTTAT TTTTCAGCCA AAACTATTTA AAAATACATT AACACAAATC 1860 
AATTACAAAT TGTATTGATT GTGTGTAACA TCAATAAATG ATACATTTAT TCCAGTAAAA , 1920 

TGGCCGTATT TTCAAAAGAG AAAAAGAGAG GATGTATCGT TGTGATAGAA ACATTTAAAG 1980 

CGTTTGTAAT TGATAAAGAT GAGAGTGGTA AAGTGACACC AACTTTCAAA CAATTATCGC 204 0 

CTACTGATTT ACCTAAAGGA GATGTGCTGA TTAAAGTACA TTACTCTGGT ATAAATTATA 2100 

AAGATGCTTT AGCGACTCAA GATCATAATG CAGTCGTAAA ATCGTATCCT ATGATTCCAG 2160 

GAATAGATTT AGCTGGAACA ATTGTTGAAT CCGAAGCACC AGGCTTTGAa AAAGGAGAAC 2220 

AAGTAATTGT AACGAGTTAT GACCTAGGTG TCAGCCATTA TGGCGGTTTT AGTGAATATG 2280 

CGCGTGTAAA ATCAGAATGG ATTATCAAGC TTCCTGATAC TTTAACATTA GAAGAATCAA 234 0 

TGATATATGG CACAGCTGGT TATACTGCCG GTTTAGCAAT TGAAAGACTT GAAAAAGTTG 2400 

GAATGAATAT TGAAGATGGT CCTGTACTCG TTCGCGGTGC TTCAGGTGGT GTCGGTACTT 2460 

TAGCAGTACT CATGCTTAAT GAACTTGGTT ATAAAGTTAT CGCAAGTACA GGTAAACAAG 2520 

ATGTTAGCGA TCAATTACTT GAACTTGGTG CCAAAGAAGT TATCGATCGA CTTCCTGTTG 2580 

AAGATGATCA TAAAAAGCCA CTCGCATCAT CAACTTGGCA AGCTTGTGTA GACCCTGTTG 2640 

GTGGCGAAGG TATTAATTAT GTTACAAAGC GTTTAAATCA TAGTGGGTCA ATTACAGTTA 2700 

TTGGTATGAC TGCCGGTAAT ACTTATACTA ATTCTGTATT CCCTCACATT TTAAGAGGTG 2760 

TAAACATTTT AGGAATTGAC TCGGTATTTA CTGCTATGAA ATTAAGACAG CGCGTTTGGC 2820 
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TTGATGAACT TCCAGAACAA CTTAACAAAG TAATTAAACA TGAAAATAAA GGGCGCATTG 2940 

TTATCGATTT CGGTGTAGAT AAATAGTATT CATGAAAAAG ACATCCCGTT ATGCGAGATG 3000 

TCTTTTTTAA TTTAGTATTT GATATACATA CCGCCTGAAT CTGGTTCGGT AGGTATAAAT 3060 

CCAAATTTTG TATATAATTT ATCCGCTGGG TAGTCTGCAA TCAGAcTAAC GTATGTACTC 3X20 

TCAACAGCCA CACCTTTAAT ATATTGCATA ATATGCTCCA TAATTAGACT GCCGTAACCT 3160 

TGACCTTGGT AACTTTTCAA AACTGCAATA TCAACAATTT GAAAAACAGT TCCGCCATCG 3240 

CCAATCACTC TACCCATACC AATTAACCGA TCTTTATCAT ACAAGGTTAC TGTAAATAAG 3300 

GCATTAGGTA ATC C T TTT TC aGCTGTTCGC GCGTCTTTGG ACTCATACCT GCGTTAATCC 3360 

TTAATGCGCA ATAATCCTCG CAAGTCGGAA TATCATATGT CACTTTAACC ATTATTTACC 3420 

CCA CTTTT CA TCACACAATA TATCAACCTA GTATAAATGT TTATTTACAA TAGTCTTATT 3480 

CGCTTCTTTA AACACTTCAT GATGACTTGA AACATAACCC TCTGCATTCG CATCTGGTTG 3540 

GATATATGTT TTAGCAAGGT TCGCTGCATT TGCACCATCA CTAAATGCAC TTGCAATTAG 3600 

ATGTG ATTTT GCATCATGAT AAACAATATC TCCACACGCA TAGATACCAG GTATACTAGT 3660 

25 TGTCGTATTA CCAAATCCTT TAACACGACA ATCATCATGC ATATCTAGCT TTGAAGATGT 3720 

TtCACTCAAT AATGTATTAC AACGATCAAA CCCATGACTA ATAATGACAT CGTCAAATTT 3780 

AACTGTATGC CTATCGCCAC TTTCAACATG TTCCAAAACA ACTTCACTTA TATGCGTTTC 3840 

30 ATCATCATTG CCGACCAAGT ATTTAATACG TGTTTTTGGG CATAGTTTCA CATTTAAATC 3 900 

TGTCACCAAC GTTTTCATCG CTTCATGACC ACTTACATCT TCTTTTCGAT AAACAACTGT 3960 

CACGCTTTTA GCAATCTTGG CAATATCATG CGCCCAATCT AATGCTGTAT TTCCTCCACC 4020 

TGATATTAAT ACATCTTTAT CTTTGAAACG TCTGTAACTT TGTACAACAT AATGTAAATT 4080 

AGTT5ATTGA TATCTCTCTA CACCTTTAAC ATCTAATTGT TTTGGATTAA TAATACCCGC 4140 

ACCAATTGCA ATGATAACTG CTTTCGATGT ATATATTTCT CCCGCTTCTG TTTCAACTTC 4200 

GAAATGACGT TCTGCCTTTT TCCTAATATC TACCACACGT TCATTCAAAT GAACTTCCGG 4260 

TTTAAAATAT AATCCTTGCT TAATTGTATC TTTTAAAATT TCATGACAAG GTTTTGGCGC 4320 

AATGCCGCCA ATATCCCAAA TAATTTTTTC AGGGTAAATT CTCATCTTAC CCCCTAATTC 43 80 

AGATTGAACA TCTATCAATC TTACAGACAT ATCTCGCAAT CCAGCATAAA AGCTTGCATA 4440 

CAAACCAGAC GGACCGCCAC CAATGATTGT AACATCTTTC ATTATGTGCC TCCTATGACT 4 500 

SO CTCTATATTC ATTTCTTTCA TTAACGTGCT . CAAATTGATA ATTATTATCA TTTAAAGCCA 4560 

TTATACTATT AATATTTATA TTGTTAAAAT AAATCGCATA GTTAGCCATG AATTATCAAT 4620 
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GAAAGATGTG 


TATATTTTTT AGTTCTAGTT 


ATATTATTTT 


TTAAAAGACT 


CATCACGTGG 


4740 




TTCTTTAAGA 


ATTGCTTGTC TTAAAAGGAA 


AAATAGCAAC 


AATAAACCTG 


CAAGCATACC 


4S00 


5 


TGTGTGCCCA 
TTGGAAGAAT 


ATACCTGCAA AGCCTGCnAA 
CCTTTTGTC 


TGCTTCTGGA 


GAGTATGATT 


TACCAGTGAC 


4860 
4879 


70 
15 


(2) INFORMATION FOR SEQ ID NO: 90: 








(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 1560 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 90: 






20 


ATAATGTCTT 


AGaTTGATTG GGAGTTTTTT 


TAATTTTTTT 


GAAATTAAAT 


TAATCTGTAS 


60 




yTAATAAAAA 


ATTTGAATAA CTGACACAyT 


TTTTTGATCA 


TAGCTAyATA CTTTGTGAAT . 


120 




TAATTCACAT 


TATAATAAGA GTGAAGATAA 


GAGTATTATA 


AATnATCTTT 


AAATAAATAT 


180 


25 


ATGTGAAGTA 


AAAATTACAC GTTAGCATAT 


CGATTATGgT 


CATTTCkTTT 


AACATATTAA 


240 




CTgGGGaACG 


TTAAAAGTTA ACGGkTGATA 


TCyAACtAAA 


AACAAGGTCA 


CAGTAGTATG 


300 




TTTTAATCTG 


GCGTCTATTA CAAATAAAAA 


TTACATCTAT 


AATTATTCGT 


TTTCTTTTTT 


360 


30 


GAAAGTAATA 


GCCAATTAAT ATCATACATA 


CTGGAGTGAC 


TATAAGGAGG 


ACATTATTAT 


420 


■ 


GAGAGCAGCA 


GTTGTAACGA AAGATCACAA 


AGTAAGTATT 


GAGGACAAAA 


AGTTAAGAGC 


480 


35 


TTTAAAACCT 


GGTGAAGCGT TGGTACAAAC 


GGAATATTGT 


GGCGTTTGTC 


ATACCGATTT 


540 


ACATGTTAAG 


AATGCTGATT TTGGTGATGT 


TACAGGCGTT 


ACTTTAGGTC ATGAAGGTAT 


600 




TGGTAAAGTC 


ATCGAAGTTG CGGAAGATGT 


AGAATCATTA 


AAAATTGGAG 


ACCGTGTGTC 


660 


40 


TATCGCTTGG 


ATGTTCGAAA GCTGTGGAAG 


ATGTGAATAT 


TGTACAACAG 


GTCGTGAAAC 


720 


ACTTTGCCGT 


AGTGTGAAAA ATGCTGGTTA 


TACAGTAGAT 


GGTGCAATGG 


CTGAACAAGT 


780 




TATTGTTACT 


GCAGACTATG CTGTGAAAGT 


ACCTGAAAAA 


TTAGATCCAG 


CAGCAGCGTC 


840 


45 


TTCTATTACA 


TGCGCAGGTG TGACAACTTA 


TAAAGCTGTA 


AAAGTAAGTA ATGTAAAACC 


qnn 




TGGACAATGG 


TTAGGTGTTT TTGGTATAGG 


TGGTTTAGGT 


AACCTAGCTT 


TACAATATGC 


960 




TAAAAACGTT 


ATGGGGGCTA* AAATTGTTGC 


CTTCGACATC 


AATGATGATA AATTAGCATT 


1020 


50 


CGCGAAAGAA 


TTAGGTGCTG ATGCTATTAT 


TAATTCTAAA 


GATGTTGATC 


CAGTTGCAGA 


1080 




AGTTATGAAA 


TTAACTGATA ACAAAGGATT 


AGATGCAACA 


GTGGTAACTT 


CAGTTGCTAA 


1140 
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TTTACCTGTT GATAAAATGA ACTTAGATAT CCCAAGATtA GTGCTTGATG GTATTGAAGT 1260 

AGTAGGTTCA CTTGTTGGTA CAAGACAAGA CTTACGTGAA GCGTTTGAAT TTGCTGCTGA 1320 

5 

AAATAAAGTA ACACCTAAAG TTCAATTAAG AAAATTAGAA GAAATCAATG ATATTTTTGA 1380 

AGAAATGGAA AATGGTACTA TAACTGGTAG AATGGTTATT AAATTTTAAA AATATCAACT 144 0 

GACTATATAG ATAAAGAAGG TAGTGCTCTG AACACTATCA TTATTAATCA AACCCCGAGG 150 0 

10 

TTTTCCTGAA AAGATAGTGG nAAATCCCCG TG TTTTTT GG GTTTGAGGnG GTTGTnTGTA 1560 
(2) INFORMATION FOR SEQ ID NO: 91: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11014 base pairs 

(B) TYPE: nucleic acid 

(C) Sf RANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 91: 



35 



40 



45 



GTCCTGTnGC 


TGCAATGAAT 


ACGCCTAAAA 


ATCCAGGGAT 


GTAATGGATA 


CTTTGTGGTA 


60 


GTACTAATGA 


TAGAAATGAT 


AAAAATGAAA 


TCACAAAGGC 


TACGCTCGCA 


AAAGCTTGAC 


120 


ATGTACGCTT 


ATCGCCATAA 


TCTAACCCTG 


TACGTATATG 


TAATAAATAC 


TGTAATCCGA 


180 


TACTTAAATA 


CATAATTGCC 


ACGCATAAGA 


AGAATGGGAA 


GAATGTCTTT 


TCAAAGTCCG 


240 


GATATAGGCT 


GTTAGATAGG 


AAGACCATGA 


TGAACATATT 


AAACATCATA 


AACGAGACGT 


300 


CTTTGAATGT 


AACTTGACCA 


AATCGATTTG 


TAAAAAATGT 


TTGATGAGAC 


CACATTAACC 


360 


ATAAGAACAA 


ACTCATGACG 


ATGTATTTGA 


AAAATAAATC 


AGCTGAAATG 


GAACCGTTTT 


420 


GTGTTGTTAA 


AATCACATGT 


GCAATTTTTT 


GAATGGCATA 


GACGAAAATT 


AAATCAAAGA 


480 


ACAACTCATG 


GAATCCTGCA 


CGCTTTTCAG 


CTAAATGTTT 


TGGTGTTAAT 


GCATTAACCA 


54 0 


TAAAATTTTA 


ACTCCTTTAA 


GATGTGTAAT 


TAATTTACTA 


AGTATACTAT 


TTATTTTTTC 


600 


TAGTGAATAG 


GGGCAGATTT 


GGCGATGAAG 


TGGAAGGAGA 


GGTGACTGCA 


AGGTAATTGC 


660 


GGAATTAACA 


ATCATCAGCG 


ATTTAATATT 


TGACTGGAGA 


CGTCATGGTA 


ATAAAAAATT 


720 


GATGAGAAAT 


TGATGGTGAA 


ACCAGCTGTG 


AATAsCGaTG 


cAATGATrsA 


TAGaATTTAA 


7B0 


TTAGAGTCAT 


TACGCGaAAT 


GATTAATGAT 


AATTTGTGGT 


AAATCAAAGC 


aTAATTTTGT 


840 


ACTATAGATG 


AGGATGATAG 


AGCATATTTA 


AGAGGGTGAA 


ATGTTAAAGT 


GAAACCGTTT 


900 


ACGTTTCCGA 


TTGCCCAAAC 


AAATTACATC 


ATTGTATAAT 


ATGATTTGTT 


AAATGCATAA 


960 


CAAGAATGAA 


AATGTAACAT 


ACGTAGCAAT 


TGGTTTCATA 


AATTGGATGT 


TAGTGGCGTA 


1020 
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TGACGAGAGT CGTATTAGCA GCAGCATACA GGACACCTAT TGGCGTTTTT GGAGGTGCGT 1140 

TTAAAGACGT GCCAGCCTAT GATTTAGGTG CGACTTTAAT AGAACATATT ATTAAAGAGA 1200 

s 

CGGGTTTGAA TCCAAGTGAG ATTGATGAAG TTATCATCGG TAACGTACTA CAAGCAGGAC 1260 

AAGGACAAAA TCCAGCACGA ATTGCTGCTA TGAAAGGTGG CTTGCCAGAm ACAGTACCTG 1320 

CATTTACGGT GaATAAAGTA TGTGGTTCTG GGTTAAAGTC GATTCAATTA GCATATCAAT 138 0 

70 

CTATTGTGAC TGGTGAAAAT GACATCGTGC TAGCTGGCGG TATGGAGAAT ATGTCTCAAT 144 0 

CACCAATGCT TGTCAACAAC AGTCGCTTTG GTTTTAAAAT GGGACATCAA TCAATGGTTG 1500 

ATAGCATGGT ATATGATGGT TTAACAGATG TATTTAATCA ATATCATATG GGTATTACTG 1560 

CTGAAAATTT AGTAGAGCAA TATGGTATTT CAAGAGAAGA ACAAGATACA TTTGCTGTAA 1620 

ACTCACAACA AAAAGCAGTA CGTGCACAGC AAAATGGTGA ATTTGATAGT GAAATAGTTC 1680 

20 CAGTATCGAT TCCTCAACGT AAAGGTGAAC CAATCGTAGT CACTAAGGAT GAAGGTGTAC 1740 

GTGAAAATGT ATCAGTCGAA AAATTAAGTC GATTAAGACC AGCTTTCAAA AAAGACGGTA 1800 

CAGTTACAGC AGGTAATGCA TCAGGAATCA ATGATGGTGC TGCGATGATG TTAGTCATGT 186 0 

25 CAGAAGACAA AGCTAAAGAA TTAAATATCG AACCATTGGC AGTGCTTGAT GGCTTTGGAA 1920 

GTCATGGTGT AGATCCTTCT ATTATGGGTA TTGCACCAGT TGGCGCTGTA GAAAAGGCTT 1980 

TGAAACGTAG TAAAAAAGAA TTAAGCGATA TTGATGTATT TGAATTAAAT GAAGCATTTG 204 0 

CAGCACAATC ATTAGCTGTT GATCgTGAAT TAAAATTACC TCCTGAAAAG GTGAATGTTA 2100 

AAGGTGGCGC TATTGCATTA GGACATCCTA TTGGTGCATC TGGTGCTAGA GTATTAGTGA 216 0 

CATTATTGCA TCAACTGAAT GATGAAGTTG AAACTGGTTT AACATCATTG TGTATTGGTG 2220 

35 

GCGGTCnAAC TATCGCTGCA GTTGTATCAA AGTATAAATA ATAAGAAAAC AGGTTATCAC 2280 
•~ 

AACA5TATTA ATtACATGTT GGCATAAGCT GTTTTTATTT GTTTATGGAT TTATTGGGTA 234 0 

ATATTAGTCA TTTGATGGTT TAATTGCAAA TGCTCTAACA GGGAACCCAG GTGCATCTTT 24 00 

40 

TGGTTTAGGG CTGATAGCGT AAATGATGGC GCCACGAGTT GGTAATTGAT CTAAATTAGT 2460 

TAATAACTCG ACTTGGTATT TATCCTGACC AAGAATATAA CGTTCGCCAA CTAAATCACC 2520 

45 ATTTTTTACA ACGTCCACAG ATGCATCGGT ATCGAATGTT TCATGACCAA CAGCTTCAAC 2580 

ACGACGTTCT TCAATTAAGT ACTTCAAAGC ATCTAATCCC CAACCCGGTG CATGTTGTTG 264 0 

TCCGTTCGCA TCTTTGTTTT CAAACTTTTC AATATTAGGC CAACGTTTTG ACCAATCGGT 2700 

5 P ACGAAGTGCA ACAAAAGTGC CAGGTTCAAT AGTACCATGC TCTTTTTCCC ATGCTTCTAT 2760 

ATGCGCACGT GTTAGGATGA AATCATTGTT GTTCGCTACT TCTGTTGAAA AGTCTAATAC 2820 

55 
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AAAGTGAATT GGTGCATCAA TGTGAGTACC 
ATAGAAACCA TGATCTTTAA CCGTGAATAA 
5 AAAACGTGGT ATTTCCGGAT CAAATGTATG 

AGTATTTAAT TGTTGCCATA AAGGATATTG 
ATATGATAAA TGCTGCGATT ATTCTTGGCG 
CAATGCATCT TTAACTTCTT CTTCTTTTCG 
GAATAATGAG CGGTCGATTT GTTGTAGTGA 
TGTTGGAATA CGTGGACTAT GAATATCATA 

75 

AATATCTTCA AAGTCTTTAA TTAAATCACC 
ATCAGCATCT AAGTCATGAA TAGCATGAAT 

20 ■ TGTATGGATT TGAGTTTCAT CACGAACTGA 
ATCTTTAAGA' TATTGTTCGT GATATTCAGA 
TTCGTCAACT TGGATAACTT TGATTCCTGC 

25 TGCTAAAGCA ATTTGATCTT GAACGACTTT 
GTTTAGAATT GTTACAGGTC CAGTTAACAT 
TGCATAAACT GTTTCATCAA CAGTTAAAGG 

30 

TGGTGGTTTT ACGGCACGTG AACCATATGA 
ACCTTGTAAT TTTTCTCCGA AGAATTCAAC 
TAATAGATCT AAGCCAATGT CTTCTTGAAT 

35 

GAATGTTTCA TATGCTTCGT CTGTAATGCG 
AACTTCTCGG CTtTGTGGGA ATGATCCAAT 
AO GTTTT TGT TGTTGTTCAA TACGTTGCGC 

40 

TTCGAAATCA TAATCTAAGT TTTTGAATGA 
TAATTTATCA TATTTAACAC TATCGTTTTG 

45 GTCTAATTTT TCAGTTGCAA AGCTTAAGCC 
ATCTAAAGAT ACTGGAACAT GTAATAATGA 
GTGTGCTAAC AATTTATCGA TTAAGACTTT 

50 ACGACCATCA ATAATTCCAG CGTATAATGT 
TTGTTTAAGG TTATAGCCAT TATCATGGAC 

55 



ATATTGCGTT ACAATATTCC AACGTTGCAC 2940 

AGTTGAAACT TCGCCTTTTT CAAACTCACT 3000 

CGTTAAATCA ACCCAAGTTG CTTGTTTTAA 3060 

TGTCATAAAA TCACCCGTTT TTAGTTTATT 3120 

TTTAGCTTTA ACAGCATTCA CAAGCACAGT 318 0 

CGTTTTTAAA CCACAGTCAG GGTTTACCCA 324 0 

ACGATTGATT GCTGTAGTAA TTTCTTCTTT 3300 

TACACCTAGA CCAATACCTA AATCATAATT 3360 

ATGGCTACGA GATGTTTCAA TTGAAATAAC 3420 

GATTTGACCG AATTGAGAAT AACACATATG 34 80 

AGACGTTGCA AGTTTAAATG ATAAAACAGC 3540 

GCGTAATGGT AAGCCTTCAC GTAATGCAGG 3600 

AGCTTCAAGT GCTAATACTT CTTCGTTGAT 3660 

ACGTGGTAAA TCAACACGTT CAAATGACCA 3720 

ACCTTTAACT GGTTTATCTG TTAAGCTTTG 3 7 BO 

CGCTGTCCAT TTTACATCAC CATAAATGAT 3840 

TTGCACCCAA CCGAATTTAG TTACTAAGAA 3900 

CATGTCATTA CGTTCAAATT CACCGTGAAC 3960 

TTTAATCCAT CGAGCAATTT CATTTTTTAA 4020 

TTTGTTCTTC CAATCTGCAC GGTATTTTCG 4080 

AGTTGTTGTT GGTAAATCCG GTAAGTTCAA 4140 

GAATGGTGAT TGTCTTGAAG TACGCACGCT 4200 

TTGATTTTGG AAACGCTCAT AACGTGCTTT 4260 

ATTAAATAGG CGACGCAATG CATCTAATTC 4320 

TTCGCCAACA CTTGTATCTA ATGTTTCATC 4380 

AGATGATGGT TGAATGACAA GTTCATTAGT 4440 

TTTAGCTTCA ATGTCACTTG CCCATACATT 4500- 

TTTTGATTTA TCAAAATCTC CAGCTTCAAT 4560 

AAAGTCTAAA CCTATACCAC CAACAGGTAA 4620 
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AACACCAGCT 


TTTTCGAAAT 


AGTCATAAGC 


TT CACGTGTA 


ATATTTTCAT 


AGCTTTCGCT 


474 0 




GTCGTCTGTA 


ACTAAGATTG 


GCTCATCAAC 


TTGAATGTAC 


TCAGCACCTG 


CATCAATTAA 


4800 


5 


TGATTCAAAC ACTTCTTTAT AAAGTGGTAA 


TAACGTTTTA 


ACTTTTTCTT 


CAAAAGTTTG 


4860 




GTGACCGCCT 


TTTGATAATT 


TAACAAAAGT 


AATCGGACCA 


ACAATGACAG 


GGTGAGCGTT 


4920 


10 • 


AACGTTTAAA 


GATTGGGCAT 


ATTTAAAGCG 


ATCTAATAAT 


ACATTGCGAC 


TCACTTTAGG 


4980 


CTCAACATTG 


TCCCATTCAG 


GTACGATGTA 


ATGATAGTTA GTGTTAAACC 


ATTTTATAAG 


5040 




TGCACTTGCA ACATGGTCTT 


TATTACCGCG 


AGCAATATCA AATAATAAAT 


CATCATCAAT 


5100 


75 


AGTTCTTCCT 


TGGAAACGTT 


CAGGGATGAT 


GTTGAATAAT 


AATGACGTAT 


CTAATATATG 


5160 




GTCATATAAA GAGAAATCAC 


CAACTGGGAT 


GCTATCTAAG 


TGATAGTACT 


TTTG tAATAA 


5220 




TAAATTTyCT 


TTATGTAGAT 


CAGTTAATGT 


TTGATCTAAT 


TCTTCTTTAG 


AAATCTTCTT 


5280 


on 


TGCCCAATAA 


CTTTCGATGG 


CTTTTTTCCA 


TTCTCTTTTT 


CTACCTAATC 


TTGGGAATCC 


5340 




TAAGTTTGAT 


GTTTTAATTG 


TTGTCATAAT 


ATTGCCTCCT 


TGTGAGCAGT 


x riwi x xxx 


5400 




GAGTATGCTG 


CAAGTTCTAA 


TGAATCTTCG 


ACATTTTGAA 


ACGGTGTGAT 


AATGTATAAA 


5460 


25 


. CCATTAAAAT 


ATTCATGAAC 


AGTATCGATT 


AAATCCTTTG 


AAAGCTTAAG 


ACTTAGTTCT 


5520 




CGTGTTTTGG 


CTTTATCATC 


TTTAACTGCT 


TCAAATTGTT 


GTAAAATTTC 


ATCTGACATC 


5580 




TTGATTCCTG 


GCACTTCATT 


ATGCAAAAAG 


AGTGCGTTTT 


TGTAACTTGC 


GATAGGCATA 


5640 


30 


ATGCCTATGA 


AAAATGGTTT 


GTTCAAGTGC 


TTAGTGGCAT 


GGTAAATTTC 


AATGATTTTC 


5700 




TCTTTGCTGT 


ACACGGGTTG 


TGTTATAAAA 


T AAGACATT C 


V^VJV* X X X \- Xf\X 


CTTTTTCTCT ' 


5760 


35 


AATCTTTTGA 


CGGCACCATA ( TAATTTACGA 


ACATTAGGGT 


TAAAGGCGCC 


Ag cGATGTTG 


5B20 


AAGTGTGTAC 


GTTTCTTCAG 


CGCATCACCG 


TCAGTGTTAA 


TACCTTGATT 


AAATCTT AGA 


5880 




. GCGSGTTCAG ttaatccttt agaattaaca 


TCATAGACAT 


TGGTTGCACC 


TGGTAAGTGA 


5940 


40 


CCAACTTTTG 


AAGGATCACC 


AGTTATGGCT 


AATATTTGGT 


TAACGCCAAT 


GAGCGATAAT 


6000 




CCAAGTAAAT 


GGGACTGCAA 


GCCGATTAAG 


TTTCGGTCTC . 


GACATGTAAT 


ATGTACGAGT 


6060 




GGTTCAATAT. TGTAATATTG 


CTTAATTAAG 


CTAGCAGCAG 


CAATATTGCT 


AATTCTGACA 


6120 


45 


GTTGCCAATG 


AATTATCTGC 


GAGTGTTACC 


GCATCTACAT 


TAGCTTTATC 


AAGTTTAGCG 


6180 




ATATTTTCAA 


AAAATCTATC 


CGTGTCTAAA 


TGTTTCGGTG 


TATCCAATTC 


GATAATAACG 


6240 




GTTGGACGTT 


CTTGAACCTT 


AGATGTTAAT 


GATTGTCTAA 


CTTTATTTTG 


AGATGGATTG 


6300 


SO 


AAAAGTGCTT 


TCGTTGGTAT 


CGGAATCACT 


TTTTTGTCAT 


TAACAGGTTT 


AAGTGTCTGA 


6360 




ATAGATTCTT 


TAATAAATTT 


GATGTGCTCT 


GGCGTTGTAC 


CACAGCAACC 


ACCAATTAAA 


6420 



55 
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TACTTAAATT CACTATTTTC AATATCTAAT AAGCTGGCAT TTGGATAACA AGATAAGAAT 6540 

GCGTGCTCTG GTAATTCAAT ATGTGTGAAA GACTCTTGCA TATGGTGCGG GCCATGATGA 6600 

5 CAATTGAGTC CCACGATGTT TGCACCACAT TGAACGAGTT GTTTTAATCC TTCATTGATT 6660 

GCCTGACCAT TAACTAAGTA ATTTGTGTTT GAAGCGGTTA ATTGAGCAAT GATTGGAATG 6720 

TCGTATTTCT TTCTCGTTCG TGAAATGACA TTTGTTAACT CTTCTAGGTC GTAATACGTT 6780 

10 

TCGAAAAGTA GCGCGTCAAC GCCTTCTTCA ATTAAGGTGT CTATTTGAAT TTCAGTATGA 684 0 
TAAAGAATAG TTTGTAAGCT GATATCCTCT TGTTTGATAC CTCTAAACCC ACCAACTGTG - 6900 

CCTAATATAT ACGTATCTTT ATTTGCTGCT TTTTTTGCGA TGCGAACGGC GGCTTGATGT 6960 

15 

ATTGCTTTAA CTTTATCTTC AAGACCGAAT CGTTTTAACT TTTCAAAATT TGCACCATAA 7020 
. GTATTG GTT T GAATGACATC AGCACCGGCT TCAATATATG AACGATGGAT GCGTTCAACT ■ 7080 

20 TTATCTGGAT GGCTAAGATT ATATGCTTCT GGACAGGTGT CTAATCCTTC AGAGTATAAA 7X40 

ATGGTTCCTA TAG CGC CATC AG CT ACT AAA ACATTATCTT TCAATTGTGT GAGGAATTGA 7200 

CTCATTGAAT GCCTCCTTTA ATGCGTATTT GATGTCTGCA ATGAGTTCAT CAGGATCTTC 7260 

25 GAGACCAACA CTTAATCGGA ATAGACCGAA AGTGATACCA CGTTCTTGTC TCACTTCTTC 7320 

AGGTAGTGCA GCGTGAGACA TTGTTGCTGG ATGTGAAAGG ATCGTTTCAA CACCGCCCAG 7380 

ACTCACTGAA ACGAGTGGTA ATGTCAGTGC ATCGACAAAT TGTTGTGCTT TAGACTCATC 7440 

30 AGCTAAACGA AAGCCAATAA CGGCACCGCC ATTTTTAGCT TGTTCTAAAT GAGCAGTAGT 7500 

GAGTCCCGGA TAATAAACTT CTGAAATTTG ATCTTGCTTT ATTAAAAATG ACACGATTTT 7560 

TTGAGCGTTT TCGACAGATT GTTTAAATCT GATTGGAAAA GTTTTTAAAT GTTTAGCAAG 7620 

35 

TGTCCAGCTA TCCTGAGCAG ATAACATATT GCCTGTACCA TTTTGTATTA AATAAAGAGC 76 80 

. GTCftCTAATT GCCTCATTAT TAGTTATGAC AGCACCAGCA ATTAAATCGC TATGTCCACT 7740 

TAAAAATTTT GTAGCACTAT GAATGACAAT ATCAGCGCCA AGTAATAAAG GTGATTGACc 7800 

40 

TAACGGTGTC ATAAATGTAT TGTCCACAGC TACCAGTAGT TCATGCTTTT CGGCTATTTT 7860 

AGAAACAGCT TTGATATCAG TAATTTTAAA ACAGGGATTC GATGGTGTTT CGATATAAAT 7920 

TAATTTTGTG TTTGATTGAA TGGCACCCTC GATTTGTTCG AGCTTTGTAG TATCTACGGT 7980 

45 

TGTAAATTCA ATATTAAATC GATTCAAAAT TTGCTCAGTG AGGCGAAAAG TACCGCCATA 8040 

TACATCATCG GGTAAGATGA CATGATCACC AGATTTGAAA GTCAAAAGTA CTGCTGAAAT 8100 

50 AGCAGCAATA CCTGATGCAA AAGCAAAAGC GAATTTTCCC TGTTCTAATC GTGCTAACTT 8160 

CTCTTCTAAA AGTTCACGGT TAGGGTTGCC CTTCGTGCAT AATCATATTT AACATCGCCA 8220 
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TCCACACCTC TACGCCAATC GAATATCACT TCTGTCTCTT TTGAAAGTGT CATACAATCT 834 0 

CTCCAATCTG AGCTTTATCT AATGCTTGGA TGATATCGCG TTCGATGTCT TCATAATTTT 8400 

5 

CAACACCTAG TGATAAGCGG ATTAAATACT CATCAATGCC ACGTTTATCT TTTTCAGCAT 8460 

CTGGCATATC AACATGTGTT TGGGTGTAAG GGAAGGTCAC TAATGTTTCA GTACCTCCTA 8520 

AACTTTCTGC AAAAATGCAA ATGTCTAAAT TTTCTAATAA TTTAGCGACG CTATAGGCCT 8580 

10 

TGTTAAGTCT TAAACTAAGC ATGCCAGTTT GCCCGCTATA TAGTACTTCG TCAATTGCTT 8640 

GAAGTGACTG ACATTTTTTA GCAAGTTTTC TAGCGTTTGA TTGCGCACGC TCAATGCGTA 8700 

15 AATGCAAAGT TTTAAGTCCA CGTAACAACA AATAACTATC TATTGGTGAA AGTGTTGCGC 876 0 

CAGTCATGTT GTGAAAATCA AACAACTGTT GCGCGAGTGA TTCATCTTTG ACGGTTACGA 8820 

CACCTGCTAG TACATCGTTA TGTCCGCCAA TATATTTCGT GGCTGAATGT AAGACTATAT 8880 

20 CAGCACCTTC TGCTAGTGGT GTTGAAAGAT AAGGTGTTAA AAAAGTATTG TCGATAATTG B940 

ACAATAAGCC TTTAGCTTTA CAAAGTTGAT AGTATGGCTT TACATCAATA GCAATCATTT 9000 

GTGGGTTAGA TATTGGTTCA ATGAATAATG CAACTGTTTT ATCAGTGATT TCTTTTTCAA 9060 

25 CTTGTTCATA ATCTGTAAAA TCAACGTACT TAAATTTGAT ATCGTATTGT TGCTCGTAAA 9120 

ATTCAAATAA TCTAAATGTG CCACCATATA AATCGAATGA AACTAAAATT TCATCATGAG 9180 

GTTTAAATAG ATTACATATT AATTGAATGG CTGACATTCC ACTTGATGTA GCGAATGATG 924 0 

30 

CAATACCATG CTCAAGTTTG GCAAAACAGG TTTCAAATGT TGAGCGTGTA GG ATT TTTAG 9300 

TACGTGTATA ATCAAAACCT GTCGATTGTC CTAGTTTTGG ATGCTTGTAG GCAGTAGATA 9360 

AATGGATTGG ATTCGCTATA GCACCGGTTG AATCATCGGT TAATGTGATT TGGGCTAACT 9420 

35 ' 

GTGTATCCTT CATATTAAGA CCCTCCTATA AGAAAAAATA AAAAAAGCTT CCGTCCTTCG 9480 

TACCCGAATG AATCGGATAA AAAGGACGAA AGCTTATGTT TCGCGGTACC ACCTTTATTT 9540 

GTTATTCCAT CGCTGAAATA ACCTTATTCA GTACGCATTA AAAGTAAATA TGCTTACTGA 9600 

40 

ACAATTATCA CAATTAAAGT CAGTAAGTAA GGATATAGTA ATGTGCTATC CCATACTTAT 9660 

TAACAAAAAA TCGTGCGTAA AGAATCCAGT ACGCCATTTA ACATCAATGT TAATACTGTA 9720 

45 TCGCTATAAC GGGCGAACCC GTAGACACCT CATATTGGCA TCAACACTCC AAGGCCATTT 9780 

TCAAACACGC TTTCAAAATC TTCTCTCAGC TACTAAAGAC TCTCTGTATA AGCAGGGTGT 9840 

GTTTTACTTy CCTCTTTATT GTGTTTACGT TTCATTAAAC TGTTATAAGA TATTAATTAG 9900 

50 CTTACAGAGT AAAAAAAGAT TTGTCAACAA TTATTCAGAA AATTTTGATT TAAAAGTTAA 9960 

TTTGTTTGTG AAATTGTAAT TGGTATCTTG AAGTTGAAAA ATGAATTATT TTTTAAATAA 10020 
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20 
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TCAAATAAAA AGTGATGTGA GTGAATTGTC AAAAAGTGAA 
AAAAGATGAA CAAATTAAGC AAATAGATAT ATCGGATATC 
CCGAAAAACT TTCGATGAAA ATCATTTAAA TGATTTGGCA 
AATTTTGCAA CCAATTGTGC TTAGAAAAAC AGTTCAAGGT 
AAGAAGGTTT AGAGCTTCGA AAATTGCTGG TCTAAAATAC 
TTTAACAGAT GAAGATATGA TGGAACTGGC GGTCATCGAA 
AAATGCGATT GAAGAAGCTG AAAGTTATCA ACGTTTGATG 
ACAAGAAGTA GCGAAACGAT TGAGTAAGTC GCGCCCGTAT 
ATTACATTTG CCGAAAAAGA TTGCTGACAT GGTAAAAGAT 
TGGACGAACG TTATTGGCAA TTAAAGATGA ACAACAAATG 
TGTTAAAGAA AAGTGGAGTG TCAGATATTT AGAAAACCAT 
TTCGTCAAAG TCGGAAACAG ACAAAGTAGA TATAACTAAG 
AGAACGACAG TTGCGAGAAC AGTATGGTAC CAAAGTAGAT 
TGGTAAAATC TCATTTGAGT TTGATTCACA AGAAGATTTT 
AAATCGTAGG TATGGTAAAT AGTTACACAA TTTTATATAA 
AATAAATTGT AATCAGTGAC ATTTGATTCT AGAT 
(2) INFORMATION FOR SEQ ID NO: 92: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6022 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



GATCAACGTA TTACTAAAAC 1014 0 

AAACCGAATC CGTATCAGCC 10200 

GATTCAATTA AGCAATATGG 10260 

TATTACATTG TAGTTGGTGA 10320 

GTATCAGCGA TTATCAAAGA 10380 

AATTTACAAC GAGAAGACTT 10440 

ACAGATTTGA AAATTACACA 10500 

ATAGCGAATA TGTTGAGGTT 10560 

GGGCGACTGA CAAGTGCACA 10620 

CTTAGGTTAG CGAAACGGGT 10680 

GTTAATGAAT TAAAAAATGT 10740 

CCTAAATTTA TAAAGCAGCA 10800 

ATATCAATAA AAAAATCGGT 10860 

GTGAGAATAA TTGAACAATT 10920 

TAACTCTTTG TGCAAGTGTA 10980 

11014 



40 



45 



50 



' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92: 

TCCCCTTATG GAATTTCACA TTCTAGTTTA CATAATATAT ATTATAGGAA GTTATATGTG 6 0 

TGTAACGCAA AAgGTACCCT ACATCATAAT CATTATCTAA TATCGTCACA TAACTTACTT 12 0 

ATGCTATAAT CATGGTATTA TATTGTTTGG AGTGATTTGA TGAGATTTGT CTTTGATATT 180 

GATGGTACGC TTTGTTTTGA CGGCCGATTA ATTGACCAGA CTATTATTGA TACATTGTTA 24 0 

CAATTACAAC ATGATGGTCA TGAACTTATA TTTGCATCAG CACGTCCGAT TCGTGATTTG 30 0 

TTGCCAGTTT TACCATCAGT ATTTCATCAG CACACATTAA TTGGCGCAAA TGGTGCTATG 360 

ATTTCACAGC AATCAAAGAT TTCTGTTATC AAACCAATTC ATACTGATAC ATATCATCAT 4 20 
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GCTGCACAAC TTGACGCTGn AGAACGCGAT TTTTGAGCGT TTAGATCCAC ATAAGCTGGC 54 0 

CAGTTGTATT GATGTTGCAA ATATCGACAC GCCAATCAAG AkTATTTTAT TAAATATAGA 600 

5 CCCGGCACAA ATTACAACTA TATTAGACGA GCTAGATAAA TACCATCAAG AATTGGAAAT 660 

GATTCACCAT TCAAATGAGT ATAACATTGA TATAACAGCG CAAAATATTA ACAAATATAC 720 

TGCATTACAA TATATATTTG ATGCAGATGT TAAATATATA GCATTTGGTA ATGACCACAA 780 

10 

TGATATTGTC ATGTTACAAC ATGCTAGTAG TGGCTATATT ATAGGACCAT CAGAAGCATA 840 

CACACACGCA ATATTGAAAC TTGATAAAAT CAAACACATC AATAATAATG CACAAGCTAT 900 

TTGCAAAGTC TTAAAATCAT ATAAATAAAA ACACCCCTAT CAAATGATAA TCATTATCAA 960 

IS 

TCGATAGGGG CTATTTTAAT AAAATTCGTC CTCGAACATT TCTTCCTCTT CATCTAATCC 1020 

AAATAATTCT GCCATTTCTC CATGTTCAAT TAACATGTTT AAATATGCAT CGCGGAGTTC 1080 

TTCTTCACTC ATATCATTAA TCATTTCTTT AAGACTATCA ATCCACATAT TTCTGCGTAA 1140 

20 

TTGATAGTCT TCTTCAACTT CGTTTAACAT CATTATATGT TTATTTGCTG CTTCTGGACT 1200 

AGCTGTAAAG AGTAATGCAA TCATATGTTT ACATATCACT CGTCTTCCAT CAGCATGAGG 1260 

25 ACAATTACAT ATGGATTTTC TAGGATGTTC CATATCAATA TAACAACGAT ATACTTTGTT 1320 

GCCACTGCCC TTTACTTCAG CCTCATGCTG CGTTTCTGAA AATGATTTTA AGTTAATGAC 1380 

GCATTCACTT TGATAATAAT TAAAGCCTCT TTCTATAGAA CGAATACTTG CAATATCAAG 144 0 

30 TAATCCCATT AATGaTACTC CTTTTTATTA TTATTTTTAA ATAAAGAaAA TAAAATAGAT 1500 
AAGTGTCTAG ATTAAAATAC TTGATTTATC TATATTTTAT AACAAGTCTA GAATTATCGC , 1560 

ATTCTTAAAT AACTAATATG AAAATGcTTG CACTAATTCt TTTGTATAAG GGTGTCTATC 1620 

55 AACATTAAAT AATTCCtCTA TTGCAAAATC ATCGACTATC ATGCCATCCT TAAGAACGAT 1680 

AATTCTATTA ACTAAGCGTT GTAACACGGA TAAATCATGA GAAATAACGA TAAAATGATT 1740 

TAAGTTCGTA ATCGTTTGCG CTTTTAATAT ATTGATTACA TTTTGTTCAG CTATAACATC 1800 

40 

TAAATTTGAA GTTATCTCAT CACATATTAA AACGCGAGGC TGTGCTAATA ACGAACGCAT 1860 

GACATTAAAT CTTTGTAATT GTCCGCCACT CACTTCGCTT GGTAATTTAG TCAATAATTG 1920 

CGCGTTTAAC TCAAAAGTAG ATAAATGTTG . TAATAATAAT TGATCCTGAG CAGTATTATC 1980 

45 

AGTTAGACCT CTGTAATAAT ATAACGCTTC TTTTAATGAG GTCTCAATCG TCCAATCAGG 204 0 

GTTAAAGCTA GTTAAAGGGT GTTGGAAAAT CGGTAACACA GCATTGTCAC TTAAGTAAAT 2100 

CTCTCCTTTA ACAGGTTTAA ACAAGCCAAG AACCAATGAA GCGAGCGTAC TTTTACCACA 2160 

SO 

GCGACTTTCG CCTAAAATAC CAACATTTTC TCCATCAGGT ATAGTAATAT TGATATCTTG • 2220 
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CCCTCTTTAA TTGTGTTCTA TATTTAATTA GACGTTCAGT ATACGGATGC AAATGCTCAT 234 0 

ACTTGAAATG ATTAATATTA CCTCGTTCAA TGATTTGACC TTCTTTTAAA ACATAAATGT 2400 

ACTGACAATA TTTCAATACA TGACTTAAGT TATGTGTGAT AATAAATAAT GTTTGACCAT 24 60 

GTTCTAATAC AATATGCTGT AATAAATCCA TCACTTGATT ACCGTTCAAA GCATCCAATG 2520 

ATGCAACTGG TTCGTCTGCA ATGATTAATT TAGGCTCCAA CATGAGAACG CTTGCTATGT 25 BO 

ATACGCGTTC AAGTTGGCCC CCAGAAAGTT GGAAACTATA TTTATTTAAT ATATCTTTGC 2640 

TTTGTAAATT AACCCACGAC AAAGCCTTAT CAACTTTGGA CAAAGCCTCT TCTTTACTAC 2700 

CTTTATAATG CTTACGATAA ATCGCAGTTA ACTGTTTACC TAATTTAGTA TGGTCGTTAA 2760 

AACTTTCTGC ATAATTTTGA GAAATATAGC CAATTGTATG ACCATAATAT TGACTCAATC 2B20 

TACTAACATT TTCCCCATCA AATTGGTACG AATCATACGT GCAGCTTAAA TCAAATGGTA 2860 

AATATTCAAG TAAAGCTTTA GCAATCAAAC TTTTT CCAGC GCCGCTCTCT CCAATCAAGG 294 0 

CATTAATCTG TTGACTAAAA ATTTTCAAAT CAATCCCTTT AATAAGAGAT TTCTCACTAG 3000 

TATTCTTTAT TGTTAAATTT TGTATATCAA TGAGACTCAT CATATTCACC CCGTTGTTTC 3060 

AGCAATCTAT CTCTTAGTGC ATCACCGGTT AAATTAAAAA TTAAAATAGT TATAGCAATG 3120 

ACTGAAGCAG GTGCAATCAA CATAATTGGA TGAGACGAAA TAAAATCACG ACCTTGTTGC 31B0 

AACATAGCGC CCCaCTCTGG TGTTGGCGGT TGTGCACCTA ACCCAATAAA TGATAGTGAA 324 0 

CTTATATATA GAATGATTTT ACCGAAATCA ACGACCATCA AAACGATAAT AGCCGGTATA 3300 

ATTTTAGGTG TTAAATGACG TATTAATATT GTTCTTGTTG GTACATGAAA TAATTGTGCC 336 0 

ATTTTTATAT AAGGCTTATT CATTTCGCTA TTAACTATAC TTCTAGTCAA CCTTGTGTAA 3420 

TTCATCCATT TTATTAATGT AATTGAGATA ACTAAATTCC ATAAAGATGG TTGAAAAAAA 34 BO 

CTTGcTAAAG CAATCATGAT GATAAATTCT GGAATACTTA GACCAACATC AATAAACCTT 354 0 

AACACTAATC GTTCAATCCA CCCTTTTTTG TATCCGGCAA ATAGACCTAG TGTAACACCT 3 6 00 

ATGACAACGA TAGCTATTAA TGTTAAAACA GTAACAAACA ATGTTGAACG TGCACCGATA 3660 

ATAATTCGGG TAAATAAATC TCTCCCATAA TCATCAGTTC CTAATAAATG CAACCAACTA 3720 

ATAGGTTCAA AAGTTTGTGA TAAATTGACT TTGGTTGCAT TTTCACTACT GACAAAGAAT 3780 

TGCAGTACAA TTACCACAAA AATAAATGCA ACGAATACAA AAAATATCAG GTTATTCTTT 384 0 

GAAAATATTT TATGCATGAC GGTCACTACT TTCTGATATC AATGGTGTAT TGGTTTTGAT 3900 

TTTTGGATTT CCTAATTGTA AACGCTGCTT CGGATCAAGT AATAACGTTA ATAAATCAGC 3960 

AATCGTATTG ATAATAACAA CGAAGAAGCC AATAAATAAC ACGCATCCTT GAATAACAGG 4020 
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ATTTTCAATC ACTACAGTAC CACCTATTAG ACTGCCAAGT GAAATCCCTA GTAATGGGAT 4140 

AATCGGCAAA ATTGTTGGTT TTAGTAAATC ATGAATTAAA ATATAACGTT CATTCATACC 4200 

GCGTAATCTT GATGCTTGTA CGATATTACT TTGCAATAAC ATCAATAAAT TAGAACGCAC 4260 

TAAACGAATG ATGTATGCAC ACATACCTAA AGATAGCGTG ATTACAGGTA ATATAAACTG 4320 

ACTTAGTATA ACGCTATCTA TATTCATTAA ATTTGTGACA ATAAATAATA AAATAATACC 4380 

10 

GATAAAGAAC GCTGGTAAAC TAATCGATAG TGTTGAGATC ACTCTAATCA CTTTATCCGT 444 0 

CCACTTATGA AATCGTTTGG CTGCTATAAT GCCGAGCGGT ATAGATATGC ATAACGACAC 4500 

TACTAATGTT GAAAATGATA TGAGTAATGT TATGGGTGCA TAGTTGAATA ATATCTGTGT 4560 

1$ 

TACCGGTTCT TTTGATTCAA AACTTTTTCC TAAATTAAAA TGTAATAAAT GATTCATCCA 4620 

ATGCCACCAC TGTACCAATA AAGAATCATT TAATCCCAAT TTATCTTTGG TTGCATTTAT 4680 

TTGTTCCGTC GACACTTGTG CTACATCAAG ATGTAATATT TTATCAACAG GATTGCCTGG 4 740 

20 

TGATAATTTC ATTAAAATGA ATGTAAGTGT AGAAATAACA AATAAAACAA CTATCATTTG 4800 

CATCAGTCTA TACAACATAG ACTTTATTAT GAACATAATA GTCCCCCTCC TTGTGTAAGT 4860 

2S TACTAACACT TTCTTTTTAC ATGAGAATGG CGCATGTATA TGCAACTTAC ATATTAAGAA 4920 

CTAACGTTCA TTATAGTATT ATCCATAAAG AAATTGAAGT ATATTTAATT TTTTAACAAA 4980 

ATCATTATAA AATATAATAT TTTGAATCAA GTCAACCATG TAAAATATAA AAAAGTCAAA 5040 

30 ACAAAAACAA CTATAGCACT GTATTCCATC TCTTTCGAAA TAATTGTTAC TGCAGTGTAA 5100 

CTTAAAAGTC GATGATTTTG TGCATATAGT TGTCGAATAT TATTTTTTAT CTTTACGGCG 5160 

AAGTTCAGCG CCCTCATAGC CGTATTTTTC AATTTGCTTT TCTAATTTAC GCGCTTTTCT 5220 

35 TTCTTTACGC CAATTTCTAG TAAAATACCA TAATAGAAAA CTAATTAATA AACTCATAAT 5280 

CGCTAAAAAT GCAGCGTATC CTAATAATGG TTGATATTTT ATATCTTGAA AATTTGGAAT 5340 

AAAAAATGCA AGCACACCTA ATATAACAAA TGTAATTACT GCAGATACAA ACCATTTATT 5400 

40 

TAAAACTAAG CAACAGAATA TTGTTAATAA AATCATTATT AATGTTGTGA TCCATAAATA 5460 

ATTAGGCATA TCGAATAATG T CAT ATT CAT TCTCCTTTTA TTTCATTACT TTCCTTGTAT 5520 

ACATTTTATT ATAAATTTTT AAAAACTTAA ACAATAGCAG TCAGTTTCAA GCAATATTCT 5580 

45 

ATCTACTAAT AGAAAAATCA TTGTTCCTTG CGACATGGAA ATCGTAACAT TATCGTTTAG 5640 

GAGACAAAAT TATGTATAAT GAATG T ATT A TACCAAAGGA GTGATTATAT GTCTCAAGGT 5700 

TTACCTTTAA GAGAAGATGT TCCTGTTTCA GAAACATGGG ATTTAGTAGA CTTATTTAAA 5760 

50 

GATGATCAAC AATATTATGA AAGTATTGAC GCTCTAGTAC AnCAAGCAAA TCAATTTCAT 5820 
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GAAAATATTT TAATTGCCTT AGATCGCTTA AGTAATTATG CAGAACTACG TTTAAGTGTA 5940 

GATACTAGTA ATATCGAGGC ACAAGTATTG AGCGCTAAAT TATCTACTAC ATACGGTAAA 6000 

ATTGTTAAGC CAATTATCCT TT 6022 
(2) INFORMATION FOR SEQ ID NO: 93: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 476 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 93: 

CCATCAATAA TGTATACATG ATTGGCATCA TATTCCCCTT TAATTAGAGA GCTACGTACA 60 

GT TTGTyTTA TTAAAGTAGA ACTAATAAAT AACCATCTCT TATGTGCACA AACACTTCCC 120 

GCAACAATTG ATTCAGTTTT ACCAACCCGT GGCATACCTC TAATGCCAAT CAACTTATGA 180 

C CTTCTT CTT TGAACAATTC AGCTAAAAAG TCTACTAACA AGCCTAAATC TTCACGCTCA 240 

AATCGAAAGG TTTTCTTATC TTTTGCATCT TGCTCAATAT ATCTTCCATG TCTTACTGCA 300 

AGACGGTCTC TTAATTCTGG TTTTTTAAGC TTTGTTATTT CAATTTCATT TATACCACGA 360 

GCTATTTGCT CAAAACGTTC AACTTTTTCA AGATTGTCTG TTTTAATTAA AAGGCCTCGT 420 

30 TTACCTTGAT CAACACCATT AATTGTAACA ATACTTATAC CTAACATACC TAATAA 476 

(2) INFORMATION FOR SEQ ID NO: 94: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 633 base pairs 
35 (b) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 94: 

AGAAATACAA CGAAGCATAT AAATATAACC GATCTTTTTT CTAATTGAAT ATTAAGTAAG 60 

TGTATGTACT TTCTGGAAGT AGCACCTAGT rGGATTGTtC CTCCTACAAC AGGCCAAAAA 120 

TTT TTA TTTT TAACTGGCTT AACAGTGTTC AGTTTTTCAT ACTCTTCTCT ACTAATTTTG 180 

GCGCACCTTT TTGGAATGAA CCAATTAATA AATGGAAAAA AGTATACAAG CCAAGTTCTT 240 

ATTACATCGA CCATTAAATA CTCATCATCA TACTTAATAA CTCTGTATTT CGGATTTTTA 300 

TTGATAATTT CGGTTTCACA AAGCAATAAT TATCACTTCC TATTAATAAC AAATTCACAC 360 
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TTATATGACC TTAAATATAT AACATGAATC TTTTTGTCTA TTATTGAAGA CATATTTATA 4 BO 

AAGAAAAATA GCATTGTCAT AATAACCCAA GCAATAAATA CTATAATATT TTGGATAGAT 540 

5 

AAACTAATCA rrACATCTAA GAACATGATT gATAATCCAC CACAGAAAAA ATAAGAAAAT 600 

AGTACAAAGC AAAGATTCTT GAATGATGGA AAAATCATAA TTTTTCCATT GCTACTCCGA 660 

TCATTATAGA TAGATAACTT TACTTTCTGA TTTAAATATA TATAAAACAC TAGAATACTT 720 

10 

AATAATAAAA CCGAACAAAT GATAATAACG CAATTTTTTT CTAAATGAGA ATCAGGTATA 780 
TATATTTTAT CTCTAAACAT AGTGCCAAAT AAAAGTATGC TACCTATAGC TGGCCATAAA 84 0 

IS GCTTTaTTTT TAACTGGTTT GACAATATTT AAATTATCAA AATCTTCTCT GCTGATTTGG 900 
ACATATTTTT TTGGTATTAA CCAATTAATA AACGGAAAGA ACAAAACTAA CCAGGTGCTT 960 

ACTAAATCAA TCATCAGATA GTCGTTTTTA TATTTAATAA TTCTATATCT GGGATTTTTG 1020 

20 

TTTACAACTC TAACCTCGCA AAGCAATATC TCCACTTCCG TCTCGTTGGT TTTATATCTA 1080 

ATACACTTTC AGATACTTTA TAAGTGTTTT GTATTTTAGT AACATACTAT TTTCCTGTTT 1140 

ATTACTTAAC TTACGAACTA CAATCTAAGT TTAGTAATTT CTATTGCTTT TTAAGTTTGG 1200 

25 

CATAAACCTT TTTATTACTA ATTGAGCCCA TGCTTATTAG AAAGAAAAAA ATTGTAATAA 1260 

TAATCCACAT AATAAATACC AGTAGATTTT GAGGTTTTAT AGTCATTAGC CATATTAAAA 1320 

30 ATAATATAGA ACAACCTCCT AATAATAGAT*ATGTGAAAAC TATAAAACTT CCATCTTTAA 13 80 

AAGTAGGCAC TAATATAACC CTATTTTCAT TATCTAGATT ATCATCATAT ATCTTTAGTT 1440 

TAAG CTTT TT ATTTAAGTAA ATGTAAAATG CTGCAATACC TATAAATCCT ATAAAACATA 1500 

35 AAGATATTAA AATCTTATTA TCTAATTGAA CTTCAAACGT ATGTACATAT TTCCGTAAAA 1560 

TAACTACAAA TAAAAACGAA CTACCAGTAA CTGGCCAGAA AATATTATTT TTATTTTGTT 1620 

TATCAACATT TAAATTTTCA AGTTCCTTCT CACTAAGTTT TGCATACCTT TTGGGAATGA 1680 

40 

ACCAATTAAT AAAAGGAAAA AAGTATACAA GCCAAGTGCT TACTAAATCA ATTAACAAAT 1740 

ACTCATCATT ATATTGAACG ACTTTATATC TCGGATTTTT ATTAATAACC TTAATATTAA 1800 

AAAGCAAAAC TCACCACGCC CATTTCATTG GATTTATATG ATTGCTAATA ATATTTTTAG 1860 

45 

CTTCACTAAC AGCATTCCCA ACACTATCCA TGGATTTTTC TGTAGTTTTT TTAACAACAT 1920 

CTATACTATT ATCGATTTTA TGCCCTACCC AGTCTACTTT ATCTTTTAAT CCAAAAATAT 1980 

so TATTTTGATA AATTAAATCT GTTCCTAATG CAAATACTGT ACTCATAGCC AAACCTGCTA 2040 

AAATCACCCA TCCTACTGGA TTACTTCCTA AAACAAAAGT CGCTAATCCA GCTCCAACTG 2100 

CTGTCCCTGC AGATCCAGCT GCAAGCGTgC ATACCATTAT GCGACAACGC CTCTCCAAAT 2160 
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CCTTTACCTA 


GGTATTTTCC 


GCCTTTTGCA 


AATTTACTAC 


CATTTTCTAT 


AAACACATTA 


2280 




CCTGATGTAC 


GTTTGACTTC 


CACAAATGAA 


TTTGGACCTG 


CTGGGCCTTT 


CACTCCACCT 


2340 


5 


GCTGTATTGa 


TAAATACACC 


GAATTTACTT 


GcATTTATAC CGTCTTGCTC TAAAAGTGTT 


2400 




GACGTAATAT 


CTAATCCTAT 


ATCTCTTTTA 


ATACTGTCTT 


TATTGTCATT 


TATATATTTC 


2460 


10 


AATATACTTT 


TCGGGATATC 


GTCTTCTGGA 


TGTTCTTTGG 


CATATGCCTT 


TATAACAGCA 


2520 


AAGTCTGCTT 


TATTTAAAGT 


TTCTTTCTCT 


GCTTTATGTT 


CAATTTTCCC 


CATAGCAACT 


2S80 




TTCAAATATT 


TTTCATGACT 


TGCTTTGGCC 


CAATCAAGTT 


CTTTACCTGA 


AGGAATATTA 


2640 


15 


AATTGATTTG 


TTGAAAAGTT 


CCAAAAATTC 


TGCGCTTGGG 


TAAGTCCTTG 


TTGGACAATT 


2700 




TTTTGAAATT 


CTTCAACTTC 


TTTAAATATT 


TCTGGTGATT 


TTTGATTAAA 


CTCACGCAAT 


2760 




TTGCGTAGCT 


TCTCTTCTAA 


TTCATGTTTT 


TGTTGACCTA 


ATGTTCGTAT 


TATTTGTTGG 


2820 


20 


TTCGATGAAA 


TGGCTTGCTG 


ATTATCGGAA 


GCATGCTTTT 


TCAAATTGTT 


ATTCAAATTT 


2880 




TCATATCGCG* 


TAATTTGTTG 


ACTTAATGAT 


CTGATATCTT 


CTTCAAGCTC 


TGATTCTTTT 


2940 




AAAGATATGC 


TATCAACCTC 


ACTCGTATAA 


CGTGACACAA AATTaTCGCA AGCTTGCTTC 


3000 


25 


GTTAAATCAC 


TCAATGTTTT 


CATACTTGTT 


GATAATGGAA 


TTAACACCGT 


ACTAAAAAAT 


3060 




TGCTTAGCTG 


ACGTATACGC 


TTTCCCTTTA 


AGCGCATCAT 


CATTAATAAA 


TTGAGTAATT 


3120 


30 


GCTTTTTCCA 


ACGCATCATA 


ATTTGAATTC 


ATTGTTTGAC 


TCAAATTCCC 


CACACTTGAA 


3180 


GCTTGGTTTC 


GAGATCTGTC 


TAAATACATG 


TCAATACTCA TCGGCATGCT 


CCTTTTTCAA 


3240 




AAATATATGA 


TTTTCAAACT 


ATTTAAAATC 


AAATGCTTTT 


TACATCTACA 


AAGTTGTAAA 


3300 


35 


ATTTTAAAAC 


TCGGCGATGA 


TTATTTCTTA 


TGTAAAGGAG 


TCTAGATGCA 


GGTAAATTGA 


3360 




GATAACATGT 


CGCCTTTTTT 


CTTATTTTAG 


CATATGGATA 


TAATGGTGTC 


TTTGTATATT 


3420 




. CGCAffTTAAT 


CAATAAAAAT 


TATCTTTCAA 


TATTTTAATT 


TTATTGCGAC 


AACATCCTTA 


3480 


40 


ACATTAAATA 


TATTAATATC 


TCAAAATATA 


TTCACTATTA AAATATGTCA 


TCAGTTGTTA 


3540 




AAAGTATTTC 


CTCATCATGC 


GAAATATCAA 


AACGTATCTA AAATACGAAT 


AAGTTTATAC 


3600 




AATCACACAA 


CATCATCATT 


CAAAATTTTA 


TTG 






3633 



45 

(2) INFORMATION FOR SEQ ID NO: 95: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2365 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TGATACGAAt GCATTACAAT TCATATGCAA CATACAATTC CTTCTACAGC AAATGAAGTG 60 

AAACAAATAG TTGATGTGAC ATCTGTAGCA GAAAATGATA CGCATTAGTC ATAAAATTAA 120 

ATGGAAATGT CGATGAAGTG TATCAGCAAT TACAGCGATT AATTAAGAAT GCTAATGTCG 180 

AAGAGAGTGA GAATACTGAC AATATTAATA GTCAAGATAC AAGTTATACA CCTCAAGTAA 240 

AAGTAACAAC ACCAATTTTA GTGAAAGCAC CAATCGCTGG TCGTCGTATT TTACTTAAAG 300 

AAGTAAGAGA TTCAATTTTT AGAGAGAAAA TGGTAGGTGA AGGCTTAGCA ATCAAAGCTC 360 

ATGAAGAATC CAAAGTAATC GCACCGTTCA ATGGTTTAAT ATCTATGATT GTACCAACTA 420 

AGCATGCAGT TGGTATTCAA TCAGAAGACG GTGTGGACAT AGTCATTCAT ATTGGCGTGA 4 80 

ATACAGTTGA CTTGGAAGGT AAAGGGTTCA AGTGCTTTGT AAAGCAAAAT GATCATGTTG 540 

AAGCAGGGCA AACGTTGTTG CAATTCGACC AGCAATATAT ACAACAACAA GGCTACAATG 600 

20 CTGACGTTAT TGTCGTTATT AGCAACTCTG CCGATTTAGG AAAAGTAGAA CTGACAATGA 660 

ATGAAATCAT TACGACTGAA GATGTTATTT TTAAAATATT TAAAAACTAG GAGTGTGTTG 720 

TAATAATGAC AAAATTACCG CAAAATTTCA TGTGGGGTGG CGCTCTTGCC GCAAATCAAT 780 

TTGAAGGTGG ATATGATAAA GGTGGTAAAG GGTTAAGTGT AATTGATGTT ATGACGAGTG 84 0 

GTGCACATGG CAAAGCACGT CAGATTACAG AATCTATAGA TCCCAATCAC TATTATCCAA 900 

ATCATGAAGG TATTGATTTT TATCATCGTT ATAAGGAAGA TATTGCCTTG TTTAAAGAAA 960 

TGGGATTGAA ATGTTTACGT ACGTCGATTG CGTGGACACG TATCTTTCCG AATGGGGATG 1020 

AAGATGTGCC AAATGAAGAA GGACTCGCCT TTTATGATCG TATCTTTGAT GAATTAATTG 1080 
CACAAGGTAT TGAACCTGTT GTGACGTTAT CACATTTTGA GATGCCACTT CATTTAGCGA ' 114 0 

35 

AACATTATGG TGGATTTAGA AATAGAGAAG TTGTCGATTA TTTTGTGCAT TTTGCGCGTG 1200 

TTGTATTTGA AAGATATAAA GATAAAGTTA CATATTGGAT GACGTTTAAT GAAATTAATA 1260 

40 ATCAGATGGA CACATCAAAT CCTATCTTTT TATGGACGAA TTCTGGGGTA GCATTGACAG 1320 

AAAATGATAA TCCTGAAGAA GTCyTGTATC AAGTAGCACA TCATGAACTT TTAGCCAGTG 1380 

CyTTAGCAGT TCGTCTTGGT AAAGaGATtA ATCCgAaGTT TAAGATTGGr ACmATGATTt 1440 

CAmaTGTACC CmTTTATCCa TAwTCGTGTC ATCCGAAAGA TATGATGGAA GCACAAATTG 1500 
CGAATCGCTT ACG TTTCTT T TTCCCGGATG TCCAAGTGAG AGGTTATTAT CCAAGCTATG • 1560 

CTAAAAAAAT GTTGGCACGA AAAGGATATG ATGTTGGATG GCAAGAAGGG GACGACAGTA 1620 

TTTTACAGCA GGGCACGGTT GATTATATTG GCTTTAGTTA TTACATGTCT ACGGCTGTAA 1680 

AACATGATGT TGATACTACA GTTGAAAACA ACATCGTCAA CGGTGGTTTG AATCATTCTG 174 0 
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GATATACATT GAATGTGTTA TATGATCGTT ATCAGTTACC ACTTTTTATT GTGGAAAATG 186 0 

GTTTTGGTGC AGTTGATGAA GTGGTAGATG GACATATTCa TGATGATTAT CGCATTGAAT 1920 

ATTTAAAAGC ACATA7TACA GCAGCGATAG AAGCAGTTGA TCAAGATGGT GTAGATTTAA 1980 

TCGGTTATAC ACCGTGGGGA ATCATTGATA TTGTTTCATT TACAACCGGT GAAATGAAGA 204 0 

AACGCTATGG TTTAATATAT GTTGATCGAG ATAATGATGG TCATGGCACG ATGGAACGCT 2100 

TGAAAAAAGA TTCGTTCTAT TGGTATCAAC AAGTGATAGC ATCAAATGGA GATAAATTAT 2160 

AAAGGTATAT TATAAGTATT TTAGGGTTAG AGCCCGAGAC ATAAATTAAT ATAGTAGGAC 2220 

CTACAGTGTT ATAATGGCGG gCCCCCAACA CAAAGAATTT CGAAAAGAAA TTCtAcAGGT 2280 

aATGCaAGt.T GGCGGGGcCC AACACAGAGA AATTCGAAAA GAAATTCTAc AGGTAATGCA 2340 

AGTTGGGGAA GGACAGAAAT AAATT 2365 
20 (2) INFORMATION FOR SEQ ID NO: 96: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11050 base pairs 

(B) TYPE: nucleic acid 
2S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



15 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96: 

CTGCGATACG ATTTGTTGAA AGTGGGGAAA ACAAAAAAGT TATCATTACC AATTTAGAGC 60 

AGGCATACGA AGCTTTGATT GGTAATAAAG GTACACACAT TCACATGTAG CACTTTATCA 120 

CGCGACAAAA CATTAAATAT GTTTCTCCGT TGATTCAAAT GAAAAAGTTG TCTGCTGACA 180 

CTTTGCAAGG TTTGAAGGAG TTTAACTTAT GACAGAAAAC TTTATTTTGG GTAGAAATAA 240 

TAAATTAGAA CATGAACTAA AGGCATTAGC AGATTACATT AATATACCAT ATAGTATATT 30 0 

40 ACAACCATAT CAAAGTGAAT GTTTTGTCAG ACATTATACG AAAGGCCAAG TTATTTATTT 360 

TTCGCCACAA GAAAGTAGCA ATATTTACTT TTTAATTGAA GGTAACATTA TTAGAGAACA 420 

TTACAATCAA AATGGAGATG TATATCGTTA TTTTAATAAA GAGCAAGTAT TATTTCCAAT 480 

CAGTAACTTA TTTCATCCGA AAGAGGTTAA CGAATTGTGT ACAGCATTAA CCGATTGTAC 54 0 

AGTTCTTGGA TTGCCTAGAG AATTGATGGC CTTTTTGTGC AAAGCTAATG ATGATATATT 600 

TTTGACACTT TTTGCATTAA TAAATGATAA TGAGCAGCAA CACATGAACT ATAACATGGC 660 

ATTAACAAGT AAATTTGCTA AAGATCGAAT TATCAAATTG ATATGCCATC TATGTCAGAC 720 

AGTAGGATAC GATCAAGATG AATTTTATGA AATCAAACAG TTTTTAACTA TTCAACtCAT 780 
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TGAAAAACTT GTTGTTAAAG ATCATAAAAA TTGGTTAGTA AGCAAACATT TATTCAATGA 900 

TGTATGTGTT TAATATACAA TGTAAAATGA ATAAGTTGAA GATGAGGTCT AACGTACATT 960 

S TATACGTTAG GCCTTTTTTG CTAGCATGAT GAATAATTTA AAATGTTAGT TAAATTTGAT 1020 

TGTTGAAATT ACAGTAAAAT TTAAGGTGAT GAAAAATTTA GAACTTCTAA GTTTTTGAAA 1080 

AGTAAAAAAT TTGTAATAGT GTAAAAATAG TATATTGATT TTTGCTAGTT AACAGAaAAT 1140 

10 

TTTAAGTTAT ATAAATAGGA AGAAAACAAA TTTTACGTAA TTTTTTTCGA AAAGCAATTG 1200 

ATATAATTCT TATTTCATTA TACAATTTAG ACTAATCTAG AAATTGAAAT GGAGTAATAT 1260 

TTTTGAAAAA AAGAATTGAT TATTTGTCGA ATAAGCAGAA TAAGTATTCG ATTAGACGTT 1320 

75 

TTACAGTAGG TACCACATCA GTAATAGTAG GGGCAACTAT ACTATTTGGG ATAGGCAATC 1380 

ATCAAGCACA AGCTTCAGAA CAATCGAACG ATACAACGCA ATCTTCGAAA AATAATGCAA 1440 

20 GTGCAGATTC CGAAAAAAAC AATATGATAG AAACACCTCA ATTAAATACA ACGGCTAATG 1500 

ATACATCTGA TATTAGTGCA AACACAAACA GTGCGAATGT AGATAGCACA ACAAAACCAA 1560 

TGTCTACACA AACGAGCAAT ACCACTACAA CAGAGCCAGC TTCAACAAAT GAAACACCTC 1620 

25 AACCGACGGC AATTAAAAAT CAAGCAACTG CTGCAAAAAT GCAAGATCAA ACTGTTCCTC 1680 

AAGAAGCAAA TTCTCAAGTA GATAATAAAA CAACGAATGA TGCTAATAGC ATAGCAACAA 1740 

ACAGTGAGCT TAAAAATTCT CAAACATTAG ATTTACCACA ATCATCACCA CAAACGATTT 1800 

30 

CCAATGCGCA AGGAACTAGT AAACCAAGTG TTAGAACGAG AGCTGTACGT AGTTTAGCTG 1860 

TTGCTGAACC GGTAGTAAAT GCTGCTGATG CTAAAGGTAC AAATGTAAAT GATAAAGTTA 1920 

CGGCAAGTAA TTTCAAGTTA GAAAAGACTA CATTTGACCC TAATCAAAGT GGTAACACAT 1980 

35 

TTATGGCGGC AAATTTTACA GTGACAGATA AAGTGAAATC AGGGGATTAT TTTACAGCGA 2040 

aGTTACCAGA TAGTTTAACT GGTAATGGAG ACGTGGATTA TTCTAATTCA AATAATACGA 2100 

40 TGCCAATTGC AGACATTAAA AGTACGAATG GCGATGTTGT AGCTAAAGCA ACATATGATA 2160 

TCTTGACTAA GACGTATACA TTTGTCTTTA CAGATTATGT AAATAATAAA GAAAATATTA 2220 

ACGGACAATT TTCATTACCT TTATTTACAG ACCGAGCAAA GGCACCTAAA TCAGGAACAT . 2 2 BO 

45 ATGATGCGAA TATTAATATT GCGGATGAAA TGTTTAATAA TAAAATTACT TATAACTATA 2340 

GTTCGCCAAT TGCAGGAATT GATAAACCAA ATGGCGCGAA CATTTCTTCT CAAATTATTG 24 00 

GTGTAGATAC AGCTTCAGGT CAAAACACAT ACAAGCAAAC AGTATTTGTT AACCCTAAGC 24 60 

50 

AACGAGTTTT AGGTAATACG TGGGTGTATA TTAAAGGCTA CCAAGATAAA ATCGAAGAAA 2520 

GTAGCGGTAA AGTAAGTGCT ACAGATACAA AACTGAGAAT TTTTGAAGTG AATGATACAT 2580 
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ACCAATTTAA AAATAGAATC TATTATGAGC ATCCAAATGT AGCTAGTATT AAATTTGGTG 2700 

ATATTACTAA AACATATGTA GTATTAGTAG AAGGGCATTA CGACAATACA GGTAAGAACT 2760 

TAAAAACTCA GGTTATTCAA GAAAATGTTG ATCCTGTAAC AAATAGAGAC TACAGTATTT 2820 

TCGGTTGGAA TAATGAGAAT GTTGTACGTT ATGGTGGTGG AAGTGCTGAT GGTGATTCAG 2880 

CAGTAAATCC GAAAGACCCA ACTCCAGGGC CGCCGGTTGA CCCAGAACCA AGTCCAGACC 294 0 

CAGAACCAGA ACCAACGCCA GATCCAGAAC CAAGTCCAGA CCCAGAACCG GAACCAAGCC 3000 

CAGACCCGGA TCCGGATTCG GATTCAGACA GTGACTCAGG CTCAGACAGC GACTCAGGTT 3060 

CAGATAGCGA CTCAGAATCA GATAGCGATT CGGATTCAGA CAGTGATTCA GATTCAGACA 3120 

GCGACTCAGA ATCAGATAGC GACTCAGAAT CAGATAGTGA GTCAGATTCA GACAGTGACT 31 BO 

CGGACTCAGA CAGTGATTCA GACTCAGATA GCGATTCAGA CTCAGATAGC GATTCAGACT 3240 

2o CAGACAGCGA TTCAGATTCA GACAGCGACT CAGATTCAGA CAGCGACTCA GACTCAGATA 3300 

GCGACTCAGA CTCAGACAGC GACTCAGATT CAGATAGCGA TTCAGACTCA GACAGCGACT 3360 

CAGACTCAGA CAGCGACTCA GACTCAGATA GCGACTCAGA TTCAGATAGC GATTCAGACT 3420 

25 CAGACAGCGA CTCAGATTCA GATAGCGATT CGGACTCAGA CAGCGATTCA GATTCAGACA 3480 
GCGACTCAGA CTCGGATAGC GATTCAGATT CAGATAGCGA TTCGGATTCA GACAGTGATT • 3540 

CAGATTCAGA CAGCGACTCA GACTCGGATA GCGACTCAGA CTCAGACAGC GATTCAGACT 3600 

CAGATAGCGA CTCAGACTCG GATAGCGACT CGGATTCAGA TAGCGACTCA GACTCAGATA 3660 

GTGACTCCGA TTCAAGAGTT ACACCACCAA ATAATGAACA GAAAGCACCA TCAAATCCTA 3720 

AAGGTGAAGT AAACCATTCT AATAAGGTAT CAAAACAACA CAAAACTGAT GCTTTACCAG 3780 

AAACAGGAGA TAAGAGCGAA AACACAAATG CAACTTTATT TGGTGCAATG ATGGCATTAT 384 0 
TAGGATCATT ACTATTGTTT AGAAAACGCA AGCAAGATCA TAAAGAAAAA GCGTAAATAC ' 3900 

40 TTTTTTAGGC CGAATACATT TGTATTCGGT TTTTTTGTTG AAAATGATTT TAAAGTGAAT 3960 

TGATTAAGCG TAAAATGTTG ATAAAGTAGA ATTAGAAAGG GGTCATGACG TATGGCTTAT 4 020 

ATTTCATTAA ACTATCATTC ACCAACAATT GGTATGCATC AAAATTTGAC AGTCATTTTA 4 080 

45 CCGGAAGATC AAAGCTTCTT TAATAGCGAT ACAACTGTTA AACCATTAAA AACTTTAATG 4140 
TTGTTACATG GATTATCAAG TGATGAAACG ACATATATGA GATATACAAG CATAGAAAGG . 4200 

TATGCGAATG AACACAAATT AGCTGTGATT ATGCCCAATG TGGATCATAG CGCATATGCT 4260 

SO 

AACATGGCAT ATGGTCATAG CTATTATGAT TATATTTTGG AAGTGTATGA* TTATGTTCAT 4320 

CAAATATTTC CACTTTCCAA AAAGCGTGAT GACAATTTTA TAGCAGGTCA CTCTATGGGA 43 80 

SS 



30 



35 



579 



TTATCTGCTG TGTTTGAAGC GCAAAATTTA 
GAGGCCATAA TTGGCAATCT TTCAAGTGTT 
5 CTAGACAAAG CTGTAGCTGA AGATAAACAA 

CAAGACTTTT TATATCAAGA CAACTTAGAT 
CCTTATCAAT TTGAAGATGG ACCAGGAGAT 

10 

AAGCGTGCTA TAACATGGAT GGTGAATGAT 
TTAAATACAC AGAGTGAGAG ATACAAACTA 
AAATTATTTT TGTATTAATA TGATTGGCGC 

15 

GAAACTTAGA TTTAGCTTAT AGTTTTATCA 
TTATAATGAG GTTAACGCTT TGAAAGGAGT 

20 TGAGCATATG TTGTTTTATT TTGCATATAA 

AGAGAAGTAT GGTATGAGTC GTCAGCATCA 
TGGTATTACT ATTAAATCAT TACTAGAAAT 

25 AACACTTCAA AAATTAAAAG AGCAAGGTCT 

ACGTGTCAAA AAATTATATT CGACGGATAA 
GGCGCAAGAT GAATTATTGC AAAATATATA 

30 

GATGGAAGCA TTGGCTAAAG GgCGACCTGG 
AAAAGAAAGC TAGCATCAGA AATGTTAAAA 
TCAAAAAGTG TATAATAAAA ACATATAATT 

35 

AAGGAGTTTG AATGATGAAA AAATTAGCAG 
TCGCATTTAA AAAATACCAA GAACGTGTTA 

40 AACCATAAAA AATTCCCGAA CACCTTGTTA 
TGAATATATC AAATATTATT TTTGCGCTTT 
CTGATCTAGG TCCGTAAGCG TAgGTATTAA 

45 ACCCTGTATA AGATTTATCA TTTACTGGCT 

GCGTTTCTAC TTCTGCGGAT. TTTTCGTCTT 

TATTCTCTTT TTTAACCTTT . TTCATATCAT 
50 * 

TAATAACTTT TTCAGGGTCT TCACCTTTAG 

GTTTCATAGA TTTAATCGGT TGAGGATTCC 
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ATGGATCTAG AGTGGAATGA TTTTTCAAAA 4500 

AAAGGAACTG AACATGATCC GTATTACTTG 4560 

ATTCCAAAAT TGCTCATTAT GTGTGGTAAA 4620 

TTTATCGATT ATTTATCACG CATAAATGTT 4680 

CATGATTATG CATATTGGGA TCAAGCGATT 4 74 0 

TAATTATTTC TTGGAAAATA TGTGGCTGCA 4 800 

TTTACGCACG ACTAACATTT CTAAGTGTTT 4 860 

AATTTGCTGA TACACAAAAA TGTTTCTCGT 4 920 

TCATTTGTAT GACTTACATT ATAAATTTTA 498 0 

CATCATCATG TCGACCAATA AAAACGATTA 5040 

AACCTTTATT ACTACCGCTG ATGAAATTAT 5100 

TCGTTTTTTG TTTTTTATCA ATAAATTACC 5160 

ATTAGAAATT TCTAAmCAAG GATCACATGC 5220 

CATTATTGAA AAAGTTTTAG AGACTGATCG 5280 

AGGCGATCAA CTCATTGCTG AATTGAACAA 5340 

TCAACAAGTC GGTTCGGATT GGTATGATGT 54 00 

CTTTGATTTT ATTAAGCATT TGAAAGATGA 54 60 

ATCTTCGCAT TCTTAAATTT AAAAAATATG 5520 

TAATTGAACT CAGTTTCAAC ACATCTTAGA 5580 

TTATTTTAAC ATTAGTTGGC GGTTTATACT 5640 

ACCAAGCACC TAACATTGAG TACTAAATTA 5700 

TAGTGCTCGG GAATTTTTTT ATGCTTTACT 5760 

CTGTATTTTC GAT ATT AC CA CTAAATGATT 5820 

CATCCTCGCC TGTATGTCCA TCGGAAGTCC 5880 

TCTGAATAGC GTGTTGTAGG GCTTTTGTTT 594 0 

TTTCTTTTTT AAGTAGTCTT TTTAGCTTTT 6 00 0 

CTTGTGAAAA TTCAAATCCA TAACCTTCAT 6 060 

CCATTTTTTC TGTCATATAT GATCCAGAGT 6120 

ATTCGTATCC TTTATCTTTA CCAATTGTTA 6180 



580 



ATTGAATGGC GTCATCGAAT GCTTTTTCAA 
TATCGTTGGA ATGCGCTGAT TTATCTATAG 
5 TTTTATTGCG CTCAAGCTTA CTAAGTGCAC 

CTTTAGAAGC ATCTATTGCA AGTGGCATAT 

CTTTATCAGA TTTTGATAAC TCCTTACTGT 

10 

ATTTTTTATC TAAATTGCCA TTACTTTTAC 

CATCAACTTT ATGCTTTCCG TTGATTTTAT 

TATCATCTCT AGAAGTCACG TGTGCAGCAT 

75 

CTGTTGAAAC AAGACCAGTC GACTTACCTT 
CTTTCTGCTT GTTACTGTCA ACACCGATGG 

20 AGGCTGTTCC GCCAGCTGCA GAATCAGTAA 

GATTTGTGCC TTTTAAATAT GAATCAAAAG 
CATTTTTATA ATAACGATAA GCTGTGTTAA 

25 AGATAACATT TTTTGGATTT TTAGTATTAC 

TATCGGATTG CGCAATTGCA GGTGTGACAG 
TAGCAACTGC AAATTTTGTG GCTTTTTTAA 

30 

TGACTATGTC TTCAGTGTAA AAGAGGAATT 
ATGTAAACAG AGTGATTTAG AATAACAAAA 
AGCGTTAGAA TAGAAGCGTG TGAAAATATA 

35 

ATACTAGTAA TGTTATCTCC ATTATTAATC 
GAGCGTAAAC GTACGAAGAA AAAGCAACTC 

40 AATACAAATG ACACGGAAAG TTCAAATCAA 

AAAGATAACA AAGGATAATT CAATTGAAGG 
TGAAGATGAT TTTGTTATAG CAGAGAGTTT 

45 TGTTATTGTC GTTGAACAAT TTGATGATAT 
GCTTGTATTG CTAGATATTA ATTTGCCAAC 
CCGAAAAACA TCTAATGTGC CAATTATATT 

50 

AATTATGGCA ATACAAATGG GGGGAGATGA 
AACGATTGCC AAAATTCAAG CATTATTGAG 
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AACCTTCCAT TTCAGACATA ACGCCTGTAA 6300 

AAGCACCTTC GACCATTAAA AAGAATCCTT 6360 

TTTGTTGCAT ATCAGCTAAT GATGGTTCGT 6420 

TTTTATCTGC AAACAAACCA AGAACTTTAT 6480 

TCGTGGCAAG GTCGTAACCA TCTTTTTTGA 654 0 

CGAAATATTT AGCGCCGCCG CCTAATAAAA 6600 

CTTTATAAAA TTGTTTAGCG ATTTCGTTTT 6660 

ATGCTGCTGG TGTTGCATCT GTTAATTCAG 6720 

TTTCTTTTGC ACGTTCAAGC ACCGTCTTTA 6780 

CACCATTATA TGTCTTATGA CCAGAACTAA 684 0 

TATTCTGTTT TGGGTCATTT GAATATGTAC 6900 

CAGTAGGGGT CATTTCTTTA GCATGCGGAT 6960 

ATGATGGACC CATGCCATCG CCAACTAAAA 7020 

CAACCGCGAA ACTTTCATCT TTAGAACTTT 7080 

AACTAAAAAC CGTTGACACG ATAATAAGGT 714 0 

CTGATAACAT AAGACATCCT CCTGAGTATA 7200 

TtGAGCAATT ATGTAGTTTT AGTTAnAAAT 7260 

aATGAATATA TATGACAATT TGTTATAGAA 7320 

GAATTAAATA TAATTTGAGG TGGAAAAATG 7380 

ATATTCTTTA TAGTGTTGTC TATTTTAGAA 744 0 

GAGAAAGAAA AAGCAAATAC ACTAAATCAA 7500 

GAGCCGTCAT TGCAGCAGGA TAAAGAACAA 7560 

AAGAAGATTA TAGATGAAAA TATTAATTGT 7620 

AGCATCTGAA CTTAAAAAAT . GGAATTACGG 7680 

ACTGTCTATC TTTAACCAAA ATCAACCTCA 7740 

GTTAAATGGT TTTCATTGGT GTCAAGAAAT 7800 

TATTAGTTCC CGTATTGATA ATATGGACCA 7860 

TTTTATCGAA AAGCCATTTA ACTTGTCATT 7920 

ACGAACTTAT GACTTGTCAG TAGCTAATGA 7980 
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ACAAAACATA CAGCTATCTT TGACTGAATT ACAAATATTA AAGTTATTAT TTCAAAATGA 8100 

AGaTAAATAT GTAAGTAGrA CTGCTTTAAT TGaAAAATGT TGGGaATCAG AAAACtTCAT 8160 

AGATGATAAC ACATTAGCTG TTAACATGAC GCGCCTGCTG AAAAAATTAA ATACTATTGG 8220 

CGTTAATGAT TTTATCATTA CAAAGAAAAA TGTCGGATAT AAAGTATAGG GTGAATGCAA 8280 

TGACCTTTCT TAAAAGTATT ACTCAGGAAA TAGCAATAGT CATAGTTATT TTTGCTTTGT 834 0 

TTGGCTTAAT GTTTTACCTG TATCATTTGC CATTAGAAGC ATATTTACTA GCACTTGGCG 84 00 

TTATTTTATT ATTATTACTC ATATTCATAG GTATTAAATA TTTAAGTTTT GTAAAAACTA 84 6 0 

TAAGCCAACA ACAACAAATT GAAAACTTAG AAAATGCGTT GTATCAGCTT AAAAATGAAC 8520 
AAATTGAATA TAAAAATGAT GTAGAGAGCT ACTTTTTAAC ATGGGTACAT CAAATGAAAA . 8580 

CACCCATTAC TGCAGCACAA CTGTTACTTG AAAGAGATGA GCCTAATGTT GTTAATCGTG 864 0 

20 TTCGTCAAGA GGTTATTCAA ATTGaTAACT ATACAAGTTT AGCACTTAGT TATTTAAAGT 8700 

TATTAAATGA AACTTCTGaT ATTTCTGTCA CTAAAATTTC GATTAATAAT ATCATTCGCC 8760 

CAATTATTAT GAAATATTCA ATACAGTTTA TTGATCAAAA AACAAAAATC CATTATGAAC 8820 

25 CTTGTCATCA CGAAGTATTA ACTGACGTTA GATGGACCTC TTTAATGATA GAACAATTAA 888 0 

TAAATAATGC ACTTAAGTAT GCGAGAGGTA AAGATATATG GATTGAATTT GATGAGCAAT 894 0 

CCAATCAATT ACACGTAAAA GATAATGGTA TCGGTATTAG TGAAGCGrAC TTGCCTAAAA 9000 

TATTTGATAA GGGCTATTCA GGTTATAATG GCCAGCGCCA AAGTAACTCA AGTGGGaTTG 9060 

GTTTATTTAT CGTAAAACAA ATTTCAACAC ACACAAACCA TCCTGTTTCA GTCGTATCTA 9120 

AACAAAATGA GGGTACAACA TTTACGATTC AATTTCCAGA TGAATAAAAA CTTTCAATAT 9180 

TGTAAGTATA CTAGTAACAT TTTTTTACTA ATTTAAATGT TATTAGTATT TTTTTGTTTT 9240 

AATATAGAAC TAACAAAGAA ATGAGGTGCA TGCCATGTTG CTAGAAGTGn AACATGTAAA 93 00 

40 AAAGGTTTAT GGTAAAGGTT TGAATGCTAC GACAGCACTT AATCAAATGA ATTTATCAGT 936 0 

TGGAGCTGGT GaATTTGTTG CaATTATGGG TGAGTCTGGG tCAGGGAAGT CTACACTACT 9420 

AAATTTAATT GCtTCTTTTG ATGGACTAAC TGAAGGTGAC ATTATTGTGG ATGGCGCACA 94 80 

TTTAAATAAT ATGAAAAATA AAAGTAAAGC ATTGTATCGT CaACAAATGG TAGGTTTTGT 9540 

TTTTcAAGAT TTTAATCTTT TACCAACAAT GACGAATAAA GAAAATATAA TGATGCCATT 9600 

AATTTTAGCT GGTGCTAAAC GAAAAGATAT AGAACAAAGG GTACATCAGT TGGCAGTACA 9660 

ATTACATTTA GAGGGATTCT TAAACAAGTA TCCTTCTGAA ATCTCTGGGG GTCAGAAGCA 9720 

ACGCATTGCC ATTGCACGTG CATTAGTTAC TAAGCCGACG ATTTTACTAG CCGATGAACC 9780 
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TCAATTGGAA CAGACAATTT TAATGGTAAC TCATTCAAAT ATCGATGCGT CTTATGCAGA 9900 

GCGAGTCATT TTTATTAAAG ATGGGCGTCT ATATCATGAA ATATATCGTG GTGAAGAAAG 9960 

TCAATTAGCT TTTCAACAAC GAATAACAGA TAGCTTAGCA CTTGTGAATG GAGGAAGTGT 10020 

CAATATATGA AGTTAAGATT GTTATGnACA TAGTGCGACG TCAATTTATT ACGCAGCGAC 10080 

TTGTAATCAT TCCATTCATT TTAGCGGTAA GTGTACTATT CATGATTGAA TATACGCTTG 1014 0 

TGTCAATTGG GTTAAATAGC TACATAAAAC AGAAGAATGA CTTCCTAGTA CCATTTATTA 10200 

TCATAGCTAA TTTTTTTATG GCGCTTTTAA CTTTTATTTT TATTTTCTAT GCAAATCACT 10260 

TTATGATGTC ACAAAGACGA AAAGAGTTTA GCATTTTTAT GACATTGGGC ATGACCAAGA 10320 

AAAGTATGCG TTTAATTGTA GTGATGGAAA CTATCTTACA ATTTGTGATA ATTTCAGTCG 10380 

TTAGTATTGC CGGCGGATAC TTACTTGGTG CGATATTTTT CTTGTTTATA CAGAAAATAA 10440 

TGGGCAGTGA AGTTGCGACG TTAAGGTATT ATCCATTTGA CTCTGTAGCG ATGTTTATTA 10500 

CTTTGATTAT CATTGCTGTA TTAATGGGCA TGCTACTTAT ATTCAACTTG TTTAGTATTA 10560 

ATTTTCAACG GCCGATAACT TATCAACATC GTTCCGATTC TAGTGTCATA TCACGATGGT 10620 

TGCGTTACGT TTTAATTGTT ATAGGAAGCG CAnACTATAT TTAGGTTACT TTATTGCATT 10680 

ACAACAAGAT ACGACGTTTG GTGCCTTTTT TAAAATATGG ATTGTCATAG GATTAGTTAT 10740 

TATCGGTACT TATGCATTTT TTGTAGGTAT AAGTGAAATA ATTATTAGTA TATTGCAGCA 10 BOO 

GGTATCAAAA GTTTACTATC ATCCACGGTA TTTTTTTGTG GTAGTTGGGA TGCGTGTACG 10860 

TCTTAAAATG AATGCAGTCA GTCTTGCAAC AATCACTTTG CTGTGTACAT TTTTGATTGT 10920 

AACGCTCACA ATGACATTAA CAACCTATCG TGATATGAAT CATACCATTA CGAAATTGAT 10980 

TACGAATGAT TakGATTTGT CATTTAGCGA CAATTCTAAG TCACAAaTAG AACGTCAACA 11040 

AACAATTGAG 11050 
(2) INFORMATION FOR SEQ ID NO: 97: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 983 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 97: 
CGACATAACG AGGCAAGGGT ACATGATACT TTAGCCTCGT TTTTGATATG TATTTTTCTG 60 
AATATAAGGG CAATAGATGG TATTTTATAw TTTTTTTAAG GTAGTGATTA ACATAGATAT 120 
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TCAAGCGGAA CAGCATTATG CACCAGTATT AACGCATTTT TTAGATCCAA GAGGGCAATA 240 

TATATTGGAA GTGATTTGTG GCAGTTATGA AGATTTAAAC GTATCTTTTT ATGGTGGACC 300 

TAATGCTGAA AGAAAAAGAG CAATCATTTC GCCGAACTAT TATGAACCTA AAGAAAGCGA 360 

CTTTGAATTA ACTTTAATGG AAATAGATTA TCCTGAAAAA TTCGTCACTT TAAAACATCA 420 

ACATATTTTA GGGACATTAA TGTCTTTAGG TATCGAACGC GAACAAGTTG GAGATATAAT 480 

TGTGaATGAA CGAATTCAAT TTGTTTTGAC AAGTAGATTG GAATCATTTA TTATGTTAGA 54 0 

ATTACAACGT ATTAAAGGCG CATCAGTTAA ACTTTATACT ATTCCAGTAA CAGATATGAT 600 

ACAATCTAAT GAGAATTGGA AAAATGAAAG TGCaCAGTTA GTTCTTTAAG GTTAGATGTT 660 

GTTATTAAAG AAATGATACG TAAATCACGT ACGATTGCGA AACAACTAAT CGAAAAAAAA 720 

CGTGTTAAAG TGAATCACAC TATTGTTGAT TCAGCAGATT TTCAATTACA AGCAAATGAT 780 

20 TTAATATCCA TCCAAGGTTT TGGTAGAGCA CACATTACTG ACTTAGGTGG TAAAACTAAA 840 

AAAGATAAAA CGCACATTAC CTATAGAACA TTATTCAAAT AGTAATGATT TAAGGAGGAT 900 

AACAAATGCC TTTTACACCA AATGAaATTA AGAATAAAGA GTTTTCACGT GTaAAGAATG 960 

25 GTTTTAGAAC CTACTGnAGT TGG 983 

(2) INFORMATION FOR SEQ ID NO: 98: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 10322 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ' ID NO: 98: 

TTTTGCAAAG CTTATTTTAT GTCAAACAGA TAGTCAATGT GAAACAAAGG TTAGTACATA 6 0 

40 ' TAATCATCCA GACTTTATGT ATATATCAAC AACTGAGAAT GCAATTAAGA AAGAACAAGT 12 0 

TGAACAACTT GTGCGTCATA TGAATCAACT TCCTATAGAA AGCACAAATA AAGTGTACAT 180 

CATTGAAGAC TTTGAAAAGT TAACTGTTCA AGGGGAAAAC AGTATCTTGA AATTTCTTGA 24 0 

AGAACCACCG GACAATACGA TTGCTATTTT ATTGTCTACA AAACCTGAGC AAATTTTAGA 300 

CACAATCCAT TCAAGGTGTC AGCATGTATA TTTCAAGCCT ATTGATAAAG AAAAGTTTAT 360 

AAATAGATTA GTTGAACAAA ACATGTCTAA GCCAGTAGCT GAAATGATTA GTACTTATAC 420 

TACGCAAATA GATAATGCAA TGGCTTTAAA TGAAGAATTT GATTTATTAG CATTAAGGAA 4 80 

ATCAGTTATA CGTTGGTGTG AATTGTTGCT TACTAATAAG CCAATGGCAC TTATAGGTAT 54 0 
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GAATGGTTTC TTCGAAGATA TCATACATAC AAAGGTAAAT GTAGAGGATA -AACAAATATA 660 
TAGTGATTTA AAAAATGATA TTGATCAATA TGCGCAAAAG TTGTCGTTTA ATCAATTAAT 720 
TTTGATGTTT GATCAACTGA CGGAAGCACA TAAGAAATTG AmTCAAAATG TAAATCCAAC 780 
GCTTGTATTT GAACAAATCG TAATTAAGGG TGTGAGTTAG ATGCCAAATG TAATAGGTGT 840 
TCAGTTTCAA AAAGCGGGAA AATTAGAATA TTATACACCT AATGATATAC AAGTAGATAT 900 
AGAAGACTGG GTAGTTGTCG AATCTAAAAG AGGCATAGAG ATAGGTATTG TTAAAAATCC 960 

ATTAATGGAT ATTGCTGAAG AGGATGTTGT GTTACCTCTT AAAAATATTA TTCGCATTGC 1020 

15 TGATGACAAA GATATTGATA AATTTAATTG TAATGAACGA GATGCTGAAA ATGCATTAAT 1080 

ACTATGTAAA GACATTGTAA GAGAACAAGG TTTGGACATG CGTTTAGTCA ATTGCGAATA 114 0 

TACATTAGAT AAATCGAAAG TTATTTTTAA TTTTACGGCG GATGATCGTA TTGATTTTAG 1200 

20 AAAATTAGTA AAAATATTAG CGCAACATTT AAAAACACGT ATCGAGTTGA GACAAATTGG 126 0 

TGTAAGGGAT GAAGCCAAAT TGCTTGGCGG TATCGGACCT TGTGGTAGGT CGTTATGTTG 1320 

TTCTACATTT TTAGGGGATT TTGAACCAGT ATCGATTAAG ATGGCTAAGG ATCAAAATTT 13 80 

ATCATTAAAT CCAACTAAAA TTTCTGGTGC ATGTGGTCGT TTGATGTGTT GTTTAAAATA 144 0 

TGAAAATGAC TATTATGAGG AAGTACGTGC ACAATTACCT GATATTGGTG AAGCAATTGA 1500 

AACGCCTGAT GGTAACGGGA AAGTAGTTGC TTTAAATATA TTAGACATTT CTATGCAGGT 1560 

GAAGCTTGAG GGACATGAAC AGCCACTTGA ATATAAATTA GAAGAAATAG AAACTATGCA 1620 

TTAAGGAGGC ATTATTACAT TTGGATCGCA ATGAAATATT TGAAAAAATA ATGCGTTTAG 1680 
3S AAATGAATGT CAATCAACTT TCAAAGGAAA CTTCAGAATT AAAGGCACTT GCAGTTGAAT ' 1740 

TAGTAGAAGA AAATGTAGCG CTTCAACTTG AAAATGATAA TTTGAAAAAG GTGTTGGGCA 1800 

ATGATCAACC AACTACTATT GATACTGCGA ATTCAAAACC AGCAAAAGCT GTGAAAAAGC I860 

40 CATTACCAAG TAAAGATAAT TTGGCTATAT TGTATGGAGA AGGATTTCAT ATTTGTAAAG 1920 

GCGAATTATT TGGAAAACAT CGACATGGTG AAGATTGTCT GTTCTGTTTA GAAGTTTTAA 19 BO 

GTGATTAATC AAGCACACTC AAATAGTGTT ATAATTATAA ATGAATATGG TTTGGATAAG 2 040 

TCTGAGACAA TGCATGTTTC AGGCTTTAAT TGTGTATAAA GTTTTGGTGA TTGCATAAGA 2100 

GATGGCGGTA CTAAATGTTA TTATTAAGTG TGCACGCAgT ATCaTTAGTT ATAAAATGTA 2160 

GCTGTTAAAA q^CAAAAATA CATCGAATGT AGTTAGGCAT ■ ATAATATAAA AAGAGTTTTC 2220 

AATTACTCAA TAGAAAAAGG TTGTCTTCAT AGGAGTTAAA AATGTTAAAA GAGAATGAAC 2280 

GATTTGATCA ACTAATCAAA GAAGATTTTA GTATTATTCA AAATGATGAT GTTTTTTCAT 2340 
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TGGACTTATG TTCAGGCAAT GGGGTGATAC 
ATATAGAAGG TGTTGAGATT CAAAAAACAC 

5 

TCAATGATGT TGATGAATAT TTAACAATGC 
TATTTAAACC TTCACAATAT ACTTTAGTAA 
AGCAACACCA ACATCAAAAA GAAGCACATA 

10 

TTGAAGATTG CATGATTGCA GCCCGTCATT 
TACATCGTGC AGAGAGACTA ATGGATGTCT 

1S CTAAGAAAGT CGTTTTTATA TATAGTAAAG 

AAGGTCGAAA AGGTGGAAAT CAAGGTTTAG 
AAGATGGTAA TTATAGCGAA GAAATGAAGG 

20 ATTGTAAAAT GTAGTGATGG AAGTTTATAT 
GTTGAAAAAC ATAACCGAGG TCAAGGAGCC 
TTAGTTTATC AAGAAATGTA TGAGACAAAG 

25 

AAAACTTATA CCAGACAAAA GAAATTGCGA 
ATATTTAGTG GGCACACCAA TTGGTAATTT 
ATTGAAACGT GTTGATATGA TTGCTTGTGA 

30 

TCATTATGAT ATTCCAACTC CATTAAAGTC 
TGCTTTTATC ATTGAACAGT TAGAATTAGG 

35 ATTGCCCTTA ATTAGTGATC CTGGATACGA 
TAAAGTAGAG ACTGTGCCTG GACCTAATGC 
ACCTTCATAT GTATATACAT TTTTAGGATT 

40 TGTATTAGAG CAACGTATGC ATGAAAATAG. 
TGTGACAGAT ACATTAAAAA CAATTGCAAA 
GCGTGAATTA ACTAAGAAGT TCGAACAAAT 

45 

ATTGATTCAG CAAGGCGATG TACCATTGAA 
TAAAGCGAAC AATGAGATAT CGTGGTTTGA 
TTATATTCAA ACTTCACAGA TGAAACCAAA 

50 

ACAACTTAAA ACGAATGAAG TATATAATAT 
ATTaTATGAA ATTTTAAACG ATTTTATAAA 

55 



CCTTGTTATT GTTTGCGAAA CATCCACGAC 24 60 

TTGTCGATAT GGCGCGACGC ACATTTCAAT 2520 

ATCACATGGA TTTGAAAAAC GTTACTAAAG 2580 

CGTGTAATCC GCCTTATTTT AAAGAGAATC 2640 

AGATAGCGAG ACATGAGATT ATGTGTACAC 2700 

TATTAAAAGA AGGTGGCAGG CTAAACATGG 2760 

TGTTTGAAAT GAGAAAAGTG AATATTGAAC 2820 

TAGGGAAATC AGCACAAACG ATAGTAGTAG 2880 

AAATCATGCC CCCATTTTAT ATTTATAATG 2940 

AAGTATATTA TGGATAGTCA TTTTGTATAT 3000 

ACAGGATACG CTAAAGACGT TAATGCACGT 3060 

AAATATACGA AAGTAAGACG TCCGGTGCAT 3120 

TCTGAAGCAT TGAAGCGTGA ATATGAAATT 3180 

TTAATTAAGG AGCGATAGTA TGGCTGTATT 3240 

AGCAGATATT ACTTATAGAG CAGTTGATGT 33 00 

AGACACTAGA GTAACTAGTA AACTGTGTAA 3360 

ATATCACGAA CATAACAAGG ATAAGCAGAC 3420 

TCTTGACGTT GCGCTCGTAT CTGATGCTGG 3480 

ATTAGTAGTG GCAGCCaGAG AAGCTAATAT 3540 

TGGGCTGACG GCTTTGATGG CTAGTGGATT 3 600 

TTTGCCACGA AAAGAGAAAG AAAAAAGTGC 3660 

CACATTAATT ATATACGAAT CACCGCATCG 3720 

GATAGATGCA ACACGACAAG TATCACTAGG 3780 

TGTAACTGAT GATGTAACAC AATTACAAGC 3 84 0 

AGGCGAATTC GTTATCTTAA TTGAAGGTGC 3 900 

TGATTTATCT ATCAATGAGC ATGTTGATCA 3960 

ACAAGCTATT AAAAAAGTTG CTGAAGAACG 4020 

TTATCATCAA ATAAGTTAAT CACTTTATCG 4080 

CGCAAGCTGT AATTTTAAAT GGTAAGTTAT 4140 
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GTTTTTTAAT GTAAAATAAA TACATTGAAA GTAATAAATA CCTTAACATT GAATAAGATG 4260 

AAAATGAGAT GACGAGATAA ATGTTCGCGT CCGTTGAAAT GCATAGAAAT CTTAGATATT 4320 

5 

ATTTGAAGTG AGACATTACG AGGAGGAACA GTTATGGCTA AAGAAACATT TTATATAACA 4360 

ACCCCAATAT ACTATCCTAG TGGGAATTTA CATATAGGAC ATGCATATTC TACAGTGGCT 4440 

GGAGATGTTA TTGCAAGATA TAAGAGAATG CAAGGATATG ATGTTCGCTA TTTGACTGGA 4 500 

10 

ACGGATGAAC ACGGTCAAAA AATTCAAGAA AAAGCTCAAA AAGCTGGTAA GACAGAAATT .4560 

GAATATTTGG ATGAGATGAT TGCTGGAATT AAACAATTGT GGGCTAAGCT TGAAATTTCA 4620 

16 AATGATGATT TTATCAGAAC AACTGAAGAA CGTCATAAAC ATGTCGTTGA GCAAGTGTTT 4680 

GAACGTTTAT TAAAGCAAGG TGATATCTAT TTAGGTGAAT ATGAAGGTTG GTATTCTGTT 474 0 

CCGGATGAAA CATACTATAC AGAGTCACAA TTAGTAGACC CACAATACGA AAACGGTAAA 4800 

20 . ATTATTGGTG GCAAAAGTCC AGATTCTGGA CACGAAGTTG AACTAGTTAA AGAAGAAAGT 4860 

TATTTCTTTA ATATTAGTAA ATATACAGAC CGTTTATTAG AGTTCTATGA CCAAAATCCA 4920 

GATTTTATAC AACCACCATC AAGAAAAAAT GAAATGATTA ACAACTTCAT TAAACCAGGA 4980 

25 

CTTGCTGATT TAGCTGTTTC TCGTACATCA TTTAACTGGG GTGTCCATGT TCCGTCTAAT 5040 

CCAAAACATG TTGTTTATGT TTGGATTGAT GCGTTAGTTA ACTATATTTC AGCATTAGGC 5100 

TATTTATCAG ATGATGAGTC ACTATTTAAC AAATACTGGC CAGCAGATAT TCATTTAATG 5160 

30 

. GCTAAGGAAA TTGTGCGATT CCACTCAATT ATTTGGCCTA TTTTATTGAT GGCATTAGAC 5220 

TTACCGTTAC CTAAAAAAGT CTTTGCACAT GGTTGGATTT TGATGAAAGA TGGAAAAATG 5280 

3S AGTAAATCTA AAGGTAATGT CGTAGACCCT AATATTTTAA TTGATCGCTA TGGTTTAGAT 5340 

GCTACACGTT ATTATCTAAT GCGTGAATTA CCATTTGGTT CAGATGGCGT ATTTACACCT 5400 

GAAGCATTTG TTGAGCGTAC AAATTTCGAT CTAGCAAATG ACTTAGGTAA CTTAGTAAAC 5460 

40 CGTACGATTT CTATGGTTAA TAAGTACTTT GATGGCGAAT TACCAGCGTA TCAAGGTCCA 5520 

CTTCATGAAT TAGATGAAGA AATGGAAGCT ATGGCTTTAG AAACAGTGAA AAGCTACACT 5580 

GAAAGCATGG AAAGTTTGCA ATTTTCTGTG GCATTATCTA CGGTATGGAA GTTTATTAGT 5640 

45 

AGAACGAATA AGTATATTGA CGAAACAACG CCTTGGGTAT TAGCTAAGGA CGATAGCCAA 5700 

AAAGATATGT TAGGCAATGT AATGGCTCAC TTAGTTGAAA ATATTCGTTA TGCAGCTGTA 5760 

TTATTACGTC CATTCTTAAC ACATGCGCCG AAAGAGATTT TTGAACAATT GAACATTAAC 5820 

50 

AATCCTCAAT TTATGGAATT TAGTAGTTTA GAGCAATATG GTGTGCTTAA TGAGTCAATT 5880 

ATGGTTACTG GGCAACCTAA ACCTATTTTC CCAAGATTGG ATAGCGAcGG AnAATTGCAT 594 0 
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AACCTCAAAT TGATATTAAA GACTTTGATA AAGTTGAAAT TAAGGCAGCA ACGATTATTG 6060 
ATGCTGAACA TGTTAAGAAG TCAGATAAGC TTTTAAAAAT TCAAGTAGAC TTAGATTCTG . 6120 
AACAAAGACA AATTGTATCA GGAATTGCCA AATTCTATAC ACCAGATGAT ATTATTGGTA 6180 

AAAAAGTAGC AGTTGTTACT AACCTGAAAC CAGCTAAATT AATGGGACAA AAATCTGAAG 6240 

GTATGATATT ATCTGCTGAA AAAGATGGTG TATTAACCTT AGTAAGTTTA CCAAGTGCAA 6300 

TTCCAAATGG TGCAGTGATT AAATAACTGT ATTTTTAAAA ATTAGGAGAG ATAATTATGT 6360 

TAATCGATAC ACATGTCCAT TTAAATGATG AGCAATACGA TGATGATTTG AGTGAAGTGA 6420 

TTACACGTGc TAGAGAAGCA GGTGTTGATC GTATGTTTGT AGTTGGTTTT AACAAATCGA 6480 

CAATTGAACG CGCGATGAAA TTAATCGATG AGTATGATTT TTTATATGGC ATTATCGGTT 6540 

GGCATCCAGT TGACGCAATT GATTTTACAG AAGAACACTT GGAATGGATT GAATCTTTAG 6600 

20 CTCAGCATCC AAAAGTGATT GGTATTGGTG AAATGGGATT AGATTATCAC TGGGATAAAT 6660 

CTCCTGCAGA TGTTCAAAAG GAAGTTTTTA GAAAGCAAAT TGCTTTAGCT AAGCGTTTGA 6720 

AGTTACCAAT TATCATTCAT AACCGTGAAG CAACTCAAGA CTGTATCGAT ATCTTATTGG 6780 

AGGAGCATGC TGAAGAGGTA GGCGGGATTA TGCATAGCTT TAGTGGTTCT CCAGAAATTG 6840 

CAGATATTGT AACTAATAAG CTGAATTTTT ATATTTCATT AGGTGGACCT GTGACATTTA 6900 

AAAATGCTAA ACAGCCTAAA GAAGTTGCTA AGCATGTGTC AATGGAGCGT TTGCTAGTTG 6 960 

AAACCGATGC ACCGTATCTT TCGCCACATC CGTATAGAGG GAAGCGAAAT GAACCGGCGA 7020 

GAGTAACTTT AGTAGCTGAA CAAATTGCTG AATTAAAAGG CTTATCTTAT GAAGAAGTGT 7080 

35 GCGAACAAAC AACTAAAAAT GCAGAGAAAT TGTTTAATTT AAATTCATAA AGTTAAAAGT 714 0 

GAGAAAGATC ACCGCCATAA ATGTAAACGA TGCTATATTC GTTTAATATG CTATGGTTCT 7200 

TTCTCACTTT TTTAAATTAA AATATCGTGC ATGTGGAATA CGTGCGATAG AGATGGTTAG 7260 

40 AGCTTTGAAA TTAAGAATTG TAGGAAGGCG TTTTAAATGA AAATCAATGA GTTTATAGTT 7320 

GTAGAAGGAC GAGATGATAC TGAGCGTGTT AAACGAGCTG TTGAATGTGA TACGATTGAA 7380 

ACGAATGGTA GTGCCATCAA CGAACAAACT TTAGAAGTAA TTAGAAATGC TCAACAAAGT 7440 

CGAGGCGTTA TTGTATTAAC AGATCCAGAT TTCCCAGGAG ATAAAATTAG AAGTACAATT 7500 

ACTGAACATG TCAAAGGTGT TAAACATGCG TATATTGATA GAGAAAAAGC TAAAAATAAA 7560 

AAAGGGAAAA TTGGTGTTGA ACATGCCGAC TTAATTGATA TTAAAGAAGC GTTAATGCAT 7620 

GTTAGTTCAC CCTTTGATGA AGCTTATGAA TCAATTGATA AATCTGTGCT AATAGAGTTG 7680 

GGGTTAATTG TTGGGAAAGA TGCAAGGCGC CGTAGAGAAA TTTTAAGTAG AAAATTGCGA 7740 
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GCGGATGTAA GGCAAGCTTT AGAAGATGAA TGAGGAAGTG AAAATGTTGG ATAATAAAGA 7860 

TATTGCAACA CCATCAAGAA CGCOAGCGTT GTTAGATAAA TATGGCTTTA ATTTTAAAAA 7920 

5 

AAGTTTAGGA CAGAACTTTT TGATAGATGT GAATATCATT AATAATATCA TTGATGCAAG 7980 

TGATATTGAT GCACAAACTG GGGTGATTGA AATTGGTCCA GGCATGGGGT CATTGACAGA 804 0 

ACAATTGGCC AGACATGCTA AAAGAGTATT GGCATTTGAA ATTGATCAAC GTTTAATACC 8100 

10 

TGTATTAAAT GATACACTAT CACCTTATGA TAATGTGACG GTGATTAATG AAGATATTTT 8160 

AAAAGCGAAT ATTAAAGAAG CTGTTGAAAA TCATTTACAA GATTGTGAAA AAATAATGGT 8220 

is TGTTGCAAAC CTGCCGTACT ATATTACGAC GCCAATTTTA TTAAATTTGA TGCAACAAGA 8280 

TATACCAATT GATGGCTACG TGGTGATGAT GCAAAAAGAA GTGGGCGAAC GCTTAAATGC 834 0 

TGAAGTAGGT TCAAAAGCAT ATGGTTCGTT ATCAATTGTC GTACAATACT ATACAGAGAC 8400 

20 TAGTAAAGTA TTAACGGTAC CTAAATCTGT ATTTATGCCA CCACCTAATG TTGATTCAAT 8460 

AGTTGTAAAA CTGATGCAGA GAACTGAACC GTTAGTAACA GTAGATAACG AGGAAGCATT 8520 

CTTTAAGTTA GCAAAAGCAG CATTTGCACA AAGAAGAAAG ACAATTAACA ATAACTATCA 8580 

25 • 

AAATTATTTT AAAGATGGTA AACAACACAA AGAAGTGATT TTACAATGGT TGGAACAAGC 864 0 

AGGTATTGAT CCAAGACGTC GCGGTGAAAC GCTATCTATT CAAGATTTTG CTAAATTGTA 8700 

TGAAGAAAAG AAAAAATTCC CTCAATTAGA AAATTAAATG ATTGACAAAG CAAAGCACTA 8760 

30 

TTGTTAAAAT TTAAATTTTG TTTGACGAAA ACGTTGCAAA TATGGTATTA TGTAACTTGT 8820 

AGCGAGGTGG AGCAATATGC CAAAATCAAT TTTGGACATC AAAAATTCTA TTGATTGTCA 8880 

35 TGTAGGAAAT CGTATTGTAC TGAAaGCCAA TGGAGGCCGT AAGAaAACAA TAAAACGTTC 894 0 

TGGAATTTTA AAAGAAACAT ATCCGTCAGT TTTCATTGTT GAGTTAGATC AAGACAAACA 9000 

CAACJTTGAG AGAGTATCTT ATACATACAC TGATGTGTTA ACTGaAAATG TTCAAGTTTC 9060 

40 ATTTGAAGAG GATAATCATC ACGAATCAAT TGCACACTAA ATAAGACATA TAGAGATGTT 9120 

AGACGTTTCT TAGTATAAGA AGTAAATATT ATGATAATTA TTTGAGTGTT GGGcATTATG 9180 

TTCAATACTC TTTTTATTTA CAAAATGTTT AACACTGATG TTTCGCTTAT AGATTTTTCA 924 0 

45 

GTAAATGGAT AATTGTATTT ATAAACACAA ATACAAGTAA ATACTAAGTA ATTAGATGGA 93 00 

GAAAATTACT TTTTTATTAA AAAAACACTA AAAAACAAAT TAAAATGTCA AATATTAATT 9360 

50 CTCTTTATGT TAAAATCATC ATATTAAGAT AACGAAAAGA GGGCGGAAAA TGATATATGA 9420 

AACGGCACCA GCCAAAATTA ATTTTACGCT CGATACACTT TTTAAAAGAA ATGATGGCTA 94 80 

TCATGAGATT GAAATGATAA TGACAACAGT TGATTTAAAT GATCGTTTAA CTTTTCATAA 954 0 
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AAATCTCGCA TATCGTGCAG CGCAACTATT TATTGAGCAA TATCAACTAA AGCAAGGTGT 
AACAATTTCT ATCGATAAAG AAATACCTGT TTCTGCTGGC TTAGCTGGAG GTTCGGCTGA 
TGCAGCAGCA ACGTTAAGAG GATTGAATCG ACTTTTTGAT ATAGGGGCGA GTTTGGAAGA 
ATTGGCTCTA CTAGGCAGTA AAATCGGGAC AGATATTGCG TTTTGTATTT ATAATAAAAC 
TGCACTATGT ACTGGAAGAG GAGAGAAAAT CGAGTTTTTA AATAAACCAC CTTCAGCTTG 
GGTGATTCTT GCTAAACCAA ACTTAGGCAT ATCATCACCA GATATATTTA AGTTGATTAA 
TTTAGATAAG CGTTACGACG TACATACGAA AATGTGTTAT GAGGCCTTAG AAAATCGAGA 
TTATCAACAA TTATGTCAAA GTTTGTCTAA TCGATTAGAG CCAATTTCTG TTTCAAAACA 
CCCACAAATC GATAAATTAA AAAATAATAT GTTGAAAAGT GGTGCAGATG GTGCGTTAAT 
GAGTGGAAGC GGACCTACTG TGTATGGGCT . AGCACGAAAA GAAAGCCAAG CAAAAAATAT 
TTATAATGCA GTTAACGGTT GTTGTAATGA AGTGTACTTA GTTAGACTAT TAGGATAGAA 
GGGTTGAAAA GATGAGATAT AAACGAAGCG AGAGAATTGT TTTTATGACG CAATATTTGA 
TG 

(2) INFORMATION FOR SEQ ID NO: 99: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5614 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



9660 
9720 
97B0 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10322 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 99: 

GATTGATTAA ATGTTTTAAT CCACTTCAAT GCCTTCGATA AACTCTACAA TCGCGCTATT 60 

CATATAATTA TTCGATTTCA TTTGTTCAGC ATATGTCTCA TTAAATCCAG ACATAACTTT 120 

TTTAAAwGCG AAAATTGAAA TTGGTATCGT TACTAATAAG GCACTAGCCA TACGCCAATC 180 

AATGAGCATT ATGTATAAAA AGATAGCAGC TGACAAAAGT AAGTTTCCTA TAACTTCAGG 24 0 

AATCATATGT GCTAAAGGTA ATTCTATTGT TTCAACCTTA TCGACAAATA TATTTTTTAA 300 

TTCACCTATT TTCTTAGATT CCaCTACGCC TAAAGGGAGA CGCATTAATT TTTGAGCTAA 360 

TTTTTTACGA ATTTCAGATA AAATTTCATA TGCCGTAATA TGTGATAGCA TCGTTGACGC 420 

TCCAAAACAA CACACTTGTG AAATATAAGC GATTAAAGCA ATAAAGATAT AAACCATAAT 480 

CGAATTAATC GTATATGTAT TGTTAATCAT CATTAAAATA ATTTTAAATA CTGCCCAATA 540 

AGGAACTAAT CCAGAAAAGA CACTGATGAT AGACAACAAA ATTGATAACA TAATTTTCCA 600 
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ATATGTAACT CCTkTCAATT AATAATCTAA ATTAAGCCGC TTATATTATT TATTTCACTG 720 

GATGATATAC ATAATATAAA TTTGTTATTT GTTAAAAATT AATACTTATT ACAAGTACAT 780 

CATATATTAG TTGATAACGA TTATCAATGT CGCGTGGATT TGTGACACAT TTCTTTTAAA 84 0 

AATTCACAAG GTTATGGGGC AGAAATGATA AAGAGCCACT AATGATTTAT TATGTAGTGG 900 

TTCTGGGAGT GGGACAGAAA TGATATTTTC ACAAAATTTA TTTCGTCGTC CCACCCCAAC 960 

TTGCATTGTC TCTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGTCCCT GAATATAGCC 1020 

TTGTAGAGTC TAGTACATTG ATTTGTATCC CAATGTCCCT ATAATTGATT ATTCGCTTTA 10 BO 

TCTAATGATC CTATGACTCA ACTATTAAAT CATTTTTCGA AATACTTAAT TCTAATATAA 1140 

TTAAATTCAT TTATTGTAAT ATTGCAAAAA TACATTGCAC ACCTTGTTCA TCAATGCTAT 1200 

• AATTAATTAC ATAATAAATT GAACATCTAA ATACACCAAA TCCCCTCACT ACTGCCATAG 1260 

20 TGAGGGGATT TATTTAGGTG TTGGTTATTT GTCACCTTTT TTATTGTTGC GCGTTCGTAA 1320 

CCAATGTGCA AAAAACGCAA CAAGACAGCC GCTTATAGCT GAAGTCATGA TGTTAATTAA 1380 

TAAATTGAAC ATCCGTCATA CACCTCCTCT CTGCGTTAAA GTAACGCCCG AGATGTTAGG 1440 

CGACCATCAT ATTATATCAT TTATTTATTA TATTTCACGC AATATTAAGG CTTAAGTAAA 1500 

GTTTTTTTTA GTGGTTTACG CTACTTTAAT TGCTATCTTT TAAAATCCAT TTAGATAATA 1560 

TAAATGTGAT GGGTATCGTA ATAATTAAAC CAGCAAATGG TGCAATTTCT GCTGGCAAAT 1620 

TTAGCCAGGA TACAAATACA TATAATAAAA CTGTTTGTAA GCTTACGTTG ACAATCTGCG 1680 

TAATTGGAAA ACTAATGAAT TTTCTCCAAG TAGGTTTTAC CCTGTAAACA AAATAACAAT 174 0 

35 TCAAATAATA TGAAATCACA AAAGCGACTA GAAATCCGGT AATATGACTA ATCATATATT 1800 

CAATGTGTAA TAATTTTAAC AGCAATAAAT AGACAACATA ATAATTTAAC GTATTAATGC 1860 

CGCCAACAAT GATAAATTTT AAAATTTCAG CATGCGTTTG TGTTAGTTTC ATATGTGTAC 192 0 

40 TCCTCAACAT CAAAATATAT GCATAACTAC GTTCTCGAAC ATACTCGAAT ATGCGAGCCA I960. 

ATCCGCTTCA CTTCAAATAT GCTTATTTCA ATCTTTATAC CCTTTCACAG CAAATTTAGT 2 04 0 

CTCTTTCCCC TCATCCTTAT ACGCCATTAT AATGTAACTG ATTTATCGCG TGACTCATTA 2100 

GCACTATAGA GATTACTTTA GTTCAGTAGT AATTTTATAT ACAATAAGAG CGACAACAGT 2160 

AATGAGAGGA TGTCTACTAT GCAATTACAA AAAATTGTCA TCGCTCCTGA CTCATTTAAG 2220 

GAAAGTATGA CCGCACAGCA AGTTGGCAAT ATTATAAAAC AGGCTTTTAC TAATGTTTAT 2280 

GGGAATACCC TTCATTATGA TATCATTCCG ATGGCTGATG GTGGTGAAGG TACCACAGAT 234 0 

GCTTTAATGC ATGCAACAGG TGCCACTAAG TATACAGTCA TCGTTAATGA CCCTTTAATG 240 0 
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GCGGCAGCGT CAGGTTTGGA TTTATTAGAA AAAGAGGAAC GTAATCCTTT ATACACATCA 2520 

TCATATGGTA CCGGTGAACT AATTAAAGAT GCATTAAATC ATGGTGCTAA GACCATTATT 2580 

5 TTAGGGATTG GTGGCAGTGC AACAAATGAT GGTGGTACAG GTATGCTAAG TGCACTAGGC 264 0 

GTAAAGTTTA CTGATGTAAA CGGGGACTTA TTACAAATGA ATGGTGCTAA TCTTGCTCAC 2700 

ATTGCACAAA TCGATATAAC CAATCTAGAT TCGCGATTAA AAGAGGTGAC CTTTAAAGTG 2760 

10 

. GCCTGTGATG TTTCAAATCC TTTATTGGGT GAAAATGGTG CTACCTATAT TTATGGTCCT 2820 

CAAAAAGGCG CTGATGCAAA GATGATACCA AAGTTGGATT TCGCAATGTC GCATTATCAT 2880 

GATAAGATAA AAATGTGCAC AGGAAAGTCC GTTAATCAAA TACCAGGTTC TGGTGCAGCT 294 0 

GGCGGTATGG GCGCAGCATT ATTAGCGTTT TGTGAGACAA CTTTAACAAA AGGTATTGAT 3000 

GTCGTCTTTG ACATTACAGA TTTTCATCAA AGAATTAAAG ATGCAGACCT CGTTATTACT 3060 

» 

20 GGAGAAGGAC GCATGGATTA TCAGACCATC TTTGGTAAAA CACCCGTAGG CGTTGCGTTA 3120 

GCTGCAAAAC AATATCATAT TCCTGTCATC GCGATTTGTG GCAGTCTAGG CGAAAATTAT 3180 

CAACATGTTT ACGATTTCGG TATTGATAGT GCCTATTCTA TAATCTCTTC ACCTAGCACT 324 0 

25 -TTAGAAGATG TCCTACAAAA TAGCGAACAA AATTTATTAA ACACTGCAAC TGACATTGCT 3300 

CGTATTCTGA AATTACAATA ATGTCAAAGT AAATCATCAG CTTTATTATT TGCAGTTAAA 3360 

ACTTGAATGA GGTGAAACCC ATGAAAAGAA CTGATAAATA CCGTGATTCA TATCAATACG 3420 

30 

ACAATCAAAA CCAAAATCAT CGTCGTCAAT CTGAAGACGC ATCGTATAGA CAACAATATG 34 80 

CTAAAGGCGA TCCTGAAGAA CACCCGGAAC GATACTATAA TGGTAGAGAT TATCGAAGAG 3 54 0 

AACAAATTCT TGAAGAAGAA AACGAGAAAT CCCGCCGTTC AAAAAAATGG TTATATATCA 3 600 

35 

TTATTGCCAT TCTCTTAATT ATTGTCGCTA TTTTTGTCAC ACGCGCCTTA CTTAACAATG 3660 

ATAGCGATAA AGTTAGTAAT GACCCTAAAG TCTCTCAAAA TTATAAAAAA CAAGTTGAAA 3720 

40 ATCAAGACGG CCAAATTAAC CAGCAAGTAG ATAATGCTAA AGAAAATATT AAAAACAACC 3 780 

AAAAAACTGA TGACATTATT AAAAATTTAC AAAATCAAAT CGACAACTTG AAGCAGCAAG 3 84 0 

AACAAAACAA AGCTGATTCT AAGCTAACTC AATTTTATCA AGACCAAATC AACAAATTGA 3 900 

45 CAGAGGCAAA TAATGCACTT AAAAACAATG CAAGCCAAGG TAAAATTGAA AGCATGTTAA 3960 

ATGATATTAA TACAAAATTC GACAGTATTA AATCTAAATT AGAAAGCTTA TTTAAAGATG 4 020 

ACAATGGTGG CGCTAATTAA TTATTACACC TGCTTTGATG ATAAACATTA ATTCCCTATA 4080 

SO 

CTTTATCTGT ATCACTACGT TATTCGTGAT GATGCATTAA GAGTATAGGG ATTTTTTATA 414 0 

TAAACTTGTA TTCTAACTAC ATACAAATAC ACACAAAACG TATATAATTT ATATAATTAT 4 200 
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TTATTGCTAA TTACGTTAGG CGTCATGACC GCTTTTGGCC CACTAACTAT AGATATGTAC 4320 

GTACCATCAT TACCTAAAGT GCAAGGTGAT TTTGGTTCTA CTACATCAGA AATTCAATTA 43 80 

5 ACATTATCAT TCACAATGAT TGGTCTTGCA CTAGGCCAAT TTATCTTTGG ACCTTTATCC 444 0 

GATGCTTTTG GTCGCAAACG GA7TGCTGTA TCCATTTTGA TCATTTTCAT TTTGGTATCA 4500 

GGTTTGTCTA TGTTTGTTGA TCAATTGCCA TTATTCTTAA CTTTACGATT TATTCAAGGT 4560 

10 

TTAACTGGTG GTGGCGTCAT CGTGATTGCA AAAGCCTCTG CTGGTGATAA ATTTAGTGGC 4620 

AACGCACTCG CTAAATTTTT AGCATCTTTA ATGGTAGTTA ATGGCATCAT CACTATTCTT 46 80 

GCACCATTAG CCGGTGGATT AGCTTTATCC GTAGCAACAT GGCGTTCTAT TTTCACAATT 4740 

15 

TTAACTA1TG TGGCACTCAT CATTTTAATT GGCGTCGCTT CTCAATTACC TAAAACATCT 4800 

AAAGATGAAT TAAAGCAGGT GAATTTTAGT AGCGTCATTA AAGATTTTGG AAGTCTTTTG 4860 

20 AAAAAACCAG CATTTATTAT TCCAATGCTA TTACAAGGwT TAACTTATGT AATGCTATTT 492 0 

AGTTATTCAT CTGCATCGCC ATTTATTACT CAAAAATTGT ATAATATGAC ACCCCAACAA 4980 

TTTAGTATCA TGTTTGCTGT TAACGGTGTA GGTTTAATCA TTGTCAGTCA AGTCGTTGCT 504 0 

25 TTATTAGTAG AAAAATTACA TCGCCACATA TTATTAATCA TTTTAACTAT TATACAAGTG 5100 

GTAGGTGTTG CTTTAATTAT CCTGACACTT ACATTCCATT TACCACTTTG GGTCTTACTC 5160 

ATCGCATTCT TCTTAAATGT GTGTCCTGTG ACGTCAATTG GACCGCTTGG TTTCACAATG 5220 

30 ' 

GCTATGGAAG AACGAACAGG TGGCAGTGGT AACGCATCAA ■ GTTTACTTGG CTTATTCCAA 5280 
TTTATCTTAG GTGGCGCTGT TGCACCATTA GTTGGCTTAA AAGGCGAATT TAATACATCA 5340 
CCATATATGA TTATTATC7T CATTACAGCC ATTCTATTAG TCAGTCTACA AATCATTTAC 5400 

35 

TTTAAAATGA TTAAAAAGCA ACATGTCGCA TAACACTTCA ACATAATTAG AACCCTAGCA 5460 
AAGATATCTA TCTTTGTCAG GGTTCTTCTT TATGAATTAT GAGATCGAAT CTTCAACTAA 5520 
AATTACGCCT TCATAGCAAG GACATTTCTA TTCAATCACC CTTTAACAGG CATCCAAATT 5580 

40 

TcTGTAATAT ATTTTTCACT TGTAGTATCA CCAT 5614 
(2) INFORMATION FOR SEQ ID NO: 100: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9179 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 100: 
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AAAGACAATG ATATGAAGTA TATGGATATC ACAGAaAAAG TGCCAATGTC GGAATCTGAA 12 0 

GTTAACCAAT TGCTAAAAGG TAAGGGGATT TTAGAAAATC GAGGGAAAGT TTTTCTAGAA 180 

GCTCAAGAAA AATATGAGGT TAATGTCATT TATCTTGTTA GCCATGCATT AGTAGAAACA 240 

GGTAACGGCA AATCAGAATT AGCAAAAGGC ATTAAAGATG GGAAAAAACG CTATTACAAC 300 

TTTTTTGGTA TAGGAGCATT CGATAGTAGT GCTGTTCGTA GTGGGAAAAG TTATGCTGAA 360 

AAGGAACAAT GGACATCACC AGATAAGGCG ATTATTGGTG GTGCAAAGTT CATTCGTAAT 420 

GAATATTTTG AAAACAATCA ACTGAATTTA TATCAAATGC GATGGAATCC AGAAAATCCT 4 BO 

GCGCAACATC AATATGCGAG TGACATTCGC TGGGCAGATA AAATTGCCAA ATTAATGGAT 540 

AAATCCTATA AGCAGTTTGG TATAAAGAAA GATGATATTA GACAAACATA TTATAAATAA 600 

GACATCGGTG CTTAAAGGAG CTGGAACAAT TTATTGTTTC GAGCTCCTTT AGCGCATTCT 660 

20 GAGTGTGTTA GTTAAATGGA TTTTAACCTA ACAAAAAACG CTATATAGCA TCAAATATGC 720 

TATATCCCAC ATCATTGTTA CAAATGTACA TGATGTAAAT GAATATTGCT GTCTAAATGT 780 

GCATGTAATA TACAATGGTG CAGATAATAC ACTTAAGTCC TTAAAAATGA AACGTTAgTT 840 

25 CCAAGAGTCA TTTTTAAACA ATAGTGCATG TGATAAAATA GAAAAGAATG AAAAATATAG 900 

AGGTGACAAT ATGAAGATAG CAATTATAGG TGCAGGCATC GGTGGATTAA CAGCTGCTGC 960 

ATTATTACAA GAACAAGGTC ATACTATTAA AGTCTTTGAA AAAAATGAGT CAGTTAAAGA 1020 

AATTGGCGCT GGGATTGGTA TCGGAGATAA TGTGCTTAAA AAACTAGGTA ATCATGACTT 1080 

AGCTAAAGGT ATTAAAAATG CTGGGCAAAT CTTATCTACA ATGACAGTGT TAGATGACAA 114 0 

AGATCGCCTG TTAACTACTG TTAAATTAAA AAGTAATACA TTGAATGTGA CGTTACCACG 1200 

CCAAACATTA ATTGACATTA TTAAATCTTA TGTAAAAGAT GACGCAATAT TTACAAATCA 1260 

TGAAGTCACG CATATAGATA ATGAGACAGA TAAAGTTACC ATACATTTCG CGGAACAAGA 1320 

40 AAGTGAAGCA TTTGATTTAT GTATTGGTGC TGATGGAATT CATTCTAAAG TGAGACAATC 1380 

TGTAAATGCT GACAGTAAAG TATTATATCA AGGGTATACA TGCTTTAGAG GTTTAATTGA 1440 

TGATATTGAT TTAAAGCATC CGGaTTGTGC AAAAGAATAC TGGGGaAGAA AAGGaAGAGT 1500 

45 AGGTATTGTT CCGTTATTAA ATAATCAAGC ATATTGGTTC ATTACAATTA ACTCGAAGGA 1560 

AAACAATCAT AAATATAGTT CGTTTGGTAA ACCTCATTTG CAAGCATACT TTAATCACTA 1620 

. TCCAAATGAA GTTAGAGAGA TCTTAGACAA ACAAAGTGAA ACAGGTATCT TATTGCATAA 1680 

50 

TATTTATGAT TTGAAACCAC TCAAATCTTT TGTTTATGGT CGTACTATTT TACTAGGAGA 174 0 

TGCAGCACAT GCGACAACGC CTAATATGGG GCAAGGTGCT GGACAAGCAA TGGAAGATGC 1800 
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TAAAATACGT GTCAAACATA CTGCAAAAGT AATTAAGCGT TCTAGAAAAA TCGGTAAAAT 1920 

TGCCCAATAT CGTAGTCGTT TATTTGTTGC AGTTAGAAAT CGTATTATGA AAATGATGCC 1980 

5 

AAATGCATTA GCAGCTGGAC AAACTAAATT CTTATATAAA TCGAAAGAAA AATAATACAA 204 0 

CAATATGAAA ACCCCCGTAT GTTGAAACGA GAGCTCAACA TATGGGGGTT CTTGTTTTTA 2100 

TAATGTTATT ATAATAAATT CAATTATTAG TTAACGACAA ATTGTGGTTT CTCACCTTGA 2160 

10 

ACGGCACTAA TTGCAGCATT AGCAACAATT TTAGACATCA TGTCACGTGC TTCAAATGTA 2220 

GCATTACCAA TATGCGGTGT TAATACTACA TTATTAAGTG ATTTTAAGTC ATCGGTAATA 2280 

15 TCTGGTTCAA ATTCATATAC ATCAAGTGCA GCACCTTCAA TTTCATTATC TTTCAATGCT 2340 

TGCACTAGTG CTTGTTCGTG CACGATTGGA CCACGAGAGG CATTGATTAA ATACGCCGTA 2400 

GATTTCATCA TTTTAAATTG TTCTGTATCA ATTAAATGAT GCATTTTAGG ATTATAAGCA 2460 

20 GCGTTGATAG TGATAAAATC TGCATTCTTT AATAGTGTAT CTAAATCTAC ATATTTTGCA 2520 

CCGATTTCTC GTTCTTTTTC TTCTTTGCGA TTAGGTCCAG TGTATAGCAC ATCCATGTCA 2580 

AATGCTCTTG CACGACGAGC TACTGCACTA CCAATTTCAC CTAAACCGAT AATGCCGATT 264 0 

25 GTTTTCCCAG ATACTTCTCT ACCTCTGAAA AATAAAGGTG CCCATCCATC AAATCCAGTT 2700 

GTACGTGATA ATTGGTCCCC TTCAACAATA CGACGCGCTA CTGCAAGTAC TAATCCAATT 2760 

GTTAAATCAG CAGTCGCGTT TGTTGATGCT TTAGGTGTGT TTGTAACATC TATACTTTTT 2820 

30 

TCTCGGGCAT ACTCGATATC AATATTATTA AAACCAGCGC CATAGTTGGC AATGATTTTT 2880 

AAGTCTTTAC CAGCATCGAT AACATCTTTA TCAACGTTTG TAGATAATAA ACTAATTAAG 2940 

GCAGTCGCGT TTTTAACACC TTTAATTAAA GTGTCTTTAT CGACTAATCC TTTACCTTCA 3000 

35 

TACATTTCAA CTTCAAAATG TTCTTGTAAA AGTTTTAAAC CTACTTCTGG TATtGCACCA 3060 

gCAAGATAAm CTTTTtCCAT AAAAGAtCAC TCCTTTTATC TTAGTATAGT AGAAGATTAG' 3120 

40 ACAGTATACA ACTATGTCAT GATGTCTTGT GTATCAATGA TGTAAGCGCG TACTTTTGAT 3180 

GGAGGCGATA TAACTTAGGC ACTGTAGAAC TATGAATATT GTAATGTGGA AAAACTGGAT 3240 

CAATTAAATT AGATAACGTA GTTTTAAAGT TAATAGTATT AGAAAAAATT AATATTTTGA 3300 

45 ATATGGGAGG AAATATAAAT AAGTAGGTGG CAACGAAAAA TAGCAAAAAA AGAGCTTCTC 3360 

CTATAAAGGA AAGCTCAAAG TTTTTTGATG ACATATGTAC TAGAATTAAG TTTCAAGACA 3420 

ATATGTATCA TCGTGTTTAT ATTAAATATG GATGTAGTTG TAGTTACCTG CTTCACTTGC 34 80 

50 

AGAAATAGTT CTAGAACTTA CTGAGAAAGG TCCGCCACTA TAATTCATTT CTGAAATTGT 3 54 0 

AACTGAACCA TCACTGTTTA CACTTTCTAC ATATGCAACG TGACCAAATG GTCCTTCAGA 3600 
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AGCAGCAGCC CAATTATTAG CATTTCCCCA 
ATATACATAC CAAGTACATT GTCCTGCAGT 
5 TGTAGTTGTC GTAGTTGTCG TAGTCGTTGT 

GTAATTTGTA TAATTTTCAG CAGCATCTGC 
GATTCCTGCT GTTAACGTAG TTGCTGTTAC 

10 

TTCTATATCT TTTTTTATAA ATAAAACGTA 
ATGACAATAG TTACTTTAAC AAAATtAATG 
AAGAATAAAA AAACTTTGAC TAATTTTGTA 

15 

AACAGATAAT TAATAGGAAA TATTTATTTG 
GGTATTATAT ATTCTTGGCC ATTATAATAT 

20 GATAATATTG AGAAAGCGAA TATGGATAAA 

ATAATAATGA AATCAATATC TGTAGCAATT 
AAAACGATAG ACCAAATAAT ATAAGAAATC 

25 TCAACTAGTT TCGATTCATC TTTTTTCAAT 

GTGAATAAAC TTAATAAATA GATAAGCATC 
TTCGGTTGAT GATTTGTTAC GTCGTTCATT 

30 

ATTGTAATAT TATCTTTAAC TATAACAAAA 
AATTATTAAA AATAAAAATA ATTGGTGGAC 
, ATATATACTT AACATTTATA ATGATGCGTA 

35 

CGTATAATTT GTTTTTAATT TTAACCAAAG 
ATTGTAGGAT CAGGAAATGG CGCAGTTACG 

40 GATGTTAAAT TATATTGTCG TAATCAATCT 

GGCGGATTTG ATTTTAATAA TGAAGGTGAT 
GATGATATGG AATATGTTTT AAAAGATGCT 

45 TACATAGAGT ATTATGCTGA TGTAATGGCA 

TTCAACATGG CTGCAGCAAT GGGGTCAATT 
ATTGAAACAA AACCACAACT AGCGGAAgcT 

50 

TTTGAAAATG CAGCAGTTGA TTTATCTCTA 
GATAGAAGCT GTCTAAATGA TTGTTATGAC 

55 



AGTAGAACCG ATTTCTCCGC CAACTTTATC 3720 

GTATAAGTTA CCAGAATGTG AAATTGATGA 3780 

AGTTTGAGTC GTGTTGTAGT TATAGTTGTT 384 0 

ATGATGTGCT TGACCTACTA ATGCTGTGCC 3900 

TAATTTTTTC ATGAATAAAG TCCTCCAAAG 396 0 

GCGACTGTTT TATTCTCACA TCTCGAATTG 4 020 

CTTCTTGTGG GGAATGTTAT TGATTTGTAA 4080 

ATAAAAATTA GTCAAAGTTA CAATGAGATT 414 0 

TAATATGTTT AAATAAATCG AATTGTTAAA 4200 

TTGACACACG CAATAATTGT GAATACAAAA 4260 

ATACCGATAA ACGTAATGAT GAAACCTATA 4320 

AGGAAAACGC CTATTAAAGT GATAACGACT 4380 

GTATAGTTAA GATAATTTTT TCCAGCACGA 4440 

AACCATATTA TCAGTGGACC AATAATAGAT 4500 

GCCATAATGT TCTCATCATT GGATTTGCGA 4560 

TCAGTTGTCA TATTAGACAC TCCTTTGAAA 4620 

TATAATCAAA AATAAACATG TTTATTAAAC 4680 

GTCGGCGTTT AAATAGGTTA ATTTAAGGTT 4 740 

ATGAATTCGC ATCATTTTTA TATTGTCTTA 4800 

ATAGAAAGAG GGTTGTTTAT GAAAATAGCA 4860 

GCAGCAGTAG ATATGGTGAG CAAAGGCCAC 4920 

ATAAGTAAGT TTCAAAACGC AATCGAAAAG 4980 

GAACGTTTCG TAAAATTCAC TGATATTAGT 5040 

GAAATTGTTC AAGTGATTAT TCCATCTTCA 5100 

GAGCATGTAA CTGATAATCA GTTGATATTC 5160 

CGTTTTATGA ATGTTTTAGA AGATAGACAT 5220 

AATACGTTGA CGTATGGTAC GCGTGTCGAT 5280 

AATGTACGTC GTATCTTCTT TTCAACATAT 5340 

AAAGTTTCAA GTATTTATGA TCATTTAGTA 5400 
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CCAACATTAT TGAATGTCGG TCGCATTGAT TATGCTGGCG AGTTCGCTTT ATATAAAGAA 5520 
GGAATTACTA AACATACAGT TAGATTACTT CATGCAATCG AATTAGAACG TTTGAATTTA ' 5580 

GGCCGTAGAT TAGGTTTTGA ATTATCAACA GCTAAAGAAT CACGTATTGA ACGTGGTTAT 564 0 

TTAGAACGTG ATAAAGAAGA TGAACCATTA AATCGTTTGT TTAATACAAG CCCAGTATTT 5700 

TCACAAATTC CAGGACCAAA TCATGTAGAA AGCAGATATT TAACTGAAGA TATTGCATAT 5760 

GGTTTAGTAC TATGGTCAAG CTTAGGTCGT GTTATTGATG TACCGACACC AAATATAGAT 582 0 

GCAGTAATTG TAATTGCATC AACCATTTTA GAGAGAGACT TCTTTGAGGA AGGCTTAACA 5B80 

GTTGAAGAAA TTGGTTTAGA TAAGCTTGAT TTAGAAAAAT ATTTAAAATA AATGATGGCT 5940 

TGAAGATAGA AAAGGATATA GCATTATGCA AAAGCAATAA ATTGAAGAAA AGAGGTTTCT 6000 

CATCAATAAG CGnAGGGGAC GATAGATGAT GAAAAGAAAA CCCACCTTTT TAGAATCAAT 6060 

2d TTCGACAATG ATTGTAATGG TTATTGTTGT TGTAACAGGC TTTGTGTTTT TTGATATTCC 6120 

AATTCAAGTA TTATTAATTA TTGCCTCAGC ATATGCCACA TGGATTGCAA AACGTGTAGG 6180 

CTTAACATGG CAAGATTTAG AAAAAGGCAT TGCAGAACGT TTAAATACTG CAATGCCTGC 6240 

25 AATTTTAATT ATACTAGCGG TAGGAATTAT AGTAGGCAGT TGGATGTTTT CTGGCACAGT 6300 

GCCAGCCTTG ATTTATTATG GCTTAGATTT ATTGAATCCA AGCTATTTTT TAATATCAGC 6360 

CTTTTTTATA AGTGCTGTTA CATCTGTAGC AACTGGTACA GCATGGGGCT CTGCATCAAC 6420 

TGCAGGGATT GCACTTATTT CTATTGGTAA TCAATTGGGG ATTCCTCCAG GGATGGCAGC 6480 

GGGTGCTATT ATAGCAGGGG CTGTGTTTGG CGATAAAATG TCACCATTAT CAGATACAAC 6540 

' TAATTTAGCG GCGCTTGTTA CTAAAGTTAA TATATTTAAA CATATACATT CGATGATGTG 6600 

GACGACGATA CCTGCATCAA TCATAGGTTT ATTAGTATGG TTTATTGCTG GATTTCAATT 6660 

TAAAGGGCAT TCAAATGATA AACAGATTCA AACTTTGTTA TCAGAGCTTG CACAGATTTA 6720 

TCAAATTAAC ATATGGGTCT GGGTTCCCTT AATTGTGATC ATTGTTTGTT TGCTATTTAA 6780 

AATGGCTACA GTGCCAGCTA TGCTAATATC AAGCTTTTCT GCCATTATAG TGGGGACTTT 6840 

TAATCATCAT TTCAAAATGA CAGATGGTTT CAAAGCAACA TTTAGTGGTT TTAACGAATC 6900 

AS • AATGATACAT CAGTCTCATA TTTCATCCAG TGTGAAAAGC TTGTTAGAAC AGGGTGGTAT 6960 

GATGAGTATG ACCCAAATAT TAGTAACGAT ATTTTGCGGA TATGCATTTG CAGGTATTGT 7020 

AGAAAAAGCA GGATGTTTAG AAGTCTTATT AACTACTATT TCTAAAGGCA TCCATTCTGT 7080 

50 

AGGAAGTTTA ATATGTATTA CTGTTATTTG TTGTATTGCG CTTGTATTCG CTGCAGGTGT 714 0 

TGCTTCGATT GTAATTATTA TGGTCGGTGT GTTAATGAAA GATTTGTTCG AAAAATACCA 7200 
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AATACCATGG GGAACATCAG GTATTTACTA TACGAATCAA CTTCATGTCT CTGTTGAAGA 7320 

ATTTTTCATA TGGACAGTAC CATGTTATTT ATGCGCAATT ATAGCAATTA TCTATGGTTT 7380 

TACAGGGATA GGTATTAAAA AGTCATCGAA TTCACGTTTA ACTTAATGTG AGCGTGGAAT 7440 

ATATATAATA TGTTGAAACA CTTTAATCAT TTATAATTGT AGCGGTTATA ATTTGAAAAG 7500 

GTTTTAACTT AGAATAAATA TCCTCTATGC ATATACTGAA TATGTTTTGT AGCGGAACAT 7560 

GTTGATATAT GTAATGTAAG TTTTATGTCA TGATTTGTAA TGACTAAATT AATTGAGAAT 7620 

TTGAAGGCAA GTATATTTGT AAGTACTTTA ACTAAAAATT TATCAATGTA TAGCCGATTT 7680 

GACATGCCTA AATTTGGGTG TGTCAATGGC TGTATGTTGT TTATTCTTTA TTACAGAGTG 774 0 

AATCGGATTG GTGAAAATCG AAATTTTGAG ATTTTTACCA ATTCGATTTT TTTCATAGAA 7800 

ATTAAAAAAG CCAACAAGGC TCTTGAAACC TTGTTGGCGT AAACATAGCC ATCACTAATT 7860 

20 AGTGAATGAA GTTATAACCA GCAGCTTGGC TAGCTGAGAT TGTACGTGAA GTTACAACAC 7920 

CTGGGCCATA ACCATAGTTC ATTTCTGAAA CTCTTACTGA ACCATTGCTG TTAACACTTT 7980 

CAACGTATGC AACGTGACCG TATGCACCTT GAGTTGTTTG CATAATTGCA CCAGCTTTTG 8040 

25 GTGTATTGTT CACTGTGTAA CCAGCTCTTG CAGCTGCGTT AGCCCAGTTA CTTGCATTGC 8100 

CCCAAGTTGA ACCGATTTTA CCACCTACAC GATCAAATAC GTAGTATGTA CATTGACCAG 8160 

AAGTGTATAA GTTACGTCCT GAAGTATAAC CACTTGAGAT TGAACGGCCA TTTGATGATG 822 0 

GAGCCATAGT TGTAGTTACT TGAACATTGT TGCTTGAAGT GCTGTAGCTT GCACCTAAAC B2 80 

CACCAGTACG GTAGCTGTTT GTGTTGTAAC TATTATAGTT ATTGTAGTTA TATGATTGAT 834 0 

TATTATTTGA GTAGTTGTTG TAACGGCTGT AGTTATTGTA GCTATAACCG TTGTTGTAAT 84 00 

TGTTATAGTT ATTGTAACCA TTGTAGTAGT AATAGCTGTA GTAGCCATTA TCTTGGTTTA 8460 

ATTGACTTGG ATGCCAGTTA CCTTTCCATG TGTAATGGTA GTTACCTTGT GCATCAATAG 8520 

TGTAAGTATA GCTATATGAT GTTGGGTCGT TTGGATTATA ACCGTAGTTA TCTTGCTCAG 8580 

AAGCATGAGC TTGATTTCCT GATGCAATTG CGATTGTAGC GAATCCTGCA GTTGCGATAG 8640 

TAGCTGTAGC GATTTTCTTC ATTTTAAAAA TATCCTCCTA AAAATTTTAA ATCTAAAATA 8700 

45 TTTTCGTAAT GTCCGTGTGA CAAAATTAAT GTTATAAGTT ATCTCTCGTA ATTAAACGAC 8760 

AAGAAAGACT ATAACAGAAA TTAGCGTCCT TGTGTGCTTT GTTAACGTTT TGTAATTTTT 8820 

TGCTAATATC TTGACACAAT AGAATTTTAA AAGTATAGAA ATTTGCATTT TGCAAAACTT 8880 

50 

ATAACTACGG CATTCTTTGT GAAAACTGAA TGTTTCGAAA ATAAGTCTGT TACAAATTTG 894 0 

TAATATTACT GAAAATTCTA AATGTATATT TTGTGCATAA TATAGGACTT TTAATCAGAA 9000 
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GGATGAAAAT GTATATTTAA TGGATAAAAT ATCCTAATTT AGCATAAAAA AATGTTTTAA 9120 
TAAAAGTATT ATTTGATATA ATCGATTTAT GTTTTGTTAC TGCTAAAAAA CATGTGGCG 9179 
5 ' (2) INFORMATION FOR SEQ ID NO: 101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1868 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 101: 

CCTTCAGCCA TTTGACTTCG ACATGAGTTG CCTGTACATA TAAAATAAAT TG TT T T TTTA 60 

GTCATAACAA TCTCCTAATT AATTAAAATA TGATAAGTGT TAGATACAAC CCTATGAGGG 120 

20 TTATAAATAG TACTGGAATT GTAATGATGA TACCAGTTTT AAAGTATGTG CCCCAAGAAA 180 

TCTTAACATC TTTTTGtGTT AAGACGTGTA ACCACAGTAA TG TAG CTAAA GAGCCTATCG 240 

GTGTAATTTT TGGACCTAAA TCAGAACCGA TAACATTCGC ATAAATTAGG CCTTCTTTTA 300 

25 . ACATGCCATG GACATTTGAT TGACCAATAG CAATCGCATC TATTAAAACT GTAGGCATAT 360 

TATTCATTAT TGATGATAAA AACGCTGAAA TGAAGCCCAT TCCCAAAATA GTGCTAAATA 420 

GACCGTAATT GGAAATATAT TCTAATATTT TAGCCAATAT TAAAGTAATG CCAGCATTTC 480 

30 

TTAAGCCGAA TACGACGATA TACATACCAA TTGAAAATAA TACTATATTC CAAGGTGCGC 54 0 

CCTTAATGAC TTGCTTAATA TTTACAGCAT TTGATTTACG AGCCAACATT AGAAAAATAA 600 

AAGCAATGAT TCCAGTGAAA ATTGATACCG GAATTTTAGT AAATTTACTG ATT AGAT AG C 660 

35 

CGAAAAGTAA TATAACTAGA ACAATCCaTG AAATTTTAAA TAGCTTTAAA TCATTAATGG 720 

CATCFTTAGG ATGCTTTATA TTATTATCAT CAAACGTTTT AGGTATCGCT TTTCTAAAAT 780 

ATAACCACAA TACTATAATA CTTGCTAAAA GCGAGAATAA ATTAGGTATA ATCATTCTAC 840 

40 

TAAAATATCG AACGAATCCT ACATGAAAAT AATCAGCAGA TATAATATTC ACTAGATTGC 900 

TCACGATTAA AGGTAAAGAA GTTGTGTCAG CTATAAAACC ACTCGCAATA ATnAAAGGGA 960 

45 ATATGGCCCG CTTACTAAAA CCTATATTTT TAACCATCGC TAATACAATA GGCGTTAAGA 1020 

TTAAcGTGCG CCATCATTTG CGAAAAATGC AGCAACAATG GCACCCAATA ATATGATATA 1080 

AACGAACATT TTTAAACCAT TGCCTTTTGA AGCATGAAGC ATGTGAATAG CTGACCATTC 1140 

50 

GAATAAT CCA ACTTTATCTA ATATTAATGA AATAAGAATG ACTGAGACAA AAGTCAAAGT . 1200 

AGCATTCCAA ACAATACCTG TTACTTCGAA AACATCGGAA AAACTTACAA CACCAGTAAT 1260 
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TAATACAAAT AATAAAGTTA CTAGAAAAAT GAGTGTCGCT AAAGTTGTCA TCATTAGCAT 
TCACCAGTCT TAAGGTTATG ACAAATACAT CGTTGGTTAG AGGTATGAAC CTTAGACAAG 
TTATTAATTA CGGACTCAAA AATATTATGA TTgAGCTGGT ATAAATGTTT ATTTCCGATT 
TTTCGTGTCG TAACTAAGTT GGTTTTTACT AATGCTTTCA TATGrTAGCT AAGTGTAGGT 
TGAGAGAATT GAAAATGTGC TAACAAATCA CAAGCGCATA ACTCTCCACA AGAAAGTAAA 
TCTAGTATTT CTAATCTGCT TGAATCTGAT AAAACTTTTA AAAATGTTGC TAGTTCTTTA 
TACGTCATAA CATACCTCCT AGACGTTAAA TAGATTATCA TCTATATAGA TGAATGTCTA 
TGTTCCTTTG GTATATTACA CGATATGACT ATGTAATTTA AATTTGGTTT TAGTATTAAA 
AGGGTATTAA AGATAAATTA TAGATATTGA TTTTGCAAAA TATACTCTTT GTTCTGCATT 
GAAAAAGG 

(2) INFORMATION FOR SEQ ID NO: 102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 102: 
ATTTATGAAA TCCATAGCnA TAAACATTAT TCTTGCATCG GCTATACAAA CAGTTACCGC 
AAGCAAATTT GTATATCAAC CTGGAATTGT GTTCACGTCA ATGGCaAATG CCGATGATGT 
GTTATCAGGC GATAGTTATT TTATGGCTGA ATTAAAATCT ATTAAGCGTA TTGTTGAAAT 
TCCAGATAAT CAAAAAATAT ACTGCTTTAT AGATGAAATT TTTAAAGGTA CCAACACAAC 
TGAACGAATT GCCGCTTCAG AATCAGTACT ATCATTTTTA CATGAAAAAT CTAACTTTAG 
AGTTATTGCA GCAACACATG ATATTGAGTT AGCTGAACTC TTAAAACAAC GTTATGAAAA. 
TTACCATTTC AATGAGGTAA TAGAAAATAA TAACATACAT TTTGATTACA AAATTAAGCC 
TGGCAAAGCA AATACACGTA ATGCCATCGA ATTATTAAAA ATCACTTCAT TTCCAGCAAA 
AATATATGAA CGAGCAAAAG ATAATGTCCC GAAAATTTAG CATTTAACTT TAAACATAAA 
AACGTCAGCT ATCACATGAC AGAAGACTAT GAACAGTTTC AATAATGTTC ATAGTAATCA 
TGTTAATAAC TGACGTTTAT TTTATTCTGC AGAATACTCT TCTAAATCTA TATTGCTGTG 
CCCATTTAAT GCTAAATCAG CAAATCGACC TTGCTGATAC AAATAGTGGC CGGCAACGCC 
TATCATTGCA GCATTATCTG TGCATAATTT AGGACTTGGG ATAGTTAATT GAATGTCATT 
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15 



AACAATTAAT CGCTGAACAC CATATTCTTT ACAAGCTTGA ATAGCTTTAA ACGTGAGCAC 900 

CTCTACAACA CTGTTTTGAA AGCTCGTTGC TACGTTAGCT TCAATGATTG GaATATTTTT 960 

TTGTCGTTGA TTGTGAAGTT GATTGATTAC GGCACTTTTC AACCCACTAA AACTAAAATC 1020 

ATAACTATCT TTATCCAACC AAACACGAGG GAATGAATAA GTATCTTCAC CTTCAGCAGC 1080 

CAACCGATCA ACTTGTGGAC CACCTGGATA ATTTAAACCA ATTGTTCGTG CCACTTTATC 1140 

ATAAGCCTCA CCTACTGCGT CATCTCGTGT TTCACCAATG ACTTCAAATG ATAAATGATC 1200 

CTTCATATAA ACTAATTCAG TATGTCCACC TGAAACAATA AGTGCAATTA GCGGGAATGT 1260 

7AATGGCTCT TCTATGTGAT TAG CAT AT AT ATGTCCTGCA ATATGATGAA CAGGAATAAG 1320 

TGGCTnATCG TAAGCAAATG CCAATGCTTT GGCTGCATTA ACACCTATTA GTAACGCACC 1380 

AATTAGTCCA GGGCCTTCTG TAACCGCTAT GGCATCAATA TCTTCTATTG ATACATCGGC 144 0 

20 ATCCCCTAGA GCCTCGTTTA TTGTTGCTGT TATACCTTCA ACGTGATGTC TACTTGCCAC 1500 

TTCGGGAACG ACACCGCCAA ATCGTTTATG ACTTTCAATC TGACTTAAAA CTGTATTTGA 1560 

TAAAATATCT CTGCCATTTT TTATAACACT AACGCTTGTT TCATCACAAC TTGTTTcAAC 162 0 

AGCTAGTATT AATATATCTT TAGTCATTTA AATTCACCCA CATAACCATT GCGTCCTCAC 1680 

CTTCACCATA ATAATTTTTA CGTTTACCAC CATATTGAAA TCCTAAATTT TCATATACAT 1740 

GTTGTGCCAC TTTATTATTA ACTCTTACTT CTAAACTCAT CACATCACAA GTGTGACTTG 1800 

CATAGTTTAT TCCGTATTTT AAAAGCATTT GACCTAAACC ATAGCCTCTA TAATTATCAT 1860 

CGATTGCAAC TGTTGTAATT TGAGCTTGAT CGATAACAAT CCATAAACCT AAATAACCAA 1920 

TAATTTGTTG TTCAAATTCt AAGACAAAAT ATTTCGCAAA GTTATTTTGC TCTATTTCAT 1980 

GATAAAATGC GTCAATTGTC CAAGAACTGT CATTGAAACT CCGACGCTCA AGATCAAAGA 2040 

CTTQTGGCAC ATCTTCTTTA GTCATCTCTC TAATGTTTAA TTGTTCTTTT GACTGTTGAT 2100 

40 CCAATTTCGT TCCGCCTCAG CTAATTTATG GTATTTAGGA GTAAATGTAT GTACGTCTGA 216 0 

AGGTTTATCT AGCAATTGAT ACATGACTGA TGCATTTGGT AGctGCGCAA TCACTTCACC 2220 

TTGTAATTCA TCTTGTAATT TTACAGTATC TTTCCCAATA TAAATAAATG GTTGGTTTAA 2280 

45 ATCTTCTAAA AAAGCTCGCA ATGCCTCTAT CGACATATAT TGATCTTCTA AAATAGTCAC 234 0 

TAATTGACCA TTTTGCCACT GGAATATGCC TGTATAAACT GCTTGTCGTC TTGCATCAAA 2400 

CACAGGAACC AATAATTTAT CAGTATGATC GATTGTTGCT GCCAATGCCT TTAATGATGA 2460 

50 

AACACCATAT AATTTAACAT CTAACGCATA CGCTAATGTT TTAGCAACAG TAACACCGAT 2520 

ACGTAAGCCA GTATATGAAC CAGGACCTTC AGCAACAATA ATCGCATCTA ATTGCTGTTT 2580 
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TTGTTTAGAA TCCGTAGTTA TTTCAGCTAA AACTTCATCG TTTTGCATCA ATGCTACTGA 2700 

TAATGGTTGA TTCGATGTAT CAATGAGCAG CGAATTCATG GATAATTGCC TCCTTAATTT 2760 

GTTCATAATG TTCTCCTTGC GCGAACAACT CAATTTGTCT TGTATTTTCA GATATTGTTG 2820 

AAATGTTAAT AGATAAATGC GTCGCTGGAA GTAAATCTTT TATAAATTGA CTCCATTCAA 28 BO 

TAACAGTAAT TGCCTGATCT TCGAAAAATT CATCAAATCC TAAATCTTCA TCAGAATCTT 2940 

CTAAGCGATA ACAATCCATA TGATGCAATT TTAAATTTTT ACCCCTATAT GATTTAATGA 3000 

TGTTAAATGT CGGGGAATTA ATCGTACGTC TTACACCAAG AG CTTTT CCT ATAAATTGCG 3060 

TTAACGTTGT TTTACCTGCT CCTAAATCTC CGTTAAGTAA AATCAAATCA CCACTTTTCA 3120 

ATTGCTCAAC TAAAAATATA GCAAATTGAT TCATTTCATC TAAATTATTT ATCTTTATCA 3180 

ATGTTGATTC TCCTATATTA TGCTTTTCAT TCATAAAAAT GATTATCCAT TGTTCAATCG 324 0 

20 TATCTAACTT TATATTTAAC CTTTATATTG TAACAAATTT CAACTTAAAT TTCTTATCTT 3300 

TGAAACAGAT TATCTATTCA AAGTTAATTG TAAGAAAATT TAAAATATTT GTTGACATAC 3360 

TAAAGCAGAT ATAGTAAATT AAATTTATCA AATTTTTAGA CAATTCTAAC TATTAAAGTG 3420 

ATATATACCA TTCACGGAAG GAGTATAATA AAATGCTTAA TCAATATACT GAACATCAAC 34 80 

CGACAACTTC AAATATTATT ATTTTATTAT ACTCTTTAGG ACTCGAACGT TAgTAAATAT 354 0 

TTACTAAACG CTTTAAGTCC TATTTCTGTT TGAATGGGAC TTGTAAACGT CCCAATAATA 3600 

TTGGGACGTT TTTTTATGTT TTATCTTTCA ATTACTTATT TTTATTACTA TAAAACATGA 3 6 60 

TTAATCATTA AAATTTACGG GGGAATTTAC TATGCGAaCG AgcATGATCA AAAAAGGAGA 3720 

TCACCAAGCA CCAGCAAGAA GTCTTTTACA TGCCACGGGC GCGCTAAAAA . GTCCAACTGA 3780 

TATGAACAAA CCATTTGTAG CTATTTGTAA CTCTTATATT GATATTGTTC CTGGACATGT 3 840 

TCACTTGAGA GAGCTTGCAG ATATAGCTAA AGAAGCAATT AGAGAAGCCG GTGCCATTCC 3 900 

40 ATTTGAATTC AATACAATTG GTGTTGATGA TGGAATAGCT ATGGGACATA TCGGAATGCG 3 960 

ATATTCTCTA CCATCACGTG AAATTATTGC AGATGCAGCT GAAACTGTAA TTAACGCTCA 4020 

TTGGTTTGAC GGCGTATTTT ACATTCCTAA TTGTGACAAG ATTACACCCG GTATGATTTT 4080 

45 AGCAGCCATG AGGACAAACG TACCAGCTAT CTTTTGCTCT GGTGGACCAA TGAAAGCTGG 414 0 

CTTATCTGCA CATGGAAAAG CATTAACACT TTCATCAATG TTTGAAGCAG TCGGCGCATT 4200 

TAAAGAAGGA TCGATTTCTA AAGAAGAATT TTTAGATATG GAACAAAATG CCTGCCCTAC 4260 

TTGTGGTTCA TGTGCTGGGA TGTTTACTGC AAATTCAATG AACTGTTTGA TGGAAGTTTT 4320 

AGGTCTAGCA TTACCATACA ACGGTACTGC ACTTGCAGTC AGTGATCAGC GACGAGAAAT 4380 
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TATCGTTACT CGCGAAgCAA TTGATGATGC ATTTGCACTT GATATGGCTA TGGGTGGTTC 4500 

AACAAACACG GTACTGCATA CGTTAGCCAT TGCCAATGAA GCTGGTATTG ATTATGACTT 4560 

AGAGCGCATT AATGCTATTG CCAAACGCAC GCCATATTTA TCAAAAATAG CACCTAGTTC 4620 

ATCGTATTCA ATGCATGATG TGCATGAAGC TGGTGGCGTC CCAGCAATTA TTAATGAATT 4680 

GATGAAGAAA GATGGCACGT TACACCCAGA TAGAATCACA GTTACTGGCA AAACGTTACG 4 740 
TGAAAATAAC GAAGGCAAAG AAATTAAGAA CTTTGATGTC ATTCACCCTC TTGATGCACC . 4800 

ATATGATGCA CAAGGCGGTT TATCTATCTT ATTTGGTAAT ATCGCCCCTA AAGGCGCAGT 4860 

TATTAAAGTT GGCGGCGTTG ATCCATCTAT CAAAACATTT ACTGGGAAAG CAATTTGTTT 4 920 

CAATTCGCAT GATGAAGCTG TTGAAGCAAT AGACAATCGT ACCGTTCGTG CAGGCCACGT 4 980 

CGTTGTCATT AGATATGAAG GACCTAAAGG TGGACCAGGT ATGCCTGAAA TGTTAGCACC 5040 

20 TACTTCCTCT ATTGTTGGTC GCGGCTTAGG TAAAGATGTT GCATTAATTA CTGATGGGCG 5100 

TTTTT COGGT GCCACAAGAG GTATTGCAGT TGGTCATATT TCCCCTGAAG CTGCATCTGG 5160 

TGGACCAATT GCCTTAATTG AAGATGGTGA TGAGATTACT ATTGATTTAA CAAATCGTAC 522 0 

25 ATTAAACGTA AACCAGCCTG AAGATGTTCT AGCGCGTCGC CGAGAATCTT TAACACCATT 5280 

TAAAGCGAAA GTAAAAACAG GTTATCTAGC TCGTTATACT GCCCTAGTAA CTAGCGCAAA 534 0 

TACAGGTGGC GTCATGCAAG TCCCTGAGAA TTTAATTTAA TTTATTTTTA TATTGGAGAT 540 0 

GGTTAAAATG TCTAAAACTC AACATGAAGT AAACCAAAAT ATTGACCCTT TAAAAATGGC 5460 

TGAATCACTT GAACCTGAAC AACTAAATGA AAAAACTTTA AATGATATGC GTTCAGGATC 5520 

AGAAGTGCTA GTAGAAGCTC TACTTAAAGA AAATGTGGAT TATTTATTCG "GTTATCCTGG 55 BO 

TGGTGCCGTA CTACCTTTAT ATGACACGTT TTATGATGGT AAAATCAAAC ATATTTTAGC 564 0 

AAG^CACGAA CAAGGTGCTG TTCATGCTGC AGAAGGTTAT GCACGTGTAT CTGGTAAamT 5700 

40 GGCGTCGTTG TAGTTACAAG CGGTCCaGGT GCAACTAATG TAATGACAGG TATTACGGAT 5760 

GCACATTGCG ACTCTTTACC TCTAGTTGTA TTCACTGGAC AAGTTGCTAC ACCAGGCATT 5820 

GGTAAAGATG CATTCCAAGA AGCGGATATT CTATCTATGA CTTCACCAAT TACAAAACAA 5880 

45 AATTATCAAG TGAAACGTGT TGAAGATATC CCTAAAATCG TACACGAAGC TTTCCATGTA 594 0 

GCTAATTCTG GACGCAAAGG TCCTGTAGTG ATTGATTTTC CAAAAGATAT GGGTGTTTTA 6000 

GCTACAAATG TGGATTTATG CGACGAAATC AATATTCCAG GTTATGAAGT TGTTACAGAA 6060 

50 

CCAGAAAATA AAGACATTGA CACTTTCATC TCACTTTTAA AAGAAGCGAA AAAGCCTGTC 6120 

GTATTAGCCG GCGCAGGTAT TAATCAATCA AAATCAAATC AATTATTAAC ACAGTTTGTT 6180 
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GATACACTAT TTTTAGGTAT GGGAGGAATG 
ACTGAGTGTG ATTTACTCAT TAATTTAGGT 
CCTGATGCCT TTGCACCTAA CGCCAAAATT 
AATAAAGTTA TTCATGTAGA TTTAGGTATT 
TTAAATGATA AAAATGTTGA GACTATAGAA 

10 

AATAAGCAGA AACACCCATT TAAACTTGGT 
CAAACAATCG AATATATCGG CAAAATTACA 

i5 GGACAACATC AAATGTGGGC AGCTCAATTT 
ACAAGCGGTG GTTTAGGAAC AATGGGATTC 
GCTAATCCTG ATAAAACAGT CGTATGTTTC 

20 CAAGAAATGG CACTTTTACC CGAATATGGT 
GGAACATTAG GTATGGTTAA ACAATGGCAA 
TCAGTATTTA ATGGTCAACC TGATTTTATG 

25 -TTCTTAATCG ATAAGCCAGA ACAACTGGAA 
GGACCAGCTT TAATTGAGGT TCGTATTTCC 
AGTGGCAAAT CAAATCATGA AATGGAGGGC 
TTGCGGATCA AGTCAGCACG CTAAATCGAA 
ATATCGATAC ATTACATGTt ACACATTCTG 
AAGTCGATAT TCAAGATGAT ACATCACTTC 

35 

TTAATGTTTT AACGGTTGAA TGCTACGACC 
aaggCaatga TGCGCTAATT AGTTATAGAT 

40 CTATTACAAA GTTATATTTC AGAATTTTCG 
TTTATTATGA CAACAGTTTA TTATGATCAA 
AAAATTGCAG TAGTAGGTTA TGGATCACAA 

45 AATGGATATG ATGTAGTCAT CGGCATTCGC 
GATGGATTTG ATGTGTTCCC TGTTGCAGAA 
~ CTATTACCTG ATGAAATTCA AGGTGATGTA 

so 

AAACATAATG CGCTTGCATT TGCTCATGGC 
CCAGCTGATG TTGATGTATT TTTAGTAGCT 

" 55 



CATGGTTCTT ATGCTAGTAA CATGGCATTA 6300 

AGCCGCTTCG ATGATAGATT AGCAAGCAAA 6360 

GTACATGTAG ATATTGATCC TTCAGAAATC 6420 

ATTGCAGACT GTAAAAGATT TTTAGAATGT 64 80 

CACAGTGACT GGGTTAAACA TTGTCAAAAT 6540 

GAAQAAGATC AAGTATTTTG TAAGCCACAA 6600 

AATGGTGAAG CAATTGTTAC TACAGACGTG 6660 

TATCCATTTA AAAATCACGG ACAATGGGTT 6720 

GGTATTCCTT CGTCAATTGG TGCCAAATTA 6780 

GTCGGTGACG GTGGTTTCCA AATGACAAAC 6840 

TTAGATGTCA AAATCGTACT AATCAATAAT 6900 

GATAAGTTCT -TTAATCAACG CTTCTCACAC 6960. 

AAAATGGCAG AAGCATATGG CGTCAAAGGT 7020 

GAACAATTAG ATGCAGCGTT TGCTTATCAA 7080 

CCTACTGAAG CTGTAACCCC AATGGTTCCG 714 0 

TTATAATGAC AAGAATTCTT AAATTACAAG 7200 
TTACAAGTGC TTTTGTTCGC CTACAATATA . 7260 

AACAACCTGG GATTTCTAAC ATGGAAATTC 7320 

ATATATTAAT TAAAAAATTA AAACAACAAA 7380 

TTGTTGATAA CGAAGCTTAA TTTTAAGACA 7440 

ATATCATAGG CTGCTAGTTA ACATCTGCCA 7500 

AAACACAAAA TATTTAATTA TTTGGAGGAA 7560 

GATGTAAAAA CGGACGCTTT ACAAGGCAAA 7620 

GGTCACGCGC ATGCACAAAA CTTAAAAGAC 7680 

CCAGGTCGTT CTTTTGACAA AGCTAAAGAA 7740 

GCAGTTAAGC AAGCTGATGT AATTATGGTG 7800 

TACAAAAACG AAATTGAACC AAATTTAGAA 7860 

TTTAACATTC ATTTTGGTGT TATTCAACCA 7920 

CCTAAAGGAC CGGGTCATTT AGTTAGACGT 7980 



604 



EP0 786 519 A2 



CAAGCACGTA ATATTGCTTT AAGTTATGCA AAAGGTATTG GTGCAaCTCG TGCAGGTGTT 8100 

ATTGAAACAA CATTTAAAGA AGAAACTGAG ACAGATTTAT TTGGTGAACA AGCAGTACTT 6160 

5 TGCGGTGGTG TATCGAAATT AATTCAAAGT GGCTTTGAAA CATTAGTAGA AGCGGGTTAT 8220 

CAACCAGAAT TAGCTTATTT TGAAGTATTA CATGAAATGA AATTAATCGT TGATTTGATG 8280 

TATGAAGGCG GTATGGAAAA TGTACGTTAC TCAATTTCAA ATACTGCTGA ATTTGGTGAC 8340 

10 

TATGTTTCAG GACCACGTGT TATCACACCA GATGTTAAAG AAAATATGAA AGCTGTATTA 84 00 

ACTGATATCC AAAATGGTAA CTTCAGTAAT CGCTTTATCG AAGACAATAA AAATGGATTC 8460 

AAAGAATTTT ATAAATTACG CGAAGAACAA CATGGTCATC AAATTGAAAA AGTTGGTCGT 8520 

75 

GAATTACGCG AAATGATGCC TTTTATTAAA TCTAAAAGCA TTGAAAAATA AGATAGACCT 8580 

ACAATGAGGA GTTGTTAAAT ATGAGTAGTC ATATTCAAAT TTTTGATACG ACACTAAGAG 8640 

20 ACGGTGaACA AACACCAGGA GTGAATTTTA CTTTTGATGA ACGCTTGCGT ATTGCATTGC 8700 

AATTAGAAAA ATGGGGTGTA GATGTTATTG AAGCTGGATT TCCTGCTTCA AGTACAGGTA B7 60 

GCTTTAAATC TGTTCAAGCA ATTGCACAAA CATTAACAAC AACGGCTGTA TGTGGTTTAG 8820 

25 CTAGATGTAA AAAATCTGAC ATCGATGCTG TATATGAAGC AACAAAAGAT GCAGCGAAgC 8880 

CGGTcGTGCA TGTTTTTATA GCAACATCAC CTATTCATCT TGAACATAAA CTTAAAATGT 8940 

CTCAAGAAGA CGTTTTAGCA TCTATTAAAG AACATGTCAC ATACGCGAAA CAATTATTTG ,9000 

30 

ACGTTGTTCA ATTTTCACCT GAAGATGCAA CGCGTACTGA ATTACCATTC TTAGTGAAAT 9060 

GTGTACAAAC TGCCGTTGAC GCTGGAGCTA CAGTTATTAA TATTCCTGAT ACAGTCGGCT 9120 

ACAGTTACCA TGATGAATAT GCACATATTT TCAAAACCTT AACAGAATCT GTAACATCTT 9180 

35 

CAAATGAAAT TATTTATAGT GCTCATTGCC ATGACGATTT AGGAATGGCT GTTTCAAATA 924 0 

GTTTAGCTGC AATTGAAGGC GGTGCGAGAC GAATTGAAGG CACTGTAAAT GGTATTGGTG 9300 

4Q AACGAGCAGG TAATGCAGCA CTTGAAGAAG TCGCGCTTGC ACTATACGTT CGAAATGATC 9360 

ATTATGGTGC TCAAACTGCT CTTAATCTCG AAGAAACTAA AAAAACATCG GATTTAATTT 9420 

CAAGATATGC AGGTATTCGA GTGCCTAGAA ATAAAGCAAT TGTTGGCCAA AATGCATTTA 94 80 

45 GTCATGAATC AGGTATTCAC CAAGATGGCG TATTAAAACA TCGTGAAACA TATGAAATTA 9540 

TGACACCTCA ACTTGTTGGT GTAAGCACGA CTGAACTTCC ATTAGGAAAA TTATCTGGTA 9600 

AACACGCCTT CTCAGAGAAG TTAAAAGCAT TAGGTTATGA CATTGATAAA GAAGCGCAAA 966 0 

50 

TAGATTTATT TAAACAATTC AAGGCCATTG CGGACAAAAA GAAATCTGTT TCAGATAGAG 9720 

ATATTCATGC GATTATTCAA GGTTCTGAGC ATGAGCATCA AGCACTTTAT AAATTGGAAA 9780 
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25 



30 



35 



40 



45 



50 



AAGAGGGTCA TATTTACCAG 
ATGCAGTTGA TCGTATTTTC 
TCACTGAAGG TACTGATGCC 
CTGTCAATGG CTTTGGTATT 
AAGCACATGC TAAATTTGCA 
TAACATTGTT GCCCTACCTG 
ATTGCTTGAA ATTATAAGTA 
TGGTGGTGCC TCTATTGATA 
TAAAAGAGCA GATGCTATTT 
CAATCGACCA GAACAAGGAT 
ACGCCCCACT ACCGTTGTCA 
TGAAGGCACA GATTTAGTTA 
TAGACATTTT AATAATCACG 
ACGCATTGTT CACGTAGCAT 
TGATAAAGAA AATGTATTAG 
TCAATTATAT CCAGAAGTAA 
AATCACAAAT CCAAAACAAT 
AAGTGATGAA GCTTCAGTGA 
TAACGATGGT CCAAGATTGT 
AAACGTTGCC AATCCATTTG 
AAATCAACCA GATGCTGCAG 
GCAAACGACA GCAGATTTAG 
TCAAAAATTG AATCACTAAG 
GGAACAGACA TGTGTTATAC 
ACCTTATACA TGAAGTTACT 
AATTAAGACG CCCAGATTTA 
TATTCAATAT TAAAGATGAA 
TAGATTTTGG GGTGCATATT 
TAGGACCTGA GACAGGACTT 



GATTCAAGTA TTGGTACTGG TTCAATCGTA GCAATTTACA 9900 

CAGAAAGAAA CAGAATTAAT TGATTATCGT ATTAATTCTG 9960 

CAAGCAGAAG TACATGTAAA TTTATTGATT GAAGGTAAGA 10020 

GATCATGATA TTTTACAAGC CTCTTGTAAA GCATACGTAG 10080 

GCTGAAAATG TTGAGAAGGT AGGTAATTAA TTATGACTTA 1014 0 

GTGATGGAAT CGGTCCAGAA ATTTTGAACG GATCTCTATC 10200 

ATAAATATAA CTTTAATTAT CAAATAGAGC ACCACGAATT 10260 

CATTCGGCGA GCCTTTAACT GAGAAAACCT TAAATGCGTG 10320 

TACTGGGTGC AATCGGTGGA CCTAAATGGA CAGATCCTAA 103 80 

TATTAAAATT GGGTAAATCC TTAAATTTAT TTGTAAATAT 10440 

AAGGCGCTAG TTCTTTATCA CCTTTAAAGG AAGAACGCGT 10500 

TAGTCCGTGA ATTGACAAGT GGTATTTATT TTGGAGAACC 10560 

AGGCCTTAGA TTCTCTTACT TATACAAGAG AAGAAATAGA 10620 

TTAAATTGGC CGCTTCAAGA CGAGGAAAAC TAACATCAGT 10680 

CTTCTAGTAA ATTGTGGCGC AAAGTCGTAA ATGAAGTAAG 10740 

CAGTAAATCA CTTATTTGTT GATGCTTGTA GTATGCATTT 10800 

TTGACGTCAT CGTATGTGAA AACTTATTTG GCGATATTTT 10860 

TTCCTGGTTC ACTTGGTTTA TCACCTTCTG CTAGTTTTAG 10920 

ATGAGCCTAT TCATGGATCA GCACCAGATA TTGCAGGTAA 10980 

GAATGATTCT ATCTTTAGCG ATGTGTTTAC GTGAAAGCTT 11040 

ATGAATTAGA ACAACATATT TATAGCATGA TTGAACATGG 11100 

GCGGCAAATT GAATACTACT GATATTTTCG AAATTCTATC 11160 

GGGGAGATGT AAATGGGTCA AACATTATTT GACAAGGTGT 11220 

GGGAAATTGG GCGAACCGCA ACTATTATAC ATTGATTTAC 112 BO 

TCTCCTCAAG CATTTGAAGG ACTTAGGCTT CAAAACAGAA 11340 

ACATTTGCAA CACTCGATCA CAATGTTCCT ACTATTGATA 114 00 

ATTGCAAACA AACAAATCAC AACATTACAA AAAAACGCCA 11460 

TTTGATATGG GTTCTGATGA ACAAGGTATT GTTCACATGG 11520 

ACACAGCCTG GCAAGACAAT CGTTTGTGGT GACTCTCACA 11580 
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ATGTTTTCGC AACTCAAACG 


CTATGGCAAA 


CAAAACCCAA AAACTTAAAA ATCGATATTA 


11700 




ATGGTACCTT ACCAACAGGC GTCTATGCTA AGGACATTAT TCTGCATTTA ATTAAAACGT 


11760 


5 


ATGGTGTTGA CTTTGGTACA GGCTATGCTT 


TGGAATTTAC 


TGGCGAAACA 


ATTAAAAACC 


11820 




TTTCAATGGA TGGTCGAATG ACTATTTGTA ACATGGCTAT CGAAGGTGGT GCCAAATACG 


11880 




GCATAATCCA ACCTGATGAT ATAACATTTG 


AATATGTTAA 


AGGGAGACCA 


TTTGCCGATA 


11940 


m ■ 
IV 


ACTtCGCTAA ATCAGTTGAT 


AAGTGGCGTG AgCTATATTC TGATGACGAC GCGATATTTG 


12000 




ATCGTGTAAT TGAACTTGAT 


GTTTCAACAT 


TAGAACCACA 


AGTGACATGG 


GGAACTAATC 


12060 


15 


CTGAAATGGG TGTTAATTTC 


AGTGAACCAT 


TCCCTGAAAT 


CAATGATATC 


AACGATCAAC 


12120 


. GTGCGTATGA TTATATGGGG 


TTAGAACCAG 


GTCAAAAAGC 


TGAAGACATC 


GACTTAGGGT 


12180 




ATGTTTTTCT CGGTTCATGT ACAAATGCTA 


GACTATCAGA 


TTTGATTGAA 


GCTAGTCATA 


12240 


20 


TTGTTAAAGG AAATAAAGTT 


CATCCAAATA 


TTACAGCTAT 


TGTCGTACCA 


GGTTCTCX3TA 


12300 




CAGTAAAAAA AGAAGCAGAA AAATTAGGTC 


TAGATACTAT 


CTTTAAAAAT 


GCAGGATTTG 


12360 




AATGGCGTGA ACCAGGATGT 


TCAATGTGTT 


TAGGCATGAA TCCTGACCAA GTACCTGAGG 


12420 


25 


GCGTACATTG TGCATCTACA AGTAATCGAA ACTTTGAAGG ACGACAAGGC AAAGGTGCAA 


12480 




GAACACATTT AGTATCCCcT 


GCTATGGCAG 


CAGCAGCAGC 


TATTCATGGT 


AAATTTGTGG 


12540 




ACGTAAGAAA GGTGGTTGTT 


TAAATGGCAG 


CAATCAAACC 


TATTACAACA 


TATAAAGGTA 


12600 


30 


AAATAGTCCC TCTCTTCAAC 


GACAATATCG 


ATACAGACCA 


AATCATTCCT 


AAGGTACACT 


12660 




TAAAGCGTAT TTCAAAAAGT 


GGCTTTGGTC 


CATTTGCTTT 


TGATGAATGG 


CGGTACTTAC 


12720 


35 


CTGATGGTTC AGATAATCCT 


GATTTCAATC 


CTAACAAACC 


ACAATATAAA 


GGGGCTTCTA 


12780 


TTTTAATTAC TGGAGATAAT 


TTTGGATGTG 


GTTCAAGTCG 


TGAACATGCT 


GCTTGGGCTC 


12840 




TTAAGGACTA TGGTTTTCAT 


ATTATTATTG 


CAGGAAGTTT 


CAGTGACATA 


TTTTATATGA 


12900 


40 


ATTGCACTAA AAATGCGATG 


TTGCCTATCG 


TTTTAGAAAA 


AAGTGCCCGT 


GAACATCTTG 


12960 




CACAATATGT TGAAATTGAG 


GTCGATTTAC 


CAAATCAAAC 


TGTGTCATCA 


CCAGACAAGC 


13020 




GTTTCCATTT TGAAATTGAT GAAACTTGGA AGAATAAACT TGTAAATGGC 


TTAGATGACA 


13080 


45 


TTGCAATCAC CCTACAATAT 


GAATCATTAA TAGAAAAATA TGAAAAATCa 


CTTTAAGGGA 


13140 




GTTGAATATT ATGACAGTCA AAACAACAGT TTCTACGAAA GATATCGATG AGGCATTTTT 


13200 




AAGACTTAAA GATATTGTCA AAGAAACACC 


TTTACAATTA 


GACCATTACT 


TATCTCAAAA 


13260 


SO 


GTATGATTGT AAAGTCTATT 


TAAAACGAGA AGATTTACAA TGGGTACGTT 


CTTTTAAATT 


13320 




AAGAGGTGCT TACAACGCTA TTTCTGTTTT ATCAGATGAA GCTAAAAGTA AAGGTATTAC 


13380 
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SO 



AAACGCTGTT ATCTTTATGC CAGTCACTAC ACCTTTACAA AAGGTAAATC AAGTAAAGTT 1350 0 

CTTTGGAAAT AGTAACGTTG AAGTTGTACT CACTGGTGAT ACATTTGATC ACTGTTTAGC 13560 

TGAAGCTTTA ACTTATACAA GTGAACATCA AATGAACTTT ATAGATCCAT TCAATAATGT 13620 

TCATACAATT TCTGGACAAG GTACGCTTGC TAAAGAAATG CTAGAACAAG CAAAGTCTGA 13680 

CAATGTTAAC TTTGATTATC TATTTGCCGC AATTGGTGGT GGCGGTTTAA TTTCAGGTAT 13740 

TAGTACTTAC TTTAAAACCT ATTCACCTAC CACGAAAATT ATAGGTGTTG AACCTTCAGG 13 8 00 

TGCAAGTAGT ATGTATGAAT CTGTTGTGGT AAATAATCAG GTAGTCACAT TGCCTAATAT 13860 

CGATAAATTT GTGGACGGTG CATCTGTAGC TAGAGTTGGC GATATTACAT TTGAAATTGC 13920. 

AAAAGAAAAT GTAGATGATT ACGTTCAAGT AGATGAAGGT GCAGTTTGTT CTACGATTTT 13980 

AGATATGTAT TCAAAACAAG CAATTGTAGC AGAACCTGCT GGCGCATTAA GTGTAAGTGC 14040 

GCTTGAAAAC TATAAAGATC ATATTAAAGG TAAAACAGTG GTTTGTGTCA TTAGTGGTGG 14100 

TAATAATGAT ATTAATCGAA TGAAAGAAAT TGAAGAACGT TCATTACTAT ACGAAGAAAT 14160 

GAAGCATTAC TTTATCTTAA ATTTCCCTCA ACGTCCAGGT GCATTGAGAG AATTTGTAAA 14220 

TGACGTATTA GGACCTCAAG ACGATATTAC TAAATTTGAA TACTTAAAAA AATCTTCTCA i4280 

AAATACAGGT ACTGTCATTA TTGGTATTCA ACTTAAAGAT CATGATGATT TAATACAACT 1434 0 

CAAACAACGT GTAAAtCATT TCGATCCTTC CAATATTTAT ATTAATGAAA ATAAGATGTT 14400 

ATATTCATTG TTAATTTAAC ACATAGTAAG AAAAACAGTC ATAAATTGAT TTCTAATTGA 14460 

AATCATCTTA TGACTGCTTT TTATTATACT TTACATTTCT CGTTTCGTCA GATTCAAACG 14520 

TTTTCACTTC GCCAAGCCAT CTTTCTTTGT GTTTGCTTTT aTTTTGACGT TTTAGACATA 145B0 

AAAAAaGAGA CCTTGGGGTC TCAATGCGGC TCATCGCATC CACTTTTTGC CTGGCAACGT 14640 

TCTACTCTAG CGGAACGTAA GTTCGaCTAC CATCGACGCT AAGGAGCTTA ACTTCTGTGT 14700 

TCGGCATGGG AACAGGTGTG ACCTCCTTGC TATAGTCACC AGACATATGA ATGTAATTTA 14760 

TACATTCAAA ACTAGATAGT AAGTAAAAGT GATTTTGCTT CGCAAAACAT TTATTTTGAT 14 820 

TAAGTCTTCG ATCGATTAGT ATTCGTCAGC TCCACATGTC ACCATGCTTC CACCTCGAAC 14 880 

CTATTAACCT CATCATCTTT GAGGGATCTT ATAACCGAAG TTGGGAAATC TCATCTTGAG 14940 

GGGGGCTTCA TGCTTAGATG CTTTCAGCAC TTATCCCGTC CACACATAGC TACCCAGCTA 15000 

TGCCGTTGGC ACGACAACTG GTACACCAGA GGTATGTCCA TCCCGGTCCT CTCGTACTAA 15060 

GGACAGCTCC TCTGAAATTT CCTACGCCCA CGACGGATAG GGACCGAACT GTCTCACGAC 15120 

GTTCTGAACC CAGCTCGCGT ACOGCTTTaA TGGGCGAACA GCCCAACCCT TGGGACCGAC 15180 
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GTGGAACTT 15249 
(2) INFORMATION FOR SEQ ID NO: 103: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14051 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 103: 

GTGGCAATAT TTCTAGTTCT CGTTTTGATA AGATTTTAAA AGGATCTGTT GTGTTTGCAG 60 

TGTCCTGATT TGAATTAGAT ACAAATTCAT TCACTAAAGA TGTTGTAAGT TTCATATCTA 120 

CATATGTTTC ACCTTTATAT ACAGTTCGAA TAGCTAACAA TAATTGTTCA TCAGGTGCAT 180 

20 TTTTCAATAT GTAACCTTTC GCACCATTAC GCAACACATG GAACAAATAC TCCTCATCAT 240 

CAAACATTGT TAATATTAGT ATTTTAGTTT CAGGAAAACT GTCAGCAATT TTACTCGTAG 300 

CGATAAGACC TGACTCACCT GGTGGcATAC TTAAATCCAT TAGTAACACA TCAGGTTTAt 360 

25 ATTCCATTAC TTTTTGGTAA GCTTCGACGC CATCTGCAGC CGTTGCAACA ACTTCCATAT 420 

CATTTTGATA ATTTAAAATC ATAGAGAACC CCGTACGGAC AACAGCGTGA TCATCGGCAA 4 80 

TGACTATTTT CAATTTTATT CCCCCAATGT ATGTTTCAAA TTGGAATGTT CAATGTAACA 540 

TTGGTACCCT CACCAATTTT CGTTTCAATA TTGACGCTAC CGCTGACTAA CTCAGCTCGC 600 

TCATTCATTC CATATAAACC GAGTCCAGAA CCTTTAGGCT TAGAACTTGG ATCAAAACCA 660 

TTTCCCGCAT CTATCACTTC TGCTACCAAA TGGCGCCCAG TTTGACGGAT ACCTACATTT 720 

ATTTCATTTA CATCAGCGTA TTTCAACGCA TTTAAAATAG CTTCTTGCAC TACTCGATAA 780 

ACAACCGTTT CAATATCACT ATCAAAGCGA GTATTTTTAA TATTTGATGT ATATATGATT 840 

TTTATTCCAT AATTTTCTTC AAACTGTTTA AAATATGATT TAAAAGCTGC TTCAAGGCCT 900 

AGATCATCCA AAGAAGCGGG TCTTAATTCA ACCGACATAT TACGTATATC ATCAATTAAT 960 

TTAGCGACAA TATATTCAAT ATTTTCTGCG TCTTCCAAAA GCTTAGTTGT ATCTTCTTGA 1020 

45 TATTTTAATA ATCTCAATTG AACATCTACA TTGAGCATTT CTTGAATCAC ACTATCATGT 1080 

AACTCTCTAG AAATTCGCTT TCTTTCATTT TCTTGGGCTG AGATTGTTTT ACGCATCATA 1140 

CGTTGTTGAT GCAATTTCTC TTGCTGTTCA ATTTGTGATG AAACATTTTG AAGCGTAAAT 1200 

50 

GCATGAATTC CCCTGTCTTG ATCAATCAAC TGATATGTTG CTGTAAATGG CATCACTTTT 1260 

TGATCTTTCG TCTTCATAAA TACTTGGAAA TTCGTAGCTT GTACTTGCAT CGATTCTAAG 1320 
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ATCGCATTCG CCACAGCACT 


GTAATTATCT 


TCTTCAGATA ATATATCTTT 


AGCAGCATCA 


1440 


TTCATTGCAA TAATTTTACC 


GTTATCATCA 


GCAAAAACTA 


TCTTTTCGAT 


TGAATGCTCA 


1500 


TAATATTTTT TCAATAAAGT 


ATCTAACTGT 


ATACTGTCCT 


CATTAATCAT 


GACTTACACC 


1560 


CTAATTCATC TCATTATTTA 


TCATCATTGA AAATACCAAA 


CTTACGTTGA 


ATATCATCAT 


1620 


TATCAAATAT TTTTGGTAAA GGACGACCAT 


CTCTTTGACC 


AAATAATAGT 


ACGCCATACA 


1680 


CTTGATTCTT ATACCAAAGC 


GGCACTGCTA AAACTGCTGT 


TAATGATTCG 


CTCAATAAAA 


1740 


TTGGATAGTC AATCTTTTCT TCAGGCCCTA AAGCTAAACC AACATTGGCT 


ATTACCATAC 


1800 


GCTTTCCTGT TTTCATAACA 


GTTCCAGCTA ATCCACGACC 


TTTTCTTAAA 


ATAATCAATT 


1860 


TAAATCGATT ATTTTTATTA 


CCTGAAACAT 


AGTGCCATTT 


TATTGGAGAT 


GATGGTTTGT 


1920 


TAGATTCATA GAAAGCGATT 


GCCGCAAAAT 


CATAACCCTC 


TTCTTTGCGT 


ATTTTATCTA 


1980 


ATGTCTCTTG AAATCTACGA 


TCTTCAATTA 


TTGCTTCTGG 


TGTCAAATCC 


TTTCACCTCT 


2040 


TATGCTTACA CTTTATTCTT 


ACGGTAAATA 


ATATATCTGC 


GATTTATATA 


TGTCAAAGGT 


2100 


ACACTCCAAA CATGCACCAA 


ACGTGTAAAT 


GGCCAACAAG 


CCATAATAGT 


GAAACCTAAC 


2160 


AATATATGCA TTTTAAATGC 


AATCGGCACA 


CCACTCATCA 


ATGACGCATC 


TGGTTTTAAC 


2220 


ATAAATAATT GTCTAAACCA 


AATTGATAAT 


GAAGTTCTGT 


AGTTAAAGTC 


TGGATGTTGT 


2280 


ATATTTGTTA CTAATGTTGC 


GTAACATCCC 


ATAAATACGA 


TAAGTAATAA 


TAAGAAATTT 


2340 


ACAAATATAT CCGACGCTGA 


ACTTAATCTT 


CGAATACTTT 


TCGTAGTAAC 


ACGTCTCGCT 


2400 


GTTAATAAAA ACATCCCTAT 


CAAAGTTATT 


ATACCAAAGA 


TGCTACCAAT 


ATAAACAGCG 


2460 


CCTATATGAT ATAAATGCTC 


AGACACACCC 


ACTGCATCCA 


TCCATGGTTT 


CGGTATTAAC 


2520 


AATCCAACTA CGTGTCCAAA 


AAACACTGGA 


ATAATACCTA 


AGTGAAATAA 


TAAACTTCCC 


2580 


CACATCAACC TTTTTCTTTC 


TATTAATTCA 


CTAGATTTAG 


CTGTCCAAGA 


AAATTTATCA 


2640 


TAACGATAAC GTGCAATATG 


ACCTGCGACA 


AAGACAACTA AACAT AAATA 


CGGAAATATA 


2700 


ACCCATAAAA ACTGATTAAG 


CATGATGTTT 


CACTCCTTTT 


GGTGATGTCA 


AACATAATTT 


2760 


CAATGTTTTT CTAAGTGCTT 


GAATCACATA GGCATATGGA 


TTGTTATCTT 


CACCAAGTGC 


2820 


ATTCGCCATC ACATATGTTC 


CATCCTCAAT AATCATAATG ATTAATTGAA 


TATTCTCTTC 


2880 


AGCTCTTGGA TCATTTCGCC 


ATTCTGCCAC 


TTGCAAAAAT 


TGAAGCATCA 


ACGGTAGATA 


2940 


ATCAGAAAGT TCATTATCTA 


CCATTTCTAG 


TCCAAACATT 


TCATATAATA 


CCTTTAATTT 


3000 


AGCTAACATT TGCCCACGTT 


CTTTTTGCGT 


ATCAAATTTG 


TTATACGTCA 


TATATAATGG 


3060 


TGCTTTTTTC GTAAAATCAA 


ATGTATCTGT 


ATAAATCGCT 


TTGATTTCTG 


ATAATGAAAA 


3120 
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TGTTTCTTCA AAAGTTTTTG GATGAAAAGT TAATTTTTCT GGAAAACATA ACTGTTGTGC 3240 

CATATATCCA AAACTTTCTT GATATTTTTT AAAATTATCG AAATTAATCA CGGAAAATCC 3300 

CTCCATAGAA ATTCTCATTA TAAATTTCTT GACCAGTTTT CCCTGAACCT ACTGCAACGC 3360 

CACAGCCTTC ACAGTTATCT CCAAAATGCT CGCCGCCGTA ATTGTATCCT GTACTACCTT 3420 

GTGCGTGATA CGTATCTAAA TAGGTTTCTT TGTGTGATGT TGGAATAACA AATCGATCTT 3480 

CATATTTGGC TAGTCCTAAT AAACGATACA TGTCTTTAGT TTGGCGCTCG GTTATACCTA 3540 

ATCGCTCTAA TCGAGACGTG TCAAATGGCT GTTGAGTAAC TTGAGATCTC ATATAACTTC 3600 

TCATCATTGC CATACGTTGT AGGGCTCCTT TTACTGGCTC TGTATCTCCT GCAGTGAAAA 3660 

TATTAGCTAA GTATTCAATA GGTAAACGCA TTTCTTCAAT GGCTGGGAAA ATCGCATCTG 3720 

GATTTTGAGT TGTATTTTTA CCTTCAAAAT. AGCTCATAAT TGGGCTAAGT GGTGGGCAAT 3760 

20 ACCAAACCAT CGGCATCGTT CTAAATTCAG GATGTAACGG AAATGCAAGT TTATATTCAA 3 840 

TTGCTAACTT ATAAATTGGA GAGTTTTGTG CAGCTTCAAT CCAATCGTAA CCAATACCAT 3900 

CTTTTTCAGC TTGAGCAATG ACTTCTTCGT CAAATGGGTT TAAGAATATA TCTAATTGTT 3960 

TTTCATATAA ATCTTTCTCG TCTACTGCTG AAGCTGCTTC ATGAACTCGA TCTGCATCAT 4020 

ATAATAAAAC ACCTAAGTAA CGCATACGTC CTGTACAAGT TTCAGAGCAT ACCGTAGGCA 4080 

TACCCGCCTC GATTCTCGGG AAACAGAAAG TACACTTTTC AGCTTTGTTC GTTTTCCAAT 4140 

TGAAGTAAAC TTTCTTATAT GGACAACCTG TCATACAGTA ACGCCATCCA CGACATGCGT 42 00 

CTTGGTCAAC TAATACAATG CCATCTTCAT CACGTTTATA CATAGCACCT GAAGGACACG 4260 

ATGCAACGCA ACTTGGATTC AAGCAATGTT CACATAAACG TGGTAAATAC ATCATAAAAG 432 0 

TTTCGTCAAA TTGGAATTTA ATATCTTCTT CTAT TT TTTG GATGTTAGGA TCTTTTGGAC 43 BO 

CTGfAACATG ACCACCTGCT AAGTCATCTT CCCAGTTAGG TCCCCATTCA ATTTCAATGT 444 0 

TATCCCCCGT AATTTCTGAA TACGCTCTAG CAACTGGCGA ATGCTTCCCT GATTTCGCAG 4 500 

TTGTTAAATG TTCATAATTA TAGTTCCATG GCTCATAATA ATCTTTAATT AATGGCATAT 4560 

CTGGGTTATA AAAAATTTTA CCTAAAGCAA TTTTTGAAAT TCTACTTCCA GATTTTAATT 4620 

45 CAAGTTTCCC TTTACGATTT AGTACCCAAC CACCTTTGTA GTGTTCTTGG TCTTCCCAAC 4680 

GTTTCGGATA CCCTACACCT GGCtTCGTTT CTACGTTGTT GAACCACATG TACTCAGCAC 4 740 

CTGGACGATT TGTCCaAGTG TTTTTACATG TCACACTACA CGTATGGCAT CCTATGCATT 4 800 

SO 

TATCTAAATT TAATACCATC GCAAcTTGCG CTTTAATCTT CAAGCCAATT AACCTCCTTC 4B60 

ATCTTTCTAA CTGCTACATA TAAATCCCTT TGGTTCCCAA TTGGTCCATA ATAATTAAAG 4 920 
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GGCGCGTTGT GTGAACCACC ACGTGTATCT GTAATTTCTG ACCCAGGCGT TTGAATATGT 5040 

TTATCTTGTG CATGATACAT AAACATTGTA CCTTTAGGCA TACGATGCGA AATAACTGCT 5100 

5 CTTGCCGTTA CAACACCATT ACGGTTATAC ACTTCTAGCC AATCATTATC TTGGATATCG 5160 

TGTTTTTCAG CATCTTCATT TGATATCCAA ACCGTTGGAC CACCTCTAAA TAGTGTCAAC 5220 

ATATGCTTAT TATCTTGATA CATTGAGTGT ATATTCCATT TTCCATGAGG CGTTAAATAA 5280 

10 

CGCAgTACCA AAGCATCTGT ACCACCTTTA ATTTTCTTAT CTCTATTCCC AAATACCATT 5340 

GGCGGCAATG TCGGTTTATA TACTGGTAAG CTCTCCCCAA ATTGTTGGAA AACTTCGTGA 5400 

TCCACATAAT AACTTTGACG TCCTGTTAAT GTTCTAAAAG GTACTAGACG TTCTATATTC 5460 

15 

GTTGTAAATG GTGAATATCG TCGACCTTGT TTATTTGAAC CTGGGAATAC TGCTGTCGGT 5520 

ATTACTTCTC GTGGTTGTGA AGTTATATTT AAAAACGAAA TTTTCTCAGC AGCGCGTTCG 5580 

20 CTAGAAATAT CTTTTAACGG CATTCCAGTT TGTTCTTCGA GATCTTCATA TGATTTTTGT 564 0 

GATAATTTAC CATTCGTAGC AGATGAAATA CTTAGTATTG CATCAGCTAC ATTACGTGCT 5700 

GTATCAATAC GTGGACGATT CGCTCTCACA GAATCATCAT TTGTATCACT CCACGTACCT 5760 

25 AACATACTTT TTAATTCTTC ATATTGTTCA CTGACACCGA AACTTACACC ATGTGCTCCA 5820 

ACTTTCCCTT TTTCAAGTAC AGGACCAAGC GTGACATATT TGTCGTAAAT TTTAGTGTAG 5880 

TCGCGTTCTA CAATTGCAAA GTTAGGCATT GTACGTCCAG GTACCGCTTC AATTTCACCC 5940 

TTCGACCAAT CTTTCACTAC GCCGTATGGT GTTGAAATTT CTTGCTTTGT ATCATGACTA 6 000 

AGTGGAGTTG TCACAACATC TTTAAACGTT CCAGGTAAAT AGT CT TTTGC CATTTCTGAA 6060 

• AATGCTTTTG CCAACGTTTT ATAAATATCC CAGTCTGAAC GCGATTCCCA TAACGGATCA 6120 

35 

ATGGCAGGAT TGAAAGGATG TACATATGGA TGCATATCCG TTGATGATAA ATCATGTTTT 6180 

TCATACCAAG TCGCTGCCGG CAAAACAATG TCAGAATATA ACGGTGTTGC CGTCATTCTG 6240 

AAGTCTAAAG AGACCACTAA ATCTAACTTA CCTGTTGTTT CTTCACGCCA CGTAATTTCT 6300 

40 

TCTGGCTTTT CATCTTCATT TGGTGTAGCT AATAACCCTG ATTTTGTGCC AAGTAAATGC 6360 

TTCATAAAGT ATTCTTGACC TTTTGCAGAA CTTGAAATTA AGTTTGAACG CCATATAAAT 6420 

45 AATGATTTTG GATGATTCTT TTTCAAATCA GGATCTTCTA TTGCAAATTG TGTTTG TTTT 64 80 

GATTTCACTT CATCAATTGC ACGTTGCAAA ATCGCTTCAT TTGAATCTAT ACCTTCATCT 654 0 

TTAGCTTCTT CTGCAAACAA CAAACTATTT TTATTAAATT GTGGATATGA TGGTAACCAA 6600 

50 CCAAGTCTAG CTGCTAAAAC ATTATAATCA GCTGGATGTT GATGCTTTAA CTCCTCTGTT 6660 

TTAGCTAATG GAGATTTTAA ACGATCTACA TTTGACTCTT CATATTTCCA TTGGTCTGTT 6720 
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AATGCGACAG TACTCCATCC TTCAATCGGA CGACATTTTT CTTGTCCCAC ATAGTGAGCC 6840 

CAACCGCCAC CATTCACACC TTGACAGCCA CATAACATAA CTAAGTTTAA GATTGAACGA 6900 

TAAATCGTAT CTGAGTTAAA CCAATGGTTA ATACCCGCAC CCATGATAAT CATTGAACGC 6960 

CCTTCAGTAT CGATAGCGTT TTGCGCAAAT TCTTTCGCTA CTTGAATGAC AACACTTTGT 7020 

TTTACGCCTG AAATGGCTTC TTGCCAAGCA GGTGTATATT TTGATTCTGC ATCGTCGTAT 7080 

10 

C CTTTT GATT CTAATTTATG ATCAAAACGA CGCACGCCAT ATTGACTTGC CATTAAGTCA 714 0 

AAAATTGTAG CAATACGGAC TTTGTCACCA TTTGCTAAAG TGACTTGTCG AGTTGGAATT 7200 

GGACGATTGA ATATCCCATC TCCATCACTA TCAAAGTATG GGAATTGAAT TGTTTCTAAT 7260 

75 

TCGTATCCAC CTTCTGTCAT TGATAATGTA GGGTTAATTT TAGAACCATC TTCTGTTTCT 7320 

AGTTTTAAGT TCCACTTCTT ACCTTCTTCC CAACGTTGAC CCATTGTGCC ATTAGGTACT 7380 

20 ACTAAACTAT CGCTGATTGC ATCATGAATA ACTGGCTTCC ATTCGCCTTG CTCTGTTGTT 7440 

TGACCTAAGT CACTCGCTCT TAAAAATCGA CCCGCTTTAT ATCCATTTTC ATCTTCATCC 7500 

AGCATGATAA GAAACGGCAT ATCTGTATAT TGTTTAGCGT AATTTATAAA GCGTTCATTA 7560 

25 GGTTGATTAA CATAATGTTC TTGTAAAATA ACATGCGTCA TTGCTTGTGC AATTGCAGCA 7620 

TCTGAACCAG GATTCGGTGC TAGCCAGTTA TCTGCAAATT TCACATTTTC TGCGTAATCT 7680 

GGTGCTACTG AAATGACTTT TGTACCTTTA TAGCGGACTT CAGTCATAAA ATGTGCATCC 774 0 

30 

GGAGTACGTG TTAAAGGTAC ATTAGAGCCC CACATAATAA TGTATGATGC GTTATACCAG 7800 

TCACTTGATT CAGGCACATC TGTTTGCTCT CCCCAAATTT GTGGAGAGGC AGGTGGTAAA 7860 

TCTGCATACC AGTCATAAAA ACTAAGCATT TCACCACCAA GCAAATTGAT GAATCGAGCA 7920 

35 

CCTGCTGCAT AACTAATCAT TGACATCGCT GGAATAGGTG TAAATCCTGC GATTCGATCT 7980 

GGACCATATT TTTTTATTGT ATACAGTAAT TGTGCTGCGA TTATCTCTGT AACGTCTTTC 8040 

40 CAATTTGAAC GCACGTGCCC TCCCATACCT CGGGCTTGCT TATATTGTTT GGCTTTGTCT B100 

TCATTTTCAA CAATAGACGC CCATGGAGCA ACGCGATTAC CATTGTTTTC TTCTAATGCT 8160 

TCAGTCCATA AATCCCAGAG TTTTCCACGA ATATATGGAT ATTTGATTCG AAGCGGACTG 8220 

45 TATTCATACC AAGAGAATGA CGCACCTCGT GGACATCCTC TCGGTTCATA TTCAGGCATA 8280 

TCCGGACCAC AACTTGGATA GTCAGTTTGT TGATTTTCCC AGGTAATCAC ACCATTTTTC 834 0 

ACAAATACTT TCCAAGAACA TGAGCCTGTA CAGTTAACAC CATGTGTTGT TCTTACTTCT 8400 

50 

TTATCGTGGC TCCAACGTTC TCTGTACATT TTTTCCCATT CTCTACTTTT ACTTTCTAGG 8460 

ATCGACCAAT TCCCATTAAA TTTTTCTGTT GGCTTAAAGA AATTCAATCC AAATTTTCCC 8520 
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TAAAATGCCC 


AAGACTATTG CTTTAATTAG 


ATTGTACATT 


TTTTCACAAA CATAAAATAT 


8640 




TAGGGAATCA 


CCTAATTACT TAAGGAATTT 


CCCTATCAAT AACGGGATTT CATTGAAATA 


8700 


5 


ATACACAATC 


ATGTATGGTC ATGCTTATTG 


CCAATCTAAA 


TCGTTCAAAT TTGGCACAAC 


8760 




GACAAATAAG 


GCTTCAACAC GAATATATTC 


TCTCGGTTGA AACCTTACTT ATTCATTTAT 


8820 


10 


TTTTTATAAA TTAGTGACAT AACACTGTAT TAGCATCTGC ACGATCGGTT GAAATATATG 


6880 


TTACATTTTC 


TTGCTGCTTA ATAAATGCAT 


CATAGTAATC ATATTGCGAC GAATGATATG 


8940 




TGCCATTCGA TGTATCATTT GGGTTTAGCA AACAGCCATA ACCTTCGTCA TATAAATGTT 


9000 


15 


CACAGAGCAT AAGGGCGTCA TGTTTAGAAC 


CACTTACTAC 


ATAAAATTGC TTCATAGGAT 


9060 




CATATGATTT 


AGGAGTGTTT TCAGTATAAT 


CAACAACTTC 


CCCTATAATA CATATACCTG 


9120 




GTTTCGCCTC 


AATTGAATAG TGTTGCAATT TTGAAATAAT ATTACTTAAA CGCCCCTTAA 


9180 


20 


CAACAAACTC GTTAAAACAC GATGCTTGAA AGACAATCGC TATCGGGTAA TCAATATCTG 


9240 




TGTATTGTTG 


TATCTGTGTG ATAATTTTCC 


CTAAACGTTT 


TACCCCCATA TAAATTGCTA 


9300 




ACGTGCCACC 


ATTCACTAAG GAATTGACAT 


CCACTTCATT 


TTCTTCTGAA TCTTTAAAGT 


9360 


25 


GACCTGTAGA 


AAATGTCACA CTTTTAGCAA 


CTGTACGCAT 


TGTCAAACCT GTCTGCATAG 


9420 




TAGCAACTGc 


tGCGCTCGCT GATGTCACCC 


CTGGTACAAT 


TTCAAACGCA ATATGATGTT 


9480 




CATTTAGTAT 


GTCGACTTCT TCTTGCACAC 


GACCAAATAT 


CGCTGGATCG CCACCTTTAA 


9540 


30 


GTCTAACAAC 


CTTGTTATAT CGACGCGCTG 


CTTCCACGAT 


ACAGTCATTT ATTTTTTCTT 


9600 




GCTGAATATG 


TTTTGCATAC GGCTTTTTAC 


CAACATCGAT 


AATTTCAGTA GTCAAATTCG 


9660 


35 


CATATTGTAA AATTAACGGA TTCACTAATC GAT CAT AT AG AATGACATCC gCTTCACGTA 


9720 


TTAAACGCTC 


AGCCTTTTTC GTCAAATAAT 


TCGGATTACC 


TGGACCCGCA CCTATCAAGT 


9780 




AAACCTTGCC 


ATATTCCTCT ACAGACATAT ATATACGTTC 


CCGTCTGTAA CTTCTACCTC 


9840 


40 


ATAAACATCT 


ACACAACCTT CATCAGGTTC 


TTGAACAATA 


CCTGTATTTA AATCAATTTT 


9900 




TTGATCGTGG 


AGCGGGCAAA ATACATATTC 


CCCACTCACT 


GTCCCTTCAG ACAATGGTCC 


9960 




TTGTTTGTGT 


GGACAGATAT TGTGAATCGC 


ATGAATTTTG 


CCACTTTCTG TTAAAAACAA 


10020 


45 


CCCTACCTCT 


TTGCCTTTGA CAATAACCTT 


TTTTCCAATT 


AGGGGTGTTA ATTCATCTAT 


10080 




AGTTGTCACT 


TTAATTTTTT CTTTTGTTTC 


CATGTATTAC 


ACCTTCTCCA CTTCAAAAAT 


10140 




TCTACGTGCT 


TGAGCATTGC TAGTTATTGC 


TTCCCAAGGT 


TCAGCTTCGA CTGCTTTTTT 


10200 


SO 


AGCATCCATA 


ATGCGTTCAA ATAGTTCATT 


TTGTCTTTCT 


GGGTCAAGTA AGACTTCTTT 


10260 




TACATTTTCA AATCCAAGTC TTCTTAACCA 


TGGCGCTGTT 


CTTTCAGCAT ATATACCTGT 


10320 



55 



614 



EP0 786 519 A2 



15 



20 



25 



30 



35 



40 



45 



50 



AGTTGTTAAA AATTCAGCTT TTTCAACTTC TGTACCACCA TTACCACCGA TATAGATTTG 10440 

GAATCCATTT TCAACTGAGA TAATACCAAA ATCTTTAACA CCTGATTCAA CACAACTTCT 10500 

TGGGCAGCCT GATACACCCA TTTTGAATTT ATGAGGTGTA TCGATGTATT CAAATGTTTT 10560 

TTCTAAACGA ATGCCAAGTC GTGTCGTGTA TTGCGTACCA AATCGACAAA ACTCTTTACC 10620 

AACACAGCTT TTAACTGAGC GTGTTTTCTT ACCATAAGCT GATGcTGAAC GCATACCTAG 106 BO 

GTCTTCCCAT ATATTTGGTA ATTCTTCTTT TTTAACTCCA TACAAACCAA CACGTTGTGA 10740 

ACCTGTCACT TTAACTAGTG GCACATGATA TTTCTTAGCC ACTTCTCCTA GACGAATCAG 10800 

TTGGTCTGCA TCTGTAACAC CCCCACGCAT TTGAGGTATA ACAGAAAATG TACCATCATT 10860 

TTGAATATTC GCATGGTAAC GTTCGTTAGC AAATCTTGAT TCTCTTTCAT CTTCATGATC 10920 

ATGTGGATAA ACCATGTTTA AATAATAGTT GATTGCTGGT CGACATTTTG GACATCCACC 10980 

TTTATTTTTA AAGTTTAAAA CATGTCGAAC TTCTTTAGAT GTTTTTAAAC CTTTCGCTCT 11040 

TATTTGCGTT ACTATTTGAT CGCGTGTCAA ATCAGTACAA CCACATATAC CAGCAGGTTT 11100 

TGCGGCAACA AAGTCATCTC CTAAGGTGTG CTGCAATATT TGAGCAATTT GCGGTTTACA 11160 

TTTACCACAT GAATTCCCCG CTTTTGTTTT AGCCGTTACT TCTTCAACTG TTGTAAAGCC 11220 

ATTTTCCGTA ATCGCATTTA CTATAGTACC TTTATCAACA CCATTACAAC CACAAATTGT 11280 

TTCATCATCA GCCATATCAG CAATTGATAG CGATGCCTCT TCTCCACCTT TAGTAAGCAA 11340 

TGATACAAGT GTGTAATCTT CAGTGGATTC ACCTTTTTTC ATCATGTTAT AAAAGCGTGA 11400 

ACCATCATCG ATATCACCAT ATAGTACTGC ACCAACTACA TTACCGTCTT TTAAAAAGAT 11460 

TTTTTTATAG TTATTATCAA CACTATTAAA TATTTCAATA CCTTTAATTT CTGCATTTTC 11520 

TACAATTTGA CCAGCACTAT ACAAGTCACA CCCAGAAACT TTTAATGACG TAAATGTTGT 11580 

TGATCCCTTG TATCCGTTCG TTTCTTTATT TGTTAAATGA TCAGCTAATA CTTTACCTTG 1164 0 

TTCATATAGT GGTGCAACGA GTCCATAAAC TTTGCCGTTA TGTTCTGCAC ATTCACCAAC 11700 

TGCATATACA TTGCTATCAC TTGTTTGCAT CACATCATTG ACAACAATAC CACGATTAAC 11760 

ATCTAGACCT GATTCTTTGG CTACTTCTGT GTATGGTCGT ATACCTACTG CCATAACAAC 11620 

TAAGTCTGCC GGAATCTCGC GTCCATCAGC CAATTTAACA CCCTCAACAT CATCTTCTCC 11880 

TAAGATTTCA GTTGTGTTGG CTTGCATTTC AAACTTCATA CCTTGCTTTT CTAGATCTGC 11940 

TTTAAGCATA TTTCCAGCTT TACGGTCTAG TTGCATTTCC ATCAACCATT CAGCTAAATG 12000 

TAACACCGTT ACTTCCATAC CTTGATCTAA TAAACCACGT GCACACTCTA AAC CTAGTAA 12060 

TCCTCCACCA ATTACAATTG CTTTCTTTTT AGTCTTAGCA ATGTTCATCA TTTGTTCAGT 12120 
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GAATGCTTTA GAACCTGTCG CAAAAATCAA TTTATCGTAT GATACTTCAA TACCATTTGC 12240 

AGTAGTAACT GATTGATTTG CTCTATCTAC TTCAATTACA GGATCATTTG TAATTAACTC 123 00 

GATACCATGT TCCTCATACC ACTCATATGG ATTCATAATT GTTTCTTCAA CTGTCATTTT 12360 

ATTTTGTAAA ATATTTGAAA GCATGATGCG GTTATAGTTT GGATAAGGTT CTTTACCTAT 12420 

TACCGTAATA TCATATAAAT CGTTGGCGCG CTCTAATATT TCTTCGATTG TTCGAATGCC 124 80 

10 

CGCCATACCG TTACCAATCA TTACTAGTTT TTGCTTTGCC ATAAAATATG CCCCTTTACT 12540 

CCATAATATT TATTTCAAAA AAAGGTATTA . ATTTTTCGTT AGTGCTTTTA TATTTTCATT 12600 

1S GGAATCATTA AGCTTTCTAA TCTATCGTTA ATGATTTGCT TTAAAATTGG GTCGAAGTTA 12660 

ATTGAAGGTG TGAAGTGTAT ATCTGTATTA ATAACCATGT CATTCATTTG CTGCTTCACT 12720 

TTGTTAACAA GTCTTCCGTC ATATAAAAAT AATGGTACGA CAATCAATTT TTGATACCGT 12780 

20 TTCGAGATGC TTTCTAAATC ATGTGTAAAA CTAATCTCTC CATATAGCGT TCTCGCATAT 12840 

GTCGGCTTGC TAATTTGCAA ATTTTGAGCG CATATTTGTA ACTCTTCGTG TGCCTTAGTA 12900 

AACTTTCCAT TAATATTGCC GTGTGCAACA ACCATAACTC CAACTTGTTG TTCGTCACCT 12 960 

25 GCTAATGCGT CACAAATACG TTGTTCAATT AATCGTCTCA TTAAAGGATG TGTGCCAAGT 13020 

GGCTCGCTTA CTTCTACCTT TATGTCTGGA TACCGTCGTT TCATTTCATG AACGATATTC 13080 
GGTATATCCT TGAGATAATG CATTGCACTA AAGATTAGCA ATGGTACAAT TTTAAAATGG . 13140 

30 

TCAACCCCAC TTTGAATCaA CGTCGTCaTT ACCGTCTCTA AATCCtGATG CTCACTTTCt 13200 

AAAAACGCAA TATCATAGTG ATGTATATCA TCTTTTACTA ATTCAGAAAT AAATGCTTCT 13260 

AACGCTTGaT TCTGTCGTCC GTGCCTCATG CCATGTGCAA CAATGATATT CCCATTCACA 13320 

35 

TTTACCAACC CTTTCACACG TATTGTATAC CAAATCATTT TGTTTTTGTG AAAAGAATCA 13380 

CATTATAATG TAAAATCAGG GAATTCCCTG ATGCCTGTAG TCATGCATAT TCCTTATACA 13440 

40 TTTTCCCTTT TTGTTAAATC AAAAAAAGCG ACCGATATAT GAATCCCTAC TCAACATTTA 13500 

TTTGAGCAAG CATTAATATA TCGGTCGCTT GTAGTGTATA TTATTATCTT AAAATGGTGG 13560 

TTGGCCTAAT ATTGTTTCGT CAAAGCGCTC GGGTATCAAT ACTTTGCGCA TGATCACACC 13620 

45 TAAATCGCCA TCATCATTTT CATGTTCGCT GTATATTTCA TAACCTCTTT TTTCATAAAT 13680 . 

TTTAAGTAAC CACGGATGCA ATCTTGCAGA TGTACCTAAA GTAACTGCCG CTGACTTTAA 1374 0 

CGTATCTCGC AAAAATGCTT CTTCAACATA AGTAAGTAAT TGGCTACCAT AGCCTTTCCC 13800 

50 

TTCATACTCA GGATTTGTCG CAAACCACCA GACAAAAGGA TAACCCGAAA TACTTTTCAC 13 860 

ACTTCCCCAA GGATATCTAA CCGTAATCGT AGATATAATT TCATCATCAA TTGTCATGAC 13 920 
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CCAATCAATA GCTAGTTCTC TTAGAgGCGT AAATGCTTCA TGCATGAGTT CTTGCAATTT 14 040 
TTCTGCATCT T 14 051 

(2) INFORMATION FOR SEQ ID NO: 104: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1885 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 104: 

75 





TAATCCTCAA 


CTTnGATTAT 


ATGGCTTGGG 


CGCATATGAA 


CTGCTTAGTT 


TAGTGTATGA 


60 




CATTCATACA 


GTTCGCATGA 


CTATCATACA 


ACCTCGAATA 


GATAACTTTT 


CTACTGAAGA 


120 


20 


GTTACCAATC 


TCAAGATTAC 


TTCAATGGGG 


AACCGATTTT 


GTTAAACCCT 


TAGCCAGACT 


180 




TGCTTATAAC 


GGTGAAGGTG 


AGTTTAAAGC 


AGGTAGTCAT 


TGTAGATTCT 


GTAAGATAAA . 


240 




GCATTCATGT 


AGAACACGTG 


CAGAAT ACAT 


GCAAAATGTG 


CCTCAAAAGC 


CACCACATTT 


300 


25 


GTTGAGTGAT 


GAAGAGATTG 


CAGAACTTTT 


ATATAAACTG 


CCTGATATCA 


AAAAATGGGC 


360 




TGATGAAGTA 


GAGAAATATG 


CGTTAGAACA 


AGCGAAAGAG 


AATGATAAAA 


CGTATCCAGG 


420 




TTGGAAGCTA 


GTCACGGGAC 


GTTCAAGGAG 


AGTGATAACT 


GATACAAAAG 


CAGTCCGAGA 


480 


30 


CAGGTTAGTT 


GAAGCGGGTT 


ATAAACCTGA 


AGATATTACA 


GAAACGAAGT 


TACTTAGCAT 


540 




TACGAATTTA 


GAAAAATTAA 


TCGGCAAAAA 


AGCATTTTCT 


AAAATTGCAG 


AAGGCTTTAT 


600 


35 


AGAAAAGCCG 


CAAGGTAAAT 


TAACACTTGC 


TACCGAGTCT 


GATAAACGAC 


CAGCTATAAA 


660 


GCAATCTGCT 


GAAGATGATT 


TTGACAAACT 


ATAAAAATTA 


AAAAGGACGG 


TATATAAACA 


720 




TGA^AGCAAA 


AGTATTAAAT 


AAAACTAAAG 


TGATTACAGG 


AAAAGTAAGA 


GCATCATATG 


780 


40 


CACaTATTTT 


TGaACCTCAC 


AGTATGCAAG 


AAGGGCAAGA 


AGCAAAGTAT 


TCAATCAGTT 


840 


e 


TAATCATTCC 


TaAATCAGAT 


ACAAGTACGA 


TAAAAGCCAT 


TGAACAAGCT 


ATAGAAGCTG 


900 




CTAAAGAAGA 


AGGAAAAGTT 


AGTAAGTTTG 


GAGGCAAAGT 


TCCTGCAAAT 


CTGAAACTTC 


960 


AS 


CATTACGTGA 


TGGAGATACT 


GAAAGAGAAG 


ATGATGTGAA 


TTATCAAGAC GCTTATTTTA 


1020 




TTAACGCATC 


AAGCAAACAA 


GCACCTGGTA 


TTATTGACCA 


AAACAAAATT 


AGATTAACGG 


1080 




ATTCTGGAAC 


TATTGTAAGT 


GGTGACTATA 


TTAGAGCTTC 


AATCAATTTA 


TTTCCATTCA 


1140 


SO 


ACACAAATGG 


TAATAAGGGT 


ATCGCAGTTG 


GATTGAACAA 


CATTCAACTT 


GTAGAAAAAG 


1200 




GCGAACCTCT 


TGGCGGTGCA 


AGTGCAGCAG 


AAGATGATTT 


TGATGAATTA 


GACACTGATG 


1260 
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TTGAGGTGTC AAGAATTTGA AATTTATGAA TATAGATATT GAAACATACA GCAGTAACGA 1380 

TATTTCGAAA TGTGGTGCCT ATAAATACAC AGAAGCTGAA GATTTCGAAA TTTTAATTAT 1440 

5 AGCTTATTCG ATAGATGGTG GAGCGATTAG TGCGATTGAC ATGACTAAAG TAGATAATGA 1500 

GCCTTTCCAC GCTGATTATG AGACGTTTAA AATTGCTCTA TTTGACCCTG CTGTAAAAAA 1560 

GTATGCATTC AATGCTAATT TCGAAAGAAC TTGTCTTGCT AAACATTTTA ATAAACAGAT 1620 

10 

GCCACCTGAA GAATGGATTT GCACAATGGT TAATTCAATG CGTATTGGCT TACCTGCTTC 16 BO 

GCTTGATAAA GTTGGAGAAG TTTTAAGACT ACAAAGCCAA AAAGATAAAG CAGGTAAAAA 1740 

TTTAATTCGT TATTTCTCTA TACCTTGTAA ACCAACAAAA GTTAATGGAG GAAGAACrAG 1800 

15 

AAACCTACCT GAACATGATC TTGAAAAAtG GCAACAATTT ATAGATTaCT GTATTCGAGA i860 

TGTAGAAGTA GAAATGGCGA TTGCT 1885 
20 (2) INFORMATION FOR SEQ ID NO: 105: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2656 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (d) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 105: 



35 



40 



50 



TAATCCTTAG 


TTCACTGnCA 


AATTTCAAAA 


CACCAGTTCC 


CTCTATCTGC 


ATCCATAGAA 


60 


ACTGnATGTT 


TGTGTCAATA 


ACCGGATTAT 


ATTGTGATGn 


TGTTTGTAAC 


TCGATTAAGT 


120 


TATCATCTTT 


CGAAAAATTA 


TCTACTACCA 


TTATTCAACC 


ACCTTTCCTT 


CGAATAAACT 


180 


CCATTTACCA 


ACkCCACCAG 


TACCAAAGTT 


TCTAACTAAA 


AATTGATGTG 


CAGACGGGAA 


240 


GTTSTTACGT 


CTTAATACTT 


GTGTTGTATT 


ACCTGGTGTA 


TTCGATTTTA 


CTTCTAATAT 


300 


CCAACCTGCA 


ATACCTTTAA 


AGTCTTTAGG 


AAAATCAGTA 


AATCGGTTTG 


ATTCTTCAGT 


360 


AGTGATATAG 


AAATCTAAAC 


CAACGATTTT 


TAAATCTGAT 


AATTTTGTAA 


TACTCTTAGG 


420 


GATATGTTCC 


CAATAACCGG 


CGTTTTGCGG 


GCAGAAATTC 


CATGCTCCGT 




480 


ATTGAAAATG 


TCAATGACAC 


GTTCGAATTT 


AAGCATATTT 


CTACCTGTGC 


TGTTTCTGGt 


540 


AAGTACTTGT 


CTTAGAGCAC 


CATTATAGTG 


TCCAGGCAGT 


ACATCCAAGA 


ACCACCCTGC 


600 


ATCTCTAAAC 

c 


GCTTTCGGTA 


ACGGGAAATC 


TAATGCATTT 


TGTGTGTCTT 


GaCGTATAGA 


660 


TATAGTAATG 


ACCAACTTCC 


GTAATATCAC 


TTAGATATGC 


TGGGTTCTGT 


ATTGGTAACG 


720 


GTTTAACACG 


TCCGCCTGAA 


TCAGTCATTG 


ATACTTGAGG 


TGCGATGTTT 


TTCAAGAATT 


780 
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TAGTTACCCC GATTAGAAGT 


GCTTTACGTC 


CTGTTTCTAG 


ATCGTAATAC 


ATATCTAGAC 


900 




CCTCAGCCTC TTGGAAATCT 


CCTTTAAAGT 


TGTTATTCAC 


ACCGCCTATA 


TCGATGCGAC 


960 


5 


GTTTAAATAA CAATTCTTTC 


GTTTTGATAT 


CGAAGCCTTG 


TAAGTAGTTA 


GGGTTGGCTG 


1020 




TATTCGAATC ACCTGTATAC 


CAATATAAGA 


TACCTGCATC ATAAGTGATA 


CCTTGCATAG 


1080 




GTTGTGTATC TGAAGTGTAT 


TCCATAGGTA 


TATCCATTTG 


ATACAATACT 


TTGTCTATAC 


1140 


10 


CTTTATCAAT ATCGTCAGCA 


CTTCTAACCT 


CAACAAAGTT 


CAACGAATTC 


TTAAGTTGTC 


1200 




TTTCAGTGGG TTTATATTCA 


CGTCTAAAAA 


TCATTAAATT 


TTCTACCGGA TTATAAATCG 


1260 


15 


CTGACGTATA TCTGTCGTTA AATATATTCG 


GCATGACATC 


TTGCATTTCA 


TTACCATAAG 


1320 


TTATTTCTCC AGTTCTATAT TGGAAACGTA 


CAAACTTGTT 


GTTTTTGTTA 


CTGTCCAATA 


1380 




CAGCTGAATA AATCCATAAT 


TCTCCATCAA 


TGTATCTATA 


CGCATTGTGT 


GTACCGTGAC 


1440 


20 


CGCCGTTTTT AACAAGCAAT 


CTATCAATAA ATTGTCCGTT 


GGGCTTCAAT 


CTAGATAACA 


1500 




TGTAATGATT ACCTGGACGA 


GCTTGCGTCA 


TATAAATAAT 




GGGTCTACCC 


1560 




AAAATGATTG CATTACTGCA 


TTTGTATATG 


GCGATAAATC 


AGTGATAAAT 


TCCGGTTCTT 


1620 


25 


GCTCTTTTGG TTCGAATCGG 


TATTCTGTCG 


CTCGATATTC 


TTTATAGTGT 


TCATCTACAG 


1680 




CTTTCTCAAC CTTTTTAGTG 


AAAACATCTA 


GTGTTGAATA 


ATCATGATAC 


AAACGATCTT 


1740 




GCAATGTCTT ATGACCATAA 


CCTGTATTAT 


CAACGCGCGC 


GTCTTTTAcT 


TCGTTGATAC 


1800 


30 


CGTCGCCGTT ATGACCTAGT ACCATGTTGC 


TAAATCGACC 


GTTTAAATAT 


GTTAAAAAGT 


1860 




CAGAGACGTT ACTTGTAACA 


TTTAAATGTT 


CATACTTTAT 


TTGTTCTCCA 


TCATGTGCGA 


1920 


35 


ATACCTCTTT ATTTCTGTGG 


TATTCAAGAG 


AGAAATTAAA 


ATCCGTCAGC 


ATGTCTGAAA 


1980 


TAAGTTTAAA GTTATACTCA 


TTTTCATCTA 


CATATCTGTA 


GTCAAAGACT 


CTACTTAAAT 


2040 




CTGTAATTAG TTTATTACTC 


ATGTTTTCCT 


CCTTTACTAT 


CCATAAAACT 


GATmATAATT 


2100 


40 


TTTAATAAGC TCATACATAA 


TAACTTCATG 


ACCTCTTTCA 


TTAGGATGTA ATCCATCAGG 


2160 


CATGCTAGAT TTTCTAAATG CTGGATTATA TGGTTTGAAA TAATCTGTGT GATAAGCATC 


2220 




ATATACTGGT ACATCCAATT 


CACTACAAGC 


CAATATCTGA 


GCATTGACAT 


AATCCTCTAA 


2280 


45 


AGTTAACCCT AGTTTGTTTT 


TGTCCGTATC 


TTTACGGCGT ATCGTTGTAC 


CACTCATAGG 


2340 




GCATTGCCTA GTAGCTGTCA 


TTACAAGTAT 


TTTTGAAGCT 


GGATTATTTT 


TCCTGATAAC 


2400 




TTCAATTGCA GAACAAAAGG 


CGCCGTAAAA 


CGTTTTAGTG 


TCGGTTTTAT 


CAGTGCCTAT 


2460 


50 


CGGTACGCCT GCCCAATAAC 


CATGTAACCA 


GTCATCATCT 


GTACCTTGTA 


ATATGATTAG 


2520 




GTCTCCTCTT ATTTGCTCTG 


CTTGTCTaTA AATGCTGTTT 


TCTaCCGCTT 


CTTTACCTAT 


2580 
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CTTGCCTAAC ATTTCT 2656 
(2) INFORMATION FOR SEQ ID NO: 106: 

5 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4854 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 106: 

15 ' AAAATGAGGG TTCTAGCGGA AATTACCAAA AGCGTGGTTC ATACTATGGG CAGCGTAATC 60 

GTATTTCAAA AGAAAAAACA CCTAAATGGT TAGaAAATAG AGATAAACCT AGTGAAGAAG 120 

ATTCGGCTAA AGATAATAGC GTAGATGATC AACAATTAGA GCAAGATCGA CAAGCATTTC 1B0 

20 TAGATAAATT ATCTAAAAAA TGGGAGGAGG ACAGTCAATA ATGAAGCAAT TTAAAAGTAT 24 0 

AATTAACACG TCGCAGGACT TTGAAAAAAG AATAGAAAAG ATAAAnCAGA AGTAATCAAT 3 00 

GACCCAGATG TTAAGCAATT TTTGGAAGCG CATCGAGCTG AATTmACGAA TGCTATGATT 360 

25 GATGAAGACT TAAATGTGTT ACAAGAGTAT AAAGATCAAC AAAAACATTA TGACGGTCAT 420 

AAATTTGCTG ATTGTCCAAA TTTCGTAAAG GGGCATGTGC CTGAGTTATA TGTTGATAAT 4 80 

AACCGAATTA AAATACGCTA TTTACAATGC CCATGTAAAA TCAAGTACGA CGAAGAACGC 540 

30 

TTTGAAGCTG AGCTAATTAC ATCTCATCAT ATGCAACGAG ATACTTTAAA TGCCAAATTG 600 

AAAGATATTT ATATGAATCA TCGAGACCGT CTTGATGTAG CTATGGCAGC AGATGATATT 650 

TGTACAGCAA TAACTAATGG GGAACAAGTG AAAGGCCTTT ACCTTTATGG TCCATTTGGG 720 

35 

ACAGGTAAAT CTTTTATTCT AGGTGCAATT GCGAATCAGC TCAAATCTAA GAAGGTACGT 780 

TCGACAATTA TTTATTTACC GGAATTTATT AGAACATTAA AAGGTGGCTT TAAAGATGGT 840 

4Q TCTTTTGAAA AGAAATTACA TCGCGTAAGA GAAGCAAACA TTTTAATGCT TGATGATATT 900 

GGGGCTGAAG AAGTGACTCC ATGGGTGAGA GATGAGGTAA TTGGACCTTT GCTACATTAT 960 

CGAATGGTTC ATGAATTACC AACATTCTTT AGTTCTAATT TTGACTATAG TGAATTGGAA 1020 

45 CATCATTTAG CGATGACTCG TGATGGTGAA GAGAAGACTA AAGCAGCACG TATTATTGAA 1080 

CGTGTCAAAT CTTTGTCAAC ACCATACTTT TTATCAGGAG AAAATTTCAG AAACAATTGA 114 0 

ATTTTAAAAT GATTGGTGTA TAATGAATAC AAATCTAAAT CGTTTAAATG ATTGAAGACA 1200 

50 AGATGATCTA ATCAATATTA CACAGAAAGC CATTGTTTGA TGAGAATATG GTTAATAAAT 1260 

TAGATGATTA CTACTTCATT TATGGTATTT GTAATGAATA CCCGGATCAA GACCGTTATC 1320 
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CTCGTCCCTT GTATAGGGGC GGGATTTTTT GTTTTTTTCA GACATAAATG TTTGTTGGTG 144 0 

TCATAAATTC CCTGTTTATT GTTAATAGGT TTAATGTTAA AACGATGATT GTTGTTCAAT 1500 . 

5 

TTTTTAACGA GGTCAGATAA AAGTATTTAT AAAGCAAATA GGAGGGTTTA ACATGGAACA 1560 

AATTAATAIT CAATTTCCAG ATGGTAATAA AAAGGCGTTT GATAAAGGTA CTACTACTGA 1620 

AGATATAGCA CAATCAATTA GTCCTGGATT ACGTAAAAAA GCTGTTGCCG GCAAATTTAA 16 80 

10 

CGGGCAACTT GTAGATTTAA CTAAACCGCT TGAAACTGAT GGATCAATTG AAATTGTGAC 1740 

ACCAGGTAGT GAAGAagcGT TAGAGGTATT ACGTCATTCT ACTGCACATT TAATGGCACA 1800 

CGCGATTAAA AGGTTATATG GTAATGTTAA ATTTGGTGTA GGTCCTGTAA TAGAAGGTGG 1860 

7© 

* ATTCTACTAT GACTTCGACA TTGACCAAAA CATCTCATCT GATGACTTTG AACAAATTGA 1920 

AAAAACAATG AAACAAATCG TTAACGAAAA TATGAAAATC GAACGAAAAG TGGTTTCACG 1980 

20 AGATGAAGTG AAAGAGTTAT TCAGCAATGA TGAATACAAA TTAGAATTAA TCGACGCGAT 2040 

TCCTGAAGAT GAAAATGTAA CATTATATAG TCAAGGTGAT TTTACTGATT TATGTCGTGG 2100 

AGTTCACGTT CCATCAACAG CTAAAATTAA AGAGTTTAAA CTATTATCTA CAGCAGGTGC 2160 

25 ATACTGGCGT GGAGATAGTA ACAACAAAAT GTTACAACGT ATATACGGTA CTGCTTTCTT 2220 

TGATAAAAAA GAATTGAAAG CACATTTACA AATGTTAGAA GAGCGTAAAG AACGTGATCA 2280 

XCGTAAAATT GGTAAAGAGT TAGAACTATT CACAAATAGC CAATTAGTTG GTGCTGGTTT 2340 

30 

GCCATTATGG TTACCTAACG GTGCAACAAT TAGACGTGAA ATTGAACGTT ACATTGTTGA 24 00 

TAAAGAAGTT AGCATGGGAT ATGACCACGT TTATACACCA GTACTTGCTA ATGTTGATTT 24 60 

ATACAAAACA TCTGGTCACT GGGATCACTA TCAAGAAGAT ATGTTCCCAC CAATGGAGTT 252 0 

35 

AGATGAAACT GAATCTATGG TATTACGTCC AATGAACTGT CCACATCATA TGATGATTTA 2580 

TGCGAATAAA CCACATTCAT ATCGTGAATT ACCTATCCGT ATCGCTGAGC TAGGAACGAT 2640 

GCATAGATAT GAAGCAAGTG GTGCTGTATC AGGATTACAA CGTGTTCGTG GTATGACTTT 2700 

40 

AAATGATTCA CATATCTTTG TTCGACCTGA TCAAATTAAA GAAGAATTCA AACGCGTTGT 2760 

AAACATGATT ATTGATGTGT ATAAAGACTT TGGTTTCGAG GATTATAGCT TTAGATTAAG 2820 

45 TTATAGAGAC CCTGAAGATA AAGAAAAGTA CTTTGATGAT GATGATATGT GGAATAAAGC 2880 

TGAAAATATG CTTAAAGAGG CAGCGGATGA GCTTGGCTTA TCGTACGAnG AAgCGATTGG 2940 

TGAAgCGGCA TTCTATGGTC CGAAACTAGA TGTTCAAGTT AAAACAGCGA TGGGTAAAGA 3000 

50 AGAGACATTA TCAACAGCAC AACTTGATTT CTTATTACCA GAACGTTTTG ATTTAACTTA 3060 

TATTGGTCAA GATGGTGAAC ATCATCGTCC AGTTGTTATT CATCGTGGTG TTGTATCAAC 3120 
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AGCGCCAAAA CAAGTTCAAA TCATTCCAGT TAACGTTGAT TTACATTATG ATTATGCGCG 324 0 

CCAATTACAA GATGAATTGA AATCTCAAGG CGTTCGTGTA AGTATTGATG ACCGTAATGA 3300 

5 

AAAAATGGGT TATAAAATCA GAGAAGCTCA AATGCAAAAA ATACCTTATC AAATCGTAGT 33 60 

TGGGGATAAG GAAGTTGAAA ATAATCAAGT GAATGTGCGT CAATATGGAT CGCAAGACCA 3420 

AGAAACAGTT GAAAAAGATG AATTTATCTG GAATCTAGTT GATGAAATTC GTTTGAAAAA 3480 

10 

ACATAGATAG ACAGTTGTCG CAATAAAATG CTTTAAAACT TTTATTGCGT ATCAAGTTTT 354 0 

ACAGGGTTGA TTATGCGTGA TGAATCCTGT ATATTACAAG TTAGTTAAAA TATTAAATTG 3600 

AGTTAGAGGT TGCATGTTTA ATTAGTAACT TGTCAGAAGT ATTTATGGTA CATAAGTTGA 3660 

To 

ACAAGTGAAA GGTAAAGATG CCGAAATAGA TATAAACCAT AAATTATATC TATTGGGACA 3720 

GTTTTCGAAT AGGAACTGTA CTGTCACAGA ATGTGATGTG CTACCTTATA TAGATAATTG 3 7 BO 

20 CCAAAGTGGT TGCATATCTT AAAGGTATGT. AGCCACTTTT TTACTTTTAA TATCACTATG 3840 

TTCTGTAAAA AAGGGTATGA AAGTGAATAA AGGTTATTTA TTTCTTGGCC TCTAAAACAT 3900 

GGAAAGGGAG CTTATATGTC AAAAGTTCAA AATGAAAGTA ACAATGTTGT CAAAAGGGGA 3960 

25 CTTAAAGATC GTCATATTTC TATGATTGCG ATTGGGGGTT GTATTGGTAC AGGTTTATTT 4 020 

GTAACTTCTG GTGGAGCAAT TCATGATGCA GGTGCTTTGG GTGCATTAAT AGGATACGCA 4 080 

ATTATCGGAA TAATGGTATT TTTCTTAATG ACGTCACTTG GCGAAATGGC TACGTATTTG 414 0 

30 

CCAGTATCAG GTTCATTTAG TACATATGCT ACAAGATTTG TTGATCCATC TTTAGGGTTT 4200 

GCGCTTGGTT GGAACTATTG GTTTAACTGG GTAGTGACTG TAGCAGCAGA TATTACGATT 4260 

GCAGCACAAG TCATTCAATA TTGGACACCA TTGCAAGGCA TACCCGCTTG GGCATGGAGT 4320 

35 

GCGTTGTTCT TAGTTATAAT TTTTAGTCTG AATTCGTTAT CAGTTCGCGT CTATGGTGAA 4380 

AGT(3AATACT GGTTGGCATT GATAAAAGTG GTTACAGTTA TTGTTTTCAT TGCAATTGGT 444 0 

TTATTAACGA TTGTCGGAAT CATGGGTGGT CATGTTGTAG GATTCGAAAT ATTTAATAAA 450 0 

40 

GGTGAAGGTC CAATTCTTGG TGGCAACTTA GGAGGAAGTT TGTTATCAAT TCTAGGTGTA .4560 

TTCTTAATCG CTGGTTTCTC ATTCCAAGGT ACTGAGTTAA TTGGTATTAC GGCTGGTGAA 4620 

45 TCAGAAAATC CTGAACGTGC TGTGCCGAAA GCAATTAAAC AAGTATTCTG GAGAATTTTA 4680 

TTATTTTACA TTTTAGCCAT TTTTGTTATC GGTATGTTAA TTCCTTATGA TAGTAGTGCA 474 0 

TTAATGGGGG GTAGTGATAA TGTAGCAACG TCTCCATTCA CATTAGTGTT TAAAAATGCT 4800 

50 GGATTTGCGT TTGCAGCATC ATTTATGAAT GCAGTCATTT TAACGTCTGT GTTA 4854 
(2) INFORMATION FOR SEQ ID NO: 107: 
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(A) LENGTH: 2488 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 
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(xi) 


SEQUENCE DESCRIPTION: 


SEQ ID NO: 


107: 






ATCAAAAATT GATTGTTTTC nATTTTTTGT TTCAGCGCGG GATCTTTTAC GTCTTTTGTG 


60 


AAAACGaTTT 


TATTATTAAC TACTTTTACT 


GGATAACTTT 


TGTATGTCGA 


GTCAGTAGCA 


120 


TTTTTTCTAT 


CGTTTGTAGT TGTGTCATAT TCACCAgTTA TTTTATGTGT GTTCTTATCT 


180 


ACCTTTAACA ACATACGGTC TTCTTTTAAA AGCTCATCTG ATCCAACAAC TGAATAAOAG 


240 


GATTCTATAT 


ACCATGTGTC TTGATCATTA 


ill ILnlnni 


VjrVjrVyVo/il 1 Al U 




300 


ATTTCATAAA 


GCGTTTCTAA GTTTTTAATA 


GGATACGTAC 


1 LJ\\j 1AL 111 


TTTAAGACCA 


360 


TCTTTCAAAT 


GAATTTGTTC CCACTTCATT 


GCCAAAAACA 


lr> 1 \-\J\-.\~J\\*- 1 




420 


GAAATAATAA 


TAATTGCTGC TAAGTTTAAC 


CAGAAAATTT 


lAluluLl 1 1 


r* ATAr'A TTrr 


480 


CACCGTTTCT 


CAAAATACTT CATTAACACT 


ATAATAATAT 


ATTTTYI A A A A 


A HP A T^HPHP A ("* A T 


54 0 


CAGTATTAAA 


GTGAATATCA AATTTTAAAT 


TTATGAAAAT 


AAT*ARATATT 
™i x x n i. x 


TATAAAAAGC 


600 


GGAAAAGAGA 


TACAATAAAA AACTGCATGA 


CGTTTGAGAC 


GTCAC!APAGT 


(TTAAfTAAAA 


660 


ATTTAAAAAG 


TTGTTGCTAA TTTTTCAGCA 


TTATTAATAC 


X X X VJVv XXX 


A ATTTPTTP A 
■e\r\x x x \m 1 1 v»n 


720 


GTCTTATGAG 


GTTCAGCATT GTGTCCTTCA 


ATAATGATTG 


TTTCATATGA 


TGGCACACCT 


780 


AAGAATGTCA TAATTGTTCT TAAATAACGG 


TCACCCATTT 


CAAAATCAGC 


AGCAGGTCCT 


84 0 


TCAGTATAAT ATCCACCACG TGATTGAATG 


TGTAATACTT 


TTTTGTCAGT 


TAGTAAACCT 


900 


TGTGGTCCTT- 


CAGCAGAATA TTTAAAAGTT 


TTACCTGCAA 


TTGAAATAGC 


ATCAATATAT 


960 


GCTTTAACTA 


CAGGTGGGAA AGAAAGGTTC 


CACATAGGCG 


TTACAAATAC 


ATATTTATCT 


1020 


GCACTTAAAA ATTCTTCTAA AATGTCACTC AATCTTGAAA 


CTTTCATTTG 


TTCATCATCA 


1080 


GTTAACGTTT 


CGCCATTACT CATTTTTCCC 


CAACCAGTTA 


ATACATCTTT 


GTCAATAACT 


1140 


GGAATATAAG 


TTTCArATAA ATCAATATGT 


TTCACTTCAT 


CATCAGGATG 


TTGTTGTTGA 


1200 


TATGTTTCGA 


TAAATGCTTT ACCAGCCGCC 


ATAGAATTTG 


ATACCAGTTC 


ATTAAAAGGG 


1260 


TGTGCTGTAA 


TATATAATAC TTTTGCCATT 


TGAAAATTCT 


CCTCTGkTTC 


TGTTATTTTC 


1320 


TTAAGTATAA 


TTATTATACT CGATATAAAA 


TTTAATATCA 


ATCAAAATAT 


TCAAATTACC 


1380 


ATCATTTTCT 


TCATCTATAT nTGGCAGTAC 


TACTAAAGTA 


TGAGTGCATT 


TAATTATGAa 


1440 


ATAGTTGATT 


TaGAATAtAT ACTTAATACC 


CAAAATATAT 


GAAGGATGGA 


TGCCACTATG 


1500 
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ATTATTTATA TAGATGACAT TCAAAAATGG TTTAACCAAT ATACCGATAA ATTGACACAA 1620 

AATCATAAAG GACAAGGACA CTCAAAATGG GAAGACTTTT TTAGAGGGAG TCGGATTACT 1680 

GAGACTTTTG GTAAATATCA ACATTCACCA TTTGATGGTA AGCATTATGG CATTGATTTT 1740 

GCATTGCCAA AAGGTACACC AATTAAAGCG CCGACGAATG GTAAAGTAAC ACGTATCTTT 1800 

AATAATGAAT TGGGCGGCAA GGTATTACAG ATTGCCGAAG ACAATGGAGA ATATCACCAG 1860 

TGGTATCTAC ACTTAGACAA ATATAATGTC AAAGTAGGTG ATCGAGTCAA AGCAGGTGAT 1920 

ATTATTGCAT ATTCAGGCAA TACAGGTATA CAAACGACAG GCGCACATTT ACATTTTCAA 1980 

15 AGAATGAAGG GTGGCGTAGG TAATGCATAT GCAGAAGATC CAAAACCGTT TATCGATCAG 2040 

TTACCTGATG GGGAACGTAG CCTATATGAT TTGTAGTTAT AGAAGGGTGC CCGCAGTCTA 2100 

AAAAATTAAG CAATCATTGT GTGAGTATGA TACTTACATA ATGGTTGCTT TTTTCAATGA 2160 

AAATCGTAAT GCTAAGTCAT ACTTGTTTGA TTTAGATATT ACTTAAAATG TAAGACAAGG 2220 

TTGTTAGCAT TGGCAGTGAA ATATCGCACA TAAAAAACAT TATTGTCACA CTAGAAAATA 2280 

GTTGTGCACT ATATCAATTT TCTGTATAAA AGTTTAATTC TGACAGTAAT GTAAACGTTT 234 0 

ACAATTTATG ATTGACATTA ATAATGACTG AATATATGAT TTATGTAAGT ATTTGTGCAA 24 00 

CGTTTTCACA AAGTGTATTG CACaAyCAAA CTGtAAACaA aGTATGGGGg GCCATAACAT 2460 

GGCAGAACTA AGTTAGAGCn TATTAAAA 24 88 
(2) INFORMATION FOR SEQ ID NO: 108: 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 4093 base pairs 
35 (B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 - 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 108: 

TTTTCTTTAT TTCAAmCTGT ATATTaATGA TGTCACTTCA TTTGATACGA TTCTTGATAA 60 

CCTATTCAAA ATTCCGCCAA ATAACATAAA TATTATATAA ATGCCGATAC TTTTAATCAT 120 

45 

TTTCTACTTT TTCTTCGATA CGGAAACTTG TTTTCGAATT GAACACTTCA CCAGCTTTTA 180 

AAATTGACGG TGCTTTTTCA CCATATAAAT TAATATCATT TGGTAAAAAT TGTGTTTCTA 24 0 

50 ATGTAAAGCC AGAATGTGGT TTATAAATAT TAAATGGACT ATCCCACTCA TCAGGCTGGT 300 

TAAAAGTAAA GAACACAACA TGAGGCATAT CTGTATCGAC CTCTAACATA AATTCATGAT 360 

TTTCAACATA CATTTTATGT TCACCAACTG TAAATGGGTG ATOGAGACCA CCAAAACGTG 420 
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TATCTTCAAA 


CACTTCATGT 


AAATCTAGAA 


TATCACCTGT AACAATATTT 


CGCTCATCTA 


54 0 


ATACATACAT 


ATCTAATTGA 


TTACTTGAAA 


TGCGATGATT 


ATCAACGACA 


TTATTATCTC 


600 


GATTCAAATT 


GAAGTACACA 


TGATTCGTAG 


GACTAAACAA 


TGTGTCTTCT 


GATGCAACTG 


660 


CTTCGTATTC 


AATCGACCAT 


TGGTGATCCG 


CATCATAAAT 


ATGTGTAATC 


GTCACATCGA 


720 


TATCACCCGG 


GAAATGATCA 


TCAGCTGATT 


TCAACACCGT 


CTTAAATATA ACTTTAATTT 


780 


GAGCAATTTC 


ATTTCTAATT 


TCATAATCAA 


ATAACTTATT 


GTCCAAACCA 


TGACATCCAC 


B40 


CATGTAAATG 


ATGTTCACCG 


TTGTTTTTTT 


CTAACTGATA TTCTTTACCT 


TTCAACTTAA 


900 


ATTTAGCATT 


ATCAATTCTA 


CCGCTATATC 


TTCCTATAGA AGCACCAAAT 


TTAAAAGGAT 


960 


TACTATGATa 


AAATTCATCC 


GCTTCAACAA 


CATTTCCAAG 


AACAATATTA 


TTATCATGAT ■ 


1020 


ATTTCCAAGA 


CACTACTCTT 


GCTCCATAAT 


TCGTAAAAAT 


AATTTTAGTT 


TCATCATTAT . 


1080 


CAATTTTGAT 


TAAATCTACA 


CCTTGTCTTT 


GGTGCTCAAC 


TTCAACTATC 


ATTTTTACTT 


1140 


CTCCCTTCTA 


ACCACAAGTG 


TTCAAGCTCT 


GCTGGGTAGC 


AACATTACTA AAACACCTAC 


1200 


AATACAAATG. 


ATTGCACCGA 


TAACATCATA 


TTTATCTGGC 


ATTTGTTTAT 


CTACGACCAT 


1260 


CGCAAAAATC 


AAACTCATGA 


TGATAAATAC 


GCCACCATAT 


GCTGCATATA 


CTCTTCCGAA 


1320 


TGATGGAAAT 


GATTGAAATG 


TCGCAATGAC 


ACCATATAAC 


ATGAGTATCG 


CACCGCCTAT 


1380 


TAGCCCAACA 


AGTGAAGACT 


GTCCTTCCCT 


AAGCCACAGC 


CAAATCAGGT 


ATCCCCCACC ' 


1440 


TATTTCACAT 


AAGCCAGCTA ATATAAATAT AAAAATCGGA TATAACATGA AATCACTCCA 


1500 


TCACACATTT 


GCTATCAATA ATCTATCGGC TACATATCAT TTGTTTACAT TTCTTCTTAC 


1560 


TTCACATTCC 


CATTTTAAAA 


AGTTCGTTTT 


CACATTCATA 


TTGTACACTT 


TTTTAGACAT 


1620 


TATTCTATAG 


CTAAATATAA 


AAAAATAAGA 


GTAACACGCT 


TTCATCATCA 


TTTTATATGA 


1680 


TAAATGTGTG 


TCACTCTCAT 


CAATTTTATT 


TTTTAAATAC 


ACGTTTCATT 


GAATTAAATA 


1740 


AGCCACGTTC 


AAATGTAAGT 


ACTGAATCTT 


TATATGTTTT 


AATTGCAATC 


CATATCAAGA 


1800 


CAGCTACCAT 


TACAATTGAG ATTAAAGAAC 


TTAAGATGAC 


CTCATATATT 


TGAAGCCCTG 


I860 


AAGTTTGAGC 


GCGTACAACT 


AATTGAAATG 


GCGCTAAAAA 


CGGAATATAA 


CTTGTGATTA 


1920 


AAGCAAGTTG 


TCCATCAGGA 


TTATTTATCG 


TGAATATCGC 


GATATAAAAT 


GCAATCATAC 


1980 


CAAGTAATGT 


CAGTGGCATC AAAGATTGAT TTAAATCTTC TATTCTAGAT GTTAATGATC 


2040 


CGAGGATGGC 


TGCAAGTAAT 


ACATACGCCG 


TAATTCCAAC 


AATACTACTT ATAATTCCGA 


2100 


CAATAATAAT 


TTGCCAAGAC 


AATTGATTCA 


TTTCCACGTT 


AAAACCTTGT 


AGCAAGTCTT 


2160 


TTAAGTCAAA 


GGCAAAAATG 


CATATAACTG 


CCATCAATAC 


AATTAAAATA ATCTGAGTCA 


2220 
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TAATAATCAT 


TTCAATGACA 


CGCGATGTTT 


TCTCACTAGC 


AATTTCCATA 


GCTATTTGAG 


2340 




ATGCATAATT 


TAAAACAATG 


AAGAACATTA 


GAAAGATAAT 


GCCATmaGcT 


AAAGCATAGT 


2400 


5 


TGAAAATCTT 


TTGTCCTTCT 


GATACTTTAT 


CGACTTCATC 


ATTAGAAATC 


ACCTTATTAT 


2460 




CAACTTTACT 


TTGTGCTTGT 


AATTTTTGTA AGTCTTCTTT 


GTTGATATTT 


AATTCCCCGG 


2520 


10 


CTACCATATT 


TGTTTGAATA 


GCTGTAAGCA GTGCTTGTAC 


TTTTTGTGAA 


TCTTCATGAC 


2580 


TTACTCGCTT 


CTCACTAATG 


ATTGTCCCTT 


GTAACGTGCG ATTTTGATTC ACCTTGATAA 


2640 




TATAAGCTTT 


ATCAAGTTTA 


TGTTTTTTTA 


CTTCTTTTTC AGCATCTTCT ATAGAAACTT 


2700 


15 


TAGTAAACTT AGCATCACTA TGAAATGTAT TCGCCTGTTG CTTGAAAACC TTATAGATTT 


2760 




GTTCATTCGG 


TGCTGCTACA 


CCAATTTTAT 


CTGGACCATC ATCAAACATG 


TTAATAATCT 


2820 




TATCAATGTT 


AGATAGGCCA 


ATCATTAAGG 


CAGCAATAAT 


AATCATAAAA 


ATTACAAATG 


2880 


20 


ATTTAGCTTT 


AATTTTTTTG 


ATATATGTCA 


AAGTAAATGT 


CGCCCAAAAC 


TTATGCATCC 


2940 




TTGCCACCAA 


CCTTCTCAAT 


GAATATATCT 


TGTAATGATG 


GTTCTACAAC 


TTGGAATCGT 


3000 


25 


TTAACATAAC 


CTTGATGTGC 


CACAACTTGA 


TAAATATCTT 


TGGCTACGTC 


TTCATTCTCA 


3060 


ATCGTCAACT 


GAAGACCTTG 


CTTCATGTTT 


TCACTATGAA TGATGCCTCT 


AATGTTTGTT 


3120 




AAATCTGGTA 


GTGTTGTTTC 


TGATTCAATG 


ACAACTTTCT 


TGTTACCATT 


AGATGCACGT 


3180 


30 


ACATGATTGA 


TATCACCAGA AACAACAAGT 


TGACCTTTAT 


CTAAAATACA 


AACATCATCA 


3240 




CATAATTCTT 


CAACATGCTC 


CATACGGTGA 


GAACTATAAA 


CGATTGTACT 


GCCCCAATCA . 


3300 




TTTAAGTCTT 


TAACTGCTTc 


TTTTAATAAC 


TCAACATTAA 


CTGGGTCTAG 


ACCACTGAAA 


3360 


35 


GGCTCATCTA ATATTAGTAA 


TTCTGGTTTA 


TGTAACATAC 


TTGCTAACAG 


CTGAATTTTT 


3420 




TGTTGATTCC 


CTTTTGATAG 


ACTATCAATT 


CGTTTTTTGC 


GGTTTTCAGT 


AATATCAAAA 


3480 




CGCTCAAGCC 


AATACGATAT 


TTGCTGTTGT 


ATTTCTGTTT 


TTGACATTCC 


CTTTAAAGTT 


3540 


40 


GCCAAATATT 


TCAATTCTTC 


TTCAACTGTC 


AATTTCCCAT 


GTAAACCGCG 


TTCTTCCGGT 


3600 




AAATAACCAA 


TACGATTGTA 


CATTGTTTTA 


TCTAGTTTTT 


TACCGTTATA 


CGTrrTGTGT 


3660 


45 


CCTTCAGTTG 


GTTCACTTAA 


GCCTAAAATC 


ATACGAAATG 


TCGTTGTTTT 


ACmTGCACCA 


3720 


TTTCTTCCTA 


GAAAACCTAA 


CATTTTACCT 


GATTCTAACT TTAATGAAAT 


ATCATTTACT 


3780 




GCCGTCATCT 


TGCCAAAACG 


TTTCGTAACA 


TGTTCAATTA 


CAAGTCCCAT 


ACTTTGCCTC 


3840 


50 


CTAAAAAnAT 


ATGTATTTAT 


CTTAATATAA 


CATTTCCATT 


CTCTATAAAT 


GCAATATTTT 


3900 




TAAAATGAAT 


TTATTTTTAA AATTTCTGAA ATTGAAAAAT 


TTAAATAGTG 


CCATTTTTGC 


3960 




ATGTTAAGTA 


TCATTAGCAC 


TAGATATGTT 


TTTTCCATGC 


CTTTATTGCC 


TTATTTGTAA 


4020 
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CTTnC CGGTG TTT 4 093 

(2) INFORMATION FOR SEQ ID NO: 109: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17846 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 109:* 

TGCCAAACTA CCTTTTGACA GTCGTTGCTG TACTTCAGGA TGATCAATCA CATATnTTAC €0 

TTTATCAAAT AGGGCATCTT CATCATTTTT AGTAATTAAA TAACCATTGA AATCTGAAGT 120 

AATCAGTTCG TTAGGTCCAT ATTTAATATC ATAACTAATA ACTGGAACAC CATGTGCTAA 180 

AGATTCAAGT AGCGCTAAAG AGAAACCTTC CATGTTACTT GTTATTAAAC TCAAATAGGC 24 0 

ATCGCTATAT TCTTGGTCTA GATTGCTTAA AAAGCCGCGT AAGTAAACAT GATTTTCCAA 300 

TCCATATTTT TGTATCAATT CATTTAATTT TTTACTTTCA GAaCCAAAAC CATACATATG 360 

AaGCTCTATT TTTGGGACAT ACGATACTAA GCGTTTAATT AATTCAATTT GTTGATGTAA 420 

TTGTTTTTCA GGTGAATAAC GAGCAACGGA AATTAATTTA ACACTGCGCT GATCTAATGT 480 

TTGGACTGGT GTATCAATTG TTTCACTATA GCCGACAGGA ATATTAACAA CTGGAATAGT 540 

ATGGTTAATA CGTTTTTCAA CATCTAATTT TTGCTGCTCA GTAGAAACGA TAATTGCACG 600 

ATATCGAGAT AAATTTTCAA ACATCGCTTT ATATACATTT TTAAATGGCG ATGAATCTAA ,660 

35 TGCATCAATA TTTTTAATGT GTGTACTGTG AAGCACAGCT ACTACTGGGA TTGACTCAGG 720 

CGTTAAGTTG AAAATAGGTG CTGTGTACAC ATTACGATCA CTGAAAAATA AATCCCCATG 780 

TTGATATAGT TGTTTAATGA AAAATGCGCC TAATTCCGTT TCATTATTAA AGAAATATTG 84 0 

TTTGTTAGCA TAGTAAACAA TAATTTTTTG TACTTCTGGT TTGCCATCCT TGTAAGAAAA 900 

ATACTTTTCT AATTTTGTGT CACCTTCTGG ATTATAGAAA AATTCACATA ATGTTTGTTG 960 

TTTATCAACA AGAATCCTAC TACAACTTAA AAAGCCACGC ACATCATAAA AATCACGTTT 1020 

TACTTtTCGT CTTTGACTAT CAAAATGATT TACATAATCT AATATACGAT ATTTAGGATC 1080 

TTGAAAATGG GCATACATTA AGAAACGCTC TTGATCATAT ATTCTAAAGT CATGACTATT 1140 

50 TTCAACATGT TTTAAAGTAT AATGACATTC ATCAGTCCAA TACGACAACC AGTCAAATGG 1200 

TTCATTGCGT TCTAAATATG TTGCTTCTTG GAAGAAATCA TACATATTAA TATAGTCAGA 1260 

ACTAGTAATA TAATTTTGGG CATTTCTATA TAAATATCTA TTCCATGACA GAAATACACA 1320 
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CCCAGTTAAA TTAACACCTA AACTATTACC TACAAAATAA TTCATTTACA ACACCACTTA 1440 

TATCTATTTT TTATAATTAT ATCACATAAT ATTTAATTAC TTCTTTTAAC TGGAAGATGT 150 0 

5 GTTTATTTAT AAAACAACAA ATTTTGATAT TTATAATGAT AGTAGTTATT CAATCACTAC 1560 

GACCcAATAT ATCATkGTAG AGCTTAGGAT ATTGATTTAT GACTCAGGCA CATCAAATGa 1620 

GAgGATTTAT AAArGAGATA TACAACTCTA GAAGGTATAA TAAAAACGCG CAACTAATGT 1680 

10 

TACGCGTTTG AATTAATCAT ATGATATTAT TTGCGATACT TTAATTTAGC GAAAgcATCA 1740 

TGTTGATGGA TAGACTCTTC ATTACGACAT TCGATATCGA AACCGTCTAA CCAATCAAAT 1800 

15 TCAACTAAGT CCGCGGCAAT TAAACGAATT AAGTCTTCGA CAAAACGTGG ATTTTCATAT 1860 

GCACGCTCTG TCACACGTTT TTCATCAGGA CGTTTTAAAA TAGGGTATAG AATTGAACTT 1920 

GCATTAGCTT CCATTGCATC TAAAATTTTA TTTTTATAGT CATCAACTAT GTCTTGATCT 1980 

20 TTATTAATAT ATGTTTTAAC AGTGACAACA CCACGTTGGT TGTGCGCTGA ATACTCACTT 204 0 

ATTTCTTTTG AACAAGGGCA TAGCGTTGTG ACAGTTGCTT CAATAGTAAG TTCTTTACGT 2100 

GTAnCTTTAT CACCGTCAAT TGCTAATCCA TAAGTGACAT CGGCATTACC AACTGCTTTA 2160 

25 

ATATTTGTGG TTGGACTATA GCGATCAAAG AACCATTTCC CAGAAACATC AACGCCTGCC 2220 

GCATTTTGTT TCATATTCGT TTGTAAAGTG CGTAACACCT GATAAAGTGT ATTAAATTCA 2280 

AGTTCAATAC CATTATCATA GTGCTTTTCA ACACTTTCGA TTATACGGCT CATATTAATA 234 0 

30 

CCTTTTTCGT CTTTTGTTAA ACTTGTTGAA AAACTAAATG TGCCAGCTGT TTGATACTGG 2400 

TCAACAAGTA CAGGGTACAC TAAGTTTTTA ATACCAACTT CTTCTATTTC AAATAAAAAA 24 60 

35 TCTTTATGTG TACTTTGTAA ATCTGTCATT TCGTTCTTAG TAGTAGGTTT CGTGCCTTCA 2520 

ATAGGATCTA CGGAACCAAA GTGTTTCCAA CGACCTTCTC GTGTCGATAA ATCAAATTCA 2580 

GTCATTTTTT TCCTCCGTTA AGATTTAAAG TGATATGTCC AATATGGTTC GACTGTTAAA 2640 

40 AAGCTGTGTT GTTTACCATC GATTTCAGGA CTTGCTAATT GTTTTAAAAA TGGACCTGTT 2700 

TGAGAAGCAT GTGCTTCAAA TGCCTTAATT TTAAGTTCTT TAAAATCTGT AATATCATTT 2760 

TGAATATCAG GTTCTCCAAG AGCTTCGGTT GCATCATTAC TGAACGCAAC TAAAGTTAAA 2820 

45 

CGAGGGCGTT CTTCTTTAGG CATGCGTTCA ACCGTTCGAA TTACAGCGTC TGCTGTTGCT 2880 

TCGTGATCAG GATGTACTGC ATATCCAGGA TAAAATGAAA TAATCAATGA TGGATTTGTA 294 0 

TCATCGATTA AAGATTTAAT CATACCATCT ATATGTTCAT AGGGTTCAAA TTCGACAGTT 3 000 

so 

TTGTCACGTA AACCCATTTT TCTTAAATCA GTAATACCGA TAACTTTACA AGCTTCTTCT 3 060 

AGTTCACGCT CACGAATACT TGGTAATGAT TCGCGTGTTG CAAATGGGGG ATTACCTAAA 3120 
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TAATTTGCTA ATGTGCCTGC AGATGAGAAG GTTTCATCAT CAGGATGTGG AAATATTACT 324 0 

AATACATGTC TTTCGTCAGT CATGTTGATG CCTCCTCTAT AAATTAAATG GTCGCTCACT 3300 

5 

AATTTGAAGT GCTGCAGCGA GTTGACCTTC GTAATTAAAA CCTGCAATTA AAAATTCATC 3360 

ATGCTCATTG ACCTCAAAAT GCGTTAGACC TTGTACATAA ACCCAACCAC CATTTGATAG 3420 

TTTAAGACCA ATGCGATAAG GTTCTTTATT ACCACCTTTT AGTTGTGCAT GCGTATATGT 3480 

10 

TATTTGTATG TTTCTTAAAA AAGTACCAGC ATTAAAAACA CGTTGATCGA AATGGTTCGC 354 0 

ATAGGCCCCA TTTGTCGTTT CAACATGCAG ATACACAGGT TTATGTTCAA AAGAAGCAAG 3600 

75 TAAATCTATA ACTTCTTGTT CTTTAATTGG TTCCAACACG TTCACTCCTT ACACTATCAA 3660 

TGTGTTTATC TTTCTATTTT ACTAAAAACT ATTCGATAAT TGTATACGAT TGCTCAATTA 3720 

TTTATAAATT AATTTTCATG AAGGGTAATT ACTCAGGATT ACGTAATCAT ACAGCATTAG 3780 

20 TTTTTTACTT TTAAAAATCA AAAATTTGTT GGAATTTGAA AAGTGTTAAA CATTAAAAAT 3840 

GATGCTATAT TAATGGTGTA TGAATGAATT CATAAGTTTT TAAAATGTAT TAAATTTGTG 3900 

GAGGCATGTA AACAATGAAA GTATTAAACT TAGGATCGAA AAAACAAGCA TCATTCTATG 3960 

25 

TTGCATGTGA GTTATATAAA GAGATGGCAT TTAATCAGCA CTGTAAACTA GGTTTAGCAA 4020 

CTGGTGGTAC AATGACAGAT TTGTATGAAC AACTTGTTAA GTTGTTAAAT AAAAATCAGT 4080 

TAAACGTAGA CAATGTATCC ACGTTTAATT TAGACGAATA TGTAGGTTTA ACCGCATCAC 4140 

30 

ATCCGCAAAG TTATCACTAT TATATGGATG ACATG CTTTT CAAACAATAT CCTTATTTTA 4200 

ATAGAAAGAA CATTCATATT CCAAATGGAG ATGCCGATGA TATGAATGCG GAAGCGTgCA 4260 

35 AAATATAATG ACGTTTTAGA ACAACAAGGT CAACGTGATA TTCAAATTTT AGGTATTGGT 4320 

GAAAATGGTC ATATTGGATT TAATGAACCT GGTACGCCGT TTGATAGCGT TACTCATATC 4380 

GTTGffTTTGA CTGAAaGTAC TATTAAGGCT AATAGTCGAT ATTTTAAAAA CGAaGATGAT 444 0 

40 GTTCCAAAGC AAGCCATTTC GATGGGACTT GCTAATATTC TTCAAGCCAA ACGTATCATT 4500 

TTACTCGCAT TTGGTGAAAA GAAACGTGCT GCTATTACAC ATTTATTAAA TCAGGAAATT 4560 

TCTGTTGATG TTCCAGCCAC ATTACTTCAC AAACACCCGA ATGTTGAGAT ATATTTAGAC 4620 

45 

GACGAAGCTT GCCCGAAAAA TGTTGCGAAA ATTCATGTCG ATGAAATGGA TTGATTGCAA 4680 

TGTTTAATTA AGAAATGCCT CGGGAAAGGT TCCAATAGAA AGATAAAAAG CATTGGAAGG 474 0 

^ ATGATTTTTA GTGGAATTAC AATTAGCAAT TGATTTATTA AACAAAGAAG ACGCGGCTGA -4800 

GTTAGCAAAT AAAGTAAAAG ATTATGTAGA TATCGTAGAA ATCGGTACGC CAATCATTTA 4 860 

CAAGGaAGGT TTACCAGCAG TTAAACATAT GGCAGACAAC ATTAGTAATG TAAAAGTATT 4920 
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CGCGGATGTA ATTACAATAC TAGGTGTTGC AGAAGATGCA TCAATTAAAG CAGCTATTGA 5040 

AGAAGCTCAT AAAAATAATA AACAATTACT AGTTGATATG ATTGCTGTTC AAGATTTAGA 5100 

5 AAAACGTGCA AAAGAACTAG ATGAAATGGG TGCTGATTAT ATTGCAGTAC ACACTGGTTA 5160 

TGATTTACAA GCAGAAGGGC AATCACCATT AGAAAGTTTA AGAACCGTTA AATCTGTTAT 5220 

TAAAAATTCT AAAGTTGCAG TAGCAGGTGG AATTAAACCA GATACAATTA AAGATATTGT 5280 

10 

CGCTGAAAGT CCTGATCTTG TTATTGTTGG TGGCGGAATC GCAAATGCAG ATGATCCAGT 5340 

AGAAGCTGCG AAACAATGTC GCGCTGCAAT CGAAGGTAAG TAATATGGCT AAATTTAGTG 5400 

75 ACTATCAATT AATTCTAGAT GAATTAAAGA TGACTTTGTC ACATGTTGAA GCGGATGAGT 5460 

TTTCAACTTT TGCATCCAAA ATACTACATG CTGAACATAT ATTTGTAGCT GGCAAAGGAC 5520 

GTTCAGGATT CGTGGCGAAT AGTTTTGCAA TGCGCTTAAA TCAGCTCGGC AAACAGGCAC 5580 

20 ATGTTGTTGG AGAATCAACG ACACCTGCGA TTAAGTCGAA TGATGTATTT GTAATTATCT 564 0. 

CTGGTTCAGG TTCCACGGAA CATTTAAGAT TATTAGCAGA CAAAGCAAAA TCAGTAGGTG 5700 

CTGACATCGT ATTAATTACT ACAAATAAAG ATTCTGCAAT AGGCAATCTA GCTGGGACGA 5760 

25 

ACATCGTTTT GCCTGCAGGT ACAAAATATG ATGAACAAGG CTCGGCACAA CCATTAGGAA 5820 

GTTTGTTTGA ACAAGCATCT CAATTATTTT TAGATAGTGT TGTAATGGGA TTGATGACTG 5880 

AAATGAATGT TACGGAACAA ACGATGCAAC AAAATCATGC TAATTTAGAA TAAAATAAAG 5940 

30 

ATAGTCGATA ATATGATGCC TAGGCAGAAA TATTATCGAT TATTTTTTTA TTTAAATAAT 6 000 

AAATTATAGT ATAATATCAA TAATAAACGA ATAGGGGTGT TAATATTGAA GTTTGACAAT 6060 

35 TATATTTTTG ATTTTGATGG TACGTTGGCA GACACGAAAA AATGTGGTGA AGTAGCAACA 6120 

CAAAGTGCAT TTAAAGCATG TGGCTTAACG GAACCATCAT CTAAAGAAAT AACGCATTAT 6180 

ATGGSAATAC CTATTGAAGA ATCATTTTTA AAATTAGCAG ACCGACCATT AGATGAAGCA 6240 

40 GCATTAGCAA AGTTAATCGA TACATTTAGA CATACATATC AATCTATTGA AAAGGACTAT 6300 

ATTTATGAAT TTGCGGGTAT AACTGAAGCC ATTACAAGTT TGTATAACCA AGGGAAAAAA 6360 

CTTTTCGTGG TGTCTAGTAA GAAGAGTGAT GTATTAGAAA GAAATTTATC GGCTATTGGA 6420 

45 

TTAAATCACT TGATTACCGA AGCTGTTGGA TCCGATCAAG TAAGTGCATA TAAACCAAAT 64 80 

CCTGAAGGCA TACACACAAT TGTGCAACGC TACAATTTAA ATAGCCAACA AACGGTGTAT .654 0 

ATTGGTGATT CAACGTTTGA TGTTGAGATG GCACAACGTG CTGGTATGCA ATCTGCAGCT 6600 

50 

GTCACTTGGG GTGCACATGA TGCAAGGTCA TTACTTCATT CAAATCCGGA TTTTATTATT 6660 

AATGATCCAT CAGAAATTAA TACCGTATTA TAAAACTTGT TAAAACAGAG AATACCATGG 6720 
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10 



15 



ATTTAAAATA AATATTTATT AAACATTATG AATTTTTAAA GAGTAATGTC TGACTCGTTG 6840 

ATAATTTATT TTTGTAAAAA TAAATTAAAG TAATGACAAA GTTATTGAAG TAAATTGAGT 6900 

ATAAACATTT AAATACGATG TCGAAAATGG CGATAGCATA TCACTTACAT GAAGTTGTGT 6960 

GctATCGCTA TTTTTAGTTA TAATTCCAAA AAGTTAATCG TTCGATGATT TAAGAATTAT 7020 

TATTGTTTAA TTCAAATGTA TGAGGGTATA AAATCATTGA ATTTAATTCG ATAAAGCGAA 7080 

ATTTTTGAAC AAACATACTT TTGTATTTAT ATAAAAGTTT AAATTCTTAT AAATTTGACA 7140 

AAACTAATTA ACTCCGTATA ATTATGAAAC ATACAAGAGG GAGTGTATGA ATTCATGGAT 7200 

TTTAATAAAG AGAATATTAA CATGGTGGAT GCAAAGAAAG CTAAAAAAAC CGTTGTTGCA 7260 

ACCGGTATCG GTAATGCAAT GGAATGGTTC GATTTTGGTG TCTATGCATA TAcAACTGCG 7320 

TACATTGGAG CGAACTTCTT CTCTCCAGTA GAGAATGCAG ACATTCGACA AATGTTGACT 7380 

2o TTCGCAGCAT TAGCCATTGC GTTTTTATTA AGACCAATTG GTGGTGTCGT ATTTGGTATT 7440 

ATTGGTGACA AATATGGACG TAAAGTTGTA TTAACATCTA CAATTATTTT AATGGCATTT 7500 

TCAACATTAA CCATTGGATT ATTGCCAAGC TATGATCAAA TTGGACTTTG GGCACCAATA 7560 
CTATTATTGC TTGCAAGAGT ACTACAAGGG TTTTCAACAG GTGGAGAGTA TGCGGGGGCA t 7620 

ATGACATATG TTGCCGAATC ATCTCCAGAT AAGCGTCGTA ACTCATTAGG TAGTGGACTA 7680 

GAAATTGGGA CATTATCAGG TTACATAGCT GCTTCAATTA TGATTGCTGT ATTAACATTC 774 0 

TTTTTAACAG ATGAACAAAT GGCATCATTT GGTTGGAGAA TCCCATTCTT ACTCGGTTTA 7B00 

TTCCTAGGAT TATTCGGCTT ATATTTACGT CGTAAGCTGG AAGAATCACC AGTTTTCGAA 7860 

AATGATGTTG CAACACAACC AGAAAGAGAT AACATTAACT TTTTACAAAT CATCAGATTT 7920 

TATTACAAAG ATATATTTGT ATGTTTTGTA GCTGTTGTAT TCTTCaATGT TACAAACTAT 7980 

ATGGTAACTG CATATTTACC AACCTATTTA GAACAAGTTA TTAAATTAGA TGCAACGACA 804 0 

40 ACAAGTGTAT TAATTACTTG TGTCATGGCA ATAATGATTC CATTAGCATT AATGTTTGGT 8100 

AAGTTAGCGG ATAAAATAGG TGAAAAGAAA GTATTTCTAA TTGGTACTGG TGGGCTAACA 8160 

TTATTCAGTA TCATCGCATT TATGTTATTA CATTCACAAT CATTTGTTGT AATAGTAATC 8220 

GGTATATTTA TATTAGGATT TTTCTTATCA ACTTACGAAG CGACAATGCC AGGGTCGTTA 82 80 

CCAACGATGT TTTACAGTCA TATAAGATAT CGAACTTTAT CAGTAACATT TAATATCTCT 834 0 

GTTTCGATAT TTGGTGGTaC GaCGCCATTA GTkGCAmCaT GGTTaGTTAC GAAAACTGGA 84 00 

GATCCATTAG CmCCTGCGTA TTATTTAACA GCAATCAGTG TTATTGGCTT TTTAGTTATT 8460 

ACATTCTTAC ATTTAAGTAC AGCAGGAAAA TCTCTAAAAG GTTCGTATCC AAATGTAGAT 8520 
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GAACGTAAGA ATTAGAGATT 
AGCTAGTAGG TTCTGCTAAC 
ATAAAGTTTT TGTATATACA 
GGGACTTAAA GCATATGTTT 
GGATGTAAAT ATGTCTTAGA 
CAATATTATT ATAGAGAACA 
TGCGATTGCG ATAACTTCTT 
CATGGTACTA CAGTATCAAA 
TTACCTGATA AAAATACTTA 
ATTGAGTGAG GGATATTGAT 
AAACCTAATG ACATAGCATT 
GCCCGCATCA CTAGCGCAgT 
ATTAATATGA AATCACCGGT 
AAAGTGCCTA TGATGATGGA 
AAATATGGTA TTAAAGATGT 
ATGTTTATTG ATTCAACGCA 
TCAGGGACAA CTGGACTGCC 
TTTGAAGTTA ATGAAATGTT 
CTATCGCACT CGTTAACATT 
ATAGGACAGA CCACTTTTCA 
TACAAAGTTG CTATGTTTCT 
AATGAACATA CAATCCAATC 
AAAAAGATAA AAAATCAAGC 
ACCAGTTTTA TCAGCTATAA 
TTTCCAAATG TGGAATTGAA 
ATAAAAAGTA ATATGATGTT 
TGGTTTGTTA CTAATGATAA 
CAACAGGATA TGTTAATTAT 
TTAACGCAAT CTTCGAGCAT 



TTAATaAAAA 
TTTAAAGTGC 
TAAACCCCCA 
AGCTTTGAAT 
GTATTTTGTC 
CAAACTTAAA 
TTCTCTATAT 
TTTATCTAGG 
TTCATTATAT 
GAACGTAATT 
ACATATCGAT 
TGAATCTTTG 
GCAAAGTATT 
AGGTAAATGG 
AATTGGAGAT 
ATTACAGCAC 
AAAAGCATAT 
GATGTTAAAA 
ATATGCGTTA 
TCCTGAAAAG 
TGTTCCAACG 
ATTTTTTAGC 
AAATGACATA 
CTTGAATCAG 
AACAACGAAT 
TAGTGGCTAT 
TGGCTATGTA 
TGGTGGTCAA 
TGATGAAGCA 



GTATAAATCA 
TTTTTAAATT 
CTGCAATGAT 
ACTTAAAATT 
CAACGCAATT 
TAGATTGGGT 
ACATATAGTA 
GCTTAAGTTT 
AATGTTAACA 
TTAGAACAGT 
GATGAAACAA 
CAGAAATATT 
ATTTGTTATT 
CAAAGTACTA 
ACAGGTCTCA 
TACCCCAATT 
TATCGTGATG 
AATGAAAATG 
TTGTTTGCTT 
TTACTTAATC 
ATGATTAAAT 
AGTGGAGATA 
AATTTGATTG 
CAAGCACCAG 
CACGATCACA 
GTAAGTGAAC 
AAAGAGCAGT 
AATATATATC 
ATTATCATCG 



ATCGTATATA 
GAGAACTGTA 
TATCGCAATG 
CTCTTGCTAT 
AATATTGAGA 
GACTTATTTG 
ACGTCTTATC 
GATTTTTATA 
ATATGTATTT 
TGAAAACACA 
TTACATATAG 
CACTTAACCC 
TAGCTTTGCA 
TACATCGTCA 
TGCAGAATAT 
TATTACATAT 
AAGATTCATG 
CAATAGCAGC 
TAAGTTCCGG 
AATGTCATAA 
CATTATTGTT 
AGCTGCATTC 
AATTTTTTGG 
TTGAATCAGT 
ATGGTATAGG 
AATGTATAAA 
ATTTATATTT 
CAGCACATGT 
GTATTCCAAA 



AGCACTTTAA 
ATTAGCCGTA 
GGGGAAAGAG 
TGAAATGTTA 
CTCTAACCTT 
TGTCAGTTAT 
TAATAAAAAA 
ATAGGCAGGT 
TAAAGTTTAC 
TACTCAAAAT 
TCAACTAAAT 
TGTCGTTGCT 
TCGTTTACAT 
ATTGATTGAA 
AGACTCACCG 
TGGTTTTACT 
GTTGGCTTCT 
CCCTGGACCA 
TCGTACTTTT 
AATATCATCA 
AGTTTACAAC 
TTCTATTTTT 
TACATCGGAA 
AGGTGTGCTA 
AACTATTTGT 
TAATGATGAA 
AACGGGACGT. 
TGAACGCCTT 
TGAGCGTTTT 



8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
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CAATTTTTAA AAAAGAAAGT GAAaCgnTaT GAAATTCCAT CGATGATTCA TCATGTAGAA 10440 

AAGATGTATT ACACTGCAAG* tGGTaAAATT GCTAGAGAAA AAATGATGTC GATGTATTTG 10500 

AGAGGTGAAT TATAATATGA ATCAAGCAGT CATAGTTGCA GCTAAACGAA CTGCATTTGG 10560 

GAAATATGGT GGCACTTTAA AACATTTAGA GCCaGAACAA TTGCTTAAAC CTTTATTCCA 10620 

ACATTTTAAA GAGAAGTATC CAGAGGTAAT ATCTAAAATA GATGATGTAG TTTTAGGTAA 10680 

TGTTGTTGGG AATGGTGGCA ATATTGCAAG AAAAGCATTG CTTGAAGCGG GGCTTAAAGA 10740 

TTCAATACCT GGCGTCACAA TCGATCGGCA ATGTGGGTCT GGACTTGAAA GTGTTCAATA 10800 

TGCATGTCGC ATGATCCAAG CCGGAGCTGG CAAGGTATAT ATTGCAGGTG GTGTTGAAAG 10860 

TACAAGTCGA GCACCTTGGA AAATCAAACG ACCGCATTCT GTGTACGAAA CAGCATTACC 10920 

TGAGTTTTAT GAGCGTGCAT CATTTGCACC TGAAATGAGC GACCCATCAA TGATTCAAGG 10980 

TGCTGAAAAT GTGGCCAAGA TGTATGATGT TTCAAGAGAA TTACAAGATG AATTTGCTTA 11040 

TCGAAGTCAT CAATTGACAG CGGAAAATGT AAAGAATGGA AATATTTCTC AGGAAATATT 11100 

ACCTATAACC GTTAAAGGAG AAATATTCAA CACTGATGAA AGTCTAAAAT CACATATTCC 11160 

GAAAGATAAC TTTGGCCGAT TTAAGCCCGT GATCAAAGGT GGGACCGTTA CCGCTGCGAA 11220 

TAGTTGTATG AAAAATGATG GTGCAGTTTT ATTGCTTATT ATGGAAAAAG ATATGGCATA 11280 

CGAATTAGGT TTCGAGCATG GTTTATTATT TAAAGATGGT GTTACGGTAG GTGTTGATTC 11340 

TAATTTTCCT GGCATTGGTC CAGTACCAGC CATTTCCAAC TTACTAAAAA GAAATCAATT 114 00 

AACGATAGAA AATATTGAAG TCATTGAAAT TAACGAAGCG TTCAGTGCAC AGGTAGTTGC 114 60 

CTGCCAACAA GCTTTAAATA TTTCAAATAC GCAATTAAAT ATATGGGGTG GTGCATTAGC 11520 

ATCAGGTCAT CCATACGGTG CAAGCGGTGC CCAATTAGTG ACTCGATTAT TTTATATGTT 11580 

TGACAAAGAG ACTATGATTG CATCTATGGG GATAGGGGGA GGTCTAGGAA ATGCAGCATT 11640 

ATTTACTCGA TTCTAACCAG CGATTAAATG TGTCATTTTC TAAGGATAGT GTGGCTGCAT 11700 

ATTATCAGTG TTTTAACCAA CCTTATAGAA AAGAAGTACC ACCATTAATG TGTGCGTCAT 11760 

TATGGCCAAA ATTTGATTTA TTTAAAAAAT ATGCAAATAG CGAACTGATT TTAACAAAAT 11820 

CAGCAATTAA TCAAACTCAA AAGATAGAAG TAGACACAAT ATATGTAGGG CATTTAGAAG 11880 

ATATTGAATG CCGACAGACT CGCAATATCA CACGTTATAC AATGGCTTTA ACATTAACTA 11940 

AAAATGATCA ACATGTCATA ACGGTtACAC AAACTTTTAT TAAGGCGATG AAGTAGAGAT 12000 

GGAGTTTAAT GAGATATGGA TAAATGAATA TTTGGCGCTC GTAAATGATG ATAATCCAAT 12060 

ACATAATGAG ATTGTGCCAG GACAATTAGT GAGTCAAATG ATGCTGATGG CTATGTCATT 12120 
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ATTCATTGAA 
AAAAATTTCT 

5 

GGAGATGAAA 
AATATAGGTT 
CTATATCAAT 

10 

CATATGATCT 
TGAATCGTCA 

15 TGCGCCTTCA 
TCTATGGTTA 
TTATCAGACA 

20 TATATATTTA 
GCATCTATCG 
CATCTTGAAA 

25 

TATTGTTCGA 
AATTTAAATT 
GTTCATACCC 

30 

CGTTAAACCT 
ACAGGACCAA 
TTCGCGCTAA 

35 

GCAGCAAATG 
TGATAAAATG 

40 CCACCTTTAA 
ATATCTTCAA 
GGTGTCACTA 

4 * TTAAGCACTT 
TTAGATGCCT 
GTTTTAATAG 

50 

ATTGGTAATG 
AAAGCGACCA 



CAACACGAAC ACGAAATTAT 
TTGAGCACAA AAAAATAACC 
GGACAGCTAA TATCAGTTAT 
ACGTTTCTTT CTTTGCACGG 
GTTTAATAAA TTCTGGATTA 
ATATCGTCTT GTAATAAAGA 
CATTTAATTG AAACATGCTG 
TGGTGATACT GTCGATAAAT 
TATTATAAAT AACATTTTTA 
TAGAACGTAT GATTTACTAA 
TAGAGTCGCC TGGCA6TCAT 
CAAAAGAATG ATAATGATAG 
ATAAAGGGTT ATTTAGTCAT 
TATGTATGAA ATTTTCAATA 
ATATACAGAG CATGATGATT 
AATTTAAGTG GTGTGGCTAA 
CTGTTACTTC AACATCGATA 
CAAAATCATT CATTTTCCAA 
TCACAGCTTC TTTCGGTGAC 
TACAACCAGC ACCATGGTTA 
TTTGACCATC ATAGTATAAG 
TGATGACATG CTGTGCGCCT 
TTGAATTTAA TTTACCTAAT 
CCGTTGCTTT AGGTAGTAAA 
CATCTTCGCC TTTACAAACC 
CATATACTTC TCCAGCACGT 
CATCAGGTCC GATTGATAAA 
GTGTAACATC ' GTGTGACCAT 
TGCCATACGT ATCTAATTCT 



AGCAATTAAT GACGATGGAG 
GATATTAGCT GCATGAACGC 
GTATTGTTAT TATTATTGGG 
GGATGCATTA ATCTAAAATA 
TTGGAACGAT TAGTCAATTT 
GAGCAATTTG AATATTTCAG 
AAACGTTTTG GTTATAATTT 
AATCATAACC TATATTACCT 
TGTGTGACAT CAACCTTAAG 
GACTATTTAT GTATAAAAGT 
TTGGGaAATA TAACATATAT 
AGGTATTGAG CATATAGATG 
AGATGTAGAT GTATAGGAAA 
AAAGCTAATA ACGCTTATAT 
ATAAAAAAAT AACCACATCA 
TAATGTTGAT TTATAGATGA 
TGTTCAATAC GGTTGTATGC 
CCGTTTTTAA TAGCAGAAGC 
TTACCGTTAG CTAAATATGC 
TAACTTTGTT GGAACATGTC 
TCATACGATT TATCTTGATC 
TTATCAAAGA TAATTGTTGC 
CCTGATAATT GACCCGCTTC 
TATTTAATCA TCGCCTCAGT 
ATGACAGGAT CTACTACAAA 
TTGATTATCT CCTCAGTACC 
GCCGTTTCAA GTTGTTTTTC 
GTATCTTTAT CCATAGTAAC 
TGGAACGTTT TCAAATCTGC 



AG ATTAAAAT 1224 0 

ATATTAATTA 123 00 

AACAGAGATG 123 60 

ATAATAACAA 12420 

AACTAACTTT 12480 

TATCACTAAA 1254 0 

CATAAACTGG 12600 

CCTTTGCTAC 12660 

TATCAACTTT 12720 

TCTAAATAAA 12780 

GATTAGAGAG 12840 

AGTTTAAGTT 12900 

TATTTGTATG 12960 

GTAACTTTCA 13020 

CATAAATTGA 13080 

ACCGCCTAAT 13140 

ACCGTGATCC 13200 

GACGAAAGCT 13260 

AGTTGTTGCC 13320 

TGTTGTTAGT 133 80 

TAAAGCTTTG 13440 

AGCCTTTTTC 13500 

AAATAAGTTT 13560 

ATTTCCAGGA 13620 

ATATTGTGCA 13680 

TAACATACCT 13740 

AAATACATCC 13800 

GATGGCAGTT 13860 

TTGCATACcT 13920 
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CACTCCTACA TAATAATATT GTATTCATCA TATCATTTTT AACCTAATTG AAAAATATTA 1404 0 

AGCATTCAAT ATTTGATGAT TGTTGAAATG AATCATTCAT ACTATTGTAA CTTTTGAAAA 14100 

TGTCATTCAC TTTAGATAAG TGTGATATGT TAAAATATGT CCTGAGGTGA GATTGAATGG 14160 

AATGGTCGCA AATTTTTCAT GACATAACAA CGAAACATGA CTTTAAAGCT ATGCATGATT 14220 

TTTTAGAAAA AGAATATTCG ACTGCAATCG TATACCCTGA TAGGGAAAAT ATATATCAAG 14280 

CGTTTGATTT AACACCGTTT GAAAATATCA AAGTTGTTAT ATTAGGACAA GACCCGTATC 14340 

ATGGTCCAAA CCAAGCACAT GG ATT AG CAT TTTCAGTGCA ACCTAACGCA AAATTCCCTC 144 00 

CATCTTTACG TAATATGTAT AAAGAATTAG CAGATGATAT TGGATGCGTT AGACAAACAC 14460 

CGCATTTACA AGATTGGGCA AGAGAAGGCG TCTTGTTATT GAATACAGTT TTAACCGTAA 14520 

GACAGGGTGA AGCAAATTCT CATCGTGATA TTGGTTGGGA AACATTTACT GATGAAATTA 14580 

TTAAAGCAGT GTCTGATTAT AAAGAACATG TTGTCTTTAT TTTGTGGGGG AAACCTGCAC 14640 

AGCAAAAAAT AAAGCTTATC GATACATCTA AACATTGTAT TATAAAATCA GTGCATCCTA 14700 

GTCCACTGTC TGCATATAGA GGATTCTTTG GATCAAAACC GTATTCCAAA GCGAATGCCT 14760 

ATTTAGAGTC AGTAGGAAAA TCACCAATTA ATTGGTGTGA AAGTGAGGCG TAGATGTTGA 14820 

ATAGAGAAAC TTTAATAGCA CGAATTGAGC AAGAATTAGT ACAAGCAGAG CAGGCACAGC 14 880 

ATGACCATGA CTTTGAAAAA CATATGTATG CCATACATAT ATTAACATCT TTATATGCTT 14 940 . 

CAACATCAAA TACACCACAT ATTGGTGAAC AACAAATGAA TCGTCGTATT GCTAACCATA 15000 

ATCAAATGCC ACAATCACAA ATAACGCAGC CAACTCATCA AGTGACAGTT GCTGAAATTG 15060 

AAGCGATGGG TGGTAAAGTA AATACGCATT CAGCACATCA TCATAATAAG TCATATTCAC 15120 

AACCTTCAAA CCAACAACAA AGATTAGCGA CAGATGATGA CATTGGCAAT GGTGAATCCA 15180 

TATttGATTT TTAAAAAGCA ACAATGAAAC ATAATTACTT AATAGCTTGT TAAGTATGTA 15240 

GGTTAATAAT CAAGACGCAT ATACTTTTAT TCGAGTGTTC GGATTTAAAC ATTTATTAAT 153 00 

ACTGAATTAT ATAAGGAGAG GTAGCAATGA AATTATTTAT TATTTTAGGT GCATTAAACG 15360 

CGATGATGGC TGTCGGTACA GGTGCATTTG GTGCGCATGG TTTACAAGGA AAAATAAGTG 1542 0 

ATCACTATTT ATCAGTATGG GAAAAAGCAA CGACGTATCA AATGTACCAT GGCTTAGCAT 15480 

TATTAATTAT AGGTGTAATT AGTGGTACAA CTTCAATCAA TGTTAACTGG GCTGGCTGGT 15540 

TAATATTTGC TGGTATTATT TTCTTTAGTG GATCATTATA TATTTTAGTA TTAACTCAAA 15600 

TTAAAGTTTT AGGTGCGATT ACGCCAATTG GTGGCGTATT GTTCATCATT GGATGGATAA 15660 

TGTTAATCAT TGCGACATTC AAATTTGCTG GTTAAATTTT AAAACTTTAG ATTACCTATG 15720 
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TGGGTATAGA ATACCTTCGA GGTGAGTTTT 
ATAGAGGCGA TTTAAAACAA AACCTATCTG 

5 

CATGTATCGG ATGGGGCGCA TTCATCTTAC 

TTGCAGCATC AATTGGTATA GTTATTGGTG 

ATGGCGCATT AGTAGAGAGA TTTCCAGTAT 
10 . 

GTTTCGGCAG ATATGTGAGT TTCTTCTCAT 

TCGTTGCTTT AAAtGCGACC GCATTCAGTT 

1$ TAAATAATGG GAAACTATAC ACCATTGCGG 

TTGCGACCGT ATTACTACTT GTATTCATGC 

GATCATTACA ATATTATTTC TGTGTGGCGA 

20 GTTCATTCTT TGGTAATAAT TTTGCACTTG 

AAGGATGGTT AGTGTCTATT GTGGTTATTG 

TTGATAATAT TCCACAAACA GCAGAAGAGT 

25 TTATCGTGTA CAGTTTATTA GCAGCATCAT 

GTTGGTTATC AACAAGTCAT CAAAGTTTAA 

CACAAACAGC ATTTGGTTAT ATTGGATTAG 

30 

TATTTACTGG TTTAAATGGA TTCTTGATGA 
GTTCAGGTAT TATGCCAACA ATGTTTAGTA 
TCGCAATCAT ATTCCTAGTA GGAGTGTCGT 

35 ■ 

TGACTTGGAT TGTAGATATG TCATCTACTG 
TGTCJTGCAGC GAAATTATTC AGTTATAACA 

40 AAACGTTTGC TATTATCGGC TCATTTGTAT 

CAGGTTCTCC TGCAGCACTG ACTGCACCGT 
TCGGTTTAAT ATTCTTTGTG ATTCGATATC 

45 TAAGTCGCTT GATTTTAAAT AGAAGTGAAA 

AAAAAGAAAA AACTAAATAA TAAAAGAATC 
ATCGTGCGAT TTTTTGTATT ATAAATTGAC 

50 

TAATTGCTAA GAGTTAGGGC TGAGCCATTT 
TTCACGAACC CAGAAACAAT TAATTTGGAA 

55 



TATTTATGGA AAAAAAGAAT AAGCAAATAG 1584 0 

AAAAGTTTGT ATGGGCGATT GCATATGGTT 15900 

CAGGAGACTG GATTAAGCAG TCAGGTCCGA 15960 

CATTATTAAT GATATTAATT GCGGTTAGTT 16020 

CAGGGGGCGC GTTTGCCTTT AGTTTCTTAA 16080 

CATGGTTTTT AACTTTTGGT TATGTCTGTG 16140 

TACTAGTTAA ATTCTTATTG CCAGATGTCT 16200 

GCTGGGACGT TTATATTACG GAAATCATTA 16260 

TAGTAACGAT TCGTGGCGCA AGTGTATCTG 16320 

TGGTAATCGT CGTATTATTG ATGTTCTTTG 16380 

AAAATTTACA ACCGTTAGCT GAACCTAGCA 1644 0 

TATCCGTGGC ACCATGGGCA TATGTTGGAT 16500 

TTAACTTTGC ACCAAACAAG ACATTTAAGC 16560 

TAACTTATGT TGTCATGATT TTATACACTG 16620 

ATGGGCAGTT GTGGTTAACA GGTGCTGtTA 16680 

GTGTATTAGC AATTGCAATT ATGATGGGTA 1674 0 

GTTCAAGTCG CTTGTTATTT TCTATGGGAC 16800 

AATTACATAG TAAATACAAA ACACCATATG 16860 

TAATTGCACC TTGGCTAGGA AGAACTGCAT 16920 

GTGTATCCAT TGCCTACTTT ATTACATGTT 16980 

AACAAAGTAA TACGTATGCA CCGGTTTACA 17040 

CATTCATTTT CTTAGCGTTG TTATTAGTGC 17100 

CTTATATTGC ATTACTTGGA TGGTTAATCA 17160 

CTAAATTGAA AAATATGGAT AATGATGAAT 17220 

ATGAAGTTGA TGATATGATT GAAGAACCTG 17280 

GCACAATAAA CCTTCTTCAT TCGGAGGCGT 1734 0 

ATTTAAGACG AGGCAGCTGA ACCTTATATA 17400 

CTAACAAATA TTTATAATCG TTTAAAAGAT 17460 

ATTTGGTCGG CGAATAATAA ACCTAATGCG 17520 
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AAGACTAAAT TTTTTGTAGC ATCGTATGCT AAGCCACCAG GTACTAATGG AATGATACCC 17640 

GTTACCATAA AAATGATGGC AGGTTCTTTT TGTTTACGAG CCATATAATG ACTTAACAAG 17700 

5 CCTAATGCTA AACTACCAAA GAAACTAGAG TATATAGTGT GCACATTAAA GCCGTTGAAG 17760 

AATAAGGTGT AAACCATCCA TCCACACGTA CCAACGAAAC CACATGATAG ATATAATTTT 17B20 

CTAGGTGCAT CAAAAATGAC GCAGAA 17846 

10 

(2) INFORMATION' FOR SEQ ID NO: 110: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5544 base pairs 

(B) TYPE: nucleic acid 

75 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 110: 





ATTGACACTT 


GGTGAAAGTA 


ATATCGCCGC 


GCTATTTTGG 


CAAAATGGAC ACTTAGAACC 


60 




TGAGTTACAA 


GATGAACAGC 


CAATTAATAT 


ATTAGGATCT 


GkTCAAATCA ACGAATGGAA 


120 


25 


TGGTAATCAA 


TCACCGCAAA 


TAATTATTCA 


AGATATTGCG 


ATGAATGAAC AGCAAATATT 


180 




AGATTATAGA 


AGTAAGCGAA 


AAAGTTTACC 


TTTTACAGAA 


AATGATGAAA ATATTGTCGT 


240 




GCTTATTCAT 


CCTAAAAGTG 


ATAAAGTAAA 


TGCGAATGAA 


TATTATTATG GTGAAGAAAT 


300 


30 


TAAACAACAA 


ACTGATAAAG 


TAGTATTAAG 


AGATTTACCA 


ACGTCAATGG AAGACTTGTC 


360 




TAATTCCTTG 


CAACAACTGC 


AATTTTCTCA 


ACTTTATATA 


GTTTTGCAAC ATAATCATTC 


420 


35 


GATTTACTTC 


GATGGTATAC 


CTAATATGGA 


TATTTTTAAA 


AAGTGTTATA AAGCATTAAT 


480 


AACTAAACAA 


GAAACAAATA 


TCCAGAAAGA 


GGGTATGTTA 


TTGTGTCAAC ATTTAAGTGT 


540 




GAAA£CAGAT 


ACACTTAAAT 


TCATGTTGAA 


AGTTTTCTTA 


GACTTAAAAT TTGTAACACA 


600 


40 


AGAAGATGGT 


TTAATTCGAA 


TCAATCAACA 


ACCTGATAAA 


AGATCGATTG ATTCCAGCAA 


660 




AGTATATCAA 


TTAAGACAAC 


AACGTATGGA 


TGTTGAAAAG 


CAATTATTAT ATCAAGATTT 


720 




TTCAGAAATA 


AAAAATTGGA 


TAAAGTCACA 


ATTGTCGTGA 


GCAATTTAGG AGGAAATATT 


780 


45 


AATGGATTTA 


AAGCAATACG 


TATCAGAAGT 


TCAAGATTGG 


CCGAAACCAG GTGTTAGTTT 


840 




CAAGGATATT 


ACTACAATTA 


TGGATAATGG 


TGAAGCATAT 


GGCTATGCAA CAGATAAAAT 


900 




TGTAGAATAC 


GCAAAAGACA 


GAGATGTTGA 


TATCGTTGTA 


GGACCTGAAG CGCGTGGCTT 


960 


50 


TATCATTGGC 


TGTCCTGTAG 


CTTATTCAAT 


GGGGATTGGC 


TTTGCACCTG TTAGAAAAGA 


1020 




AGGGAAATTA 


CCTCGTGraAG 


TCATTCGTTA 


TGAGTATGAC 


CTAGAATATG GTACAAATGT 


1080 



55 
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ATTAGCTACT GGTGGTACGA TTGAAGCAGC 
CGTAGTAGGT ATTGCATTTA TAATTGAATT 
AGATTACGAT GTTATGAGTT TAATCTCATA 
AATGAAATCC TTCATCAAAT GTATAAGAAC 
TTTCTTAACA TGAGATGTTA GGATTTTTTA 

10 

ATACCTTAAT AACATCGTTT ATTTATTTCA 
AAAAATGAAA CAGTAGATTT AGGTCGAATT 
TACAAATTAA ACTCGCTCAA GTAAAATTAA 

75 

TTATCGTCGA CGGACGTATG ATTGGTGTGG 
TCATTGTTTA AGGCGAAGTA ATAAATATGA 

20 ATCCATATAG TGCAGACGAA tTCTTCACAA 

TGAGTATGTT TTAAAAAGCT ATCATATTGC 
AAACGGATTA CCATACATTA TGCATCCTAT 

25 ATTAGACGGA CCGACGATTG TCGCAGGTTT 

TACATTTGAA GATGTAAAAG AAATGTTCAA 
GACGAAGCTT AAAAAAGTAA AATACCGCTC 

3D 

CAAGTTATTT ATTGCGATTG CCAAAGATGT 
ATTACATAAT ATGCGTACCT TGAAAGCCAT 
AGAAACATTA GAAATTTATG CACCATTAGC 

35 

GGAACTAGAA GATACGGCTC TTCGTTATAT 
TTTAATGAAG AAGAAACGTA GTGaACGTGA 

40 ACGf ACTGAA ATGGACCGAA TGAATATCGA 
TTACAGTATT TATCGGAAAA TGATGAAGCA 
GTTGGCGATA CGTGTTATTG TCAATTCTAT 

45 GCATACGTTA TGGAAACCGA TGCCAGGACG 

AAATTTGTAT CAGTCATTGC ATACTACAGT 
CCAAATACGA ACGTTTGATA TGCACGAAAT 

50 

TTACAAAGAA GGTAAAAAAG TAAGTGAAAA 
GTTAAAAGAA TTAGCTGAAG CGGATCATAC 

55 



AATAAAATTA GTTGAAAAAT TAGGCGGTAT 1200 

GAAATATTTA AATGGTATTG AAAAAATTAA 1260 

CGACGAATAA TAAATAATAT AATTTTATCA 1320 

CAATGACTTA ATTAAAAAAG TTGTTTAAGT 1380 
TTTACTGAAA ATGTTAGATG ATTGAGCATT . 1440 

TAAATTGTAG TATCATAGAA CTAATATTTA 1500 

TTTGTAAAAG TTTTAAAAGT AGGAATAGTA 1560 

TATTACGATT AATGACGACA GGATAAATAT 1620 

GACAAATACT ATTCAACAAG AGTACCTAAA 1680 

ATGGGGTGTA TCATATAATG AACAACGAAT 1740 

AGCAAAATCA TATTTGTCAG CAGATGAATA 1800 

TTATGAAGCA CATAAAGGTC AGTTCCGAAA 1860 

ACAAGTTGCA GGTATTTTAA CAGAAATGCG 1920 

TTTGCATGAT GTAATTGAAG ATACACCGTA 1980 

TGAAGAAGTT GCTCGAATTG TTGATGGTGT 2040 

AAAAGAAGAA CAACAAGCTG AAAATCATCG 2100 

ACGCGTAATT TTGGTGAAAT TAGCAGACAG 2160 

GCCGCGCGAA AAACAAATTA GAATTTCTCG 2220 

ACATCGTCTT GGTATTAATA CAATCAAATG 2280 

TGATAATGTG CAATATTTTA GAATAGTCAA 234 0 

AGCGTATATC GAAACGGCTA TTGATAGAAT 2400 

AGGCGA7ATA AATGGTAGAC CTAAACATAT 2460 

GAAAAAACAA TTTGATCAAA TTTTTGATTT 2520 

TAATGATTGT TATGCGATAC TTGGGTTGGT 2580 

TTTTAAAGAT TATATTGCAA TGCCTAAACA 2640 

AGTAGGCCCA AATGGAGACC CGCTCGAAAT 2700 

TGCTGAGCAT GGTGTTGCAG CACACTGGGC 2760 

AGATCAAACT' TATCAAAATA AGTTAAATTG 2820 

ATCGTCTGAC GCTCAAGAAT TTATGGAAAC 2880 



638 



TGAGTTGCCA TATGGTGCTG TGCCGATTGA 
TAATAAGATG ATTGGTGCCA AGGTGAATGG 
5 AACAGGCGAT ATTGTTGAAA TACGTACTAG 

GTTGAAAATT GTTAAATCGT CTAGTGCCAA 
AGATCGTTCA TCTAATATTG AAAAAGGCCG 

10 

AGGATTTAGA GTCGAAGATA TTTTGACAGA 
TAACTTTGCA AATGAAGATG ATTTATTCGC 
ACAGATTGTT AATAAATTAA CTGAAAGACA 

IS 

TGAAGCACAA GAAGTTACGA AATCATTGCC 
TGTCTATGTA GAAGGTTTAG AAAATGTACT 

20 ACCaGGTGAT GATATTGTAG GTTATATCAC 

TGATTGCCCA AATATTAAGA ACGAAACTGA 
AAAAGACGCA ACTCAAAAAT ATCAGGTTGA 

25 TTGTTGAATG AAGTACTACA AGCTGTTAGC 

GGACGTTCAG ATATTGATAA AAATGCAATA 
AATGATGTTT ATCGTGTGGT AGAAAAGATC 

30 

AGAGTTTGGA ACTAGAGGTG CAAAATATGA 
CGGTGACGAA TGATACATTA AATAATCAAA 
TCGGTCAGAA CTCTACAGAG CAAGATGCAG 

35 

GATTATTTGA AGATGACAAT AATAAATTAA 
TACTATCAGT TTCACAATTT ACTCTCTATG 

40 TCTCAAATTC TAAAAATCCT GATCaAGCGG 

CTACGAGCGT ATGGTCTTAC TGTGAAAACA 
ATAAATAATG ATGGTCCAGT CACTATTATT 

45 AAAAAATAGA GGCATGGTTA TCTAAAAAGG 

TGATTGCCTT TGTCTTATTT ATCATCTTTT 
ATAGTGGGAA CATCACGATA. ACTGAAAATG 

50 ATCCAGTCAT ATATAAAGTT GAAAAAGGTG 

AATGGATTGA AGTTGAAGAT ACATCCAGTA 

55 
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TTTTGCTTAT 


GCGATTCACA GTGAAGTAGG 


3000 


CAAAATTGTA 


CCAATTGACT ATATTTTACA 


3060 


TAAACATTCA TATGGACCAA 


GTCGTGATTG 


3120 


AGGTAAAATT 


AAAAGTTTCT 


TCAAAAAACA 


3180 


AATGATGGTT 


GAAGCTGAAA TAAAAGAGCA 


3240 


GAAAAATATT 


CAGGTTGTTA ATGAAAAATA 


3300 


AGCTGTAGGA 


TTTGGCGGCG 


TGACATCCTT 


3360 


ACGTATTTTA 


GATAAACAAC 


GTGCTTTAAA 


3420 


TATTAAAGAC 


AACATCATTA 


CTGATAGTGG 


3480 


TATCAAGTTG 


TCAAAATGTT 


GTAATCCTAT 


3540 


CAAAGGTCAC 


GGTATTAAAG 


TACATCGCAC 


3600 


ACGACTAATT AATGTTGAAT 


GGGTAAAATC 


3660 


TTTAGAGGTA 


AtGCGTATGA 


CCGAAATGGC 


3720 


TCGACAGCCG 


GCAATTTAAT 


TAAAGTTTCA 


37B0 


ATAAATATTA 


GTGTCATGGT 


GAAAAACGTG 


3840 


AAACAACTTG 


GTGATGTTTA 


TACAGTAACA 


3900 


AAGTAGTTGT 


ACAAAGAGTT 


AAAGAAGCAT 


3960 


TCAAAAAAGG 


ATATTGTTTA 


TTAGTCGGTA 


4020 


ATGTAATTGC AAAGAAAATT 


GCTAATGCAA 


4080 


ACTTTAATAT 


CCAACAAATG AATGGTGAAA 


4140 


CAGATGTAAA 


AAAAGGTAAC 


CGTCCAGGTT 


4200 


TAAAAATTTA 


TGAGTATTTT 


AATGcaTGCG 


4260 


GGTGAATTTG 


GAACACACAT 


GAATGTTAGC 


4320 


TATGAAAGTC AGGACGGCAA AATTCAATGA 


43B0 


GTCTTAAAAA 


TAAACGTACT 


CTAATAGTAG 


4440 


TATTTTTATT GCTGAATAGC AATAGTGAAG 


4500 


CTGAATTACG 


TACAGGTCCA AACGCTGCGT 


4560 


AC CATTTTAA 


AAAGATTGGT 


AAAGTAGGTA 


4620 


ATGAAAAAGG 


TTGGATAGCT 


GGATGGCACA 


4680 
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GGAGGTAGTG ACCAGGGTGC TTCAAGCAAT ACTAAATATA 4800 

ACGTTGAAAA CAGCAAAAGA ATTGCAGCGT ACTTTAGAAA 4860 

ATGACAAGAA CAGACGATAC ATATGTTTCA CTAGAAAATC 4920 

TATTTGAGTA TACATAATGA TGCGTTAGAA TCATCTAATG 4980 

TGGTATCATG ATAATCAAAG AGCTTTAGCA GATACGTTAG 5040 

GGTCTACTTT CTAATCGCGG TTCAAGACAA GAAAATTATC 5100 

GTTCCTGCTG TTTTATTAGA ATTAGGTTAT ATTAGTAACC 5160 

AAAGATCAAT TACATAGACA AATTTTAGAA CAAGCAATTG 5220 

TTTTCTGCGT AGGGCTTGCA AAAATATGTG AAAGTAGTTA 5280 

AACTAAAACC GTTAGTATTC TTGAAATGGT AAATGAAATA 5340 

TGTGTAGGAA TATAATCCAT AGACTGAAAG ATTATGCTGA 5400 

ACAAGAAGAG GTGCTTTATG AAAAGTAAAG CCGTTAAACG 5460 

GGTTTATTAA ATGCACGCTT ATAAAAAGTA ATGATGATTA 5520 

ACCA 5544 

(2) INFORMATION FOR SEQ ID NO: 111: 

(i) SEQUENCE CHARACTERISTICS: 
30, (A) LENGTH : 1067 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 111: 

AAAAGATTGC AAATATAAAT GGCATGTTTA ATATGTTAGA ACAACAAATC ATTCATAGCC 60 

40 AAGATATGGC TCATTTTAGA AGTGAATTTT TTTACGTCAA TCATGaGCAT CGAGAAAACT 120 

ATGAAgCACT CCTAATTTAT TACAAAAATA GTATCGACAA TCCTATTGTA GATGGTGCAT 180 

GTTATATTTT AGCCCTACCT GAAATTTTCA ATAGTGTTGA TGTTTTCGAA TCAGAGTTAC 24 0 

45 CATTTTCATG GGTATATGAT GAAAATGGCA TTACCGAAAC AATGAAATCA CTTAGCATTC 300 

CATTACAATA TTTAGTTGCA GCAGCTTTAG AAGTAACTGA TGTGAATATA TTTAAGCCTT 360 

CAGGATTTAC AATGGGAATG AATAATTGGA ATATTGCTCA AATGCGAATC TTTTGGCAAT 420 

50 

ATACAGCAAT TATTAGAAAA GAAGCACTAT AACATTAATA ATTAATTAGC TATAAAGATG 4 80 

ATTCACAACA ATCATCTTTA TAG CTTTTT T ATGTCTAATT ATTTTTGAGG AAAATmACAA 540 



TAGTGCTTGA TCCTGGTCAT 
AAAGTTTAGA AAAAGATTAT 
5 AAGAAGGCGC AACTGTTAAG 

GTGATATCAA AGGCGATGCC 
. CAAATGGAAT GACaGTTTAT 

10 

ACGCTACGAT TCAGAAGAAA 
AAGTGTTAAG ACAAACAAAA 

1S CAACTGATGA AACGATGATT 

TTGATGGCCT TAAAATTTAT 
TCATTGATAT TGAATTTTAT 

20 GGTAGCAATC TAACTAAGAT 

GTAGTTTATA TACATTGAAC 
TACGTTaAAC GTTTTGAGTG 

25 CAATTAGGCA TGTT TTTTA A 
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AATTTTATGT TTTCAAAAGT AAACAATCAA AAGATGTTAG AAGATTGCTT CTATATAAGA 660 

AAGAAAGTGT TTGTAGAAGA ACAAGGCGTC CCTGAGGAAA GTGAAATTGA TGAATATGAA 720 

5 TCTGAATCTA TTCACCTCAT TGGATATGAT AATGGACAGC CAGTTGCCAC TGCTCGAATA 780 

CGCCCTATTA .ATGAAACAAC TGTCAAAATA GAACGAGTAG CTGTGATGAA ATCACATCGT 840 

GGACAAGGAA TGGGTAGAAT GCTTATGCAA GCTGTAGAAT CATTAGCTAA AGATGAAGGT 900 

10 

TTTTACGTAG CTACTATGAA TGCCCAATGT CATGCTATCC CATTTTATGA AAGTTTAAAC 960 

TTTAAAATGA GAGGTAATAT ATTTCTTGAG GAAGGCATCG AGCATATTGA AATGACAAAA 1020 

AAGTTAACCT CGCTTAATTA AAAAAAGTTG TATCTATTTT AGAAACA 1067 

75 

(2) INFORMATION FOR SEQ ID NO: 112: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18613 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 112: 





AAGACGtA t G 


ATAACAACAA 


TACgTGTAGT GAAAGATTTT 


AATCTACATA TTACTGACAA 


60 




AGAATTCATT 


GTATTTGTTG 


GACCATCGGG ATGTGGTAAA 


TCAACAACAT TACGAATGGT 


120 


30 


TGCTGGACTA 


GAGTCTATCA 


CATCTGGAGA TTTTTATATT 


GATGGGGAAC GCATGAACGA 


1B0 




TGTTGAACCA 


AAGAATAGAG 


ATATTGCGAT GGTATTTCAA 


AACTATGCAT TATATCCACA 


240 


35 


TATGACTGTT 


TTTGAAAATA 


TGGCATTTGG GCTAAAGCTA 


CGTAAAGTAA ATAAAAAAGA 


300 


GATTGAACAA 


AAAGTTAATG 


AAGCAGCTGA AATATTAGGA 


TTAACTGAGT ATCTTGGTCG 


360 




TAAACCAAAA 


GCGTTATCTG 


GCGGACAGCG TCAACGTGTT 


GCTTTGGGCA GAGCTATTGT 


420 


40 


TAGGGATGCG 


AAAGTCTTTT 


TAATGGATGA ACCATTATCG 


AATCTTGATG CGAAyTtCGA 


480 




GTACAAATGC 


GCACAGAAAT 


ATTGAAATTA CATAAGCGAC 


TTAATACTAC GACAATTTAT 


540 




GTTACACATG 


ATCAAACTGA 


AGCATTGACG ATGGCTAGTC 


GAATTGTTGT TTTGAAAGAT 


600 


45 


GGCGACATTA 


TGCAAGTCGG 


CACACCTAGA GAAATATATG 


ATGCCCCTAA TTGCATATTT 


660 




GTGGCGCAAT 


TTATCGGCTC 


ACCAGCAATG AATATGTTGA 


ATGCTACAGT TGAAATGGAC 


720 




GGATTGAAGG 


TAGGAACACA 


CCATTTTAAA TTACATAATA 


AAAAATTTGA AAAGTTAAAA 


• 780 


50 


GCTGCTGGCT 


ACTTAGACAA 


GGAAATTATT TTAGGTATTC 


GAGCTGAAGA ' CATTCATGAA 


640 




GAACCAATAT 


TTATTCAAAC 


TTCTCCAGAG ACACAATTTG 


AATCTGAAGT AGTTGTATCC 


900 



55 
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AAATTAGATT CAAGAACTCA AGTGATGGCG 
AATAAGTGTC ACTTTTTTGA TGAAAAAACA 
5 ATGTCTAAAA TTTTAAAATG TATCACGTTA 

TGTGGCCCTA ATCGTTCGAA AGAAGATATT 
GACAAGCCTA ACCAACTTAC GATGTGGGTG 

10 

AAAATTACGG ATCAATATAC TAAAAAAACT 
CAAAATGATC AACTAGAAAA TATTTCGCTA 
TTTTTCTTAG CACATGATAA TACTGGAAGT 

15 

AAATTATCAA AAGATGAGTT GAAAGGTTTC 
GACAATAAGC AACTAGCATT GCCAGCTATC 

20 AAATTAGTGA AAAATGCACC GCAAACGTTA 

ACTGATAGTA AAAAGAAACA ATACGGTATG 
TATCCGTTTT TATTCGGCAA TGATGATTAT 

25 ATTCATCAGC TAGGACTAAA TTCAAAACAT 

TGGTACGACA AAGGGTATCT TCCTAAGGCA 
AAAGAAGGAA AAGTAGGACA ATTTGTCACT 

30 

ACGTTTGGTA AAGATTTAGG AGTAACAACA 
CCATTTCTAG GTGTACGTGG TTGGTATTTA 
AAAGATTTAA TGCTGTATAT CACTAGTAAA 

35 

AGCGAAATTA CTGGACGTGT TGACGTGAAA 
AAGCAAGCAC GTCATGCTGA ACCGATGCCT 

40 CCGATGGGCA ATGCAAGCAT ATTTATTTCA 

GAGGCGACGA ATGATATAAC GCAAAATATT 
AAAGGAGATT AGTTATGACG AAACGTAACC 

45 CTGGTTTGGG ACAGTTTTAT AATAAAAGAC 
TCATCAGTTT TATTTCTGTT TTTTATAGCT 

CATTAGGGAC AGTACCTAAG TTAGACGATT 

50 

CTATCTTACT CGTTGCTTTC GCAATCATGC 

GTAATGCTGA ACGATTTAAT CGCAATGAGG 

55 



AACGACAAGA TTACACTAGC ATTTGATATG 1020 

GGAAATCGTA TCGTCTAAGG GGGAGTATTC 1080 

GCCGTGGTAA TGTTATTAAT CGTAACTGCA 1140 

GATAAAGCAT TGAATAAAGA TAATTCTAAA 1200 

GATGGCGACA AGCAAATGGC GTTTTATAAA ■ 1260 

GGCATCAAAG TAAAGCTTGT AAATATTGGT 1320 

GACGCTCCTG CAGGAAAAGG TCCAGATATC 1380 

GCCTATCTAC AAGGCTTAGC TGCTGAAATC 144 0 

AATArGCAAG CACTTAAAGC GATGAATTAT 1500 

GTTGAAACAA CCGCACTTTT TTATAATAAA 1560 

GAAGAAGTTG AAGCTAATGC TGCCAAACTA 1620 

TTATTTGATG CTAAAAATTT CTATTTTAAT 1680 

ATTTTCAAGA AAAATGGCAG TGAATATGAT 1740 

GTCGTCAAGA ATGCTGAACG ATTACAAAAA - 1800 

GCAACACATG ATGTCATGAT TGGT CTTTTT 1860 

GGACCGTGGA ACATTAATGA ATATCAAGAA 1920 

TTACCTACAG ATGGTGGCAA ACCTATGAAA 1980 

TCTGAATATA GTAAACATAA GTATTGGGCT 2040 

GATACATTAC AAAAATATAC AGATGAAATG 2100 

TCATCTAATC CAAATTTAAA AGTGTTTGAA 2160 

AATATTCCTG AAATGCGACA AGTTTGGGAA 2220 

AATGGTAAGA ATCCTAAACA AGCGTTAGAT 2280 
AAGATTCTTC ATCCATCACA AAATGATAAG ' 2340 

CT AAATT AG C GGCATTATTA TCTGTTATAC 2400 

CCATTAAAGG GACGATATTT TTTATCTTTT 2460 

TTTTAAATAT TGGTTTTTGG GGATTGTTCA 2520 

CTCGTGTCTT ACTTGCACAA GGTATTATTT 2580 

TATATATCAT TAATATTTTA GATGCATATC 264 0 

AAATAAAGGA TCCGAAGcGC GTATGGTGGC 2700 
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TGTAGTTGTA TTTCCATTAA TAyyTATGTT 
CAACGCGCCT CCGAGACACA CATTAGAATG 
CACAATTGGC GTTTGGCGTA AAACATTTTT 
G CTTGTTG CA ACGACACTTC AAATTGCATT 
CCCTGTCGTC AAAGGTAAGA AATTTATCCG 

10 

ATCATTTGTG ACAATTTTAA TATTTGTAGC 
TAATGATATT TTGCAACCTT TATTAGGTGT 
GGCAAAAGTG GCATTAATCG GCATTCAAGT 

15 

GTTCACTGGA GTACTGCAAA GTATTTCATC 
TGCGTCTAGT TGGCAAAAGT TTAGAAACAT 

20 GCCATTGTTA ATTATGCAAT ATGCAGGTAA 

TAATAAAGGC GGTCCACCAG TGTCAGGGCA 
TTGGGTGTAT AATCTGACAT TTGAGTTTAA 

25 AATTATTGGA TTTATTGTTG CTATTGTCGC 

TAAAGATGAG GGAGGTTTAT AAGATGACAA 
TTTACAGTTT TATAGCGATG ATGTTTGTCA 

30 

GCATTTCCCT TAATCCAGGT ACGAACTTGT 
CATTTAAAAA TTATGCATTC TTACTATTCG 
AAAATACGCT TATCGTAGCA TCTGCAAATG 

35 

CAGCATATGC TTTTTCTAGA TATCGCTTTG 
» TGAOTTTACA AATGTTCCCT GTATTAATGG 

40 CAATTGGATT ATTAGATTCT TTATTTGGAC 

CGATGAATGC CTTTTTAGTG AAAGGTTACT 
CTGCCAAAAT TGATGGTGCA GGGCATATGC 

45 CTAAGCCGAT TTTAGCAGTT GTTGCTTTGT 

TATTACCTAA AATACTATTA AGAAGTCCTG 
ACTTTATTAA TGATAAGTAT GCAAATAATT 

50 

TTGCAGTACC TATAGCAATC GTATTCTTGT 
CAACAGGTGC GACAAAAGGT TAGTTTGAAA 

55 



TGGAGTAGCA TTTACAAATT ACAATTTATA 


2820 


GGTTGGTTTA GATAACTTTA AAACGTTATT 


2880 


CAGTGTTATT 


ACTTGGACAT 


TAGTATGGAC 


2940 


AGGGCTGTTT TTGGCAATTA TTGTAAATCA 


3000 


TACTGTGTTA 


ATCCTACCTT 


GGG CTGTACC 


3060 


GTTATTTAAT 


GATGAATTTG 


GTGCGATAAA 


3120 


AGCACCAGCA 


TGGTTAAGTG 


ATCCGTTTTG 


3180 


ATGGCTTGGA 


TTCCCATTTG 


TCTTTGCACT 


3240 


AGATTGGTAC 


GAAGCAGCAG 


ATATGGATGG 


3300 


CACATTCCCG 


CATGTCATTT ACGCCACAGC 


3360 


TTTCAATAAT 


TTTAATCTTA 


TTTATCTATT 


3420 


GAATGCTGGT AGTACAGATA TCTTGATATC 


3480 


CAACTTCAAC 


ATGGGTGCAG 


TTGTGTCATT 


3540 


ATTTATTCAA 


TTCAGACGTA 


CAAGTACGTT 


3600 


AGAAGAAAAA 


CATATTAAAA 


GCAATCGGTA 


. 3660 


TCATTTTATA 


TCCACTACTG 


TGGACATTTG 


3720 


ATGGTGCCAA 


AATGATACCA 


GACAATGCAA 


3780 


ATGACAGTAG 


TCAATACCTG 


ACTTGGTATA 


3840 


CACTGTTTAG 


TGTGATATTT 


GTCACGTTAA 


3900 


TTGGTCGTAA ATACGGGCTG ATTACATTTT 


3960 


CAATGGTCGC 


AATCTATATT 


TTGCTAAATA 


4020 


TAACACT3GT ATATATTGGT 


GGATCAATAC 


4080 


TCGATACGAT 


TCCAAAAGAA 


CTTGATGAAT 


4140 


GTATTTTCTT 


ACAAATTATG 


CTTCCATTAG 


4200 


TCAATTTTAT 


GGGGCCATTT 


ATGGACTTTA 


4260 


AAAAATTCAC ATTAGCAGTT GGATTGTTCA 


4320 


TCACAGTGTT 


TGCAGCAGGG 


GCAATTATGA 


4380 


TCTTGCAACG 


CTATTTAGTA 


TCAGGTTTAA 


4440 


TTAGGAGTGG 


GGCAGAATTG 


ATAAAGAACC 


4500 
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GGGTGTGGTG GTATTGCGAA TGGCAAGCAC 
GAAATGATCG CATTTTGTGA CGTAGACATT 
GGAACTGACA ATGCAAAGGT TTATGATGAT 
GATGTTATCC ATGTTTGTAC GCCAAATGAC 
CATGCTGGTA AACATGTGAT GTGTGAAAAA 

10 

AAAATGATAG ATACAGCTAA ATCAACAGGT 
TTCCGAGCAG ATAGTCAATT TTTACATCAA 
TACTTCGGAA AGGCACATGC CATTCGTCGT 

15 

GACGAAGAAG CTCAAGGTGG AGGACCATTA 
ACGTTATGGA TGATGGATAA TTATGAACCA 

20 TTAAATAAAC AGCATCATGC GGCAAACGCT 

GTTGAAGATT CTGCGTTTGG ATTTATTAAA 
TCCGCTTGGG CGATTAATTC TTTAGAAGTG 

25 AAAGCAGGTG CTGATATGAA AGATGGTCTA 

TATACCAAAC ACGTTGAATT GGAAAACAAA 
GATGAAGCTG AAGAAGAAGC AAAAGCTTGG 

30 

GTTGTGAAAC CGGAACAAGC AATGGTAGTT 
GCAAAATCAG GCAAAGCAAT TTACTTTGAA 
ACAAAATTAA AAGTTGGTGT GATAGGTGTT 

35 

GCATTGCTGA AACTCAAAGA CACAGTCTGA 
CAGATGATTG ATGTTGCGAA gCGCTTTAAT 

40 CTGTTTAAAC TTGTTGATGC GGTGGTCATT 

TCTATAGAAG CATTGAACCA TGGTGTCCAT 
ACGGAAGAGT GTGATCGCAT GATTGAAGCG 

45 GCATATCATT ATCGTCACAC AGATGTGGCA 

GTGGTTGGTA AACCTTTAGT AGCACGTGTA 
TGGGGTGTTT TTACCAATAA AGCGTTGCAA 

SO 

CACTTGTTAG ACTTATCTTT GTGGCTACTA 
GGAAAAACAT ATAATCAATT GAGCAAACAA 

55 



ATGCCAAGTT TACAAAAAGT TGAAAATGTT 4620 

TCGAAAGCAG CGAGTGCGGC AGAAGCATAC 4680 

TACAAAGCAT TGTTAAAAGA TGACACGATT 4740 

TCGCATTGTG AAATTACTGT AGCAGGGTTG 4800 

CCAATGGCTA AAACGACAGC AGAAGCTCAA 4860 

AAAAAATTAA CAATAGGTTA TCAAAATCGT 4 920 

GCAGCGCAAC GTGGCGACTT AGGAGACATT 4 980 

CGAGCAGTAC CAACATGGGG TGTCTTTCTA 5040 

ATCGATATCG' GTACACACGC TTTAGATTTA 5100 

GAATCAGTGA TGGGTTCAAC ATTCCATAAA 5160 

TGGGGTTCAT GGAATCCAGA TGAATTTACA 5220 

ATGAAGAATG GAGCGACGAT CATTTTAGAA 5280 

GATGAGGCAA AATGTTCATT ATCAGGAACT 5340 

CGTATTCATG GTGAAGACAT GGGTACACTT 5400 

GGCGTCGACT TTTATGAAGG TAATGAAGTG 5460 

ATTGATGCAG TTGTAAATGA TACTGAACCA 5520 

ACAAAAATTC TTGAAGCGAT TTATCAGTCT 5580 

TAACATCATA CGGTAAGGAG GCACATCATG 5640 

GGTGGTATTG CACAAGACCG TCATATTGCA 5700 

TTAGTTGCAG TACAAGATAT TAATACAGTG 5760 

ATACCTCATG CAGTTGAGAC ACCTAGCGAG 582 0 

TGTACAGCTA ATAAATTCCA TGCTGATCTT 5880 

GTATTGTGTG AAAAGCCAAT GGCGATGACG 5940 

GCTAATAAAA ATCACAAATT ATTAACTGTC 6000 

ATTACTGCTA AAAAAGCAAT TGAATCAGGT 6060 

CAAGCGATGC GTAGGCGTAA AGTGCCTGGC 6120 

GGTGGCGGTA GTTTAATCGA TTATGGTTGC 6180 

GGTAAAGATA TGGTGCCGCA TGAAGTGCTA 6240 

CCGAATCAAA TTAATGATTG GGGAACATTT 6300 
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GCAAGCATGC AGTTTGAATG TTCGTGGTCT 

AGTTTATCAG GAGAAGATGG CGGTATCAAT 

5 

TTTGGAACTA TTTTTGAAAG. CAAAGCTAAT 

AGACAGGCGC GTAACTTTGT CAATGCGTGT 

GAAGAAGCAC GCAATGTAAA TGCCCTTATA 

10 

AAGAGCATAC AACTTTAATG ATTATCATAT 
GAGTGCTTTT CAATGAAAAT AGGTGTATTT 

1$ GATATGTTAG ATTATGTCTC AGAATCTGGA 

AACCCAGGAG ATAAATTTTG TAAGTTAGAT 
GCATTTATGA AGTCAATCAC AGACAGAGGC 

20 AATCCAATTT CTCCAGATCC GATAGAAGCG 

ATCCGTTTAG CAAATCTATT AGACGTGCCA 
TCAGATGATA CCGCTAAAAA GCCTAATTGG 

25 GAAATTTATG ATTATCAGTG GAATGAAAAG 

TTTGCAAAAG AGCAAGATGT AAAAATTGCC 
ACACCATATA CAATGTTGAA GTTACGTGAG 

30 

GATCCTAGTC ATCTATGGTG GCAAGGTATT 
CAAGCAAATG CAATTCATCA CTTCCATGCT 
AATATGTATG GTCTAACTGA TATGCAACCA 

35 

TTCCGTACAG TTGGTTATGG ACATAGTCCA 
ATTATTAATG GATATGATTA TGTATTAAGT 

40 GAAGAAGGTT TCCAAAAAGC TTGTCAAACT 

GCAGACATGT GGTGGGCATA ATACGAACTC 
ACTGGTGGCA GTGTTGAATA AATGCATATG 

45 TTAAATCAAG TCATTGTTTG TAAAGAAGGT 

TACCCATTCA CAGTAACAAT CCTCACCATT 
GGTATATGAT AATAAAAAAA GCCTGTTGTC 

50 

GGTTTCTGAA TATAATATTT CAGAATGCAC 
TGATGACAGG CTTTCATCTT TTTAAATATT 

55 



GCAAATATCA AAGAAGATAA GGTTCACGTT 6420 

TTATTTCCAT TTGAAATATA TGAGCCCCGC 6480 

GTTGAGCATA ACGAAGACAT TGCTGGTGAG 6540 

TTAGGGATAG AAGAGATTGT GGTGAAACCG 6600 

GAAGCGATTT ATCGTAGCGA TCTTGATAAC 6660 

ATGATACAAA ATTCTCAATA TAAAAAGAAG 6720 

TCAGTATTAT TTTACGATAA AAATTTTGAA 6780 

TTGGATATGA TTGAAGTTGG AACAGGTGGT 6640 

GAGTTGTTAG AAAATGAAGA CAAGCGCCAA 6900 

TTACAAATAA GTGGTTTCAG TTGTCATAAC 6960 

AAAGAAGCCG ATGAAACGTT ACGTAAAACA 7020 

GTTGTTAATA CATTTTCTGG CATTGCAGGA 7080 

CCTGTTACAC CTTGGCCAAC AGCCTACTCT 7140 

TTGATACCAT ATTGGCAAGA TTTAGCTGAG 7200 

ATAGAGTTGC ATGCAGGATT TTTAGTGCAT 7260 

GCTACAAATG AATATATCGG TGCTAACTTA 7320 

GACCCAATTG CTGCGATTCG CATATTAGGC 7380 

AAAGATACGT ATATTAATCA AGAAAATGTA 744 0 

TATGGTAACG TTGCGACAAG AGCATGGACA 7500 

TATGTATGGG CAGATATCAT AAGTCAACTT 7560 

ATTGAACATG AAGATCCTAT TATGTCAGTA 7620 

TTGAAATCTG TTAATATTTA CGACAAGCCA 7680 

GAGGTTAGTC TGAAGTTTGT CTGAAGTAAG 7740 

TCGCCAAGCC ATTGCCAAAA ATTTCACACC 7800 

GTACTTTATA TAAGTATATA GCGATGGTCA 7860 

GAAAAGAGTA TATAACCTTT TCAATAGTGA .7920 

ACAATGGTCA TAGACACGAC ATACTTTAAA 7980 
TTTAAAGATG GACGTCGATG TAGACTAAAG . 8040 

CATTAATTTC TCTTCTTGTT TAATACGTAC 8100 
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5 



10 



15 



30 



35 



40 



45 



50 



TAATACACCG 


ATTAATTCAG 


GAATGATGTT 


TAAGAAGTAA TTTGGGTGTT 


TTGTAATTTT 


B220 


ATATAATCCA 


GATTTAATAA 


TAGGATGGTT 


AGGTAAAATG 


AATAATTTTA 


ATGTCCAAAT 


B280 


ACCACCTAAA 


GTTTTAATAA 


CCATAAATAA 


CATGATATAA 


GCAAAGATTA 


ATATAACTAA 


8340 


GCCAATACCA 


TTTGCAAAGC 


TAAATGTATC 


TTTATTAATA 


AATGCCTCTA 


CACCAGCCAA 


8400 


TACATAAATT 


AAAACGTGTG 


TTATTGCTAA 


AAACTTCGAA 


TTTTTAACGC 


CATATTCAAC 


8460 


TGCACCGTCT 


GCTTTTAATT 


GTTTTGAGTG ATTAATAGAT 


ATCTTTAAGC 


TGACAAGTCT 


8520 


GATACAGAAA AAGATAAGTA ATATAGATAG 


AATCATGATG 


TCCTCCGTCA 


TTATGTCATA 


8580 


TGTATAAGCG 


TTGATTTTGA 


CAACATAAAG 


TATTTTATAG 


ATAAAGCTTG 


TCAAATACTA 


8640 


TTAACTATTT ATTAATTTTA 


GTACATAAAT 


ATGTTTCTAA 


GTATGTGTTT 


ATGTTCAGTA 


8700 


TTTTGGATAA 


TTTAATAATT 


TTAAGGATAT 


TAAGCGCTTA 


CACCGACGTG 


ATATATTTGG 


8760 


CTTAACGAAA 


ATGATTGAGG 


TGACAGAGAT 


GAAcmriT 


GATATCCATA 


AGATTCCGAA 


8820 


CAAAGGCATT 


CCATTATCGG 


TACAACGTAA 


ATTATGGCTT 


AGAAACTTGA 


TGCAAGCTTT - 


8880 


CTTCGTAGTG 


TTCTTTGTTT ATATGGCTAT 


GTATTTAATT 


CGAAACAACT 


TTAAGGCGGC 


8940 


ACAACCGTTT 


TTAAAAGAGG 


AAATTGGATT 


ATCTACATTA 


GAACTTGGTT 


ATATCGGATT 


9000 


AGCATTTAGT 


ATCACGTACG 


GTTTAGGAAA 


AACATTACTT 


GGATATTTTG 


TCGATGGACG 


9060 


TAACACAAAA 


CGTATTATCT 


CGTTCTTACT 


TATCTTATCT 


GCGATTACAG 


TTTTAATTAT 


9120 


GGGATTTGTT 


TTAAGTTACT 


TTGGTTCTGT 


AATGGGATTA 


TTAATTGTAC 


TTTGGGGACT 


9180 


TAACGGGGTG 


TTCCAATCAG 


TTGGTGGACC 


TGCAAGTTAT 


TCAACGATTT 


CAAGATGGGC 


9240 


GCCAAGAACG 


AAACGTGGCC 


GATACTTAGG 


ATTCTGGAAT 


ACATCACATA 


ATATCGGTGG 


9300 


TGCCATAGCA 


GGTGGTGTTG 


CACTTTGGGG 


TGCTAATGTA 


TTCTTCCATG 


GAAATGTTAT 


9360 


AGGGATGTTC ATTTTCCCAT 


CGGTGATTGC ATTACTTATT 


GGTATCGCAA 


CATTATTTAT 


9420 


CGGAAAAGAT 


GATCCGGAAG AATTAGGATG 


GAATCGTGCT 


GAAGAAATTT 


GGGAAGAGCC 


9480 


GGTCGATAAA GAAAATATTG ATTCTCAAGG TATGACGAAA TGGGAGATCT 


TTAAAAAATA 


9540 


TATGCTGGGA 


AATCCTGTTA 


TATGGATTCT 


ATGTGTTTCA AACGTCTTTG 


TATACATTGT 


9600 


ACGAATCGGT 


ATTGATAACT 


GGGCACCGTT 


ATATGTGTCA 


GAGCATTTAC 


ACTTTAGTAA 


9660 


AGGCGATGCA 


GTTAATACGA 


TATTCTACTT 


TGAAATTGGT 


GCATTAGTTG 


CAAGTTTATT 


9720 


ATGGGGCTAC 


GTATCAGACT 


TATTAAAAGG 


TCGTCGTGCA 


ATTGTAGCTA 


TTGGCTGTAT 


9780 


GTTTATGATT 


ACATTTGTTG 


TCTTATTCTA 


CACAAATGCT 


ACAAGTGTCA 


TGATGGTTAA 


9840 


CATTTCATTG 


TTTGCATTAG 


GTGCGTTAAT 


CTTTGGTCCG 


CAATTATTAA 


TTGGTGTATC 


9900 
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CGCGTATCTA TTCGGTGACT CAATGGCGAA 
ACGTAACGGT TTAAACATCT TTGGATATAC 
CTTCTATGTT GCATTATTCC TAGGCATGAT 
AAAGAAAATT AGAAGTTTAA AAATTTAATA 
TGCAATATAG TTGGCAATCC TGCCCCGACG 

10 

TTCTACCCTT GCAAATATAT CATCTCTATG 
AAGTATCCTT ATTTGCCCGA CTTAGGGTGA 
ATATAGCGGT GTCAATATTG TAGGGAGTAT 

75 

ACTGAAAGAA AAGTGAAAAC TTCTCTATCA 
GCCATAATCA TTAGTTATGT GCTAATCTAT 

20 ATTTGTAAAT ATTTTTTAAA ATGATAGCTA 

TCTGCAATTT CATAGTGTTT GTAATGTCGA 
TAGCGATTTA AATAATCGAC AATTGTAATA 

25 TACGATTCAC TAACATCGAT ATGTTGAATT 

GATTGCTTAA TTTGATCCAC AATCTGGTTT 
AGTGGTTGGA AGGCATCATG ACAAGATGCT 

30 

AATAAGGTAC GGACAAGTCT TCCTAAAATA 
AATAAATAAT CAAGAACATG ATGTTGAATG 
CTCGATAATA TGATGACATT ACAATCTAGA 

35 

TTTTTACGTG GCATACGAAT ATCAGTAATT 
GATAATGCTT CAACACCATC TTTAGCAGTG 

40 GGAATGATTT GCTTTAATCC TTCTCGAATA 

AACATCTACA TTCCCCCTTG AAAGTGGTAT 
CTTTGAAAAA ATATGGAGTC GTGCATGTGA 

'' 4S ATGATTTAAT CCCAAATGCT TAGTATCAAA 
TTGCAGGCGA GATGACGACA TCCCGATACC 
ACGTGCCAAT GTCAGGCGTA TAGTAATGTC 

50 

AGCATTTTCT ATGAGTGGCT GAAGCATCAT 
AGAACTTTCA ATATGGAGCT TAATCATGTC 

55 



AGTTGGTTTG GCGGCTATTG CTGATCCAAC 10020 

ATTAAGTGGA TGGACAGATG TTTTCATCGT 10080 

TCTATTAGGA ATCGTTGCTT TCTATGAAGA 10140 

TAAATCGGAT TAAAAGTATC GCCAATCTAT 10200 

GCATGTGCGT GAAGAGATGA AAGATACTGC 10260 

TCTCGGGGCA GATCATAATT CCCTGTTATG 10320 

CTCAATGAAT TTACTCCTTA CAATAAAGAC 10380 

TGTTTTATAT TTAAACTCTC TAAAAAGCGG 10440 

GTCCG CTTTT TCATAGAACA AAATGGAGGC 10500 

TTTGCTTGCT TACAATAATC ACTTGGCGAC 10560 

i 

AACATTTTAT ACTCTGAAAA GCCTACTTTG 10620 

TCTAACAATT GCAGAGATTG TAAAATACGA 10680 

CCAACATGAT CTTTAAATGT TCGCATCGCA 10740 

AAATCTGAAA CAGtCACTTT CGTTTGATAA 10800 

ACA7AATAAT CATCGTATTC TACTTTTAAT 10860 

AAGCTACGGC CGTTCTGTGA TTGTTGCTCT 10920 

ACTTCTAATT GTGCATGGTC TACTGGTTTT 10980 

CCGGCTTTCA TATATTCAAA GTCATCGTAA 11040 

TGCGCAATAT CATTGAGTAA ATCGACGCCA 11100 

ACTAATTCTG GCTGATGTTG TTGAATTAGT 11160 

TATATTGTAT TGAAATGATA GTCTCCCCAA 11220 

ATTCGTTCAT CATCACAAAT AACTACCTTA 11280 

TTTATAACAA ATTAACGTAC CTTGATTACG 11340 

ACCATATTGA ATCATTGCTT TATTGTGTAA 11400 

TACATCATTA TTAAGAGATT GGCGTACATA 11460 

ATTGTCGCAA ACTAAAACAT GTAAATTCTG 11520 

CAATGACTCA GTATCTCTAC CATGTTTAAT 11580 

TTTACCAATT GTCTGGTGAC GCGCTTCTTC 11640 

ATCAAAACGG aTGTTTTGTA TTGCAACATA 11700 
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GTAACGTAAC ATTTGCGATA ATTGTTGGAC 


CACAGTTtGT 


GCTAATTTCG 


GAGATAACGT 


11820 




AATTAAATAT TGTATTGTTT 


GCATCGTATT 


GAATAGGAAA TGAGGCTGGA ATTGGCGTTC 


11880 


5 


TATTTCCTTT AACTGAATAT 


CACGCAAGCG 


ACGTTCTGTA 


TGCTCGATAG 


AATGGATCAG 


11940 




TTGCTCATTT GATTCAAATA AATCGTAAAT 


ATAATTATTA 


ATTTCTTCTA 


GTTCACTGTT 


12000 




GTTTTTTAAA GGCGTATATG 


TACCTAGATG 


ACGATTTTTG 


GCATAGTAAA 


TTTTTTGAAT 


12060 


10 


AATCGTTTCG ATATCTTTTG 


TTTGTCGTTT AGCCATATTA 


TCTGCGCTAA 


TGAAACCAAA 


12120 




TATTACTAGT AAAACAAGAA CTACGGCCAT AACAATTAAC AACGTGATAC 


CATCTTCAAT 


12180 


15 


GTTTTCATGT ATATCTTTAT AAATAATGAG ACGATGGTCA GCATGGTTTA ATTTTACAGA 


12240 


TTCATTCATA AATCCGAATT 


GTTGTGGTcT ATACTTTTCA 


CCTATAGTAA AACGGTCATC 


12300 




GTTGGCGTAT AAAATATTGT 


CATATTGATC 


AmCGATAAGT 


GCGAATTGTC 


GGTTATCTTT 


12360 


20 


CtTAATTTCA CTTAAACGTG 


GGGTGTtAGC 


CATATAAATt 


TTaAGCATAT 


ATGTACTATT 


12420 




TTTGAATTTA AGCTGATGCG 


TTGAAAATAA ATACATATTT 


TTAGTGTTTA 


AATGTTCATA 


12480 




ATTATTGGTT ATAAACTGAT 


TTGGTCCAGA 


TAATTCATAA 


TAAAGTGTTG 


CGGGCTGTTG 


12540 


25 


GkGTATTAAT TTTAATAATT CACGTTTTGT AGCGGTCACA TCATGATGAT TTGyTAAATC 


12600 




GAGCTCTTGA AACGAATTAT 


TATGCTGTGT 


AATAAATGTC 


TGAATCTGCT 


TTTCAGTATG • 


12660 




ATGTAAAGAT GACTGACTTT 


CATCAACATG 


TTGATGAATC 


GTACGATGCT 


CAATCCAAAT 


12720 


30 


ATAGATGGCA TAGAAGCTTA 


CTAGTCCAAT 


AATAATGACT 


AAAAATACTG 


GAAAAATAGT 


12780 




AGACnCAAAT AACGATCGTC 


TTAATTGATG 


TCTATAAGGT 


TTGTATGCCn 


TCATTGAATC 


12840 


35 


ATCTCCAAAA ATTTATGATG 


TGGAATATCC 


GGTAATTTAG 


ATTTCGGTAT 


TAAAGGTATG 


12900 


TTCTTAAGAT TTTCGATAGA 


CTGATCGCTT 


TGTTCACTAA 


CATCCTTTCG 


AATTGACTTG 


12960 




GCATCGAACT CTGCAACTAA 


TCGTtGTTGT 


ACTGAGCGGC 


TTGTTAAATA 


TTGCACTAAC 


13020 


40 


TTTTTACGCT TAGGATGAGG 


GTGTGCATTT 


TTAACTAAAG 


CAATrCCATC 


AACATTTAAC 


13080 




ATTGTTCCTT CAATTGGATA AACGATTGAT 


ACAGGATAAC 


CTTTGTTTTT 


CCATGTGCGT 


13140 




GCATCTTGTT CGTAGCTTAG 


ACCTGCGTAA 


TATTTACCTT 


TTGCAACATC 


TTCAATGACT 


13200 


45 


TTAGACGTCT TTGACAGTTG 


CATCGCATGG 


TTTTGGAATT 


GATG CACATC 


ACTTACTCGA 


13260 




TGATGCATGC TATAAATAGC 


ACGCATATGT 


TGATAGCCTG 


TCGTTGTTGT 


ATTTGGATTT 


13320 




GAGTACGCAA TTTTACCTTT 


AAGTATAGGT 


TGTAATAAAT 


CTTGATAACC 


TCGAATCTTA 


13380 


50 


ATATCTCCTT GTAAATCTGA 


ATTCACTACT 


ATAACTGTTG 


GCATTAATAG 


AAAACTAGTA 


13440 




ACATATTTAT TGTTCGAGCG 


ATAATCCTCT 


AATTGCTGTG 


TTACAGATGT 


ATCTTGATAG 


13500 



55 



648 



E P 0 786 519 A2 



CCACGCTCCG AAAAATCTTC GTTATGCAAG 
TTAATTTCAA TTTTGACATG CTCTTGTTTT 

5 

TTTGATTGAT ACGGAGAATA AACTGTTAAT 
TTAGCGCATG CTGaTAAAAA AATGAGAAAT 
ATATCCCATC AATTCTATGT ATATTTTAAT 

10 

GTAATGTTAA ATATTTAGAA ATGTTTATAG 
ATAGCACAAA ATTTTTGTTT GTCAAGACGA 

1S ATTTTATTTG TAGCTGTTAT ATAAAAATCG 

TTTACGTCAA TAAAAGTATT TAATCCAGTC 
TGTTGATTTA ACGCTTATCA ACAATCATTT 

20 CTTTTAAAGC AATGAAAATA GTGAACATTA 

TATTACTGTT ACACAAATTA GTACAGTTTC 
TACATAATTT ATGTGAAAAA AATCACAACA 

25 CATAGCATTT CAAATTCACA ACATTATACA 

TaAAAATCAT GCAACAGCTT GGCAAGGATT 
TGTAAGAGAG TTTATCCAAT TAAACTACAC 

30 

AGGACCAACA GAAGCAACTT CTAAACTTTG 
ACGTGAACGT GGCGGCATGT GGGATATGGA 
■ TGATGCTGGT TATTTAGACA AAGATTTAGA 

35 ' 

ATTCAAACGT TCAATGCAAC CATTCGGTGG 
TTA£GGTTAC GAATTAGACG AAGAAACTGA 

40 TAACCAAGGT GTATTCGATG CATATTCTAG 

AATCACTGGT TTACCTGATG CATACGGACG 
AGCTTTATAT GGTGTAGATT TCTTAATGGA 

45 TACAGAAATG TCAGAAGATG TAATTCGTTT 

ATTAAAAGAA TTAAAAGAAC TTGGACAAAA 
AAACTTCAAA GAAGCAGTTC AATGGTTATA 

SO 

AAACGGTGCA GCAATGAGTT TAGGTCGTAC 
TGACCTTAAA GCAGGCGTTA TTACTGAAAG 

SS 



TTTGAAAGCA GTACTTGAGT AGATCCGTGT 13620 

TCAAATTCAT TTAAAATTGG ACGAATCAAG 13680 

ACATTTTTAT CGGATTCAGA GTGACGCGTA 13740 

AATAGCAAGA TATAAATTTT TGATTTCATG 13800 

ACAATAATTT TAGCAATAAA TGACGCATAA 13 860 

ATGACTTGTT AAGACGTTGC AAATGTTGTG 13920 

TTTACCGAGG CTGTAAAATC AAACTGTTAT 13 980 

GCAAGATATT GAACGGTTCA AAAGTGAATT 14040 

TCTTCATATA TAAAAGTAAA TCTTTCTAAG 14100 

TTTATAAACA AATATATACT CCTAAATTAA 14160 

TAACTGTTGT GTAACAGAAT GCAATTAGCA 14220 

TATGTTTTGA CATACATTTG ATGAAAATTG 14280 

AACATGCTAC AATGACTATG AAAACGTTAA 14 340 

GATGGAGGCG TTTAGTATGT TAGAAACAAA 14400 

TAAAAATGGA AGATGGAACA GACACGTAGA 14460 

TCTTTATGAA GGTAATGATT CATTTTTAGC 14520 

GGAACAAGTA ATGCAGTTAT CGAAAGAAGA 14580 

CACGAAAGTA GCTTCAACAA TCACATCTCA 14640 

AACAATTGTA GGTGTACAAA CTGAAAAGCC 14700 

TATTCGTATG GCGAAAgcAG CTTGTGAAGC 14760 

AAAAATCTTT ACAGATTATC GTAAAACACA 14 820 

AGAAATGTTG AACTGCCGTA AAGCAGGTGT 14 880 

TGGACGTATT ATCGGTGACT ATCGTCGTGT 14 94 0 

AGAAAAAATG CACGACTTCA ACACGATGTC 15000 

ACGTGaAGAA TTATCAGAAC AATATCGTGC 15060 

ATATGGTTTC GATTTAAGCC GTCCAGCAGA 15120 

CTTAGCATAC CTTGCTGCAA TTAAAGAACA 15180 

ATCAACATTC TTAGATATCT ATGCTGAACG 15240 

CGAAGTTCAA GAAATTATTG ACCACTTCAT 15300 
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15 



20 



25 
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AGACCCAACT TGGGTAACTG AATCTATCGG 
TACGAAAAAC TCATTCCGTT TCTTACACTC 
AAACTTAACA GTATTATGGT CAGTACGTTT 
AATGAGTATT AAAACAAGTT CTATCCAATA 
TGGCGATGAC TATGGTATCG CATGTTGTGT 
ATTCTTCGGT GCACGTGCGA ACTTAGCTAA 
AGATGAAAAA TCTGGTGCAC AAGTTGGTCC 
AGAATATGAC GAAgTATTCA AGAAATTTGA 
CATTAACTCA TTAAATGTTA TTCACTACAT 
AATGGCATTA CATGATACAG AAATTGTACG 
AGTAGCAGCT GACTCATTAT CTGCAATTAA 
AGAAGGTCTT GTAGTAGACT TTGAAATCGA 
CGACCGTGTA GATGATATTG CAGTTGATTT 
TCATAAAACA TATCGTGATT CAGAACATAC 
TGTATACGGT AAGAAAACTG GTAACACACC 
TCCAGGTGCA AACCCAATGC ATGGCCGTGA 
TGTAGCTAAG ATCCCTTACG ATTGCTGTAA 
ACCAAAATCA TTAGGTAAAG AACCAGAAGA 
TGGTTACGCA ATGCAATGTG GTCACCACTT 
AATAGATGCA ATGGAACATC CAGAAGAATA 
CGClSTTAAC TTCATTAAAT TAACACGTGA 
CCAf GAAAGT ATGTAACAAA ATTTAAGGTG 
CTGTCGAAAG TTTAGGTACT GTCGATGGAC 
GATGCTTACT TAGATGCTTG TATTGCCACA 
CAAGAGAAGT CACAGTTGAT GAAATGGTGA 
ATGCATCGGG TGGCGGTGTA ACAGTCAGTG 
TAGAAAAATT ATTTGCAGAA TTAAAAGAAA 
CTGGATGTGC TAATGATACA AAAGCATTTC 
CAGACTTGAT ATTATTAGAT ATAAAACATA 



TGGTGTAGGT ATTGACGGAC GTCCACTTGT 15420 

ATTAGATAAC TTAGGTCCAG CTCCAGAACC 15480 

ACCTGACAAC TTCAAAACAT ACTGTGCAAA 15540 

TGAAAATGAT GACATTATGC GTGAAAGCTA 15600 

ATCAGCGATG ACAATTGGTA AACAAATGCA 15660 

AACATTACTT TACGCTATCA ATGGTGGTAA 15720 

AAACTTCGAA GGTATTAACA GCGAAGTATT 15780 

TCAAATGATG GATTGGCTAG CAGGTGTTTA 15840 

GCACGATAAA TACAGCTATG AACGTATTGA 15900 

TACAATGGCA ACAGGTATCG CTGGTTTATC 15960 

ATATGCACAA GTTAAACCAA TTCGTAACGA 16020 

AGGCGACTTC CCTAAATACG GTAACAATGA 16080 

AGTAGAACGC TTCATGACTA AATTACGTAG 1614 0 

AATGAGTGTA TTAACAATTA CTTCAAACGT 16200 

AGACGGACGT AAAGCTGGCG AACCATTTGC 16260 
CCAAAAAGGT GCATTATCTT CATTAAGTTC ' 16320 

AGATGGTATT TCAAATACAT TCAGTATCGT 16380 

TCAAAACCGT AACTTAACTA GTATGTTAGA 16440 

AAATATTAAC GTATTTAACC GTGAAACATT 16500 

TCCACAGTTA ACAATCCGTG TATCTGGTTA 16560 

ACAACAATTA GATGTAATTT CTCGTACATT 16620 

GGAGCACTAT GCTTAAGGGA CACTTACATT 16680 

CGGGATTAAG ATATATATTA TTTACACAAG 16740 

ATCCAGATAC TTGGAAAATT AGTGAGCCAT 16800 

ATGAAATATT ACCATACAAA CCATACTTTG 16860 
GTGGCGAACC ATTGTTACAA ATGCCATTCT . 16920 

ATGGTGTGCA CACTTGCTTA GACACATCGG 16980 

AAAGGCATTT TGAAGAATTA CAAAAACATA 17040 

TTGATAATGA CAAACATATT AGATTGACAG 17100 
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TATGGATTCG ACATGTCCTT GTGCCTGGTT ATTCTGATGA TAAAGACGAT TTAATTAAAC 17220 

TAGGGGAATT TATTAATTCT CTTGATAACG TCGAAAAGTT TGAAATTCTG CCATATCATC 17280 

5 AGTTAGGTGT TCATAAGTGG AAAACATTGG GCATTGCATA TGAATTAGAA GATGTCGAAG 17340 

CGCCCGATGA TGAAGCTGTT AAAGCAGCCT ACCGTTATGT TAACTTCAAA GGGAAAATTC 17400 

CCGTTGAATT ATAAATACAA TTCAGACCGA AAAGAAAGCA TATGCAACTT CAAGAGTGAA 17460 

10 

GGGG CATATG CTTCTTTTTC AATTGAGTAT TGAGTATTAG CAAGACGTAG TAAGTATATG 17520 

AGACAACTTC TACAATGGTT GAAGGAAGAC GTT TT T G TAA GTAGCTATGC TGATAAAGAA 17580 

TGTGATGTCT TGTTAAAGGT GGGGTTCCAA TATCATCATT TAGCTGATGT TGAATGGGTT 17640 

75 

ATTATTTGCT ACTTGCATAT GAATATGAGT CTTTTCAAAT TTTTATTGAC CCTGAGTAAT 17700 

GAAAAATATT AAGATGAAAC TTAATATTAA AgCAATGCGG AGCGTGATTA TGAAGAGAAT 17760 

20 TAGTAAAGAT ATATGGGCAG TATTTAAATT ACTGTATCaA AATAAAGGGC GTTTTAGCAT 17820 

TAATGCCTTA CTATTGCAGT TAATCATGAT TTTTATTAGT AGTACATACT TAATTTTACT 17880 

ATTTAATATG ATGTTAAAAG TAGCTGGcAA AGCCAACTTA CGATTAACAA TTGGACGGAA 17940 

25 ATCGTTAGTC ATCCCGCCAG TGTGATACTT CTTATTATAT TCATATTAAG TGTTGCCTTT 18000 

CTGATTTATG TAGAGTTTTC ATTGTTAGTT TATATGGTTT ATGCCGGGTT TGATCGACAG 1806 0 

ATTATTACAT TTAAATCCAT TTTTAAAAAT GCCTTTGTAA ATGTGCGTAfc ACTCATAGGT 18120 

30 

GTACCAGTTA TT TTCTT TGT CATTTATTTA ATGTTAATGA TACCCATTGC CAACCTAGGA 18180 

CTAAGTTCAG TATTAACAAA AAATATTTAC ATACCTAAAT TTTTAACGGA AGAACTTATG 18240 

AAAACGACGA AAGGTATAAT CATTTACGGT ACCTTTATGA TTGCTGTATT TATATTAAAT 18300 

35 

TTTAAATTAA TATTTACTCT ACCGTTAACG ATTTTAAACC GCCAGTCGTT ATTTAAAAAT 18360 

ATGAGACTAA GTTGGCAAAT TACGAAGCGA AATAAGTTTC GGCTTGTTAT AGAAATAGTT 18420 

4Q ATATTAGAAC TCATCATTGG TGCGATTTTA ACATTAATTA TTTCAGGAGC AACATATCTT 18480 

GCTATTTGTG TAGATGAAGA AGGAGATAAG TTTTTAGTCT CATCAATTTT ATTTGTTGTA 18540 
TTGAAAAGCG CATTGTTCTT CTATTATkTA TTtACGAAAT TATCATTAAT CAGTGTGTTA • 18600 

45 GTACTGCACT TAA 18613 

(2) INFORMATION FOR SEQ ID NO: 113: 

(i) SEQUENCE CHARACTERISTICS: 
so (A) LENGTH: 1214 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 , 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113: 

AAAGTTTTAA AAGGGGTGAG ATACTTGGCG AATAATCCAT TCCAGCTTTG CGTTTAAAAG 60 

5 GAATTATACT TGCCATTGTC GGTGCTTGTT TATGGGGATT AGGTGGTACT GTTTCTGATT 120 

TCTTGTTCAA ATATAAGAAT ATTAATGTCG ATTGGTACGT CACTGCTCGA CTTGTAGTCA 180 

GTGGTGTTTT CTTACTTATT ATGTACAAAA TGATGCAACC CAAACGTTCA ATATTTAGCG 240 

10 

TATTCCAAGA TCGACGTATG TTAGGCAAAT TACTTATCTT CAGTATACTG GGCATGTTAG 3 00 

TAGTACAATA TGCTTATATG GCATCTATTA ATACAGGTAA TGCTGCGATT GCAACATTAC 360 

TACAATACAT TGCGCCAGTT TATATTATTA TTTGGTTTGT CATAAGAGGC GTTGCAAAAC 420 

15 

TAACATTATT TGATGTGCTT GCTATTATCA TGACACTATT AGGAACATTT TTATTATTAA 480 

CAAATGGTTC ATTTTCTAAT TTAGTCGTCA ATCCTGCAAG TTTATTCTGG GGTATTTTAG 540 

2Q CTGGTGTAGC ACTCGCTTTT TACACAATTT ATCCTTCAGA CCTACTTAAC CGCTTCGGTT 600 

CGATTCTAAT TGTCGGGTGG GCAATGCTTA TTTCTGGTGT TGCGATGAAT TTACGCCATC 660 

CAATTTGGCA CATTGATATC ACTAAATGGG ACATATCAAT TATATTATTT TTAATCTTTG 720 

25 GTATTATCGG TGGTACCGCA CTCGCATTTT ATTTCTTTAT CGACAGTTTA CAATACATAT 780 

CAGCGAAAGA AACAACATTA TTCGGAACTG TTGAACCTGT CGTAGCCGTT ATCGCAAGCA 84 0 

GTCTATGGTT ACATGTGGCA TTCAAACCAT TTCAAATCGT AGGCATCATT CTTATTATGA 900 
30 

TTTTAATTTT ATTACTATCA CTTAAAAGAC AACCTGAAAC ATTAGATGAA TAAGAAAACT 960 
CTGATAATCA CTTTAGCAAG TAACTATTAT TTAACAACGT AGTTACCTTA TAGGTGATAT 1020 
CAGAGTTTTT TATTTTAGTT AATAATATTT TTCACTTGGT ATAAAAAaGC GTCGTCGCTC 1080 

35 

TGGTAATCGG AAATACTGGA ATAAAATATG GAATTGGGTA ATAATCCCAG GTAnTAAAAG 1140 
TCCAJGTTCC GATAnCCTnT CCGCAnCTCC AACCAAATTT GCCGATAAGG TTCCAAAAGG 1200 
CATCCTGGGG GTAC. 1214 

40 

(2) INFORMATION FOR SEQ ID NO: 114: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9458 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

50 (Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 114: 

ATTTTGGTTT CATTCACGAT GGGGTnATAC AGCAAACACA nCTAAAATAA CTATCAATAG 60 

55 
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CTTAGACAAT AAAAAATATG CCACTACAAT 
AACGATTACT TTCATCGTTG TTCTATCTCT 
TGATAATGAA ACAGCAAAAA AAGTAATAGC 
TAAACCTATG ACTAATAATG TATTAGAAAT 
AATATGCACT CACCCTTTTT ATTTAAATAA 
CATAGGCCTG TCGATGATTG ATTCACAATA 
TCATTTATTC CATCAAAAAC ACCCTTTTTA 
ACACTGCTTG CATGTAGAAA CACTTTTTCA 
TTTAGCATAT TAAGCATGTA TGTTAAACTA 
AATAGACTTC TTATAGTTGT CTTAGCATTT 
TTATTAACTT TATTTAGGAA GTCGCTTAAA 
ACCATTGAAG CCATATTAGA TAATAGCCCT 
AATTTATTTA TGATGACTTG ACGTCCTCGA 
GCATCATCTA ATGTTTTAAT AAGCTTGATT 
ATTGCCTCTG TAACTTCATT TAAACTTTCT 
CGTCTAATTT TAGAAGATAT TCTCTCAGCC 
CATAATCTGA ACGTTCCCAT TTTTTCTGTA 
TATTGACACG GAAATTATTA GGGTTCAACG 
AACGACAGCT AGTACGTTTA TAAGATGGTG 
TTAATAAGTT AATTGCACCT AGATCTCCAT 

AGATTTCTTT acctttaaca cgatctgtca 

CAGAAATCAG CTTAACTTCT GCACCTTCAT 
AAATTTCAAC AAATGCACGT GGCACTTTGT 
TACCTTCATG GAAGTGCTCT AACAATCGAC 
CTTGCTTAAA GTAATTATCA AATGATAATG 
TGAATCCTTC TAAGTATAGA ATAGGCTCAT 
AACTATTGAA TCCTTCTAAA CGATCATAAC 
CTACTTCATC CATAATTTCA CTAGGATGCT 
CAATTGCTTG GAAAATTTTC CAGTCAGGTT 



3 0 786 519 A2 

CGCTAATATT ACGATTAAAA AAGAAGCGTT 
GAACATCATA TTAAAGACAA CTAGACTAAT 
TAACACTAAT TTCATCATAA ATAGACAGAC 
TACAGCTGAC GTTTTTAACA TTCTCGaATT 
CTTACATAAT CATAATAATA CATGATGTTT 
GCACGTGATT TTTTTGTTTT TCAATATTAT 
ATTTTTACAA AAATTAAAAA AAGTGCTCCT 
TTGTAATGTT ATTCTTCTCG AGACATACCT 
CGGTTCATGT CGTCATCTTT CAATACGCCC 
GGACTCGCTT GATTGGCAAC GTGTAATCCT 
TCTGATACAT TGAGTTCACC TAATAAAAAT 
GTATAAATAT CTTTATTAAG TTCAACTGCA 
ATTGCACCAT TTAAAGCATC TAATAGTTTT 
GCTTTTAATA TACTATCTTT ATTCGCTGCA 
AACTTAATTT GTTCTTCTGA TTTTTCTAAG 
ATTATTTATC CACCTGATTT CCCGGGAAAA 
CTTGAACACT GTACTGCGGT TGACGTTTTT 
GTGACTTACC ACGTTTCGTA ATTACCTCCA 
TATCCGTGTA TTGATCAACA TCACTaTTAG 
TTTCCATCGC aTCaTTATTT AATGGAATAT 
CGTGAACTTG TAATACCGCT TCTCCTGTyT 
GAATGCCTCT ATCTTCAGCA AGCTCTGGAG 
ATTTAATCAT TGGTGTTTGA TAAGTCATAT 
CATTGTTTAC ATGAATATCA TAAATTTCAT 
GGAATAATTT TGCTTTACCA TTATCAAAAT 
CAGTACCATC AGGTTGTACT GGCCATTGTA 
TTACCCCAGC ATATAGAGGT GTTAAGCGTG 
TGTAATTCCA ATCAAATCCT AAT CTATTAG 
TTkAATCACC AAGAGGTTCT AATGCTTGGT 



653 



EP0 786 519 A2 



10 



15 



TTGCTGGCAA TACAACATCT GCGTATGTTG CTGTGAATGT TAAAAATTCA TCTTGGACTA 1980 

CCATGAAATC TAATTTTTCA AACGGAGCTT GTACAAAATT AATATTTGAA TCCACAATAC 2040 

CCGTATCTTC ACCATATAAG TACAATGAGT GTACTTCTCC GTCATGTATA CCTTCTACCA 2100 

TTTCATGATT ATCTTTACCA GCTTTTGGAT TCAATTTAAC GCCATATTCT TTTTCAAATT 2160 

TAGCGCGAAT ATCATCCGCT TCAATACTTT GATAACCAGT AATCTTATCA GGCATACTTC 2220 

CCATATCACT ACATCCTTGA ACATTATTAT GTCCACGTAA TGGATACGCA CCAGTACCAG 2280 

GACGACGATA ATTACCTGTT ACTAATAATA AGTTTGAAAT CGCTGTACTT GAGTCACTAC 2340 

CAATGTCTTG TTGTGTAATA CCCATTGCCC AACAAATTAC AACAGATTCA GCTTTAGCAC 24 00 

ATTCTTCAGC AAATTTAATC AATTCTGATT CAGGAATACC TGTTGCTTCT TCAGCAAAAG 2460 

CCATTGTAAA TGTTTCTAAT GATTTGTAAT ATTCATCAAA ATCATCTACC CACTCATCAA 2520 

20 TAAATGCTTT ATCGTGTAAA TCATGATCAA TAATATACTT AGTCACTGCA CTTAACCACG 2580 

CTAAATCCGT ACCTGGTTTA GGTTGATAAA AACGATCCGC ACGTTCTGCC ATTTCATGTT 2640 

TTCTAATATC AAATACATGT ATTTTTTGAC CAAATAATTT TTGTGCACGT TTCATGCGTG 2700 

ATGCGATAAC TGGATGAGCT TCGGCTGTAT TAGTACCTAT CAATACAGAC ATTGCCGCTT 2760 

TTTCTAAATC TTCAATACTA CCTGAGTCAC CGCCGTGTCC AACCGTTCTA AATAAGCCTT 2820 

TTGTTGCAGG TGCTTGGCAA TATCTTGAAC AGTTATCAAC GTTATTTGTG CCAATAACTT. 2880 

GTCTTGCTAA TTTTTGCATT AAATACGATT CTTCATTCGT CGCTTTAGAA GAAGAAATGA 2940 

ATGATAGTGC ATCTGGGCCA TGCTTTTCTT TAATAGCTGT AAAATTATCT GCAATGACGT 3000 

TTAAAGCTTC ATCCCATTCT ACTTCATGGA ACTCACCATT TTTCCTTACT AGTGGTTTAG 3060 

TTAATCGTTG ATCTGAATTA ATATGTCCCC ATGAAAACTT ACCTTTAACA CAAGTCGCAA 3120 

TTTTATTTGC TGGAGAATCA TGTGATGGTT GTACTTTTAA AATTTCTCTA TCTTTAGTCC 3180 

40 AAACTTCAAA TGAACAACCC ACACCACAAT AAGTACACAC TGTTTTAGTT TTCTTAATAC 3240 

GCTCTTTACG CATTTCTGCT TCTGAATCTG AGATTGCAAA TAGTGGACCA TAACCAGGTT 33 00 

CTGCTTTTTT AGTTAAATCA ATCATTGCTG CTAATGAACC AGGTTCCGTA TCAGTCATAT 3360 

45 AACCCGCATT ACCTTCCATA TTCACTTCCA TCATGGCATT ACATGGACAT ACCGTCGCAC 3420 

ATTGACCACA AGATACACAT GAAGACTCAT TAATCGGTAC ATCATTATCC CAAATAACAC 34 80 

GTGGATGTTC ACGATCCCAA TCAATTCTAA TAGTTTCATT CACTTCGATA TCTTGACATG 3540 

SO 

CTTCTACACA ACG CC CACAT AAGATACATT GATTTGGATC ATAACGATAA AATGGGCCGT 3600 

AATCTTTTTC GTATGGCTTC TCTTTATATT CATACGTTTG ATGCTGAAGC CCCCATGCAT 3660 
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TATGCTTTTC TAAAATTCGA TCAAGCGCTT CTTTTTGAGC ATCTTTCACA TCATTGTTCA 3780 

CAGTATTTAC AGTCATTGGA CGATCAATCA CCGTACTACA TGAACGTTCA ATTTTACCGT 3840 

CAATCTCAAC AGTACATGTA TCACATGTTT GAATTGGTCC CATCGACTCG TTATAACAAA 3900 

TTGAAGGTAC AAAAGTATCT TGTGATTTAA TAAATTCAAG TAAATTCGTA CCTGGTTCTA 3 960 

CAAGATAATC TTTTCCATCA AGTGTAACCA CCAAATGTTC TTGCATATTA CTCACCCCGT 4 020 

CTATATATAT TTTCCGTAAA TGACTTTTAA TAAATTGCTC ATATCCACCT AAAATAACGA 408 0 

TGCCCCACAC ATCTTTCAGA TAGAATTAAT TTAATTGTAT TACTTTATGT ACTAGTTGTT 4140 

AAGTAAAATT TTGTATTTTG CCTTTTTACA ATCATTTTTA TTTGAAATAT TTTGCGCGAA 4200 

ATTAAATCAT CTTTTT GTTT AATTGAAAAT AATTATCATT ATTAGTTTTC CAATTATCTG 4260 

TTTCACGCTT TTTGCCATAT CTTTCACAAC CTTATTAATG ACAATATTTA ATAATCACCT 4320 

20 CACCTAAAAA TCGTTATACT ATTTATAAAT ACCCTTTTTC TGAAAATTAA TAACCCAAGT 4380 

TTGATAAATA TCTACTATCA TTTAGAAGGT AATATTTATC TTTAAATTAA ATTTGTAATG 444 0 

GATTAATTTA TAAAAATCAA ATCAGGCATT AAATAAAATA GCCCATAAAT ACAAAGTGTT 4500 

25 ATCACCTTCT ATTTACGGGC TATTAGTTCT ATTCGTTATT CTATTTACAG ATCATTCTAT 4560 

CTAATTAATT TGTGTACAAT TTTGATAACT TATTTTCCCT TAGTTTACTA CTCTAGATTA 4620 

TCTTTTAATA ACTTAGTACT TTCAG CTTTT GACTGCTCAC TAGGAATGAA GTAGTACAAT 468 0 

CCGTCACTTT GAATGCCGCC TTGACCACTC AATTGATGTT TATTAATCGT GTCATTAGCA 474 0 

TCTTTATAAT TGCTTCTAAT CGTATTCAAA TCACCTAATG TTAAATCTGT TTTAACATTA 4800 

TTTTGAATTT CATTCATTAG ACTATTAAAA TGTGTAATCG ATGATGGGCT TGCAATCTTA 4860 

TTGGCCATCG CTTCAAGCAC AATTTGCTGA CGTTGTTGTC GACCAAAGTC ACCACCAGCA 4920 

CCTTCTTCTT TACGACTTCT AATAAACTTC AATGCTTGAT CACCATTTAC ATGTGTCTGC 4980 

TGTCCTTTTG TAAAACGAAC ACCATCAACA GTGAATGTAT CATTACTTAC TACATCAACA 5040 

CCGCCGATGC TATCTATCAT ATTATGCAAA CCATCCATAT CGATTGTCGC ATAATGATCA 5100 

ATTGGCACAT TCATTAATTT TTCAAGTGAT TTAACAGCCA TATTTGGTCC ACCATATGCA 5160 

45 TAGGCATGTG CAATTTTTTC AGTAGTACCA CGGCCAACAA TTTCCGCTCT TGTATCACGC 5220 

GGTATACTTA CTATTTCAGT TTTCTTCGTT TTAGGGTTGA TAGATAAAAT CATAATACTA 5280 

tCACTACGCT CTCCGCCACC CTTTTTCTTA CGATCAGCAT CTGAATCGAC ACCAAATAAA 5340 

SO 

GCGATTGTGA ATGGATCACC ATCGTTTAAA CTCACTTTTT TATCTCTTAA TTCTGAATGA 5400 

TTGCGATCTA ACGGATTGTG TATCTTATTA CCAGTAATAA AAATTTTAGC AGCTACATAC 54 60 
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GGTAGGCTCA TTTTACTTTT AGACGAACGT TTCAATCCCA CCACTCCTTT ACTATTCCTT 5580 

ACATACTTTG TCTGTTTTCT CTATTTATTA TATAGTAAAA TAATTTTTTT ACTATACTTC 5640 

5 TGTAGACGTA TAACTATTTT TTATCATTTT TTATCTCTAG AGAATATCTA TCTGTATTTT 5700 

TGATAACCAC CATTTGCATT TAAAATTTTA AGTACCGTTT CATGACATGC TTTATTACTT 5760 

ATAATAAAAG GTGCACCCTT TAAATGATCA ATTGCCTTAC CATCTAAAGT CGTCATTTTT 5820 

10 

AGATTCAATA GTTCTGCAAA TAAAAACTGT GCAGCAATGT CCCAAGGTTT AGGATTTGTA 5880 

TTAATATGTG CCCCAAATTG ACCTTTTGCC ACTCGCATAG AATCTAATCC GCAAGCACCA 5940 

ACTAAACGAT AACTAAATGA GGCGTCAAAT AAATCTTGCA CCGTATCTAG ATTCATCACT 6000 

TGTGCATTAA ACGATATAAT AGCGTCTTCC AATTTTAACG ATGGTGGTTC TTCCATCTTA 6060 

ATTCCATTAC AAAAAGCACC TTCTCCTCGT ATTGCTTTAT AAAGCTTTTT ATGCGGATAA 6120 

20 TCATATACGT ACGATAACAT TGGTTTACCT TCATAAAAAT ACGCCAATAT AATACAATAA 6180 

TCTTCTTGCT GTTTTACTAA ATTGGCAGTT CCATCAATGG GATCCATAAT CCATAAATGA 6240 

TTAATTTCAT TCGTAATCAT TTCATTACTT TTTTCTTCCG CTAATAGTTG GTGTTCCGGA 6300 

25 AAATGTGTTG CTAAAAATTG TTGGAATTGT TGTTGAATCT GTTTATCTAC ATTTGTAACT 6360 

AAATCAAATC GATGACGCTT AGTTTCTGTA GTCATTTCCA TAATTAATTG CGGAATAACA 6420 

TTGTCTATTT GTTTCAACCA CGAACATATT AACTTATCTA TTTGCTGTAA TGTTTTATCT 64 80 

30 GTCATTTCGT CCACCACTTC TCATATCATT ATCATTTTAT TATTACCCTA TATTAAAAGA 6540 

ATCAACAATA CAACTGAAGA CTTCTTCATT TTATGCATAA AAAAATCGGC TAGTCACGTG 6600 

CTAGCCGACA AATAGAAAGG AAAGTAAGTA ATAAATATTG AAGATGTTGT GATGTAACTT 6660 

35 

GAACGATTAA AAGCTATCTG TTATATAGCT CTACCCCTTT GTTTAATCGC TCCCCCTGTT 6720 

ACAAGTAATA TCATAGCACA ATCTTTTTTA AAATGTAAGC GTTTTCCACA AAATTTTTAC 6780 

GATTTTTTTA AAAAGATATT GAAAATGTCC TCATTGTCAC TCTTATGTTA TACTTTGTGT 6840 

40 

AATATATCAT CTTTT AGGAG GTGGCTGTCA TGAATAAAGC TGAAAGGCAA AATTTAATAA 6900 

TTACTGCAAT TCAACAAAAT AAAAAAATGA CCGCTTTAGA ATTAGCTAAA TATTGCAACG 6960 

45 TATCCAAACG CACAATTTTA AGAGATATTG ATGATTTAGA AAATCAAGGT GTTAAAATTT 7020 

ATGCGCATTA TGGGAAAAAT GGTGGTTACC AAATACAACA AGCACAATCT AAAATTGCAT 7080 

TAAACTTATC TGAAACACAA TTATCAGCCT TATTTTTAGT GCTTAATGAA AGTCAGTCGT 7140 

50 ACTCGACATT ACCATATAAA AGCGAAATCA ACGCAATTAT AAAACAATGT TTAAGTCTTC ' 7200 

CACAAACACG CTTAAGAAAA TTGCTTAAAC GCATGGACTT TTATATTAAA TTTGATGACA 7260 
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ATGTGATGTT AGTAGATCAT AGGGTTGATG ATAATATTAA AGCTGAAAAC GTTATATTTA 73 80 

TTGGCCTTTT GTGTAAACAT GGACATTGGC ATGCAGTCAT TTATGACATT GCTCAAGACA 7440 

£ AAACTGCCGA ACTCGAAATT GAAAATATTA TAGATATTTC GTATTCATTC . GGTAAGACGA 7500 

TTCAAACCAG AGACATATCC ATTGATAACT ATCATCAATT TTTAAACCCC ATCGATTCCT 7560 

AAAAAACAGC AGTAAGATGA TTTTCAATTA GAAAATATCT TGCTGCTGTT CTCTATTTAT 762 0 

10 

ACAATACTTC GTATTGAATG GnTTCGCTTT CCTAGGGTGC CGTCTCAGCC TTGGTCTTCG 768 0 

ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACAT GTAATTTTAC 774 0 

TTTGAAATAC TTAAAAAAAT AAAACACTTT GCCCAACTTA CACTACCAAT AGAAACTGCT 7800 

75 

GTTAGAATTC CTCAAAATGA TATTTCGCGA TATGTTAATG AAATTGTTAA AAAGATAGCT 7860 

GATAGCGAAT TCGATGAATT CAGACATCAT CGTGGCGCAA CATCCTATCA TCTAAAAATG 7920 

20 ATGTTAAAAA TCACCTCATA TTCATATACT CAATCTGAAT TTTCTGGCCG TAGAATAGAA 798 0 

AAATTACTTC ATAACAGTAT TCGAATGATG TGGTTAGCTC AAGATCAAAC ACCTTCTTAT . 804 0 

AAAACTATTA ATCTTTTTAG AGTGAATCCT AATACTGATG CGCTAATTGA ATCTTTATTT 8100 

25 ' ATTCAGTTTC ATAATAAAAT GCATATCAAA AAAGCTGATT TCTATCAAAT AATTAATAGA 8160 

AATCAGCTTT TTTCaTTGCC TAAAAACTTA ATGTCCCGAC CTCTTTATCT ACGCATAAAT 8220 

ACTTATTACT GATATAACGA AAGAAACAAA ATTATTTGCT ATATGTAATG CAATTGTTGA 8280 

30 ACCTAGGTTT CTTCCAGATT TTAAATAAGT GAAAACTAAT ATGATGGATA GTATGAGATA 834 0 

TGGACCAAAC TCAAACGGCG ACTTTGCATC AGTCACATGA ATAAATGCAA ATAAGAACAC 8400 

CGAAACAATA CTCATAGCTA TAAAATTAAA CTTCTTACCT AATTCTCCAA TTAAAATATG 8460 

35 

TCTAAATACG ATTTCTTCAA CTATTGGACC TACAATCACA ATTAATAAGA ATGCTACAGG 8520 

TAA2AATGCA GGCACTTCAA ACATTTTATT TAGCTCAAGT TCATTGGCTG TTtCACTATA 8580 

TTGCAAATGT TTAGGTAGAA ACTGTGTCAT ATATTCATAT GTATAAATTA AGATGAGAGC 864 0 

40 

AATAATATAC GTTATTGACA ATCTAAGCCA ATATTTTTTG ATATACGCAA AACCAGCTCG 8700 

AAGCCTTGAT GGCATCACTT TTAAATGAAA TAAATAAAAT GCGCCAATCC CAATCGTATA 8760 

45 . TGCTAAAGCT TGTGTGATAG TCGCTACAAA TATCAGATTA CTATCGATTT CATAATAACC 8820 

AAACAAAATT GGTCCTATGT AAGCTGCAAT TGTGAGTGCA TAAAATATAA CACCTATAAT .8880 

TGGAATTATA AGCAAATCTC TCCATGCTAT ATCTTTAAAC GTGTATTTCT TTTTTTCATT 8940 

50 TTCCaCTGTT ATATCCtTTC CTGTTTAATA ATTGATTTTT GGAGGTACTT CTACATGATA 9000 

AACGAAACTA AGTATATGAG ACAACA^TT ACTAATTTGA TTCAAATCAT TGATACGATT 9060 
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ATAGTTACTA ATGAATTGAA TAAGTTCAAA GGCTTTGAAA CATCATATAT AATAAACGAA 91 BO 

AATCAAGTTT CCTATTATGA AATTATAACA CTACTTAATA AAOGTCCCCT CgACAAGTCG 9240 

5 ACTATGGTAA CAAAATTCAA TATCTTAATT TTTATCATAC AGAACTATCT AACGCATTAT 93 00 

TTGCAATTAA ATTTGCCCAT TAACCTATTT TTCATAAAAT GTCATTTAAA CAAGTTATTT 9360 

ATTAAAATTC ACTTTATTAC ATAAATTATA CAATTArAAA GTTTCTTCAA ATTGTAAAGA 9420 

10 . 

TGCATTAATC GAGTTATAAT CATAATGATT AAGATGGT 94 58 
(2) INFORMATION FOR SEQ ID NO: 115: 

r5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 910 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 115: 



35 



40 



50 



AnGCGTATCA 


TGTCACGCAT 


TTTAACTACT 


TCTTTACCAC 


AAGATTATAC 


AGTCACATTA 


60 


GTTGATCGTA 


TGCCATTTCA 


TGGATTGAAA 


CCAGAATTTT 


ATGCTTTAGC 


TGCGGGCACG 


120 


AAATCAGATA 


AAGATGTTCG 


TATGAAATTC 


CCTAATCATC 


CACAAGTGAA 


TACAGTTTAT 


180 


GGTGAAATTA 


ACGACATAGA 


TTTAGATGCT 


CAAATTGTCT 


CAGTCGGTAA 


TTCTAAAATT 


240 


GATTATGATG 


AGCTAATCAT 


TGGTTTAGGA 


TGTGAAGATA 


AATATCATAA 


CGTTCCAGGA 


300 


GCCGAAGAAT 


ATACACATAG 


TATTCAAACA 


CTCTCAAAGG 


CTCGGGATAC 


TTTCCATAGT 


360 


ATTAGTGAAC 


TACCAGAAGG 


TGCTAAAGTC 


GGTATCGTTG 


GTGCTGGATT 


AAGCGGCATA 


420 


GAACTTGCCA 


GCGAATTAAG 


AGAAAGTAGA 


TCAGACTTGG 


AAATATATCT 


TTATGACCGT 


480 


GGGCCGCGAA 


TTTTAAGAAA 


TTTTCCAGAA 


AAATTAAGTA 


AGTATGTTGC 


GAAATGGTTC 


540 


GCCAAAAATA 


ATGTTACCGT 


TGTTCCAAAT 


TCAAATATTA 


ATAAAGTTGA 


ACCTGGTAAA 


600 


ATATATAACT 


GTGATGAACC 


TAAAGATATT 


GATTTAGTTG 


TATGGACAGC 


AGGAATTCAA 


660 


CCTGTTGAAG 


TTGTTCGTAA 


CTTGCCGATT 


GATATAAATA 


GTAATGGACG 


CGTGATAGTT 


720 


AACCAGTATC 


ATCAAGTACC 


AACATATCGT 


AACGTCTATG 


TAGTTGGTGA 


TTGTGCTGAT 


780 


TTACCACATG 


CGCCAAGTGC 


TCAGTTAGCC 


GAAGTTCAAG 


GTGATCAAAT 


TGCCGATGTG 


840 


CTTAAAAAGC 


AATGGCTAAA 


TGAACCATTA 


CCTGACAAAA 


TGCCGGAACT 


AAAGGTACAA 


900 


GGTATCGTTG 












910 



(2) INFORMATION FOR SEQ ID NO: 116: 
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(A) LENGTH: 10182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 116 : 

10 TTTTTGATTC AAAGTGGTGA TTTAACAAGC ATTTTAAATA GCAATGATTT GAAAGTCACA 60 

CATGATCCTA CCACTGATTA TTATAATTTA TCTGGTAAGT TGTCGAACGA TAATCCAAAC 120 

GTTAAACAAT TAAAACGTAG ATATAATATT CCTAAAAACG CATCAACAAA GGTGGAATTA 180 

75 AAGGGAATGA GTGATTTAAA AGGCAATAAT CATCAAGATC AGAAACTTTA TTTTTATTTT 240 

TCAAGTCCTG GAAAAGACCA AATCATTTAT AAAGAAAGCC TTACTTATAA TAAAATAAGT 300 

GAACATTAAT ACTTATGCTG TAATTATAGA AACATCCAAA TCATCTATTA nAATCCTATA 360 

20 

TTATAAAAnC ACCTCACATA ACTCGTTCAA CTGTACCAAA CCACATTACA TTAGATTTTA 420 

GGCTAACTAT TGTGATGTAC ATCAAAAACG AATTTGTGAG GCGTTGTATA TTTTACAAAG 480 

GTGACTAGCG TTTCGTATAG CATTTCCAAC ATTACTACAC TCAAGCGTCA CGCTAAAGTT 540 

25 

CGAAATCGAA TGCTTTCATT CAACAAAAGC TCATATCCAC TACAAACTTC ATATCAAGCG 600 

TATAAACTAT CTTGTGATAC TATCTCGATC ATATCTATAG TATGCATTTG TGTTCCGTTT 660 

30 CACTGAAGTA TATGTATCAT CAGTTAAGTA TAAACCGTCA TCCTTCAATG TTACTTGATA 720 

AGCATATTTC CGTGCTAACC AGGCAATATC TATATAATTT TCTCCTGCGT TTTCATAACT 780 

TCTTAAATCT TCAATATGTG CACTAACTTC AGGGaAAATG ATTCTAACAA CACTTTCATC 84 0 

35 AACCCAATAT TTGTCATGCA TCCATCGCAC TTGATCTGCC AATAAAGGTA ACTGCACATC 900 

ATTGAAATAT AGACGAAAGC CGTCACTATC ATACATTTGC CGATATGGTA ATGGCTGTTT 960 

TCTAATCACT AACACCTCGC CACCCATTAC GGTGCCTTCT CTAGTATCAT CACTTCCACC 1020 

40 

CGAAGCTTCA TACGTTGTTG GGTCAACCTG TAGTCCATGT ACATCTCCAA TATAAGCATC 1080 

TGGTTTATGT TCCATTGCAT GTCCATGTGC AATCAATGCT AATATTGTAG ATTGTGAAAA 1140 

TTGAGGCTCC CATTCAATGC GATTAGGATG GCTACTATAA ATTCTAGGTT CATCTATAGC .1200 

45 

CTGCTGAATA TCCATGCCAA ACACTAATAC ATTGATTAAT GTTTGCGCAA CACTAGCAAT 1260 

GATACTTATG GCACCAGGTG CACCTACTGT TAATATTGGC TTCCCGTGAT ACATCACAAT 1320 

SO CGTTGGAGCC ATGTTACTTA GTGGTCGTTT ATATGGTGCA ATTTCGTTAA TACCACCATC 13 80 

TACTACATCA AAGCCATCCA TTGTCGTATT CAATAACACA CCGTAGCCTG GAATCGTGAT 144 0 

ACCTGAACCA TAAATCATAC CAATTGATGT CGTAAATGAA GCAATATTAC CTTCCTTATC 1500 
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w 



15 



ATCAGACACA ACACCATGCT CTATATCAAT ATTTGCTTTA TTGCTATCAA TGAGCGTACT 1620 

GCGTGCTTTT AAATAATCAT CATCAATTAA TGACTGTACA GGCACCTCAT GAAAATTATC 1680 

ATCCGCCAAG TATTGCGCAC GATCACTATA TGCTAAATGC ATCGCTTGTA TCAAATGATG 1740 

CAAGTAATCA ACAGATCTTG GACCCATAGA TGGTAAATCG ACATGTTCTA ATAACTTCAA 1800 

TATTTGAATT ACCGTGATAC CGCCAGAACT AGATGGTCCC ATTGaATAAA TGTCATAGTC 1860 

TTTAAATGTT GCACTGATTG GCGCTTTAAT CTGAATGTCA TATTTGGCTA GATCCTCTAA 1920 

AGTGATTGTC CCACCACATG CTTTGACAAC ATTGACTAAT TGTTTCGCAA TGTCACCTTT 1980 

ATAAAATGCA TTAAACCCTT GTTCTCTTAA TATTTGAAAT GTCTTACCTA ATTCGGGTTG 204 0 

TACAATCCAA TCACCTTCAC GCCAATATTG ATTTTCATGC GTAAATACTT GTGCCGTTTC 2100 

ATGATACTTT GTCAATCGTG CGTGTTGCTG GCGCGAATAT TTTTCAGTAG CCCAATTGGC 2160 

20 TGCATGACCT TCAATGGCTA GTTCAATTGC AGGATTAATT AAATCTTGCA ATGACAATTT 2220 

AGCATAACGC TTGTGAATAT AATCAAACAG CTTTGGAATT GCTGGCACAG CGACAGTTTT 2280 

ACCATGTGTA GTCATATCAA AAAATGATTT ATATTCGCCT GAATCATCTA GATAAAATTG 2340 

TTTGTCTACA TGTTCAGGTG CTGTCTCACG TGCATCAAAC GCAGTTATAC TGCCAGTACT 24 00 

TTGCTCATAA TATAGCAAAT ACCCGCCACC ACCAATACCT GATGCAAATG GTTCTACCAC 2460 

ATTCAATGCC AGTTGAATTG CAATCACTGC ATCCATGGCG TTGCCACCTT GATCTAATAC 2520 

ATCCTTACCA ATTTTAGCCG CAAGAGGATG TGATACGGAA ATTAACCCTT CTTTAGATGT 2580 

TTTTGTCTGT TTGTCATTTA AGTTAATGAC CATACTATAT CCTCCTACTT TCTGTTAAAT 2640 

ATTTAAAACA TTATTGATTA ATGGCTTTTT CTACTTTTTC TAAATCTTGA CGTTGCTCGT 2700 

TACCAGTATC GACAAGTGGT GTAATCGGTG ATGCAATTTT AAATTTATCG CCACGATAAA 2760 

ACTfAATAAA TTGATCCTGA TCTATCGCAT TAACTACTGC TTGTCTCAAG TTTGGATGCG 2820 

40 TCTTAAATAT ACCTTTTTTA ATATTTAGCA TTAAAAAGAC TGACTTGCGT CCATTTTTGC 2880 

GAATAATGCT TAAATTTTTA TCCGACTTAA TTAAATCAAA ATGTTTTTGA TTCACATCTG 2940 

CCAACATATC AATTGAATGA TTTCTAAGTT CTGACAATGC ATTATTCGGG TCACCATTAA 3000 

ACTTCAATGT AATATTTTTA ATTTTAGCTG GTCCATAACT ACCTTTTTCT GTTTCGTTGA 3060 

ATCCTGGATT ACGTTGAAAC GTTGCTTGAT ATGCATTTTT CTGTGTCATA ATGTATGCGC 3120 

CACTTGCATA CAGCGCATTT TTCCCATCTG AATTTGCAGG AATTGTACTG CTATCCCCAT 3180 

ATCCTTTTGG ATATTCTTGA TTTACTTGAT TAACAAATTT TTTAGATAAA ATGCCTGCCG 324 0 

AAGAGTGTGT TAAGTAATTT ACCTCTCGAG GQATCGATTG ATCTGTCGTA ATTTTAACAA 3300 

♦ 
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TATAAGCTTT AATCAACTTA TCATAGATTG 
ACTGATCGAT GTCCTCATCT TTTAATATCT 
5 AAGTATTATT GTTAGGCACA GA CT TT TT AT 

CTTCAGCCGA TACACGCTCT CCAGTATTAC 

AATCATCATC TCTTAACAAG AAATAAAATG 

10 

GACTTAACGA ACCTTTCGTT GTTAAATGAT 

ACATATTCAT ATTAATTGAA TATACTGACG 

GAATTTCACC ATCTTGTTGT GTCATCACAA 

75 

TGTAATCAAA TTGTTGCCAT ATTAATGCAC 
CTAACACTTT ATTGTCATAT ACTAAATTCT 

20 CTTTATCAAA TACAACTTCA TCTTCAATTT 

TTTGAGTAGA AGCTTTATTT AACAACTGGT 
ATCCTGTAGA ACTAAATAAT GCCGTCATAG 

25 AGTCATCAAT TTGGATATCA TAATTGCCGG 

CTGGTTGGAT ATTCATCTTC ACGTTAAATC 

TCATATCATT TTCATAACTT GCTTGTCCTA 

30 

CTTCAACTTT CGATGACTTT TGAGCCACTT 

ATACCAACGC TAAAACTATA ATTGCGATAC 

TTTTAATGAA TTCTTGGATC TAGTGCATCA 

35 

AAAACGACGA ACATAATACA AACACCAGGT 
TAG1TACTAC CGGTACGTAA AATGTTGCCC 

40 CCTAGGAAAC TTAAACTACT TGTTGTTAAT 

ACAATCATAG GCGCAATCGC ATTCGGTAAA 
TCACCAGTGA TACGTGCTGC ATCTACATAT 

45 CGCATTGTCC GTGCAAATGA TGGTATATTA 

ATACTTGCTC CAAATGATGC AATAATTGCC 
ACTACATCTA AAATTCGCAT TATTAAATTA 

SO 

CCTAGTAACA CACCAAAAAT AACTGCAATA 
AGCTTCGTTC CTACAACTAC GCGTGTAAAT 
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ATTTATCGTC 


CTTGTCTTTC 


TCTTTACGCA 


3420 


TGATGTCATT 


TATATGTTTG 


TGCATATTGT 


3480 


CACGTGCTCT 


ATCTAAAGAA 


AACTTAACAT 


3540 


GTGCTTGTCC 


ATTGACCACT 


TTCGCAAAAT 


3600 


CTTTATTGTC 


CTTATTCACA 


GCATAATCAT 


3660 


CATTTTCATC 


TAATAATAAT 


AACCTTGTGT 


3720 


GCGCAATTGA 


ACGTATTGGA TCCAATGTAG 


3780 


GTGGCCGCGT 


ATCTCGTTCT 


CTACTATTGT 


3840 


GTGAATTTGG 


CAATCCAACA 


CTATTTTTAT 


3900 


TTTTTGATCC 


ATATAAAGGC 


GCCATATACC 


3960 


GCTTATATGT 


TTGTTTAACA 


TCTGCTTCAT 


4020 


CTACATGTTT 


ATCTTTCAAT 


AAACTATTTG 


4080 


CATAGTTCGG 


GTCACCAAAC ACTGTCATCC 


' 4140 


CTTGACGTTG 


TGTACGATAG 


CTACCATAAT 


4200 


CTGCATTTTC 


CAATTGATCT 


TTAACGATAT 


4260 


GGAAATGTAT 


TGTTGGTCGC 


TCGCCTTTCA 


4320 


CTGATTTCGT 


AGGGACACCA 


CAACCACTTA . 


4380 


TAATGATTTT 


CTTCACATCT ATCCCTACCT 


4440 


CGCACTGCAT 


CACCTATAAA ATTAAATGCT 


4500 


ACAATAGCTA 


AATTACTGTG 


CGTTTCCAAG 


4560 


CATTCAGCTA 


CATCAGGTGC 


aacaccaagt 


4620 


ACAACCACAC 


CTATATTTAA TGAAAAACGT 


4680 


ATATAACGCC 


ATATGATATT 


CCAAGTGTTT 


4740 


TCCATGCGTT 


TAATTTCTAA 


AACACTGGCA 


4800 


CCGATACTTA 


AAGCAATAAT 


TAAATTTGGA 


4860 


ACCGCTAACA 


ATAATGATGG 


AATTGCAAAC 


4920 


TCAATATGAT 


TAAAATAACC 


TGCGATAGTG 


4980 


ACTACTGAAA 


TAATTGAAAT 


TGAAAATGTC 


5040 


AAGTCTCTAC 


CGAAATCATC 


AGTACCAAAC 


5100 
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GTATCAAATG TAAATTGTGA CACAATTGAT AATGTCAGCA TGTAGACTAA AATAAGTAAC 5220 

CCGATAATCG CAATACGATG TCTAGTAGTT TTTCGTATAA ACGATTCCCA CCCGTTATAA 5280 

CTATGTATTT GCGATGTACG TTGGTAACGT CTAATACTTA CAAACATTAA TAATGTAAAT 5340 

ACGTTGCCTG TTAATGTCAT CAACAATAAC AACACTTCGA CGATACGTCG CCATAGGTCA 5400 

TGATGCTTCC ATGTTTGTTC CGTTGTTAAA ATAATAATTA AAATGATGGT TAAAACGATT 5460 

AGCAATGTTT CAGCAATATA GAACGTATCG GCCACATAAC CTTTAAAAAG ATTTAATGCA 5520 

CTCGTTAATA TAACTAAAAT ATAAGTTGCT ATGGCGTAAC TTGCGAATAA TTTTAAGGAA 5580 

GCTATCTTTG AATTAAGTTG TGCCATATGC CTCACTTCCT TTCGTTGATT TCACTACGTA 564 0 

ATTTTGGATC GATTAAAGCA TAAAATATAT. CAATAATTAA GTTTGCTAAA GATATTACAA 5700 

TTGATATATA TACGACCCCA CCCATGACTG CTGGAATATC AGGTATTAGT TGTTTTTGGA 5760 
20 CGATATAACG CCCGATACCA TTAATGTTAA ATACTTGTTC CGTCACTGCT GAACCGCCTA : 5820 

GTAACTCTGC CACTAGAAGA CCAACTAACG TTACAATTGG AATAATGGCA TTTTTCAAAA 5880 

TATGTTTAAT AACAACTTGT GTCGTCGATA ATCCTTTTGC ATAAGCAGTT AAAACATAAT 594 0 

25 CGctGCGCAT TACTTCAAGT ACAGAAGACC TTGTCATACG CGTGATAGAA GCAGCAATAC 6000 

TTGTTCCAAT GACAAGTACA GGTAAAATCA ACGATATTGG ATGTTCTGGC ATATAAGATG 6060 

GTGGCAAAAT ATCCAATTTC AATGAGAACG CTAAAATGAA TAATAGCCCT TGCCAGAAAC 6120 

TTGGAATAGA TAAACCAATT AATGCAATTA TCATTAACGT GATATCAAGC CAACTATTTC 6180 

GCTTCATCGC ACTGATAATA CCAATTGGTA TTGCAATAAT TAATGCCACC ATTAGCGCTA 624 0 

ATACTGCGAC AATTATTGTA ATTGGAATTC TTTCGCCAAC TGCTTTAGTC ACAACCTCAT 6300 

TCCCTTTGTA AGTCGTACCT AAGTCAAAGG TAAAAACACC CTTGATGGTA TCCCACAATT 6360 

GAATSAAATA AGGTTCGTTA AGATGATGTA ATACATTGAA TTGATGTATC TGTGCCTTTG 6420 

40 TTGCATTTTG TCCCAGTATG CTATAAGCCG CATCAAGCGG TGAAAAATAC AGAATGGTAA 6480 

ACACACTGAC AATAACACCA ATGATGACAA TCACAGCCAT GACAATTCGT TCAAAAATAT 6540 

ATCTAACTAA TGGCTGTAAA TAAAAAGTCA ATAAGATGAA CATCGGCAAG GCCAATATCA 6600 

45 CTTTGATCAT GATGAACTTA TGAAATAATA CATTTTCAAA GTATGTTGAA AAATGTGCTT 6660 

GTTCAATATT CTTTGAACTC GTATTAGAAC TTTGTGCCTT GAATATTTTT AATGCTTCTT 6720 

TATGTATTTG TGTGGATGAC TTTTGCTGCG ATAAATATTT ATATTTTTGA TGTAACGCCT 6780 

50 

GTTCAATTTC TGAAATTTCA GAATTATTAG CGTAAAAATT TTTCCTCTTA GCAGAAAAGA 6840 

AAAACTTTAT CJiCT Q CJiTKr AAAAATATTG GCAAGCTTAA TACCGATAAT ACAAACTTGT 6 900 
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CTTGTAAAAT AATCTTGAGT AGATTACTAT 
ATTTGTGaAT AGGGAGGCAC AACATCATGT 
5 TACAATTCAA TTATGATGAA ACTACAGTTC 

GAAAAAAACA TATCCTAGGT ATTGTTGGTG 
AATCTATTTT AGGGCTACTA CCAGATTATC 

10 

TTAATGGGCA ATCGTTAAAT AATTTATCAA 
ATATTTCAAT GATTTTTCAA GATCCACTCT 
AACAAATTAC AGAAGTAATA TTTCAACATA 

75 

TGACAATAGA CATTTTAGAA AAAGTAGGTA 
ATCCACATGA ACTTTCTGGT GGTATGCGTC 

20 TAAAGCCACA AATTTTAATC GCAGATGAaC 

ATCAATTACT GCAGTTAATG AAGTCCCTTT 
TCACTCACGA TTTAGGCGCT GTGTATCAAT 

25 GAAGTGTCGT TGAAAGTGGC ACGGTTGAAA 

CAAAACGCTT AATAGATGCG ATTCCTGATA 
ACAATGATAT TTTATTAAAA TTCGATCGCG 

30 

CCTATACCGA GCAGTTAATG ATATTAACTT 
TGTCGGTGAA TCAGGGTCAG GGAAATCGAC 
AGTGTCAGAA GGCTTTATTT GGTATAACGA 

35 

ATTGAAATCT TTACGACAAG AGATACAAAT 
TCCAAGATTT AAAGTCATTG ATGTGATTAA 

40 AGATAATGAT GACATTATTA AAACTGTCGT 

AACTTTCTTA TATCGCTATC CACACGAATT 
CGCGAGAGCA' CTTGCTGTTG AACCTAAAGT 

45 AGACGTTTCA ATTCAAAAAG ATATCATCGA 

CATCACTTAT TTATTCATCA CACATGACAT 
TGCAGTTATG AAAAATGGCG AAATCGTTGA 

50 

TCCGCAGTCA GACTATGCAA AGCAACTTAT 
GTCATGCGTT GTGCAACTTT ATCACTGTAT 



GATATACAAA AGTATAGAAT 


AAATTTACAC 


7020 


CAAATTTATT AGAAGTCAAC 


AGTCTGAATG 


7080 


AAGCGGTAAA AAACGTCTCT 


TTCGAATTAC 


7140 


AATCAGGATC AGGAAAAAGT 


ATTACCGCTA 


7200 


CAGATCACAC ATTAACAGGA 


GAAATTATTT 


7260 


CTTCAGCGTT ACAACAAATT 


CGAGGTAAGG 


7320 


CTTCGTTGAA TCCAAGATTA 


ACGATTGGCA 


7380 


AACGTGTATC TAAATCTGAA 


GCAAAGTCGA 


7440 


TAAAACATGC AACTCGACAA 


TTTGATGCTT 


7500 


AACGTGTCAT GATAGCAATG 


GCATTGATTT 


7560 


CAACAACGGC ATTAGATGCC 


AGTACACAAA 


7620 


ATGAGTACAC AGAAACATCT 


ATTATTTTTA 


7680 


TTTGCGACGA TGTGATTGTA 


ATGAAAGATG 


7740 


GTATTTTTAA ATCGCCACAA 


CATACCTATA . 


7800 


TTCATCAAAC GCGTCCGCCA 


AGACCGTTAA 


7860 


TGAGyGgGAT TACACATCAC 


CGAGTGGCAG 


7920 


GGCTATTAGA AAAGGCGAAA 


CATTAGGCAT 


7980 


ATTAGCTAAG ACGGTCGTCG 


GTCTAAAGGA 


8040 


ATTACCATTA AGTTTATTTA 


AAGATGATGA 


8100 


GATTTTTCAA GATCCATTCG 


CATCTATTAA 


8160 


ACGACCACTA ATCATTCATG 


GGAAAGTCAA 


8220 


ATCGTTGTTA GAAAAGGTTG 


GCCTAGATCA 


8280 


ATCTGGTGGG CAACGTCAGC 


GTGTAAGTAT 


8340 


^GATTGTTTGC GACGAGGCAG 


TGTCCGCTTT 


8400 


GTTATTAAAA CAATTACAGT 


TAGACTTCGG 


8460 


GGGTGTTATC AATGAAATAT 


GTGATCGCGT 


8520 


ACTGAATAAC ACAGAAGATA 


TTATCAAACA 


8580 


TTCAGAAGTA GCAGTTATTG 


CTAAATAAAA 


8640 


GGTCTGAAAT AAATTGCGCG 


ACTTCTGATG 


B700 
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TATCAAGTTT 


TAGGTGCTTT 


GCCATGATTT AAGAGTCACC CCCATACTTT 


GGGCATTTTA 


S820 


ACGCCAGAAT 


AAATCCCCCG 


CCACTATGTG AAGTGTGGGG GATTATTTAT 


ATTTTATTAG 


8880 


AATATTCAGA 


TTTTTGAGTG 


TGTCAACTTA 


GCTTAGTCAA TGTATATTTA 


ACGTCACTTA 


8940 


CTCTTTTTCT 


TTCATAATTA 


ACACATTCAA ATAAACTTTG ATCAAAAAAC 


ACAAAGTTAA 


9000 


AAGTACCATC 


TTGTAATATG 


CTCTCATACA TTATCCCGTC ATATTTAAGG 


CTTCGAATAT 


9060 


AATCAGCTAA 


ATATTGAAAT 


GGCAAATAAT 


CTATTCCTTG TTCATCGCTT 


GGATTTGTTA 


9120 


TTCCTTTATG 


AATCTTTTTT 


AATGTTTGGT AATTTACAAA ATACTTTCTA 


AATCCATCAT 


9180 


CGCCAGCTTT 


GATTGCATTA 


CTAGTTAAAT 


TAGTTAAATT CGCAATTTTC 


AATTTCTCTT 


9240 


TTGTCACGTT 


TTTTTGTAAC 


TTAACCTTAC CTATATAAAT AATGTCATTA 


TGCTTAGGTT 


9300 


TAACTTCTTC 


TATACTGACC 


TGTTCTTTTG TACTAAGGTA TAATACGCTT 


ATCCATTTAG 


9360 


AATTCAATCT 


TCCTGCCGTT 


GCAAATCCCT 


TTGGTGGTGA CATTAGTTCA 


CTTTTCTCTG 


9420 


TAATGAACTT 


AACTATTCTA 


GATCTATATA 


-ATGGTTCAAA TCTTTCTCTA 


AATTCCTCAA 


9480 


TACTATAGTA 


ATTAGTAGTG 


ATATCGAGAA 


AGAACGCTAA ATTCTCTAAA 


TTGATCATAT 


9540 


TTTTATGAAA 


TCTATTTTTA 


TACTTCAAGC 


TCTCACAAAA TCCATCCCAG 


TCATTATTTG 


9600 


CTACAATTAG 


ATTTTTATTT 


GTATATTTTT 


TATCGTTTAT GATTTTAGCG 


CCTACTAAAT 


9660 


CTTCCAACAC 


TCGTCTATCT 


AAATTTTCAT 


CATCTTTAAA AAGTTCATTT 


AAAATACAAC 


9720 


TTATTTGAGC 


TTCCTCAACA 


TTAAATATAC 


TCCAGTCGTC TTTTAATGCT 


ATTTCAATCT 


9780 


TTTTACCTTC 


TTTTGGGCTA 


AAAGTATCTG 


GTAAATTTAT ACTAATATCA 


TATAATTCTA 


9840 


ATGCTGGTCT 


TAAATAATCT 


CTAATAAGTT 


CTAATTTATC TATGTCCTTA 


GTCGTATCAA 


9900 




ACCAAGATGA 


TTGTTATCAA 


TATCACAATT GTCAAATTTG 


CTATTTATCA 


9960 


TTTGCAATGA 


TTTCTACGAT 


TTCAGTATTA 


TTAAAACATT TTTCACATAT 


TTTCATTTTG 


10020 


AGACtCCAAG 


TATCTATTCA 


TAATTTCTAG 


GTGATGCATG ATAGATAACC 


TTTTAATTAA 


10080 


ACCTAATCCT 


GGATaCTTAT 


TATTTTCATT 


TAATTCTTCA AATTGTCCCA 


AGCGCATAAG 


10140 


ATCTATTTTT 


AATATCTAAG 


TTTTTTGACC 


ATGTTACTAA TT 




10182 



45 

(2) INFORMATION FOR SEQ ID NO: 117: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3491 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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AACTCAGGCA ATTGAAACAG CATTAGGTGC 
AAAAGATGGA CGCCAGGCTA TTCAATTTTT 
• TTTACCATTA AATG7TATAC AGAGTAGAGT 
AGAGGCAAAC GGATTTATTA GTATCGCTTC 
AAATATTATC GGGAATTTAT TAGGTAATAC 

10 

TGAATTGGCA CGTGCGATTA AATATCGAAC 
AAATCCTGGT GGtTCTATGA CTGGTGGTGG 

15 AAAAGACGAG TTGACAACAA TGAGACACCA 

ATTTGAACAA CAATTTAAAG AGTTGAAGAT 
TGAAAAAAGT CAAAAGCATA ATACACTTAA 

20 CGATAGATTA ACTACACAAG AAACACAAAT 

AAAAAATGAT GGTTATACGA GTGACAAAAG 
TCTAGAAAGT ATTAAAGCAT CTTTAAAACG 
ACTTTCTAAA GAAGGTAAGG AAAGCGTTAC 
ATCTGATCTT GCTGTGGTTA AAGAGCGTAT 
AAATAATCAA AATCAACAAA CTAAACATCA 

30 

CTTTAATTCG GATGAAGTGA TGGGCGAACA 
TGGTCAACAA GAAACGAGAA CACGCTTATC 

35 TATTGAGTTG AATGAACAAA TCGATGCGCA 

TATTTTAGCT ATCGAAAATC ACTACCAAGA 
ATTAATTCAT CATGCGATAG ATCATTaAAT 

40 ArATCTGAAT ATACGaGTGA TGrATCGATg 

AGaTGyCGAT TGATGrACTA GGTCCTGTAA 
TAAATGAACG TTATACATTT TTAAGTGAAC 
CATTAGAGCA AATTATAAGT GAAATGGATC 
TCCATGCTAT TCAAGGACAT TTTACAGCTG 
CAGAATTGCA ATTAACTGAA GCCGATTATT 

50 

CACCGGGTAA AAAGTTGCAA CATTTATCGT 
CTATTGCTTT ACTATTTGCA ATTTTAAAAG 

55 



TTCATTACAA CATGTCATTG TAGATTCAGA 60 

AAAAGAACGT AATTTAGGTC GTGCGACGTT 120 

GGTAGCGACT GATATTAAAT CTATTGCTAA 180 

GGAAGCAGTT AAAGTAGCAC CAGAATATCA 240 

GATTATCGTT GATCATTTAA AGCATGCAAA 300 

TCGTATTGTT ACTTTGGAAG GTGATATTGT 360 

CGCTCGTAAG TCAAAAAGTA TTCTGTCTCA 420 

ATTAGAAGAT TACTTGCGTC AAACAGAATC 480 

AAAAAGTGAT CAATTAAGTG AACTGTATTT 540 

AGAGCAAGTG CATCATTTTG AAATGGAGCT 600 

AAAAAATGAT CATGaAGAAT TCGAATTTGA 660 

TCGACAAACT TTGAGTGAAA AAGAAACTTA 720 

ACTAGAAGAT GAAATTGAAC GCTACACAAA 780 

TAAAACACAA GAAACCTTAC ATCAGAAACA 84 0 

TAAAACACAA CAACAGACAA TAGATOGATT 900 

ATTAAAAGAT GTTAAAGAAA AAATTGCATT 960 

AGCTTTTCAA AATATTAAAG ATCAAATTAA 1020 

AGATGAATTA GATAAATTGA AACAACAACG 1080 

AGAAGCTAAA CTACAAGTTT GTCACCAAGA 114 0 

TATTAAAGCT GAACAATCAA AGCTAGATGT 1200 

GATGrATATC AATTGACTGT TGAACGTGCG 1260 

ACGCATTACG TAAAAAAGTT AAGTTAATGr 1320 

ACTTAAATGC AATTGAACAA TTTGAAGAGT 1380 

AACGTACAGA TCTTCGTAAA GCTAAAGAAA 1440 

AAGAGGTTAC TGAAAGATTT AAAGAAACTT 1500 

TGTTCAAACA ATTGTTTGGT GGAGGCGATG 1560 

TAACAGCTGG TATTGATATT GTGGtACAAC 1620 

TACTGAGTGG TGGTGAGCGT GCATTAACTG 1680 

TAAGATCTGC ACCTTTTGTT ATATTAGrTG 174 0 
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TATCAGACGA AACACAATTC ATTGTTATTA 
ATAGGTTATA CGGTGTAACA ATGCAAGAAT 
TAAATACAAT AGATGATGTG TTGAAGGAGG 
AGATAAGTTT GCAACAAATA AAGAAAATGA 
TCAAGACAAA TTAGAAGATA CACATTCTGA 

10 

AGAAAATGCT GAAGTGAAAA AGAAGCCACG 
TGGCTTAATA TCAATTGAAG ATTTTGAAGA 

15 TAAAGCAGGA CTCGAAAAAT CTCGTCAAAA 

GAGATATCGT AAAGTAGATG AAGACTTTTT 
AGACGTCGGT TTTAATACAG TGATGACGTT 

20 ACGTAATATT CAAGATACTG AAGATTTGCG 

TTACCATCAA GAAGATkATA ATTCAGAAGC 
CATTTTAATG GTTGGTGTGA ATGGTGTTGG 
CCGATATAAA ATGGAAGGTA AAAAAGTAAT 
TGCTATTGAT CAATTGAAAG TTTGGGGCGA 
TGAAGGTTCT GATCCAGCTG CTGTTATGTA 

30 

TGTTGATATT TTAATCTGTG ATACCGCTGG 
AGAATTAGAA AAAGTTAAGC GTGTAATTAA 

35 ATTACTATGT TTAGATGCTA CAACTGGTCA 

AGAAGTAACA AATGTTACAG GTATTGTATT 
TATCGTATTA GCCATTCGTA ATGAATTGCA 

40 GCAATTAGAT GACTTACAAC CATTTAACCC 

TATGATTGAA CAAAATGAAG AAATAACAAC 
AAAGGACGAT AATCATGGGT CAAAATGATT 

45 

TTGATTTTaT CAATCCTTAT TGACGAATAA 
TGAAGATTAT TCTTTAAGTG AAATCGCAGa 
TGATAATATA AGAAGAACTG GCGATTTAGT 

50 

CCAGAAATTT GAGCAACGCC GAGAAATATA 
AGAACAAATA C 

55 



CACACCGTAA 


AGGAACAATG 


GAATTTGCAG 


1860 


CAGGTGTTAC 


TAAACTTGTG 


AGTGTGAATT 


1920 


AGCAATAATG 


AGCTTTTTTA 


AACGCTTAAA 


19B0 


AGAAGTTAAA TCCTTAACAG 


AAGAACAAGG 


2040 


AGGTTCAACG 


CAGGACGCAA 


ATGATTTAGC 


2100 


CAAGTTGAGT GAAGCGG ATT 


TTGATGACGA j 

t 


2160 


AATTGAAGCT 


CAAAAAATGG 


GTGCTAAATT 


2220 


TTTCCAAGAA 


CAATTAAATA 


ATTTGATAGC 


22B0 


TGAAGCTTTA GAAGAAATGT 


TAATCACTGC 


2340 


AACTGAAGAA 


TTACGTATGG 


AAGCACAACG 


2400 


TGAAGTCATT 


GTTGAAAAGA 


TCGTAGAGAT 


2460 


TATGAACTTA 


GAAGATGGTC 


GTTTAAATGT 


2520 


TAAAACAACA 


ACAATTGGAA 


AATTAGCTTA 


2580 


GTTAGCTGCG 


GGCGATACTT 


TTAGAGCGGG 


2640 


ACGTGTTGGT GTAGACGTAA 


TTAGCCAAAG . 


2700 


TGATGCgATT AATGCCGCTA 


AAAACAAAGG 


2760 


ACGTTTACAA AATAAmACAA 


ATCTAATGCm 


2820 


TCGAGCAGTG 


CCAGATGCGC 


CTCATGAAGC 


2880 


GAATGCGTTG 


TCACAAGCTA 


GAAACTTTAA 


2940 


AACGAAATTA 


GATGGTACAG 


CCAAAGGTGG 


3000 


CATCCCAGTT AAATATGTAG 


GTTTAGGTGA 


3060 


TGAAAGTTAT 


GTCTACGGCT 


TATTCGCTGA 


3120 


AGTTGAAAAT 


GATCAAATTG 


TAACAGAAGA 


3180 


TAGTtAAAAC 


GTTACGAATG 


A A TT ATTTGT 


3240 


ACAACGTaAT 


TATTTGGAAT 


TATTTTATCT 


3300 


TACTTTTAAT 


GTGAGTAGaC 


AAGCAGTTTA 


3360 


TGAAGATTAT 


GAAAAGAAAT 


TGGAATTATA 


3420 


TGATGAAATG 


AAACCACATT 


TAAGTAATCC 


3480 



3491 
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15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4253 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 118: 

AGTACGTTTT ATAATTATAA GTACGTAATT AACATATTAA CATATCGCAA GTATGTATTT 60 

AAATAAgATT GTTATAATTT CAAAGTTCAT CCAAGaTTAT GGCGTTTGCA TTTACCTATT 120 

AAAAACGTTA TTATATCAAA GATGCGAAAG ATAATACGGG TTTATTTTAT GAAAGTGAGA 1B0 

AGGATAAAAT GGATAATGAG CAACGCTTAA . AAAGAAGAGA GAATATAAGG AATTTCTCGA 24 0 

TTATAGCACA TATTGACCAC GGAAAATCTA CATTGGCTGA TAGAATTTTA GAAAATACCA 300 

20 AATCAGTTGA AACAAGAGAT ATGCAAGATC AGTTACTAGA TTCAATGGAT TTAGAAAGAG 360 

AACGTGGTAT TACAATCAAA TTAAACGCgT ACGTTTAAAG TACGAAGCTA AAGATGGAAA 420 

TACTTATACA TTCCATTTAA TCGATACGCC TGGACACGTC GATTTTACAT ATGAAGTGTC 480 

ACGTTcTTTG GCAGCTTGTG AGGGCGCGAT TTTAGTAGTA GATGCGGCTC AAGGTATCGA 54 0 

AGCACAAACA TTAGCAAATG TTTATTTAGC ATTAGATAAT GAGTTAGAGT TATTGCCTGT 600 

TATTAACAAA ATTGATTTAC CTGCTGCAGA ACCTGAACGC GTGAAACAAG AAATTGAAGA 660 

' TATGATAGGT TTAGACCAAG ACGATGTTGT TTTAGCAAGT GCTAAATCTA ACATTGGAAT 720 

TGAAGAGATA CTAGAGAAAA TAGTTGAAGT TGTGCCAGCT CCAGATGGTG ACCCAGAAGC 780 

ACCACTAAAA GCGTTAATAT TTGATTCTGA GTATGATCCA TATAGAGGGG TAATTTCATC 840 

GATAAGAATT GTGGACGGTG TTGTTAAAGC CGGAGATAAA ATTCGAATGA TGGCCACTGG 900 
TAAAGAGTTC GAAGTAACAG AAGTTGGAAT TAATACACCT AAGCAGCTTC CAGTTGATGA . 960 

40 ATTAACAGTT GGTGATGTTG GTTATATTAT TGCAAGTATT AAAAATGTTG ATGATTCTAG 1020 

GGTTGGTGAC ACCATCACAT TAGCTAGTAG ACCTGCATCA GAACCATTGC AAGGTTATAA 1080 

GAAAATGAAT CCAATGGTAT ATTGCGGACT GTTCCCAATA GATAACAAAA ATTATAATGA 1140 

TTTAAGAGAA GCATTAGAAA AATTACAATT GAATGATGCA TCATTAGAAT TTGAGCCTGA 1200 

ATCGTCACAA GCATTAGGTT TTGGTTATAG AACTGGTTTC TTAGGTATGT TACACATGGA 1260 

AATAATTCAA GAAAGAATTG AAAGAGAATT TGGTATTGAA TTAATTGCAA CTGCACCATC 1320 

TGTAATTTAT CAATGTGTTT TAAGGGACGG TTCAGAAGTG ACGGTTGATA ACCCAGCACA 13 80 

AATGCCAGAT CGTGATAAAA TTGATAAAAT ATTTGAGCCA TATGTTCGTG CAaCTATGAT 1440 

55 



25 



30 



35 



45 



50 



667 



EP 0 786 519 A2 



TATAAATATG GACTATTTAG ATGATATTCG TGTAAATATT GTTTATGAAT TACCTTTAGC 1560 

TGAAGTTGTA TTTGATTTCT TCGATCAACT TAAATCTAAT ACTAAAGGAT ATGCATCATT 1620 

5 

TGATTATGAA TTCATCGAAA ATAAAGAAAG TAATTTAGTC AAGATGGATA TTTTATTAAA 1680 

TGGTGATAAA GTGGATGCGC TAAGCTTCAT AGTTCATAGA GATTTTGCAT ATGAACGTGG 1740 

TAAAGCATTA GTTGAAAAAC TTAAAACGTT AATTCCAAGA CAGCAATTTG AAGTACCTGT 1800 

10 

ACAGGCTGCA ATAGGACAAA AAATTGTAGC GCGTACAAAT ATTAAATCAA TGGGTAAAAA 1860 

CGTTTTAGCT AAATGTTATG GCGGTGACAT AAGCCGTAAA CGTAAATTAC TTGAAAAACA 1920 

15 AAAAGCAGGT AAAGCTAAGA TGAAAGCAGT TGGTAATGTT GAAATTCCAC AAGATGCTTT 19B0 

CTTGGCTGTA TTGAAAATGG ATGATGAATA ATTTTAAAAA ATCAATTAAC AATTTACAAT 204 0 

GAATAAAGTT TAATAACTAA AAAGAGGGAG CCTAGGATAA ATTAACGTCC TGGGCTTTAC 2100 

20 AATGTTATAT TGGCAGCCAT CGACAGAGTT AAAATGAGCT TATAACAATG GGGCCCCAAC 2160 

ACAGAAGCTG ACGAAAAGTC AGCTTACTAT AATGTGCAAG TTGGGGTGGG GCCCCAACAT 2220 

AGAGAATTTC GAAAAGAAAT TCTACAGGCA ATGCAAGTTG GGGTGGGACG ACGAAATAAA 2280 

25 

TTTTGCGAAA ATATCATTTC TGTCCCACTC . CCTTATGCAT GAGTTTTACT CATGTAATTT. 234 0 

TATTTTTAAG GACATATTAC ATCTGGCTAA TGTGTAAGAG CCACTACATA ATAAATCATT 24 00 

AGTGGTTCTT TATTATTTCT ATCTCACTCC CTCTAAACAA GAATAAATAT TAAAATGAAT 24 60 

30 

CGATATATTA GACAATCATT GATTAAACGT TAAAGTTAAA AGTAAGAATA ATTGCAGATA 2520 

GTCCAACAGG ATATAGCCGA TTGGATAAAA AGTCTGAGAA GCGGGGCATT AAAATGACGG 258 0 

35 TACAAAG7GC ATATATACAT ATTCCATTTT GTGTAAGAAT ATGTACATAT TGTGATTTCA 264 0 

ATAAATATTT TATACAGAAT CAACCTGTAG ATGAGTACTT AGATGCACTA ATCACAGAAA 2700 

TGTCTACAGC AAAATATAGG ATCTTAAAGA CCATGTATGT AGGTGGCGGC ACACCAACGG 2760 

40 CCCTTTCTAT TAATCaGTTG GAAAGATTAC TTAAAGCAAT ACGTGATACG TTTACAATCA 2 820 

CAGGCGAGTA TACATTTGAA GCAAATCCTG ATGAGTTAAC TAAAGAGAAA GTCCAACTAT 2880 

TAGAGAAATA TGGAGTAAAA AGGATTTCAA TGGGCGTTCA AACATTCAAG CCGGAGTTAT 2940 

45 

TGTCTGTTTT AGGTAGAACG CACAATACTG AAGATATTTA CACTTCGGTG TTAAATGCTA 3000 

AAAACGCAGG TATTAAATCA ATCAGTTTAG ATTTAATGTA TCATTTACCG AAACAGACGA 3060 

TTGAAGATTT TGAACAAAGT TTAGATCTAG CTTTAGATAT GGATATTCAA CATATTTCGA 3120 

50 

GTTACGGCTT AATACTTGAA CCTAAAACCC AATTTTATAA TATGTATAGA AAAGGCTTGC 3180 

TCAAACTTGC TAATGAGGAT TTAGGTGCTG ACATGTATCA GTTGCTGATG TCTAAGATAG 3240 
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5 



70 



20 



35 



45 



AACATAATAA 


GG TTTACTGG 


TTTAATGAGG 


AATATTATGG 


ATTTGGAGCA 


GGTGCAAGTG 


3360 


GTTATGTAGA 


TGGTGTGCGT 


TATACGAATA 


TCAATCCAGT 


GAATCATTAT 


ATCAAAGCTA 


3420 


TAAATAAAGA 


AAGTAAAGCA 


ATTTTAGTAT 


CAAATAAACC 


TTCTTTGACT 


GAGAGAATGG 


3480 


AAGAAGAAAT 


GTTTCTTGGG 


TTGCGTTTAA 


ATGAAGGTGT 


GAGTAGTAGT 


AGGTTCAAAA 


3540 


AGAAGTTTGA 


CCAATCTATT 


GAAAGTGTCT 


TTGGTCAAAC 


AATAAATAAT 


TTAAAAGAGA 


3600 


AGGAATTAAT 


TGTAGAAAAG 


AACGATGTGA 


TTGCACTTAC 


AAATAGAGGG 


AAAGTCATAG 


3660 


GTAATGAGGT 


TTTTGAAGCT 


TTCCTAATAA 


ATGATTAAAA 


AAAATTGAAA 


TTTCGAGTCT 


3720 


TTAACATTGA 


CTTACTTTGA 


CCAATTTGAT 


AAATTATAAT 


TAGCACTTGA 


GATAAGTGAG 


3780 


TGCTAATGAG 


GTGAAAACAT 


GATTACAGAT 


AGGCAATTGA 


GTATATTAAA 


CGCAATTGTT 


3840 


GAGGATTATG 


TTGATTTTGG 


ACAACCCGTT 


GGTTCTAAAA 


CACTAATTGA 


GCGACATAAC 


3900 


TTGAATGTTA 


GTCCTGCTAC 


AATTAGAAAT 


GAGATGAAAC 


AGCTTGAAGA 


TTTAAACTAT 


3960 


ATCGAGAAGA 


CACATAGTTC 


TTCAGGGCGT 


TCGCCATCAC 


AATTAGGTTT 


TAGGTATTAT 


4020 


GTCAATCGTT 


TACTTGAACA 


AACATCTCAT 


CAAAAAACAA 


ATAAATTAAG 


ACGATTAAAT 


4080 


CAATTGTTAG 


TTGAGAATCA 


ATATGATGTA 


TCATCAGCAT 


TGACATATTT 


TGCAGATGAA 


4140 


TTATCAAATA 


TATCTCAATA 


TACAACTTTA 


GTTGTTCATC 


CTAATCATAA 


ACAAGATATT 


4200 


ATCAATAATG 


TACACTTGAT 


TCGTGCTAAT 


CCTAATTTAG 


TTATAATGGT 


TAT 


4253 


(2) INFORMATION FOR SEQ ID NO: 119; 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








'(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 119: 






TCCCTAATCG 


AACAAAATTA 


TGCGCATAAA 


CAAAGTAGAT 


TGATATAAAA 


TTCTTAATTA 


60 


TCAGAATATA 


TTTACAAATC 


TGAATTTTAT 


TAGTATATTG 


GrTAGTrTTC 


ATAGAGGCAT 


120 


GACGGTaTTT GAGCAGGATT TTAAATCGGg ATTTTATAAT CGATTTAAGA 


GAGGCCACtT 


180 


TGCTTGcACA 


TTAATACTGT 


CAATGGGAGG 


GGAATGTATA 


TGAGTrAAGC 


ACATCAATTA 


240 


ATTCAAGAGG 


ATGAACATTA 


TTTTGCGAAA 


TCAGGACGTA TTAAATATTA 


TCCGTTAGTG 


300 


ATTGATCATG 


GATATGGAGC 


AACATTGGTT 


GATATTGAGG 


GGAAGACATA 


TATCGATTTG 


360 


TTATCGAGTG 


CGAGTTCTCA 


AAACGTAGGT 


CATGCACCTA 


GAGAAGTAAC 


AGAAGCGATA 


420 
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GTACGTTTAG CTAAGAAGCT TTGTGAGATT GCACCTGGAG ATTTTGAAAA AAGAGTGACC 540 

TTCGGATTAA CCGGATCAGA CGCAAATGAT GGCATCATTA AATTTGCCAG AGCATATACA 600 

5 

GGGCGTCCTT ATATCATTAG TTTCACTAAT GCATATCATG GTTCAACTTT TGGCTCATTG 660 

TCTATGTCAG CTATTAGTTT AAATATGCGC AAACATTATG GTCCGTTATT GAATGGTTTT 720 

TATCATATTC CGTTTCCAGA TAAATATCGT GGTATGTACG AGCAGCCACA AGCTAATTCA 780 

10 

GTAGAAGAAT ATTTAGCACC CTTAAAAGAA ATGTTTGCGA AGTATGTACC TGCTGACGAA 84 0 

GTAGCATGTA TTGTTATTGA AACGATACAA GGCGATGGTG GACTTTTAGA ACCAGTTCCA 900 

75 GGGTATTTTG AAGCGTTAGA AAAGATTTGT CGTGAACATG GTATTTTAAT CGCTGTCGAT 960 

GATATTCAAC AAGGTTTTGG GAGAACAGGT ACATGGAGTT CAGTCTCGCA TTTTAATTTT 1020 

ACGCCTGATT TAATCACTTT CGGAAAATCC TTAGCAGGTG GTATGCCTAT GTCAGCAATT 1080 

20 GTTGGACGCA AAGAGATTAT GAATTGTTTA GAAGCACCAG CACATTTATT TACAACAGGT 1140 

GCTAATCCAG TTAGTTGTGA AGCTGCATTA GCCACAATTC AAATGATTGA AGATCAGTCG 1200 

CTTCTTCAGG CTAGTGCGGA AAAAGGGGAA TATGTTAGGA AACGAATGGA TCAATGGGTA 1260 

25 

TCTAAATACA ATAGTGTAGG CGATGTTAGA GGTAAAGGTC TGAGCATTGG TATTGATATT 1320 

GTTTCCGACA AAAAACTCAA AACACGTGAT GCCAGTGCGG CACTTAAAAT TTGTAATTAC 1380 

TGCTTTGAGC ATGGCGTAGT TATTATAGCT GTAGCAGGAA ATGTGTTGCG ATTCCAACCG 144 0 

30 

CCATTGGTAA TAACATATGA GCAATTAGAC ACGGCGTTAA ACACTATAGA AGATGCACTG 1500 

ACTGCTTTGG AAGCAGGTAA CTTAGATCAA TATGACATAT CTGGACAAGG TTGGTAATAG 1560 

35 CGATTATCTT AATATAAAAT AAAAAATCAT TTCCACATCT GGATGTTAAT CAGATGGGAA 1620 

ATGATTTTTT TTATTTTTTA TTTTGGTGGG TGGTATTCAG CTACGTCATT TTTCTTAGAA 1680 

TGTCTAAGTC CATAACTTAA ATATAGGATG ATACCAACAA TAAACCAAAT TAAAGTGTAT 1740 

40 AATTTCGCTT CGAATCCTAA TCCCCAGAAT ACTAGCAATA CTAAAACAAA TGTAATTGCT 1800 

GGTAACACAG GATATAAAGG TAATTTAAAT GCAGGAATTG GTAGATCTTT ACCTTcACGC 1860 

TTTCTCAAAC GATACATTGC TAATGAAACG AACATAAATG CAACAAGTGT ACCTGCTGAA 1920 

45 

ATTAATTGTG CTAAAAATGC GAATGGGAAC ATAGAACCAA TTAAAACACC AATAATAGTA I960 

AGTATAACTA GTGCGCGATT AGGTAAATGT TTGTCGTTTA AGTGGCTTAA CCATGAAGGT 204 0 

AATAAGCCGT CACGTCCAAA TGAATAAAGT AAACGTGAGC CTGCTAACAT CATACCAATT 2100 

50 

AATGCTGTAA ACATACCGAT AACAGAGATA GCTTGAACAA TAGCTGCTAC AACACCATGA 2160 

CCACTTTGAC GTAAAGCCCA ACCAACAGGT TCAGCATTGT TTGCGTATTG TGAGTAATGG 2220 
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CCAAGAATAC CTCTAGGCAT TGTCTTTTGA GGATCAAGTG CTTCTGCTGA. GTTTGCTGCG 2340 

ATAGAATCGA AACCGATATA CGCTAAGAAA ATCATTGAAA CACCAGCATA TATGCCTTGC 2400 

CATCCACCAA AGTCACCTGT AGCAGTTACT TTGTGTTCTG GAATAAATGG CACATAGTTA 2460 

CTAACATTTA TTGCTGTTAA ACCTACGATG ACAAATAAAA TAATAGCTAA TACTTTTAAA 2520 

ATAACTAAAA TATTTTCCAT ACGAGCTGCT TCCGACATAC CACGTGATAG TAATAATGCA 2580 

GTTAATAAAA TAACGATAGC AGCAATAATA TCGATAAAAC CGCCATTTGT ACCAAATGGA 2640 

TTTGATAATG CTGCAGGTAA TTCGATGCCA ATTGGTTTCA CAAGTCCGCG TAAATTCGCT 2700 

GAGAATCCTG ATGCAACAAA GGCTACGGCG ATAAAATATT CAGCTAATAG AGCCCAACCG 2760 

GCAACCCATC CAAAAAATTC ACCAAATAAT ACATTGACCC AAGAATAGGC TGAACCTGCA 2820 

AATGGCATAG CGGCAGCCAT TTCTGCATAA GTAAATGCAA CTAAACCAGC AACAATAGCA 2880 

20 GCGAGTAAGA ATGATAACGC AACGGCCGGT CCTGCATGTT CTGCAGCAAC AATGCCAGGT 2940 

AGCGTAAAGA TAGATGTCGA TACAATTGTT CCTACACCTA AAGCTAAGAA ATCACGCACC 3000 

CGAAGTGTAC GCTTTAAATG ACCATCTTTA TTTTGATAGA TAGCCGGATC CTCTTTTCGT 3060 

GCTATTTTAT TGAAAAAACT TCCCATAAAC TTTCCTCCCA AACATTCATA AACAATTCTA 3120 

TACGGTGTTT TTTAATATGT TATATCATAG CACAAATAAT CAATATTTTG TCTAAAAATT 3180 

CTGAAAAATC ACAACTTTAT GTTACGTATT AATGACTTGT CTTGATAACA TCCATAGATT 324 0 

TTTTAAATGA TAAAACTGAT TATAACAGAT ATTAAATGAA TAAGTACTAT TTTTTGCnAA 3300 

TTTTCTAACA ATTTTGCACA TTATATGTTT AAAATCAATT TCATGTTTAT GGTCTGATTG 3360 

GCTAGTGTGT ATGAAATGTA AnTCTTTGAC TnnGA 3395 
(2) INFORMATION FOR SBQ ID NO: 120: 



25 



30 



35 



- (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13508 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120: 

ATCAGGTAAT GCCATGCGTT TAGCTGAAAA TTTTTTCAGA ACGTTTAAGT GATATCGGAC 60 

ATCAAGTTGT TTTGATGTCA ATGGATGAAT ATGATACGAC AAACATCGCG CAGTTAGAAG 120 

ATTTATTTAT TATTACGTCT ACTCATGGTG AAGGAGAACC GCCTGATAAT GCATGGGATT 180 

TCTTTGAATT TTTAGAAGAC GATAACGCAC CTAATTTAAA TCATGTGAGA TATTCAGTAC 240 
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TACTAGAAAA TCTAGGCGCT GAGCGTATAT GTAAGCGTGT AGATTGTGAT ATTGATTATG 360 

AAGAAGACGC AGAAAAGTGG ATGGCAGACA TCATTAATAT TATTGATACC ACATCAGAAG 420 

GTATTCAAAG TGAATCGGTG ATAAGTGAAT CAATTAAGTC TGCCAAAGAA AAGAAATATT 480 

CTAAATCAAA TCCATACCAA GCAGAAGTAT TAGCGAATAT CAATTTAAAT GGTACCGATT 54 0 

CAAATAAAGA AACACGACAT ATAGAATTTT TACTTGATGA TTTTAGTGAA TCATATGAAC 600 

CAGGAGATTG TATAGTAGCA TTACCGCAAA ACGACCCTGA ATTGGTTGAA AAACTAATAT 66 0 

CCATGTTAGG TTGGGATCCG CAATCTCCGG TGCCAATTAA TGATCATGGT GATACAGTTC 720 

CTATTGTTGA AGCACTAACA TCACATTTTG AATTTACTAA ATTAACATTG CCATTATTGA 780 

AAAATGCAGA TATCTATTTT GACAATGAAG AATTATCTGA ACGTATTCAA GATGAGTCAT 84 0 . 

GGGCGCGTGA ATATGTTATA AATCGGGACT TTATAGATTT AATAACAGAT TTTCCAACTA 900 

20 TAGAATTACA ACCTGAGAAT ATGTATCAAA TCCTTAGAAA ATTACCACCA AGAGAGTATT 960 

CGATTTCTAG TAGTTTTATG GCAACGCcAG ATGAAGTGCA TATTACCGTT GGTACGGTTC 1020 

GTTATCAAGC ACATGGACGT GAGAGAAAAG GTGTATGCTC GGTTCATTTT GCTGAGCGAA 1080 

TTAAACCAGG CGATATAGTA CCAATTTATT TGAAGAAAAA TCCGAACTTC AAATTTCCGA 1140 

TGAAGCAAGA TATACCGGTT ATTATGATTG GACCAGGTAC TGrAATTGCT CCTTTTAGAG 1200 

CATATTTACA AGAACGTGAA GAACTTGGTA TGACTGGAAA AACATGGTTG TTCTTTGGTG 1260 

ATCAACACCG TAGTTCTGAC TTTTTATATG AAGAAGAAAT AGAAGAATGG CTTGAAAATG 132 0 

GAAACTTAAC ACGCGTAGAT TTAGCATTTT CAAGAGACCA AGAACACAAA GAATATGTAC 1380 

AGCATCGTAT AATGGAAGAA AGTAAACGTT TCAATGAATG GATTGAGCAA GGCGCACAAT 144 0 

CTATATTTGT GGCGATGAAA AATGTATGGC GAAAGATGTC CATCAAGCCA TTAAAGATGT 1500 

ATTGCTAAAA GAACGTCATA TTTCTCAAGA AGAAGCAGAG TTATTATTGC GACAAATGAA 156 0 

40 ACAACAACAA CGCTATCAAC GTGATGTTTA TTAGCGATTG GTGTTAAATA TTTTAAGGTG 1620 

TAATGATGTA AAAAGATATA AAGGATGTTG CTCAACATGA ATATGCCATT AATGATAGAT 1680 

TTAACAAATA AAAATGTCGT CATAGTTGGT GGAGGCGTCG TTGCAAGTCG TCGGGCACAA 174 0 

45 ACATTAAATC AATACGTTGA ACATATGACG GTCATCAGTC CGACAATCAC TGAAAAACTT 1800 

CAAAATATGG TAGATAACGG TGTCGTCATA TGGAAAGAAA AAGAATTTGA ACCAAGCGAT 1860 

ATTGTAGACG CGTATCTAGT TATTGCAGCA ACCAATGAGC CACGTGTCAA TGAAGCGGTA 1920 

SO 

AAAAAAGCCT TACCTGAG CA TGCCCTTTTT AATAATGTTG GAGATGCATC AAATGGCAAT 1980 

GTTGTATTTC CAAGTGCACT ACACCGCGAC AAGCTAACTA TCAGTGTATC AACTGATGGT 204 0 
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TACAGTTCGT ATATCGACTT TTTATATACT 
ACATATAACG AAAAGCAACA GTTACTGTCA 
GACAAACAAG CTCAATTTTT AGCGTGGTTG 
CCGTCTAAGG TAAGTCTTCT TATTTTAACT 
GGCGGCTTTG TGAATAGTCT AATAATGAAG 

10 

TATGAATATT ACAATAGAGA AAAAGATACG 
ATTGAAATCT GGTTGAAGTC GTTACTATCA 
TAGAAAGCAA TAATAAAAAT GATAGATCAA 

1o 

TAGTCATGCC ATGTTATCAA GTAGGAAAAT 
AGATTTTTTT GATAAAATGA GATAACTTAA 

20 AAGTTAAAGA GGGGGATTAT GTAAATTGTA 

AGTGATAATA TGTGGTTAAC AGTAATGGGG 
CTCATTGCCA AAAAGATAAA TCCAGTTGTA 

25 ATGATTTTAG GATATAGTGT GACAGATTTG 
GTCATCAACG TTGTTATTAT GTTTATCTTT 
AGTGGTTTAT TCAAGCCGCT TGTCAAACGC 

30 

•ATTGTCTGTG CAATGACAGC TTTAATTGGC 
GTAACATTTT TGCTTTCTAT TCCTGCATTA 
AAATATTTAT TGATTTTACT ATTAGCATTA 

35 

GGAGGTCCAA TGGCTCGTGT AGCTGCAGTG 
TATGDATTAA TACCTATTCA AATAATAGGT 

40 CTTGGATTTA AAGAACAGAA ACGTATCAAA 
ACACAAGATA TAGATGTACA TAAATTAGTT 
TTTCCTGTAA AAGGACGTGC AAGAACAAAA 

45 ACTTTAGCTG TTATTCTATC GATGTTAATA 
ATAGGTGTTy CGTTGGCACT TGTTATTAAT 

TTAAGAGCgC ATGCGCCGAA TGCATTAATG 

50 

TTTTTAGGTG TACTAAATGA AACCGGTATG 

GTGATTCCTG CAGAAGTAGG ACCATACTTG 

55 



TGCCGACAGA AAATAAAAGT ACTTGATATA 2160 

CAAATTGTGT CACAAGAATA TTTAAATCAT 2220 

GATGTAAGAT AATAATAGCG GACCGTCTAA 2280 

TTAACGCTTA ATCATTGAAA TTAAGACATG 2340 

GATTTAAGCG ATAATGATAT GCGTTTTAAA 2400 

TAGAACAAAC TTAATAAAAT AGGTGGATAA 24 60 

TAGCGACCTT TAGCCAGATT TTTTGTGCAA 2520 

AATGAAATAC AGGACAGGAT ATACAAGGAT 2580 

CAAACTTCAC TATTGATAGT TACGCAAAAA 2640 

ATATAAAAAA TTATATTAAT TATAATATTT 2700 

TTAAAAGTGG AGGGAGAAAA TAATATGAAT 2760 

CTCATTATTA TTATTTCAAT TGTAGGTTTA 2820 

GGTATGACAA TCATACCTTG CTTAGGGGCA 2880 

GTTGGATTTT TTGCTAAAGG GTTAGATCAA 2940 

GCCATTATTT TCTTTGGCAT CATGAACGAT 3000 

TTAATATTAA TGACACGAGG CAATGTCGTC 3060 

ACAATAGCCC AATTAGATGG GGCCGGTGCG 3120 

TTACCTTTAT ATAAAGCGTT AAATATGAAT 3180 

AGCGCGGCGA TTATGAACAT GGTACCTTGG 324 0 

TTAAAAGCCA AAAGTGTCAA TGAATTATGG 3300 

TTCATTCTTG TTATGTTGTT TGCGGTATAT 3360 

AAAGCAATAG AGAGAAATGA ATTACCGCAA 342 0 

GAAGTATATG AACGAGATCA AGATGTAAGG 34 BO 

TCATGGATAA AATGGGTGAA TACAGCTTTA 3 540 

AATATTGCGC CACCTGAATT TGCATTCATG 3600 

TTTAAATCAG TGGATGAACA AATGGAACGA 3660 

ATGGCTGCAG TGATTATTGC AGCAGGTATG 3720 

CTTAAAGCGA TTGCGACCAA TTTAATCAAA 3780 

CATATTATTG TAGGTTTACT TGGCGTACCA 384 0 
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ACAGCAGGGC AATTTGGTGT ACCGTCTGTA TCAACAGCTT ATTCAATGGT CATAGGGAAT 3950 

ATTATAGGTA CATTTGTCAG CCCATTTTCA CCAGCCTTAT GGTTGGCAAT TGGTTTAGCA 4020 

GAGGCAAACA TGGGCACGTA TATTAAGTAT GCATTCTTTT GGATTTGGGG ATTCGCTATC 4080 

GTTATGTTAG TAATTGCAAT GTTGATGGGC ATTGTGACGA TTTAAGTATG AAAAAATAGA 4140 

AACTATGGTC ACGTTGCAAA ATGAAATAAT AGTTGCATAA. ACATGTCGAA ATGACGGACG 4200 

AATCTTTAAA CAATTTTAAA AATTAATGAA ATAATTGTGT AGAAATATGA ATTTCACTAA 4260 

ATGTTAATAA CTTTGTGACG TTTTAGTTAA CAGACTAATA AAAATTTGAA AATACTATAT 432 0 

ATAGTGGTAT AACGTAATGA GTAGACACAA TATATAGGAA GAAGGGGTAA AATGAATCAA 4380 

ATCGAAGAAG CATTAACGGG TTTGATTTCT AAAGATCCTG CTATTGTTAA CGAAAATGCT 444 0 

AACAAAGATA GTGATACATT TTCAACAATG AGAGATTTAA CAGCAGGTAT CGTTTCTAAA 4500 

20 TCTTACGCAT TAAATCATTT ATTACCAAAG CACGTTGCAG ATGCACATCA AAGAGGGGAC 4560 

ATACATTTTC ACGACTTAGA TTATCATCCA TTCCAACCGT TAACTAACTG TTGTTTAATA 4620 

GATGCTAAAA ATATGCTACA TAATGGATTT GAAATAGGCA ACGCGAATGT AACTTCACCA 4680 

25 AAATCAATAC AAACTGCATC AGCGCAGCTT GTACAAATTA TAGCCAATGT TTCTAGCAGT 4 740 

CAATATGGTG GCTGTAcGGT TGACCgCGTT GACGAATTAC t TAGTACATA TGCACGACcA 4 800 

TAATGAAGAA CAACATAGGA ATATsCGCAA AGCAATTTGT CAAAGAATCT GAAATTGATC 4860 

GTTATGTTGA TCAACAAGTC ACTAAAGACA TCAATGATGC GATTGAAAGT TTAGAATATG 4920 

AAATTAATAC CTTATATACA TCTAATGGAC AGACACCTTT TGTAACATTA GGATTCGGCT 4980 

TAGGTACAGA TCATTTAAGT CGCAAAATTC AACAAGCTAT CTTAAATACT CGTATCAAAG 5040 

GCTTAGGAAA AGACCGCACG ACAGCGATTT TCCCGAAACT TGTATTTTCA ATTAAAAAAG 5100 
GAACQAACTT TAGTCCGCAA GATCCGAACT ATGACATTAA ACAACTAGCA TTAAAGTGTT . 5160 

CAACGAAACG TATGTATCCA GATATTTTAA ATTATGACAA ACTCGTAGAA ATATTAGGTG 522 0 

ATTTCAAAGC GCCAATGGGT TGTCGTTCAT TTTTACCAAG TTGGAAAGAT GCGGAAGGTC 5280 

ATTTTGAAAA TAATGGTCGT TGTAATCTTG GTGTTGTTAC ACTTAATTTA CCTAGAATGG 5340 

45 CATTAGAATC TGCCGGTAAT ATGACGAAAT TCTGGGAAAT CTTTTATGAA CGTATCGATG 54 00 

TGTTACATGA TGCATTACTT TATCGTATAA ATCGTTTGAA AGATGCTGTA CCGAATAACG 54 60 

CACCGATTTT ATATAAAAGT GGCGCATTTA ACTATAAATT AAAAGAAACA GATGATGTTG 5520 

CTGAGTTATT TAAAAATAAA CGTGCAACGA TTTCAATGGG CTATATAGGG TTGTATGAAA 5580 

CAGCTACTGT TTTCTATGGT CCAGACTGGG AAACATCTCA AGAAGCAAAA GCATTTACGC 5640 



30 
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GGTTCAGTAT TTmCAGTACG CCGAGTGAAT CGCTAcGGAT CGTTTTTGTC GTTTAGACCA 5760 

AGAGAGATTT GGAGATATTA AAGACATTAC AGATAAAGGA TATTATCAAA ACTCTTTCCA 5820 

5 

TTATGATGTA CGTAAAGATG TTACACCTTT TGAAAAGTTA GATTTTGAAA AAGATTATCC 5880 

TTATTATGCG AGTGGTGGTT TCATTCACTA TTGTGAGTAT CCGAAATTGC AACACAATTT 5940 

GAAAGCACTA GAAGCGGTAT GGGACTACTC TTATGACAAA GTTGGTTACT TAGGTACAAA 6000 

10 

TATTCCGATT GATCATTGTT ATGAATGTGA TTACGATGGA GATTTTGAAG CAACTGAAAA 6060 

AGGATTTAAA TGCCCGAACT GTGGCAATGA TAATCCTAAA ACAGTTGATG TCGTTAAACG 6120 

1S AACATGTGGT TACCTAGGCA ATCCAGTTCA ACGTCCAGTA ATTAAAGGCC GTCATAAAGA 6180 

AATTTGCGCA CGAGTAAAAC ATATGAAAGC GCCTAAAGAA TGATACTTTT AGACATTAAA 624 0 

CAAGGACAAG GTTATATTGC TAAAATAGAA TCAAATAGCT TTGTTGACGG TGAAGGAGTA 6300 

20 AGATGCAGTG TTTATGTATC AGGATGTCCA TTTAATTGTG TTGGATGTTA TAACAAAGCC 6360 

TCACAAAAGT TCAGATATGG CGAGAAATAC ACTGATGAAA TATTAGCAGA AATATTAGAT 6420 

GATTGCGATC ATGATTATAT ATCTGGGCTA AGTCTATTAG GTGGCGAACC ATTTTGTAAT 6480 

25 TTGGATATTA CATTAAATCT TGTCAAAGCA TTTCGAGCAC GTTTTGGAAA TACAAAGACA 654 0 

ATTTGGGTAT GGACTGGATT . TTTATATGAA TATTTAGCAA ATGATTGTAC AGAACGTCGA 6600 

GAGTTATTAT CATACATTGA CGTTTTAGTA GATGGTCTAT TTATACAACA CTTATTCAAA 6660 

30 

CCTGATTTAC CATATAAAGG TTCTTTAAAT CAACGCATTA TAGATGTACA ACAATCACTC 6720 

TCGCATGCGC GTATGATTGA ATATATAGTT AGTTGAATAT GTATTAGAAG TCAAGGTAAC 6780 

ATTCGTTGCC TTGGCTTCTT TTTAGGTTAG GTACATAATT GAAAGTTAAT AAAAGCAATT 6840 

35 

CTTTATAAAA ATATATTGAT AGAATATGAC CTAACAATCA TTTTGATACC AATACTAAAA 6 900 

GTTGCATATC CGTTTTTTAA AAAAGTTGAA AGAGAAAAGT GGTATTTTAG TGGGAAGGAA 6960 

40 GTCTAACTTT TTGGTAGCGT TTTACAATAA ATAAATATTC GTTAATAACG TATAAATATT 7020 

CTTAAATGCC ATTCTAGTAA AATTTGTTAA ATTCGTTAAA TCGTAACTTA ACACTGTTAT 70 BO 

TTTAGCGCTA TTAAGGTTTT GTTTATTACG GGAAAAATTA TATAAATATT CAATAATTGC 7140 

45 s . CAAGTTTCAA ATTGTATGAA ATTTGCATTA TTATTAAATG TTAGTTATTG TCAATTTTGT 7200 

GAATCAATAT AATTATTACA TTTTGAGATA AATCGAAACA GGATTCATAA AATTAATAAT 7260 

TAGGGGGAGC ACAATTGAAA AAAGAGAAAG TTATGGACTG GACGACCTTT ATAGGGACAG 7320 

SO 

TAGCTGTACT TCTTTTTGCA GTTATACCTA TGATGGCTTT TCCAAAAGCA AGTGAAGATA 73 BO 

TCATCACTGG TATTAATAGT GCCATTTCTG ATTCAATTGG TTCGATATAT TTATTTATGG 7440 

55 
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TTGGTAAAGC AAGTGATAAA CCAGAATTTA ATACATTTAC ATGGGCGGCA ATGCTGTTTT 7560 

GTGCAGGCAT AGGCTCTGAT ATTTTATACT GGGGCGTTAT TGAATGGGCT TTTTACTATC 7620 

AAGTTCCACC AAATGGCGCG AAAAGTATGA GTGATGAAGC ACTCCAATAT GCGACGCAAT 76 BO 

ATGGTATGTT CCACTGGGGG CCAATTGCTT GGGCTATTTA TGTTCTACCA GCATTACCAA 7740 

TTGGTTATTT AGTATTTGTT AAAAAACAAC CGGTGTATAA AATTAGTCAA GCTTGTCGTC 7600 

10 

CGATTTTAAA AGGTCAAACA GATAAATTTG TAGGTAAAGT TGTAGATATC TTATTTATCT 7860 

TTGGATTGCT AGGTGGTGCG GCAACATCAC TAGCGTTAGG TGTGCCATTA ATTTCTGCAG 7920 

1S GCATAGAAAG ATTAACTGGT TTAGATGGTA AAAATATGAT TTTACGTTCG GCCATTTTAT 7980 

TAACAATCAC GGTTATATTT GCCATTAGTT CATATACAGG ATTGAAAAAA GGTATTCAAA 8040 

AGTTAAGTGA TATCAACGTT TGGCTATCCT TTGTACTTTT AGCCTTTATA TTTATTATTG 8100 

20 GACCGACTGT TTTTATTATG GAAACGACAG TGACAGGGTT CGGAAATATG TTGAGAGATT 8160 

TCTTTCATAT GGCAACATGG TTAGAACCAT TCGGTGGTAT TAAAGGTCGA AAAGAAACGA 8220 

ATTTCCCACA AGACTGGACA ATATTCTACT GGTCATGGTG GTTAGTATAT. GCGCCATTTA 8280 

25 

TCGGTTTATT TATCGCTAGA ATTTCAAAAG GTCGACGCCT TAAAGAAGTC GTGCTAGGAA 834 0 

CAATTATTTA TGGAACGCTT GGATGCGTAT TATTCTTTGG TATTTTTGGT AACTATGCTG 8400 

TGTATTTACA AATTTCTGGA CAGTTTAATG TAACACAATA TTTAAATACA CATGGTACAG 846 0 

30 

AGGCAACCAT TATTGAAGTG- GTGCATCATT TACCATTCCC ATCATTGATG ATTGTACTAT 852 0 

TCTTAGTATC TGCTTTCTTA TTCTTAGCAA CAACATTTGA TTCGGGTTCA TATATTTTAG 8580 

CGGCAGCATC TCAGAAAAAA GTGGTAGGCG AACCATTACG TGCCAATCGT TTATTCTGGG 864 0 

35 

CATTTGCATT GTGCTTATTG CCATTTTCAT TGATGCTAGT TGGTGGTGAA CGTGCATTAG 8700 

AAGTWTTGAA AACTGCTTCA ATACTGGCAA GTGTGCCATT AATTGTTATT TTTATTTTCA 8760 

40 TGATGATATC ATTTTTAATC ATTTTAGGGC GCGATAGAAT TAAACTTGAA ACGCGTGCTG 8820 

AAAAATTAAA AGAAGTTGAA CGTCGTTCAT TGCGAATCGT TCAAGTATCa GAAGAAGAAC 8880 

AAGACGATAA TTTATAATTC AAAGCGGGTC TGGGACGACG AAATGaATTT TGTGAAAATA 894 0 

45 TCATTTCTGT TCCaTTCCCC TTTTTTTAGT AGCATTGTAG GATGAACTTT TAGGTTTTCA 900 0 

TTAATGTTGT ACTAAAAGAT TTAATTTTTT AGTGCTCCAA GTACTTATTT ATTGTATGAA ■ 9060 

GCATATTCTA AATCGAAGTT TGAAAGACTC TCATTGATTA TTAAATTAAA TAAAGGGTAT 9120 

50 

GCGTATGTAC AATTCAAATT AATCGAAGGA TGAAATAAAA TGACTAATCA ATTTAAAAAT 9180 

AAACAGTCCA AATTACATGA CAGTTTAGAA TCCATCACAA AAAACTTATA TGCGACACCT 924 0 
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ACAGAAXATT GTTATCTATC ATTCCGGACA 
ATTAGGTGGT GTTTCAAAAG TATTGATGAA 

5 

TCAAGTTGAA GCCCCTTACT TTATACATGA 
TTCTGTTCAA AAACAATTTA GTAATCGTGT 
CGCGCCTGGA CATACACCAG GGACGACACT 

10 

CTTATTTACT GGAGATTTTA TATGTTTTGA 
TTCAAGTGAT AGAGAAAAAT CTATTCAAAG 

1S TGTACTTGTA CCTTGGGTTA CTATCAAAGA 

ATATGAAAAA CGTGAACAAA TACAAAATAT 
CTAATTGAAA TATATTGGCG AAgCAATGTA 

20 CCATAATTGA TTGTCATATA AAACAAAAAC 

CCGTTTATTT CGTGGACCTA TTGCATTGTT 
TTGAGTCAAT TTTAATTTTC AGAATCAGAA 

25 ATTTCACCGC TTTCAATCGC TTCGGTTAAC 

TGATCAAGCT GAGCTTTAAT TTTTTGTGCC 
CTTTCAAGTA ATGGATTGAT AATAGCTTGA 

30 

ATTTCACGCT GGAATTTTTT AGTGAAAAAG 
GCGATAAAAT GTTGTTTGCG GCTTCCTCTC 
TGCAATTTCT TAACGCCAGC ACTCATACTT 

35 

TCATCAAGCG TCATACTGCC TTCAAACACC 
TTAGTGCCAT ACAAATCCAT TGTCTGTCCA 

40 TCTAATTGTT GCTGTTTGTT CTGAGAACGA 
GTCACGTTAA AATAAATACT AATACATTAT 
TATTTAAAGC ATTAAAGTTA AATGTTTCGT 

45 TAATTCTTGA GTTGTTTTGT AAATTGACAC 
TAAAATTcGT TAAATAAAAA TTTAACAAAC 
aACATTTATC TCAGCGTCAA TATATTGATG 

50 

CAAGAGATAT TATCAATCCT TACAATCAAG 
AAGAGGATGC AGAACGTGCA ATCTTAGCTG 

55 



CTTAGGTGAC TCCCAACAAG ACATTGCATC 9360 

TCATGATCAT GAATCTATAG GAGGTTCTAA 9420 

AAATGATGTG GCTGCACTGA AACATAAGAT 9480 

AATGTTGGAT AAGGATTTAG AAGTTATTCC 9540 

ATTTTTATGG GATGATGGTC ATCACCGTTA 9600 

AGGGAAGAGA TGGCGTACAG TTATATTAGG 9660 

TTTAGAGATG GTTAAAGAAT TAGATTTTGA 9720 

TGAACCGTTA GTTTATTTTG TAGAAAATGA 9780 

TATTGATAGA GTACGTGAGG GCGAGAATAG 9840 

ACGAATCTAA GAAAGCCCTA GAAAATACCT 9900 

GGTAATTTCT ATTTATTGAG ATAGAAATTA 9960 

TTTATCATGC ATAATCATCA TTGTCGTTGT 10020 

GGCTGTTCTG GAATTGGGAA ATATTTGAAA 10080 

TGTTCTAACC ATTCGTAATA AACATGTGTA 10140 

TCTTGTGTTT CAGCTTCAGT TAAATCACTG 10200 

GCATCTTTTA CTGCTTCGAC ATTGATGTCA 10260 

TTTCGGAAAA AGATGAAAAA GTCTTTCTCG 10320 

_ GTAAATTGTT GTTTAACAAT ATCAAATTCC 10380 

GGTTTGCTCA TTTGCAATTG ATGACGCATT 10440 

ATTGTGCCAT ATAAGTTTCC TACACTTCTA 10500 

ATTGAATTAA TTACAATATC TTTTGCTTGT 10560 

GTCATCATTG CACCTCCGTA CATCATTTTG 10620 

AAAACCTTTT CTAAAAAAAG ACATTAAAAA 10680 

TAAATAAAAA TCTAACGAAC TTACAAAACT 10740 

ATTTTTCATT TCTATGCTAA CATAAGTnTG 10800 

TTAACGGr GG TTGTTGAAkG GrACTTTTAA 10860 

GTGAGTGGGT TGAAAGCGCG AATAAAAATA 10920 

AAGTGATATT TACGGTTTCT GAAGGGACAA 10980 

CAAGACGTGC GTTTGAGTCT GGTGAATGGT 11040 
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AACATCgCGA AgCgTTAGCA CGATTAGAAA CATTAGATAC TGGAAAAACG TTAGAAGAAT 11160 

CATATGCAGA TATGGATGAT ATTCATAATG TGTTTATGTA TTTTGCTGGA TTAGCAGATA 11220 

5 

AAGACGGTGG CGAAATGATT GATTCACCAA TTCCAGATAC AGAAAGCAAA ATTGTTAAAG 11280 

AACCAGTAGG TGTAGTTACA CAAATTACAC CTTGGAATTA TCCGTTATTA CAAGCATCAT 1134 0 

1Q GGAAAATTGC GCCAGCGCTT GCTACGGGTT GTTCACTAGT TATGAAACCA AGTGAAATTA 11400 

CACCATTAAC AACAATACGT GTTTTTGAAT TAATGGAAGA AGTTGGTTTC CCTAAAGGAA 11460 

CAATTAATCT TATTCTAGGT GCAGGTTCTG AAGTTGGTGA CGTAATGTCA GGTCATAAAG 11520 

15 AGGTTGACCT TGTATCATTT ACAGGTGGCA TTGAGACTGG TAAGCATATT ATGAAAAATG 11580 

CTGCTAATAA TGTTACGAAT ATTGCCTTGG AACTTGGCGG TAAAAATCCA AACATTATCT 11640 

TTGATGATGC TGATTTTGAA TTGGCAGTAG ACCAAGCGTT AAATGGTGGA TATTTCCATG 11700 

< 

20 CAGGTCAAGT TTGTTCAGCA GGATCAAGAA TATTAGTACA AAACAGTATT AAAGACAAAT 11760 

TTGAGCAAGC ACTTATTGAT CGCGTGAAAA AAATCAAATT AGGTAATGGT TTTGATGCTG 11620 

ATACTGAAAT GGGACCAGTG ATTTCAACAG AACATCGTAA TAAGATCGAA TCTTATATGG 11880 

2S 

ATGTAGcTAA AGCAGAAGGC GCAACAATTG CTGTTGGTGG TAAACGTCCA GATAGAGATG 1194 0 

ATTTAAAAGA TGGTCTATTC TTCGAGCCAA CAGTCATTAC AAATTGTGAT ACGTCAATGC 1200 0 

GTATTGTACA AGAAGAGGTT TTCGGACCTG TCGTTACTGT AGAAGGCTTT GAAACTGAAC 12060 

30 

AAGAAGCGAT TCAATTAGCG AATGATTCTA TATATGGTTT AGCAGGTGCT GTATTTTCTA 1212 0 

AAGATATTGG AAAAGCACAA CGCGTTGCTA ACAAGTTGAA ACTTGGAACG GTGTGGATTA 12180 

3s ATGATTTCCA TCCATATTTT GCACAAGCGC CATGGGGTGG ATACAAACAA TCAGGTATCG 12240 

GTAGAGAATT AGGCAAAGAA GGCTTAGAAG AGTACCTTGT TTCAAAACAC ATTTTAACAA 123 00 

ATACAAATCC ACAATTAGTG AATTGGTTTA GCAAATAAAA ATTAGATAAG GTGAGTGCCA 12360 

40 TTGTAAGAAC ACAAGACACT CACTTTGTTT TGTATAAGTG GCGAAATGTT GATTGATAAT 12420 

TTGGACTAAA CGCAAAATGA ATCATAGATT ATTTCATTAC TGTTAGTAAC AATCGTAAAA 12480 

GGAAAAGCGA GTGTTTTGGT TAGCTAAGTT TAGCAATTCA ACGATAACCA ATCAGCCACT 12540 

4=> AACAAATATT TCATGCAATA CTCACTTTGA AATACAACAA ACTTTGGAGG TCATAACGAT 12600 

GAGTAACAAA AACAAATCAT ATGATTATGT CATCATTGGA GGAGGCAGTG CAGGTTCTGT 12660 

ACTAGGTAAT CGTCTGAGTG AAGATAAAGA TAAAGAAGTC TTAGTATTAG AAGCGGGTCG 1272 0 

SO 

CAGTGATTAT TTTTGGGATT TATTTATCCA AATGCCTGCT GCGTTAATGT TCCCTTCAGG 12780 

CAATAAATTT TACGATTGGA TTTATTCAAC AGATGAAGAA CCACATATGG GCGGTCGTAA 1284 0 
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TCAACGTGGT AATCCAATGG ACTATGAAGG CTGGGCAGAA CCAGAAGGTA TGGAAACTTG 12960 

GGATTTTGCG CACTGTTTAC CGTATTTTAA AAAATTAGAA AAAACATACG GTGCAGCGCC 13020 

5 

TTATGATAAA TTTAGAGGCC ATGATGGACC AATTAAGTTA AAACGAGGGC CAGCAACGAA 13080 

TCCTTTATTC CAGTCATTCT TTGATGCAGG TGTTGAAGCA GGCTATCATA AAACACCTGA 1314 0 

TGTGAATGGA TTTAGACAAG AAGGTTTTGG ACCGTTCGAT AGTCAAGTAC ATCGTGGTCG 13200 

10 

CCGAATGTCA GCTTCAAGAG. CATATTTACA TCCAGCGATG AAGCGTAAAA ACTTAACCGT 13260 

TGAAACACGT GCCTTTGTAA CTGAAATTCA TTATGAAGGT AGAAGAGCAA CTGGTGTTAC 13320 

15 GTATAAGAAA AATGGCAAAC TACATACCAT CGATGCTAAT GAAGTCATTT TGTCTGGTGG 13380 

GGCATTCAAT ACGCCACAAT TACTACAATT ATCTGGTATC GGTGATTCAG AGTTCCTAAA 13440 

ATCAAAAGGC ATTGAGCCAC GTGTTCATTT ACCTGGTGTG GGTGAAAACT TTGAAGATCA 13500 

20 CTTAGAGG 13508 

(2) INFORMATION FOR SEQ ID NO: 121: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7646 base pairs 
2 (B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 121: 



35 



40 



45 



50 



GTAAGTATTG TCTTGATTTC 


CTAATAAAGT 


TATATCTTGT 


AATTCATCTT 


GTTGACGGCC 


60 


ATGTGCCATA TAAAGCGCTC 


CTTTAAATTT 


ATTTTTTTAT 


TATTTTGGCG 


TCTCGGCGTG 


120 


CTTTTTCAAA CATGTAATAA 


CTTGCACCGA 


TAATAACGAC 


GTAACCTAAT GTTGCATAGA 


180 


AATC^GGAGA TTCTCCGAAT 


AGAATAAATC 


CAAGTATTGC 


TGTGAAAATT 


ATAGATGCAT 


240 


ACGTAAAAAT AGAAATATCT 


TTTGCTGCTG 


CAAAACTATA 


TGCTAAAGTA ACACCAATTT 


300 


GACCCACAGC GGCAgCTAAG 


CCAGCCCCTA 


ATAGATAAAG 


TATTTGCATC 


TGACTCATTG 


360 


GTTCATAAGT ATATGCAGTG 


AAAGGTATTA AAACGATGAC 


AGAAAATAAG 


GAGAAGTAAA 


420 


ATACTATAGT ATATGGTGCT 


TyTCTTGTAC TAAGTGCTCG 


AACACATGTA 


TATGCTGATG 


480 


CTGCAAAAAT ACCTGAGAAT 


AAGCCAGCTA ATGATGGAAT 


CATAGATGAT 


GAAAATTCAG 


54 0 


GTTTCACTAT TAAnAGCAaC 


CTAAAATAGC 


AATTATCATT 


GCTGTAATTT 


GaTACTTCCT 


600 


TACCTTTTCA TGtAAGAaaA 


CAATGCTTaA 


TAAAATCGTC 


CAGAAAGGAT 


TGAGTTTCAT 


660 


TAATGAATCG GCATCACTAA 


GTACCATATG 


ATCAATGGCA 


TAAATATTTA 


ACAATACACC 


720 
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TGGCTGATGG TATTTATATA TAAAAAATAA TGGAATAAAC ATTGCTACTA AGTTTCGTGC 840 

TAATGATTTT TGAAAAACAG GAAGGTCACC TGCAAGTCTG AAAAACACTG ACATAAAACT 900 

5 

GAAACCAATA GCCGAAATTA AAATGGCAAT GATACCTTTT ACTTTAGGAT TCAATTTTAT 960 

CGCCTCTTTT ATATAAAATT AACGTATTTA TATTAGCATA AAACAACATG TTGTGCATAA 1020 

ATAGTTGAAA TTTACTATAA AAAGACTATA ATAGACTGTA GCGAACAAAC GTTCTGTGTT 1080 

10 

TATTTGTCGG AATAATAGGG CATTACACTT TTATGAATGT TTGTGTTATT ACATAAAACA 1140 

AATATCAATT CAGTATCAAG CTAATAAGCT TTTTCTTGAT TTCTGTTGAT ACAATTGAGA 1200 

75 TTGACACAGA TTTAAAAAAA TCAAGTGATA TCTACTAAAA AATTTTTTTA AATTTGTTCA 1260 

AGTTTTTCTA ATTTAGTATT GGTGCCTAGT TGGAACGTTT TACGAACATT CGATTAGAAA 1320 

ATGGCACTTT AAATCATAGT GTGTCTTATG TATAATGAAA CACATAATAT AGTGTTGGTG 1380 

20 AAACGAAAAA gACACAATAT CTTGTGTTTT GTATGCAAAT GCTTTATTTA TGAAGAAATT 144 0 

ACATTTAAAA GTAATTTAAC ACAGAAATTT AATAGTTATT ATCAATTAAT AGTCATATTT 1500 

TTAGAAAATG TACTGAGCAA ATGGAAGATA TCCAATGATG TAAACACTAC ATATAGTGAT 1560 

25 

TTTTATACAT TCAACCCATA TAAGCTACTA TTTTCTCAAA TATAAATCTA TGCAATTGGT 162 0. 

TTACATTTGA GAAAATAAGT AGCTTCATTA TAGTTAATAC AATGCTGAGA TAACCATAGT 1680 

AACCATGTTG TTAAAGCATT TTTTAATTGG AATGACTACT TTATTTAAAA GGGTTGAAGA 174 0 

30 

AAGAAGGTGA TCCAATGAAA ATAATATATT TTTCATTTAC TGGAAATGTC CGTCGTTTTA 1800 

TTAAGAGAAC AGAACTTGAA AATACGCTTG AGATTACAGC AGAAAATTGT ATGGAACCAG 1860 

TTCATGAACC GTTTATTATC GTTACTGGCA CTATTGGATT TGGAGAAGTA CCAGAACCCG 1920 

33 

TTCAATCTTT TTTAGAAGTT AATCATCAAT ACATCAGAGG TGTGGCAGCT AGCGGTAATC 1980 

GAAATTGGGG ACTAAATTTC GCAAAAGCGG GTCGCACGAT ATCAGAAGAG TATAATGTCC 2040 

40 CTTTATTAAT GAAGTTTGAG TTACATGGAA AAAACAAAGA CGTTATTGAA TTTAAGAACA 2100 

AGGTGGGTAA TTTTAATGAA AACCATGGAA GAGAAAAAGT ACAATCATAT TGAATTAAAT 2160 

AATGAGGTCA CTAAACGAaG AGAAGATGGA TTCTTTAGTT TAGAAAAAGA CCAAGAAGCT 2220 

45 TTAGTAGCTT ATTTAGAAGA AGTAAAAGAC AAAACAATCT TCTTCGACAC TGAAATCGAG 22 80 

CGTTTACGTT ATTTAGTAGA CAACGATTTT TATTTCAATG TGTTTGATAT TTATAGTGAA 2340 

GCGGATCTAA TTGAAATCAC TGATTATGCA AAATCAATCC CGTTTAATTT TGCAAGTTAT 2400 

SO 

ATGTCAGCTA GTAAATTTTT CAAAGATTAC GCTTTGAAAA CAAATGATAA AAGTCAATAC 2460 

TTAGAAGACT ATAATCAACA CGTTGCCATT GTTGCTTTAT ACCTAGCAAA TGGTAATAAA 2520 
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ACATTTTTAA ACGCAGGCCG TGCGCGTCGT GGTGAGCTAG TGTCATGTTT CTTATTAGAA 2640 

GTGGATGACA GCTTAAATTC AATTAACTTT ATTGATTCAA CTGCAAAACA ATTAAGTAAA 2700 

5 

ATTGGGGGCG GCGTTGCAAT TAACTTATCT AAATTGCGTG CACGTGGTGA AGCAATTAAA 2760 

GGAATTAAAG GCGTAgCGAA AGGCGTTTTA CCTATTGCTA AGTCACTTGA AGGTGGCTTT 2820 

AGCTATGCAG ATCAACTTGG TCAACGCCCT GGTGCTGGTG CTGTGTACTT AAATATCTTC 2880 

CATTATGATG TAGAAGAATT TTTAGATACT AAAAAAGTAA ATGCGGATGA AGATTTACGT 294 0 

TTATCTACAA TATCAACTGG TTTAATTGTT CCATCTAAAT TCTTCOATTT AGCTAAAGAA 3000 

75 GGTAAGGACT TTTATATGTT TGCACCTCAT ACAGTTAAAG AAGAATATGG TGTGACATTA 3060 

GACGATATCG ATTTAGAAAA ATATTATGAT GACATGGTTG CAAACCCAAA TGTTGAGAAA 3120 

AAGAAAAAGA ATGCGCGTGA AATGTTGAAT TTAATTGCGC AAACACAATT ACAATCAGGT 3180 

20 TATCCATATT TAATGTTTAA AGATAATGCT AACAGAGTGC ATCCGAATTC AAACATTGGA 3240 

CAAATTAAAA TGAGTAACTT ATGTACGGAA ATTTTCCAAC TACAAGAAAC TTCAATTATT 3300 

AATGACTATG GTATTGAAGA CGAAATTAAA CGTGATATTT CTTGTAACTT GGGCTCATTA 3360 

25 AATATTGTTA ATGTAATGGA AAGCGGAAAA TTCAGAGATT CAGTTCACTC TGGTATGGAC 3420 

GCATTAACTG TTGTGAGTGA TGTAGCAAAT ATTCAAAATG CACCAGGAGT TAGAAAAGCT 34 80 

AACAGTGAAT TACATTCAGT TGGTCTTGGT GTGATGAATT TACACGGTTA CCTAGCAAAA 3540 

30 

AATAAAATTG GTTATGAGTC AGAAGAAGCA AAAGATTTTG CAAATATCTT CTTTATGATG 3600 

ATGAATTTCT ACTCAATCGA ACGTTCAATG GAAATCGCTA AAGAGCGTGG TATCAAATAT 3660 

CAAGACTTTG AAAAGTCTGA TTATGCTAAT GGCAAATATT TCGAGTTCTA TACAACTCAA 3720 

35 

GAATTTGAAC CTCAATTCGA AAAAGTACGT GAATTATTCG ATGGTATGGC TATTCCTACT 3780 

TCTGAGGATT GGAAGAAACT ACAACAAGAT GTTGAACAAT ATGGTTTATA TCATGCATAT 384 0 

40 AGAtTAGCAA TTGCTCCAAC ACAAAGTATT TCTTATGTTC AAAATGCAAC AAGTTCTGTA 3900 

ATGCCAATCG TTGACCAAAT TGAACGTCGT ACTTATGGTA ATGCGGAAAC ATTTTACCCT 3960 

ATGCCATTCT TATCACCACA AACAATGTGG TACTACAAAT CAGCATTCAA TACTGATCAG 4 020 

45 ATGAAATTAA TCGATTTAAT TGCGACAATT CAAACGCATA TTGACCAAGG TATCTCAACG 4080 

ATCCTTTATG TTAATTCTGA AATTTCTACA CGTGAGTTAG CAAGATTATA TGTATATGCG 4140 

CACTATAAAG GATTAAAATC ACTTTACTAT ACTAGAAATA AATTATTAAG TGTAGAAGAA 4200 

50 

TGTACAAGTT GTTCTA7CTA ACAATTAAAT GTTGAAAATG ACAAACAGCT AATCATCTGG 4260 

TCTGAATTAG CAGATGATTA GACTGCTATG TCTGTATTTG TCAATTATTG AGTAACATTA 4320 
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ATGTTTTGGA GACAAAATAT ATCTCAAATG TGGGTTG AAA CAGAATTTAA AGTATCAAAA 444 0 

GACATTGCAA GTTGGAAGAC TTTATCTGAA GCTGAACAAG ACACATTTAA AAAAGCATTA 4500 

5 

GCTGG TTTAA CAGGCTTAGA TACACATCAA GCAGATGATG GCATGCCTTT AGTTATGCTA 4560 

CATACGACTG ACTTAAGGAA AAAAGCAGTT TATTCATTTA TGGCGATGAT GGAGCAAATA 4620 

CACGCGAAAA GCTATTCACA TATTTTCACA ACACTATTAC CATCTAGTGA AaCAAACTAC 4680 

10 

CTATTAGATG AATGGGTTTT AGAGGAACCC CATTTAAAAT ATAAATCTGA TAAAATTGTT 4 740 

•< 

GCTAATTATC ACAAACTTTG GGGTAAAGAA GCTTCGATAT ACGACCAATA TATGGCCAGA 4 800 

?5 GTTACGAGTG TATTTTTAGA AACATTCTTA TTCTTCTCAG GTTTCTATTA TCCACTATAT 4860 

CTTGCTGGTC AAGGGAAAAT GACGACATCA GGTGAAATCA TTCGTAAAAT TCTTTTAGAT 4920 

GAATCTATTC ATGGTGTATT TACCGGTTTA GATGCACAGC ATTTACGAAA TGAACTATCT 4 980 

20 GAAAGTGAGA AACAAAAAGC AGATCAAGAA ATGTATAAAT TGCTAAATGA CTTGTATTTA 5040 

AATGAAGAGT CATACACAAA- AATGTTATAC GATGATCTTG GAATCACTGA AGATGTGCTA 5100 

AACTATGTTA AATATAATGG AAACAAAGCA CTTTCAAACT TAGGCTTTGa ACCTTATTTT 5160 

25 GAGGAACGTG AATTTAACCC AATCATTGAG AATGCCTTAG ATACAACAAC TAAAAACCAT 5220 

GACTTCTTCT CAGTAAAAGG TGATGGTTAT GTATTAGCAT TAAACGTAGA AGCATTACAA 5280 

GATGATGACT TTGTATTTGA CAACAAATAA CAATTAAATT AAAAGACCTT CACATGTAAA 5340 

30 

GGGAAATAGC GATTCGTTTC GTCTTGTCTC CTACATGTTG AAGGTCTTTT TTTATGTGTA 5400 

TCTAACTCAT TATGAGTCTG AGTAAGAAAT CAATGCTCTA AGATGTACAA TGCTATTTAT 546 0 

ATTGGCAGTA GTTGGCGGGG CCCCAACACA GAAGCAGGCG GAAAGTCAGC TAACAATATT 552 0 

35 V 

GTGCAAGTTG GCGGGGCCCC AACATAGAAG CAGGCGGAAA GTCAGCTAAC AATAATGTGC 5580 

AAGTTGGCGG GGCCCCAACA TAAAAGCAGG CGGAAAGTCA GCTAACAATA TTGTGCAAGT 5640 

40 TCGGgCGGGG CCCCAACATA AAGAAAAACT TTTTCCTTTA GAAATTATCA CTTCCaCaTG 5700 

AGTTTTACTC ATGTATTCCT ATTTTTAAGT ACACATTAGC TGAGGCTAAT GTTAAGAACC 5760 

ACTACTTAAT CAATCATTAG TAGTTTTTAT CATTTCCACT ATTCCCaGAC ATCaAAATCT 5B20 

45 TAAGTGTTCT ATTTTACTTT AAGTAAACAA AATACACATT CCGAAAAATT AAATTTCAGT 5880 

TTAATTGCAA ATATCAATAA AATTGACACT AAATTATTTG AAAGGCTATT GAAATTATGG 5 94 0 

TCAAAAAACG CTACTATTAA TGAGAAATAT TATCAATGAT AATGATTATC ATTAATTTAA 6000 

SO . • 

AGGGAGAAAA ATTTGTAATG AAGTATTTAT TAAAGGGAAA TATTTTGCTT CTATTACTAA 6060 

TATTGTTGAC AATTATTTCG TTGTTCATAG GTGTGAGTGA ACTATCAATT AAAGATTTAC 612 0 
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wlAi X X infti 


TGTTGGAAGT 


• 

TCGTTGGCTT 


TAGCAGGCTT GATAATGCAA CAAA X ga x vj»l. 


6240 


a a a at aa<ttt 


TGTTAGTP PG 


ACTACAGCTG 


GAACGATGGA ATvGGCTAAA L IAWjia; i 1 


€300 


IJ'wl 1 X 


AX XVjXX^XXX 


CCAACCGGTC 


ATA x X 1 X ATT AAAACTAG x A ill OL. xvj x X A 


6360 


T W T*TY2 f A rt *P & T 


TTGPOGTA PC 


TTTTTATTTG 


TTAAAATCAT TGA X X X X AX A AAAUiuAftAb 


6420 




TGTAPPGPTT 


TTAGGAATTA 


TGATGGGT GG GATTGTTGuA AG 1 X CALAAL 


64B0 


V»X X\-jH.X V X L-rt 


TTRPGPAPGA 


ATGCTGTTCA 


AAGCATTGGT AACTGGCTTA ACGGGAACTT 


654 0 


ioLwii XAX \- 


A P A Art Tfifi AP 


GCTATGAAAT 


TTTATATTTA AGTATTCCTC XxxXAGUAlX 


6600 




I ■ ■ IN I u t. Wf*TV >V ^Pf 

XXX VjV— X MAX V— 


ATTTCACGAT 


TGTAGGAATG GGTAAAGACT xxACTAATAA 


6660 


X X iKw X X i« 


ariTT A fY5 A AA 


AATTAATTAA 


CATCGCATTG TTTATTACTG CAAuiATTAL 


6720 


AGL-AX XVro XM 


O X VjV> X fcJMv, X V3 


TTGGAACATT 


ACCGTTCTTA GGACTAGTAA TACCAAATAT 


6780 


m]i m^™mi^» » » ^*T* 
ini 1 X Lnn.1 X 


^wnw X O 


ATCATTTGAA 


AAATGCTATC CCTCATACGA TGATGTTAGG 


6840 


TGCCATCTTT 


olnl lni 1 x X 


CTGATATAGT 


TGGCAGAATT GTTGTTTATC CATATGAAAT 


6900 


AAAXAx 1GG1 


J. iHALAAlMu 


GTGTATTTGG 


AACAATCATT TTCCTTATCT TGCTTATGAA 


6960 


AGO X AGGAAA 


AAx iniuLkjL 


aACAATAATA 


aAAAAATAAT GCTTTTAATT GCAGTAACGT 


7020 


^TV ^^PTv TV^^PTVf? 

XAX 1 AAX X AO 


TaTY^nYiTar 


TTATTTGTAG 


G TATTGATTT TGAAATATTT GAATATCAAT 


7080 


ill LAAw 1 L.G 


TTTa araaar 

1 I IAauaaAu 


TTCATATTAA 


TTATTTTAGT AGGTGCTGCC ATTGCAACTT 


7140 


UAo XGVj X LjA X 


XIX ILaaLjLu 


ATTACAAATA 


ACCGTCTATT GACACCATCA ATAATGGGGT 


7200 


X AGA x G LAu X 


^p^p a ^p^p^p & ^p^p^p 
1 1A1 1 1a1 X X 


ATCAAAGTAT 


TGCCAGTCTT TTTATTTGGA ATTCAATCGG 


7260 




TAATPTATAT 


TTGAACTTTA 


TATTAACACT TATAACGATG GrGxXAxXtJG 


7320 


*ta rtpht 


ATTPPAARrtT 

nil L>wVWU X 


ATCTTTAAAA 


TCGGACATTT TTCAATTTAT TTT AT C TT AC 


7380 


*T*^P ft ^PHP^^T 1 !*? T* 
X x/\X AwlvjJ. 


CLl X X InlJUn 


ACATTTTTTA 


GAAGCATAAC AGG X X X X A X T LAAL X GA X X A 


744 0 


X Ijwii l«Llun 


O X ^-rV.X XXX in 


GCAATACAAA 


GXAGXAXGIX IGLXAAXllX AAXGuXXUXA 


7500 


Al X W**J/V\X X X 


AGTTAPTTTP 


TCAGCAGTGC 


TATTAGTAAT CTTATTAGTC ATTACAATTT 


7560 


TACTATTGCC 


TTATTTAGAT 


GTATTCjCTTT 


TAGGTCGTGC TGAAGCAATT AATCTTGGGA 


7620 


TATCGTATGA 


AAAATTAACG 


CGAATT 




7646 


(2) INFORMATION FOR SEQ ID NO: 122: 





(i) SEQUENCE CHARACTERISTICS: 
so (A) LENGTH: 1194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 122: 

ATGAATATAT TTnnAAATAA ATTATTATGG ATTGCACCAA TnGCCACTAT GATTATCTTG 60 

GTAATCTTTT CTTTAGCTTT TTATCCTGCA TATAATCCTA AACCAAAAGA TTTACCAATT 120 

GGTATATTAA ACGAGGATAA AGGTACAACG ATTCAAGATA AAAATGTTAA CATTGGTAAA 180 

AAATTAGAGG ATAAATTATT AGATAGTGAT TCTAATAAAA TTAAATGGGT TAAGGTTGAT 24 0 

AGTGAAAAAG ACCTTGAAAA AGATTTGAAA GATCAAAAAA TCTTTGGAGT AGCTATTATT 300 

GATAAAGACT TTTCAAAAGA TGCTATGAGT AAAACACAAA AAGTAGTTAT GGATAGTAAA 360 

AAAGAAGAAA TGCAACAAAA AGTTGCTTCA GGTGAAATTC CGCCACAAGT GGTTCAACAA 420 

ATGAAACAAA AAATGGGGAA . TCAACAAGTA GAGGTTAAGC AGGCTAAATT TAAAACGATT 480 

GTAAGTGAAG GATCAAGCTT ACAAGGTTCA CAAATTGCAT CAGCTGTGTT AACTGGTATG 540 
20 GGTGATAATA TTAATGCTCA AATTACGAAG CAAAGTTTGG AAACATTAAC GAGTCAAAAT . 600 

GTTAAAGTCA ATGCCGCGGA CATCAATGGT TTGACGAATC CAGTAAAAGT GGATAATGAA 660 

AAACTTAATA AAGTTAAAGA TCACCAAGCA GGTGGTAATG CACCATTCCT AATGTTTATG 720 

CCAATTTGGA TAGGTTCAAT CGTAACGTCT ATCTTATTGT TCTTTGCATT TAGAACTAGT 780 

AACAATATCG TCGTGCAACA TCGTATCaTT GCtTCAATTG GACAGATGAT ATTTGCAGTT 840 

GTTGCAGCAT TTGCAGGTAG CTTTGTTTAT ATTTATTTCA TGCAAGGCGT TCAAAGATTT 900. 

GATTTTGACC ATCCAAATCG TATCGCAATT TTTGTAGCAT TTGCGATTCT TGGTTTCGTG 960 
GGCCTTATTT TAGGTGTTAT' GGTATGGCTA GGTATGAAGT CAGTTCCAAT TTTCTTCATT . 1020 

TTAATGTTCT TTAGTATGCA ACTTGTAACG TTACCTAAAC AAATGTTGCC TGAAAGTTAT 10 8 0 

CAAAAATATG TATATGATTG GAATCCATTC ACACACTATG CAACAAGTGT AAGAGAcTAT 114 0 

TATACTTGAA TCATCATATT GAATTAAATA GTACAATGTG GATGTTTATA GGGT 1194 
40 (2) INFORMATION FOR SEQ ID NO: 123 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 558 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
4S (D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123: 
GACCGACCTA TACATCCGTA TAAGTATTTC TTGATATAAG TCTTCTAAAT CATAATGATT 60 
AAATCCAAAT GTTTTGATGC GTCGAATAAT TAATGGTTGT AGATCCATTA CTAACTTTTC 120 
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GTATTTCAAA TATTAAACTA ACCCCTTCTA TCTAAAATTT AAGGTTAGTT TAATATTGTT 240 
ACATTCAAAA TTTCAAGATG ACGGAAATGT CATTTCTTAT GATGTCCTCT TOGTATTTTT 300 
5 TCAAATTCTG CAAGGATTTC AGAAGATAAC GGAATTCGAG TTCTTGGCTT GTTTTCACTT 360 

ATATCATCTA ATGATTTACT CACATCAATT TCATTTTCTT TTAAATCTCT CCACATTTCG 420 
CGAGATGATA TTCTATATGC ACCTGATCCA AAGATAGCAT GTTGcTCACT CaTATCACTT 480 

10 

GTTACAACTG TAATATGcTT AGtATGCTTG tCaTAAAGtT CaTAAACCAT AACGGTTCTA 540 
ATGGAAACCA ATCAGCTG 558 
(2) INFORMATION FOR SEQ ID NO: 124: 

IS 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
9n (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 124: 



25 


GCTTCAGACA 


TnTGATGATA 


TAATCTCTCA 


TCATCGATTA 


ATTCTTTTGC 


AGCTTGATAC 


60 




ACATnTTGCT 


TATTTGTTCC 


AATGACTTTT 


AATGTGCCAG 


CTTCAACACC 


TTCAGGACGT 


120 




TCTGTAACAC 


TTCGCCAAAA 


CTAAAACTGG 


CTTATTAAAT 


GATGGCGCTT 


CTTCCTGAAT 


180 


30 


TCCACCTGAA 


TCTGTCAAAA 


TAAAATAAGA 


TTTTnTAGCA 


AAATTATGGA AATCTATACG 


240 




TCCAAAGGTT 


CAATCAATTC 


AATTCTGTCA 


TGACTACCTA 


AAATCTTTTG 


AGCCACCTCT 


300 


35 


CGAACTTTCG 


GGTTTTTATG 


CATTGGATAT 


ACCAGTGCTA 


AATCAGTATA 


CTCATCTATT 


360 


AAGCGTCTAA 


CCGCTTTAAA 


TATATTTTCC 


aTGGGTTTCC 


CGATATTTTC 


TCGTCGGTGT 


420 




GCTCTCATrA 


GAATGAATTT 


JcTtGTCATGG 


TATTTATCCA 


TGATGTTAGA 


TTTATAATTG 


480 


40 


TCATCAACTG 


TATATTTCAT 


AGCATCAATC 


GCAGTATTAC 


CAGTGACAAC 


AACACTTTCT 


54 0 


GAATATTTCC 


CTTCACTTAA 


CAAATGCGAT 


GCAGCATTTT 


TAGTAGGTGC 


AAAATGTAAG 


600 




TCAGCTAATA 


CACCAACTAA 


TTGTCTATTC 


ACCTCTTCTG 


GAAAAGGTGA 


ATATTTATCA 


660 


45 


TAACTTCTAA 


GCCCTGCTTC 


AACGTGTCCA 


ATCGGCACTT 


GGTTATAAAA 


TGCCGCTAAA 


720 




CCACCTGCAA 


ATGTCGTCAT 


CGTATCACCA 


TGTACAAGTA 


CCATGTCTGG 


TTTTTCTAAT 


780 




TGAATCACTT 


GTTCTAATTG 


AGTGATTGAT 


TTAGAAGTTA 


TCTCAGAAAG 


TGTCTGTCCT 


840 


50 


GATTTCATAA 


TATTCAAATC 


GTATTTTGGT 


TTGATTTCAA 


AGGTACTTAA 


TACTGAATCA 


900 




AGCATTTCTC 


TATGCTGTGC 


TGTAACAACA 


ACAATTGGCT 


CGAGCATTTT 


TTCTTGTTCC 


960 
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ATCTTTTTCA 


TCAAACTACT 


TATCTCCGAT 


TCTTCTATTT AGTACCAAAC AATCTATCTC 


1080 




CAGCGTCGCC 


TAACCCTGGT 


GTGATATATG 


CTTTGTCATT aGCTTTTCAT CAAGTGCAGC 


1140 


5 


AATATAAATA TCTACATCTG 


GATGTGCTTC 


ATGCATCTTT TCTACGCCTT 


CTGGTGCTGC 


1200 




AATTAAACAC 


ATGAAGCGAA 


TATTTTTAGC 


GCCACGTTTC TTCAATGAAG 


TAATAGCTTC 


1260 




AATTGCTGAT 


GCGCCTGTTG 


CTAACATAGG 


ATCAACAACA ATGATTTGTC 


TTTCAGTAAT 


1320 


10 


ATCTTGAGGT AACTTAGCAA AATACTCTAC AGCCTTTAAT GTTTCGGGAT CTCGATATAA 


1380 




ACCGATATGT 


CCAACTCTGG 


CTGCAGGTAC 


TAAACTTAAA ATACCATCAG 


TCATACCTAA 


1440 


15 


ACCAGCTCTT AAAATTGGAA CGATAGCTAA TTTTTTACCA GCTAATCGTT TAGCCGTCAT 


1500 


TTTAGTTACA GGCGTTTCAA TATCAACATC CTGAAGCTCT AAGTCTCTAG TTACTTCATA 


1560 




TGCCATCAAC 


ATACCAACTT 


CGTCTACAAG 


TTCTCTAAAT TCTTTAGTAC 


CTGTATTTAC 


1620 


20 


ATCTCTAATA TAGCTTAGTT 


TGTGTTGAAT 


TAATGGATGA TCGAAAACGT 


GTACTTTACT 


1680 




CATAAAAATT ACTCCTATCT 


TTGTGTATGT 


TTATTGATAT AGAGGATATT 


CAGCTGTTAA 


1740 




TTTCGCAACG 


CGTTCTTTAG 


CTTGTTGTAA 


TTTTTCTTCA TCTTTACTAT 


TTTTCAATGC 


1800 


25 


TAAACTGATG 


ATTTTTGCAA 


CTTCCTCAAA 


AGCTTTTTCA TCAAATCCAC 


GCGTTGTTGC 


1860 




AGCAGGTGTA 


CCTAAACGTA 


TACCACTCGT 


TACAAAAGGT TTTTCTTGAT 


CGAACGGAAT 


1920 




GGTATTTTTG 


TTACATGTGA 


TACCAACTGA 


ATCTAAAGTC TCTTCAGCTT 


CTTTACCAGT 


1980 


30 


AAGTCCTATA 


GACCCTTTTA 


CATCAACAGC 


TACTAAGTGA TTATCTGTAC 


CGCCAGAAAC 


2040 




AATTCTAAAT 


CCTTCATTAA 


TTAATGCTTC 


TGCAAGAACT^ TTTGCGTTTT 


TAACCACTTG 


2100 




TTGTTGATAC 


GTTTTGAAAT 


TATTTTCTAA 


CGCTTCTCCA AAAGCAACTG 


CTTTtGCTgC 


2160 


35 


AATAACATGC 


TCAAGAGGTC 


CACCTTGAAT 


ACCAGGGAAA ATTGTTTTAT 


CTATGTCTTT 


2220 




TTTATATTCT 


TCCTTACATA AAATCATACC 


ACCACGtGGT CCGcGTAATG 


TTTTGTGTGT 


2280 


40 


TGTAGTTGTT 


ACAAAATCAG 


CATATTCTAC 


TGGATTTGGA TGTAAACCTG 


CCGCTACTAA 


2340 


TCCTGCAATA TGTGCCATGT 


CTACCATTAA 


CTTAGCGTTT ACTTCATCTG 


CGATTTCTTT 


2400 




AAACTTTTTG 


AAGTCAATTG 


TTCTTGAATA 


TGCTGATGCT CCTGCCACAA 


TAAGCTTAGG 


2460 


AC 


CTTATGCTCT 


AACGCTAATT 


TACGAACTTC 


ATCATAATTG ATTCGTTCTG 


TGTCTTTATC 


2520 




TACTCCATAT 


TCAACGAAAT 


TGTAGAATTT 


ACCACTAAAA TTAACAGGCG 


CTCCATGTGT 


2580 




CAAGTGACCA 


CCATGACTCA 


AATTCATACC 


TAAAACTGTG TCGCGCATTT 


CTAATGCAAC 


2640' 


50 


TAAGTAAACA 


GCCATGTTCG 


CTTGTGAACC 


TGAATGTGGT TGAACATTGA 


CATGTTCAGC 


2700 




TCCAAACAAT 


GCTTTAGCAC 


GATCAATTGC 


GATGCTTTCA GTAACATCTA 


CAAACTCACA 


2760 
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TTGTGCTTCC ATAACCGCTT CCGATACAAA ATTTTCCGAT GCGATTAACT CTATGTTGCT .2880 

ATTTTGTCTC TGAAATTCTC TCTCGATTGC TTCTGCGATA ACTTTATCTT GCTTGGTGAT 2940 

5 ATAAGACATA AAATCTCCCC TTCTTTCAAA AAAACTTATT GGTATTTAGC ACGTTCGCCA 3000 

CCAATCTTTT TCGGCCTAGA TGTGGCAATA GTTACAATTG CCTGTCCTAC TTGCTTTACT 3060 

GAGGTCCTTA CAGGTACACA TACATGTTTA ATATGCATGC CTATTAACGT TTGACCAATA 3120 

TCAATTCCAC AAGGAACAGT AATATGTTCG ACCACGATCG GATCCTTCAT ATGCTGAAAA 3180 

GCGTATGTTG CCAAACTCCC TCCAGCATGT ACATCTGGAA CGACGGAAAC TTCTTCCATT 3240 

GTTAATGGAT TATACTGAGA TTTTTCTATT GTTATCGCTC TGTTGATATG TTCACATCCT 3300 

75 

TGAAAAGCAA AAGTAACGCC TGTCTCTTTA CTCACAACAT CTAATGCATT AAAAATAGTT 3360 

TCTGCAACTT CCaTCGAACC GACAGTCCCT ATTTTTTCGC CAATGACTTC CGATGTTGAA 3420 

CATCCAATTA AACATATATC TCCTTTATTA AAAAAGGACA TATCTTTTAA TTCGTCTAAT 34 80 

20 

AACATTGTCA AATCTTTCAT AAAAGCCCAC CCTTCCTAAA AATAAAAAAG GAATATAGCA 3540 

AAGTGCTACA CTCCTCTATT ATAACTTATT TAACTGTTAA CATATACTAA TTATACAGAA 3600 

25 TTCCTACTAG CAAATAATAT CTTTTAATTT TAAAATTAAA CTTACAAGTT CTTCATAGGT 3660 

ATGTACATAC ATTTCTTTTG TTCCACCGTA TGGATCTATA ACTTCTCCTG CTTCTTTtAC 3720 

ATATTCATGC AATGTGAAAA CATGATTTTG CAAACCAAAG TGTGCCTCTA TTAATTCTTT 3780 

30 GTGCGAATAC GACATCGTCA AAATAATATC TGCTTTCAAA TCTGCTTCAG TAAATTGTTG 3840 

CGATAAGGTC GTTTCAGCTA AATGATGTTC TTCAACTAAG TCTTCAACAT AATTCGAAAC 3900 

ACCTTGATTG TTCACAGCGA ATATACCTCT TGATTCAAAT TGATGATTTG GCATAACCTC 3 960 

35 TTTTGCAATA CTTTCCGCTA ATGGGCTACG ACATGTGTTA CCTGTACAAA CGAATAAAAT 4 020 

CTTCATAGTT CACATCCTTT AATAATGTGA TTACCTGCAG CTTTTAACAT GCGATTCATA 4 080 

ATTGCTTCTG TATTATCATT CAGCTCAAAG CCGTATATAT ACGCCGCTGA AATATTTTCA 414 0 

40 

TTTTCATCAA GTGAATGTAA CACATCATAA AGATTATGAC TTGCTTGTTT AACATCATTG 4200 

TCATCCTGAC ATAATTGAAT GAATTGCGCT TCACTTGGTA TAAACGCCAC CTTATTACTC • 4260 

GGCACAATAA AAGCTATAGA AGACCAATCT TTACCGTCAT TTCCAATTTT GCTCTCAATA 4320 

45 

TCTGTAATAA TTGTAAGTGG TGTATTGGGT GAGTAATGCT TATACTTCAT ACCTGGTGCA 43 80 

ATTGGCTGTT CAGTATCATT ATAATCAGCA TGGGCGATAC TATTCGGAAG TATTTCTGTA 4440 

50 ATCATTGCTG CTGTTATAGA ACCAGGTCTT GCAATTTTAT AAGGAAAAGA TGTGCAATCT 4500 

AAAACCGTAC TTTCTAATCC TTCTTCACTT TGTTCAGCTT GAACAATACC ATCGATACGG 4560 

55 
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GCACTTGGAG CAGCTAGAGG TTCATTTATG ATTTGTAATA ATTGTCTACC TACAGAATGG 4680 

CTTGGCATTC TAACAGCAAC TGATGATAAA CCTCCAGAAA CTTTTCGACA TAGATAGCCT 4 74 0 

5 AGCTTTAACG GCAATATAAA CGAAATAGGG CCCGGCCAGA ATGCCTGCAT TAACTTTTCT 4 800 

ACGCGTGGAT CCAAAGTATA TGTAAAATCT TTTAATTGAC CTTTACTGTG TATATGAACA 4 860 

ATAAGCGGAT TGTCAGATGG ACGGCCTTTA GCTTCATATA TTTTAGCTAC AGCTTCTTCA 4 920 

10 TCTGTCGCAT TTGCTGCAAG TCCATAAACT GTTTCAGTTG GTAAACCTAT TAAACCACCG 4980 

TTTAAAACAA TGTCTTTTAT TTCATTAATT TTAGGATATT GCTGTAAATC TTCATTATAT 504 0 

TCTCTAACAT CCCAAATTTT AGTATCCAAC TTAATCAGGC CTTTCTTATT TATCATAATA 5100 

TAAAGCAAAA AGCTATGCAC TTAACTAATC ATAGCAAAGG CATAACTTCT AATTACCATT 5160 

TAAATGAGAC GATTCGATCG TGGCCATTTA TATCTTTAAT AATGTCGATT TTTTTGTCAG 522 0 

GAAATTTATT TAAAATTATT GATTTAAGTG CCTCACCTTG ATTGTAACCA ATTTCAAAAA 5280 

CAACTGGGCT GCCTTTTTCC ATAACGTGAG GTAAATCTTC AATGATTGAT TCATAAATAG 534 0 

CATATCCATG GTTATCTGCA AACAATGCCT GATGTGGTTC GAATCTCGTA ACCGTTGGAG 5400 

ACATCGTAAC CATATCTTTT TCATCTATAT ATGGTGGATT AGATATCAAG CCGTTCAACT 54 60 

TGATACCTTC ATTAATTAAG GGCTTTAATG .CATCCCCTGT TAAAAATTGT ATTTGTGATT 552 0 

GATGCTTCTC AGCATTATTA CGAGCCATAT TCATTGCTTC AAGTGAAATA TCAGTAGCAA 5580 

30 TAACATTTAA ATCCGGCTTT TCACATTTCA AAGTAATTGC AAGTACACCA CTACCCGTTC 564 0 

CGATATCTAC GATTGTTGCA TCATCTTCTA ACTGTTGTAA GAAATGCAAC ATTACTTCTT 5700 

CAGTTTCAGG TCTTGGTATC AAACAATTTG AGTTTAGATC AAACGTTCTA CCATAAAATG 5760 

55 AGGCAAAGCC AACTATATAC- TGTATAGGCT CTCCTAATAA CATACGTTGT AATGCTAAGT 5820 

CGAACTTCAT AATCATCGCT TTCGGCATAT CATCATGCAT GTGGACTACA AAGTCCGTAC 5880 

GCGTCCATTG AAATACATCT AACATTAACC ATTCAGCTCG TGTTTGTTCA AACCCTTTTT 5940 
GTTGTGTTAA ATGAATTGCT TCATCTAACT TTTCTTTATA ATTCACCATT ATTAAGTTCT ' 6000 

TTCAATTTAT CTGTCTGCTC TGATAAAGTC AGTGCATCTA TAATTTCTTC TAAATGGCCT 6060 

TCCATAATTT GCCCTAATTT TTGAAGCGTT AGACCTATAC GATGGTCTGT TACACGGCTT 6120 

TGTGGATAAT TATAAGTTCG AATACGTTCT GAACGATCAC CAGTACCGAC TGCTGATTTA 6180 

CGTTGTGACG CATACTTTTG TTGTTCTTCT TGAACTTTCA TATCGTATAA ACGTGCTTTT 6240 

AACACTTTCA TTGCTTTTTC ACGGTTTTGA ATTTGAGACT TCTcAGAAGA TGTTG CAATG 63 00 

ACACCAGTTG GTAAATGGGT AATACGTACT GCAGAGTCAG TTGTGTTTAC GTGCTGACCA 6360 
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ACATCTTCAA CTTCTGGTAA AACTGCCACT GTAGCTGTTG AAGTATGAAT ACGTCCACCT 6480 

GATTCTGTTT. CAGGCACACG TTGAACGCGG TGCGCACCAT TTTCAAATTT CAATTTACTA 6540 

5 TACGCGCCAT TACCAGAAAC TGAGAAACTA ATTTCTTTGT AACCACCATG GTCACTTTCA 6600 

GACGCTTCTA CTATTTCAGT TTTGAATCCT TGTGATTCAG CATACTTTGA ATACATACGC 6660 

ATTAAATCAC CAGCAAAAAT CGCAGCCTCA TCACCACCTG CTGCTGCTCT TATTTCTACA 6720 

10 

ATAACGTCTT TGTCATCATT AGGATCTTTA GGAATCAATA ATATTTTAAG CTCTTCTTCA 67 BO 

AGATTTGGAA GTTCAGCTTT AATACCATTA CTCTCCTCTT TTAACATTTC TACTTCTTCT 6840 

TTATCATCAG TCTCACTTAA CATTTCTTCA ATATCAGCTA ATTCTTCTTT TTTAGCTTTA 6900 

75 

TAGTTACGAT AAACATCTAC AGTTTTTTGT AAATCAGCTT GCTCTTTAGA ATATTTACGT 6960 

AATTTATCTG AATCATTTAC AACATCTGGG TCACTTAACA GTTCATTTAA CTGTTCGTAT 7020 

2Q CTTTCTTCTA CAATATCTAA TTGATCAAAC ACTTATAATT CCTCCTTATT ATTATCACTA 7080 

GGTGCTACGA TATGGTGCGC GCGACAACGT GGCTCATAAC TTTCATTGGC ACCTACTAAG 7140 

ATAATCGGAT CATCGATTTT AGCTGGTTTA CCATTTATTA ATCGTTGCGT TCTACTAGAT 7200 

25 GAAGAACCAC AAACAGCACA AACTGCTTGA AGTTTCGTTA CTTGTTCACT GACAGC CATC 7260 

AATTTAGGCA TTGGTTCGAA CGGTTCGCCC CTAAAATCCA TATCTAATCC AGCAACAATA 7320 

ACACGGTGTC CATCTGCTGA TAGTTTTTCT ACTATACTTA CAATTTCATC GTCAAAAAAT 7380 

30 TGCAcTTCGT CTATTCCTAT AACATCAACA TTAGTTAAGT CGTGCGTCAT AATTTCACTT 7440 

GCTTTAGAAA TATTAATCGC TTCAATGGCA TTACCATTAT GAGAGACCAC TTTTT CTTTA 7500 

TGATATCGAT CATCAATCGC CGGTTTAAAT ACAACGACTT TTTGTTTAGC GTATATACCC 7560 

35 

CTTCTTAGAC GTCTTATTAG TTCTTCGGAT TTACCGCTAA ACATACTACC TGTAATACAT 7620 

TCTATCCAAC CGGAATGGTA AGTTTCATAC ATTGAGAGTll CCACCTTTTT CAAAACATAA 7680 

TCGCTTTATT ATATCATATT TCAAATATTC ATAAATGTCT TTnTCATAAT TATATCGATA 7740 

40 

, TTGTACATGA ACAATTATTT TA 7762 
(2) INFORMATION FOR SEQ ID NO: 125: 

45 (i)' SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2583 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 125: 
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TAAAAAAATT ATTATCAATG ATGAACTAGA ATTGACTGAA TTCCACCAAG AACTTACTTA 


120 




TATTTTAGAC AACATAnAAG GGAATAATAA TTATGGTAAG 


GAATTTGTTG 


CAACCGTTGA 


180 


5 


AGAAACATTC GACATTGAAT AaAGCGGGGT GgaAGCACTA TGAATCAATG GGATCAGTTC 


240 




TTAACACCTT ATAAGCAAGC GGTTGATGAG TTGAAAGkGA 


AcTTaAAGGC 


ATGCGCAAAC 


300 




AATATGAAGT TGGTGAACAA GCGTCGCCAA TAGAATTTGT 


TACTGGTCGT 


GTTAAACCAA 


360 


10 


TCGCTAGTAT TATAGATAAG GCAAACAAAC GACAAATACC 


ATTTGATAGG 


TTAAGAGAAG 


420 




AAATGTACGA TATCGCTGGT TTAAGAATGA TGTGCCAATT 


TGTTGAAGAT 


ATTGATGTTG 


460 


15 


TCGTCAATAT TTTAAGACAA AGAmAAGATT TTAAAGTAAT 


TGAAGAACGA 


GATTATATTC 


540 


GTAACACTAA AGAAAGTGGT TACCGCTCGT ATCATGTCAT 


TATTGAATAT 


CCAATTGAAA 


600 




CATTACAAGG CCAAAAATTT ATATTGGCTG AGATTCAGAT 


TCGTACATTA 


GCAATGAATT 


660 


20 


TCTGGGCAAC GATTGAACAT ACTTTACGAT ATAAATATGA 


TGGTGCTTAT 


CCGGATGAAA 


720 




TTCAACATCG TTTGGAAAGA GCGGCAGAAG CAGCGTATTT 


ACTTGATGAA 


GAGATGTCTG 


780 




AAATTAAAGA TGAAATTCAG GAAGCTCAAA AATATTACAC 


GCAAAAACGT 


TCTAAAAAAC 


840 


25 


ATGAAAATGA TTAACGAGGT GTTATAAATC ATGCGTTATA 


CAATTTTAAC 


TAAAGGTGAC 


900. 




TCCAAGTCTA ATGCCTTAAA GCATAAAATG ATGAACTATA 


TGAAAGrTTT 


TcGCATGaTT 


960 




GaGGATrGTG AAAaTCCTGA AATTGTTATT yCAGTTGGTG GTGACGGTAC ATTACTACAA 


1020 


30 


GCATTCCATC AGTATAGCCA CATGTTATCA AAAGTGGCAT 


TTGTTGGAGT 


TCATACAGGT 


1080 




CATTTAGGAT TTTATGCGGA TTGGTTACCT CATGAAGTTG 


AAAAATTAAT 


CATCGAAATT 


1140 




AATAATTCAG AGTTTCAGGT CATTGAATAT CCATTGCTTG 


AAATTATTAT 


GAGATACAAC 


1200 


35 


GACAACGGCT ATGAAACAAG GTATTTAGCA TTAAATGAAG 


CAACGATGAA 


AACTGAAAAT 


1260 




GGCTCAACAC TTGTTGTGGA TGTTAACTTA AGAGGGAAAC ACTTTGAGCG ATTTAGAGGC 


1320 


40 


GATGGATTAT GTGTATCAAC ACCTTCGGGT TCAACGGCTT ATAACAAAGC GCTAGGTGGC 


1380 


GCACTGATAC ATCCTTCACT TGAAGCAATG CAAATTACAG 


AAATTGCCTC 


GATAAATAAT 


1440 




CGTGTGTTTA GAACGGTAGG ATCACCACTT GTATTACCAA 


AGCATCATAC 


ATGTTTAATA 


1500 


45 


TCACCAGTTA ATCATGATAC CATTAGAATG ACGATAGATC 


ATGTTAGTAT 


CAAACATAAA 


1560 




AATGTTAATT CAATACAATA CCGTGTAGCA AATGAAAAAG 


TGAGGTTTGC 


ACGTTTTAGA 


1620 




CCATTCCCAT TCTGGAAACG TGTGCACGAT TCTTTCATAT 


CAAGTGATGA 


AGAACGATGA 


1680 


50 


AATTTAAGTA TCATATATCA CAACAAGAAA CTGTTAAAAC 


TTTTTTAGCA 


CGACATGATT 


1740 




TTTCTAAGAA GACAGTGAGC GCCATTAAAA ATAATGGCGC 


TTTAATTGTT 


AATGATGAAC 


1600 
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AAATACCGAG TGTTAATTTA ATACCTTATG CTCGTAAGCT AGAAGTATTG TATGAAGATG 1920 

CTTTTATCAT CATAGTTACT AAACCAAACA ATCAAAATTG TACGCCTTCG AGAGAACATC 1980 

CTCATGAAAG TTTAATCGAA CAAGTACTAT ATCATTGTCA GGAACATGGT GAAAATATTA 204 0 

ACCCACATAT TGTTACGCGT CTAGATCGTA ATACAACTGG TATTGTGATA TTCGCTAAAT 2100 

ATGGACATAT CCATCATTTA TTTTCTAAAG TAAACTTGAA AAAAATATAT ACTTGCCTTG 2160 

TATATGGTAA AACCCATACA TCTGGTATTA TTGAAGCTAA TATTAGACGG TCAAAGGATA 2220 

GGATTATAAC TAGAGAAGTT GCCTCGGATG GTAAATACGC TAAAACATCT TATGAAGTAA 2280 

TAAATCAGAA TGATAAATAC AGTTTATGCA AAGTTCATTT GCATACGGGA CGTACACATC 2340 

AAATTCGTGT ACATTTTCAA CATATTGGGC ATCCAATTGT GGGAGATTCT TTGTATGATG 2400 

GTTTTCATGA CAAAATTCAT GGTCAAGTAC TGCAATGTAC GCAAATATAT TTTGTTCATC 2460 

2Q CAATCAATAA GAACAATATT TATATTACAA TTGATTATAA GCAATTACTT AAATTATnCA 2520 

ATCAACTCTA ATnCACACAG GGGGTGTAAG TATGTCAATG AnCACAGATG AAAAAGAGCG 2580 

TGT 2583 
25 (2) INFORMATION FOR SEQ ID NO: 126: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1818 base pairs 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 126: 

ATCAAGTGAT ACATTTAACT GGTAAAGGAT TAAnAGATGC TCAAGTTAAA AAATCnGGAT 6 0 

ATATACAATA TGAATTTGTT AAAGAGGATT TnACAGATTT ATTnGCAATT ACGGATACAG 120 

TAATAAGTAG AGCTGGATCA AATGCGATTT ATGAGTTCTT AACATTACGT ATACCAATGT 180 

TATTAGTACC ATTAGGTTTA GATCAATCCC GAGGCGACCA AATTGACAAT GCAAATCATT 24 0 

TTGCTGATAA AGGATATGCT AAAGCGATTG ATGAAGAACA ATTAACAGCA CAAATTTTAT 300 

TACAAGAACT AAATGAAATG GAACAGGAAA GAACTCGAAT TATCAATAAT ATGAAATCGT 360 

ATGAACAAAG TTATACGAAA' GAAGCTTTAT TTGATAAGAT GATTAAAGAC GCATTGAATT -420 

AATGGGGGGT AATGCTTTAT GAGTCAATGG AAACGTATCT CTTTGCTCAT CGTTTTTACA 4 80 

SO TTGGTTTTTG GAATTATCGC GTTTTTCCAC GAATCAAGAC TTGGGAAATG GATTGATAAT 540 

GAAGTTTATG AGTTTGTATA TTCATCAGAG AGCTTTATTA CGACATCTAT CATGCTTGGG 600 



55 
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5 
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20 



35 



CTCATGTTAA 


AGCGCCACAA 


AATTGAAGCA 


TTATTTTTTG 


CATTAACAAT 


GGCATTATCT 


720 


GG AATTTTGA 


ATCCAGCATT 


AAAAAATATA 


TTCGATAGAG 


AAAGACCTAC 


ATTGCTGCGT 


780 


TTAATTGATA 


TAACAGGATT 


TAGTTTTCCT 


AGCGGTCATG 


CTATGGGATC 


AACTGCATAT 


64 0 


TTTGGAAGTG 


GTATCTATCT 


ATTAAATCGA 


TTAAATCAAG 


GTAATTCAAA AGGTATTCTT 


900 


ATAGGGTTAT 


GTGCAGCTAT 


GATTTTATTG 


ATTTCCATAT 


CACGTGTATA 


TCTAGGTGTA 


960 


CATTATCCAA 


CAGATATTAT 


TGCCGGCATT 


ATTGGTGGAT 


TATTTTGcAT 


TATTTTATCA 


1020 


ACGTTATTAC 


TTAGAAATAA 


ATTAATAAAT 


TAAATAGTAA. AAAAACAAAA 


GCAGTAAACC 


10B0 


TAAAGTGTCG 


TAAGGGTTTA 


CTGCTTTTAT 


AAAACGTTGT TATAACGTAT 


ATTGTCTTTT 


1140 


ACGGGCATAT 


AAnAGGGGAA 


TATTTGAnAA 


TGACCAATCC AACAAGAACG 


AAACGTTGTG 


1200 


GGGGGGATGT 


TCTATGTGGT 


ATTGATAATC 


ATTTTCAACT ACTATTATAC 


ATTAGTGAGA - 


1260 


ATCATTGTCA 


ATTAGAAACT 


AAAACTTTTT 


TTGAATATTT 


TTTAAGAATA 


GTAAATAAAA n 


1320 


CGCATGATTA 


CGCTATTTTA 


GAAAATAAAA 


AAATTTGTAT 


TTCTCATTAG 


AATTAGAATA 


1380 


TTTAAAAGTG 


ATGAGGTTTA 


AACATTATAT 


TGTTTACATA 


CTCCTTTTGA ATTCATACAT 


1440 


TATGAAATGT 


tACTTCCAAG 


TTCAAAATCG 


CACATTGAAA 


TGATGTGTGA 


AATGTTTAAA 


1500 


CTACGGTCAT 


tTTGTGTnAAA 


TAAAGrTAAT 


AACTATTCAT 


TTTACAATAG 


TGAAAAGTCA 


1560 


GTATATGACA 


ACAATTAATA 


TTGCGGTAAG 


GCCTTGTGTT 


ACAGTATTCT 


AT ATTTAAG T 


1520 


ACTGCAATCA 


GAATTAACAG 


AATGCCATTA 


ACTGATTATT 


AAATATTTGA 


GTTAATAAAT 


1680 


AATTAATGAT 


TGTAGCTTGA 


AAAATTTAAA 


ACATGGTTAT 


TGATTTGTGA 


TAAAATTTAA 


1740 


ACGTAAACAA 


ACTAATTTAA 


AAAGCAACTA 


TTGTATAGAA AAATACAAAA TTTAAAATAT 


1800 


ATTACCTTAT 


TAGAAAAA 










1816 



(2) INFORMATION FOR SEQ ID NO: 127:. 



(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 12658 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 127: 
TGTTTAAACA ATAGGGGGAA TCTTATGATT GAAAAATTAG TAACCTTTTT AAATGAGGTT ■ 60 

50 GTTTGGAGTA AGCCATTAGT TTATGGTTTG CTAATTACTG GTGTGCTATT TACATTGCGT 120 

ATgCGATTTT TTCAAGTTAG ACATTTTAAA GAAATGATTC GATTAATGTT TCAAGGAGAG 180 
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GGTACAGGTA ATATTGTCGG TGTATCTACT GCAATATTTA TAGGAGGACC TGGTGCAGTA 300 

TTTTGGATGT GGATTACTGC GTTTTTAGGT GCAAGTAGTG CTTTTATTGA ATCTACACTT 360 

5 

GGTCAAATAT TCAAGAGAGT TGAAAATAAT GAATACCGTG GTGGACCAGC GTATTATATT 420 

GAATATGGTA TTGGTGGTAA ATTTGGTAAA ATTTACGGAA TTATCTTTGC TATTGTTACG 480 

ATTATCTCAG TAGGTCTATT GCTTCCTGGT GTGCAATCTA ACGCTATAGC AAGTTCTATG 54 0 

10 

CATAATGCGA TTCATGTTCC ACAATGGTTA ATGGGTGGTA TTGTTGTAGT TATTTTGGGA 600 

TTAATTATTT TTGGTGGTGT ACGTATTATT GCCAATGTTG CAACAGCCGT TGTACCATTT 660 

J5 ATGGCAATTA TTTACATACT GATGGCTGTC ATTATCATTT GTATCAATAT ACAAGAAGTG 720 

CCAGCGTTAT TTGCATTAAT TTTCAAATCA GCATTTGGAT TACAATCTGC TTTTGGTGGT 780 

ATCGTTGGCG CAATGATAGA GATTGGTGTT AAACGTGGAT TATATTCAAA TGAGGCTGGT 84 0 

20 CAAGGTACAG GTCCACACGC AGCAGCGGCa gcAGaAGTAT CACATCCAAG TAAACAAGGT 900 

CTAGTACAAG CATTTTCAGT TTATATTGAT ACATTATTTG TATGTACTGC AACTGCTCTG 960 

ATTATACTTA TTTCTGGTAC ATATAATGTG ACTGATGGTA CGGTTAATGC GAATGGCACA 1020 

25 CCGCATTTAA TTAAAGATGG CGGTATTTAT GTTgAAAATG CAACAGGTAA AGATTATTCA 1080 

GGTACTGCGA TGTATGCACA AGCCGGCATt GATAAAGCGT TCCATGGCAG TGGTTATCAA 114.0 

TTTGATCCTA CTTTCTCTGG CGTAGgTTCG TACTTTATTG cATTTGCTTT ATTCTTCTTT 1200 

30 

GCATTTACTA CAATTTTGTC GTACTACTAC ATTACAGAAA CAAATGTTGC TTATTTAACG 1260 

CGTAATCAAA ATAATCAAGT TTCATCGATA TTTATTAATA TTGCTCGTGT GATTATTTTG 1320 

TTCGCTACAT TTTACGGTGC AGTTAAAACA GCTGATGTAG CATGGGCATT CGGTGATTTA 1380 

35 

GGTGTAGGTC TAATGGCTTG GTTAAATATG ATTGCGATTT GGATTTTACA TAAGCCTGCC 1440 

GTAAATGCTT TAAAAGATTA TGAAATTCAA AAGAAACGTT TAGGCAACGG TTATAATGCA 1500 

GTTTATCAAC CTGATCCGAA TAAATTACCT AATGCTGTCT TTTGGTTGAA GACGTATCCA 1560 

40 

GAACGTTTAA AACAAGCACG TGCCAAAAAG TAATCTACTT TTGTTTATAG TATATGTAGT 1620 

GATCATTTGA TAAAAAAGAA AAGTATTGAG AATTTTAGGt GCTCAGAAAT TTGAATTTTA 1680 

A5 AAAATATAGT GTCTCTTGGT ACAATAACAA TACAACTACT AGGGGCACTT TTTTATGTCA 174 0 

GAATTTAAAA CTGGTAAGAT TAATAAACAT GTTTTATATA GTAATATTTT AAATAGAGAT 1800 

GTCACGTTAA GTATTTATTT AC CAGAATCT TATAATCAAC TTGTTAAATA TAATGTCATT 1860 

50 CTTTGCTTTG ACGGATTAGA TTTTTTACGT TTCGGGAGAA TACAACGTAC ATATGAATCG 1920 

TTAATCAAAG AAGCGCGTAT TGATGATGCG ATCATTGTTG GATTCCATTA TGAAGACGTT 1980 
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GTCGGTAAAG 


AAATATTGCC ATTTATTGAC 


TCGACGTTTT 


CTACACTGAA 


AGTAGGTAAT 


2100 


• 


GCAAGGTTAT 


TAGTAGGGGA 


TAGTTTAGCG 


GGTAGTATTG 


CCTTATTAAC 


GGCGTTGACC 


2160 


£ 


TATCCAACGA 


TTTTTAGTCG 


TGTAGCAATG 


TTAAGTCCAC 


ATTCAGATGA 


AAAAGTATTA 


2220 




GATAAGCTAA 


ATCAATGTGC 


AAATAAAGAA 


CAATTGACAA 


TTTGGCATGT 


CATTGGTCTA 


22B0 


10 


GATGAAAAAG 


ATTTTACTTT 


ACCAACAAAT 


GGTAAGCGTG 


CCGATTTCTT 


AACACCGAAT 


2340 


AGAGAATTAG 


CTGAACAAAT 


TAAGAAATAT AATATAACTT 


ATTATTACGA 


TGAATTTGAT 


2400 




GGTGGTCACC AATGGAAAGA TTGGAAACCA TTGCTGTCAG ATATATTATT 


GTATTTTTTA 


2460 


15 


1 AGTAAAAACA 


CAGATGATCA 


ACTTTATGAA 


TAATTTACAT 


TAGTAGATTT 


AGTATGAATT 


2520 


GTCTTCATAT AGTCTGGTCT ATAATATAAT 


TTATAAAAGA 


TTTTACTGTT 


TAATTTAATT 


2580 




TAAATTTGAC 


GAAATTGCAA 


AAGATGTATA 


ATGAATTATT 


TTTAATGTAA 


CGGTTTTCAA 


2640 


20 


AGAAATTTGA 


TATAATAGCA ATAGGTTAAA 


CAAAGGAGGA 


ATTCAGATGA 


TTTTAGGATT 


2700 




AGCATTAATT 


CCATCAAAGT 


CATTTCAAGA AGCGGTGGAT 


TCTTACCGTA 


AAAGATATGA 1 


2760 




TAAACAGTAT 


TCACGAATTA AACCACATGT 


GACAATTAAA 


GCGCCATTTG 


AAATTAAAGA 


2620 


25 


TGGTGATTTA 


GATTCTGTCA 


TTGAACAGGT 


TAGAGCTCGT 


ATTAATGGTA 


TACCAGCAGT 


2B80 




AGAAGTTCAT 


GCTACAAAAG 


CTTCTAGCTT 


CAAACCAACG 


AACAATGTGA 


TTTACTTTAA 


2940 




AGTTGCGAAG 


ACGGACGACT 


TAGAAGAATT 


GTTTAATCGC 


TTTAATGGAG 


AAGATTTCTA 


3000 


30 


TGGAGAAGCT 


GAACATGTTT 


TTGTGCCACA 


CTTTACAATA 


GCACAAGGAC 


TATCTAGCCA 


3060 




AGAATTCGAA 


GATATTTTTG 


GTCaAGTAGC 


ATTAGCTGGG 


GTAGACCaTA 


AAGAAATTAT 


3120 




CGATGAATTA 


ACTTTGTTAC 


GTTTTGACGA 


TGACGAAGAT 


AAATGGAAAG 


TTATTGAAAC 


3180 


35 


GTTTAAATTA 


GCTTAAGTAA 


CATAATAGTA 


TTGTTAATCG 


TAGTATGTTT 


GAATTAATAA 


3240 




GAAAATGGTC 


ATTTTTATTG 


AATGTAATAA 


AAATGACCAT 


TTTCTTTATT 


TTAAAATACG 


3300 


40 


TTTTAACCTT 


ACTTAGCTTT 


TTCTCTATTT 


ACTATAAAGT 


rGCTTCCATA 


AAATACAGCT 


3360 


AAGACTAAAA 


AGATTAATGC 


CGAGAAATAA AATGTATTGT 


TTAAATTGTT 


GGTAAATTGT 


3420 




GTAATTAATC 


CGCCAAATAA 


TGGCCCTATC ATTGAGCCGA ATCCTTGGAT 


ACTATTAAAA 


3480 


45 


ACACCCCAAG 


TTTCTTCTTG 


TTCATCTGAT 


TTGATAAATC 


GTGCCATAAA 


GGTATTCCAT 


3540 


GCTGGTAATA 


AGATGCCATA 


CATTAGACCG 


ATAGCTAAAG 


CGATAATCCA 


CAAGATGTGA 


3600 




ATATTAACAA 


TCATAGATAG 


AGTAAAAATT AATATCATGT ATAAAATAAA 


TCCGCTTAGA 


3660 


50 


ATAACACCAT 


ACATAAAGTT 


TCTGCTGCGG 


TTATCTATTA 


GTTTCGATAA 


AAATAGCATC 


3720 




GAAACTGCAC 


AGCCGATACC 


ACCAATAATG 


ATTGCAACAG 


TATATTCAAT 


TGTGCTTACG 


3780 
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10 



15 



20 



TGTAAAAGAA TACCAGGGAA CaACAATAAA TGGcGCTTTG TCACATCAAC AATTTGTCTC 3 900 

AATTGAGCTT TAACTGGACG AGTATTATAA TTTGTTAACT TTACATCGAC AAAATAATAT 3 960 

AATATCCATG CAATTAAAAC GACTAAAGAC ATCATGAAGG CAAAGCGTGT TGGGTGCACT 4020 

TTGATAAGTA GATTCATAAA AACCATACCT ACCAATAGGC CTAACAACCA TGAAAAATAA 4080 

ACATAGCCCA TTTGTTTGCC ACGTTTATCT TCTTCAACAC TGGATAACAT AATGACCCAA 414 0 

ATAGGACTAA CTGCAATACC GAGCATCATA GCACTAAATA TGATTACAAA AGGTGATGGT 4200 

GGAAACCAAA TAACTAAAAA TAAACTTGTA AATGCTAAAA TAAATCCAGT CGTTAAAACG 4260 

ATTTTTGTGC CGAATTTTTT CAGTAAAAAT CCTATAACAA AGTTTGTAGA TGCATCAGCA 4320 

ATAAAATGTA TTGAAAATGC TAGAGACGTT ATTGCTACAG CAATGGATGT AACTGTTGGC 4380 

AAGAAATTAA TATAGCTTAG GATATACATG CCTCTCGCAA ATTCCATTAA AAATAAGATA 4440 

ATAAGCaTTA AAATGAAATT TTTATGATTA GCGTAATTAT TTAACGAAGA ATCTTGCATA 4500 

TAAAGGAACC TTTCCATAAA TCTCTTGTGG TTGTGATGAA TGACCGATTA AATCAAGTAA 4560 

GTCTCGACAT ATTGTCTGTG TAGCATACTT AATTTTATCT TGTTCCATTG TACTAATCAT 4620 

25 • GTTAGTTAAT TGCTCATTAC CGTTAGTTAA ACTTGGTACA ATTTTTATTG CTTCTTCTGG 4680 

AGTATCAGCG ATTTTACCAA AACCTTTTTC TTCAAAGTAA AGGGCATTTT CAAGCTCTTG 474 0 

ACCAGGTGCA GGATTTAGGA AAATCATTGG AATACAACGG GCGAAACCTT CAGTTATTGT 4800 

30 GATACCACGA GGTTTCGTAA TCATAAGTTG ACTTGATGCC ATCCATTCAT TCATGTGTTT. 4860 

GGTATAACCT AGAATCAATA CATTCTCGTT AGATTTAAAC TTAGCTGTTA AAGAACGCTT 4 920 

TAGCTCTTTG CTCTTACCAC AAATCATAAC TACTTGTGCA TTTGCaCTTT tCGCTAATAT 4 980 

ATCAGTAATC ATCGTGTCAA AACCTTTAGA TACACCAAAT GCACCAGCTG aCATTAAAAT 5040 

AGTTTGCTTA TCTGGATCTA AGTTGTTGTC TATTAACCAC TGCTTTTGAT TAATAGGCGT 5100 

' TTCAAATTTG TTATCAATAG GAATACCTGT CaCTTTAACT GTTGAAGGAT CAATACCTAC 5160 

GTCTATGAAG TCTTGTTTCG TTTCT TT TGT TGCCACATAA TATCTTGTTG AATACGGCGT 5220 

AATCCAGTTT TTATGTAAGC GATAGTCTGT CATCACTGTA GCAACTGGAA TATTAATGTT 52 80 

AAATTGCTCA GTTAGTACCG ACATAACTGG TGTAGGAAAC GTTAATAATA TTAAATCTGG 534 0 

CTTTTCTTTT ATCAATAAAT TAATTAACTT ATTAAGTCCA TAGTATTTGT AAAAACATTT 54 00 

GTCTAGTTTA TCTGGGCGGC TGTAATAAAA CCCTTTGTAC ATATTTCTAA AATATTTAAA 54 60 

50 GCTATTGATA TACCATTTTT TACAAATAGA AGTCAAAATT GGATGAGCTT CCATAAATAA 5520 

ATCGTGCTCA ATGACGCTTA AATGGTCTAG ATTCATATCA TTAAGTTGAT TAACGATACT 5580 
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TTGAGTAACC ATTAATAGCC ACCCTCCGTT 
TTTTACGGCA TTATAAAAGA AATAAAGACG 
5 CTATAGATGA ATTGATACAA AATAAAACGT 

TTTATTTGTA TATTTAAAAT TATCCAGTAT 
TGATATTATA CCATGTTACA AGATGGTTTT 

10 

CTAATTCATA ATACCGTATG TTTTATTTTT 
AGGATATAAC AGTGAAATTA TAAATTTATT 
TTGTAGAAAA AGGAGCGGTT CAGTTTGGAT 

75 

AAGCGTGTAT TGGG TTCTTT AGAACAACAA 
GCGAGAGAAG GTAGCATTTT TGTCGCTTCA 

2Q . TGTCAAAATG TAGCTGATCA AGGGTGTAAG 
CCAGCTAACG TAACACAAGT GGTTGTGCCG 
CACACATTAT ATGATTATCC GAGTCATCAG 

25 • GGTAAAACTT CTATTGCGAC GATGATTCAT 
GCATATTTAG GAACTAATGG TTTCCAAATT 
ACACCAGAAA CAGTTTCTTT AACTAAGAAA 

30 TCTATGACAT TAGAAGTATC AAGCCATGGC 

TTTGACGTTG CAATATTTTC AAATTTAACA 
GAAGCATACG GACACGCGAA GTCTTTATTG 

35 GAAAAGTATG TCGTGTTAAA CAATGACGAT. 

CCTTATGAAG TATTTAGTTA TGGAATTGAT 
CAAGAATCTT TACAAGGTGT CAGCTTTGAT 

40 

AAATCGCCTT ATGTTGGTAA GTTTAATATT 
TGGAGTAAAG GTACATCTTT AGAAACGATT 
GAAGGGCGAT TAGAAGTTTT AGATCCTTCG 

45 

CATACAGCTG ATGGTATGAA CAAATTAATC 
TTGATATTTT TAGTTGGTAT GGCAGGCGAA 
50 CGAGTTGCCT GTCGTGCAGA TTATGTCATT 
CCGAAAATGT TAACGGCAGA ATTAGCCAAA 



AGTTTGAAAA TTTTATTTAA GTGTAACTTA 5700 

CAAAGTCGTT ACATTTATAG CAATTTTAAT 5760 

TATTTTATAA AGCAATTTAT TGTTCTATGT 5820 

ACAATTATAG CATATTTTTG GAAACAATTA 5880 

AATAATTTAA GATGAGCCAT AATTGTAAAA 5940 

AATAGTAGAA ATTAGAAAAT GCTGATTAGT 6000 

AACATCAACA AAACGTGTAT AATAAACATA 6060 

GCAAGTACGT TGTTTAAGAA AGTAAAAGTA 6120 

ATAGATGATA TCACTACTGA TTCACGTACA 6180 

GTTGGATATA CTGTAGACAG TCATAAGTTC 6240 

TTGGTAGTGG TCAATAAAGA ACAATCATTA 6300 

GACACATTAA GAGTAGCTAG TATTCTAGCA 6360 

TTAGTGACAT TTGGTGTAaC GGGTACAAAT 642 0 

TTAATTCAAA GAAAGTTACA AAAAAATAGT 64 80 

AATGAAACAA AGACAAAAGG TGCAAATACG 654 0 

ATTAAAGAAG CAGTTGATGC AGGCGCTGAA 6600 

TTAGTATTAG GACGACTGCG AGGCGTTGAA 6660 

CAAGACCATT TAGATTTTCA TGGCACAATG 6720 

TTTAGTCAAT TAGGTGAAGA TTTGTCGAAA 6780 

TCATTTTCTG AGTATTTAAG AACAGTGACG 684 0 

GAGGAAGCCC AATTTATGGC TAAAAATATT 6900 

TTTGTAACGC CTTTTGGAAC TTACCCAGTA 6960 

TCTAATATTA TGGCGGCAAT GATTGCGGTG 7020 

ATTAAAGCTG TTGAAAATTT AGAACCTGTT 7080 

TTACCTATTG ATTTAATTAT CGATTATGCA 7140 

GATGCAGTAC AGCCTTTTGT AAAGCAAAAG 7200 

CGTGATTTAA CTAAAACGCC TGAAATGGGG 7260 

TTCACACCGG ATAATCCGGC AAATGATGAC 7320 

GGTGCAACAC ATCAAAACTA TATTGAATTT 7380 
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f 

GTTTTAGCAT CAAAAGGAAG AGAACCATAT CAAATCATGC CAGGGCATAT TAAGGTGCCA 7500 

CATCGAGATG ATTTAATTGG CCTTGAAGCA GCTTACAAAA AGTTCGGTGG TGGCCCTGTT 7560 

5 GATTAATAAA AGATTTATTG ATGAAGGTAA AACTATTGAT GTTTATTTAT TCGAAGCATT 762 0 

AAATAACCAG ATAATCATTG CTATACCAGA TTGGTTTTGG TCATATCAGA TGGCAATGAC 7680 

ATTAGATGAA GAAACTTGTT TTGAAGCAAT ACTCATGCAA TTGTTTGTTT TTAAAGAAGA 7740 

10 

GGAAGAGGCA GAATCGATTG CATCACAACT AACAGATTGG ATAGAAACAT ATAAAAAGGA 7 BOO 

GAAAGACTAA TGAACTTAAA GCAAGAAGTT GAGTCTAGAA AGACTTTTGC GATTATTTCA 7860 

CATCCCGATG CAGGGAAAAC AACGTTAACT GAAAAACTAT TGTACTTCAG TGGTGCTATT 7920 

15 

CGTGAAGCGG GTACAGTTAA AGGGAAGAAG ACTGGTAAAT TTGCGACAAG TGACTGGATG 7980 

AAAGTTGAAC AAGAGCGTGG TATTTCTGTA ACTAGTTCAG TAATGCAATT TGATTACGAT 8040 

GATTATAAAA TCAATATCTT AGATACACCA GGACATGAAG ACTTTTCAGA AGATACGTAT 8100 

20 

AGAACATTAA TGGCAGTTGA CAGTGCTGTC ATGGTCATAG ACTGTGCAAA AGGTATTGAA 8160 

CCACAAACAT TGAAGTTATT TAAAGTTTGT AAAATGCGTG GTATTCCAAT CTTTACATTC 8220 

25 . ATTAATAAAT TAGACCGAGT AGGTAAAGAA CCATTTGAAT TATTAGATGA AATCGAAGAG 8280 

ACATTAAATA TTGAAACATA CCCTATGAAT TGGCCAATTG GTATGGGACA AAGTTTCTTT 8340 

GGCATCATTG ATAGAAAGTC TAAAACAATT GAACCATTTA GAGATGAAGA AAATATATTA 84 00 

30 CATTTGAATG ATGATTTTGA GTTGGAAGAA GATCATGCAA TTACAAATGA TAGTGATTTT 8460 

GAACAAGCGA TTGAAGAATT AATGTTGGTT GAAGAAGCGG GTGAAGCCTT TGATAATGAC 8520 

GCGCTGTTGA GTGGAGACTT AACACCTGTA TTTTTCGGTT CAGCTTTAGC TAACTTTGGT 8580 

35 GTACAAAATT TCTTAAATGC ATATGTTGAT TTTGCGCCAA TGCCAAATGC GAGACAAACA 8640 

AAAGfiAGACG TTGAAGTAAG CCCGTTTGAT GATTCATTTT CAGGATTTAT CTTTAAAATT 8700 

CAAGCCAACA TGGACCCTAA ACACCGTGAT AGAATTGCCT TTATGCGTGT CGTTAGTGGT 8760 

40 

GCATTTGAAC GTGGTATGGA TGTTACTTTG CAACGTACTA ATAAAAAGGA AAAGATCACA 8820 

CGTTCAACGT CATTTATGGC AGACGATAAA GAAACTGTGA ATCATGCTGT AGCAGGCGAT 8B80 

ATCATTGGAC TATATGATAC TGGTAATTAT CAAATTGGAG ATACTTTAGT TGGTGGAAAA 8940 

45 

CAAACCTACA GTTTCCAAGA TTTACCACAA TTTACGCCAG AAATTTTTAT GAAAGTTTCT 9000 

GCTAAAAACG TCATGAAACA GAAGCATTTC CATAAAGGTA TTGAACAATT. AGTACAAGAA 9060 

50 GGTGCGATTC AATACTATAA AACATTACAC ACAAACCAAA TTATTTTAGG TGCTGTTGGT 9120 

CAGTTACAAT TTGAAGTTTT CGAACATAGA ATGAAAAACG AATATAATGT TGATGTTGTT 9180 
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AAGATGAACA CATCAAGATC GATTTTAGTG AAAGATAGAT ATGACGATTT AGTATTCTTA 9300 

TTTGAAAATG AATTTGCAAC AAGATGGTTT GAAGAGAAAT TCCCTGAAAT TAAATTGTAT 9360 

AGTTTACTTT AACAGCTCAA TTGTATAATC GAATTTGTTA CATTAAAAAT AATTGTTTCG 9420 

TTGAAGAAAA ATAAATTGTA TATTTTAAAA GAAAAAGGTA TACTATGATG TATCAAATGA 9480 

ATAACCTATG GCATTTTGTC AGAGGGGAGT AACTTAAGAA TCATGACCGT ATAAATGaTT 954 0 

CGACACTTTA TCGTCATTAC GArGATATCT TCCGGTAAAG TGGGCAATTT AAATTGCTTA 9600 

GTGAGACCTT TGCTATTTAT TTAGCATAGG TCTTTTTGTT TGTACTTAAC TTATTTATTT 9660 

AAAGGAGTTG TACATGTTAA TGGATCCAAG TTTGATCTTA CCTTATTTAT GGGTACTTGT 9720 

CGTTTTAGTA TTTTTAGAAG GCTTATTAGG AGCAGATAAC GCGATTGTTA TGGCTGTAAT 9780 

GGTTAAGCAC TTACCACCCG AACAACGTAA AAAAGCTTTG TTTTACGGTT TGTTAGGTGC 9840 

ATTTGTATTT AGATTTTTAG CATTATTCTT AATTAGTATT ATCGCGAACT TTTGGTTTAT 9900 

TCAAGCTGCA GGAGCGGTTT ACTTAATTTA TATGTCAATC AAAAATCTGT GGCAGTTCTT 9960 

TAAACACCCA GAAATTGAAA GTCCTGAAGC TGGAGATGAT CATCATTATG ATGAATCTGG 10020 

TGAAGAGATT AAAGCAAGTA ACAAATCATT CTGGGGAACT GTGTTGAAAA TAGAATTTGC 10080 

AGATATCGCA TTTGCCATTG ATTCTATGCT TGCTGCTTTA gCTATTGCTG TAACACTTCC 10140 

TAAAGTTGGT ATTCACTTTG GTGGTATGGA CTTAGGTCAG TTCGTAGTCA TGTTCCTAGG 10200 

TGGAATGATT GGTGTTATTC TAATGCGTTA TGCAGCAACA TGGTTTGTAG AGCTATTAAA 1026 0 

CAAATATCCA GGACTTGAAG GTGCAGCCTt CGCGATCGTT GGTTGGGTAG GTGTTAAATT 10320 

AGTTGTCATG GTATTAGCGC ACCCAGACAT CGCTGTATTG CCTGAGCACT TCCCACATGG 10380 

CGTATTATGG CAATCTATTT TCTGGACAGT ACTAATTGGA TTAGTAATTA TCGGTTGGTT 1044 0 

AGGTTCAGTT GTTAAAAATA AAAAATCGCA TAAATAATTG ATGTGAAGCG GACAATCTTA 10500 

ATTTAGTTTA AGGTTGTCCT TTTTCATTTA ATTGAGTGAT TTATGAAAAA TGGATTTTGA 10560 

AGAATGTGAA TCAAAAGATG CGATATAGTA TTAAGAAAAT GTGCCTTTTA TATTTAGCAT 10620 

TTTTTCAATA GAAATTATAT AGATTTTAAA GCAAATTAGG TGTTAATGTG TCATAATGAT 10680 

AAGTGATTTT ATTGAATGGA GTGGACATTA GTGGATATTG GTAAAAAACA TGTAATTCCT 10740 

AAAAGTCAGT nACCsaCGTA AGCGTCGTGA ATTCTTCCAC AACGAAGACA GAGAAGAAAA 10800 

TTTAAATCAA CATCAAGATA AACAAAATAT AGATAATACA ACATCAAAAA AAGCAGATAA 10860 

GCAAATACAT AAAGATTCAA TTGATAAGCA CGAACGTTTT AAAAATAGTT TATCATCGCA 10920 

TTTAGAACAG AGAAACCGTG ATGTTAATGA GAATAAAGCT GAAGAAAGTA AAAGTAATCA 10980 
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AAATTCATTA GATTCAGTGG ACCAAGATAC AGAGAAATCA AAATATTATG AGCAAAATTC 11100 

TGAAGCGACT TTATCAACTA AATCAACCGA TAAAGTAGAA TCAACTGAAA TGAGAAAGCT 11160 

AAGTTCAGAT AAAAACAAAG TTGGTCATGA AGAGCAACAT GTACTTTCTA AACCTTCAGA 11220 

ACATGATAAA GAGACTAGAA TTGATTCTGA GTCTTCAAGA ACTGATTCAG ACAGCTCGAT 11280 

GCAGACAGAG AAAATAAAAA AAGACAGTTC AGATGGAAAT AAAAGTAGTA ATCTGAAATC 11340 

TGAAGTAATA TCAGACAAAT CAAATACAGT ACCAAAATTG TCGGAATCTG ATGATGAAGT 114 00 

AAATAATCAG AAGCCATTAA CTTTACCGGA AGAACAGAAA TTGAAAAGAC AGCAAAGTCA 11460 

AAATGAGCAA ACAAAAACCT ATACATATGG TGATAGCGAA CAAAATGACA AGTCTAATCA 11520 

TGAAAATGAT TTAAGTCATC ATATACCATC GATAAGTGAT GATAAAGATA ACGTCATGAG 11580 

AGAAAATCAT ATTGTTGACG ATAATCCTGA TAATGATATC AATACACCAT CATTATCAAA 11640 

AACAGATGAC GATCGAAAAC TTGATGAAAA AATTCATGTT GAAGATAAAC ATAAACAAAA 11700 

TGCAGACTCG TCTGAAACGG TGGGATATCA AAGTCAGTCA . ACTGCATCTC ATCGTAGCAC 11760 

TGAAAAAAGA AATATTTCTA TTAATGACCA TGATAAATTA AACGGTCAAA AAACAAATAC 11820 

AAAGACATCG GCAAATAATA ATCAAAAAAA GGCTACATCA AAATTGAACA AAGGGCGCGC 11880 

TACGAATAAT AATTATAGTG ACATTTTGAA AAAGTTTTGG ATGATGTATT GGCCTAAATT 11940 

AGTTATTCTA ATGGGTATTA TTATTCTAAT TGTTATTTTG AATGCCATTT TTAATAATGT 12000 

GAACAAAAAT GATCGCATGA ATGATAATAA TGATGCAGAT GCTCaAAAAT ATACGACAAC 12060 

GATGAAAAAT GCCAATAACA CAGTTAAATC GGTCGTTACA GTTGAAAATG AAACATCAAA 12120 

AGATTCmTCA TTACCTAAAG ATAAAGCATC TCaAGACGAA GTGGGATCAG GTGTTGTATA 12180 

TAAAAAATCT GGAGATACGT TATATATTGT TACGAATGCA CACGTTGTCG GTGATAAAGA 12240 

AAATCaAAAA ATAACTTTCT CGAATAATAA AAGTGTTGTT GGGAAAGTGC TTGGTAAAGA 12300 

TAAATGGTCA GATTTAGCTG TTGTTAAAGC AACTTCTTCA GACAGTTCAG TGAAAGAGAT 12360 

AGCTATTGGA GATTCAAATA ATTTAGTGTT AGGAGAGCCA ATATTAGTCG TAGGTAATCC 12420 

ACTTGGTGTA GACTTTAAAG GCACTGTGAC AGAAGGTATT ATTTCAGGTC TGAACAGAAA 124 BO 

TGTTCCTATT GATTTCGATA AAGATAATAA ATATGATATG TTGATGAAAG CTTTCCAAAT 12540 

TGATGCATCA GTAAATCCAG GTAACTCGGG TGGTGCTGTC GTCAATAGAG AAGGAAAATT 12600 

AATAGGTGTA GTTGCAGCTA AAATTAGTAT GCCAAACGTT GAAAnTATGT CATTTGCA 1265 B 

(2) INFORMATION FOR SEQ ID NO: 128: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6048 base pairs 
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(C) STRAND EDNESS ; double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 12B: 

TGAAATnGAA TAGTACTATT GCAAGTGTAA AGAGGTTAAT TTTTGCCnCA CGCGGGACTT 60 

AAAAAGGCAA CCACTGGTTG TGACATATCC TTATTTACAT TTATAAATAT AAGGAGGAGG ' 120 

TAGTAGTGAA AGACTTATTG CAAGCACAGC AAAAGCTTAT ACCGGATCTC ATAGATAAAA 180 

TGTATAAACG TTTTTCTATT CTTACTACTA TCTCAAAAAA TCAGCCTGTC GGACGTCGAA 24 0 

75 GTTTAAGCGA ACATATGGAT ATGACTGAAC GTGTACTGCG TTCTGAAACA GATATGCTTA 300 

AGAAACAAGA TTTGATAAAA GTTAAGCCTA CCGGAATGGA AATTACAGCT GAAGGTGAGC 360 

AACTGATTTC GCAATTGAAA GGTTACTTTG ATATCTATGC AGATGATAAT CGTCTGTCAG 420 

AAGGTATTAA GAATAAATTT CAAATTAAGG AAGTTCATGT TGTTCCTGGT GATGCTGATA 4 80 

ATAGTCAATC TGTTAAAACA GAATTAGGTA GACAAGCAGG TCAATTACTT GAAGGCATAT 540 

TACAAGAAGA CGCGATAGTT GCTGTAACTG GCGGATCCAC GATGGCATGT GTTAGTGAAG 600 

CAATTCATTT ATTACCATAT AATGTATTCT TCGTACCAGC CAGAGGTGGA CTAGGCGAAA 66 0 

ATGTTGTCTT TCAGGCAAAC ACAATTGCAG CCAGTATGGc aCAACAAGCT GGCGGTTATT 720 

ATACGACGAT GTATGTACCT GATAATGTCA GTGAAaCAAC ATATAATACA TTGTTGTTAG 780 

AGCCATCAGT CATAAACACT TTAGACAAAA TTAAACAAGC AAACGTTATA TTACACGGCA 84 0 

TTGGTGATGC GCTGAAGATG GCGCATCGAC GTCAATCACC TGAAAAGGTC ATTGAACAAC 900 

35 TTCAACATCA TCAAGCTGTC GGAGAGGCAT TTGGTTATTA TTTTGATACA CAAGGTCAAA 960 

TTGTCCATAA GGTTAAAACA ATTGGACTTC AATTAGAAGA CCTTGAATCA AAAGACTTTA 1020 

TTTTTGCAGT TGCAGGAGGC AAATCGAAAG GTGAAGCAAT TAAAGCATAC TTGACGATTG 1080 

CACCCAAGAA TACAGTGTTA ATCACTGATG AAGCCGCAGC AAAGATAATA CTTGAATAAG 114 0 

AGATAAAAAG TTTAATACTT TTTAAATATC ATTTTAAAGG AGGCCATTAT AATGGCAGTA 1200 

AAAGTAGCAA TTAATGGTTT TGGTAGAATT GGTCGTTTAG CATTCAGAAG AATTCAAGAA 1260 

GTAGAAGGTC TTGAAGTTGT AGCAGTAAAC GACTTAACAG ATGACGACAT GTTAGCGCAT 1320 

TTATTAAAAT ATGACACTAT GCAAGGTCGT TTCACAGGTG AAGTAGAGGT AGTTGATGGT 13 80 

GGTTTCCGCG TAAATGGTAA AGAAGTTAAA TCATTCAGTG AACCAGATGC AAGCAAATTA 1440 

CCTTGGAAAG ACTTAAATAT CGATGTAGTA TTAGAATGTA CTGGTTTCTA CACTGATAAA 1500 

GATAAAGCAC AAGCTCATAT TGAAGCAGGC GCTAAAAAAG TATTAATCTC AGCACCAGCT 1560 
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ACAGTTGTTT CAGGTGCTTC ATGTACTACA AACTCATTAG CACCAGTTGC TAAAGTTTTA 1680 

AACGATGACT TTGGTTTAGT TGAAGGTTTA ATGACTACAA TTCACGCTTA CACAGGTGAT 1740 

5 

CAAAATACAC AAGACGCACC TCACAGAAAA GGTGACAAAC GTCGTGCTCG TGCAGCGGCA 1800 

GAAAACATCA TCCCTAACTC AACAGGTGCT GCTAAAGCTA TCGGTAAAGT TATTCCTGAA 1860 

ATCGATGGTA AATTAGATGG TGGTGCACAA CGTGTTCCTG TAGCTACAGG TTCATTAACT 1920 

10 

GAATTAACAG TAGTATTAGA AAAACAAGAC GTAACAGTTG AACAAGTTAA CGAAGCTATG 1980 

AAAAATGCTT CAAACGAATC ATTCGGTtAC ACTGAAGACG AAATCGTTTC TTCAGACGTT 2040 

,5 GTAGGTATGA CTTACGGTTC ATTATTCGAC GCTACACAAA CTCGTGTAAT GTCAGTTGGC 2100 

GACCGTCAAT TAGTTAAAGT TGCAGCTTGG TATGATAACG AAATGTCATA TACTGCACAA 2160 

TTAGTTCGTA CATTAGCATA CTTAGCTGAA CTTTCTAAAT AATTTTAGTA TAGTTTTTAT 2220 

20 TCAAATACGC TAGTGCTCAG AACTATTTAG CATTAATTAA AGCTTATGAG TAAGCGGGGA 2280 

GCACAAACGC TTCTCCGCTT ATTTTTATAT AAAATTTCCT AATTACAAGG AGGAAACACC 2340 

ATGGCTAAAA AAATTGTTTC TGATTTAGAT CTTAAAGGTA AAACAGTCCT AGTACGTGCT 24 00 

25 

GATTTTAACG TACCTTTAAA AGACGGTGAA ATTACTAATG ACAACCGTAT CGTTCAAGCT 24 60 

TTACCTACAA TTCAATACAT CATCGAACAA GGTGGTAAAA TCGTACTATT TTCACATTTA 2520 

GGTAAAGTGA AAGAAGAAAG TGATAAAGCA AAATTAACTT TACGTCCAGT TGCTGAAGAC 2580 

30 

TTATCTAAGA AATTAGATAA AGAAGTTGTT TTCGTACCAG AAACACGCGG CGAAAAACTT 264 0 

GAAGCTGCTA TTAAAGACCT TAAAGAAGGC GACGTATTAT TAGTTGAAAA TACACGTTAT 2700 

35 GAAGATTTAG ACGGTAAAAA AGAATCTAAA AATGATCCAG AATTAGGTAA ATACTGGGCA 2760 

TCTTTAGGTG ATGTGTTTGT AAATGATGCT TTTGGTACTG CGCATCGTGA GCATGCATCT 2820 

AATGTTGGTA TTTCTACACA TTTAGAAACT GCAGCTGGAT TCTTAATGGA TAAAGAAATT 2880 

40 AAGTTTATTG GCGGCGTAGT TAACGATCCA CATAAACCAG TTGTTGCTAT TTTAGGTGGA 2940 

GCAAAAGTAT CTGACAAAAT TAATGTCATC AAAAACTTAG TTAACATAGC TGATAAAATT 3 000 

ATCATCGGCG GAGGTATGGC TTATACTTTC TTAAAAGCGC AAGGTAAAGA AATTGGTATT 3060 

45 

TCATTATTAG AAGAAGATAA AATCGACTTC GCAAAAGATT TATTAGAAAA ACATGGTGAT 3120 

AAAATTGTAT TACCAGTAGA CACTAAAGTT GCTAAAGAAT TTTCTAATGA TGCCAAAATC 3180 

ACTGTAGTAC CATCTGATTC AATTCCAGCA GACCAAGAAG GTATGGATAT TGGACCAAAC 3240 

SO 

ACTGTAAAAT TATTTGCAGA TGAATTAGAA GGTGCGCACA CTGTTGTATG GAATGGACCT 3300 

ATGGGTGTAT TCGAGTTCAG TAACTTTGCA CAAGGTACAA TTGGTGTATG TAAAGCAATT 3360 
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TCTTTAGGTT TTGAAAATGA CTTCACTCAT ATTTCAACTG GTGGCGGCGC GTCATTAGAG 3480 

TACCTAGAAG GTAAAGAATT GCCTGGTATC AAAGCAATCA ATAATAAATA ATAAAGTGAT 3540 

AGTTTAAAGT GATGTGGCAT GTTTGTTTAA CATTGTTACG GGAAAACAGT CACAAGATGA 3600 

CATCGTGTTT CATCACTTTT CAAAAATATT TACAAAACAA GGAGTGTCTT TAATGAGAAC 3660 

ACCAATTATA GCTGGTAACT GGAAAATGAA CAAAACAGTA CAAGAAGCAA AAGatTCGTC 3720 

AATACATTAC CAACACTACC AGATTCAAAA GAAGTAGAAT CAGTAATTTG TGCACCAGCA 37B0 

ATTCAATTAG ATGCATTAAC TACTGCAGTT AAAGAAGGAA AAGCACAAGG TTTAGAAATC 3840 

GGTGCTCAAA ATACGTATTT CGAAGATAAT GGTGCGTTCA CAGGTGAAAC GTCTCCAGTT 3900 

GCATTAGCAG ATTTAGGCGT TAAATACGTT GTTATCGGTC ATTCTGAACG TCGTGAATTA 3960 

TTCCACGAAA CAGATGAAGA AATTAACAAA AAAGCGCACG CTATTTTCAA ACATGGAATG 4020 

20 ACTCCAATTA TATGTGTTGG TGAAACAGAC GAAGAGCGTG AAAGTGGTAA AGCTAACGAT 4 080 

GTTGTAGGTG AGCAAGTTAA GAAAGCTGTT GCAGGTTTAT CTGAAGATCA ACTTAAATCA 4140 

GTTGTAATTG CTTATGAACC AATCTGGGCA ATCGGAACTG GTAAATCATC AACATCTGAA 4200 

GATGCAAATG AAATGTGTGC ATTTGTACGT CAAACTATTG CTGACTTATC AAGCAAAGAA 4260 

GTATCAGAAG CAACTCGTAT TCAATATGGT GGTAGTGTTA AACCTAACAA CATTAAAGAA 4320 

TACATGGCAC AAACTGATAT TGATGGGGCA TTAGTAGGTG GCGCATCACT TAAAGTTGAA 4380 

GATTTCGTAC AATTGTTAGA AGGTGCAAAA TAATCATGGC TAAGAAACCa ACTGCGTTAA 4440 

TTATTTTAGA TGGTTTTGCG AACCGCGAAA GCGAACATGG TAATGCGGTA AAATTAGCAA 4500 

ACAAGCCTAA TTTTGATCGT TATTACAACA AATATCCAAC GACTCAAATC GAAGCGAGTG 4560 

GCTTAGATGT TGGACTACCT GAAGgACAAA TGGGTAACTC AGAAGTTGGT CATATGAATA 4620 

TCGGTGCAGG ACGTATCGTT TATCAAAGTT TAACTCGAAT CAATAAATCA ATTGAAGACG 4 680 

40 GTGATTTCTT TGAAAATGAT GTTTTAAATA ATGCAATTGC ACACGTGAAT TCACATGATT 4740 

CAGCGTTACA CATCTTTGGT TTATTGTCTG ACGGTGGTGT ACACAGTCAT TACAAACATT 4800 

TATTTGCTTT GTTAGAACTT GCTAAAAAAC AAGGTGTTGA AAAAGTTTAC GTACACGCAT 4860 

TTTTAGATGG CCGTGACGTA GATCAAAAAT CCGCTTTGAA ATACATCGAA GAGACTGAAG 4 920 

CTAAATTCAA TGAATTAGGC ATTGGTCAAT TTGCATCTGT GTCTGGTCGT TATTATGCAA 4980 

TGGATCGTGA CAAACGTTGG GAACGTGAAG AAAAAGCTTA CAATGCTATT CGTAATTTTG 5040 

ATGCCCCAAC TTATGCAACT GCCAAAGAAG GTGTAGAAGC AAGCTATAAT GAGGGCTTAA 5100 

CTGACGAATT CGTAGTACCA TTCATCGTTG AGAATCAAAA TGACGGTGTT AATGATGGAG 5160 
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CGAACAGAGC 


ATTCGAAGGC 


TTTAAAGTTG 


AACAAGTTAA AGACTTATTC 


TATGCAACAT 


5280 


TCACTAAGTA 


TAATGACAAT 


ATCGATGCGG 


CTATCGTCTT 


CGAAAAAGTT 


GATTTAAATA 


5340 


ATACAATTGG 


TGAAATTGCA 


CAAAATAACA 


ATTTAACTCA 


ATTACGTATT 


GCAGAAACTG 


■ 5400 


AAAAATACCC 


TCACGTTACT 


TACTTTATGA 


GTGGTGGACG 


TAACGAGGAA 


TTTAAAGGTG 


5460 


AACGCCGTCG 


TTTAATTGAT 


TCACCTAAAG 


TTGCAACGTA 


TGACTTGAAA 


CCAGAAATGA 


5520 


GTGCTTATGA 


AGTTAAAGAT 


GCATTATTAG 


AAGAGTTAAA 


TAAAGGTGAC 


TTGGACTTAA 


5580 


TTATTTTAAA 


CTTTGCTAAC 


CCTGATATGG 


TTGGACATAG 


TGGTATGCTT 


GAGCCGACAA 


5640 


TCAAAGCAAT 


CGAAGCGGTT 


GATGAATGTT 


TAGGAGAAGT 


GGTTGATAAG 


ATTTTAGACA 


5700 


TGGACGGTTA 


TGCAATTATT 


ACTGCTGACG 


ATGGTAACTC 


TGATCAAGTA 


TTGACGGaTG 


5760 


ATGATCAACC 


AATGACTACG 


CAwACAACGA 


ACCCAGTACC AGTGATTGTA ACAAAAGAAG 


5820 


GCGTTACACT 


TAGAGAAACT 


GGTCGCTTAG 


GTGACTTAGC 


ACCTACATTA 


TTAGATTTAT 


5880 


TAAATGTAGA 


ACAACCTGAA 


GATATGACAG 


GTGAaTCTTT AATTAAACAC 


TAATATTGTA 


5940 


AAAGATGTTA 


AGTAAACGCT 


TAATGACACT 


TATTTTTTGA 


AAATAATAGT 


AATATCnTTT 


6000 


TGTTAAATGA 


AAGAATAAAG 


CTATAATAAT 


TATAGAATAA 


CTATTTAn 
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(2) INFORMATION FOR SEQ ID NO: 129: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5602 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 129: 
AAAGAAGTGC AAGATATCAT CGCATTAATT AAGTCGTTAC AAAgTGTAAT TGTAGACaTC 60 

40 GCTTCCAATA ATGTTGATAC AATTATGCCT GGTTATACTC ATTTACAGCG TGCACAGCCA 120 

ATTTCATTTG CACATCATAT TATGACTTAT TTTTGGATGT TACAACGAGA CCAACAACGA 180 
TTTGAAGATA GTTTAAAACG AATCGATATT AATCCTTTAG GTGCAGCAGC CTTAAGTGGT 24 0 

4t> ACCACATACC CTATCGATAG ACACGAGACA ACAGCATTGT TGAACTTTGG CAGTCTCTAT 300 

GAGAATAGCC TAGATGCTGT TAGTGACAGA GACTATATTA TTGAAACATT GCATAATATT 360 
TCTTTAACGA TGGTTCACTT ATCACGCTTT GCAGAGGAAA TTATTTTCTG GTCCACAGAC 420 

50 

GAAGCTAAAT TCATTACATT ATCAGATGCA TTTTCAACTG GCTCATCTAT TATGCCACAA 480 
AAGAAAAATC CTGATATGGC AGAATTAATT AGAGGTAAAG TTGGTCGAAC GACTGGTCAT 540 
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GAAGATAAAG AAGGTTTATT CGATGCTGTC CATACAATTA AAGGTTCTTT ACGTATTTTC 660 

GAAGGTATGA TTCAAACGAT GACAATTAAT AAAGAACGAC TCAATCAAAC TGTTAAAGAA 720 

5 

GATTTTTCAA ATGCAACGGA ACTAGCAGAT TATTTAGTAA CTAAAAATAT TCCATTTAGA . 780 

ACTGCACATG AAATTGTAGG AAAAATCGTC TTAGAATGTA TACAACAAGG TCATTATTTA 840 

TTAGATGTTC CTTTAGCAAC ATATCAACAA CATCATTCTA GTATTGATGC CGATATTTAC 900 

10 

GATTATTTGC AGCCTGAAAA TTGTTTAAAA CGACGTCAAA GTTACGGTTC AACAGGTCAA 960 

TCATCGGTCA AACAACAACT TGATGTTGCT AAACAATTAC TATCACAATA AATACGTTAA 1020 

15 TCTACCTACC CACAATGTCT ATTAAAATTA CATTGTGGGT ATTTTAATGC TCTCTTCGTC 1080 

TTGTTGAACA TCACATTTTT AAGATTCCTA AAATGTTTGA TAATTCTTTT AAATTTATAT 1140 

TACAAAAATG TTATAAATTG TAAAAGAAAT GTGTAAAGCG TTTTCACAAG CAGGTTTTTG 1200 

20 TAGTATTTTA AAATTGTTAG ACTACAAATA AAGAGATGAA AGGATAAAGA CTATGACTAA 1260 

CTCTTCGAAA AGCTTCACTA AATTTATGGC TGCTTCTGCT GTTTTTACTA TGGGATTTTT 1320 

ATCAGTACCT ACTGCTGGCG CTGAACAAAC AAATCAAATT GCAAATAAAC CTCAGGCTAT 13 80 

25 

TCAATGGCAT ACAAATTTAA CGAATGAGCG ATTCACTACT ATCGCACATC GTGGCGCAAG 1440 

TGGCTATGCA CCCGAGCATA CGTTTCAAGC ATATGATAAG AGTCATAATG AGTTAAAAGC 1500 

ATCTTATATC GAAATTGATT TACAACGTAC CAAAGATGGC CATTTAGTTG CTATGCATGA 1560 

30 

TGAAACTGTT AAC CGTACAA CAAATGGACA CGGTAAAGTT GAGGATTATA CCCTTGATGA 1620 

*ATTAAAACAG TTAGATGCAG GAAGTTGGTT TAATAAAAAA TATCCAAAAT ACGCAAGAGC 168 0 

35 AAGTTATAAA AATGCTAAAG TACCCACTTT AGATGAAATT TTAGAACGTT ATGGCCCGAA 174 0 

TGCAAACTAT TATATTGAAA CAAAGTCACC TGATGTATAC CCAGGAATGG AAGAACAATT 1800 

ATTASCTTCA TTGAAAAAGC ATCACCTTTT AAATAACAAT AAATTAAAAA ATGGACATGT 1860 

40 AATGATTCAA TCATTTTCTG ACGAAAGTTT AAAGAAAATT CATCGTCAAA ATAAGCATGT 1920 

GCCATTAGTA AAATTAGTTG ATAAAGGTGA ACTACAACAA TTTAACGACC AACGCTTAAA 1980 

AGAGATACGC TCTTATGCGA TTGGATTAGG TCCTGATTAT ACAGATTTAA CTGAACAAAA 204 0 

45 

TACCCATCAT TTAAAAGACT TAGGATTTAT AGTACATCCT TATACAGTGA ATGAAAAAGC 2100 

TGATATGTTA CGATTAAATA AATATGGCGT TGATGGTGTC TTTACAAATT TCGCTGATAA 2160 

ATATAAAGAA GTCATTAAGT AGTAATGTTA AACTAGAAAA CATAAATACA AAAATATAGC 2220 

SO 

TATTACTATA AAAAACAGCA GTAAGATATT TCCAAATTGA AATTATCCTA CTGCTGTCTT 2280 

TTTGGGAGTG GGACAGAAAT GATATTTTCG CAAAATTTAT TTCGTCGTCC CACCCCAACT 234 0 
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TTGTCTGTAG 


AAATTGAGGA GCTAATTTCT 


CTGTGTCGGG 


GCTCCACCCC 


AACTTGCACA 


2460 


CTATTGTAAG 


CTGACTTTCC GCCAGCCTCT 


GTGTTGGGGC 


CCCGCCAACT 


TGCACACTAT 


2520 


TGTAAGCTGA 


CTTTCCACCA GCCTCTGTGT 


TGGGGCCCCG 


ACTATTTTTG 


AAAAGAGCGT 


2580 


GTTACACGGG 


CATTGTTTTA CAGTCAACTA 


CTGCTAAAAT 


AAAATTAACG 


AGCTTAGGGC 


2640 


TTTGTTTTCT 


GTCCCAAGCT CGTTAAATCA 


CATATGATAA 


TTAATTATGC 


CCAACCACGA 


2700 


TATCTAGCTG 


CTTCTGCTGT ACGTTTAATA 


CCTATGATAT 


ATGCTGCAAG 


TCTCATATCT 


2760 


ATTTTTCGGT 


TTTGAGACAA TTCGTAAATC 


GTATCAAATG 


CCGCTTCTAA 


TTTTTCACGT 


2820 


AGCTTTTCAT 


TAACTTCTTC TTCAGACCAA 


TAATAACCTT 


GATTATTTTG 


TACCCATTCG 


2880 


AAGTAAGAAA 


CCGTtACACC ACCAGCACTT 


GCTAATACGT 


CTGGAACTAA 


TAATATACCA 


2940 


CGTTCAGTTA AAATACGTGT TGCTTCTGGT 


GTTGTAGGTC 


CATTAGCAGC 


TTCAACAACG 


3000 


ATACTAGCTT TAATATCATG TGCATTGTCT TCTGTAATTT GGTTTGAAAT 


AGCCGCTGGT 


3060 


ACTAAAATGT 


CACAATCTAA TTCAAACAAT 


TCTTTATTTG 


AGATTGTTTC 


TTCAAATAAA 


3120 


TTTGTTACCG 


TACCAAAACT ATCACGACGG 


TCTAATAAAT 


AATCTATATC 


TAAGCCATTT 


3180 


GGATCGTGTA ATGCACCGTA AGCATCAGAG 


ATACCTACAA 


TTTTTGCACC 


TAAATCATAT 


3240 


AAGAATTTAG 


CTAAGAAACT TCCGGCATTA 


CCGAAACCTT 


GAATAACAAC 


CTTGGCACCT 


3300 


TCAATTTGCA 


TATTACGACG TTTTGCAGCT 


TGTTCAATTG 


CAATAACTAC 


ACCTAGTGCA 


3360 


GTTGATCTGT 


CGCGTCCATG AGAACCACCC AATACAATTG 


GTTTACCTGT 


GATGAAACCT 


3420 


GGTGAATTAA 


ATTTATCTAA TGCACTATAT 


TCATCCATCA 


TCCAAGCCAT 


AATTTGTGAG 


3480 


TTTGTAAATA 


CATCTGGTGC TGGAATATCT 


TTGTTCGGAC 


CTACGAATTG 


TGAAATTGCT 


3540 


CTTACATATC 


CGCGTGATAA ACGTTCAACT 


TCATGAATGC 


TCATTTGACG 


TGGATCACAA 


3600 


ACGMACCAC 


CCTTACCACC ACCGTATGGT 


AAGTTTACAA 


TGCCACATTT 


CAAAGTCATC 


3660 


CACATTGATA ATGCTTTTAC TTCTTCTTCA 


TCAACATCTG 


GGTGGAAACG 


CACGCCCCCT 


3720 


TTTGTTGGTC CAACAGCATC ATTATGTTGC GCACGGTAAC CTGTGAATGT 


TTTTACTGTG 


3780 


CCATCATCCA 


TTCGTACAGG GATACGCACT 


TGTAACATTC 


TTAAAGGTTC 


TTTAATTAAA 


3840 


TCGTACATTC 


CTtCGTCAAA TCCCAATTTA 


TGCAATGCTT 


CTTTAATAAT 


TCCTTGAGTA 


3900 


GAAGTTACTA 


AATTATTGTT CTCAGTCATG 


ATCCTTTTCG 


CCTCTTCTTT 


ACCTAATGAT 


3960 


TTCGCTTTCA 


AACATATTGT AACATAACGT 


ATTCCTTTTT 


AAAGCCCTTA 


CAAACTGATT 


4020 


GTTACAACTT 


TTTGACATTA TTGAAATACA 


TGTCTTATTT 


TTTCAAGTGC 


AAGGTCCAAT 


4080 


TCTTCTTTAG 


TAATAATTAA TGGTGGTGCA AAACGAATGA 


CAGTATCATG 


CGTTTCTTTA 


4140 
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- ACACCTATAA ACAAACCACG TCCACGGACT TCTTTAATTG ATGGATGATC AATTTGCTTT 4260 

AATTGTTCTT TAAAATAATC TCCTAATTCT AAAGAGCGGC CTGGTAAATC CTCATCAACG 4320 

5 ATAACATCTA ATGCAGCAAT TGATGCAGCA CAAGCAAGTG GATTACCACC AAATGTTGAA 4380 

CCATGTGAGC CAGGTGTAAA GACATCTAAT ACTTCTTTAT CTGCTAATAC AACAGAAATT 444 0 

GGGAAGACTC CACCACCTAG TGCTTTACCT AAAATATAGA CATCAGGTTT TACATTATCC 4500 

10 

CAATCCGTAG CAAATAATTT ACCCGAACGA CCTAATCCTG CTTGGATTTC GTCAGCAATA 4560 

AATAAGACAT TATGTTCATC ACATAATTCT CTAATTGCTT TCAAATATCC TTCTGGCGGT 4620 

ATATTTATAC CCGCTTCACC TTGAATTGGT TCTACTAAAA CTGCTGCAGT ATTTTCATTA 4680 

15 

ATTGCAGCTT TCAATGCATC TACATCTCCA AAATCAACTT TTCTAAATCC ATCTAATAAC 474 0 

GGACCATAAC CACGTTGGTA TTCTGCTTCT GAAGATAATG AAACTGGCGC CATTGTTCGA 4 800 

2Q CCATGGAAGT TACCATTAAA TGCAATGATT TCTGCTTTAT TTGGCTCAAT TCCTTTAACA 4 860 

TCGTATGCCC AGCGTCGTGC TGCTTTCAAA GCTGTTTCTA CTGCTTCAGC ACCTGTATTC 4920 

ATTGGTAAAG CTTTATCTTT ACCTGCCAGT TTACAAATTT TTTCGTACCA TTCACCTAAG 4980 

25 TTATCACTAT GAAAAGCACG TGAAACTAAA GTCACTTTAT CAGCTTGATC TTTTAATGCT 5040 

TGAATAATTT TCGGATGTCT ATGACCTTGG TTAACAGCGG AATATGCAGA TAACATATCC 5100 

. ATATATTTAT TGCCTTCAGG ATCTTTAACC CATACCCCTT CAGCTTcTGa AATGaCAATT 5160 

in 

GG CAATGGTA AATAATTATG TGCTCCGTAA TGATTTGTTA ACTCAATAAT TTTTTCAGAT 5220 

TTAGTCATCA TATCTCCCCT TTTCATCATT TATAACTATT ATACATGAAA CATTATCCAA 5280 

ATAATTACAT TAGTTTTCAA AGCAGATACT TTTCCACCAA AAAAGATGAA ATAATCACTA 5340 

35 

AGTTTCATTA AATTTGTCTA TTTTGAAAAC CCTTACATTT ATAATGACAT AATTACTTAA 54 00 

ATGaJTACAA GCAAAAGAAT TGATAATTTT ACACTTAATC AAAAGTATAT TTTACTAAGA 5460 

ATATTTTTAT TTATAAATAT TGAAAACCAC TAACAAATTG CATACACAAT ATCATTAGTG 5520 

40 

GTAACAGTTA AACACTTATT TATCTTTACG GGGTAATGGG TTAAAACCCT TnCATTAAAA 5580. 

TTGGATGnCC ATAAAATTAG GG 5602 
45 (2) INFORMATION FOR SEQ ID NO: 130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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10 



15 



TAACCCCATT TTACCTGGAA AAATCgTTTG CGATGCaATm GCaTTtGaAT ATAaATACAT 60 

TTTACGTATa GAATTATAAA AgGTTTCATT CaAATCTTAG GGTCAAAAAT GTTATAATAT 120 

TTTTATGTCA AATTTAAAAC AGTAACACTT ATTTACAAGG TTGCAATATT TTGAAGTAAT 180 

AAAGGAAGTG TCGCGTATTT TAACTTTTTC AGAGCAAAAT GCACTCGCGA AAATAGATGA 240 

TTTAATGAAT ACTTATTGCA ATCAATGTCC AATCAAAACT CGTCTGCGTA AATTAGAGGG 300 

GAAAACGAAG GCGCATCATT TTTGTATCAA TGAGTGTTCA ATAGGGAAAG AAATAAAACA 360 

ATTAGGAAAT GAACTTCAAT AGGAGGAAGT CAAATGAAAA TTATATCTAT ATCAGAAACA 420 

CCGAACCACA ACACAATGAA GATTACACTT AGTGAAAGCA GAGAAGGTAT GACATCAGAT 4 80 

ACGTATACTA AAGTTGATGA TTCACAGCCA GCATTTATTA ATGACATCTT AAAGGTTGAA 540 

GGCGTTAAAT CAATTTTCCA TGTTATGGAC TTTATTTCAG TAGATAAAGA AAATGACGCA 600 

20 AATTGGGAAA CAGTATTGCC AAAAGTAGAG GCTGTATTCG AATAAATTTT TCATCAACTA 660 

GTATTCGGGG GGAATAAAGT ATATGGAAAT TTTACGTATA GAGCCAACAC CAAGTCCAAA 720 

TACAATGAAA GTTGTTTTGT CATATACAAG AGAAGACAAG TTATCTAATA CTTATAAAAA 780 

AGTAGAAGAA ACACAACCAA GATTTATAAA TCAGTTGTTA TCTATAGATG GTATCACTTC 84 0 

CATTTTTCAT GTCATGAACT TCTTAGCTGT TGATAAGGCA CCAAAAGCTG ATTGGGAAGT 900 

CATATTACCT GATATTAAAG CTGCTTTTTC TGATGCGAAT AAGGTTTTAG AATCTGTAAA 960 

TGAACCTCAA ATTGACAATC ATTTTGGTGA AATTAAAGCT GAATTATTAA CTTTTAAGGG 1020 
TATACCGTAT CAAATTAAGC TAACTTCTGC TGACCAAGAA TTAAGAGAAC AATTACCACA . 1080 

AACATATGTT G AC CATATGA CTCAAGCGCA AACAGCACAT GACAATATTG TTTTTATGCG 114 0 

TAAATGGCTA GATTTAGGAA ATCGCTATGG AAATATTCAA GAAGTAATGG ATGGTGTCCT 1200 

AGAAGAAGTG CTAGCTACCT ATCCAGAATC ACAGTTACCC GTATTGGTAA AACATGCTTT 1260 

40 AGAAGAAAAT CACGCAACTA ATAATTATCA TTTCTATCGA CATGTCTCTT TGGATGAATA 1320 

TCATGCAACT GATAATTGGA AGACTCGATT ACGAATGTTA AACCATTTTC CAAAGCCGAC 1380 

TTTTGAAGAT ATACCGCTGC TTGATTTAGC TTTATCTGAT GAAAAAGTAC CGGTTAGACG 144 0 

45 TCAAGCGATT GTATTATTAG GTATGATTGA AAGTAAAGAA ATTTTACCGT ATTTATATAA 1500 

GGGGCTTCGT GATAAAAGTC CTGCTGTAAG AAGAACAGCA GGGGATTGCA TAAGCGATTT 1560 

AGGGTATCCA GAGGCACTAC CAGAAATGGT GCTACTATTA GATGATCCAC AGAAAATCGT 1620 

50 

TAGGTGGCGT GCTGCTATGT TTATCTTTGA TGAAGGTAAT GCAGAGCAGC TTCCCGCACT 1680 

AAAAGCCCAT ATTAATGACA ATGCGTTTGA AGTTAAATTA CAAATTGAAA TGGCCATATC 1740 
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AATTTAATTG GAGGAATTAA ATATGAATGC ATATGATGCT TATATGAAAG AAATTGCGCA i860 

ACAAATGCGT GGCGAATTAA CTCAAAATGG TTTTACAAGT TTAGAAACGA GCGAACAGct 1920 

ATCGGAGTAT ATGAACCAAG TAAATGCTGA TGACACTACT TTTGTAGTTA TTAACTCTAC 1980 

ATGCGGCTGT GCAGCTGGAT TAGCAAGACC AGCTGCAGTA GCAGTTGCAA CACAAAATGA 204 0 

ACATAGACCT ACAAATACAG TTACAGTTTT TGCTGGGCAA GATAAAGAAG CAACTGCTAC 2100 

AATGCGAGAA TTCATTCAGC AAGCACCATC TAGTCCTTCG TATGCTTTAT TCAAAGGTCA 2160 

AGATTTAGTT TATTTTATGC CTAGAGAATT TATCGAAGGT AGAGATATTA ATGACATTGC 2220 

AATGGACTTA AAGGATGCCT TTGACGAAAA TTGTAAATAG TACACATAAA TAAATATAAA 2280 

GGTTAACACA TTTTATAATA TTAAAAATGG TGTCTGTCAT TGAAAATAGA GAATATAGTT 2340 

GTATTCTATT TGTTAAATAA AGTCCGTTTT TACCaACTAT ATTTTCTAGA AATTTAACTG 2400 

20 TTTTAATAGG ACATCAAACA TAATATTCaA ATCaTGTGTT AAGCTCTTTT TTAAAATTTT 2460 

TTAGCATTAA AGTTATAGAT TTGGGTAAAC AATTACCAAT TGGAAACATA TATCACGTTA 2520 

CGATGGGGTA GGTACTTAAT CAGCATTTTA TAAATAAAGT AACGGAATTG ATGATATTAA 2580 

TATCATATTC CTAAAATGAG TGATAACAAA ATGCTACATA AAGTTAAGTT ATATCAAACT 2640 

AAATATACAT ACTATAAATA ATGAAAATGA GGTGTTATCG CATATGTTGA ATTCATTTGA 2700 

TGCAGCATAT CACAGTCTTT GTGAAGAAGT TTTAGAAATA GGAAATACAC GAAATGATCG 2760 

CACAAATACA GGTACGATTT CGAAATTTGG TCATCAACTT CGCTTTGACT TATCTAAAGG 282 0 

ATTTCCACTA TTAACGACAA AGAAAGTTTC TTTTAAATTA GTAGCAACCG AATTATTATG 2 8 BO 

GTTCATTAAA GGAGATACAA ACATCCAATA CTTATTAAAA TATAATAATA ATATATGGAA 2940 

CGAATGGGCT TTTGAAAATT ATATCAAATC AGACGAGTAT AAAGGTCCAG ATATGACAGA 3000 

TTTCBGGCAT CGTGCATTGA GTGATCCTGA ATTTAACGAA CAATATAAAG AACAAATGAA 3060 

40 ACAATTTAAG CAACGTATTC TTGAAGATGA TACATTTGCG AAGCAATTCG GGGATTTAGG 3120 

AAATGTTTAT GGTAAACAAT GGCGAGATTG GGTTGATAAA GATGGTAATC ATTTTGATCA 3180 

ACTTAAAACA GTAATTGAAC AAATTAAGCA TAATCCAGAT TCAAGGCGAC ACATCGTATC 324 0 

45 TGCATGGAAT CCAACAGAAA TTGATACAAT GG CACTTCCG CCTTGTCATA CCATGTTCCA 33 00 

GTTTTATGTC CAAGATGGTA AGTTAAGTTG CCAGTTATAC CAACGTAGCG CAGATATCTT 3360 

TTTAGGTGTG CCATTTAATA TCcGCagctA CGCTTTATTG ACACACCTTA TTGCCAAAGA 3420 

SO 

ATGTGGACTT GAAGTGGGTG AATTTGTGCA TACATTTGGA GATGCACATA TTTATTCAAA 34 80 

TCATATTGAT GCGATTCAAA CACAATTAGC ACGTGAAAGC TTCAATCCTC CAACATTAAA 3540 
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TGAATCACAT CCAGCAATAA AAGCTCCAAT 
CATATAGACA TCAAAATGAC ATCATAGTAT 
5 TAAACGTTTT CATAAATTAT GCAAAATCAT 

TGTTAAATTA AAGATAACTT AGTAATAAAA 
TGACTTTATC CATTCTAGTt GCACATGACT 

10 

TACCTTGGcA CCTACCAAAT GATTTGAAGC 
TAGTAATGGG TCGTAAGACA TTTGAATCGA 
TTGTACTTAC TTCAGATACA AGTTTCAACG 

15 

AAGATATTTA CCAACTACCG GGCCATGTTT 
AAATGATTGA TAAAGTGGAC GACATGTATA 

20 ATACGTTCTT TCCACCTTAT mCATTkGAgr 
ACTAGATGAG AAAAATACAA TTCCACATAC 
GGGAAAACGA CCATGACAAA ACAGATTATA 

25 GAATACTTAG AAGCAAACAA CATTCATGTA 

TCATACGTTG ACCAAGTAGA TATTACATCA 
GAAGATGTAA AGACAAGTCA GCCAGCCATA 

30 

GGAAAAGATG GCTCTGAAAT CATAAGTATT 
AACACTGCTT ACCAAGCAAG TCAAATGGTA 
TCTATTTCTT TTGGTTTAGG GTATCAAATA 

35 

GtCTCAACTT CTGAAATAGT TAAAAAGTTA 
GTAGTTATAG GGCAATTGAA TCAATTAATT 

40 TTGATTGGTA ATCTTATGAA AATTAAACCA 
CTTGTGCmCA ATGCGAGAAC TCaAAATTCk 
GAATTTATAG GAGATCATGA AATCAAATCC 

45 TATGTTGATA AATTGAAGAA AGTTTTTAAT 
AATGTAACTA CACCAGTTAT TTCTGCACAT 
CTTAAGAAGT AAATTTAATC TTTTCAGTGT 

so 

TAAATTTATA ATTAGATAGA TAGAGGAGGT 
AGCAGGAGGA TGTTTCTGGT GCATGGTTAA 

55 



AGCAGTGTAG TCATTGCATA GTTAGCTAAC 3660 

TTTCAAGTGC AAAAAAGTAC TTTTTTGTGT 3720 

TATTTCTATC ACACTTTATG ATAAAAATTG 3780 

AATGAAATGA TAGAAGAAGG AGGATAATTA 384 0 

TGCAACGAGT AATTGGTTTt GAAAATCAAT 3900 

ATGTTAAAAA ATTATCAACA GGTCATACTT 3960 

TTGGTAAACC ACTACCGAAT CGTCGAAATG 4 020 

TAGAnGGCGT TGATGTAATT CACTCTATTG 4080 

TCATATTTGG AGGGCAAACA TTATTTGAAG 414 0 

TTACTGTTAT TGAAGGTAAA TTCCGTGGTG 4200 

CTGGGAAGTT GCCTCTTCAG TTGAAGGTAA 4260 

CTTTCTACAT TTAATTCGTA AAAAATAAGG 4 320 

GTAACAGACT CAACATCCGA TTTATCTAAA 43 BO 

ATTCCTTTAA GTTTAACTAT TGAAGGAGCT 4440 

GAAGAATTTA TTAATCATAT TGAAAATGAT 4500 

GGTGAATTTA TATCTGCTTA TGAAGAACTA 4560 

CATCTTTCTT CAGGATTAAG TGGTACATAT 4 620 

GATGCTAATG TAACTGTTAT TGATTCAAAA 4680 

CAACACCTAG TAGAGCTTGT AAAAgAaGGT 474 0 

AATCATTTAA GAGAAAACAT TAAATTATTT 4800 

AAAGGTGGCA GAATTAGTAA AACAAAAGGT 4 860 

ATTGGTACAC TAGATGATGG TCGCTTAGAG 4920 

AGTATCCAAT ACTTGAAAAA GGAAATTGCT 4 980 

ATTGGTGTCG CACATGCTAA CGTCATTGAA 504 0 

GAAGCTTTTC ATGTGAATAA TTACGATATA 5100 

ACTGGTCAAG GTGCGATTGG CCTCGTAGTC 5160: 

TAATTACTTC CATTTCAATC CTTTATAGAC 5220 

AATTCATATG ACAAAAGAAT ATGCAACATT 5280 

ACCATTTACA TCATATCCAG GCATCAAGTC 5340 
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GAATCAAACC GGCCATGTCG AAGCAGTACA AATTACGTTT GATCCAGAGG TTACTTGCTT 5460 

TGAAAATATA TTAGACATAT ATTTCAAAAC ATTTGACCCA ACTGATGATC AAGGGCAATT 5520 

5 

TTTCGATAGA GGCGAAAGCT ATCAACCAGT CATTTTCTAT CATGATGAAC ATCAGAAAAA 5580 

GGCTGCTGAG TTTAAAAAGC AACAATTAAA TGAACAAGGT ATTTTCAAGA AACCAGTGAT 564 0 

TACACCTATT AAACCATATA AAAATTTCTA TCCAGCTGAA GACTACCATC AAGATTATTA 5700 

10 

CAAAAAGAAC CCGGTACATT ATTACCAATA TCAACGTGGT TCAGGTAGAA AAGCGTTTAT 5760 

AGAATCACAT TGGGGGAATC AAAATGCTTA AAAAAGATAA AAGTGAACTA ACAGATATAG 5820 

15 AATATATTGT TACACAAGAn AACGGCACTG AACCACCATT TATGAATGAA TATTGGAATC 5880 

ATTTTGCTAA AGGATTTATG TAGATAAAnT TCnGGTAAAC CTTG 5924 
(2) INFORMATION FOR SEQ ID NO: 131: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 131: 

CCAATGTTTT AAAATCGCCa 60 

TTTTaCCAAA AGCATCaAAC 120 

TGACTCTCTT CGGTATACGT 180 

AACGTTCATC AACAATCACT 24 0 

ATAAATAAAT GCCTTCTTGG 300 

GCAATATAGT GATACCTTGT 360 

TTCCAATAAC ATTACTTAAA 420 

AGCTCATAAT GTATGCAATT 480 

CAGCAAATTT CAGAATCAAA 54 0 

GCTTTGCACC TTCTATAAAA 600 

TCATTGCTCT TCTTAAAATA 66 0 

TTTGCATAAT TTGCTCTTGA 720 

AATGCGGATG TAAATATTGA 780 

TTTCTTCCAT TGGACCTGGT 840 
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30 



35 



GGCCGTTnAA AATCTCCAAA ATAnAAAAAC CCATCTTGTT 
TCCaACACTT GaTCaATAGC TTGCAACAAC GTTGAACGTG 
GCTCCCACTA AAATCAGTGC TTCAAGTAAC TTTCTCGTTT 
CTAGCAAAAT CAAAGAAATC TTTAAATTTG CCGTTCTGAT 
TTCACACTTT GATAACCAAC ACCTTTAATT GTACCAATTG 
GAAGGTTTAT AAAACCAATG ACTTTCGTTA ATGTTCGGTG 
40 TTTTTTGCTT CTTCTATCAT TTGAGCAGTT TTCTTCTCAC 
ATATTTGCGT AAAAATAATT TGGATAATGG ACTTTTAAAA 
TTAGAATAGC TGACAGCATG TGCTCTAGGA AAACCATAAT 
TCAAATATTT GCTTACTAAT GTCTTCGTGA TAACCATTTT 
TGTTGACGCT CACTTTCAAG AACAGCTCTA TTTTTTTTAC 
TCCGCTTCAC CATAACTGAA GTTTGCAAAT GTGCTCGCTA 
TAAATAATAA CACCGTAAGT ATTTTTTAAT ATAGGTTCTA 
ACTTTG CTTG GATCATGTCT TCTTGTAATG TAAGTTGGAA 
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ACACTTCTTA CACCGTCAGA CTCTAATTGG 


AATATGCCAG 


TCGTATCTCC TTGCGACAAC 


960 




AATTCAAACA CTTTTTGATC ATCAAACGGA ATCTTTTCGA TATCAATATT AATACCTAAA 


1020 


5 


itl llillljA LiiuiuilAA CiATTTGATGA 


ATAATCGATA AGTTTCTCAA 


CCCTAGAAAA 


1080 




lulAXiiilA AlAACt-CAAT ACGTTCGGCT 


TCAGTCATTG 


TCCATTGCGT 


TAATAATCCT 


1140- 




v> A Al 11 1 1 1 APJVaKj GGCATATTCA 


TATAATGGAT 


GGTCATTAAT AATAATTCCT 


1200 


1U 


GCCGCATGTG TAGATGTATG TCTTGGTAAA 


CCTTCTAACT 


TTTTACAAAT 


ACTGAACCAG 


1260 




CGTTCATGTC GATGGTTTCG ATGTACAAAC 


TCTTTAAAAT 


CGTCAATTTG 


ATATGCTTCA 


1320 


15 


TCAAGTGTAA TTCCTAATTT ATGTGGGATT 


AAACTTGAAA 


TTTCATTTAA 


TGTAACTTCA 


1380 


TCAAACCCCA TAATTCTTCC AACATCTCTA 


GCAACTGCTC 


TTGCAAGCAG 


ATGACCGAAA 


1440 




GTCACAATTC CAGATACATG TAGCTCGCCA 


TATTTTTCTT 


GGACGTACTG 


AATGACCCTT 


1500 


20 


TCTCGGCGTG TATCTTCAAA GTCAATATCA ATATCAGGCA 


TTGTTACACG 


TTCTGGGTTT 


1560 




AAAAAACGTT CAAATAATAG ATTGAATTTA ATAGGATCAA 


TCGTTGTAAT 


TCCCAATAAA 


1620 




TAACTGACCA GTGAGCCAGC TGAAGAACCA CGACCAGGAC 


CTACCATCAC 


ATCATTCGTT 


1680 


25 


TTCGCATAAT GGATTAAATC ACTTACTATT 


AAGAAATAAT 


CTTCAAAACC 


CATATTAGTA 


1740 




ATAACTTTAT ACTCATATTT CAATCGCTCT AAATAGACGT 


CATAATTAAG 


TTCTAATTTT 


1800 




TTCAATTGTG TAACTAAGAC ACGCCACAAA 


TATTTTTTAG 


CTGATTCATC ATTAGGTGTC 


I860 


30 


TCATATTGAG GAAGTAGAGA TTGATGATAT 


TTTAATTCTG 


CATCACACTT 


TTGAGCTATA 


192 0 




ACATCAACCT GCGTTAAATA TTCTTGGTTA 


ATATCTAATT 


GATTAATTTC 


CTTTTCAGTT 


1980 




AAAAAATGTG CACCAAAATC TTCTTGATCA 


TGAATTAAGT 


CTAATTTTGT ATTGTCTCTA 


2040 


35 


ATAGCTGCTA ATGCAGAAAT CGTATCGGCA 


TCTTGACGTG 


TTTGGTAACA 


AACATtTTGA 


2100 




ATC»AACAT GTTTTCTACC TTGAATCGAA 


ATACTAAGGT 


GGTCCATATA 


TGTGTCATTA 


2160 


40 


TGGGTTTCAA ACACTTGTAC AATATCACGA 


TGTTGATCAC 


ggactxtttt 


AAAAATGATA 


2220 


ATCATATTGT TAGAAAATCG TTTTAATAAT 


TCAAACGACA 


CATGTTCTAA 


TGCATTCATT 


2280 




TTTATTTCCG ATGATAGTTG ATACAAATCT 


TTTAATCCAT 


CATTATTTTT 


AGCTAGAACA 


2340 


45 


ACTGTTTCGA CTGTATTTAA TCCATTTGTC ACATATATTG 


TCATACCAAA AATCGGTTTA 


2400 




ATGTTATTTG CTATACATGC ATCATAAAAT 


TTAGGAAAAC 


CATACAATAC 


ATTGGTGTCA 


2460 




GTTATGGCAA GTGCATCAAC ATTTTCAGAC ACAGCAAGTC TTACgGCATC TTCTATTTTT 


2520 


50 


AAGCTTGAAT TTAACAAATC ATAAGCCGTA 


TGAATATTTA 


AATATGCCAC 


CATGATTGAA 


2580 




TGGCCCCTTT CTATTAGTTA AGTTTTGTGC 


GTAAAGCTGT 


AGCAAGTTGC 


TCAAATTCAT 


2640 
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CAATATCATT AATAATCAAT TGCCCTTTAG AACGTAATCG ACATCTGATT TCATTACCTT . 2760 

CATCGACTGC AAATACCCAT ATTTTCAAGC CTTTGATGTC AGCAATTGTA TTAACAAACT 2820 

GAGATGCTTC ATTTGGCTGA ATACCGAATT GCTCCAATAC ATCTTCAGTT ATTTTAACTT 2880 

GGCAGAATCC ATCATCCATA AGTTCGAAAT GTTGTAAAAC ATAACCTTGA AACGGCAACA 294 0 

TTTTTGGGTC CTTCTCCATC ATTTTATTTA AAAGCGCATT ATGATCAATA TCATGCCCAA 3000 

TTAACTTTCC AGCAATTTCC ATAGTATGTT CTGAGGTATT GTTAAAAAGG AATCGCCCAG 3060 

TATCACCGAC GATACCAAGA TATAAAACGC TCGCGATATC TTTATTAACA ATTGCTTCAT 3120 

CATTAAAATG TGAGATTAAA TCGTAAATGA TTTCACTTGT AGATGACGCG TTCGTATTAA 3180 

CTAAATTAAT ATCACCATAC TGATCAACTG CAGGATGATG ATCTATTTTA ATAAGTTTAC 3240 

GACCTX3TACT ATAACGTTCA TCGTCAATTC GTGGAGCATT GGCAGTATCA CATACAATTA 33 00 

20 CAAGCGCATC TTGATATGTT TTATCATCAA TGTTATCTAA CTCTCCAATA AAACTTAATG 3360 

ATGATTCCGC TTCACCCACT GCAAATACTT GCTTTTGCGG AAATTTCTGC TGAATATAGT 3420 

ATTTTAAACC AAGTTGTGAA CCATATGCAT CAGGATCTGG TCTAACATGT CTGTGTATAA 34 80 

25 TAATTGTATC GTTGTCTTCG ATACATTTCA TAATTTCATT CAAAGTACTA ATCATTTTCA 3 54 0 

TACTCCCTTT TTTAGAAAAG TTGCTTAATT TAAGCATTAG TCTATATCAA AATATCTAAA 3600 

TTATAAAAAT TGTTACTACC ATATTAAACT ATTTGCCCGT TTTAATTATT TAGATATATA 3660 

TATTTTCATA CTATTTAGTT CAGGGGCCCC AACACAGAGA AATTGGACCC CTAATTTCTA 3720 

CAAACAATGC aAGTTGGGGT GGGGCCCCAA CGTTTGTGCG AAATCTATCT TATGCCTATT 3 780 

TTCTCTGCTA AGTTCCTATA cTTCGTCAAA CATTTGGCAT ATCACGAGAG CGCTCGCTAC 384 0 

TTTGTCGTTT TGACTATGCA TGTTCACTTC TATTTTGGCG AAGTTTCTTC CGACGTCTAG 3900 

TATGCCAAAG CGCACTGTTA TATGTGATTC AATAGGTACT GTTTTAATAT ACACGATATT 3960 

TAAGTTCTCT ATCATGACAT TACCTTTTTT AAATTTACGC ATTTCATATT GTATTGTTTC 4020 

TTCTATAATA CTTACAAATG CCGCTTTACT TACTGTTCCG TAATGATTGA TTAAAAGTGG 4080 

TGAAACTTCT ACTGTAATTC CATCTTGATT CATTGTTATA TATTTGGCGA TTTGATCGTT 4140 

45 AATTGTTTCA CCCATCTGAG GCTGTCTTCC TAAAAGTTGC ATAGACTTTA AAACATCTTG 4200 

TCTATTAATC ACACCCACTG TCTTTTTATT ACTCGAAACG ACAGGAATCA ATTCAATACC 4260 

TTCCCAAATC ATCATATGCG CACAACTTGC TACTGTACTC ATAGCATTTA CATAAATAGG 4320 

so ATTTCGCGTC ATCACTTTAT CTATTTCGTC GTCGTCCTTT GTATTAATCA TCTCTCGACT 43 80 

TGTTACAATA CCTACTAATT TATACGACTC ATTGACTACC GGAAATCTTG TATGGCCAGT 4440 
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ATCTAATGGC 


GTCATTATAT 


CTTGAACTAT 


TAAGATATCT TTTCGTATTT TCTGATTAAA 


4560 




AAGTGCTTTG 


TTGATAATAT TTGCAACTAG GAATGTATCA TAACTTGATG ATAGAACAGG 


.4620 


£ 


TAAATCATGT 


TCATTCGCAA AATTAATAAC 


TTTATTAGAT 


GGCTTAAATC 


CAC CAGTAAT 


4680 




TAATATAGCC 


GTACCTCTTT 


TTAAAGCTTC 


AATCTGCACA 


TCTTCACGAT 


TTCCGACAAT 


4740 




CAATAATGTC 


TTTGGACCAA TATACTTTAA AATATCTTTG AGTTCCATTG 


CTCCAATTGC 


4800 


10 


AAATTTAGAT 


ACCATCTTAG 


TGATACCTTT 


GTTGCCACCT 


AACACTTGGC 


CATCAATAAT 


4860 




ATTGACAATT 


TCATTAAAAG 


TTAAATGTTC 


AATTTCATTA 


CGATTACGTT 


TTTCGATTCG 


4920 


15 


AACCGTACCA 


ACACGATCTA 


TCGTTGCGAC 


CATGCCCATT 


TTATCAGCAT 


CTTTmATTGc 


4980 


ACGATATGCT GTCCCytCaG ATACGTTTAA AAATTTAGCG ATTTTACGCA CCGAAATTTT 


5040 




AGAGCCTATA 


GATAACGATT 


CAATATAATC 


TAAAATTTGT 


TCATGTTTTG 


TCATTCTTTA 


5100 


20 


CCTCTTCTTT 


TCGAACAGTA 


TTAACTACAT 


TATAACTTTA 


TTTTGGATAA AAAGCATTGA 


5160 


AGTGAAATGA 


AATAATGATC 


GTTtCACCTA 


TTTTATTTTT 


TGAAAATATA 


CAACAAACAC 


5220 




AAAGATCACA AAATCTTTAA 


TTTTAAATGG 


AAAAATCCAT 


TATTATTTAT 


TAGAATGTAA 


5280 


25 


GTGAGGAGGG ATGTACTAAT GTATAAAAAT ATATTACTTG GTGTAGACAC TCAGTTAAAA 


5340 




AATGAAAAAG 


CACTAAAAGA 


AGTGTCTAAA 


TTAGCTGGCG 


AAGGTACAGT 


CGTAACAGTT 


5400 




TTAAACGCAA 


TCAGCGAACA 


AGaTGCTCAA 


GCATCAATTA 


AAGCAGGTGT 


TCATTTAAAC 


5460 


30 


AAACTTACTG 


AAGAACGAAG 


CAAGCGATTG 


GAAAAAACAC 


GCAAAGCTTT 


AGAAGATTAT 


5520 




GGTATTGATT 


ATGACCAAAT 


AATTGTTCGT 


GGTAATGCAA 


AAGAAGAACT 


ATTAAAACAT 


5580 




GCTAATAGCG 


GTAAATATGA 


AATTGTTGTT 


TTAAGTAACC 


GTAAAG CAGA 


AGACAAAAAG 


5640 


35 


AAATTTGTAC 


TTGGAAGTGT 


CAGCCACAAA 


GTAGCAAAAC 


GTGCGACTAT 


CCCTGTATTA 


5700 




. ATCQJTAAAT AAAATTTTTA 


TCCAGAATCA 


CAAATAATCT 


TTCAATCATG 


ATGCAGTCTC. 


5760 


40 


AAACGACTGA 


GTAAATACAA 


GAAACGATTA 


TGACTGTGGT 


TCTGGATTTT 


TTATATCGTA 


5820 


GTAAATTTAT 


AATCAATGTC 


TAATTGTATA 


AAACTAAAAT 


TACGAGAGTA 


GGTCAGAAAT 


5880 




GATAAAGAAC 


CACTGATGTC 


CCCCGTCCAC 


GTCGTAACTG 


AATCAGTAGA ATATAAAAAC 


5940 


45 


ACCCACTAAA 


AATATGCAGA 


CGATAACTTC 


CACATAGATT 


AGCGAGGTGT 


TTTTTAGTGT 


6000 




AAAATCTATA 


TTCTATTTAA 


AACTGAACAG 


ATTCACCTGG 


TTTTAAAATT TGCACGTCCC 


6060 




CTACATTAAC 


AGCATCTTTA AATTGTTGTG 


GATCTTGTTC 


GATTAATGGG 


AATGTATCAT 


6120 


50 


AATGAATCGG 


TACAGAAATT 


TTTGGTTTAA 


TAAATTCATT. AATAGCATAA 


CTTGCATCAT 


6180 




CAATACCCAT 


CGTAAAATTA 


TCTCCAATTG 


GTACAAAACA 


TACATCAACT 


GGATGACGTT 


6240 
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TTCAACTTCA AACACGATAC CCATTGGCAT ACCTAAATAA ACTGGgAATA CCATTTTCAT 6360 

GTGTAAAACT TGAACTATGA AATGCTTGAA CAAATTTAAC GCTTCCGAAA TCAAaGTTTG 642 0 

s CTTTACCACC AaTATTCATA CCATGAACAT TTTCAACACC GTGATATGAA GAAAGATAGT 6480 

CAGCCATTTC TGCACTTCCA ATTACTGTTG CTCCTGTTTT CTTTGCTAGT TCCACAACAT 654 0 

CACCAAAATG ATCAAAATGA CCGTGCGTTA AAACGATATA GTCTACCTGC ACTGTTTCAA 6600 

10 

TATTCAAATC ACACTTAGGG TTATTTGAAA TAAACGGATC TACGATAACC TTTTTGTTGT 6660 

TCCCTTCTAA ATAAATCGTT GATTGACCAT GAAATGATAA CTTCATTTGA GCATCCTCCT 6720 

ATCAATTACT ATATAAATTT AGTACCCTTT TGCCACTTAA TTATAACAAA TTCTCAAATT 6780 

15 . 

TTAAAAATTG AAAATCTAGT TAATGTATTA GCTCGATTTT GAAATCTAAT AATAATTGGC 6840 

ATAAAATGGA AGTAATATTA TGTTGAGGAG TGTTTATAAA ATGACAAAAA TATCAAAAAT 6900 

2Q AATAGACGAA TTGAACAATC AACAAGCTGA TGCAGCATGG ATTACAACAC CGTTGAATGT 6960 

ATATTATTTT ACTGGATACC GTAGCGAACC CCATGAAAGA TTATTTGCAT TATTGATTAA 7020 

GAAAGATGGT AAACAAGTAC TATTTTGTCC AAAAATGGAA GTCGAAGAAG TCAAAGCATC 7080 

25 ACCTTTCACA GGTGAAATCG TTGGATATTT AGACACTGAA AACCCTTTTT CACTTTATCC 7140 

TCAAACAATC AATAAATTAC TAATTGAAAG CGAGCACTTA ACAGTAGCAC GCCAAAAACA 7200 

ATTAATCTCT GGTTTCAATG TCAATTCATT CGGAGATGTT GATTTAACAA TCAAACAATT 7260 

30 GAGAAATATT AAATCCGAAG ATGAAATTAG CAAAATACGT AAAGCTGCTG AGTTA&CAGA 7320 

TAAGTGTATC GAAATAGGTG TTTCTTATTT AAAAGAAGGT GTGACTGAAT GTGAAGTAGT 7380 

CAACCATATT GAGCAAACTA TCAAACAATA TGGCGTCAAT GAAATGAGTT TTGATACGAT 744 0 

35 

GGTTTTATTT GGAGATCATG CCGCATCACC TCATGGCACA CCAGGAGATC GCAGATTAAA 7500 

AAGCAATGAA TATGTACTAT TTGATTTAGG TGTAATTTAT GAGCATTATT GTAGCGATAT 7560 

GACACGTACT ATTAAATTTG GTGAACCTAG CAAAGAAGCA CAAGAAATTT ATAATATTGT 7620 

40 

ATTAGAAGCA GAAACATCTG CAATCCAAGC AATTAAACCT GGAATACCAT TAAAAGATAT 7680 

CGATCATATC GCTAGAAATA TTATTTCAGA AAAAGGTTAT GGTGAATATT TCCCTCATCG 7740 

45 CTTAGGTCAT GGCCTAGGAT TACAAGAACA TGAATATCAA GATGTTTCAA GTACTAATTC 7800 

TAATTTGTTA GAAGCTGGCA TGGTTATTAC AATCGAACCA GGTATTTATG TACCTGGTGT 7860 

TGCAGGTGTA AGAATTGAAG ATGACATACT TGTCACTAAT GAAGGATATG AAGTATTAAC 7920 

50 ACATTACGAA AAATAAGGAG TGGGATAAAA ATGAAAAGCT TGTTACAAGC GCATTCTCAT 7980 
TCAGTCAAAC ACTGCCAATA TAACATTGTA GCGCCTAAGA CATAAATTTT TATCCAAGTC - 804 0 
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TGTAATGAAT CAAATCAATA TCATTCATGT TCGATGATTT CTTCGCATTG TTTCTAGCTT 8160 

TAATTTATCA TTATTTAATT TTAATAACCA AGGAGATGAT AACGTCATTC TTTAGTACGC 8220 

TGTAATCCAT TCCCTTTTCA TCAAATTCAA ATTATAATTG TAATGCTTCT TCTACAGATT 82 80 

TATATTCCAT TTCAAATGCC TCTGCAACGC CTTTATTGGT TACGTGACCT TTGTAAGTAT 8340 

TTAAACCTAA TGATAATGGT TGATTTGATT TAAATGCTTC TCTATACCCT TTATTAGCTA 8400 

GCATOAGCGC ATAAGGTAGC GTAgCATTAT TTAAAGCTAA CGTCGAAGTA CGCGGTACTG 8460 

CACCTGGCAT ATTTGCAACT GCATAATGAA CCACACCATG CTTAATATAT GTAGGATCAT 8520 

CATGTGTCGT AATTTTATCA GTTGtTTCAA AAATACCGCC TTGATCAATA GCAATGTCAA 8580 

TAATAACTGA CCCATTTTTC ATTTGTTTAA TCATGTCTTC TGTTACAAGT CTTGGCGCTT 8640 

TAGCACCTGG AATTAAAACT GCACCTATTA CTAAATCACT TTGTTTAACA TACAACTCAA 8700 

TATTCAACGG ATTTGACATA ATTGTATGTA CACGTCCACC GAATAAATCA TCTAATTGTT 8760 

GTAAACGCTT TGGATTAACA TCTAAAATCG TAACATCTGC ACCTAGTCCT AGTGCAATTT 8820 

TAGCTGCATT TGTTCCTGCT TGACCACCAC CGATAATAGT TACTTTACCC TTAGGTACTC . 8880 

2S CTGGGACACC ACCTAGTAGA ATTCCCATAC CACCATTAAG TTTTTGTAGG AACTCTGCGC 8940 

CAACTTGAGC TGACATTCTT CCTGCTACCT CACTCATTGG TGATAACAAT GGTAAAGATC 9000 

GGTCTGGTAA CTGCACAGTC TCATATGCAA TACTAATTAC TTTTCTATCT ATCAAAGCTT 9060 

30 GTGTTAATTT TTCTTCATTT GCTAAATGAa gatAaGTGAA TAATACAAGC CCTTCTTTAA 9120 

AATATGGATA TTCAGATTCA AGTGGTTCTT TAACTTTAAT AACCATATCC ACATCCCAAA 9180 

CTTTTGCTTG TTCAGCAACA ATCTCAGCAC CTGCTTCTTT GTAATCTACA TCTTCAAAGA 9240 

35 ATGATCCTGA ACCCGcATTT GTTTCCACTA AAACAGTATG 9280 

(2) INFORMATION FOR SEQ ID NO: 132: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 132: 

CTGATTAATC TCTTGTTGTC GTGTATTTAC TAATTGAATC GTTGGTGTCT GAACACGTCC 6 0 

so CAGGGATAGC TGTGCATCAT ACTTTGTTGT TAGTGCACGC GTTGCATTAA TCCCAACAAT 120 

CCAATCTGCC TCACTTCTCG CTAACGCTGC ATAATACAAA TCGTTATATT GACGACCGTC 180 
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ACGGATTGGC TTTTTGTTAC CAACTTTATC CAAAATCAAT CTTGCAACTA GTTCACCTTC 3 00 

TCGTCCaGCA TCTGTTGCAA TAATAATATC TTTCACTTTA TTATCTAAAA TTAACGCTTT 360 

TACTGTTTTA AATTGTTTGC TTGTTTTACC AATAACAACA GTTTTCATAT ATTTAGGTAT 420 

AATTGGAAGG TCTTCTAATC GCCATTCCTT TAAATTTTTA TCGTATTGTT CAGGTGTCGC 480 

ATTTGTCACT AGATGACCTA ACGCCCACGT GACAATATAT TGGTTATTTT CAAAGTAACC 540 

ATTACGCTTC TGATTTATTT GTAAAGCATC AGCAATATCT CTTGCGACTG ATGGTTTTTC 600 

AGCTAATATT AAAGATTTCA TAAATTATCC TTTCTCATAC GTTCTTTTAT TTCGAACGTG 660 

CTTCATCTAT TCCACTAATC TTTGATTTAA ATTCAATGAT TGCAAATGAT GTGTTAAATG 720 

TATTGTAACA TGTTAATATC ACTATTAACT TTCATTTCAG TTGAAATACT ATATAATAAA 780 

AGTAACAAAA AGTACGGAGG TAATGACATG AGCATAGTTC AGTTATATGA TATTACACAA 840 

ATAAAATCGT TCATTGAACA TTCGAATTAT GAATCAGCAT CATACTTATA TAAACTTCCT 900 

CAACAGTAGA ATGAAATAGA TGTATTAATA ACCGATGCGA TTGAATCACC TGGTGTATTT 960 

TCGATTAAAG AAAACGATTC AATCAAAGCA ATCATATTGT CTTTTGCATA CGATAAAAAT 1020 

25 AAATTCAAAG TCATAGGCCC TTTCGTGGCT GACAATTATG TATTATCTGT CGATACGTTT 1080 

GAAACGCTAT TTAAAGCAAT GACTTCGAAC CAACCTGACG ATGCCGTCTT TAACTTTTCT 1140 

TTTGAAGAAG GCATTCAACA ATACAAACCA TTAATGAAAG TTATTCAAGC AAGTTATAAC 1200 

30 TTCACTGACT ATTACATAGA AGCCCGTACA AGATTAGAAG AAGATATGCA CCAACCAAAT 1260 

ATCATTCCTT ATCACAAAGG GTTTTATCGT GCTTTCAGCA AATTACACAC AACTACATTT 1320 

AAATATCAGG CACAGTCACC ACAAGATATC ATTGATAGTT TAGACGACCA TCATCATTTG 1380 

TTTTTATTTG TTAGCGAAGG TTTACTTAAA GGTTATTTAT ACCTTGAAAT TGATTCACAA 144 0 

CAGTCAATCG CCGAGATTAA ATACTTCAGT TCTCATGTAG ATTACCGTTT GAAAGGTATC 1500 

GCTTTCGAGT TGCTTGCGTA TGCATTGCAA TATGCTTTTG ATAATTTTGA TATTAGAAAA 1560 

GTTTATTTTA AAATTCGTAA TAAAAATAAT. AAACTCATCG AACGATTTAA TGGTCTAGGT 1620 

TTCCATATCA ACTATGAGTA CATTAAATTC AAATTCGAAT CACGTAACGT AAAAGATCAA 168 0 

ACAATCCCTG AATAAAACAC CAAGCAAATA CCCTACAGTA CATCATTAGC ATGTATTGTG 1740 

GGTTTTTCTA CTTTTTGTAA ATATTGAAAA TTATAAGTAG TTGTTTTTTA CTATTAGGGC 1800 

AGAATGCTTT ACAATAACAT GCAAGTGTCA ATTAAGGGGA GCACTTGCAT AAATAGTATA 1860 

SO GGAGAGTGAG TAGTCTTGCA ATTTCTTGAT TTCTTAATCG CACTTTTACC TGCTTTATTC 1920 

TGGGGAAGTG TCGTTCTTAT TAATGTGTTC GTCGGCGGTG GACCTTACAA CCAAATTCGT 1980 
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TTCAATAATC 


CTACTGTAAT 


TATTGTCGGT 


CTTATTTCTG 


GTGCATTATG 


GGCGTTTGGA 


2100 


CAAGCGAATC 


AGCTTAAATC 


TATTAGTTTA 


ATCGGTGTAT 


CAAATACTAT 


GCCAGTTTCT 


2160 


ACAGGTATGC 


AATTAGTTGG 


TACAACATTA 


TTCAGCGTTA 


TCTTTTTAGG 


TGAATGGTCT 


2220 


TCAATGACTC 


AAATTATCTT 


TGGTTTAATC 


GCCATGATAT 


TATTAGTTAC 


TGGTGTAGCA 


2260 


CTTACTTCAC 


TTAAAGCTAA 


AAATGAACGT 


CAATCAGATA 


ATCCTGAATT 


TAAAAAAGCA 


2340 


ATGGGTATTT 


TAATTGTATC 


TACAGTTGGA 


TATGTAGGTT 


TCGTTGTACT 


TGGTGACATC 


2400 


TTTGGTGTTG 


GTGGAACTGA 


TGCATTGTTC 


TTCCAATCTG 


TCGGTATGGC 


AATTGGTGGC 


2460 


TTTATCCTAT 


CCATGAATCA 


TAAAACATCA 


CTTAAATCAA 


CAGCACTTAA 


TCTATTGcCA 


2520 


GGTGTGATTT 


GGGGAATTGG 


TAACTTGTTC 


ATGTTCTATT 


CTCAACCAAA AGTTGGTGTA 


2580 


GCTACAAGTT 


TCTCATTATC 


ACAGTTACTT 


GTTATCGTTT 


CAACCTTAGG 


CGGTATTTTC 


2640 


ATTTTAGGAG 


AAAGAAAAGA 


TCGTCGTCAG 


ATGACGGGTA 


TTTGGGCAGG 


TATTATTATT 


2700 


ATCGTGATAG 


CTGCTATAAT 


TCTAGGTAAT 


TTGAAATAGA 


AAGTTAAATA 


CTCATGTAAC 


2760 


GTAAAAATGT 


AATCACTTCT 


GAAAATAACC 


ATTCACTTAT 


AGAATGATTA AAATTAATTT 


2820 


TCGGGAATTT 


TACGTTGAAT 


GTTCCTCTAT 


ATGTCCTAGG 


AAATACGTGG 


CTCTAAAAAC 


2880 


AAAACGCAAT 


AACACATCAT 


GACATTAATC 


ATGCGTTTTA 


AGACTTTAAA ATTAGCGATA 


2940 


CTTTTAAAAT 


CTTGATGATA 


TTCATATATC 


AAGTATGCGC 


CATACATATG 


AAGTGGATAG 


3000 


CTGCATAACG 


CACTGCATTA 


TCAACTTGAA 


TGTATGAGTT 


GAACAACTAT 


GTCATAAATA 


3060 


AAAGCCCCCT 


TTTCACAATA 


TACATTTACA 


TATTGTGGTA 


AAGGGGGCTC 


TCATTTTCTA 


3120 


CGAATACTAA 


AATGGATTTT 


ATTTTCAAAT 


GTGTAAACTA 


GACAAACACT. 


GCCTGATACA 


3160 


CGTACAAAAT 


AATGATACTA 


ATAATGATTG 


TCAAATTGGT 


CGTCATACCT 


ATAAATGGCA 


3240 


GTGTTCGATA 


TTTAAACTGA 


ATACCATAAG 


AAATAATTGC 


AACACcTACC 


GGGAACATCC 


3300 


AAGTGACCAA 


CAATGTCGTC 


TTAATCATAT 


CATCTGATAC 


TGGTAACAAC 


ACATATACTA 


3360 


ACAATCCCGC 


AACTAATGCT 


AATCCATAAT 


GCAAACATAA 


ATATTTAATA 


GTAGCAGGTA 


3420 


TATACTTTCT 


TTCCAGAGTA 


AAATTCAACA 


TGACACCTAG 


CAAAATCATT 


GATAACGGCA 


3480 


TATTTGCATG 


GGAAAGTATG 


CTAAAGAAAT 


CGATTGCCAC 


ATGTGGTAAA 


TGGATGTGAC 


3540 


TTATATTCAA 


TATAAACATT 


ACAATGTATG 


TAACGAGTGG 


CACTGATTGT 


AATAATTTCT 


3600 


TACCTAAATA 


TTTAAAATCG 


AATTGATCAC 


TACCTTCACT 


AAAGTAGCTA 


CCTACAAAGT 


3660 


AAGTAATTCC 


AAACATCACA 


AAGGCACCAC 


CTATATCAGC 


CATAACAAAA 


TAAATAAGTC 


3720 


CCGTTTTAGG 


CCATATCACT 


TCAATTAGTG 


GATATGCAAA 


CAATCCAATA 


TTCATAGCAC 


3780 
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CAATCATTTT CGCCACAATA CCATATATAA TCATTAAAAT TGGTAAAATG GAGAATGACA 3900 

ATTTTAATTC TGCACTGTTT AAATTCACAA TAACTAAAGA TGGGAGTGTG ACATTAAGAA 3960 

CTAATGTAGC AATGACTTGA CTATCTGTTG CTTTTATAAA ATTAATGCGC TTCAAAAAGT 4020 

AACCAAGCGC AATTAATAAA ATAATCATAG TAAATTGTTC TGTCACTGTT ATCCCTTCTT 40 BO 

TCAATAATCT TCATAATTTA TAACTTTAAC ATACTCCACA GATATTTTAG AAGTCTACTG 414 0 

TTTCATGCTA TAATCTACAT TAAATGCACT TAATTATATT TCAAAGGAGT GTTATAGTAT 4200 

GTCTTTAGAA AACCAACTAG CCGAACTTAA ATATGATTAT GTTOGTCTTC AAGGTGACAT 4260 

AGAAAAACGG GAATCTTTGA ATTTAGATAC TTCCGCACTT GTTCGTCAAC TTAAAGATAT 4320 

TGAAAATGAA ATTAGAAACG TTCGTGCTCA AATGCAAGAT TAATAATCTA TCATTCAAGC 43 80 

AATAAATGCT TTTTGTTACA TAAATTTGAC TAGCATTGCT CTGAATACGT TATATTGATG 4440 

AATTGCTTCA TTTTTCGCTC AATTACATCT AGAATCACAA GATGTTGTCG TGTTATGATT 4500 

TAGTGTTTCA TTAACAACAT ACACGCATAT CTATCCCAAC ACTGCTATTT ATGTTTTCTA 4560 

CGCTGnTGTA CTACATGAAC CCTTTGAAAC GGAGAGGAAG TTATCATATG CAATTTTAnC 4620 

TGATTTTACT AGCAATACTT TAACnAATTG nTAGTTTAAT AGAATTTTA 4 669 
(2) INFORMATION FOR SEQ ID NO: 133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 133: 

TTTGCACCCA TCTGaTACAA TGCACCATGC GGTTTAACAT GATTAATTTT AACTTGATGA 60 

ATGCGACAAA ACCCTTGTAA TGCACCTAAT TGATAAATCA TCAAATTATA AATCTCGTCG 120 

TTAGAGATAT CTATATTTCG TCTGCCAAAG CCTTTCAAAT CAGGTAAACC AGGATGTGCA 1B0 

CCTACTGCAA CATTATGTGC TTTGGCAAGT TTTACCGTTT CATTCATTAC ATTTTCATCA 24 0 

CCAGCGTGAA AACCACAAGC AACATTCGCA CTTGTAATTA ACGGAATAAT TTGATGATCA 300 

CCACCAAAGG AATAATTTCC AAATGCTTCG CCTAAATCAC AATTCAAATC AACTCGCATT 360 

ATAATTCCAC CCCTTTAACA ATTTGATGTT TTTCTAAAAA TTTAATATCA ACATCTTTTG 420 

CATCTCCATC ACGATATAGT GGATAATTTA AAACTGCATA TAAAAAATCG GCAGTTGTAG 480 

AAAATCCATC TATCACCATT TCATCTAAGG TGACTTTCAA CTTATCAATT GCTGAAGCTC 540 
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AACCGTGATA 
TCACTTTACC 
CTATCACATG 
ATAAATACAG 
CAACTTGTAT 
CAATCGTACC 
TTTGTTGTCG 
TTTTACGTTG 
CCaTAACTTG 
CATCATCAAA 
CAGCTTTCTT 
CTGGATAACC 
TTGATCCTGG 
CCCCCATCAT 
CGGTTTGAGC 
CACCAGTGAT 
TArcwTCCCC 
CTTTTGCATA 
CAGCAATTTC 
ATACGTACCA 
ATT&TCCAG 
ATAATATCTC 
CTATCTTGTA 
GAAACATCGG 
TTTGATTGAT 
TCTTCAAAAT 
TGTTGGATTT 
CGAAGTATGT 
TTCATTAAAC 



TAGTAAAGAA 
TGGTGTTGGT 
ACCATTAAAT 
TTGTGCTGCA 
TCGAGCATTC 
TGCACTTCTA 
TCTTTCTTCA 
TACAGAACAA 
AACTTCAACA 
GTATTTTTTT 
TACAATACGT 
GATGTCTTTG 
AATCACAGGA 
TTCCATCGTT 
AAATTTTGTT 
TTGTGCAGCA 
AATACATATA 
AACTGCTACA 
ACCTCTGTTC 
AAACTTGGTC 
CAACATCTGT 
CCTTGTTAAC 
AAAAGAATGT 
AGTTATCATT 
TACTGTGTGC 
TTTTATATTT 
nTTCAATATT 
CGCAAGCTAG 
AGAGACATTT 



TCGACTCGCA 
TGAAATTTCT 
TGAATATCGC 
ACCAAATCAC 
ATTTCAATAA 
TAATTTGCTG 
GTTAATGCTG 
TCACGTTCCC 
TGTTTTGcAA 
CCTTCACTTT 
ATACCTTTAC 
GCAAGATTCT 
ACACCTGCAT 
TTTTTAGTAG 
GATTCTGATA 
GATATGATGC 
GCTTGATCTG 
GTTTCAATCC 
GCAATTAAAC 
GTATTCAACA 
TGTTACCTCG 
TTTGTCACCG 
ACCTATCATT 
CGCTTTTGAA 
AGCCAAATGA 
AAATTTCTTA 
CATTTTACAA 
ATGTATCAAA 
GTTCCCGATT 



CATTAAAGCC 
TTTCaGGATT 
TTTGTGAAAA 
GTTCTGCTCG 
AGTAATGTGC 
CACGTGCAAC 
CACAAGGAGA 
CTAAATGTAC 
CAGGTATAAA 
TAGCTTCTTT 
CACCACCGCC 
CAATTTCAGA 
GATGAACTGT 
GCCCTATAAA 
AAAAGCCATA 
GGTCAATATT 
CTAAATGTAC 
CATATTCTCT 
AACGAAGCAT 
TTTGTGCCAT 
TTTAATACTT 
ACATTCACAA 
GGTGATTTAA 
GCTGTCAAAT 
TTCGAGTCAG 
ACATCATTTT 
ATCCCCTTTT 
AATTGGAGTC 
CTTATCTACA 



TTGAGGTAAA 
TTCGGCATTT 
AGGTAAATGA 
CATCTCTGTA 
GGTATCAGTT 
TTTAACAGCA 
TTCTTCGATT 
ATAATTATCC 
AGCCTCAACA 
AAATGCCTTT 
ACTGGCAGCT 
CACATGATTC 
TTGTCTTGCT 
CGCTATGCCT 
TCCTGGGTGA 
TAAATAACTA 
ATGCAAGCTT 
GCAAGCTCTT 
TTACTTACCC 
GATCAGCTAC 
TCATCGCTTC 
TTGGTTCAGT 
TGTCATGATA 
CATTATTGTT 
TGAAGTCAAT 
CCTTCACTAA 
AAAATTGTTG 
TTTTGATGAC 
GCTTCTTGGA 



TGTAACGCTG 
ATTCTCGCTT 
TTATGTTCCA 
ACAGTATGTT 
ACTAAAAATT 
TCGTTACATA 
AATTTTTGAT 
TGCCCATCTC 
TAAACACGAT 
TCTAAATCTT 
TTGATAACAA 
ACAGCACCAT 
GTTATTTTAT 
TGTTCCTCAA 
ATTGCATTAG 
TCTAAAgCAT 
TGCTCGTCCC 
ATAATCCTTA 
CCTTTACTTA 
TATTTCAGTA 
AACATATCCT 
TAATTCTTTA 
ATCATTTGTC 
CATACTTTGA 
TTCTATTTCA 
TTTGATTATT 
CTAATTTTTT 
TCTTAAGAAT 
ATGATATCCA 



660 
720 
780 
64 0 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
16B0 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
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TACAGTTGCA ATTTTGGTAT AACCACCTAT CGTTTGTTTA 


TCATTAAGCA GAATAATAGG 


2460 




TTGACCATCA TTTGGTACCT GAACACTACC AAGAGCAACC 


GGTTCAGAAA TGATATCTGC 


2520 


5 


TTGATTAAtT GGTGCAACGC TGTCACCTTC CAAACGATAG 


CCCATACGGT CTGATTGTTC 


25B0 




AGTAATTAAA TATGGATGAT TTACAATTTT CGCTCTAGCC 


TCTTCAGAAA ATGCCTCGAA 


2640 


10 


TTGAGGTCCT TGAAGAATGT GTATAATATT ATTTTCTGGC AATAAATCGT CCTGTAAATG 


2700 


AATCGTCTTT CCAATGTTTT CTTTAAAGTC ATTATTTATT TTCACTGTTA TTACATCATC 


2760 




AGCTAATAAC TTTCTACCTT TGAAT 




2785 


15 


(2) INFORMATION FOR SEQ ID NO: 134: 






20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :. 1010 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 134: 




25 


AATGGAAACG GTTGAAACAG CAATTATTAC TATTTCTATG 


GGTGAAGGTA TTTCAGAGAT 


60 




ATTTAAATCA ATGGGTGCCA CACATATCAT TAGTGGTGGA 


CAAACGATGA ATCCTTCTAC 


120 




AGAAGATATC GTTAAAGTCA TTGAACAATC AAAATGTAAA 


CGTGCAATTA TTTTACCGAA 


180 


30 


TAATAAAAAT ATCTTAATGG CAAGTGAACA AGCAGCGAGT 


ATTGTTGATG CAGAAGCTGT 


240 




TGTTATTCCA ACGAAATCTA TTCCTCAAGG TATAAGCGCA 


CTATTCCAAT ATGATGTGGA 


300 




CGCAACACTT GAAGaAAATA AAGCGCAAAT GGCTGATTCA 


GTAAATAACG TTAAATCTGG 


360 


35 


• TTCATTAACG TACGCTGTTC GTGATACGAA AATTGATGGC 


GTTGAGATTA AAAAAGACGC 


420 




GTTTATGGGC TTGATTGAAG ATAAGATTGT AAGCAGCCAA 


AGTGATCAAT TAACAACGGT 


480 


40 


TACTGAGTTG TTAAATGAGA TGTTAGCAGA AGATAGTGAA 


ATATTGACTG TGATTATTGG 


540 


TCAAGATGCA GAGCAAGCAG TTACAGATAA CATGATAAAC 


TGGATCGAAG AGCAATATCC 


600 




AGATGTAGAA GTGGAAGTTC ATGAAGGTGG ACAACCAATT 


TATCAATATT TCTTTTCAGT 


660 


45 


AGAATAAAAA TTTAAAATAA AAAACTACCA ATGATAAATC 


ATCAGTTGGT AGTTTTTTAT 


720 




TTTGCTATTT TAGTGATATT GCGGGTTAAA AGTATCGTTC 


TCGAGTTGCT AACAATGTCA 


780 




TGTTCAACTT AGTCATGATA AAATAAATAA CATACTAAAT 


GATACGTAAA ATCAAATAAA 


840 


50 


ACATAGGTGA TTTATTTTGG CTAAAGTAAA CTTAATAGAA 


AGTCCATATT CTCTTTTACA 


900 




ATTAAAAGGT ATAGGTCCTA AGAAAATAGA AGTATTGCAA 


CAACTAAATA TTCATACAGT 


960 
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(2) INFORMATION FOR SEQ ID NO: 135: 



(i) SEQUENCE CHARACTERISTICS: 



5 




(A) LENGTH: 154 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 






10 


(xi) 


SEQUENCE DESCRIPTION: SEQ ID NO: 135: 








TGTAGTTGAA CATGAACAAC AAAAGAAAGA AAAGACAAAA AAGCAATACA AGCCATTTTG 


b U 


15 


GATTGTCATG 


AGTTTTATAA TACTTATAGT TGTACTATTA CTCCCGGCAC 


CTTCAAGTCT 


i on 


GCCGATAATG 


GCTAAGGCAG TACTAGCTAT TTwAGCTTTT GCAGTTATTA 


TGTGGGTAAC 


n q a 
loU 




.GGAAGCTGTA 


TCATATCCGG TGTCAGCAAC TTTAATTATT GGCTTAATGA 


TATTACTTTT 


^4U 


20 


AGGATTTAGC CCTGTTCAAA ATTTAGGGGA GAAGCTAGGT AATCCGAAAA GTGGCAGTGC 


i uu 




TATTTTAGCT 


GGAAGTGACC TTCTAGGAAC TAATCATGCA TTATCATTAG 


CGTTTAGTGG 


o C f\ 




ATTTGCAACT 


TCAGCTGTAG CTCTCGTTGC AGCTGCATTA TTTTTGGCTG 


CTGCTATGCA 


420 




AGAAACGAAT 


TTGCATAAAA GACTAGCTCT TTTAGTGTTA TCAATTGTTG 


GTAATAAAAC 


a o n 

4oQ 




TAGAAATATA 


GTTATTGGAG CAATTATCGT TTCAATTGTA CTTGCATTTT 


TCGTTCCTTC 


e a r\ 
54 0 




TGCAACAGCT 


AGAGCAGGGG CAGTTGTACC AATCTTGCTG GGTATGATTG 


CGGCATTTAA 


o UU 


30 


AGTTTCCAAA 


GATAGCAAGT TAGCGTCTTT ATTAATAATT ACTTCAGTAC 


AAGCTGTGTC 


ooU 




AATTTGGAAT 


ATTGGTATCA AAACGGCGGC AGCACAAAAT ATCGTAGCGA 


TTAATTTTAT 


1 £ U 




AAACCATCAA 


TTAGGATTTG ATGTTTCATG GGGCGAGTGG TTCTTATATG 


CAGCGCCTTG 


/ DU 


35 


GTCCATAGTT 


ATGTCCGTAG CTTTATATTT CATCATGATT AAAGTGATGC 


CTCCAGAAAT 


OH U 




TAAT&CAATA 


GAAGGTGGTA AAGATTTAAT AAAAGAAGAA TTGCATAAAC 


TTGGCCCCGT 


Zf u u 


40 


TAGCCCACGT 


GAATGGCGTT TAATTGTTAT ATCGATGTTA TTATTACTGT 


TTTGGTCAAC 


J D U 


TGAAAAAGTA TTACATCCGA TTGACTCTGC ATCCATTACT ATTATTGCTT TAGGTGTTAT 






GTTAATGCCG AAAATTGGTG TCATGACATG GAAACATGTT GAAAATAAAA 


TACCATGGGG 


1080 


45 


AACAATTATC GTGTTTGGTG TAGGTATTTC ACTAGGTAAC GTTCTTTTGA AAACAGGTGC 


1140 




' AGCTCAATGG 


TTAAGTGATC AAACTTTTGG TGTTTTAGGT TTAAAACATT 


TACCTATTAT 


1200 




CGCGACAATT 


GCACTTATCA CGCTTTTTAA TATATTGATT CATTTGGGCT TTGCGAGTGC 


1260 


50 


AACAAGTTTA TCATCAGCGT TAATACCTGT TTTTATTTCG CTAACCTCTA CGTTACACTT 


1320 




AGGAGACCAG 


TCTATAGGAT TTGTTTTAAT TCAACAATTT GTTATTAGTT 


TTGGTTTCTT 


1380 
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10 



AGATTTCTTG AAGGCAGGTA TACCATTGAC AATTGTAGGG aATAtCtAgT GaTAGTTTTT 
AGCATGACTT ATTGGAAATG GGTAAGGTTG OlTTAATTAA 
(2) INFORMATION FOR SEQ ID NO: 136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11823 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
.(D) TOPOLOGY: linear 



1500 
1540 



15 



20 



25 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 136: 

ACTTCTCACA ATAAGAAATA TGAAATTGTT ATGTGTTAGT TGAGATTCAG TGATGAATTA 60 

CTTTTATCAT TTAAAATGTT GTTATCATTG TCATGCGTTA CCAAATCGCT TACGTATACA 120 

CGATTCCCAA TCTTAACATA GACGATTTGT ATATCAGAAT TTTCTGATTA CTAACAGTTT 180 

ACCTAAGTTT AAATATCTGT TCAATGATTT TCAGTTATTT TTAAAAGAAA AATCGTAATG 240 

CTGCCATGAT AACAATCCCA CTAATAATTG TAATAGTTAA AtACGCGTGA TTATAGATAA 300 

AATAACCGTC GGAATGAGCG CGATAATGTA AGGGATGTTT AATGTATACC CCTCACCATG 360 

AGGCGTCTGT TGAATAATGC TGTCAATGAC AAGTGCCGTA AATAGTGTGA TTGGGATAAA 420 

TGATAGCCAT CGAACCACGA CATCAGGCAA TTGCACTTTT GAAATCATGA TAAAAGGTAT 4 80 

AATTCGAATT AATAGCGTTA CGATACCACA CAATAAAATA AGTATTAACA TGTTCATATG 540 

AGTTATCATT GTTCCATCAT CACTCCTAAC GCTGCTGAAA TTGTGGCTGC AATTAATATT 600 

GCTAGATATG AAGGCATAAA CATACTTAGC GATAACATCA TTACTATGAC GGCAATAATG 660 

AGTACTATGT AAATTCTTAA TCGCGATTTA GTAATTGATT CAAATTGCGC AATGGCCAAA 720 

AAGMAAACA TAGCCGTGAT AGCAAAATCT AACCCTAGCG TTTGCGGATT TGAGATATAT 780 

TCGCCAAATA AAGCCCCAGC TACACATGAA ATTGCCCAAA ATAAATATGC TGTGATGTTA 840 

AGACCATGCA TCCAACGATC ATTGATAGCT TCTCCTTTTA AATAAGGTGT AATGGCGACG 900 

CCAAACGTTT CGTCAGTTAC TAATGAACCT AATCCAACAC GGTTCCAAAA CCCATATGTC 960 

TTGAAGTTTG GTGCAAGCGA CATACTTAAA AGGAACATTC TTGAATTTAC GATAAATACA 1020 

GTTAGTACAA TCGCTGATAT AGGTGTACCT GCTATAAACA ACGCGCACAT AATAAATTGC 108 0 

GCAgcaCCGG CATATATAAC AAGACATAAC AAGACAATTT CTAAAATACT AAAGTTTTGA 1140 

GACGAAG CCA CAATACCAAA TGAAATACCA ACACCGGCAT AACCCAATAA TGTTGGGATA 1200 

CACTCTTGCA CGCCTTGTCT AAAACTTAAA TGTGTTGTCA TCTCAATTAC CTCCTTTGCC 1260 
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TAAGCAATAA CATTAGACAT CAGTTTGTCT GAGGTTAGAC ATTCCGGAGT CTTTAGTCAG 1380 

CTTCATATTA ACTTTTTATT TTTGAGAATT TTCAATTTTT TATTTAAGAC TACCTCCATA 1440 

5 TTTTCTATGG aTTTGTAGTT GTTTTTAAGT ATCAATTTTA TAAATTTTTA TATCTGATGA 1500 

TGAGTCTGGG aTATTGaTTC ATGTACCACT CCCTTaTaAT CATCCCCTCC CCCTaCCCTA 1560 

CTCCATCGAT ATAACTCATA CTACATATCA ACGAAATCAG TATTTTATCG CTTCCTTTCC 1620 

10 TATATTAGTG ATGCTCAAAC TTGTTACGTT TTAGATTGTT TTAGTTCATC ATAATTATCC 1680 

CGTATTGTTG CTATAATGAA ATGCGTTCAC CCCATTAAAC CACAAACTTA ATTTATTGTT 1740 

GTTATGTGCA TTGGCTCACT ATTATATTTT TACAGCACAA AAAAAGTGGC GACAGTTCGT 1800 

is 

CACCACTTTT TAAAATATTA TTTAAAGTAT CTTGCCCTTG CTTTAAGTAT ACGTAGATAT 1860 

ATACTTTTTA AAGCTTGTAG CTAAAGCCTT TATTTAACTG GTTTTGAAAT TTGTGTTTTA 1920 

■r 

CCACCCATAA ATGGTACTAA TGCTTCTGGA ATTGTTACTG TTCCATCTTC ATTTTGGTAA 1980 

20 

TTTTCAACAA TAGCAGCAAA TGTACGTCCA ACTGCTAAAC CACTACCATT TAATGTATGT 2040 

GCTAATTCTG GTTTAGCTGC TTTGTCACGC TTGAAGCGGA TGTTAGCACG ACGCGCTTGG 2100 

25 AAATCCGTAC AGTTTGAGCA TGAACTAATT TCTTTATAAT CATTGTAGCT TGGTAACCAA 2160 

ACTTCTAAAT CATATGTTTT GCTTGCACTA AATCCAATAT CACCTGTACA TAAAATAACA 2220 

CGACGGTATG GTAAACCTAA CTCTTGTAGA ATTGCTTCTG CGTTTGTTGT CATTTCTTCT 2280 

30 AAAGCATTCC ATGAATCTTC AGGTTGTTCA AAACGTACCA TTTCCACTTT ATCGAATTGA 234 0 

TGTAAACGAA TTAATCCTCT TGTATCTCTA CCTGCTGATC CTGCTTCACT ACGGAAACAT 2400 

GCAGATTGAC CAGTGAATTT TTCAGGAAGT ACACCTGGTT GAATAATTTC ATTACGGTAG 2460 

35 

AAATTCGTTA ATGGTACTTC AGCAGTTGGA ATTGTATATA ATCCTTCTTT TTCTACTTTA 2520 

AAT^ATCTT CTTCAAATTT AGGTAATTGA CCTGTACCAT ACATTGTATC TGCGTTCACA 2580 

AGCTGTGGTA CCATCATTTC TGTATAACCA TGTTGTGTTG TATGTTTTGT AATCATATAG 264 0 

40 

TTCATTAAAG CACGCTCTAA TTGCGCACCT TCATTTGTTA AATATACAAA ACGCGCACCT 2700 
GAAACTTTTG CTGCACGATC AAAATCAGCC ATTTTCAATT CTTCTACAAT ATCCCAATGT . 2760 

GCTTTGGGTT CAAATGAAAA CTCaCGTGGT GTACCCCACT TTTTAACTTC AACGTTATCT 2 820 

45 

TCATCAGATT CACCTTGAGG TACATCATCA CTTATTAAAT TTGGAATACG ACAAAGGATA 2880 

CCTGTCATTT TATTATCAAT TTCATTTAAT TGACTATCTT TTTCTTTAAT ATCGTCACCT 2940 

50 AATGTGCGCA TTTCAGCAAT CACATCATCA GCATTTTCTT TATTACGTTT TTTTAATGCG 3000 

ATTTCTTCGC TTACTTTATT ACGACGTGCT TTCATTTCTT CTGTTGCACT AATTAATTTA 3060 

55 



723 



EP 0 786 519 A2 





TCAATTTTGC 


TCTTAACTGT 


GTCAGGCTCA TTTCTGAATA 


ATCTAATGTC 


TAACATTAAC 


3180 




. CTTCATCCTT 


TCCCAAATAA 


TTATCATTTA TTATGGAATG 


ACGTACGTCT 


TTATTTTTTA 


3240 


£ 


GAAAATAAAA 


AAAGACCACA 


TCCCTACAAG GGACGTGGTC 


TACGCGTTGC 


CACCCTATTT 


3300 




AACAATTTAA 


GTTATAAAGA 


TACACTAAAC CTAAATTGCA 


CTTCACTAAA 


ATAACGGTTA 


3360 


10 


TCACCGATTG 


TTCTTTTAAA 


TTAAGTAGGT AGATTCATAT 


ATATGTTGAT 


TCTTGTTCAC 


3420 


ACTAACCACA 


AGCTCTCTGA 


TATCGAACAC TATATATTAC 


TTGTCCTACG 


AACAATGTCT 


3480 




TATTAAGTTA 


TTTTTAATAT AGCAAACTAT ATTTGCTTTT 


TCAAGTAACG 


ATTTCAAACA 


3540 


15 


TCACTCATGT 


CGATTTAGTG 


ACATGCAGTC GTTTGATAAA 


TTGATTGCTT 


TAAATACTGT 


3600 




GCAACCGCTT 


CAATATCTTT ATGAAATTGA CGATCATGTG 


TAATGGATGG 


CACGATACTT 


3660 




CGAAACTCAT 


CATACTTGCG 


ACGTGTTTTT GGTGATAATC 


CTTCAACACC 


TTTTAACTCT 


3720 


20 


GCTGCTTGTA 


ATGCAATAAC ACATTCGATT GCCAGCACAC 


GTCTTGCATT 


TTCAATAATT 


3780 




TGATAACCAT 


GTCTAGCAGC TGTAGTTCCC ATAGATACGT 


GATCTTCTTG 


GTTCGCAGAT 


3840 




GAAGTGATAG 


AATCAACACT 


CGCTGGATGC GCTAAAGTTT 


TATTTTCAGA 


AACGAGACTT 


3900 


25 


GCAGCAGCAT 


ATTGCATAAT 


CATCGCGCCA CTTTGCAATC 


CTGGCTCTGG 


ACTAAGAAAT 


3960 




GCTGGTAAAT 


CACCATTTAA 


TTGAGGATTT ACTAGTCGCT 


CTAGACGACG 


TTCCGATACG 


4020 




TTTGCTAATT 


CACTTACACC 


TAATTTAAGA TGATCTAATG 


CAAAAGCAAT 


AGGTTGTCCA 


4080 


30 


TGGAAGTTAC 


CACCTGAAAT 


AACAAACGTT TCATTTGCTT 


CCTCAAATAT 


AAGTGGATTA 


4140 




TCATTAGCCG 


CATTCATTTC 


AAATTCTAAT TGCTGTTTAA 


CATAATTGAA 


TACTTGAAAA 


4200 




CTCGCGCCAT 


GGATTTGTGG 


TATACAACGC AACGTATATG 


CATCTTGTAC 


ACGTATTTCT 


4260 


35 


GATTGTCGCG 


TCGTTAATGT 


TGATCCTTCT AACCAATCAC 


GCATACGCGC 


TGCCACATTA 


4320 




ATCTCTTCTT 


GAAAATTACG AACTGCGTGC ACATCATGTC 


GATATGCATG 


TATAATGCCA 


4380 


40 


TTAAGAGACT 


GATGCGTTAA TGCAGCAATC CATTCAGATT 


GGTAACCTAA 


ATCTTCTGCT 


4440 


TCTATATAAC 


TAATGACACC 


TTGAGCTGTC ATAGCTTGCG 


TACCATTAAT 


CAATGCTAAA 


4500 




CCTTCTTTAG 


CCTGAAGGTT 


CAAAGGTTGT CTATTTAATT 


CTCTTAATAC 


ATCGTCACTA 


4560 


45 


TCCTTTTCTT 


CCCCTCTGTA 


CAATACTTTC CCTTCACCAA 






*± D U 




GATAATGGCG 


CTAAATCTCC 


TGATGCACCG AGAGAGCCTT 


GCTGTGGGAT 


TATCGGTATA 


4680 




ATACGTTCAT 


TTATAAAAAA 


TTGTAATTGT CTCACTAATT 


CTAAAGTGGC 


ACCTGAATGA 


4740 


50 


CCTTTTAATA 


ATGTATTCAA 


TCGTAAAATC ATCATGACTA 


ATGCTACTTC 


TTTTGAAAAT 


4800 




GGCTCACCTA 


GTCCACAGGC 


ATGTGAGCGT ATCAGATTCA 


CTTGTAATTC 


ATTATATTGC 


4860 
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TCCTCATTTT CAATAATACG TTCAACTACC GCTCTACTTT TTTTGACACG TTCTAACGCA 4 960 

TCATCAATAA TTTCAATCTT TGATTGTTGT TGTAAAAATG ATTTAATATC CTCAATTGTT 504 0 

AGTGTTTCAC CATCTAAATA TAAAGTCATA TATGTTACCC CCTTGTTTAT ATTAAGTAAC 5100 

CCATCCTTCT TGAAGTATAC GTTTTCATTT TTATTGAAAC AATGGTTTTA CGTACATTTA 5160 

TAACCTATTA TCAGAGCACT ATTGTAGTGC GTTAAAGGAT ATTAAGATTG TTGTAAGCAT 5220 

ATTTAATAAT TTATCTATTG ACGAATTGCA TATACAGGTA TAGTATTTTC TATTGTATTT 5280 

AACGACAAAT AATAATGAAT TCAGAAATTT ATAATACATT TTGTTAAAAG TTACTATATA 5340 

TTTTTAAAAT TGAATAAATT CGGAAAAGGC TTTTACATGG GAGGTTATAT CACTATGGAA 54 00 

ACGTTAAATT CTATTAACAT TCCTAAGCGT AAAGAAGATT CACATAAAGG TGATTATGGC 5460 

AAAATTTTAT TAATTGGTGG ATCTGCTAAC TTAGGTGGTG CCATTATGTT AGCGGCTCGT 5520 

GCATGTGTAT TTAGCGGTAG TGGTTTAATC ACTGTAGCTA CACATCCAAC AAATCATTCA 5580 

GCATTACATT CTCGTTGCCC AGAAGCGATG GTTATTGATA TTAATGATAC GAAAATGTTG 564 0 

ACGAAAATGA TTGAAATGAC TGACAGTATA CTAATTGGTC CAGGTCTTGG CGTTGATTTC 5700 

AAAGGAAATA ATGCCATTAC ATTCCTACTA CAAAATATAC AACCGCATCA AAATTTAATC 5760 

GTAGACGGCG ATGCGATTAC AATCTTTAGT AAACTGAAAC CGCAATTACC TACATGTCGT 5B20 

GTGATCTTTA CACCACACCT CAAAGAATGG GAACGATTAA GTGGTATTCC TATTGAGGAA 5880 

CAGACATATG AGCGTAATCG TGAAGCAGTT GAT CGTTTAG GTGCAACTGT TGTACTTAAA 5940 

AAACATGGTA CTGAAATTTT CTTTAAAGAT GAAGACTTTA AATTGACAAT CGGTAGCCCA 6000 

GCAATGGCGA CTGGTGGTAT GGGCGATACA CTTGCTGGTA TGATTACAAG CTTTGTCGGT 6060 

CAATTTGATA ACTTAAAAGA AGCGGTTATG AGTGCCACAT ATACACATAG TTTTATTGGC 6120 

GAAA&CCTTG CAAAAGATAT GTATGTGGTG CCACCATCAA GACTTATCAA TGAAATACCT 6180 

TACGCAATGA . AACAATTAGA AAGTTAGTCA TTACTAATCA TTGAATATAG TAAAGCATTA 624 0 

CTTTCTAGCA TAAAAATAAG ACTCCCCTAC ATATAGGGAA GTCTTATTTT TTATTATTCT 6300 

TCATCTGATG ATTGTTGTAT ATCTTCTTCA ACACGATCCA TGAAATCTTG TCTTACTTCA 6360 

ATACGTCCAT CTTCATCATT TTCTTCTGAA TCAATCACTT CAGTATGAAT TGCATTTCCT 6420 

GGTGTTTCAT CATTTaCAAC CGCTTCACGT TGTTGTTCAG TACCATCTTC AGATACAGTT 64 80 

GAAGTAGATT GCTCATCTTC ATTCGTTTCA TCTTCTGCAT CTTCTTTTAC TTTAGCAACC 654 0 

GTTGAAACAA ATTGATCATC ACCTAAGCGA ATTAAGCGAA CACCTTGTGC TGCACGACCA 6600 

TTTTGAGAAA TATCTGCAAC ATCTAGTCGA ATAATGACAC CTGCATTAGT AACAATCATT 6660 
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GTAGCTGTTT TAATACCTTT ACCACCACGA TTTGATAAGC GATAGTCATT AACTGGCGTA 6780 

CGTTTACCAT AACCATTTTC AGTAACTACT AATACTTCAT CAACACTGTT TGCATGAGCT 684 0 

5 

ACATCAAGCC CTACAACTTC GTCACCTTCA CGAAGTGTAA TACCTTTCAC ACCCGTTGCT 6900 

GTACGGCCTA AAGGACGTAA TGTTGATTCA GGGAATCGAA TTAATGATGC ATGTGATGTA 696 0 

CCAATCAAGA TATCTTCTTG ACCACTTGTT AAGCGAACTG CAATTAACTC ATCATCTTCT 702 0 

10 

CTGAACGAAA TCGCAATCTT ACCATTTCTA TTTATTCTTG AGAAGTTACT TAATGCTGAA 7080 

CGTTTAACGA CACCACGTTT AGTTGCAAAC ACTAAGAAGT TGTCTTCACT TTCAAGGTCT 7140 

75 TTAACAGCAA TCATTGTACT AATGACTTCA TCATTTTCAA GTTCAATAGC ATTCACTACA 7200 

GGAATACCTT TAGACTGTCT TGATAACTCA GGCACTTCGT AACCTTTAAG TTTGTATACA 7260 

CGACCTTTGT TAGTAAAGAA CAATACATGG TCATGTGTAC TTAAAGTTAC CAATTGACTG 7320 

20 ACAAAATCTT CTTCCAATGT ATTCATACCT TGAACACCAC GACCACCACG GTTTTGAGCA 7380 

CGATATGTAG ATACCGGCAA ACGTTTAATG TAGTTATTAT GGCTTAGTGT AATTACTATT 7440 

■TGTTCTTCTG GAATTAAGTC TTCGTCCTCT AAGTCTTCAA ATCCACCTAA TTGAATTTCT 7500 

25 GTACGACGAT CATCACCGAA ACGATCTCTA ATTTCAGTCA ATTCATCTCT AACTAACTGT 7560 

AATAACACTT CTTCATCAGC TAAGATTGCT TCTAATTCAC TAATATAATT TAATAACTCA 762 0 

TTATATTCAG CTTCAATTTT GTCTCTCTCT AAACCTGTTA GACGTCTTAA ACGCATGTCT 7680 

30 AAAATAGCTT GAGCTTGTTT TTCAGAAAGT TTGAAGCGTT GTTGCAAGCT TTCCATTGCA 774 0 

ACTTTATCTG TATCTGACTC ACGAATCGTT GAAATAATTT CATCGATATG GTCAAGTGCG 7B00 

ATACGTAATC CTTCTAAAAT GTGGG CACGA TCTTTAGCTT TACGTAAgTT GTATTGCGTA 7860 

35 CGTCTTCTAA CAACTGTCTT TTGATGCTCT AAATAATGTA CCAACGCTTC TTTTAAATTA 7920 

ATAAGCTTCG GTCTACCATT TACAAGTGCA ATCATATTCA CACCAAATGA TGTTTGAAGA 7980 

GGTGTTTGTT TGTATAAGTT ATTTAAAATG ACACTAGCAT TTGCATCCTT ACGCACATCA 8040 

40 

ATAACGACAC GCACACCAGT ACGTAAACTT GTTTC^TCAC GTAAATCAGT GATACCGTCA 8100 

ATTTTCTTGT CACGAACGAG CTCTGCAATT TTTTCAATCA TACGAGCCTT ATTCACTTGG 8160 

AAAGGAATTT CAGTGACAAC AATACGTTGA CGTCCGCCTC CACGTTCTTC AATAACTGCA 8220 

45 

CGAGAACGCA TTTGAATTGA ACCACGACCT GTTTCATATG CACGTCTAAT ACCACTCTTA 8280 

CCTAAAATAA GTCCAGCAGT TGGGAAATCA GGACCTTCAA TATCCTCCAT TAACTCAGCA 8340 

SQ ATTGAAATAT CAGGGTTCTT ACTTAAGCTA AGTACACCAT TGATTAATTC TGTTAAGTTA 8400 

TGTGGTGGAA TATTCGTTGC CATACCTACC GCGATACCTG ATGCACCATT GGCTAATAAG 8460 
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AAATCTATTG TATCTTTATT AATATCACGT AACAGTTCAA GTGTGATTTT AGTCATACGC 8580 

GCTTCAGTAT AACGCATTGC TGCTGCGCCA TCTCCATCCA TTGAACCAAA GTTACCTTGG 864 0 

5 CCATCAACAA GCGGATAACG ATAACTGAAA TCTTGAGCCA TACGTACCAT TGCTTCATAA 8700 

ATAGATGAGT CACCATGAGG GTGATATTTA CCCATTACGT CACCAACGAT ACGTGCTGAT 8760 

TTTTTATATG ATTTATCCGG TGTCATACCT TGTTCATTTA ATCCATATAG TATACGACGA 8820 

10 

TGTACTGGTT TTAAACCGTC ACGAACATCT GGCAATGCAC GAGCAACGAT AACACTCATC 8880 

GCATAATCTA AAAATGATTC ACGCATTTCA CTGGTAATAT TTCGTTCATT TATTCTTGAT 894 0 

TGAGGTAATT CAGCCATCAA GAGTTCCTCC TTCAAAAGTT CAGTTCACAG CGCTTAGAAG 9000 

15 

TCTAAGTTTG CATAAACTGC ATTATCTTCT ATAAATTGTC TACGGTTTTC TACAACGTCA 9060 

CCCATTAACA TTTCAAATGT TTGGTCCGCT TCAATCGCAT CTTCAAGTTT TACTTGTAAA 9120 

AGAGCGCGGT GCTCAGGGTT CATTGTTGTT TCCCAtAATT GATCTGCATT CATTTCTCCA 9180 

20 

AGACCTTTGT ATCGTGCAAT AGACCATTTT GGTGTTGGAT TCAATTCAGA TTTAAGTTTA 9240 

TCAAGTTCCC TATCATTGTA TACATAATAC TTTTGTTTAC CTTGTGTCAG TTTATACAAC 9300 

25 GGTGGCTGTG CAATATACAC ATAGCCTGCT TCAATTAACG GTCTCATAAA TCGATAGAAG 93 60 

AATGTTAATA ACAATGTTCT AATATGCGCT CCATCCACAT CGGCATCAGT CATAATGACG 9420 

ATTTTGTGAT ATCTTGCTTT CGCTAGATCA AAGTCGCCAC CGATTCCTGT ACCAAATGCT 94 80 

30 GTGATCATTT GACGAATTTG ATTGTTATTC AAAATTCTAT CTAATCGTGC TTTTTCAACA 9540 

TTTAATATCT TACCTCGTAA TGGTAAAATC GCCTGCGTTC TAGAGTCACG ACCAGATTTT 9600 

GTAGACCCCC CGGCAGAGTC CCCTTCGACT AAGAAAATCT CACATTCTTC AGGACTTTTA 96 60 

35 CTAGAGCAAT CGGCTAATTT ACCTGGAAGG CTTGCTACAT CTAACGCTGA TTTACGACGT 9720 

GTTACTTCAC GCG CTTTTTT CGCAGCAACA CGTGCACGTG CCGCCATAAT ACCTTTTTCA 9780 

ACCACTGTAC GTGCGACTTG TGGATTTTCA TATAAAAATC GTTCAAAGTG CTCTGAGAAT 9840 

40 

AATTTATCTA CAACTTGACG CACTTCAGAA TTACCTAATT TTGTCTTCGT TTGACCTTCG 9900 

AATTGAGGAT CACCATGTTT GATAGATATA ATTGCTGTCA TACCTTCACG TGTATCTTCA 9960 

CCAGAAAGTC TATCTTTTTC TTCTTTCATA ATCTTGCTAC TTAAACCATA ACTATTTAAG 10020 

45 

ACACGCGTTA ATGCACGTTT GAATCCGTCT TCATGCGTAC CACCTTCATA CGTATGAATG 10080 

TTATTTGCGT AAGTTAAAAG ATTTGTGGCA TATCCTGAGT TATATTGAAT CGCAATTTCT 1014 0 

ACTTCAATAT CATCTTTAGA TTGATGAATA TAAATTGGCT CATCATGAAT AGGTTCTTTA 10200 

50 

TTTTCGTTCA ATAACTCAAC GTACGATTTA ATACCGCCCT CATAGTGATA GGAGTCTTCT 10260 

55 
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GCAAGCTCTC TAATACGCTG - CTGTAATGTT TCATAGTTGT ATACAGTTGT CTCTGTGAAG 10380 

ATTTCTCCAT CTGCTTTAAA ACGAAtGaCA GTACCTGTCT TAtCAGTnGT GCCAACTTCT 10440 

5 

TTTAAGTCAA ATTGAGGTAC ACCTTTTTTA TATGCTTGAT GATATATAGT CTCATTTCTG 10500 

TGTACATATA CTTCTAAGTC TTGTGACAAT GCGTTTACAA CTGATGAACC AACACCATGT 10560 

AAACCACCAG ATACTTTGTA TCCGCCACCG CCAAATTTAC CACCAGCATG TAAAACAGTT 10620 

10 

AAAATAACTT CGACAGCTGG ACGTCCCATT TTTTCTTGAA TATCAACTGG GATACCACGT 10680 

CCGTTATCCG TTACTTTAAT CCAGTTATCT TTTTCAATAA CAACTTCAAT TTGATTTGCA 10740 

TAACCaGCTA ATGCTTCATC GATACTATTA TCGACAATTT CCCACACTAA ATGGTGCAAA 10800 

15 

CCTCTCTCTG AAGTCGATCC TATATACATA CCTGGTCTTT TACGTACTGC TTCTAAACCT 10860 

TCTAATACTT GTATTTGCCC AGCACCATAA TTATCCGTGT TGTTTACATC TGACAATGCA 10920 

20 GTCACCATCG CTTTCTGTTA CTTTATAATT TCACCTTGAT TAATACGATA CAATTTAGCG 10980 

TTATTCATGA TTTCATGATC AATACCATCT ACAGATGTCG TAGTGACAAA TGTTTGTACT 11040 

TTATGCTGAA TCGTACTTAA TAAATGCGTT TGACGCGAAT CATCTAATTC ACTGAGTACA 11100 

25 TCGTCTAATA ATAAGATGGG ATATTCCCCA ACTTCGATAT TCATTAACTC AATTTCAGCT 11160 

AATTTAATGG ACAAAG CCGT TGTACGTTGC TGTCCTTGAG AACCATATGT TTGAGCATCC 11220 

ATGCCATTCA CATCAAAACT TATATCATCT CGATGTGGTC CGAATAAGCT AATGCCTCGT 11280 

30 TCTTTTTCTC TTTGCATATT ATCGCTAAGA ATAGACATAA TTTCTTCAAG TCGTGCCGCT 1134 0 

TCATTTTGAG CATAATCAAA TTTAAGACTA GGTAAATAAT TCAGCGACAA CGCTTGTTTA 114 00 

TCATTTGTGA TACCAGCATG AATCGGTTTA GCTAACGACT CTAGCTCTTG AATAAAATGT 114 60 

3i? GCACGTTTAT CAGTTACTTT CATTGCATAT TCAGCAAACT GCTGATTTAA TACTTCCAAC 11520 

ATTGTTAAGT CCTTTTTTTG GCCTAATTGT AACTGCTTTA AGTAATTATT CTTTTGCTTT 11580 

AAAATACGTT GGTATTGAGC TAAATCATTT AAGTAAACAG CAGAAATTTG GCCCAACTCC 1164 0 

40 

ATATCTATAA AGCGTCGTCT TATTrGrGGr GAGCCTTTTA CAATATTCAA ATCTTCTGGC 11700 

GCAAATAGAA CCACATTGAG GTGTCCAATA TATTGAGTTA GACGACTTTG CTCTAAGTGn 11760 

ATTCACTTTG GACTTGTTTA CCTTTnTTAG TTATAAACAT TGTTAATGGG CATCGTGCCG 11820 

TGT 11823 

(2) INFORMATION FOR SEQ ID NO: 137: 

(i) SEQUENCE ■ CHARACTERISTICS : 
50 (A) LENGTH: 692 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 137: 



5 



10 



IS 



ATAATTATTA 


ACATGGTGTG 


TTTAGAAGTT 


ATCCACGGCT 


GTTATTTTTG 


TGTATAACTT 


60 


AAAAATTTAA 


GAAAGATGGA 


GTAAATTTAT 


GTCGGAAAAA 


GAAATTTGGG 


AAAAAGTGCT 


120 


TGAAATTGCT 


CAAGAAAAAT 


TATCAGCTGT 


AAGTTACTCA 


ACTTTCCTAA 


AAGATACTGA 


180 


GCTTTACACG 


ATTAAAGATG 


GTGAAGCTAT 


CGTATTATCG 


AGTATTCCTT 


TTAATGCAAA 


240 


TTGGTTAAAT 


CAACAATATG 


CTGAAATTAT 


CCAAGCAATC 


TTATTTGATG 


TTGTAGGCTA 


300 


TGAAGTTAAA 


CCTCACTTTA 


TTACTACTGA 


AGAATTAGCA AATTATAGTA 


ATAATGAAAC 


360 




A A Af2 A A A PA 21 


C A A A A PPTTP 


TACTGAAACA ACTGAGGATA 


ATCATGTGCT 


420 


TGGTAGAGAG 


CAATTCAATG 


CCCATAACAC 


ATTTGACACT 


TTTGTAATCG 


GACCCGGTAA 


4B0 


CCGCTTTCCA 


CATGCAGCGA 


GTTTAGCTGT 


GGCCGAAGCA 


CCAGCCAAAG 


CGTACAATCC 


540 


mTTATTTATC 


TATGGAGGTG 


TTGGtTTAGG 


aAAAACCCAT 


TTAATGCATG 


CCATTGGTCA 


600 


TCATGTTTTA 


GATAATAATC 


CAGATGCCAA 


AGTGATTTAC ACATCAAGTG 


AAAAATTCAC 


660 


AAATGAATTT 


ATTAAATCAA 


TTCGTGATAA 


nA 






692 



(2) INFORMATION FOR SEQ ID NO: 138: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 138: 



40 



4? 



50 



ATACTGTAGC GCAAATTTCA CAATGGCATG TTATAGAAGA TTTAGTTACG 


AATGAATTAG 


60 


GTAtTAGTAT TTTACCAACA TCAATTTCAG AgCAACTAAA TGGAGATGTG 


AAGCTGtACG 


120 


CATTGAAGAT GCTCATGTAC ATTGGGAATT AGGTGTTGTT TGGAAGAAGG 


ATAAACAATT 


180 


AAGTCATGCC ACAACGAAAT GGATAGAATT TTTGAAAGAC CGTTTAGGCT 


AACATATTAA 


240 


TAAAGCACTC ATTATTTAAG GCGCATCATT ACGTGGGTCA TTGAAATAAT 


GAGTGTTTTT 


300 


TTGTGAAAAT GAAGTGAAAT TTAGAGAGCG TTTCCATAGA AAATAGTAAT 


ACAAACTATA 


360 


AAAAAAGAGT ATTTTTATAT TGTGTACGCC ATCTTTATAA TAGTTATTGT 


AACAATTTAG 


420 


ACATATTTAG AAAGGGATGG CGCCATGCAC AAAGTCCAAT TAATAATCAA 


ACTACTACTA 


4 80 


CAACTAGGAA TCATCATTGT GATTACTTAT ATTGGCACAG AAATTCAAAA 


GATTTTTCAT 


54 0 
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ATTGTACCGC 


TAACTTGGGT AGAAGACGGT GCAAACTTTT 


TATTAAAGAC 


GATGGTCTTT 


660 




TTCTTCATAC 


CGTCAGTTGT AGGtATTATG GaTGtgCTTC CGAAATTACG CTAAATTATA 


720 


£ 


TACTCTTTTT 


CGCAGTCATT ATCATAGGAA CATGTATCGT 


TGCATTATCT 


TCAGGTTATA 


780 




TTGCTGAAAA AATGTCyGtT AAACwTAAAC ATCGTAAAGG TGTAGACGCt TATGAATGAT 


. 840 




TACGTGCAAG 


CCTTATTAAT GATTTTGTTG ACTGTCGTTT 


TATATTATTT 


CGCTAAAAGG 


900 


10 


TTACAACAAA 


AATATCCGAA CCCATTTTTG AATCCAGCAT 


TAATTGCATC 


TTTAGGAATT 


960 




ATTTTTGTCT 


TACTTATCTT TGGAATTAGT TATAACGGGT 


ATATGAAAGG 


TGGCAGTTGG . 


1020 


IS 


ATCAACCATA TTTTAAACGC AACGGTCGTA TGTTTAGCGT ACCCACTTTA TAAAAATAGA 


1080 


GAGAAAATTA AAGACAATGT CTCTATCATT TTTGCAAGTG 


TATTAAcTGG 


CGTCATGCTG 


1140 




AATTTCATGT 


TAGTGTTCTT AACACTTAAA GCATTTGGCT 


ATTCTAAAGA 


CGTCATTGTA 


1200 


20 


ACGTTATTGC 


CCCGATCTAT AACAGCCGCA GTAGGTATCG AAGTGTCACA 


TGAACTAGGT 


1260 


GGTACAGATA 


CGATGACCGT ACTTTTTATT ATCACAACGG 


GTTTAATCGG 


TAGTATTTTA 


1320 




GGTTCGATGT 


TATTAAGATT TGGAAGATTT GAATCTTCTA 


TCGCCAAAGG 


ATTAACGTAT 


1380 


25 


GGGAATGCGT 


CACATGCATT TGGCACAGCT AAAGCACTAG 


AAATGGATAT 


TGAATCCGGT 


1440 




GCATTTAGTT 


CAATTGGGAT GATTTTAACT GCAGTTATTA GTTCAGTGTT 


AATACCTGTT 


' 1500 




CTAATTTTAT 


TATTCTATTA ATTTAGATAT TTAAAATGAT 


AGACAGAAAG 


GGAGGCTATT 


1560 


?n 
ou 


AGTAATAATG 


GCAAAAATAA AAGCAAATGA AGCATTAGTT 


AAAGCATTAC 


AAGCaTGGGA 


1620 




TATAGATCAC 


TTGTATGGTA TTCCAGGAGA CTCAATCGAC GCATAGTCGA TAgTTTACGT 


1680 




ACAGTGAGAG 


ATCAATTTAA ATTTTATCAT GTACGTCATG 


AAGAAGTAGC 


AAGCTTAGCG 


1740 


35 


GCTGCTGGTT ACACAAAATT AACTGGTAAA ATCGGTGTGG 


CATTAAGTAT 


CGGTGGCCCT 


1800 




GGTTTAATTC 


ATTTATTAAA TGGTATGTAT GATGCCAAAA 


TGGATAATGT 


ACCGCAATTA 


1860 




ATATTATCTG 


GACAAACGAA TAGTACAGCA CTTGGAACGA 


AAGCATTCCA 


AGAAACAAAT 


1920 


40 


TTACAAAAAT 


TATGTGAAGA TGTAGCCGTT TATAATCACC 


AAATTGAAAA 


AGGTGACAAT 


1980 




GTGTTTGAAA 


TCGTTAACGA AGCAATTCGT ACGGCATATG- 


AACAAAAAGG 


TGTAGCTGTT 


2040 




GTTATTTGTC 


CTAACGACTT ATTAACTGAA AAAATTAAAG 


ATACAACGAA 


TAAACCAGTA 




45 


GATACATCAA 


GACCAACAGT AGTATCACCA AAATATAAAG 


ACATCAAAAA 


AGCGGTTAAA 


2160 




CTAATTAATA 


AAAGTAAAAA GCCTGTCATG TTAATTGGTG 


TAGGTGCGAA ACATGCGAAA 


2220 




GATGAGCTAC 


GTGAATTTAT TGAAATGGCT AAAATTCCTG 


TCATTCATTC 


ATTACCAGCT 


2280 


50 


AAAACAATCT 


TGCCGGATGA TCATCCATAT AGTATCGGtA ACTTAGGTAA AATCGGTACC 


2340 



55 
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CCATATGTGG ATTACTTACC TAAGAAAAAT 
AAAAATATCG GACATCGTTT CAATATTAAT 

5 TTGCATCAGT TAACTGAAAA TATTAAACAT 

TTAGAACGTA AAGCGGTTTG GGATAAATGG 
CCATTACGTC CAGAACGATT AATGGCATCA 

10 ATTTCAGCAG ATGTAGGTAC AGCAACAGTT 
AATAACAAGT TCATCATTTC AAGTTGGTTA 
ATTGCATCAA AAATTGCATA TCCAAATAGA 
TTCCAAATGG TAATGCAAGA CTTCGCTAGA 
TTTGTACTTA ATAACAAACA GTTAGCATTT 
TTAGAATATG CAGTTGATTT TTCTGATATG 

20 

GGTAAAGGTT ATACAATTAA GAGTGCTAGC 
GCACAAGATG TACCAACGAT TGTAGATGTA 
GGTAAAATTG TAAATGAAGA AGCGCTTGGT 

25 

GAAGATAAAC ATTTAGATTT AGATCAAATT 
TTATAACTGA TTTAAAGGTT ATCACAATTG 
TCAACAAAAT GGGAATTGCC GTTTTGTTTA 

30 

ATAAAATTGT GAAAAAGTTG TTGAAAACGC 
AGATATCACT TGCGTGTTAC TGGTAATGCA 

3S AGTCTTGTTT GTTCATGCCT GCTTTTTTTG 
GTTTffTATGT TTAAGAAATT GTTTGGACAA 
CCTGTTGCGA TTTTACCAGC AGCTGGTATT 

40 GAACAATTAG TAGAAATTGC ACCATGGTTA 
GTCATGGAAG CAGCAGGACA AGTTGTATTT 
ACAGCACTTG GATTAGCAGG AGGAGACGGT 

45 TTAATTATGA ATGCAACAAT GGGGAAAGTG 
TATGCCAAAG GGGCAAAAGA ATTAAGTCAA 
TTAGGTATTC CAACGTTACA AACGGGTGTG 

50 GCATGGTGTT ACAACAAATT TTATAATATT 
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ATTAAAGCCA TTCAAATTGA CACAAATCCT 24 60 

GTAGGAATTG TTGGAGATAG TAAAATTGCG 2520 

GTTGCTGAAA GACCATTCTT AAACAAAACG 2580 

ATGGAACAAG ATAAAAATAA. TAATAGTAAA 2640 

ATCAATAAAT TTATTAAAGA TGATGCAGTG 2700 

TGGTCAACTC GATACTTAAA CCTTGGTGTA 2760 

GGTACAATGG GTTGCGGTCT TCCAGGTGCA 2820 

CAAGCCATCG CAATTGCTGG TGACGGTGCA 2880 

GCAGTACAAT ATGATTTACC TTTAACTGTA 2940 

ATTAAATATG AACAACAAGC AGCTGGTGAA 3000 

GATCATGCAA AATTTGCTGA GGCAGCAGGT 3060 

GAAGTAGATG CTATAGTCGA AGAGGCATTA 3120 

TATGTTGATC CTAATGCTGC GCCATTACCA 3180 

TATGGTAAGT GGGCATTTAG ATCAATTACT 3240 

CCACCAATTT CAGTGGCAGC AAAACGTTTC 3300 

AATTGAACTA TAAAAACGGT AATTTCTATT 3360 

TTTATCACAA ATGATCGTAC TGAATTGATG 3420 

TTTTACAAAT ATGTATAATA GCTATGAATT 3480 

GGCATGAGCA AACAACCGCA CTATGAGAAT 3540 

TACATGGAAG CGGAAATTGA GATAGGGGAT 3600 

TTGCAACGTA TCGGTAAAGC ATTAATGTTA 3660 

TTATTAGCGT TTGGTAACGC AATGCACAAC 3720 

AAAAACGATA TCATTGTAAT GATTTCGTCG 3780 

GATAACTTGC CATTATTATT TGCAGTTGGT 3 840 
GTTGCAGCAT TAGCAGCGCT AGTAGGTTAC • 3900 

TTGCACATTA CAATTGATGA CATTTTCTCA 3960 

GCAGCGAAAG AACCAGCACA TGCTTTAGTA 4 020 

TTTGGTGGTA TTATCATGGG TGCTTTAGCC 4 080 

ACACTACCAC CATTTTTAGG ATTCTTTGCA 414 0 



55 
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AGCTTTGCGT GGCCACCAAT TCAAGATGGA TTAAATAGTT TATCGAATTT 


CTTATTAAAT 


4260 




AAAAATTTAA CATTAACAAC GTTTATATTC GGTATTATTG 


AACGCTCATT AATTCCATTT 


4320 


s 


GGTTTACATC ATATTTTCTA TTCACCGTTC TGGTTTGAAT 


TCGGAAGTTA 


TACAAATCAC 


4380 




GCAGGTGAAT TGGTTCGTGG TGACCAACGT ATTTGGATGG 


CACAATTGAA 


AGATGGCGTA 


4440 




CCATTTACTG CTGGTGCA7T TACTACTGGT AAATATCCAT 


TTATGATGTT 


TGGTTTACCA 


4500 


10 


GCGGCGGCAT TTGCTATTTA TAAAAATGCA CGACCAGAAC 


GTAAAAAAGT 


CGTGGGTGGT 


4560 




TTAATGTTAT CAGCAGGATT AACTGCATTT TTAACTGGTA 


TCACTGAGCC 


ATTAGAATTT 


4620 




TCATTCTTAT TTGTAGCACC AGTACTTTAT GGAATTCACG 


TATTATTAGC 


TGGTACATCA 


4680 


15 


TTCTTAGTAA TGCATTTATT AGGCGTTAAA ATTGGTATGA 


CATTCTCAGG 


TGGTTTCATA 


4740 




GATTATATTT TATATGGTTT ATTAAACTGG GATCGTTCAC 


ACGCATTATT 


AGTTATTCCA 


4800 


20 


GTCGGTATTG TATATGCTAT CGTGTATTAC TTCTTATTCG 


ACTTTGCAAT 


TCGTAAGTTT 


4860 


AAATTGAAAA CACCAGGTCG TGAAGATGAA GAAACTGAAA 


TTCGTAACTC 


TAGTGTCGCA 


4920 




AAATTACCAT TTGATGTCTT AGATGCAATG GGTGGAAAAG 


AAAACATTAA ACATTTAGAT 


49B0 


25 


GCATC3TATTA CACGTCTACG CGTAGAAGTG GTTGATAAAT CAAAAGTAGA TGTAGCAGGT 


5040 


ATTAAAGCTT TAGGCGCATC AGGTGTATTA GAAGTTGGAA 


ACAATATGCA AGCTATCTTT 


5100 




GGTCCAAAAT CAGATCAAAT TAAACATGAT ATGGCCAAGA 


TTATGAGTGG 


TGAAATTACG 


5160 


30 


AAACCAAGTG AAACGACAGT GACTGAAGAA ATGTCAGATG 


AACCAGTTCA 


CGTAGAAGCA 


5220 




CTTGGAACAA CAGACATCTA TGCACCAGGT ATCGGTCAAA TCATTCCATT ATCAGAAGTA 


5280 




CCTGATCAAG TATTCGCTGG TAAAATGATG GGTGATGGTG 


TTGGCTTTAT 


CCCTGAAAAA 


5340 


35 


GGTGAAATTG TAGCACCGTT TGATGGTACA GTGAAAACAA 


TCTTCCCTAC 


GAAACATGCG 


5400 




ATA(S3ATTAG aatctgaaag tggcgtcgaa gtacttattc atattggtat CGATACAGTG 


5460 




AAACTGAATG GTGAAGGATT CGAAAGTCTG ATTAACGTTG 


ATGAAAAAGT 


AACACAAGGT 


5520 


40 


CAACCATTAA TGAAAGTGAA TTTAGCATAC TTGAAAGCAC 


ACGCACCAAG 


CATCGTTACA 


5580 




CCAATGATTA TTACAAATCT TGAAAATAAA GAACTTGTCA 


TTGAAGATGT 


ACAAGATGCT 


5640 




GATCCAGGTA AGCTAATTAT GACAGTCAAA TAATGATTAA 


AAATGAAACA 


GCATATCAAA 


3 / uu 


45 


TGAATGAACT TTTAGTCATT CGTAGTGCGT ATGCGAAGTA 


GCGAGTTGAA 


AGAGAATACG 


5760 




TTACAAAAGG CAGTAGCTTA AAATGAAGCT ACTGCCTTTT 


TAGTGCGCAA 


TGATGTATAG 


5820 




CAGGTGTGTT GATGrTAATA AGTTAAATAT TAGTGTTAGA 


TATAGAAAAC 


ATTGCTTATG 


5880 


50 


TTTTTGTCAC ATTTTAGAAA AATGCATCTT CGCGACTAGC 


CAAATTAATA 


GTCTCATTGA 


5940 
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AATAAATTAA CATGATTTTA AATCTATTTG TAAGATAAGG AGATTTGTCA TTATGACAAC 6060 

AGAAGGTCTA TTAGTTGCAG AGAAAGAAAT CGAAGTGAAT GGTTACGACA TTGATGCGAT 6120 

5 GGGTGTCGTT AGTAATATCG TTTATATTAG ATGGTTCGAA GATTTGAGAA CAGCGTTTAT 6180 

TAATCAGCAC ATGAATTACT CAACAATGAT CAATCAAGGC ATTTCACCTA TACTTATGAA 6240 

AACGGAAGCA GAGTATAAAG TACCTGTCAC AATACATGAC AAACCAGTAG GTCGTATTTA 6300 

10 CTTAGTTAAA GCAAGCAAGA TGAAATGGGT GTTTCAGTTT GAAATTGTGT CCGCACATGG 6360 

CGTGCATTGT ATTGGTACAC AGACAGGCGG TTTTTACAGA TTGAGTGATA AGAAGATAAC 6420 

CTCTGTGCCA CAAGTGTTTC AAGACATTTT AGCAACAAAA TAATGACTTC ATTTTAAAAT 6480 

ATAAAAAGTA AGAAGGTGTT CGAAATGGTT AAGCAATTAA ATAGTGTCGA AGCATTCCGT 654 0 

GAATTTATTC ATCAATATCC GTTAGCAGTT GTACATGTCA TGCGCGATCA GTGTAGCGTG 6600 

TGTCATGCCG TTTTACCACA AATTGAAGAC TTGATGCAAT CATATCCCAA TGTGCCATTA 6660 

20 

GCTGTGATTA ATCAAAGTCA GGTGGAAGCT ATTGCTGGAG AATTAAATAT TTTCaCTGTA 6720 

CCTGTGGATT TAATTTTTAT GAATGGAAAA GAAATGCATC GTCAAGGGCG TTTTATCGAT 6780 

ATGCAACGTT TTGAACATCA TCTTAAGCAA ATGAATGATA GTGTAAATAA CGATGTCGAT 6840 

25 

GAGCATTAAT ATCGCAAATG ATTAGCATTG CTAAGATTAT GTAGACATCA TAACTTATTT 6900 

CCCAGTAAAT ATTGGTAGTA ATTAGAATCA GCATGGTACA GTAGAACTAT AGTAGAAATC 6960 

ATCAAAGAGG AGTGACGACA AATGCGTAAA AAATGGTCTA CACTTGCGTT TGGATTTTTA 7020 

30 

GTTGCAGCAT ACGCACATAT TAGAATTAAA GAAAAACGCA GTGTGAAAAG TTATATGTTA 7080 

GAACAAGGTA TACGATTATC TAGAGCTAAG CGTCGTTTTA TGTATAAAGA AGAAGCGATG 7140 

3$ AAAGCATTAG AAAAAATGGC GCCACAGACA GCAGGCGAAT ATGAGGGAAC CAATTATCAG 7200 

TTTAAGATGC CAGTAAAAGT GGATAAGCAC TTCGGTTCAA CCGTTTATAC CGTTAACGAT 7260 

AAACAAGATA AGCATCAACG CGTTGTATTA TATGCACATG GAGGCGCATG GTTCCAAGAC 7320 

40 CCACTCAAAA TTCATTTCGA ATTTATTGAT GAACTTGCAG AAACACTCAA TGCTAAAGTC 7380 

ATCATGCCAG TATATCCGAA GATTCCGCAT CAAGATTATC AAGCGACGTA TGTGC TTTTT 7440 

GAAAAGTTGT ACCATGATTT ATTGAATCAA GTAGCAGATT CTAAACAAAT CGTTGTAATG 7500 

45 GGTGACTCTG CGGGCGGTCA AATTGCTTTA TCATTTGCTC AATTGTTAAA AGAAAAACAT 7560 

ATTGTGCAAC CAGGACATAT TGTATTAATT TCACCAGTTT TAGATGCAAC GATGCAGCAT 7620 

CCTGAAATTC CTGACTACTT AAAGAAAGAC CCAATGGTAG GTGTGGATGG CaGTGTGTTC 7680 

50 TTAGCTGAAC AATGGGCAGG GGACACACCT TTAGATAACT ACAAAGTATC ACCAATTAAT 7740 

55 
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CCAGATGCTT TGAACTTATC GCAATTGTTG AGTGCGAAAG GTATCGAACA TGACTTTATA . 7860 
CCTGGATATT ACCAATTCCA TATTTATCCA GTATTTCCGA 7900 
5 (2) INFORMATION FOR SEQ ID NO: 139: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1984 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 139: 

75 

GTCTAAATAA ACAAAATTAT CATTGATTaC TGAACTGGCA TTTCGAAGTA ATGCTTCAAT 60 

ATCATTCGAA TATTTCTTCA ATTTATGATT GTGAAATAAT TCTTGCATCA AAAATGGTCT 120 

TTGGTCACAT GAATGTGCAT CTGAAGCTAC AAAATGAGCC AAATTACATT CTATAAATTG 180 

20 

TAATGATAAC TTTTGAATGT TTTTACCAAA TCCACCAACT AAAGAACTCG ATGTTAATTG 240 

ACTCAGTGCC CCATTTGCAA CCAATTCATA TAATATTTCC GGATTTTTGG CGATACTTCT 300 

ATTTCTTTCA GGATGTGCAA TGATTGGTAT GTAACCTCTC GATTGTATTT CAAAAAACAA 360 

25 

TTGTTTTGTA TAATGTGGTA CTTCGCCCGT TGGAAATTCA ATTAATAAAT ATTTCGAACG 420 

ATTAATACCT TGAATACTAC CATTATCTAA GCCTTTCAGA ATCGAATCTG TAATTCTAAT 4 80 

30 TTCTTGCCCG GGAAATAATT TAATATCCAA TGCTTGAACT TCTGGATGCG TTCTTAACTC 54 0 

CGCCAATTTC ACAAGCACTT GTTGAAATGT ATTATCATAT CTCGGATGCA AATGATGAGG 600 

TGTCGCTACA ATACTTGTTA CACCTTCATC CTTAGCTTGC TTTAATAGTG CAATACTCTT 660 

35 TTCAATTGTT TTAGGACCAT CATCTATATC AACTAATATA TGGTTATGAA TATCAATCAT 720 

GATTCATCAG TCCCATAATA TGCATAGTAA CTAGCACTTT TATCTTTAGG CATTCTATTT 7B0 

AAGACTACAC CTAATAATTT AGCACCTGTT GCTTCAATAA GTTCTTTTCC TTTTTTAACT 840 

40 TCATCTCTAT TATTATTTTC CGAATTAACT ACGTAGACAA CATTGCCGGT AAACTTTGAA 900 

AATAATTGCG CATCTGTAAC TGTGTTCACT GGTGGCGTAT CGATAATTAC AAAGTTATAA 96 0 

TTCATCAATA ATGTGTCATA CAAATTTGCA AATGCCCTTG ATGTAATTAA CTCTGACGGA 1020 

45 TTCGGTGGGA TTGGCCCAGA CGTCAAGACG TCTAAATCTT GAATTTCAGT TGAGATAATA 1080 

CTGTCTTGAT AAGTTGACCA ATTTAGCAAT AAACTTGATA GGCCTTCATT GTTTGGCAAA 1140 

TTAAAAATAT AATGCTGCGT AGGTTTACGC ATATCCCCGT CTACGATTAG TGTTTTATAA 1200 

SO 

CCTGCTTGCG CATATGCAAC TGCTAAATTT GCTGCAATTG TAGACTTACC TGCGCCTGGT 1260' 
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10 



IS 



20 



25 



30 



45 



SO 



GATCTTATGC CTCGAAATTT CTCGCTAATA GGTGACTTTG GTTGTTCATG GACAATTAAA 1380 

CTTGATGTAC TTCyTCGTGT ATTCGTCATG GTAATTCCTC GTAAATTAAA ATTTTTGTAT 1440 

TGAACCTAAA ATAGGTAATC CTAGTTGCGA TTCAACATCT TCTTCTGTCT TAATACGCTT 1500 

ATCTAATAAT TCTTTTAAGA AAATAATCAA TATTGCTAAA ACAATACCAA CAATAATGCT 1560 

GATAACTAAG TTGACAGATA CTATTGGAGA TACTTTTACA GCATTATCAT GTGCTGAGGA 1620 

AAGTATCGTA ACATTATCAA CACTCATAAT TTTAGGCATG TCATGAGCAA AAACTTTAGA 16 BO 

TATTTTATTA ACAATTTTGT CAGATTCAGA TTTATTCCCA GTGGTAACTG ATACAGTAAT 174 0 

AATTTGAGAG TTTGTTTGAT TGGTTACTTT TAAAAATGAA TTCAACTCAG CTGTTGAATA 1800 

CTGACCATCA AnTTCTCTAG ATACTTTATC TAGAATTCTA GGACTTTTGA TAATTTCCGT 1860 

ATATGTATTA ACAGACTGCA AACTACTTTG AACATTTTGG AAAGCTAAAT CACTTGAGGA 1920 

CTTTTTCATG TTCACTAATA TTTGAGTAGA AGCAGTATAT TTGTCAGGCA TAACAAAAAA 1980 

GGTT 1984 
(2) INFORMATION FOR SEQ ID NO: 140: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6272 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 140: 

CAAATCCCTT GGTGATGAtA AAtGtATTGC TGTGTAGCCA AATAATCTTC GTATATATGA 60 

35 CTGACGTTCA ACAACAGCTT GCAATCGTTT CGTTGGTACA GTTACTTTCT TCTTGTTAAA 120 

GA<^CCATAT TCAATTTTAA GTTGCTCATT TTCAAGCATC ACCGAAAAGC CATAAAATCT 180 

TATCATTGTT ATAATCGTTC CAATAATATA TGCCACTATT AATACTAGTA AAATGATGAT 240 

40 TAATACTGAA ATACTTACAA TTTGAACCCA TTGACTAATT TCATGATTTA GCTTCGACCA 300 

TGGGATCAAC TCTCTTACAG CCCCGTAAAT CGGTACTAAA GCTGCTAACG TTACACCAAT 360 

GGCGCCACTG GTCATTGCCA TAAATAGTGA TTCTTTAAAA TTCATCTGAT ATATAGGAAT 420 

GCGTTTATTT TTCTGATTAA GCATACTATC AGTGTTCTGC ACTTCATCTA AGCGACCTTC 480 

TGCGATGTCT TCCACATTAC CTTCAATGTC ATGATTACAG TTGTCATTCT TCTCAGCACT 540 

AGACTTTTGC GCCACTTCTG TCTTCAACTC TGTTTGCAAT TGATCAATAT ATCGTTCAAG 600 

ATATTCACCT TGTTTTTTCG AAATAACACT TAAGACAATA CCATCACTTG GTGTTTTGAT 660 
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AATACGTTTT ATATTTAATT CTTTACGCTT TTTATTAAAA ATACCTGTTG TTAAAATGAA 780 

ATAATTATCC tCAATCCAAT ATCGCGTGTT CATAATTCCG ACAATTTGAG AAATGTATGA 840 

5 

TATTAAAAAG AATACAAATA CAATACCTAT CCATAAATAT GATTCGGGAT TCGTATAATC 900 

AAAATCTTTC AATTGAAAGA TAATGAAAAT AAAAAAGACG ACTATGTTTT GTTTGATAGC 960 

ATTGATTATG CCATTAAAAT ATGAAATCGG ATGTAATTTT TGAGGTTCAG ACATCACTTT 1020 

10 

CAACCCCTCT CAAATTCGAC ATAGTTCTCT CTTCGATTAT TTTAACATCG TCATGAGACA 1080 

TCATCGGTAA ATAAATAGTA TGACCTGCAG TCATAAATGC AACTTTATAC AAATTAAGCA 1140 

75 CTTTACTAAT TGGATTAGAT TTAATCGACA AGTATTGTAA ACGTTCAATT CGACTCGTTT 1200 

CTTCTTTATA TATAAAAAAT GATGTACGAT ATTGTACACT TAGTTGATCA ACTTTATAAA 1260 

AGCGACAATG ATATTGCCAT AAAGGCTTAA TAAATAATTT TAATGTACTC AGAGCACCTA 1320 

20 AAACCAACAA AATATAAAGT AAGTAATGTG GCCATTCAAA TCTTAACCAT ATAAAATAAA 1380 

AAATGACATA CACAGCTACA CTCAATATAA ATTCTAAGCC ATTCGTAATG TAGTAATACA 1440 

ACAATGCTGA CTTAGGACTC TTAGTCAACT TAGTATAATC TGACATATAC CCCTCTCCCC 1500 

25 

AAATAAAAAA TTATACGGAT TTATAATCTA TTTCATTTTA TTTTTATATG ATGATAATTA 1560 

TAGCATATGG AATATTTCAT GCTAATTTAT TCTTCCTAAA GGTACATCTA AAAATTTAAT 1620 

TAAGCAGAAA GTGCTTGAAT TGCTAAAAAG ACACCATGTT ATAATTTTAT CAACATGATG 1680 

30 

CCTTTCATCT ATAATCAATC TTTCATCTTA TCAAGAGCGA TATTTAGTTC AAGCACATTC 174 0 

ACATAATCAT TTGTTAACAC ACCACGCTGC TTACGATGTT GAATCAAGTC GGCCACTCTT 1800 

3S GAAGTAGATA CATGACGAGC ATCAGCAATA CGAGGTGCTT GCTTCAATGC ATTTTCGACC 1860 

GTAATATGCG GATCTAAGCC CGACCCAGAA CTTGTTGCAG CATCTATTGT TACATTTGAA 1920 

TTCOCAAATT TAACATGATG TTTCATGCGT GCTATTAATT CGGTGTTTCC ATTCGATTCA 1980 

40 TTACTTCCAC CTGAAGATAC GCCGTTTTTA TATAATTTTT CAGGATTCAT ATTATAATCA 204 0 

ACTGCACTCG GTCTCCCGTG AAAATATCGT GTCTCTGTCC AGTGCTGTCC AATCAATTTT 2100 

GATCCAACTA TACGATTGTC ATACGTAATT AAACTGCCAT TTGCTTGTTG ATAAAAAAAT 2160 

45 

ATTTGACCAA TTAACGTGAT AGCTAACGGG AATAAAAATC CACATAATAC CATAGTTATT 2220 

ATCGTTAAAC AAATACTATT TCTTATCGTA TTCATGGTAC AGGCTCCTTC CTCTTTACAC 22 BO 

AAAAAATTGT ACAATCATAT CTATTAATTT AATGCCTAAA AACGGGACGA TTAATCCACC 2340 

so 

TAATCCATAA ATCAACATAT TATTTATAAA GATTCTATCA ATGCTGTAAC CCTTTACTTT 24 00 

TACACCTTTC ATGGCAATTG GAATTAAGGC AACAATGATT AATGCATTGA ATATCAAAGC 24 60 
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AATTGTTGAC ATCATTAGTG CAGGTAAAAT TGCAAAGTAT TTTGCTACGT CATTAGCCAA 2580 

ACTAAATGTC GTTAATGCAC CTCTCGTCAT TAATAATTGT TTGCCTATTT TTACAACCTC 2640 

TATTAACTTT GTAGGATTCG AATCTAAATC AATTAGATTA GCTGCCTCTT TAGCACTAAT 2700 

TGTCCCTGAG TTCATAGCTA ATCCTATATT . CGCTTtGTGc tAGCGCAGGT GCATCATTTG 2760, 

TACCATCTCC TGTCATCGCA ACAATATGGC CTTTCGCTTG TTCATCTTTG ATGACTTTAA 2820 

10 

TTTTATCTTC GGGTTTACAC TCTGCAACAA ATCTATCAAC CCCGGCTTCT TTTGCAATTG . 2BB0 

TAGCTGCTGT TAAAGCATTA TCACCTGTAC ACATAACTGT TTCAATCCCC ATTTTTCTCA 2940 

,5 ATTCAGTAAA TCGTTCTACA AGACCATCTT TAATCACATC TTTTAAATAA ATCACGCCAA 3000 

GCATGACATT GTTTTCAATG ACTATTAAtG GnGTGCCACC TTTACTCGAT ACATCCATAC 3060 

AGAGAGACTC AATATTAAGA GGAATATTGC CTTGTTGTTG TTTGACAAGA TTTATCATAC. 3120 

20 TATTAGGTGC ACCTTTGAAT ACCGATATTT CATTTGTAAT GATTCCGCTC ATTCTAGTTT 3180 

CAGCTGTAAA AGGCTTATAT GTGCCATCAA TGTCTTTAGG CAGCTCATTT ATATACATcT 324 0 

GcttCGCTAA TCGTACAATA CT TT TTCCTT CTGGCGTATC ATCGTAGATT GATGACATAT 3300 

25 

AAGCAGCGAC TATCAATTTT TCAAGCATTT GTTGATTCAC TGGTAAAAAT TCACTAGCGA 3360 

TTCGATTGCC ATAAGTGATT GTGCCTGTCT TGTCTAAAAT CATTACATCG ACATCTCCAC 3420 

ATACTTCTAC AGCACGCCCA CTTTTCGCTA ATACATTGAA TTGAGTAACA CGATCCATGC 34 80 

30 

CTGCAATACC AATCGCCGAT AACAAACCAC CGATTGTCGT TGGTATTAAA CATACTGTTA 354 0 

ACGCAATGAG CATCGCAATA GGTAAAATTA AATGCAGGTA AGATGCTATT GGATATAACG 3600 

35 TTACAATAAC GACTAAAAAT ATAATTGTTA ACGTTGTTAA TAATGTAAAA AGTGCAATTT 3660 

CATTTGGTGT TTTATTTCTT TCCGCCCCTT CAACTAAGGC AATCATTTTA TCTAAAAAAG 3720 

ATGTACnCGC TTCACTCTCA ACACGTATTT CTAACCAATC AGATGTTACA AGTGTACCGC 3780 

40 CAATGACTCC ATCAAAATCG CCACCTGATT CTTTTATCAC AGGTGCAGAC TCACCAGTAA 3840 

TTGCAGATTC ATCAACGGTT GCTAATCCAT TTATTACAAC GCCATCAGCA GGGATTGTTT 3 900 

CTCCATTTTC TACCCGAATA TTTTGTCCGG CTTTTAACTC TGTGGCGTTC ACTATCCGAT . 3 960 

ACGCACCATT TTCTTCTATC AATCGAGCAG TTAAATTTGA TTGTGCTTGT CTTAAACTAT 4 020 

CAGCTTGCGC TTTTCCACGA CCTTCAGCAA AGGCTTCTGA AAAATTAGCA AACAATATAG 4080 

TTATTAATAA TATGATAAAA ATTGTAATCA AATAACCTCG CGATAGATAG CTAGTTCCAA 4140 

50 

ATATGTCAGG AAAACATATT AATATCAACG TTAAAATCAT TCCAACCTCA ACGACAAACA 4200 

TTATCGGATT TTTTATTAAT TGTTTAAGAT TCAGCTTATA AAAACTCATT TTCAAAGCTT 4260 
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TTTATTTTAA AGTTAAAAAT TCACCAATAG GACCAAGTAA TAGTACTGGA ATAAATGTCA 4380 

AACCACTTAG TAAAACGATA AATACGATTA GTGATACGCC AAAATAAGGT TTATCAATCG 4440 

5 

CTATTGTATA TTTATCTTGA TGGTATGATT TTTTATTCAC TAAACTTGAT GCAATCATTA 4500 

ATTGCAAAAT AATTGGTATA TAACGAGAAA GCAACATAAT GATTCCTGTA GAGATATTCC 4560 
AGAATGTTGT ATCATCTTTC AGTCCTTCAA ACCCTGATCC ATTGTTCGCA GCAGCTGATG . 4620 

TCATTTCATA CATAACTTGT GAAATACCAT GAAAAGACGG ATTCGTtATa CTTtCACTTG 4680 

CTCCAGGAAT CATAAAAGCA AGTGCTGAAA ATACTAAAAT TAAAATTGGG TGTATGAGAA 4740 

75 AGACTAAGAC AATACATTTC ATTTCACGGG CGCCAATTGG CATATTTAAA TATTCTGGTG 4800 

TTTTACCAAC CATCAAACTG CATATAAACA CCGTCAGTAA GACAAATATC AATAAATTCA 4860 

TGAGTCCTAC GCCTTCGCCA CCAAATACAA CATTTAGCAT CATTAATACC ATTGGTCCTA 4 920 

20 ATCCACCTAT AGGCGTTAAG CTATCATGCA TGTTATTAAC AGAACCCGTT GTAAATGCCG 4980 

TCGTAATAAC TGTAAATAGT GCTGACAAAC CTGCTCCAAA CCGTACCTCT TTACCTTCCA 5040 

. TATTCGGTCC ATAAATGCCT AAATTCGCTA GTATTGGATT ACCACGATAC TCACTCCACA 5100 

25 

TAGTTAATGT AAGAATTGCT ATAAAAATGA AAAACATTGC GACAAATAAT ATCAACGCAT 5160 

GACGATGTAC TCGTTTACCA TGTCTACTTA ACATGCGAC C AAATAAGAAC AACATTGACA 5220 

TAGGAAGTAA CATCATACTG CCCATTTCTA TAAAATTGCT CCAAATATTT GGATTTTCAA 5280 

30 

AAGGTGTTGC AGAATTTCCT GCTAAAAATC CTCCACCATT CGTACCAAGA TGTTTTATTG 5340 

ATTCAAGTGA TGCAATAGGT CCAAATGCAA TATGTTGAAT ATGTCCGCTT AAAGTCCGAA 54 00 

3S TCATTAAATT AGCATGCAAC GTTTGTGGTA CaCCTTGAGT CATCAATAAA ATACTAATTA 54 60 

AACATGATAA TGGTAAAAGT ACTCGGACAA TAAACCGAAC AATATCTTGA TAAAAATTAC 5520 

CAATGATATT AGTTAATCCA. GTTAAACGTC TCAACATCGC. TATACAAACG GCGTAACCTG 5580 

40 ATGCACTAGA TGTAAACATT AAATATGTCA TTACAATCAT TTGCGTTAAA TATGTCACAT 5640 

CTGaTTCACC GTTATAGTGT TGtAAATTAC TATTTGTTAA AAAAGATATT GCTGTATTAA 5700 

ACGCTAAATC TATCGATTGG TTTAAATTAT GATTTGGATT TAAAAAAAGC CATTGCTGAA 5760 

45 

CTATTAGCAA TACAAATGTT ATAAACCCCA TAAATCCATT AAATGCCAGA AAATGTTTGA 5820 

CATATGTTTT AGCTGACATG TGTTCTAAAT CTGTGCCGAT AATTTTAAAA CACATATTTT 5880 

CAAATCTAGT AAATATTAAA TCTACTCTTG ACGATTGCAC CAATGCTACG CGATATAGAT 5940 

50 

ATCCACTAAA AACATACGTA ATCATAACCA TCATTGTTAG AAACAAAATT ATTTCCATGA 6000 

TAACCCTCAC TTAATATATT TCTAAAATTT TTCACTACGA ATTAAGGCAT AAAATAAATA 6060 
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ACACAACAAC ATCGTAACAA CTTGTTTATG AGAGAAATnT TAATTTTCAA ACTTAGTTAT 6180 

TAAGAAAnCA TTAAGATGTG TATGCAGAAA TAAATTTTAT AGCATTTAAT TGTGAAGAAT 6240 

5 ATTATGATAT TGCTATCGAG GTGAAGGTTA TG 6272 

(2) INFORMATION FOR SEQ ID NO: 141: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1978 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

is 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 141: 

AAATGATGTT TTACAATAAA TATAnAAACG TATCAACATA TATCATCATA TTTTTAGTTT 60 

20 CAAGTGCAGC CTTTGCAATA TTCTTGTTAA GTGCGnACAT TAGTGCTCAC TCGGAACAAG 120 

TGTACGAAAT GACTGACCAT CAAATTAAGA ACAATACGAT AAATAAAGCA TACGAACATA 180 

AAGACCCTAC AAACAATAGC GAACAAAGAG ATGGGAAAGT GTTCGCTTTA ATAAATTGAT 240 

ACATTGTCAC AACGTTATTT TGCCTATTTT TGCGmAATAG CGTTTTTTAT TACwTTTTTG 300 

CTGATsTTAA ATTTGTTATA TTTTGTTAAA GTATTATAAT GATTGAATAA ACAAATTGAA 360 

GGTAGGTTTT TTAATTGAGT AATTCTGATT TGAATATCGA AAGAATTAAC GAGTTAGCTA 420 

AAAAGAAAAA AGAAGTAGGA TTAACTCAAG AAGAAGCAAA GGAGCAAACA GCCTTAAGaA 4 80 

AAGCTTATCT TGAGAGTTTT AGAAAAGGGT TTAAACAACA AATTGaAAAT ACTAAAGTAA 54 0 

TTGATCCAGr AGGTAATGAT GTAACACCTG AAAAAATTAA AGAGATACAA CAAAAAAGAG 600 

ATAATAAAAA TTAAATCACA AATCTGTAAA GAATTTTCTG ACATTATAAC TTGAAATAAG 660 

TATTTTACTT ATCTTTTTAT TTTAAAATAA GTTATAATGT ATTTGATAAA ATTGAAGAAG 720 

40 GGAAGATACA CAAGATGTTT AATGAAAAAG ATCAATTAGC TGTTGATACG CTACGTGCAC 780 

TAAGTATCGA CACAATCGAA AAAGCGAATT CTGGTCATCC AGGATTACCT ATGGGAGCTG 840 

CCCCAATGGC TTACACTTTG TGGACACGTC ATCTGAATTT TAATCCACAA TCTAAAGATT 900 

45 ACTTCAATAG AGACCGTTTC GTATTATCTG CAGGGCATGG TTCAGCATTA TTGTATAGCT 960 

TGTTACATGT TTCTGGTAGT TTAGAATTAG AAGAATTAAA GCAATTTAGA CAATGGGGTT 1020 

CTAAAACACC AGGTCATCCT GAATACAGAC ATACAGATGG TGTAGAAGTT ACTACCGGAC 1080 

50 

CACTTGGACA AGGTTTTGCT ATGTCAGTAG GATTAGCTTT ACAGAAGATC ACCTAGCAGG 1140 

gAAATTTAAT AAAGAAGGAT ATAATGTTGT AGATCATTAC ACATATGTAT TAGCTt CTGA 1200 
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AAGTAAATTA GTTGTTTTAT ACGATTCAAA TGATATTTCA TTAGATGGCG AATTAAACAA 1320 

AGCTTTTTCT GAAAACACAA AAGCTCGTTT TGAAGCATAT GGTTGGAATT ACTTACTAGT 1380 

TAAAGATGGT AATGATTTAG AAGAAATTGA TAAAGCGATT ACTACAGCTA AATCTCAAGA 144 0 

AGGACCAACG ATTATTGAAG TTAAAACAAC AATCGGATTT GGTTCACCGA ATAAAGCAGG 1500 

AACTAATGGT GTTCATGGGG CACCTTTAGG TGAAGTTGAA AGAAAATTAA CATTCGAAAA 1560 

TTACGGTTTA GATCCTGAAA AACGTTTTAA TGTTTCAGAA GAGGTATACG AAATTTTCCA 1620 

AAATACTATG TTAAAACGTG CTAATGAAGA TGAATCTCAA TGGAATTCAT TATTAGAAAA 1680 

ATATGCAGAA ACATATCCTG AATTAGCAGA AGAATTTAAA TTAGCGATTA GTGGTAAATT 174 0 

GCCTAAAAAT TATAAGGATG AATTACCACG TTTTGAACTG GGTCATAATG GTGCATCTCG 1800 

TGCTGATTCT GGTACTGTTA TTCAAGCAAT CAGTAAAACT GTCCCTTCAT TCTTTGGTGG 1860 

ATCAGCAGAC CTTGCTGGTT CAAACAAATC CAATGTAAAT GATGCAACTG ATTATAGTTC 1920 

TGAAACACCT GAAGGtAAAA ATGTGTGGTT TGGTGTACGT GAATTTGCTA TGGGTGCT 1978 
(2) INFORMATION FOR SEQ ID NO: 142: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7588 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 142: 

TAGTAGTATT TATTAAATTA TACGAAGGGA CCcAACACAG AAAATTCATT TTATTGAATT 60 

TTACATTTAT GTGCCAAGTT GGGAAAAATG TCTTATTTTT TCaAAGTATT TAAAAGTAAA 120 

ATTACATGTT AATACGTAGT ATTAATGGCG AGACTCCTGA GGGAGCAGTG CCAGTCGAAG 180 

ACCGAGGCTG AGACGGCACC CTAGGAAAGC GAAGCCATTC AATACGAAGT ATTGTATAAA 24 0 

TAGAGAACAG CAGTAAGATA TTTTCTAATT GAAAATTATC TTACTGCTGT TTTTTAGGGA 300 

TTTATGTCCC AACCTTTTTA GAATATTAAA TTTCTACAAT TTCGTCATCT TCAACAATAA 360 

45 AGCCCATTGT ATTGACGCTG TTATTTAAGA AAGTCAGAAT ATAACGCATT ACTTCATCAC 420 

GTTCTGGCTC ATTGTGAACC TCGTGGTAAA AACCTTGCCA AGCTTTAAAA TATAATTCAG 4 B0 

GTGTTTGATA TTTTTCTTTA AACTCATCAA TTGCCCTAGT ATCAACAATT AAATCCTTCG 540 

TTCCATACAT TAATAGCGTT GGCATTGGTT GAATGTCATG AATATGAGCC ATCGTATCTT 600 

TCATCGTCTC ATTAATTGTA TTATACCAAT GATACGTTGC TTTTTTTAAC ATTAAACCAT 660 
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CATTAAAACG TGTGTCTTTT GAAATTTTAC CTATATTTGA AACAAGTTTA TCTTTACGAT 780 
TTTTTCCATT CTTTTGAAGT TCTAGCATAG GAGAAATTAA CATCATCCCC TCGATTGGCA 840 
, 5 ATTCTACTTT TTCAAGTAAA TTTAATAAAA TCAAACCGCC AAGTCCTACC CCTAATACAT 900 

AAGTAGGAAT TTTATATTCA TTAGCTATCT TTAACCAGTC TAGCAAACTT TCGTGATACG 960. 

TTTGAAAGTT TTCAATTTGT CCTTTATTAG CTCTTGAAGT TTGACCTTGA CCAGGCAAAT 1020 

10 

CTCCCATAAT CACATGATAG CCATTTCTTC TTAACATCGT AATAACATAT GCATATCTTC 10 BO 

CCGTATGTTC TAATATATTA TGAGCAATAA CAACGACGCC TTTCGCATCA TTTTCAGCTT 1140 

CCCACTTCCA CATTATTATA CTGCCCCTTT TTCATTAATC TTCAATAACA TAATTATAGC 1200 

75 

AAATTCACTA TGTAGATTTC TATTTATAGT ATTATTGTTG TCCATATTAT TATATATAAA 1260 

TGAAATCAAC ATCAATAATA GTGTAATTAT ACATAATTAT TTTTGATTGT TTTTGATGAA 1320 

2Q AAGGCTTTCT CGAATATTTT TTTCATGCTA AACTTATTGT AAACACAAGG GTTTGGAGGA 13 BO 

GTAGCAATGG CACTATTAAA GAATTTTTTT ATCGGATTAT CTAATAATAG TTTTTTAAAC 144 0 

AACGCAGCAA AAAAAGTGGG CCCACGTTTG GGCGCCAATA AAGTCGTTGC CGGAAATACA 1500 

25 ATTCCAGAGT TAATTAATAC AATCGAATAC TTAAATGACA AGAATATCGC TGTTACGGTA 1560 

GACAATTTAG GGGAATTTGT CGGTACAGTT GAAGAAAGTA ATCATGCTAA AGAACAAATT 1620 

TTAACAATTA TGGACGCGCT TCATCAACAT GGCGTAAAGG CACATATGTC TGTTAAATTG 16 BO 

30 AGTCAGTTAG GTGCAGAATT CGACTTAGAA TTAGCTTACC AAAATTTAAG AGAGATTTTA . 174 0 

CTTAAAGCAA ATACTTACAA CAATATGCAT ATAAATATTG ATACTGAAAA ATATGCTAGC IB 00- 

CTGCAACAAA TTGTTCAAGT TTTAGATCGC TTAAAAGGCG AATTTAGAAA TGTTGGTACT I860 

35 

GTAATTCAAG CATATTTATA CGATAGCCAC GAATTAGTTG ATAAGTACCA AGATTTACGA 1920 

TTACGTTTGG TTAAAGGTGC ATATAAAGAA AACGAATCAA TTGCATTTCA ATCTAAGGAA 1980 

GACGTAGATG CAAATTACAT CAAAATAATT GAACAACGTT TGTTAAACGC ACGCAATTTC 204 0 

40 

ACTTCAATTG CAACACATGA C CAT CG CATC ATTAATCATG TAAAACAATT TATGAAAGAA 2100 

AATCACATTG AAAAAGATCG TATGGAATTC CAAATGCTCT ATGGTTTTAG ATCAGAGTTA 2160 

45 GCAGAAGAAA TCGCAAATGA AGGCTATAAT TTCACTATTT ATGTACCTTA TGGCGATGAT 2220 

TGGTTTGCGT ATTTTATGAG AAGATTAGCA GAACGCCCAC AAAACCTATC TCTTGCTGTA 2280 

AAAGAATTTG TGAAACCTGC TGGCTTAAAA CGTGTTGGCA TAATTGCAGC TTTAGGAGCT 2340 

50 ACAGTTATGT TAGGTTTAAG TACAATTAAA AAATTATGCC GTAAATAGAG CAAGACATAA 2400 

ACAATAATTT AGGAGTCTGG AACAATAATC AATGTTCTAG GCTCCTAAAT GTTATATTGG 24 60 
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TAGATTTTAA TAAATTAGCC ATTTCAATTG CACTTACTGC TGCTTCAGCA CCTTTATTGC 


2580 




CAGCTTTCGT ACCTGCTCTT 


TCCACAGCTT 


GTTCAATACT TTCAGTCGTT AAAATACCAA 


2640 


5 


ATATGACTGG 


TACATTAGTT 


TGATCATTCA 


CTTTAGAAAC 


ACCTTTCGCG 


ACTTCATTAC 


2700 




AAACATAATC 


ATAATGAGAC 


GTAGCACCGC 


GAATTACGCA 


TCCTAATGTA 


ATTACTGCAT 


2760 




CATAATTTCC 


TGATGAGGCT 


AATTTTTTAG 


CTACTAAAGG 


AATTTCAAAC 


GCACCTGGCA 


2820 


10 


CAAATGCTAC ATCAATATTG TCTTCATTAA CATCATGTCG AATCAAAGTA TCTTTTGCAC 


2B80 




CTTCAAGTAA 


TCTTCCAGTG 


ATAAAATCAT 


TAAATCGACT 


AACTACGATT GCAACTTTCA 


2940 


15 


AATCTTTTCC AATTAATTTA CCTTCAAAAT TCATGTTAAA ATCCTCCTAT ATTAAATGAC 


3000 


CCATTTTTAT 


TTTTTTCGTT 


TCCATATAAT 


CATGATTATG 


TACCGTTTCT 


GGTACGATAA 


3060 




CTTCAATTCT 


TTCTGCAATA 


TCAATGCCAT ATTGTTTTAA 


TCCCTCAAAT 


TTACTTGGAT 


3120 


20 


TATTACTTAA 


TAAATTGATA 


TGTTCGATGT 


TAAAATATTT 


TAAAATCTGT GCAGCAATAT 


3180 




GATAATCTCG 


CAAATCTTCA 


TCAAAACCTA 


ATGCTAAATT 


TGCAGTTACT 


GTATCATATC 


. 3240 




CTTGCTCAAT 


TAATTCATAT 


GCGCGTAATT 


TGTTTAACAA 


TCCTATGCCA 


CGACCTTCTT 


3300 


25 


GAGGTAGATA 


AATAATCATG 


CCACCATGTT 


CATTGATATA 


CTTCATAGAC 


GATTCAAGTT 


3360 




GAGCACCACA 


ATCACAACGT 


TGACTATGGA AAATATCGCC TGTAAGgCAC GCAGAATGTA 


3420 




AGCGTACATT 


TTCATGTTGT 


CGAATTGCAC 


CTTTTGTCAG 


TACAACTATC 


TCTTCATCTG 


34B0 


30 


TGTATGTCGC 


TTTAAAACCA 


TACATATCAA 


ATGTTCCGAA 


ATCTGTAGGC 


ATTTTCACTT 


3540 




TTGCCTTAAA 


TTCAATTTCT 


GGTTCTAATT 


TTTTACGATA 


TTCAATTAAA 


TCATCAATCG 


3600 


35 


TAATCATCTT 


TAATTGATGT 


TTTTCTTTAA ACTTTTGTAA 


ATCTTGTCCT 


TTCGCCATCG 


3660 


TGCCGTCATC 


ATTCATAATC 


TCACAAATGA 


CACCAGCGGG 


CTTGGCACCA 


GTAAGTTTAG 


3720 




CTAAATCAAC 


AGCCGCTTCT 


GTGTGTCCAT 


TTCTAGCTAA 


TACGCCTTTA 


TCTTGTGCTA 


3780 


40 


CTAATGGAAA 


TAAATGACCA 


GGACGATTAA 


AATCTTTAGC 


TTCACTACTA 


GGATCAATGA 


3840 




GCTTTTTGGC 


AGTCAATGTA 


CGTTCATAAG 


CACTAATTCC 


TGTTGTTGTA 


TCTACATGAT 


3900 




CAATACTCAC 


TGTAAATTGC 


GTACCAAAGA 


TGTCGGAGTT 


ATCATCAACC 


ATTTGTACCA 


3960 


45 


AATCCAAACG 


TTGTGCAATA 


TCTTTAGACA 


CTGGTGCGCA 


TATTAATCCC 


CtTGCTTCTT 


4020 




TCGCCATAAA 


ATTAATGGTA 


TTATCGTTCA 


TCCATTCAGT 


AACCGCTACT 


AAATCACCTT 


4080 




CATTTTCACG 


ATTCTCATCA 


TCTACTACAA 


TAATTGGTTC 


TCCATTTTTT 


AAAGCCATTA 


4140 


50 


AAGCACTGTC 


AATATTATCG 


AATTGCATGC 


TACCCCTCCt 


AAAAACCAAA 


TGCTCTTAAT 


4200 




TTATCTACAG 


ATAATTGGTC 


TTTATCTTTA 


TTTAAAATAT 


TTTCAACATA 


TTTAAACAAA 


4260 
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CTCGTTTCTG 


GAATAAGATG 


AATGTCAAAA 


CTGTTATCAT 


GCTTATCAAA 


TACCGTTAGA 


4380 




CTAACACCAT 


CCACAGTAAT 


AGACCCTTGC 


TTAACTAACT 


GATTATTAAT 


ATGTTGGCTA 


4440 


5 


CATTGAATCG 


TAATAATTTT 


TGCATTGGCT 


GTTTCATTTA 


TTTTTGAAAC 


TGTTCCTAGT 


4500 




TCATCTACAT 


GACCGAGGAC 


AAAATGTCCA 


CCAAACCTAC 


CGTTACCACT 


CATGGCACGC 


4560 


10 


TCTAAATTTA 


CTTCTGATTQ 


TCGCTTAACA 


TCTGCTAAAT 


AGGTTTTATT 


TTCAGTGCCT 


4620 


TTAATTACTT 


GAACAGTAAA 


AGATGTCTGA 


TTAAAATCAA 


TCACTGTTAA 


ACATGCACCA 


4680 




TTAACACTGA 


TGGAATCACC 


AATATGCATA 


TCTGCCGTAA 


TCTTATGTGC 


TTCAATTTCA 


4740 


is 


ATCGTCCTGA 


CTGATTGACG 


AATTTGAACA 


CTTTTAACGA 


CACCTATTTC 


TTCAACGATG 


4800 


CCAGTAAACA TGCATCATCA CTTCTTTCGT 


AAAGTTAATT 


TAACATTTTG. 


ATTTAATAAC 


4 860 




TCGGAATGAA 


CAATTTCAAA 


TTGGTTCGCA 


TCTGGTATCT 


CAATCACATC 


ATTTGTTTGA 


4920 


20 


TAAAATTGAT 


AATTTCCAGA 


TCCGCCAATT 


AATTTCGGGG 


CATAATAGAG 


AATAAATTCA 


4980 




TCTATATAAT 


TAGATTGGAG 


AAATTCTGAA 


GTAGTGGTTG 


GACCTGCCTC 


GACTAGCAAA 


5040 




GTTCCAACTC 


CTCTTTTATA TAAATTGTGA 


AGAATTGTTG 


TTAAATCGCA 


AGACTTCAAG 


5100 


25 


TAAATAATTT 


CAATATGTGT 


TTGATTGGTT 


GTTAAATTTG 


GATTTTCAGT 


ATATATCCAA 


5160 




ATTGGTGTTG 


ATTCATCTTG 


ATAAATTTGC 


TGATTAAAAT 


GAATATTCCC 


AGACTTAGAC 


5220 




AATATTACTT 


TTATAGGG7T 


TTTTCCATCT 


TGAATACGTG 


TAGTATATTG 


TGGATCATCT 


5280 


30 


AATTCAACTG 


TACGTCTTCC 


AGTTAACACT 


GCGTCGTGTC 


GATGTCTTAA 


CTTATAGACA 


5340 




TCTTGTTTAA 


CCTCTTTGTT 


AGTAATCCAT 


TGACTTTGTC 


CATTATCATT 


CGCTTGTTTA 


5400 


35 


CCATCTAAAC 


TTGCAGATAC 


TTTCACTGTA 


ATTTGTGGCA 


GTTGCTTTGC 


TTTTGCTTTA 


5460 


AAAAAGTCTT 


GGTATAATTG 


TGATGCCCGT 


TCATCATCAA 


CGCATTCAAC 


CTCAATACCG 


5520 




TGAGCCCGTA ACGTCTCATC ACCATGTGTG 


TCTAACGAAT 


TGTCTTTTGT 


TGCGTATACT 


5580 


40 


ACTTTTGCTA 


TCTTACAATC 


AATTATTTTG 


TTAACACAGG 


GTGGTGTTGA 


ACCAAAATGA 


5640 




CTACATGGCT 


CTAACGTAAT ATAAATCGTC 


GCACCTTCAG 


CATTTTGTTG 


TGCCATATCA 


.5700 




AGTGCTTGAA 


CCTCCGCATG 


CTTGTCACCT 


TTTCTCAAGT 


GTGCACCAAT 


ACCAACAATC 


5760 


45 


CTACCTTCTT TAACTACAAC AGCGCCAACG 


GGTGGATTAA 


CACCTGTTTG 


ACCTTGTACC 


5820 




ATATTTGCAA 


GTTGAATCGC 


ATAAT C CAT A 


AATTGACTCA 


AATGATCACC 


TCTATAAACA 


5880 




AAAATCCTCA 


CATCATGAAT 


TAAGATGCAA 


GGAGaAAAAT 


TTATCGTTAA 


ATAAGCCTAT 


5940 


50 


TTGTACACAT 


TTTTACAAAT 


ACGCTACATT 


ATCTTTGTCG 


ATAATTAACA 


TTCTTTCTCC 


6000 




CATCCAGACT 


TTAACTGTCG 


GCTCTAGAAT 


CTCACTAGAT 


CAGCCACTAA 


TATGAAACAT 


6060 
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TTaTATATGA AATTGTTATA GATTATTTGA GTACGTAGTA TGTCAACTAC ATTTAAAATG 6180 

ATACTATATG TTTTCTGAAA AAACAATTAA TGACG GTTTT AATTTAATAT AATCTGAGTA 624 0 

CTATAGGCAT CTCATTGATA TGATTCTTAC TAACAGACAT TAAAATCAAA CCTTCAATTC 6300 

GTCTCTATAG AGCGTTCTCT TTATTATCTT CTAGTTACAA ATTATTGATT GtCACtGCGC 6360 

TGTTGTTGCT CATTCGATTC TAAAGCATCA TATAATTGAG ATACTGTATG CGCAACTTGT 6420 

TCTACAATCA TTTTCACACC GTTTCGTAGT TTATTAACAC CGTTTGTCAT TTGACCTATC 64 80 

GCAATCATAT TTGTTAATGT TCCAAACCTT GGACTAATAA CTTGATTGGT TTCCGGAATG 654 0 

ATTTGTATGC CTCCCATTGG GTGTGCTTGT ACAATTTGTC TATTTTCAAG ATTTCTAATT 6600 

AATTGATCAT CTTGATCCAA TTCATTTAAA TGA C TT TTT G CACCTGTCGC GTTAATGACA 6660 

ACATTATATA TGTCTACTGA TTCTTGGTTT TTGTATGAAA AATAATACAA CTTGCCATaC '6720 

ATGTTCACAT CTTCTAAATC TTTTTTCAAA ATTAAAGACT TATTTTCTAT TAATTCAATA 6780 

ATTAGTTCAG CAGTTCTTGG AGGCATTGGA TTTGAATTTA ATTGAATCAT CTTTGAGTAT 6 840 

TTTTGATTAA ATTGATGTTG GTCTTCAATA CTTAAGCTAT TCCATATCCA ATTTAAATTC 6900 

25 TCTTTCAAAT GTTCAATCAT ACTTTGGAAA ATGCCCaTTT CTGTTGGACG CGCTAAATCA 6960 

TACTTCAAAT CTGCAATATG ATTTCCTGTA CGTCTATGTA CTAATTTTTT AAAATCAATG 7020 

TCATATTCAG CACATTCTTT TAAAAATAAA GAAACTAAAG TATCAAGCGG TGCATTGCCG 7080 

30 AAATGATGTT TTTTAATGTC ATTTAATTTG TCTTTAGTTA AGTACTTGAA TGTCACGTCT 7140 

ATCATTGTAC CTCTTACACT TGGTAAATGA GCAGAACGAC TCGTCATAGT AATTGGTAAT 7200 

TTTGGATGAT GAGCAGCAAC ATAACGGACA ACATCTAAAG TGGCAAGGCC TGTACCAATA 7260 

ATCGCAATAT CGTCCAGTTC ATTTACTTCG TCTAACGTAT TATATGTTGG ATAAGGCGTA 7320 

gcGAXATATC CTTTTTTACC CTTTAAGTTA TATGGATCAT GGTAGGCAAA TGTACCACAT 7380 

GTTAAAAATA CATAATCGTA CGCTTGCCAT GATTGTCCTG AATTTGTAGT ACATATGTAA 7440 . 

TAAGTTAAAT TCGTTTCATC GATATTAGAA TTTGTATAAA TCTCTTGAAC TTTATTATAA 7500 

TTAGTTGATA TATTTGGATA TTTTTTCGTG AACATAGATA AATAAGATTT CATATAATGT 7560 

CCGAATACAA ATCTCGGTAA ATATGCAG 7588 
(2) INFORMATION FOR SEQ ID NO: 143: 



35 



40 



45 



SO 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10320 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNES S : double 
<D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 143: 





nCTAGGTATT TTAAACCTAA TCTAGATAAA CTAGCTTCGT AAGCAGCTGC 


TACATTTTCA 


60 


£ 


CGACCGAAAT 


CCTCAAAATA TAATTTTGAA GTAATAAATA AGTCTTCTCT 


AGCAATACCA 


120 




GTTGACTCCA 


ATCCGGCACG AATGCCAGCA 


CCTACTTGTT 


CTTCATTCCC 


ATAAACTTTT 


180 




GCGGTATCAA 


TACTACGATA TCCTTGTTCA 


ATGGCATACT 


TAACACTTTC 


CATGCAATTT 


240 


10 


TCATCATTTT 


CCACACGAAA TGTCCCTAAA 


CCAATTTGTG 


GCATCGTGTT 


TCCATTATAA 


300 




AATGTTTTAA 


CCTCCATAAA TATCGCCTCA 


ccrrmxsAT 


GTATTATACC 


CTGTTATCAT 


360 


15 


AACAAATCTG 


AGTTGAATAC ATGAGAAAAA 


ACACTTAGAG 


CAATCAACCA 


CTAAAATTCT 


420 


AGTAATATCT 


CTCAAATATT AATCAAATTG 


TAAAAGTAAT 


TCTGTTTAAT 


TTATGACAAA 


480 




CTAAAAAAGC 


CGAAGTAACA ACATATAGTC ATCACTTCAG 


CCTAACATTT 


AATTGAATGA 


540 


20 


TTCAATTTTA 


TCCATCATTT GTTGTAAGTC 


TTCCACGTTG 


TATTGAATAC 


GACCATGGAA 


600 




TACAAATTTG 


TTAAAGAACT CGTCTAATTG 


TTCAGCACCG 


ACAAGCACTT 


TGACAGCACT 


660 




ATTTTGATTA 


TAATTTGAAA TCGTTACATC 


GCCTTCATTT 


TTAAGATTAA 


AGTATAAAAT 


720 


25 


TGAAGTTGGT GTATATTTGG CACCTAATTC TTTTTGTAAG TCTTCAGCCA 


ATTGTTTAAT 


780 




CGCCTCAATT 


TGATCTGAAT AATTTACAAA 


TGATAATGAA 


CGTTTGTCAT 


CATTTTGATC 


840 




CATCACAATA 


GTTTGCGGTC TAGATTTATC 


TAAATCCAAT 


GTATCAAATA 


CTTGTTCCAT 


900 


30 


TGGTGGTAAA 


TCTTTAAATT GACCGCCACT 


AATACCATTA 


TAAACATGAC 


CTTTTAACAA 


960 




TTGAGAATCA 


ATAATATAAA GACCAGTTCT 


TGTTAATACT 


AAATGACTAA 


TTCGTTCAAT 


1020 


35 


ATTATTAAAG 


CCATCCTTTG GTAAAAAGAT 


ATTTGCCATA 


ATGTGCATAT 


CTTCTGGTCG 


1080 


AATTCGTTTT 


TCTTTAACTA ATCTTTCACG 


AATACCAATT 


AATCTCATGT 


CCGTTACATA 


1140 




TTCACTATGA 


TTTTTCGAGA ACAATTTTAA 


TGCGTCAATC 


TCACGATCTT 


TTGTACTAAC 


1200 


40 


CATGTGATTA 


TAATCTTCTT GTTGTTTTGT 


AATTGTCTTT 


TTATTTTGAA 


TACGCTCTTT 


1260 




CTCTAAAGCT 


TCTTCATGAG ACTTTTTAAT 


GTTTTGTTCT 


TGTTGTTCAT 


ACTTTTCTTC 


1320 




TGTTTGTCGC 


TTAACTTTTT TCTTACTACC 


TAAGGCAACT AAAAAAAGGA 


CAAAAAAGAT 


1380 


45 


TAATGCAATG AgCTACTGCA ATAATGAGTC CAATGACTAT CGGTGAAGAT 


AAATCCATCA 


1440 




CAACAACGCT 


CCTTTTTAAT ATATGAATAA CTTTAATTAT AATAGAaAAG 


CTAAAGATTT 


1500 




TCGATACATA TTATCATTTA TATACCGAAA ATCTTTTATT TAGCTATATT 


CAATTCATCT 


1560 


50 


TATTATTTTA 


CTGCGTCTTT TAATTCTTCC 


ACTTTGTCTA ATTTTTCCCA 


TGGGAATAAG 


1620 




ACATCTGTAC 


GTCCAAAATG ACCATAAGCA 


GCAGTTTGTT 


TGTAAATCGG 


TTGTTTCAAA 


1680 
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AGTTGCCCTT CAGAAACTTT ACCTGTTCCA AATGTATCAA TTGCAATTGA 


CACTGGTTCT 


1800 




GCAACACCAA TCGCATATGC CAATTGTACT 


TCACATTGAT 


CTGCTAAACC 


TGCTGCAACA 


1860 


5 


ATATTTTTAG CCACATAACG TGCAGCGTAT 


GCAGCTGAAC 


GGTGTACTTT 


TGTAGGATCC 


1520 




TTACCACTGA AGCATCCGCC ACCATGACGT 


GCATAGCCAC 


CGTACGTATC 


AACAATGATT 


1980 


10 


TTACGTCCTG TTAATCCTGC ATCACCTTGA 


GGTCCACCGA 


TTACAAAGCG 


TCCTGTAGGA 


2040 


TTGATGTAGA ATTTAGTTTG TTCATTAATC 


AAGTTTTCTG 


GAACAGTTGG ATAAATGACA 


2100 




TGTGCTTTAA TGTCTTCTTG AATTTGTTCA 


AGTGTCACAT 


CCTCAGCATG 


TTGTGTTGAT 


2160 


15 


ACGACAATCG TATCAATACG TACTGGGTTA TCATTTTCAT CATATTCAAC AGTGACCTGA 


2220 




ACTTTACCGT CTGGTCGTAA ATAATTTAAC 


GTACCATCTT 


TACGCACATC 


TGATAAACGT 


2280 




TTTGCCAATT GATGTGATAA ATAAATTGCT 


AGAGGCATAT ACGTCTCTGT 


TTCATTCGTT 


2340 


20 


GCGTAACCAA ACATTAAACC TTGGTCACCT 


GCACCTGTTG 


CTTCAATTTC 


TTCTTCGCTA 


2400 




TCTTTATCAC GATACTCTAA TGCTTTATCC ACGCCTTGTG 


CAATGTCAGG 


TGATTGTTCA 


2460 




TC3ATCGCAG . TTAAAATTGC CATTGTTTCA 


TAATCATAAC 


CATATTTTGC 


TCTTGTGTAT 


2520 


25 


CCAATTTCTT TAATTGTTTC TCTAACAACT 


TTCGGAATAT 


CAACATATGT 


TGTTGTAGAA 


2580 




ATTTCGCCGG CGATCAATGC CATACCTGTT 


GTAACAGTTG 


TTtCACAAGC 


TACACGTGCA 


2640 




TTTGGATCGT CTTTTAAAAT AGCATCTAAT 


ATTGCATCTG 


ACACTTGGTC 


AGCGATTTTA 


2700 


30 


TCTGGGTGTC CTTCTGTAAC AGACTCTGAA 


GTAAATAATC 


GTTTGTTATT 


TAACATAGTT 


2760 




TGCTCCTTTA AATTTATATT ACGAAAATTC 


TCTCTCTGTG 


AGCTAAATAA 


AAAAGACCTT 


2820 


35 


CTAACTATTA ATATAGAGAG AAGGCCTAAT ACGTCCATTC 


GCTCTTATCG 


TTCAGACCTA 


2880 


TTTGTCTGCA AAcGGTTTGG CACCTTTCTT 


TTATAAAAAA GAGGTTGCTG 


GGTTTCATTG 


2940 




GGTCCATGTC CCTCCACCAC TCAGGATAAG 


AGAATCCGTT 


AAAAATAATA 


GTACCTAATT 


3000 


40 


AATGAATTAA TGTCAATTTT TCACAAATAA 


ATTTACAGTA 


AAATATTGTA 


GATTAATTAT 


3060 




GTTAATGTGT TATACTAATT AAATGTAAAG 


GCTTACATTT AAATTATCGC 


TTTGGAGGGA 


3120 




TTTAGGATGT CAGTAGACAC ATACACTGAA 


ACAACTAAAA TTGACAAATT 


ACTGAAAAAA 


3180 


45 


CCAACGTCAC ATTTTCAACT TTCGACGACA 


CAACTTTATA 


ATAAAATCTT 


AGACAATAAC 


3240 




GAAGGGGTAT TAACAGAACT TGGTGCTGTT 


AATGCAAGTA 


CTGGAAAATA 


TACTGGTCGT 


3300 




TCGCCTAAAG ACAAATTTTT TGTCTCTGAA CCTTCATATA GAGATAACAT 


TGATTGGGGA 


3360 


50 


GAAATTAATC AACCTATCGA TGAAGAAACT 


TTCTTGAAGT 


TATACCATAA 


AGTACTAGAC 


3420 




TATTTAGATA AAAAAGATGA ACTATACGTA TTTAAAgGcT ACGCTGGTAG CGATAAAGAT 


3480 
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ATGTTTATTA GACCTGAATC AAAAGAAGAA GCTACAAAGA TTAAACCTAA CTTCACTATC 3600 

GTTTCTGCAC CACATTTTAA AGCAGATCCA GAAGTTGATG GTACTAAATC TGAAACCTTT 3660 

GTCATTATTT CATTTAAACA CAAAGTCATT TTAATCGGCG GTACTGAATA CGCTGGTGAA 3720 

ATGAAAAAAG GTATCTTCTC TGTAATGAAT TATCTCTTAC CGATGCAAGA TATTATGAGC 3780 

ATGCATTGCT CAGCAAACGT TGGTGAAAAA GGCGATGTTG CATTATTCTT TGGTCTATCT 3 840 

GGCACTGGTA AAACAACCTT ATCGGCTGAC CCACACCGTA AACTAATCGG TGATGATGAA 3900 

CACGGCTGGA ATAAAAACGG GGTCTTTAAT ATCGAAGGTG GCTGCTATGC AAAAGCAATT 3960 

AATCTTTCCA AAGAAAAAGA ACCACAGATT TTTGACGCAA TCAAATATGG TGCAATTTTA 4 020 

GAGAACACTG TAGTTGCAGA AGATGGTTCA GTGGACTTTG AAGACAATCG TTATACAGAA 4 080 

AACACGCGTG CCGCTTATCC AATTAATCAC ATTGACAATA TTGTAGTACC ATCTAAAGCA 4140 

GCACATCCAA ATACAATTAT TTTCTTAACT GCGGATGCAT TTGGTGTTAT TCCACCGATT 4200 

TCAAAGTTAA ATAAAGACCA AGCAATGTAT CATTTCTTGA GTGGTTTCAC TTCTAAATTA 4260 

GCTGGTACAa GCGTGGTGTG ACAGAACCTG AACCATCATT CTCAACATGT TTCGGAGCAC 4320 

25 CGTTCTTCCC GTTACACCCT ACTGTTTACG CTGATCTATT AGGTGAACTT ATCGATTTAC 4 380 

ATGATGTTGA TGTTTATCTT GTTAATACTG GATGGACTGG CGGAAAATAT GGTGTAGGAC 4440 

GTAGAATCAG CTTACATTAC ACACGTCAAA TGGTAAACCA AGCGATTTCT GGCAAATTGA 4500 

AAAATGCAGA ATATACAAAA GATAGTACGT TTGGTTTAAG CATTCCTGTA GAAATTGAAG 4 560 

ATGTACCGAA AACAATTTTA AATCCAATTA ATGCTTGGAG CGACAAAGAG AAAT ATAAAG 4620 

CACAAGCAGA AGATTTAATT CAACGTTTTG AAAAGAACTT CGAAAAATTT GGTGAAAAAG 4680 

TTGAACATAT TGCTGAAAAA GGTAGCTTCA ACAAATAAAT TTGAATACTA AATCaAAACC 4 740 

ACC&TGTGA ACGGGTGGTT TGTTCTGCGG CTATAAGCCT TCCTTACTGG CCAGCCCTAA 4 800 

AAGGGCACTG ACAAGTCAGC CAACTGCACT ACTATTCCAG CAACCCTAAA GGGTTACTCT 4 860 

TTTTTCTTTC TTTTTTTATT TTTCTCTCCA GTGAAAGGAT CTAAATATTC TTCCATTGAG 4920 

ATTTGGTCTG CAACGATATC CTCTTGTAAT TGATTACGAA TATAATTTTC AATCACTTTT 4 980 

45 TTATTTCTAC CTACTGTATC CACATAAAAT CCTTTACACC AAAACTTTCT ATTTCCATAT 5040 

CTATACTTTA AGTTAGCATG TCTATCAAAT ATCATTAAAC TACTITTTCC TTTTAAATAG 5100 

CCAACAAATG ATGATACCCC AAGTTTGGGT GGTATACTAA CTAACATATG GATATGATCT 5160 

TTACATGCCT CTGCTTCAAT TATCTCTACA CCTTTTCTTT CACATAATTG ACGCAATATA 5220 

ATCCCTATAT CTTTTTTTAT TTTTCCATAT ATCACTTGTC TTCTGTATTT AGGTGCAAAG 5280 
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AAATAGCATC TCCTCGTGTT GATTATTTTG GTTGGCTGAC CAATATTTAT TCTAGCACGT 5400 

AGAGATGCAT TTTTTGTGAC AATGGTAGAA CCTTTTCtGa ACCATACGCA TAGCGTATGG 5460 

TTTTCTTTTT ACAATTAAAG AGCCAACCGT TGTTATAGTC TAACAATGGT TGGCTCCTCT 5520 

TATTTTATGT GCTAAAAATT TATAGGCAAT TTTATTACAA CAATGTACAT TTAAGGTGAC 5580 

CTTCATGCCA AAATCGCATC ACTCATTTAA TGGAAGCAGC ACGTCTTCAT ATAAAGTACC 564 0 

GATCCCTAAT TCAACGCATG TAGTACCACA TCTTCAAAGC TTGATAGTTC CCATGCGCAC 5700 

ACCACGTTTC ATACTAGCTA TGCGACTCAA CTTGGTTCAT AAACTCTTTA ATATAAGTCA 5760 

ATGTTTCAAC CATCGCTGGT GGTCTTGGCA CATGTCCTTC TGCCATTTGA TAAAATGTTT 5820 

CATGCGTGGC ACCTTTTAAC TCTAGTTGGT CCGCTAAATA ATACGCATGA TGAATACCAA 5880 

CTTGCTGGTC TTTCCCTCCA TGTACAATTA ATATTGGCGG ACTGTTTTCA TTAATGTTTG 594 0 

GAATCGCTTG GCGTGCCTCA TATGCCGCTC GATCTTTTTT CGGATGACCA ATCATTCTTC 6000 

GTAGCATGCC TCTTAAATCG ACACGTTCTT CATACATTAA ATCAATATCT GAGACACCAC 6060 

CCCAGATTGT ATAACTTGTT ACTGGTAAGT CTTGAAATGT CAACAATCCT TGTAAACCAC 6120 

25 CTCGCGAAAA ACCAACCATG TGGATAAATG CATGTGGATA TTTATCATGT AGCAACCTTA 6180 

ATAATTGCGT CACATCATTT AAATCGCCAC GGTAAAATTC GTCTTTGCCT TGACTCCCAT 624 0 

TGTTACCTCG GTAGTATGGC CCAATCACTA AAGTTTGACT ATCTGAAAAT TGCATTAATC 6300 

TACCTGCGCG CACACGTCCT ACTTGACCTT TGCCACCTCG CAAATAAACT ACAATGCGAT 6360 

TTACTTCATG ATGTGGTGTC ATCATTAAAG CTTTTACTTG TAAGTCATCT GACAAATATG 642 0 

TAATTTCTTC GAATTGATGC GTAAAATATT CAATTGGCAT TCGTTTACGT TTGATAAAAC 6480 

CCAAGTGATT GCACCCTCTC TACGCATTTT AAAATGGTAC TATCTTGCAG TAAGAAACTC 654 0 

CGTTGTGCGA' GTTCAATATC ATTGATACAG TTAAACAACA CTGGCCCTGC TGTTTCTAAA 6600 

TAATCGTTCT TGCTTACCAA TGATTCAACT TCGATAAAAT ATACATCTTT TACAAAATCA 6660 

GTTTGATCAT GTGTTTCAAT GGTATATTGT GCTATGTAAT AAATATTTTT AACTTTGGCG 6720 

CCTGTTTCTT CATATAATTC aCGTGTAACT GCTTCAGCAC TACTTTCCCC GCGTTCCCTT 6780 

45 TTACCACCAG GAAATTCAAT CCCCCGTAAA TTATGTTTGG TAAAAAGCAA TTGATTTTTA 684 0 

AACGTTGGAA TAGCTAGCAC ATGATTGCCA TCTGCTATCT CATTATCCTT TTTAAATGTC 6900 

AAATTAACTT GACGATTATC TTTATCCCTA AACTTCACGC GCATCACATC CCTACATTGT 6960 

50 

ATGTTAATAT AATAGTTAAT TACTATCGTT GGAGGCATTA ATTATGAAAA AGATATTCTT 7020 

GGCGATGATT CATTTTTATC AACGTTTCAT TTCGCCACTC ACTCCACCAA CTTGTCGTTT 7080 
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CCTTTATTTA 


GGTATCCGTC GTATTTTAAA ATGTCATCCG 


CTTCATAAAG GCGGCTTTGA 


7200 




CCCTGTTCCG 


TTAAAAAAAG ACAAGTCAGC AAGCAAGCAT TCACATAAAC ATAACCATTA 


7260 


s 


ATATGGTTGT 


AATTGAGTTA TATCCACTAA AGGGGGGCGA AATTCGAGTC GCCCCTCTTT 


7320 




TAATATGCCT 


GAATGCGCCA CCACATCTTG TTCAAAATAA 


TAACCTGCTG GTGTAACATC 


7380 




TCCTGGATAA TCACCTTTAC GAGCAAGCAT CGCTGTAAAA TAGCGGCTTA AACCATATTC 


7440 


10 


GTACATGCCG 


CCAATAACCA CTTTTGCACC ATGACTTTTC 


AAAGTATCAA TTGCCGTTTG 


7500 




CACTTTATCA 


ATGCCACCTA GACGAAATGG TTTTAATACA 


ACAACTTTCA CATTGTATAA 


7560 


15 


TTCTATCAAA 


TTAATTATGT CCaACAACGA TGTTGCCTTT 


TCATCAAGGG CTATTGGAGG 


7620 


TATTGTTCCA 


TCCGCTACTT CATCAAGCAT GGAGATATCT 


TTAAATGGCT CTTCGATATA 


7680 




AAGAACCTGT TCACGCGCTA ATAACTGTAA CTGTGTGAAA TCTTGACGAT CCAAGGACTC 


7740 


20 


ATTTGCATCT 


ATAACCAATT GAAAGTGAAA GTCTAATTCC 


CGTAACACTC- TAATTTGATG 


7800 




CATGATTTGA GGCGTCCATT TTAATTTAAT TCTGGTCGGC TTTGTTGCTT TTAATGACTC 


7860 




TAGTTGTTTA 


TTTGATAAGC CGCTGGcTGT CGCTCCATAT 


GCTACTGAAA ATGAAGGCAG 


7920 


25 


TACATGAAAC 


ATTTGATACA ATGCCATGAC AATAGTTGCC 


CTTGCAGCAG GCGTATTTTC 


7980 




CAATGAATCT ACTAATTTTA GTGCTGCTTC ATACGTTTCA 


AATGATTTAT TTCTATTATC 


8040 




TTCGAACCAT 


TGCTCAATTA CATGTTTCAC TGAGGCAATT 


GTTTCATGAT CATACCAATC 


8100 


30 


TGTTTGAAAA 


GCGTTACATT CCCCGAAATA TGCATTTCCT 


TTGTCATCAA TCAATTCGAT 


8160 




AAACAAACAA 


TCACGATGCG TTAAAGTGAC TTTCGGTGTT 


ACAATTTGTG ACTTAAATGG 


8220 


35 


CTCACTATAT 


TTATAAAAAT GCAAAGCTGT CAACTTCATC 


AAATCATCCT CTATACAACT 


8260 


TATTTCTTTG 


TAATTTACCT GTTGATGTAT AAGGTAAAGT 


ATCAACCTTT TCAAAGTGTT 


8340 




TCGCTACTTT ATATTTCGCT AAATGTTGTG ATAAATATGC AATCAATTGT GCCTTTGAAA 


8400 


40 


TGTCACTTTC 


ACTGACAAAA TATAATTTAG GCACTTGGCC 


CCAAGTATCA TCAGGATGCC 


8460 




CTACACATAC 


TGCGTCACTG ATACCTGGAA ATTGctTCGC 


TACCGTTTCA ATTTGATATG 


' 8520 




GATAAATATT 


TTCACCGCCA CTAATAATTA AATCTTTACG 


TCGGTCATAA ATCATGACAT 


8580 


45 


AACCTTCATG 


ATCTATTTCA GCAATGTCAC CCGTATTAAA 


ATAACCATTT TCAAACGTAC 


8640 




CCGTTAAATC 


TGTTGGATAC AAATATACAT TCATCACATT 


GGCGCCTTTA ATCATTAATT 


8700 




CTCCATGACC 


TTCTTTATTA GGATTTTTAA TTTTTACGTC 


AACATTGGCA CTTGGCATCC 


8760 


50 


CTACAGTGTC 


AGGACGTGCA TGCAACATTT CCGGTGTTGC 


TGTTAAAAAT TGCGAACATG 


8820 




TCTCAGTCAT 


ACCAAATGAA TTATAAATTG GCAGGTTATA 


TTGTAATGCC GTCTCTATCA 


8880 
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10 



IS 



AACCTTGTTG CATAAGCCAA TTTAAAGTTT GTGGCAC^AG CGAAATGTGC GTGATTCGTT 9000 

CATTTTTAAT CATCGTTAAA ATTTGTTCGG CATTGAATTT ATCAACAATG CGCACAGTAA 9060 

AACCTTCAAT AACAGCTCTT AAAAGTACAC TGAGACCCGA AATATGATAA ATCGGCAAGA 9120 

CAGATAGCCA ATTAGTGTCA CGATCAAATC CCAAGCTCTC TTTACATCCG ATTGCACTGG 91 BO 

CATAATGATT ACGAAACGTT TGTGGCACCG CTTTTTGAGG GCCCGTTGTC CCTGATGTAA 924 0 

ACATAATCGA TGCAATGTCA TCTAAATTAA ATGATGTATT TAATATGTTG GACGGCGACT 93 00 

CTTTCGGCAC CACAGTTTCA TTCGATGTTT CATATTGGAT ACCCATTGTG TTGTCCAACA 9360 

AACTGTTCGT TGTAATATCC CTTCCAGCGA ATTCAATATC ATCCAGCGAT ACAATTTGAA 9420 

ACCCTCGTAA TTCCAGTGGC AAGGTACAAA AAATCAATTG TACATCGATT GACTTCATCT 94 80 

GATTCGTCAT CTCATTAGGT GTCAACCTTG TATTAATCAT CGCAATTTCA ATATTTGCCA 9540 

20 ACCAACATGC ATGTATTAAA ATGATCGATT GAATCGAATT ATCTATGTAT AGCCCAACAC 9600 

GAGATTGTTG ATAAGCCTTG AGTCTTTTAG CCAATAGACT CGCTTCACAG TATAAATTTT 9660 

GATAAGTATA AGATTCTTGA CCGTCTGTTA TCGCAATATG ATGTCCATTT TGTTGTGCTT 9720 

25 GTTTATATAA CCAAAAGTCC ATGCGTTATT CCTCCAAAAT CATTTACATT ATAATTATAA 9780 

CGATTTTATG ACATTCTAGC AGTGGTTATG TTTAAAAATA TAAAAAAGTA GACGAATTGA 9840 

TGCATTGATA TGATTGTTAT AATGCTCAAT ACATATCGTT ATATCATTCG TCTACTATTA 9900 

TCAGTTATTT TTATTTAATT TTAGTGTCAT TCTGTCATTT TGATGTGGTG ATTTACCCAT 9960 

TGTTGCCACA TCATCTGCAA TGTCAATTGG TATACGGTTC ATGTCTTGTA ATGCACTTAA 10020 

ATGGAATACT TCATCATCTA AATTTTCAAT GAGATATACA TAATATGTTA CCTTGTCCTT 10080 

TTTATATTTT AACGTTTTCC AAAAGTCCGG CTTGCAATTC AATACATTAT CCGGAATATA 10140 

TTCffATAAAT AAGTAACGTT TGCTGCCTAC TTTGTCTATG AAATATTTTG CAGTGCCTTT 10200 

TTCTATACCT CTTATATGTG CATAGTCTGC TGAAAAGTAA ATACTACCTA TTGTTTCATT 10260 

ATGTTGTTGT ATTTCAAATC GTTGGCCTAC TATTTTATTA TTTGTGCTAC nGGGGACTTA 10320 

(2) INFORMATION FOR SEQ ID NO: 144: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1477 base pairs 
• (B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 144: 
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GTGTGGATTG 


GATTTTAAAA TCACCCTCAT 


AAATACTGTC 


ATCAATATGA 


TAAGTTACAA 


120 




TTTCACCTAT 


TATTAAATCA GCCCCATCTA ATACATCTCC 


AAGCAATATC 


ATTTGCGmTA 


180 


5 


GTTTACATTC 


GAATCTCATT TTCGCATCTT 


TAATTCCTGG 


CGTCTTAATC 


GTTGTAGATG 


240 




TTAAAAGTGA 


TAATTCTGTA CGACTCAACT 


CACTGTCACC 


ATATGCTAAC 


GGCGCTGCAG 


300 


10 


TCTCATTAAT 


ATCTTGAACA TTATCTTCGT 


CTGTAATATG 


CACAACAAAG 


TCTCCAGTCC 


350 


GTTCTATATT 


TAATGCAGTA TCTTTTCTCT 


TACCTCCTGC 


ACGTTGAACT 


GGAATAGCAA 


420 




TCATTGGCGG 


ATGATTATTA ACAATATTAA AAAAGCTAAA 


TGGTGCTGCA 


TTTACTGATG 


480 


15 


CATCTTGATT 


TAATGTTGTA ACAAAAGCTA 


TAGGTCGTGG 


AATAATTGAA 


CCAATTAATA 


540 




ATTTATAGTT 


TTCTCTAGCA GTTAATGATT 


GTGCATCAAA 


CGTATACATA 


ATACCTACCT 


600 




CTTTTCTAAG 


TATATCTAGG TATTTCTCCG ATTTTGGTTA ATTTAAACAT 


CTATTCTCCT 


660 


20 


CTGAAAATCA 


CTTGTATTTA TTTAGCAAAT 


CTTTTGAAAT 


ATGACACATA 


TGCATATCTT 


720 




CTGGATATTT 


TTCTAAATGT TGCTGATGTT 


CTTCAGCACT 


TTTAATGTAG 


TTAGACAGCG 


780 




GTAAGACTTC 


CACTGCAATT TGATCTCTGT 


CTTTACGTCG 


TTCAATGAAC 


TGACGCGCTT 


B40 


25 


CAATTAAGTG 


GTCATCTACA CAACTATATA 


AACCCGTTCG 


ATACTTTTGT 


CCAATATCAT 


900 




TTCCTTGTTG 


ATTCACACTG TAAGGATCAA 


TGATTTCAAA 


TAAATAATTC 


ATAATGTCTG 


960 


30 


TAATTGTTAA 


CATACGATCA TCGAAATGAA 


GTTTGACACA 


TTCAGCATAA 


CCATCATACG 


1020 


GACCGTCTAA 


TTTAGAGCTT CTTCCATTTG 


CTCTTCCTGC 


TTCTGTATGT 


ATAATTCCAG 


1080 




GTATTGTTGC AAAAAATGCT TCAACACCCC 


ATAAACATCC 


TCCTGCTACA 


TAAACAACTG 


1140 


35 


CCATATTTAC 


ACCTCATCAT CCTTTTTTAT 


ATTTTTAACA AGGTTATACC 


ATTTAATACC 


1200 


GCCATGACAT 


GATTCTGATA CACCTTCATT 


ACGATACCCA 


TATTTTTCAT 


AAAATGAAAT 


1260 




TAATGATTCT 


CGACATGTTA ACGTTACACC 


ATGTCGATGA 


TGATTCTTAG 


CAAGAGTTTC 


1320 


40 


AAAATAGTTT AGTAAGCGAC CTGCAATACC 


CTGACCTTGA 


TAATTTGGTG 


CTACAACAAG 


1380 




ACCTAACACA 


CTAATATAGC CACCTTCACT 


ATTATTTGTG 


GAGACATTTT 


TAAATAAATC 


1440 




ATCGCTAATG 


TAACGCTCTT TTATGACTGG ACCGTTG 






1477 


45 


(2) INFORMATION FOR SEQ ID NO: 145: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3976 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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5 



10 



20 



25 



30 



35 



40 



AGGTGATTAT CCTAAAAATG 


CTCATGAGGT 


CGCTATTAAT 


GATAAGTTAG 


CTGCAGACAA 


60 


CATTAGAGTC GGGGATAGAT 


TACATTTTAA AAATAATTCA ACTAGTTATA 


GAGTTTCTGG 


120 


TATTTTAAAC GACACAATGT 


ATGCGCATAG 


TTCCATTGTG 


CTATTGAACG 


ATAACGGATT 


180 


TAATGCATTG AATAAGGTTA ATACGGCATT 


TTATCCAGTG 


AAAAATTTAA 


CACAACAACA 


240 


ACGTGATGAG CTTAATAAAA 


TAAATGACGT 


TCAAGTTGTG 


AGTGAAAAAG 


ATTTAACAGG 


300 


TAATATTGCG AGTTATCAAG 


CAGAGCAAGC 


ACCGTTAAAT ATGATGATTG 


TTAGTTTGTT 


360 


TGCTATTACA GCAATCGTTC 


TAAGTGCATT 


TTTCTATGTT 


ATGACGATTC 


AAAAAATATC 


420 


ACAAATTGGC ATTTTGAAAG 


CAATTGGTAT 


TAAGAGAAGA 


CATTTATTGA 


GTGCGTTAGT 


480 


TTTACAAATT TTAACACTAA 


CAATAATTGG 


GGTAGGTATT 


GCTGTGATCA 


TCATAGTAGG 


540 


ACTATCATTT ATGATGCCGG 


TAACGATGCC 


TTTTTACTTA ACAACGCAAA 


ATATTTTATT 


600 


AATGGTGGGG ATATTTATAT 


TAGTAGCGAT 


TTTAGGTGCC 


TCACTATCAT 


TTATCAAATT 


660 


ATTTAAAGTG GATCCTATCG 


AAGCAATTGG 


AGGTGCAGAA 


TAATGGCATT 


AGTCGTTGAA 


720 


GATATCGTCA AAAATTTCGG 


AGAAGGTTTG 


TCTGAAACAA 


AAGTTTTAAA 


AGGTATTAAT 


780 


TTTGAAGTGG AACAAGGGGA ATTTGTCATT 


TTAAATGGTG 


CCTCTGGTTC 


TGGGAAAACA 


840 


ACATTGCTAA CGATATTAGG 


CGGATTGTTA 


AG7CAAACGA 


GTGGTACAGT 


GCTTTACAAT 


900 


GATGCGCCAT TGTTTGATAA 


ACAGCATCGT 


CCTAGTGATT 


TACGATTGGA 


AGATATTGGT 


960 


TTTATTTTTC AATCTTCACA 


TTTAGTTCCT 


TATTTAAAAG 


TGATAGAGCA 


ATTGACACTC 


1020 


GTAGGTCAAG AAGCGGGAAT 


GACCAAACAA 


CAAAGTTCAA 


CAAGAGCAAT 


ACAACTTTTG 


1080 


AAAAATATTG GTTTAGAAGA 


TCGCTTGAAT 


GTATATCCGC ATCAGTTATC 


TGGCGGTGAA 


1140 


AAGCAACGTG TTGCGATTAT 


GAGAGCATTT 


ATGAATAATC 


CGAAAATCAT 


TTTAGCAGAT 


1200 


GAGGCCACAG CAAGTTTAGA 


TGCCGATAGA 


GCAACAAAAG 


TTGTTGAGAT 


GATACGTCAA 


1260 


CAAATTAAAG AACAACAAAT 


GATTGGTATT ATGATTACAC ACGATCGAAG 


ATTATTTGAA 


1320 


TATGCAGATC GAGTGATTGA ATTAGAAGAT 


GGCAAAATAA 


CTGATTAGTG 


GCTTGTAAAG 


1380 


ACGCTAAATG TTAATGATTT 


AAGACATAGT 


AGTATAAAAG 


TTAGATAACA 


GAATACGATT 


1440 


TGGGTTTACA AAAAACAGGC 


TGGGACATTA 


AGTTCTTAGG 


CAATGTAAAA 


AAGCTGATTT 


1500 


CTATTAATTA TTTGATAGAA 


ATCAGCTTTT 


TTGATATGTA 


TTTTATAATG 


TACAGCTCGT 


1560 


TGCATTCATA TAGCTTGAAG 


TCACGTTTAA 


AACCATATCT 


ATCATTATGG 


TATGCATATC 


1620 


TTTTAAAACC TATTCTTTTG 


TTATTAGGAC 


ATATAAATTC 


ATCATTAAGT 


TCGTCATATT 


1680 


TCCAATTTTG AGTGTTAAAA 


ATGTCACTTT 


TAAACTTTCT AGTTTTATCT 


TTAATAAACA 


1740 
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CACTATCATA ACATGCATCA GCTACAATAT ACTCCGGTAA ATAACCGAAG 


nTATTTTgAA 


1860 




TCATTGTTAA AAATGGAATT AAAGTTCTAG TATCTGTTGG 


GTTTTGAAAT 


AGGTCATAGG 


1920 


5 


ATAAAACAAA TTGAGAATTT GTCGCTATTT 


GTAAATTGTA TCCTGGCTTA 


AGTTGGCCAA 


1-980 




AGTGTCTTAT 


TTTTTTAAAG TATTTAAAAG 


TAAAATTACA 


TGTTAATACG 


TAGTATTAAT 


2040 




GGCGAGACTC 


CTGAGGGAGC AGTGCCAGTC 


GAAGaCAGGG 


GCCCCAACAC 


AGAArcTGAC . 


2100 


10 


ATATAGTCAG 


CTTACAACAA TGTGCCGGTT 


GGGGTGGCTG 


AGACGGCACC 


CTAGGAAGGG 


2160 




ACCCGTCATC 


AAAAATTCTA TTTATAGAAT 


TTTACAGTAA 


TGTGCCAGAT 


GGGCATAGCG 


2220 


15 


AAgcCATTCA ATACGAAGTA TTGTATAAAT AGAGAACAGC AGTAAGATAT 


TTTCTAATTG 


2280 


AAAATTATTT 


TACTGCTGTT TTTTTTAGGG 


ATTAATGTCC 


CAGACTCTTT 


AGTTTATTTA 


2340 




TTTTCAATAT AACAATTGTC TAATCAAGGA TTAACGAATA 


TTTAAAGATA 


GTTTGACGCA 


2400 


20 


ATATTAGAAA 


CAACCTATAA TAATAGTTTG 


TTTGTGGATT 


AACTATTATA 


AATAAAAGCG 


2460 




GCGTAAAGAC 


ATATAAACCA ACTACTTGAA 


CAATATAACG 


TTAATAACAA 


TCTATACTGA 


2520 




TACATTACGC 


CTAGATAATC TTTGATGAGC 


ACATGTAAGA 


AAAAGTGATA 


TGGTGTATGA 


2580 


25 


CTTCCGACAC CATCGATAGA TAAACCTAAT TTTTGGGCTA GTCGTAAGGC 


GCGCAATACA 


2640 




TGAAACTGAC 


TTGTtACACA AACAATTTTA 


ACTGCTTCAT 


GATACAAATT 


GTTGATGATT 


2700 




TGTTTAGAAT 


ATAAAAAGTT TGTGTATGTA 


TTTATAGAGT 


GAGATTCCAT 


TAGTATATCT 


2760 


30 


GTTTTATCAA 


CACCATGTGC AATCAAATAA 


CGTTGCATAG 


CTAAAGCTTC 


AGAAATTGGT 


2820 




TCGTCTGGTC 


CTTGTCCGCC AGATACAATG 


ATCTTTGTTG 


CTGATGCTTG 


TTGTTGATAG 


2880 


35 


ATATCAAGTG 


CACGATCTAA ACGCGCTGCA 


AGCATTGGTG 


TGACAAATTC 


GGTAAAAATA 


2940 


CCAGCACCTA 


ACACAATTAT GATATCAACT 


TCTTTGTTGT 


ATGATCTATG 


TCTATATGAT 


3000 




ACTGTCCAAA 


CGAGATAACA AATAAAGGTT 


AGTAACAGGG 


AAAGACATAA 


TATAGCTAAC 


3060 


40 


CACATAGACA 


AACCTTTCAC AATAGGTGAC 


TGAATCGTAC 


TTATAAATAG 


AAGTGCTGAT 


3120 




GTGTAGAGTA 


CAAATTTATA TGAAAAAGAT 


AATAATTTTT 


TAATAAATAA 


GCGACTAGAA 


3180 




GTATGAGAAA ATAAATATCT ATGTTTGAAT AGCATGATAA 


TACTGATTAT 


TATAAATGTT 


3240 


45 


ACAAACATAG 


ACCAAGGGAA AGTATAGGTC 


ATGATGCTAT 


AGATGAGTGA 


CAAAAATATC 


3300 




GATATGACAA 


CTAAGATGTA GCATGTTAAA 


TTTAACGTCA 


GAGTATAGTT 


GAAAATTAAC 


3360 




GGACAAATAA 


CGATAAGTAT AAATATTAAT 


AATAAATTCA 


ATAACATACT 


GACACCTCGC 


3420 


50 


TTATAATAAA 


TATTAAATAT AAATGTAGAT 


GATTTAATTT 


ATTAAAGCAA 


GGAGAAAGCA 


3480 



GCAACATGTA AATCTTAATT TGTTATATTA TATATGGGTC AATATTTTTG TGTTTTTTAG 354 0 
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TATGGTAAAA CATTTACAAG ACCATATTCA ATTTTTAGAG CAGTTTATAA ATAACGTTAA 3660 

CGCATTAACT GCAAAAATGT TGAAAGATTT ACAAAATGAA TATGAAATTT CATTAGAGCA 3720 

5 GTCTAACGTA TTAGGTATGT TAAATAAAGA ACCTTTGACA ATTAGTGAAA TCACGCAAAG 37B0 

O ACAAGGTGTA AATAAGGCCG CAGTAAGCCG ACGAATTAAA AAGTTAATCG ATGCTTAATT 3840 

AGTTAAGTTA GATAAACCAA ATTTAAATAT TGATCAACGT TTGAAATTCA TAACCTTAAC 3900 

10 

TGACAAAGGT Ag AG CAT ATT TGAAAGAACG TAATGCGATT ATGACAGATA TTGCGCAAGA 3960 

TATTACTAAT GATTTA 3976 
(2) INFORMATION FOR SEQ ID NO: 146: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 146: 



25 


GCTACCTAGG 


CATTTAAGAG 


ATCAAAAAAT 


GTATGAATAT 


GAACGTTATT 


TTTATGAGCA 


.60 




AGAACTTAAT 


GGCGTTGATG 


aAGGGGAAAT 


TTTAAAGAAG 


TTAAAAGACC 


CACAAGATGT 


120 




TGCAGCTGAA 


ACAAAAGCTA 


GAAGTGTTAT 


TGATTATGCT 


GAATCTAAAC 


CAACATTTGA 


180 


30 


AAATATTTCA 


AGAGCTGTTG 


CTGCTTCATT 


AAGTTTAGGC 


ATTCTATCTA 


TTTTTGTCAT 


240 




CCTTATACCA 


GTATCTATAG 


TTGGATTATT 


TGTATTAGCA 


TTATTTTTAA 


TATCACTTTT 


300 


3S 


GCTGCTGTTT 


TGTCCAATTA 


TTTTATTAGC 


ATCAGCAATA 


TCCAGAGGAA 


TTGTGGACTC 


360 


AATTAGTAAT 


GTAmTrixs 


CCATATCATA 


TTCAGGATTA 


GGATTAGTAT 


TTATCATTGT 


420 




CATASTTAAG 


ATTTTAGAAT 


ACATTTATCG 


TTTAATCTTA AAATATTTAC 


TTTGGTATAT 


480 


40 


TAAAACTGTC 


AAAGGAAGCG 


TTAGAAAATG 


AAGAAATTCT TTTTTATTGG GCTTTTAGTG 


54 0 




TTTGTTGTCT 


TTTTTACAGC 


AGCAACCATT 


ATTTGGTTCA 


GCTATGATAA 


AAACAAATAT 


600 




GGTACTAAAC 


AATATGATAA 


AACATTCAAA 


gACGATGCTT TTGACAATGT ATCTATAAAT 


660 


45 


TTGGATAGTA 


CAGAACTTCG 


TATAAAACGG 


GGGAATCAAT 


TTAGAGTTAA ATATGATGGT 


720 




GACAATGATA 


TATTAATTAA 


TATAGTAGAT 


AAGACGTTGA 


AGATTAGTGA 


TAAAAGGTCT 


780 




AAGACAAGAG 


GATATGCAAT 


TGATATGAAT 


CCTTTTCATG 


AGAATAAGAA AACGTTAACG 


840 


50 


ATTGAAATGC 


CTGATAAAAT 


GATTAAACGT 


TTAAATCTAT 


CATCTGGAGC 


AGGAAGTGTT 


900 




AGAATCAGTG 


ATGTTGATTT 


AGAGAACACA 


AGTATTCAAA 


GCATTAACGG 


TGAAGTAGTT 


960 
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AGTAAAAGTA ACATTAAAAA TAGCAATATT 
AAGAGTCAAA TTAAACAATC CATATTTTTA 
5 ATGCCATCAA AAGTAGATGC AAAAGCTTCT 

GATAGTAAAC CTGAAGACAC TATACTAAAG 
AAAAATAAAA CATTTACTAA TGGtAAAGTT 

10 

ACGATTGATG GTAATATCAA AGTTGAATAA 
CATAATTTCT CTAATATCGG TGTTATTTAT 
TTGCCAGTAA AGTGAGTGAT ATTAAGGTCT 

15 

ATTTCTGGTG GCGCATTTTC TCCCAATGTA 
ATTGATGTAT GGATGGTGCC AGACCAGCTT 

20 GGATTATTGA ATAACTTAAA TGCTGTAGTC 

• GTACGCGCCA GTCTTTCTTT AGATTCTTTT 
TCAAAATGAT TTGTACATAT ATTATCATAA 

25 ACAATTGCAT GGTTCAATGA TTTGTCCATC 

ATTTCTTTCA ATAATTGGAT TGCTTCTGTT 
CCAATCATAT AACATACAAA ACCATTTGCT 

30 

ATAGTTAATC CTGACTCATT CATACCATCC 
ATTTGAGCTA AACCGCTATC TGTAGGTTGA 
GGGTGGTAAT CATAATTTCT AACCATGAAG 

35 

TCTTTTAAAT CGGTAAAACG ATAATGTCCA 
TTGAGTATAC TTTGTAGTCC CATTAATTCT 
ATTTGATAAG TTTCATTTAC ATCTATATCG 

40 

TCTCGATTTT TTAGAAGAGG TGTTTGTTGA 
AAATGTGAAC CTCTAAAAGT CATGATATCT 
45 TTCTTTTTAG TTGTAATATA TTGTAAATAA 

TTAAATAAAG TTATATTTTA CTAAATGAAA 
TATAAGACTT ATCAATCAGT TAAAACATAT 

50 

AACTTAAAAT CATCTTATCA TAAATATCAA 
TGTTAACATT AGCATATAAG GTCATGTCAC 

55 



AAAGTTGTTA TTGGTACGCT ACAAATCGAC 1080 

AACGATCATG GTGACATTGA ATTTAAAAAC 1140 

, ACTAAACAAG GAGATATTCG TTTTAAGTAT 1200 

CTAAATCCGG GAACGGGTGA TAGCGTAGTT 1260 

GGGAAAAGCG ACAATGTTTT AGAATTTTAT 1320 

ATAAAGGATG TAAGCACCGA TATTAGGAAG 1360 

TTGTTGGCAA AAGTTAAGTC GGTATCTATA 144 0 

TGACCATCTA ACCATGATTT GAAATCTATT 1500 

AAATATGCAG TTAATGTTTC AGGTTGATAC 1560 

TTGAATAGTT TACTGTAAAT TTCATACTGA 1620 

ATATCTAAAT TATCATTAGT TTGTGAAATG 1680 

GTATAATTAC GATTTTCATG TGTTAATATT 174 0 

CGAACATCTA TTGATCTCGG TGTCACTTCA 1B0O 

AGTATGTAGC TAAATGAGCT TCTGTGTGGT 1860 

ACATTTCGGC AATTTTCAAG AATTAGACGA 1920 

GGTTTCTTCC GGTGCATAAA GTTATAGCCC 1980 

ATTCTTCCAG TTACCCTTGA TACAGGACCA 204 0 

TAAAGTAAGT AGCGACCATC ATAAGTTGCA 2100 
TCTTTGCCTT GAAAGACCGT GCAaCCACTT ' 2160 

AAGTTTAAAA TAATTTGGCG TGTTGGCATT 2220 

TCCCATATTT GAGGTGCGTA TGTTTGGAAT 22 80 

AAACGTGGGA CaCnTTTTTT CCATTCTTTT 234 0 

AGCCATTTAC CAGTTTTAAC ACCTAACTCG 24 00 

GATGTCACTT GTTGCATATC ATCGGCCCCT 2460 

ATAGTAATCG TATGTATATT GAATGTCATG 2520 

TATAAAATTG TTTGAGGTGA TTTCTCGGTG 2580 

TTTTATAGAT GGTGGGGATA TTGAGTTAAA 2640 

TCTTAAGTTA GCATTCACGA TAATAGTCAT 2700 

GTTGAAACAG AGGTTCCTCG GCATTTTTGA 2760 
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TTATTTAATG ATTATTCTAT ATATGATAGT ATAATGAAAT GTAGATAGGT ATTTAATTTA 
ACAGAGGTGA AATTGAGATG TGGAATTTTA TTAAATGtGT GkTTAAATTC GTATTTAGCT 
TAGTTGCTAT TACAACATTA GTTGCTGGTG TTGGTGTAGT AGCATTTGCT TATATCTTTA 
AAAAAGATTT TGAAGATATT GAAAGAAAAA CTAAAGAAAT TATTTCTGAT ATTGAAAGTA 
AAAATAACTA ATAACATTTA GAGGCTGGGA CATAAATCCC TAAAAAACAG CAGTAAGATA 
ATTTTCAATT AGAAAATATC TTACTGCTGT TCTCTATTTn ATcAmTACTt CGTATTGAAT 
GGCTTCGCTT TCCTAGGGTG CCGTCTCAGC CTTGGTCTTC GACTGGCACT GCTCCCTCAG 
GAGTCTCGCC ATTAATACTA CGTATTAACA TGTAATTTTA CTTTGGAAAT ACTTTTAAAA 
AATAAGACAC TTTGGCCCAA CTTGGCACAT AAATGTAAAA TTCAAT 
(2) INFORMATION FOR SEQ ID NO: 147: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 147: 
GTTGAAGAAA GAAATATAAC AGTCAATTAT AATTATAACC TTGTTGAAAT CGACGGTGAC 
AAAAAAGTGG CTACATTCGA ACATATCAAA G CAT AC G ATA GAAAAACAAT AAGTTATGAT 
ATGTTACATG TAACACCACC TATGGGTCCC TTAGATGTAG TAAAAGAAAG TACACTTTCA 
GATAGTGAGG GTTGGGTAGA TGTTAACCCA ACCACATTAC AGCATAAAAG CTACTCTAAT 
GTATTTGCAC TTGGTGATGC TTCAAATGTA CCTACTTCAA AAACAGGCGC ACTATTcGTA 
AGCAAGCACC TATCGTCGCT AATAATTTAT TGCAAGTGAT GAATAATCAA ATGTTAACGC 
ATCATTATGA TGGTTATACT TCATGCCCTA TTGTTACTGG ATATAATAGG TTAATACTTG 
CAGAGTTTGA TTATAATAAA AATACTAAAG AAACAATGCC GTTTAATCAG GCCAAAGAAC 
GTaGAAGTAT GTATATATTT AAGAAAGATT TATTACCTAA AATGTATTGG TACGGCATGC 
TAAAAGGATT AATATAATAA AGTACAGAAA ACAATAAATT TTTAATGAAA AATCTTTTAC 
TATAAAAGAT TAAGTATTTA AATGACGTGT CAGTGTTGTG TTTATATGTC GTGAATTTTT 
AGCTCTAAAT AGTATAAGAT TGAAAAAGTT GTTACTGTTT TAAATGATCA CGATGAAGTC 
ATTCAATAAG AATGATTATG AAAATAGAAA CAGCAGTAAG ATATTTTCTA ATTGAAAATC 
ATCTCACTGC TGTTTTTTAA AGGTTTATAC CTCATCCTCT AAATTATTTA AAAATAATTA 
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AGATATTCAA ACCACGTGTA CTCAAAATGA TAGCTTGGTA TGTACCTCCA ATAGTAATTT. 960 

CAATAACTTT GTCTGTTGAA ' CACTAAGAGC AATTTTAATT TCATAATGTG TTGTAAACAT 1020 

5 TTTTTTTGAT TGGAGTTTTT TTCTGAGTTA AACGATATCC TGATGTATTT TTAATTTTGC 1080 

ACCATTTCCA AAAGGATAAG TGACATAAGT AAAAAGGCAT CATCGGGAGT TATCCTATCA 114 0 

GGAAAACCAA GATAATACCT AAGTAGAAAG TGTTCAATCC GTGTTAAATT GGGAAATATC 1200 

10 

ATGCATAAAC TTTATTACTC ATACTATAAT TCAATTTTAA CGTCTTCGTC CATTTGGGCT 1260 

TCAAATTCAT CGAGTAGTGC TCGTGCTTCT GCAATTGATT GTGTGTTCAT CAATTGATGT 1320 

CGAAGTTCGC TAGCGCCTCT TATGCCACGC ACATAGATTT TAAAGAATCT ACGCAArCTC 1380 

TTGAATTGTC GTATTTCATC TTTyTCATAT TTGTTAAACA ATGATArATG CAATCTCAAy 1440 

ArATCTAATA GTTCyTTGCT TGTGTGTTCG CGTGGTTCTT TTTCAAAAGT GAATGGATTG 1500 

20 TGGAAAATGC CTCTACCAAT CATGATGCCA TCAATACCAT ATTTTTCTGC AAGTTCAAGT 1560 

CCTG TTTTT C TATCGGGAAT ATCATCGTTA ATTGTTAACA ATGTGTTTGG TGCAATTTCG 1620 

TCACGTAAAT TTTTAATAGC TTCGATTAAT TCCCAATGTG CATCTACTTT ACTCATGCGT 168 D 

25 TTGATAAAAA CTTAAATAAT ATTAATTCGG TCATCAGTGG CGTTAAATCT TTTATCATTT 1740 

TTAGTTATAG TTGATAAATT TATATTTATA AGCATATATG GATATTTCAT CAAAAATTTT 1800 

TATTTATATA AATCCGAACT GCATACATAT TTGTTTAAAT AAGAGGTATT ATTTTTCGGG 1860 

30 

AAATTGCTGT CTGAGTTAAA AGGATTAGTT TTATAAAATG AGTTGAACTA TAGCCAAAAA 1920 

CGATTAAAAT ACTGATAATC CATTTTTGtA TTATGTTAGG GACTTTTTTA CTTAATTTTA 1980 

ACCCTATTGG aGCmAATATA ATACTCCCTA TTATAAGGAA TAAGGCGTCA TATAAaGGGA 2040 

35 

TATAACCTTG AATAAGTTTG ATGACAAAAG CACCAATTGA AGATATAAAA GCAATTACTA 2100 

TACfATTAGC GACTACAGTA TTCATTGGTA ATTTGAATAA AACCAATAAT ATAGGAATAA 2160 

40 TAATGAAGGC ACCACCTGCA CCTACTATAC CTGAAATAAT ACCAATGAAA AGGCCAATGA 2220 

TAACTAATAA ATATTTATTA AATGAAGACT TTTCGGAACT AGGTTtCACT TTAATAAACA 2280 

TTAATGTTAA TGCAAGTAAA GCAATAATGA TATATACCGT ATTTACAAAT GTAGCATCAA 2340 

45 ATAAATTTGC TAGAAATGCA CCTAACATAC TCCCT 2375 

(2) INFORMATION FOR SEQ ID NO: 14 8: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 6115 base pairs 

bv (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 148: 





GAGGTTTCTA GACAAGCTTT TAATAACTTA CCAAACTCAT TAAgrTGGTT gTGtTGGACT 


60 


5 


GCCtATTATC TnAAGtATTAT 


GaGTTGTTTA 


ATATTAGtGC 


TAArACATAC 


GAAGAGTGGT 


120 




TTAAACAATT TAGTAGTAAG 


AAAGCACAAT 


TCAGTATTAA 


TCTCACGGAT 


AAATGGATAA 


180 


10 


TTCAAATCGC ATATGGTAAA 


TTAATAATAA 


TGGCTAAAAA TAATGGCGAT ACATATTTTA 


240 


GAGTTCAAAC AATTAAAAAG 


CCAGGTAATT 


ATATTTTTAA 


CAAATATCGA 


TTAGAGATAC 


300 




ATTCTAATTT ACCAAAATGT 


TTATTTCCGC 


TTACAGTGAG 


AACACGACAA AGTGGCGATA 


360 


15 


CATTTAAACT GAATGGGCGC 


GATGGTTATA AGAAAGTGAA TCGCCTGTTT ATAGATTGTA 


420 




AAGTGCCACA GTGGGTTCGG 


GATCAAATGC 


CAATCGTATT 


GGATAAACAA 


CAGCGCATTA 


480 




TTGCGGTAGG AGATTTATAT 


CAACAACAAA 


CAATAAAAAA ATGGATTATA ATTAGTAAAA 


540 


20 


ATGGAGATGA ATAGCGTTAT 


GCATAATGAT 


TTGAAAGAAG 


TATTGTTAAC 


TGAAGAAGAT 


600 




ATTCAAAATA TCTGTAAGGA ATTGGGAGCA 


CAATTAACAA AGGATTATCA AGGTAAACCA 


660 




TTAGTATGCG TGGGTATCTT AAAAGGCTCA 


GCAATGTTTA TGTCAGATTT 


AATTAAACGA 


720 


25 


ATTGATACCC ATTTATCAAT 


TGATTTCATG 


GATGTTTCTA 


GTTATCACGG 


AGGCACTGAG 


780 




TCAACTGGTG AAGTTCAAAT 


CATTAAAGAT 


TTAGGTTCTT 


CTATTGAAAA 


TAAAGACGTA 


840 




TTAATTATTG AAGATATCTT 


AGAGACTGGT 


ACTACACTTA 


AGTCAATTAC 


TGAATTATTA 


900 


30 


CAATCTAGAA AAGTTAATTC 


ATTAGAAATA 


GTTACTTTAT 


TAGATAAACC 


AAACCGTCGT 


960 




AAAGCGGACA TTGAAGCTAA 


GTATGTAGGT 


AAAAAAATAC 


CAGATGaATT 


TGTTGTTGGt 


1020 


35 


TACGGTTTAG ATTATCGTGA ATTATACCGA AACTTACCAT ATATCGGTAC 


GTTAAAACCT 


1080 


GAAGTGTATT CAAATTAATT 


TTTTAATCAA 


TTTCAGTTAT 


TATTACTATG 


CGTTTGAGAA 


1140 




ATAATAGTGT AGACTCAAAA ATATGAAAAA 


TGTATTTCAT 


ATATATTTAA 


TTTTAGACAA 


1200 


40 


GACATATGTC TTGAAAAGTT 


GAAAAATATA 


GAGATTGATA 


AAACTAATAC 


"GGGTGTGAAT 


1260 




GACATTGATG TTAAGCTCAA 


TTACTAGCTT 


ATAAAACATG 


TCATATGTTA 


CAATTTTTGT 


1320 




TAGTTTTATT ATGGGAAGTA 


GGAGGAAATG 


ACGCATGCAG 


AAAGCTTTTC 


GCAATGTGCT 


1380 


45 


AGTTATCGTA ATAATAGGCG 


TTATTATTTT 


TGGTCTATTT 


TCATATTTAA 


ACGGTAATGG 


1440 




AAATATGCCG AAACAGCTTA 


CATATAATCA 


ATTTACTGAG 


AAGTTGGAAA 


AAGGTGACCT 


1500 




TAAAACTTTA GAAATCCAAC 


CACAACAAAA 


TGTCTATATG 


GTAAGTGGTA 


AAACGAAAAA 


1560 


50 


TGATGAAGAC TATTCATCAA 


CTATTTTATA 


TAACAACGAA 


AAAGAATTAC 


AAAAAATTAC 


1620 




TGATGCTGCT AAAAAGCAAA ACGGTGTAAA ATTAACGATT AAAGAAGAAG 


AAAAACAAAG 


1680 
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TTTCTTCCTA AGCCAAGCAC AAGGTGGCGG TAGTGGCGGT CGTATGATGA ACTTTGGTAA 18 00 

ATCTAAAGCA AAAATGTACG ATAATAATAA ACGTCGTGTT CGTTTCTCTG ATGTAGCAGG 1860 

5 GGCAGATGAA GAAAAACAAG AATTAATTGA AATTGTTGAT TTCTTGAAAG ATAATAAAAA 1920 

ATTCAAAGAA ATGGGATCTA GGATTCCTAA AGGTGTCTTA CTTGTTGGAC CTCCAGGTAC 1980 

TGGTAAAACA TTACTTGCTA GAGCGGTTGC AGGTGAAGCT GGCGCACCAT TCTTCTCTAT 204 0 

10 

TAGTGGTTCA GACTTTGTAG AGATGTTTGT TGGTGTTGGT GCGAGCCGTG TTCGTGACTT 2100 

ATTCGATAAT GCTAAGAAAA ACGCGCCTTG TATCATCTTT ATCGATGAGA TTGATGCTGT 2160 

TGGTCGTCAA CGTGGTGCAG GTGTTGGTGG CGGTCATGAT GAACGTGAAC AAACCCTAAA 2220 

15 

CCAATTATTA GTTGAAATGG ATGGTTTCGG TGAAAATGAA GGTATCATTA TGATAGCTGC 2280 

TACAAACCGT CCTGATATCC TTGACCCAGC CTTATTACGT CCAGGTCGTT TTGATAGACA 234 0 

20 AATTCAAGTT GGTCGTCCAG ATGTGAAAGG CCGTGAAGCA ATTCTTCATG TTCATGCTAA 2400 

AAACAAACCA CTTGATGAAA CGGTTGATTT AAAAGCAATT TCACAACGTA CACCTGGTTT 2460 

CTCAGGTGCT GATTTAGAGA ACTTATTAAA TGAAGCATCT TTAATTGCTG TACGTGAAGG 2520 

25 TAAAAAGAAA ATTGACATGA GAGATATCGA AGAGGCAACG GATAGAGTTA TAGCCGGACC 2580 

TGCTAAGAAA TCTCGAGTTA TTTCTAAGAA AGAACGTAAT ATTGTTGCTC ATCACGAAGC 264 0 

TGGTCATACA ATTATCGGTA TGGTACTTGA TGAGGCAGAA GTAGTGCATA AAGTTACTAT 2700 

30 

TGTTCCACGT GGACAAGCAG GTGGTTATGC AATGATGCTA CCTAAACAAG ATCGTTTCTT 2760 

AATGACTGAA CAAGAGTTAT TAGATAAAAT CTGTGGTTTA CTTGGTGGAC G7GTATCAGA 2820 

AGATATTAAC TTTAACGAAG TATCAACAGG TGCTTCAAAT GACTTCGAAC GTGCAACACA 2880 

35 

AATCGCACGC TCAATGGTTA CGCAATATGG TATGAGTAAA AAATTAGGAC CATTACAGTT 294 0 

CGGYCATAGC AATGGTCAAG TATTCTTAGG TAAAGATATG CAAGGTGAGC CTAATTATTC 3000 

AAGCCAAATC GCATATGAAA TTGATAAAGA AGTTCAACGA ATCGTTAAAG AACAATACGA 3060 

40 

ACGTTGTAAA CAAATTTTAT TAGAGCACAA AGAACAATTA ATTTTAATTG CTGAAACATT 312 0 

ATTAACAGAA GAAACATTAG TTGCTGAACA AATTCAATCA TTATTCTACG AAGGTAAATT 3180 

45 ACCTGAAATT GATTATGATG CAGCTAAAGT TGTTAAAGAT GAAGATTCTG AATTTAATGA 324 0 

TGGTAAATTC GGTAAATCTT ATGAAGAGAT TCGTAAAGAG CAATTAGAAG ATGGACAACG 3300 

TGACGAAAGT GAAGATCGTA AAGAAGAAAA AGATATTGCT GAGGATAAAA AAGAAGCTGA 3360 

50 TAAATCTGAT GAAAAAGATG AACCAGCACA TCGACAAGCC CCAAATATCG AAAAACCTTA 3420 

CGATCCAAAT CACCCAGACA ATAAATAATC GATTATATTC AGTACCTCTT TCTATGATAA 34 80 
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AATTGTTATA GCAGAAAATA ATTGTAAAAC AAGTTACTTC ATTATTTAGA ATGATGGGTG 


3600 




TAGAATAAGT 


ACAATTGTTG 


CATTTTATGA AGTAAAGTAA 


TTTTTTAAAT 


ATAGAGTAAT 


3660 


5 


AGAGGAGATT 


GAAATAATGA 


CACACGATTA 


TATTGTTAAA 


GCATTAGCAT 


TTGATGGAGA 


3720 




GATTAGGGCT 


TATGCTGCTT 


TGACAACTGA 


AACTGTTCAA 


GAAGCACAAA 


CGAGACATTA 


3780 




TACATGGCCG 


ACAGCATCTG 


CTGCAATGGG 


AAGAACAATG 


caCAGCAACA 


GCTATGATGG 


3840 


10 


GCGCAATGTT 


GAAAGGTGAT 


CAAAAATTAA 


CTGTCACTGT 


AGATGGCCAA 


GGACCTATTG 


3900 




GACGAATTAT TGCCGATGCA AATGCTAAAG GCGAGGTGCG TGCTTATGTA GACCATCCAC 


3960 


15 


AAACTCATTT 


TCCATTAAAT 


GAGCAAGGTA AACTTGATGT 


AAGACGAGCG 


GTAGGGACAA 


4020 


ATGGATCTAT 


TATGGTTGTT 


AAAGACGTTG 


GAATGAAAGA 


CTATTTCtCT 


GGAGCAAGTC 


4080 




CaATTGTTTC 


AGGAGAACTT 


GGTGAAGATT 


TTACTTATTA 


TTATGCTACA 


AGTGAACAAA 


4140 


20 


CACCTTCATC GGTAGGTCTT GGTGTATTGG TAAATCCTGA TAATACGATT AAAGCAGCAG 


4200 




GAGGATTTAT 


CATTCAAGTT 


ATGCCAGGTG 


CCAAAGATGA 


AACAATTTCA 


AAATTAGAAA 


4260 




AAGCAATTAG 


TGAAATGACA 


CCAGTTTCTA AATTAATTGA ACAAGGATTA ACGCCAGAAG 


4320 


25 


GATTACTAAA CGAAATCTTA GGTGAAGACC ATGTGCAAAT 


TTTAGAGAAA 


ATGCCTGTTC 


4360 




AATTTGAATG 


TAATTGTAGT 


CATGAGAAAT 


TTTTAAATGC 


TATTAAAGGA 


TTGGGCGAGG 


4440 




CTGAGATTCA 


AAATATGATT 


AAAGAAGATC 


ATGGTGCTGA 


AGCAGTATGT 


CATTTCTGTG 1 


4500 


30 


GAAATAAATA 


TAAATATACT 


GAAGAAGAAT 


TAAACGTGTT 


GCTAGAAAGT 


TTAGCGTAAT 


4560 




TTAATTTAAA 


TCAATACGCT 


AAAATGTTTA 


TTTTTAGCGG 


TTTAGTGAAA 


TGTAGAACTA 


4620 


35 


AATAGTTGTA 


TAATCCTTAG 


TGATTTTGTT 


TGCTTTCTAG 


AATTTATTTG 


ATAAAATAAT 


4680 


TCTATATCCG ATAAATAAAC 


TAAGATTTCA 


ACAACTAACT 


AAAAAGGAGT 


GTTCTTAATG 


4740 




GCAC2AAAAAC 


CAGTAGATAA 


TATTACTCAA 


ATTATTGGCG 


GTACACCGGT 


AGTCAAATTG 


4800 


40 


AGAAATGTAG 


TAGATGACAA 


TGCAGCAGAT 


GTTTATGTAA 


AATTGGAATA 


TCAAAATCCA 


4860 




GGTGGTTCTG 


TAAAGGATAG 


AATTGCTTTA 


GCAATGATTG AAAAAGCAGA GCGAGAAGGC 


4920 




AAAATTAAAC 


CTGGCGATAC 


AATTGTAGAA 


CCAACAAGTG 


GTAATACAGG 


TATCGGTTTA 


4980 


45 


GCATTTGTAT 


GTGCTGCTAA 


AGGATATAAA 


GCAGTATTTA 


CTATGCCCGA 


AACAATGAGC 


5040 




CAAGAGCGTC 


GTAATTTATT 


AAAAGCATAC ' 


GGTGCGGAAT 


TAGTTTTAAC 


GCCTGGATCA 


5100. 




GAAGCGATGA 


AAGGTGCAAT 


TAAAAAAGCT 


AAAGAATTGA 


AAGAAGAACA 


TGGTTACTTC 


5160 


50 


GAGCCAGAAC 


AATTTGAAAA 


CCCTGCGAAC 


CCTGAAGTTC 


ATGAGTTAAC 


TACAGGTCCT 


^ 5220 




GAGTTATTAC 


AACAATTTGA 


AGGGAAAACT 


ATCGATGCGT 


TCCTAGCTGG 


TGTTGGTACT 


5280 
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GTTGCTATAG AGCCTGAGGC TTCTCCAGTA TTGAGCGGTG GTGAGCCAGG TCCACATAAA 5400 

TTACAAGGTT TAGGTGCTGG ATTTATTCCA GGCACTTTGA ATACAGAAAT CTATGACAGT 54 60 
5 ATTATTAAAG TAGGAAATGA TACAGCGATG GAAATGTCTC GTCGAGTTGC TAAAGAGGAA . 5520 

GGTATTTTAG CAGGTATTTC ATCAGGTGCT GCGATTTATG CTGCCATTCA AAAAGCAAAA 5580 

GAATTAGGAA AAGGTAAAAC AGTAGTAACA GTATTGCCGA GTAATGGTGA ACGCTACTTA 5640 

10 

TCAACACCTT TATATTCATT CGATGACTAA TTAATGTCAT TTAAAAGAGT GAGTTATCTT 5700 

TTTGAGATAA CTTGCTCTTT TTTTCTACCA TGTATATTTT TAAAAATATG AGCGTTAAAT 5760 

TAAACATTTT TCTGATAAAA ATATCCAGTG AATGATAAGA TAATAAACGT ACATACTAAT 5820 

IS 

AACTAGTAAA TAGCAGGAGT AAATTTTATT AGAGTTAAAC AATACATAAT TAAAGGGTGG 5880 

TTAACATGAC TAAAACAAAA ATTATGGGcA TATTAAACGT CACACCTGAT TcATTCTcAG 5940 

20 ATGGTGGAAA ATTTAATAAT GTTGAATCAG CTATAAATAG aGTGAAAGCC ATGATAGATG 6000 

AAGGTGCTGA CATTATAGAT GTTGGAGGTG TTTCAACGAG ACCCGGTCAT GAAATGGTTT 6060 

CATTAGAAGA TGAGATGAAC AGAGTATTAC CTGTTGTTGA AGCTATTGTC GGTTT 6115 

25 (2) INFORMATION FOR SEQ ID NO: 149: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 104 01 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 149: 

3S 





TAGATACTGG GnTAAAcaTc 


AAAAATAtyT GCtTaTTCaC GTGTTTAcGc 


TCCCtCAAAC 


60 




GCAACGTTAA TTGCGTGTAA 


TCATTTAGTG 


TGAATTcAGA CGCTTCTTCC 


ATGACTATGT 


120 


40 


CTGATATGCC TTTTATCGAC 


TTTATTTTCT 

✓ 


CTGGGTTATC TAATCCTTTA 


AACAAAAAAA 


180 




CTGCGCCGTT TGGCAATTCA 


ACTTTGTTAT 


CAGTCTTATT CCAAAGGCAC 


ATGTCCCAAA 


240 




TACCAAAGTT TATCAAACAA 


TCTTTAACAT 


CTTCGAACAA ACTATCTTTA 


ATTGTTGATT 


300 


45 


GTACTTTTCT AAGCCACAGT 


ATACGCCTAG 


GATATTTCCA ATCTTGCAAT 


GCTTTGAGTA 


360 




CAACTTTTTG TATAACGCCG 


TGAGACTTAC 


CGCTCGAACC TCCACCGTAA 


TGkACTTCAG 


420 




TGAAGTtATC GTAATTGGTT 


AGTATTTCGA 


ATATGTTTCT ATTGAAAACA 


TTAGACGGTT 


480 


SO 


TGTTAAAGTT TAATTTAACT 


TTCGTCATCG 


TACTCACCAA TATTAATCTC 


AATATTCTTC 


540 




TGAGTAATTT CTTTTTTATC 


GATATACGCA 


CCATGTACTT TTAGTATGTG 


GTCAATAGAT 


600 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



TTTAAATGGT CATATTTCTT ACTGTAAGCC TCTTGAGGTT CTCCTCTAGC AATAGAAGCA 720 

GATAACGCTA AAGCTTCTGT AATACTCATT AAACGCTCTT CTTGTATCTG TTCTAATCGT 780 

TCTTTAATAT ATTCCGAAAC ATTAACATTT CTTAACAATC GACTTGCTAA AGACTCTGCT 840 

GTTTTCTTAC TATAACCTGC TGTAATTGCT G CTTTTTT AC CATTACATCC ATTCATTATA 900 

TATTCATCTG CGAATCTCTT TTGTTTTTCG TTCATTTCAT . TTACCACCAA CTCTCGCGCT 960 

ATACGCTTTT TAAAATTAAA AAAGGATTGG CTATAATCAG CCAACCCACA TAG ATC CTTT 1020 

ATTCCTAATT GCGATAAGGG AAACGCAGTA CGATAGTCAA TATCCTACAC TATCATAATA 1080 

TCTCATTTAA GGTATCAAAA ACTGCCACTT TACTGCGAAT TTCAGTCTTC CCCTAACTCT 1140 

TCCGCCAATC TAGATATGAT TTTTCTTTTG ATTCTATGAG CAGTTCTATC AGAAATGTGT 1200 

ATGTCAACAC AAACTTTCAC TAATTCCTTT TTATTAAAAT AATACTCTTG AATGAATTCG 126 0 

CGTTCTTTCC TGCTTGATGT GTTGATTATA CGTTCAATAG CGCTCTTAAA CTCAAGGATT 1320 

TTACCTCTTC GTATACTACA AAGATAATTA GTTACTGCCA TTTCTGTTTT CGATGTATTA 1380 

GACGGTACAA ACTCCCCGCC TATATTTGTA TCTGTTGGAA TCCACGGTGT CATTATTTCA 1440 

CTTCTTAAAT CTTCAAGTTG TTTATGATAA TTAGGATAAT CACACAACTC ATCTTCTAAC 1500 

TTTCGAACTG TTGATAATTT TAATCCGTAT TTCTTTTTAG TCATGAATAC CCTCCGTACA 1560 

AATATGTTTA ATCTTCAAAG TGTCTCAATC TACTTCTTAA TATCTCTATC TCTCGCTCTT 1620 

TAACTTTTAC ATCACCTTTT AACTGTTCCG CTTGTAACAT CACACCAAAC AATAAGATGA 1680 

CTAGTAATAT AATTGCTATG ATTAACCACA TCATCTACTC CGACACCTCC GCCCTCATCA 1740 

AATCAGACTG ATCACTCAAC TTTGCGAAGT CACTTGGCGC CTCTACATCA TCATTAGCCG 1800 

TCATCATAAT ATATACTTGC TCAGTTACAT ACTTACCTAA CTCATACATC GCTAGTAAGA 1860 

ATAATAGTCT CAAAATTTCT TTAACCACCA CTAAACACCC CATGTTAATT TATCGATAAT 1920 

TTGTATAGCT TGTTTTAATG CGTCTCTTTT TTCTTTGATA TCTCTATTAT CGCCATCTTC 1980 

ATCAGCTGAC ATTAACTCAC TGTCATATTC ATATAATAGT TCTGATATTT CATTACTAGC 204 0 

TACTACTAAT AAGTTTTCAT CTACATCAAT CGTTACCGTT TTCTTTGGCA TCTCCATCTC 2100 

TCCTTATCTT AACTTGTGCC TCGTATTTGC GCTCAGCTTC TTCTTTACTC TCTGCCTCAA 2160 

CAACTGTAAA CGTCTGATTA TCTCTAGCAG TAGTAAAATG TTCATGTGGT TGTCCTGTTG 2220 

AATCTTTGAA TGTTGTGACT AAGTATTGCG TCACTTCTTA TCACTCCTTT GAATGATTCT 2280 

AAGTTTTTCT ACGAATAAAA GTATTAGTAC AACACTCAAT GTAGCCAACA TATTTTTTTG 2340 

CTTTGCAAAA TCTACTATAA CGATTAAGAC TAATAACATT CCAATTCTGC ATGTAAATAA 2400 
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TACAAGTATT 


GGAACTAATG 


TAATGATGTA ACTCACTTCC 


CCAAAACCTC 


CTTGACTCGA 


2520 




TCTAAGATGT 


CTTTACACTC 


CGCTACTTCC 


GAAGCCTTTT 


TCTCCACGTT 


CTGAAACACT 


2580 


5 


TTCGAATTCC 


TCCACTTGCT 


TTAGTTCAGG 


TGTCCATATA 


GGCACGATAA 


CCAATTGAGC 


2640 




TAGTTTGTCT 


CCTTCGTTGA TTTGATAAGT 


TCCGTATTGT 


CTTATGGCGT 


CACTCAAATC 


2700 




GATTTCTCCT 


TTAATATCAA 


AAACACCTGG 


TGTGATATAA 


CCATTCGATG 


CAATAGCGTC 


2760 


10 


ATTCTTGATA 


TTAATCCCTA AATTGCCGTG 


ATATCCCGCG 


TCTATCTTGC 


CTGTTTCAAT 


2820 




CACTAAATGC 


GTTTTACTAC 


TTACACCACT 


ACGGCTAGTT 


AATAGTCCGA 


CATAGCCCTC 


2880 


15 


TGGTATGCTT 


ACAGCTACAT 


CTGTTTTAAT 


CACTGCCTTT 


TCTTGTGGCT 


CAAGTACGAC 


2940 


AGTTTCAGCT 


GAGAATATGT 


CATAACCTGC 


ATCCGTCTTA 


TGATTTCGTT 


CGGGCATTCT 


3000 




AGCATXTIX-T 


GATAATAGCC 


TTACTTGTAA 


TGTGTTAGTC 


ATTTTCCTGC 


TCCTCCCTAG 


3060 


20 


CTGTAGCAAA 


CGCTATTCTC 


AATTTCAATC 


TTTCAACAAT 


ATGAATTAGT 


GCGGTATTGA 


3120 




GGAATATTTC 


AAATTCTTCA ATGTTCTCAT CTATAAAATC 


AAGTATTTCT 


TCCTCTTGTT 


3180 




CACTGTCAAA 


CTCGCTTAGT 


ACA7CCCAAA 


TATTTATGTC 


GCTTTTGCTC 


GTTTCTAATA 


3240 


25 


CTCTTTTGAT 


TATTTCTGAA 


TTACTTTTAT 


TACTCATTTT 


CCTTGTTCCT 


CCTCATATTT 


3300 




ATAGACAACT 


TGACCTGCCA 


TAATCCCTAC 


TGCTTCATCA 


AGTTCAATAC 


CTTCTTTAAC 


3360 




TGAATGTTGA ATAGCATTTG 


TCATTCCCTC 


AAGTATTTCA 


TCAAACGCTT 


GTGCTCTCTT 


3420 


30 


ATACACGTCC 


TCAATCTCTT 


TTAGTAATCC 


CTCTGTGTCA 


TTACCGTTAT 


ACGCACTAGC 


3480 




ACTGATCACT 


GATTGTTCAA 


TTTGTTCGCG 


GTTATTCATC 


ATTTCCATCT 


CCTCTAAAAT 


3540 




AAAGTTAGTT 


GCTTCTGCTC 


CTCGTATTCC 


AAACCATGTT 


GCTTTATATA 


TGTTTCGAGC 


3600 


35 


TCTTCCGCTG 


TATCAAATGT 


CTTTTTCACG 


CCTTGCCAAC 


CTGGCACGAT 


ATGCCCATGa 


3660 




AAGTAATAAG 


TGCCGTTCAC 


TACATGGATA 


TGTGCCACTC 


GTTCGTTATC 


CTGATACAGA 


3720 


40 


TATCTCTTAG 


ATCCGAAAAA 


TTGGTTTAAG 


TATTCTTTAC 


ATGCGCTATC 


GGTTTTAGGC 


3780 


ATTTATGCTT 


CC7GCCATTT 


CTTAAACATT 


TGGTTATAAG 


TAGTATCAAA 


CCAGTACGGA 


3840 




TCACGTGAAT 


GTTTTTGAGG 


CACATTAAAC 


AAATGTGGCT 


TCTTCTTACG 


TAGTTCAGCC 


3900 


45 


TCTTTACGTC 


GTTGCCTAGC 


CATTTCACGC 


TCTTTGCTCT 


CTCGCTCCAT 


GATTTTGGAT 


3960 




AACACAATTT 


CTTTATACTC AGCTAAGCGC 


ATACCATAAG 


GTGCATGTAA 


GGCTTCTAAC 


4020 




AACGCCCAGC 


CACCTCGTAC 


TCTTTTTGCA 


ACCATTCCTG 


GAGTTAAACC 


GTTCTTTTTT 


4080 


50 


ATCAATTCAT 


TTTCATGTTC 


GGTAAATTTA 


TATGGTTTAC 


CGTTAATCTT 


TACGATACTC . 


4140 




ATTTATTCCA 


CCTCTATACA TTTACTTTTT 


TTAATCCAAT 


CCTCTAATTT 


GTGCGTGTTG 


4200 
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ACATTTAAGT TAACCATCTC AGCTTTTCCG 


TTTTTATATC 


CACTAATAGT 


TGATCTTGAT 


4320 




ACGCCAGTTT CATTGTGCAA ATCTTGGACA CTTACGTTAT 


CTCTAGCCAT 


GATTACCCTT 


4380 


5 


AAATTAGTTG CGAATACTtC GTTCAACTTC 


ATTTATTCCA 


CCTCTATATA 


TGCATGTCTT 


4440 




ATTGTTATGT TGTCATACTT TAGTAATTCG 


TCCGGATTGT 


CATCTAAGCG 


CTTTGCCAGC 


4500 




GTATCTTTTT CTTTATCCAC ATCATCGTAA 


TGCTGATATT 


CAACTTCTGT 


AGGTATTCTT 


4560 


10 


ATATCAATCG TTGCGTTTAT ATATGCTTGT 


TGTTGCATTA 


GATCACTTCA 


TTTCTCTTTT 


4620 




TCTTTTACGT CTGACTTTCA CTAAGTCCTC 


ATATACCATC 


CATTCTTGAC 


CTGTGTATTT 


46B0 


f c 
ID 


AGGCGCTTTA CATATCCACG TTAAATTCAC ATCTCTATAC 


TGATATCTGA 


ATATCTTCGG 


4740 


TTTGATGTTG GCAACTTCAG TCGCCTTACC 


TTTAACGTCT 


ATAACTTCAA 


CCAGTTTCCC 


4800 




TTCCTTCCAC AAAGAGAAAT CGGCTATATA 


CGTAATCGGT 


CTTTGTTTCC 


CGAATTTAGG 


4860 


20 


TTGTAATTCA AATTTCGGTT GTATTTCGAT 


ACGATCATAG 


TTAGTGCCAT 


TCATATTACT 


4920 




TTCTAAATAT TGGTAATATT CGCACTCTAC 


TTTGCTATCA 


AATACAATTC 


CTTTGTACTC 


4980 




AACTTTCTTA GCATTGTATT TACTCATTGT 


GCCACCTCTA 


AATATCAAAT 


ATCGTTGCTT 


5040 


25 


GCAATCCTAG CTCTTGCTCA TATAGAAGCC 


CGTGAGCGCC 


TTTGAATCGT 


TTTAGGTCAC 


5100 




TATCAGTCAT AATTTTCTTT TCGTCGCTGA 


AATGGGCTCC 


TGTGAGCGAA 


TAAACTTCAT 


5160 




TTACGTTGTC TTTATACTTG ATGACCTTAA 


TATCTTCCGT 


GCCATCTTCT 


CGGTATAAGT 


5220 


30 


AATATTTTTC TTTCGGCATT TTTTAACACT 


CCTTAATGTG 


TGTTTTCTTC 


CAGTTGATTT 


5280 




CATTCATGAT TTTCTTTTCA ACTCTGTCGT 


AATCATCGAA 


AGGCGATAAC 


TCGTTATTGT 


5340 




CCAACAATCT ATTGACCGCC CAACCAGTCT 


CGATATATAC 


ATTTGCTACA 


ATCGGGTCGC 


5400 


35 


TTTGCTTTGT CTCTTCATAC ATCGATTTCA 


ATAAGCTTTT 


GAATTGCATT 


ATATTCATGT 


5460 




GAAAAACCTC TGAGTCTTCT TGTAATACTC , AAATTCAATT 


ATTCCGGTTT 


CGCCGTCTTT 


5520 


40 


GTTTTTGGCT ATGTTACATT CAACAATAGA 


TTTGCCAGTG 


ATACTGTCAT 


CTTCGTCACG 


5580 


GTTATAATAA TCATCACGGT AAAGTAGCAT 


CGCTAAACTC 


GCATCTGCTT 


CTATTCCGCC 


5640 




TGATTCTTTC ATGTCCGATA GCATTGGTCT 


TTTATCCTGT 


CTAGACTCGA 


CACCACGATT 


5700 


45 


CAGTTGTGAA AGTAGTACGA TGATTGCGCC 


TGTCTCGTTA 


GCGATTATCT 


TTAAGTCACG 


5760 




TGATATCTTT TCTACTGCTA CACGTCTATC 


AACTTTCGCA 


TCAGTATCCA 


TCAGTTGAAG 


5820 




ATAATCTATA AAAATAACTT GTTGCCTGTC 


TGAATGCCTC 


ATTGtTGCGC 


TCGCACATCT 


5880 


50 


TGCGGTGTGA TATTACTTTT ATCAGAAATA 


TCGATGCCTA 


ATTTCATGAT 


TTTATCCATC 


5940 




GCATTCGTTA ACTTTGTTAA GTCATCCGGC 


GTTAAGTTCC 


TGATTTCTTT 


TATCTTTGTT 


6000 
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AGACTAAAGA AAGATGTTTT 


GTATCCATTT 


TGTGCTATGT 


TCAGCATCAT 


GTTTAATGCA 


6120 




AAACCTGTCT 


TACCCACTGA 


GGGACGCGCT 


GCGATGACGA 


TTAATTGTGA 


TGGTTCTAAT 


6180 


5 


CCCCCTATTT 


TGTAATCCAT 


TAGCTTGTAA 


CCCGTCTTAA 


TTTGCTTCTT 


AGGGCTATCG 


6240 




CTGTATAACT 


CTTCGACAAA 


CTCCTCAACA AACTTCTTGG 


TTCCATCTTC 


TTTTTTGTTA 


6300 


10 


GTAATTGTTT 


TTAAATCCTT 


GAGTTCATCA 


ATCAAGTTGT 


TAAAGTTTTG 


GTTCGTAGGT 


6360 


TGTTGTTTGA ACTCAGTTAC 


CAATTCGTTA 


GCTTTGTTGA 


GCTGATAACT 


TTCCAATAAT 


6420 




TCTTGTTGAT 


AACGTTCAAA 


GAAGCCATAT 


CCAATGAAAT 


CGGAGTTGTA 


AAGTTTAGTT 


6480 


15 


ATAGTATCTG 


CATCTAAAAA 


TTCTTTATCT 


TTAGTTGCTT 


TTAAATAGAT 


TTCTTGATGA 


6540 


TCTATCTTTC 


CGACGTCCAT 


TACATAATTG AAAAAGGTTT 


TAAACTTTTC 


GTTCGTAAAC 


6600 




ATGTAATCTT 


TAACTCTTAT 


CTTTTCTAAT ACGTCCGGTT GTTTAAGTAG 


CGTAGCGATT 


6660 


20 


ATTGTACTTT 


CAATTTCGAA 


TTGTCCGTAA 


TTCATTCGTT 


TTCGCCCCCA 


AATTCTGCCA 


6720 




ACTTATTCAT 
GCATTTTCCA 


GAACTTATCT 


AGCGCTATTT 


TTCTTTGTCT 


GACATATTCG 


GGGTCATTCT 


6780 






GCGGTTTCGT 


TATCTACTGG 


CTCGATAGAT AL 1 1 1 1 1 TAG 


6840 


25 


GTTCCTTACG CATGATTGCT GGTAAGTTAG GCGGGTACGG GTTGTTACTG TTGATATAAA 


6900 




CATCTACCGC 


TTTTACAGTT GG TTGATAAT 


CTCCATTTTG 


ACTTAATACA 


TCAATCCACA 


6960 




TTTCTAACTT 


CGGTTTATCA AAATCAATGT 


TGTATACGTA 


CCTAACTTTT 


TTAATAATTT 


7020 


30 


CTAATGCTTG 


TGTTTTGCTC ATCGGCATTA 


GTCATCACTC 


AATTCTTTTT 


CCATTTGTGC 


7080 




AATGACATCA 


TCAGTAGTAT 


TTTTTCTAGG 


TGCTATTTTA 


TTTTCTGCAT 


CTTCTTTTGT 


7140 




TTTGACATTC 


TCTTTAGCCC 


AGTTGTTTAA 


AACTTTAATT 


AAATAGCCAC 


CATGCGCACT 


7200 


35 


TTTGCTTTTA 


GTGTACTCAA 


CACCTACTTT 


TACAACTTCA 


AAAGCGTTTG 


TACCTATATC 


7260 




ATCAATAGCA AACCCTAATT 


GTTCCATTTG 


ATTAGGTGTT 


AACTTATCAT 


CCAAATTTGC 


7320 


40 


AATTATATAT 


TTTATTGAAG 


ATGAGAAGAC 


GGCTTCTCTT 


TCTTCTTCTT 


TATTCTTATA 


7380 


TTCTTCTTCT 


TTTTCTTCTT 


CTCTTTCTTC 


TTCTTCTTCT 


GTATCGTTAC 


GTAACGTTAC 


7440 




GGTAACGTTA 


CGTTTTGCTT 


CTAGTAACTT 


TTTCTGTTTC 


TCACGATAGC 


GTTGTTGTCG 


7500 


45 


CAATTTATTT 


TTTTCTTTAT 


GCTTAGCTTT 


GCTATCTAAG 


CTTTGATGCT 


TCTCCCAGTT 


7560 




TGTCACTTTT 


ATGACACCAT 


TAACTTTTTC AATCATGCCC AATGTCTCAA AAGTTTGAAT 


7620 




TGCTAACCTT 


ATTGAGTTAA TAGGTCTATT AAATTCATTT 


GCTAACATTT 


CTTCGTTGTA 


7680 


50 


CGGCAAGTTT 


TCGGATAGCA 


TAATATAACC 


TTGTTCATTG 


TACTTTCCTG 


ATAAAGTTAG 


7740 




TAACTTAACC 


CAAATAGTTA 


TGATCGTATC 


TCTTTCGGGT AAAGCTTCGA 


TATATTTGAT 


7800 
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CTCCTTTCAG CATTTTGTTG AGCCTCTCAT CAA CTTTT AT CCACGAGTCA TGCAAGTGAT 7920 

ATTTATCATC AAACGACTTA ACGCCAATTG CGTGCTGTTC ATTATGATGT TGTCTACACA 7 980 

GTGCTAACAC ATGTTTGTCG TAGTGATTCA TTTTGTTTCT GTTCATGCCT CTGCCGACTG 8040 

CTTCATAATG TGCCAGGTCT GCGTGAGGCT TTCCGCATAT TACACAGTTG CGGTTGATTG 8100 

TAGCCCAATA TAATAACGCT TTATCTTCGC TTAACAACTT ACTCGTTTCT ACACTCATAG 8160 

10 

GTATTTGATG ATGAAACATA AACGCTATAA TCAGTTCTAT TAACTCCCTT GCAACTTTCA 8220 

TAGAACAGTC GCGCAGACTG ATTTCTTCAT AACCTTTCAT AATTTCCAAT TCTGTTTGTA 8280 

ATAATTTTCT AGTTGATTCT ACTGGTTCGC CCCAGTGAAG TTCTATATCT CTACACATTG 834 0 

CGAATATTTT TTTGCGTTGT TCTATAGATA GTTTTTTATT GTCCGGAACC TCTACTTCTG 8400 

CTTTTAGTGG ATATCCGTTT TCTAGTAAGT CAATGTGACt TTGTTCAAGT TCAACACCAG 8460 

2o TAGCAACGAC GGAATAAGTA CCGTCATTGT CTTTCTGGTA TCTTGTAATG TATTGCATTT 8520 

AAACCACGTC CTAGAACGGT AAATCATCAT CATTGATTTC TATTGGACCA TTAGCATTAG 8580 

CGAAfGGGTT TGATTGTTGA CTCATTGGCG TCTGTTTCCC ATTTGCTTGC TGTTCTTTTT 8640 

25 GTTTCATCTC ATCAGTTTTA GGTTCTGGTT TATTAACTAC TTCATCGTCT TTATTCCAAA 8700 

CTTTTACATA TGAGAGTCTT ACAAAATACT TGCCTTGTTC CTCGTTAAAT TTATTTTTAA 8760 

GTACAATAGT TCCGATTTTG TTAATTAATT GATCTGTGTC AAAAGTTAAA TCTGGTAAGT 8820 

30 

TCAATTTAAT TCCTAATCTA CTAAGTAACT CGATATATTG TTTTTCTTGA TAATCTTGTT . 8880 

GGAATGGTGG GACGAATTGG TTGTGTTTGT ATTGTTTACC TTCGTTGTTT TCAAAAACAA 8940 

TCGTGAAGTA TCTGTTTTCT CTGTCGTTAA ACTCGACATT TGCAACTTTT ACTGTAAATT 9000 

35 

CTCCAGCTCC TAAAAAGTCC CCACCTTTCA TGAATGCCTC TTGATTAGTT TCTTGAATGT 9060 

ATTGTGTTCT ACCAGTGATT TTCATAATTT TTATACCGTC CTTTTAATTA ATTTTTAATT 9120 

ACCATTTCTA ATTGCTTGTA CAAGATCGTT AATACTTGGA TTAATGAAAC GTTTGTTGTT 9180 

40 

AATTTTGATG TTGCTTGAGT GTCTTATCTT TGTCTCGAAT AAATTTGATG GTTCAGCGTT 9240 

AAGTACATAT TGATAAGTTT TTTCGGCGTC TTGCTCATGT TCTTCTATTG TCATTCTTGC 9300 

45 TAACACGTCA GATTGACTGA TGACTGCTTT TTTTATTTGG TCTTGTGCCT CTATCGTGAT 9360 

TGTTGGATTG ATAGTACTTC CCTCATCATC TTTGTCTTTG TTAATGCCCT CGTGTCCGCT 9420 

TATAGCAAGA TGAAATTGAT AATGTTCTTG TAATTTAGAA ATATAACGAT AAATACTTAC 94 80 

50 AATGCGTGTA GCACACTCGC CCCAATCATT AAATGTCGGT TTCTTTGATT TACCGTCCAT 954 0 

GATGTCGTCC ATAGTGATAT CACGTAACTT TTGGATTGTT TCAATCACTA CAACATCAAT 9600 
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AAAATGCTTA TAATTCTTAA TCTGCACAAC TGCCCCATCT TCTGTTACCG TTGTTCCGTC 9720 

CTCATTTATA TCTAGTACTA AGGCATTGTT ATCTTTTGTT AAAAACGTAG TTTTACCAGT 9780 

5 ACCGAACTTG CCGTATATCG CAAATTTATA AAACTTGTTT GCATTTTGTT TGCTGATGTC 984 0 

TTTTACACCT AGTTGCGTTA AAATATCGAC ATCTTGATTA GTTTTTTCAG TCATCTATTC 9900 

TCCCACCTTT ACCGTGTATG ACGTTGGTTT CTCCACAATG CTAGCACCCT CTAAAACTTC 9960 

10 

GCCGTTTGCG TCAATCAATG TGCCGTTTTC AGTTACATTG AAATCTTTCT TAATGTCTGA 10020 

TTGGCTAAGT TTTTTAGTTA CTTTTACATA GTTGTCAAAA CCTCGTTGCT CAAGTTGTnT 10080 

AATGACTTCT TGCTCATTGC TAACTTGAAT GACTTTTGAA CCTTTTCTGG CTGTCACTTT 10140 

75 

TCCGTAAGtG TATTCAACTT GAATTTGCTA TCTTGTTCTT TTTGTATTCT GTAATATTCA 10200 

ATTACAAGGC TTTGTAAATA TTCTTTGCCA CTCTGTAATT TTTCTACTTC TTTATCTTTC 10260 

20 CATTCGTTTA TGCGTTCAAT TTCTTTATTT GCTAAATCGT TGATTTCATT CTCTTTAGTT 10320 

GTGATTGCAT CCAGTTTCTn AAAAACCCAG TTAGCACTGT CTAGATCAGT nACTTTGAAT 10380 

CGGTCGTCTT GTTCGAATGT n 10401 

25 (2) INFORMATION FOR SEQ ID NO: 150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 2989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 150: 

35 

TTTCTCTCTA TTATTCTCGA TGCGTAGATA ATTGTTTAAA TTTAAGTTTA TAGTAATGTT 60 

GAGTTTATAA TTTCATATAT CTAAAAACAG GTGTTGTATA TATAATCATT CATCTAGTTA 120 

TACTTACTTT AAAAATAATA TAATTTCATG CGATGCAATT CATTGATGGA TGTTTTTAAT 180 

40 • 

CTTAATCAAA TCCAaATAAA GCATATATTT TTAAATTCAC TTTCTTTCGA ATCGATTTTT 240 

ATCTCTTGnA TTAAACTTTT CCATTGTTTC ATTAAAGCTC TCTGTCATAT CTATTCCCAT 300 

4S TGAATTCGCT AAACATAACA ACACAAATAA ATTATCACCT AATTCTGCTT TAATCGTATT 360 

TGCTTCCTCT GAATCTTTCT TCTTTTTTTC ACCATAGGTA TGATTTATTT CACGTGCAAG 420 

TTCGCCCACT TCTTCAGTCA ATCTAGCTAA GTTAGCTAAT GGTGAAAAAT ATCCTGTTTT 480 

50 AAATTGTCCA ATATATTCAT CAACTTCACG TTGCATTTCT ACCATTGATT TCATTTCTAC 540 

GTTCTCCTTA TATTGCATTT CTAATATAGT ATATATCAAT TTGAAGTCTC ATGCATGTTT 600 
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AATTCAGTTT ATATAAATGT AATGCATTCC TAACTAAATT AAATCAATTG AAATTGGGAT 720 

TATAACTTTA TGATACGTAC CACTACAATA AAATAATATA GTGAATAATC TACCATTAGA 780 

AAAATAAGCA CAAAAAAACT AGCAACCACA CAAAAATGTG ATTAGCTAGT TAATAAGTGT 840 

CTAATTTAAG TTAATTGTTA ATCTATAAGA TTAATCACTT GAACGCGCAA TCAAAATAAT 900 

ACGTACAAGC TCTGCTACAG CGACTGCAGT TGCTGCAACA TAAGTCATTG CTGCTGCAGA 960 

TAATACTTTA CGCGCATGCT TGTATTCTTT TTCATTTACA ATGTTCAATG CCGTAATTTG 1020 

TTTCATCGCT CTTGAACTCG CATCAAACTC AACTGGTAAC GTAACAATTG AGAATAATAC 1080 

CGCTAATGAC ATTAAACCAG CACCAATCCA TAAAGCAGTT GAACCaAATG CACTACCTAT 1140 

■ CGCTGTTAAG ATAATACCTA ACATGATGAT CATATAACTT AATGAACTCC CTAGGTTTGC 1200 

AACAGGTACT AATGCTGCTC TGAATCTTAA GAACCAATAT CCTTGGTGAT CTTGAATGGC 1260 

20 ATGACCAACT TCGTGGGCTG CAATTGCAGT TCCAGCAACT GATGGTCTGT CATAGTTTGC 1320 

AGGAGATAGT GAAACAACTT TCTTTTTAGG ATCGTAATGA TCTGTTAAGA ATCCTTCACC 1380 

TTTAACAACT TCGACATCAT AAATACCGTT TGCATGTAAA ATTTCTAATG CAACTTCACG 144 0 

25 ACCQSTTTTA CCACTAGTTG ATCTAACTTG TGAATATTTC TCATAGTTAG ATTTAACTTT 1500 

GTGTTGTGCC CATAAAGGAA GCACCATTAA TATTACGAAA TAAATTATCA TAGTAAAAAT 1560 

TGAAGACAAT AAACTCACTC TCCTTTATAA ATATTTTACT GTCATTTGCC GTTTTTATCA 1620 

AATCATTTAC ACTTTAATAA TTTGTTTAAT TCAATATAAA GCAAAAGTCC AAAAACACTT 1680 

AGACAACATG ATAATACACC AATTTGCCAC ACATGTGTAG TTATAAAATC ATAATATGGA 1740 

AATTGAAGGT GAAAATAGTC AATATAATCA TTCAAAAACA CCCAAATCAT yGCTACACTG 1800 

ATTCCAATCA TAGAACGTTT AAACCTAGGA TAGAAGTAAA TTGCCTGAAC AGCCATTATA 1860 

CTGTGGGAAA ACATTAATAC CAAACCATTT ACTGTAATAT CACCTTGTTC AATAATAAAT 1920 

AATATATTCA TTATAACTGC CCAAATCCCA TATTTGAATA ATGTTACAAA TGCCAGTGCA 1980 

TCGATAATAC TATTTTGTTT TTGAATTAAT ATCAATGAGA TAGAAATAAC TAAGTATAAT 204 0 

ATTGCAGTTG GGCTATCTGG AACAAAAATC TTAAAATGCC AGGGCGTATG ACTTAATTGT 2100 

4S TCACCATACC ATATATAACC ATAAATCATC CCTAATATAT TACAAATGAG TAGCATCATT 2160 

AACCAAGAAC GTTGATAAAG TGTATATTGC CAAAATGCTT TAATTGTCAT CTGCTAAGTC 2220 

CTCAAATTGA TTATGTTTAT TT ACT AG CTT GAGTGTATTT AAAATTTGCG TTAGTTGATA 2280 

50 AAAACGTTGC TTTTCATTCA TCTGTAAACT TAAATCAATA TTGTGTAACA AGTAATCTAT 234 0 

TAATAACGGA TGTTTATGCC GATCTATAGC CATACTATTT AAGTCATGAA GATAAGTTTG 24 00 
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TGACACGTTT GCGAAGTGAA TTTGAATATC AAAAGCACAG TTATGATTAG CGATATAATC 2520 

AAATATTTCA TTTGTATTCA TTAACTTT AT ATTACGCTTA GTAAATTGAA TTGCAGAAGC 25B0 

5 GTGACTTCCC ACTTCTGCAA TTTCTAATGT TTCATGATGA TTAATTTTTG TATCTACAAA 2640 

ATGAATGTTT GCCAATTTCG CCTCATTCAC TTTTATATAG TTAAGCACCC AAACTGCAAT 2700 

ACGCGACTTA AATCGATATT GAAAAAGTAA ATATTCAATA AAACTTTCTT TAATTTGATT 2760 

10 

GAGTGTCTCT GACATCAAAT ACCCCATTTT AAGATTGCAA TCTTGaTAAT TCGTCATGCC 2820 

AATTTTCGTT ACTTGGcTCT AGTTCCAACA ATTGATTTAA AATAGTAATT GCTTGTTCCT 2 8 BO 

TTTGACCAAT TTCAATTAAA TAGAAATAAT AATCACTCAT AAAATCAATA TTTGTTTTCA 2940 

TCGTTGGATA TGCTAATTCA AAGAAATGTT GAGCTTCTTT ATCTCGCTC 2989 
(2) INFORMATION FOR SEQ ID NO: 151: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double. 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 151: 

CATCAACTCC TTAATTACAC TGTAAATGAT ATGCGTCTTT TTGACAACTA TATTTGTCAA 60 

30 * 

. ATCTACACCA AAAAATATGA TTATCCACCT ATGTATGACA TTTTGAAACA AACACCTCAA 120 

CGCCTACAAG TCATAATTGT TTACTTTCGT TACACCTTCC TGCATAATTA ACAGCATTCT 180 

AATTTTAGTA TGATGCACGC ATTTTCACTA AATCAAACCA TTCAAAGGAG ACTATTATGG 24 0 

35 

CATTTACATT ATCTGCAATT CAACAAGCAC ATCAACAATT TACTGGTGTT GACTTTCCAA 300 

AACmTTCAA AGCTTTTAAA GATATGGGGA TGACTTACAA TATCGTCAAC ATTCAAGATG 360 

GCACTGCAAC ATACGTACAT CAATCAGAAG ATGATATCGT TACGTCATCT GTAAAAAGTA 420 

40 

ATCATCCTGT TGCTCAAAAA TCAAACAAAA CAATAGTTCA AGACGTCTTA ACTAGACATC 4 80 

AACAAGGGCA AACAGATTTT GAAACATTTT GTGATGAAAT GGCTGAAGCT GGCATTTATA 54 0 

45 AATGGCATAT CGATATTCtnA GCGGG CACTT GTACTTATAT CGACTTGCAA GACCAAGCTG 600 

TTATTTCAGA ATTAATCCCT CAATAAACTA TATTTATAGC AACATTTTAA TTATTTCATA 660 

AAATTTTATT GATAATCATT ATCGTTCGGT ATAAAGTAAA TACTATATAC TACTTATGAG 720 

SO TGAGGTTGAT TATCATGATA ACTAACACTT TTATTTTAGG CATCACAGGC CCAACAAGTC 780 

TTGTCGTCAT TAGCATTATC GCTTTAATTA TTTTTGGTCC GAAAAAATTA CCACAATTTG 84 0 
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AGTCTCACGA TACACCCAGT AAGGAATCGA AACAACAGCG AGAGCAATAG CACTGACCAC 960 

ACCTTACTGG TTCACTTTAG CGAACTACGC CA7CGGTTAG TAAAAATTTT ATTGTCGTTC 1020 

GTCATTACGG TCATCGTCGT ATATGTyTCA TCATTTTGGT GGATGACACC ATTCATAACG 10 BO 

TATATyACCC GgCACATGTG TcCTTACATG CATTTcATTC ACAGAAATGA TACAAATAAC 1140 

GTG 1143 



10 

(2) INFORMATION FOR SEQ ID NO: 152: 

{ i ) SEQUENCE CHARACTERISTICS : 

(A) - LENGTH: 7953 base pairs 

(B) TYPE: nucleic acid 

75 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 152: 





CAACGCCTGA 


ACGTAAACCA 


TATCGTTTCG 


CGATTTCCTC 


ATCTTGACTA 


TTTACTAAAA 


60 




ACTCTCTCAT 


GGCGATTAAT 


GTTTCTTTTT 


CTTCTTTAGT 


TAATGGTAAT 


TCTAACTCAG 


120 




CTGCTTTTTG 


ACGCAAAGTT 


GGATGACCAT 


CTCTAATGAT 


GTCTTTCATT 


GTTAACATAT 


180 




ATTGCACCTT 


CCTTATTTTA 


ATTTGTTTTA 


GTTGAATGAC 


AGTAAAAAGG 


TTGTTAAGAT 


240 




ACTCATACAT 


TTTTATGTGT 


AAATATCTAC 


AAAGTTAACC 


AACTACTGCC 


AATGTTTATT 


300 


30 


TTAGATAGTA 


TATGTAAATT 


TTCAaGAtAT 


GCgTAATTGC 


gTTAAAAAAT GaTTAAAGTG 


360 




TTGGTTTCAA 


GCAATGaTAC 


TTTAGAAATT 


TATTTATCAT 


CTTGACTTTA AAAATTATAT 


420 


35 


TATAAATGAC 


GTAACTGTCA 


ACAGATATAC 


TTAGTArTGA 


AGATGTGTAA TGTAATTGTT 


480 


TAAAATTGAT 


TTCCAAGCAG 


ATTTTATTTA 


TCATTTAATT 


TAAATAGCAA 


GTGGAGGTAC 


540 




AAGTAATGAA 


ATTTGGAAAA 


ACAATCGCAG 


TAGTATTAGC 


ATCTAGTGTC 


TTGCTTGCAG 


600 


40 


GATGTACTAC 


GGATAAAAAA 


GAAATTAAGG 


CATATTTAAA 


GCAAGTGGAT 


AAAATTAAAG 


660 




ATGATGAAGA 


ACCAATTAAA 


ACTGTTGGTA 


AGAAAATTGC 


TGAATTAGAT 


GAGAAAAAGA 


720 




AAAAATTAAC 


TGAAGATGTC 


AATAGTAAAG 


ATACAGCAGT 


TCGCGGTAAA 


GCAGTAAAGG 


780 


45 


ATTTAATTAA 


AAATGCCGAT. 


GATCGTCTAA 


AGGAATTTGA 


AAAAGAAGAA 


GACGCAATTA 


840 




AGAAGTCTGA 


ACAAGACTTT 


AAGAAAGCAA 


AAAGTCACGT 


TGATAACATT 


GATAATGATG 


900 




TTAAACGTAA 


AGAAGTAAAA 


CAATTAGATG 


ATGTATTAAA 


AGAAAAATAT 


AAGTTACACA 


960 


.50 


GTGATTACGC 


GAAAGCATaT 


AAAAAGGCTG 


TAAACTCAGA 


GAAAACATTA 


TTTAAATATT 


1020 




TAAATCAAAA 


TGACGCGACA 


CAACAAGGTG 


TTAACGAAAA 


ATCAwAAGCA ATAGAACAGA 


1080 
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AAGAAAAGCA AGACGTTGAT CAATTTAAAT AATTAATATA ATACAGATGG TAGGAAACAA 1200 

CTAATACAGT TCCTATTATC TGTATCTTTT TTTATTAAAA CAGAACTTTT TCAAATGGTT 1260 

TAACAGTCCC ATTTATTTGT GGTACAATTA GTAAGGATAA AATGAATTTC TATACAATTA 1320 

TGGGAAAGGT ATTGTGAATT GAATGGCTCC TAAGTTACAA GCCCAATTCG ATGCAGTAAA 1380 

AGTTTTAAAT GATACTCAAT CGAAATTTGA AATGGTTCAA ATTTTGGATG AGAATGGTAA 1440 

CGTCGTAAAT GAAGACTTAG TACCTGATCT TACGGATGAA CAATTAGTGG AATTAATGGA 1500 

AAGAATGGTA TGGACTC GTA ^ TC CTTGATCA ACGTTCTATC TCATTAAACA GACAAGGACG 1560 

TTTAGGTTTC TATGCACCAA CTGCTGGTCA AGAAGCATCA CAATTAGCGT CACAATACGC 1620 

TTTAGAAAAA GAAGATTACA TTTTACCGGG ATACAGAGAT GTTCCTCAAA TTATTTGGCA 1680 

TGGTTTACCA TTAACTGAAG CTTTCTTATT CTCAAGAGGT CACTTCAAAG GAAATCAATT 1740 
20 CCCTGAAGGC GTTAATGCAT TAAGCCCACA AATTATTATC GGTGCACAAT ACATTCAAGC - 1800 

TGCTGGTGTT GCATTTGCAC TTAAAAAACG TGGTAAAAAT GCAGTTGCAA TCACTTACAC 1860 

TGGTGACGGT GGTTCTTCAC AAGGTGATTT CTACGAaGGT ATTAACTTTG CAGCAGCTTA 1920 

25 TAAAGCACCT GCAATTTTCG TTATTCAAAA CAATAACTAT GCAATTTCAA CACCAAGAAG 1980 

CAAGCAAACT GCTGCTGAAA CATTAGCTCA AAAAGCAATT GCTGTAGGTA TTCCTGGTAT 204 0 

CCAAGTTGAT GGTATGGATG CGTTAgcTGT nATATCAAGC AACTAAAGAA GCACGTGACC 2100 

GCGCAgTTGC AGGTGAAGGT CCAACATTAA TTGAAACTAT GACATATCGT TATGGTCCTC 2160 

ATACAATGGC TGGTGACGAT CCAACTCGTT ACAGAACTTC AGACGAAGAT GCTGAATGGG 2220 

AGAAAAAAGA CCCATTAGTA CGTTTCCGTA AATTCCTTGA AAACAAAGGT TTATGGAATG 22 80 

AAGACAAAGA AAATGAAGTT ATTGAACGTG CAAAAGCTGA TATTAAAGCA GCAATTAAAG '2340 

AGGCTGATAA CACTGAAAAA CAAACTGTTA CTTCTCTAAT GGAAATTATG TATGAAGATA 24 00 

TGCCTCAAAA CTTAGCAGAA CAATATGAAA TTTACAAAGA GAAGGAGTCG AAGTAAGCCA 24 60 

TGGCACAAAT GACAATGGTT CAAGCGATTA ATGATGCGCT TAAAACTGAA CTTAAAAATG 2520 

ACCAAGATGT TTTAATTTTT GGTGAAGACG TTGGTGTTAA CGGCGGTGTT TTCCGTGTTA 2580 

45 CTGAAGGACT ACAAAAAGAA TTTGGTGAAG ATAGAGTATT CGATACACCT TTAGCTGAAT 264 0 

CAGGTATTGG TGGTTTAGCG ATGGGTCTTG CAGTTGAAGG ATTCCGTCCG GTTATGGAAG 2700 

TACAATTCTT AGGTTTCGTA TTCGAAGTAT TTGATGCGAT TGCTGGACAA ATTGCACGTA 2760 

50 CTCGTTTCCG TTCAGGCGGT ACTAAAACTG CACCTGTAAC AATTCGTAGC CCATTTGGTG 2820 

GTGGCGTACA CACACCAGAA TTACACGCAG ATAACTTAGA AGGTATTTTA GCTCAATCTC 2880 
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5. 



10 



IS 



20. 



35 



40 



45 



CTATTAGAAG 


TAATGACCCA 


GTCGTATACT 


TAGAGCATAT 


GAAATTGTAT 


CGTTCATTCC 


3000 


GTGAAGAAGT 


ACCTGAAGAA 


GAATATACAA 


TTGACATTGG 


TAAGGCTAAT 


GTGAAAAAAG 


3060 


AAGGTAATGA 


CATTTCAATC 


ATCACATACG 


GTGCAATGGT 


TCAAGAATCA 


ATGAAAGCTG 


3120 


CAGAAGAACT 


TGAAAAAGAT 


GGTTATTCTG 


TTGAAGTAAT 


TGACTTACGT 


ACTGTTCAAC 


3180 


CAATCGATGT 


TGACACAATT 


GTAGCTTCAG TTGAAAAAAC 


TGGTCGTGCA 


GTTGTAGTTC 


3240 


AAGAAGCACA 


ACGTCAAGCT GGTGTTGGTG CAGCAGTTGT 


AGCTGAATTA 


AGTGAACGTG 


3300 


CAATCCTTTC 


ATTAGAAGCA 


CCTATTGGAA 


GAGTTGCAGC 


AGCAGATACA 


ATTTATCCAT 


3360 


TCACTCAAGC 


TGAAAATGTT 


TGGTTACCAA 


ACAAAAATGA 


CATCATCGAA 


AAAGCAAAAG 


3420 


AAACTTTAGA 


ATTTTAATAC 


ATTTTAAAAG 


TTAACGAAGT 


TAGCGTATTT 


TAGTCTCATT, 


3480 


GATTAAAATG 


AAATGTTTAA 


TTTACGAAAT 


CTTAGGAGGG 


CAAAAACGTG 


GCATTTGAAT 


3540 


TTAGATTACC 


CGATATCGGG 


GAAGGTATCC 


ACGAAGGTGA 


AATTGTAAAA 


TGGTTTGTTA 


3600 


AAGCTGGAGA 


TACTATTGAA 


GAAGACGATG 


TTTTAGCTGA 


GGTACAAAAC 


GATAAATCAG 


3660 


TAGTAGAAAT 


CCCATCACCA 


GCATCTGGTA 


CTGTAGAAGA 


AGTTATGGTA 


GAAGAAGGTA 


3720 


CAGTAGCTGT 


AGTTGGTGAC 


GTTATTGTTA 


AAATCGATGC 


ACCTGATGCA 


GAAGATATGC 


3780 


AATTTAAAGG 


TCATGATGAT 


GATTCATCAT 


CTAAAGAAGA 


ACCTGCGAAA 


GAGGAAGCGC 


3840 


CAgcAGaGCA 


AGCACCTGTA 


GCTACTCAAA 


CTGAAGAAGT 


AGATGAAAAC 


AGAACTGTTA 


3900 


AAGCAATGCC 


TTCAGTACGT 


AAATACGCAC 


GTGAAAAAGG 


TGTTAACATT 


AAAGCAGTTT 


3960 


CTGGATCTGG 


TAAAAATGGT 


CGTATTACAA 


AAGAAGATGT 


AGATGCATAC 


TTAAATGGTG 


4020 


GTGCACCAAC 


AGCTTCAAAT 


GAATCAGCTG 


CTTCAGCTAC 


AAGTGAAGAA 


GTTGCTGAAA 


4080 


CTCCTGCAGC 


ACCTGCAGCA GTAACATTAG AAGGCGACTT 


CCCAGAAACA 


ACTGAAAAAA 


4140 


TCCOTGCTAT 


GCGTAGAGCA ATTGCGAAAG 


CAATGGTTAA 


CTCTAAGCAT 


ACTGCACCTC 


4200 


ATGTAACATT 


AATGGATGAA 


ATTGATGTTC 


AAGCATTATG 


GGATCACCGT 


AAGAAATTTA 


4260 


AAGAAATCGC 


AGCTGAACAA 


GGTACTAAGT 


TAACATTGTT 


ACCTTATGTT 


GTTAAAGCAC 


4320 


TTGTTTCTGC 


ATTGAAAAAA 


TACCCAGCAC 


TTAACACTTC 


ATTCAATGAA 


GAAGCTGGTG 


4380 


21 H Zl TVYTTY* £ 
AAAiUoi lUi 


TAAACATTAC 


TGGAATATCG 


GTATTGCAGC 






****** \J 


TAGTACCTGT 


TGTTAAACAT 


GCTGATCGTA AGTCTATTTT 


CCAAATTTCA 


GATGAAATTA . 


4500 


ATGAATTAGC 


TGTTAAAGCA 


CGTGATGGTA 


AATTAACAGC 


CGATGAAATG 


AAAGGTGCTA 


4560 


CATGCACAAT 


CAGTAATATC 


GGTTCAGCTG 


GTGGACAATG 


GTTCACTCCA 


GTTATCAATC 


4620 


ACCCAGAAGT 


AGCAATCTTA 


GGAATTGGCC 


GTATTGCTCA 


AAAACCTATC 


GTTAAAGATG 


4680 
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ATGGTGCAAC TGGCCAAAAT GCAATGAATC ACATTAAACG TTTATTAAAT AATCCAGAAT 4800 

TATTATTAAT GGAGGGGTAA AACATGGTAG TTGGAGATTT CCCAATTGAA ACAGATACTA 4 860 

5 

TAGTAATCGG AGCAGGTCCT GGTGGATACG TTGCAGCAAT TCGTGCAGCT CAATTAGGAC 4920 

AAAAAGTAAC AATCGTTGAG AAAGGTAATC TTGGTGGTGT TTGCTTAAAC GTAGGATGTA 4 9 BO 

TTCCTTCAAA AGCATTACTA CATGCTTCTC ACCGTTTTGT TGAAGCACAA CATTCTGAAA 5040 

ACTTAGGTGT TATTGCTGAA AGTG TTTCTT TAAACTTCCA AAAAGTTCAA GAATTCAAAT 5100 

CATCAGTTGT TAATAAATTA ACTGGTGGTG TTGAAAGCTT ACTTAAAGGT AACAAAGTTA 5160 

1S ACATCGTTAA AGGTGAAGCA TATTTCGTAG ATAACAATAG CTTAOGTGTT ATGGACGAAA 5220 

AGAGCGCACA AACATACAAC TTTAAAAATG CAATCATTGC AACAGGTTCA AGACCAATTG 5280 

AAATTCCTAA TTTCAAATTC GGTAAACGTG TTATCGACTC AACAGGTGCT TTAAACTTAC 5340 

20 AAGAAGTACC aGGTAAATTA GTTGTAGTTG GTGGAGGATA CATTGGATCA GAATTAGGTA 5400 

CAGCATTTGC TAACTTTGGT TCAGAAGTAA CCATCCTTGA AGGTGCTAAA GATATCTTAG 5460 

GTGGCTTCGA AAAACAAATG ACACAACCTG TTAAAAAAGG TATGAAAGAA AAAGGTGTTG 5520 

25 AAATCGTTAC TGAAGCTATG GCTAAATCAG CTGAAGAAAC AGATAACGGA GTTAAAGTTA 5580 

CTTATGAAGC TAAAGGCGAA GAGAAAACAA TCGAAGCTGA TTATGTATTA GTAACTGTAG 5640 

GTCGTCGTCC AAACACAGAC GAATTAGGCC TAGAAGAATT AGGTGTTAAA TTCGCTGACC 5700 

30 

GTGGATTATT AGAAGTTGAT AAACAAAGCC GTACGTCTAT CAGCAATATC TATGCAATTG . 5760 

GTGATATCGT TCCAGGTTTA CCACTTGCTC ACAAAGCTAG CTATGAAGCT AAAGTTGCTG 5820 

CTGAAGCAAT TGATGGTCAA GCTGCTGAAG TTGATTACAT TGGTATGCCA GCAGTATGCT 5880 

35 

TTACTGAACC AGAATTAGCT ACAGTTGGTT ATTCAGAAGC GCAAGCTAAA GAAGAAGGTT- 5940 

TAGCAATTAA AGCTTCTAAA TTCCCATATG CAGCAAATGG TCGTGCATTA TCATTAGATG 6000 

ATACTAACGG ATTTGTTAAA CTTATTACAC TTAAAGAAGA TGATACTTTA ATCGGTGCTC 6060 

40 

AAGTAGTTGG TACTGGTGCA TCAGATATTA TCTCTGAATT AGGTTTAGCA ATTGAAGCTG 6120 

GTATGAATGC TGAAGATATC GCATTAACAA TCCATGCACA TCCAACATTA GGTGAGATGA 6180 

45 CTATGGAAGC AGCAGAAAAA GCTATCGGAT ACCCAATCCA TACAATGTAA TAACTGATTA 624 0 

TCTATAAAGA TTCAGTCATT AAAAGCTGTA GCATATGCTA CGGCTTTTTT GTTTTAGGTA 6300 

AAGTAATGTA AGGAAATTGA TTTGAGATAT CGTTAACATG TGACATGCAT GTTATACTAG 6360 

50 CGATGCTAAT AAAAGAATTG AAATGGAGGG TTCAACAATG GAATATGAGT ATCCAATTGA 6420 

TTTAGACTGG AGTAATGAAG AGATGATTTC AGTGATAAAT TTCTTTAATC ATGTAGAGAA 6480 
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AATTGTGCCT GCTAAAGCAG AGGAAAAACA AATTTTTAAT ACTTTCGAAA AAAGTAGTGG 6600 

CTATAATAGT TACAAAGCAG TTCAAGATGT AAAAACTCAC TCTGAAGAAC AAAGAGTAAC 6660 

AGCTAAAnAA TAATTCGTTC GAAATTAACA CAATTTAATA GGAATTTTTC TTTAAAACTA 6720 

TTGCTAATAA AGCTATATTT TGATACCTTT ATCAAGTGTT AAACAAAATG TTTGATAAAA 6780 

GTAAACTTAA TATAGCTTTT TTAGGTGGAA AAATAAATGA ACATAGGTAA TAAAATTAAA 684 0 

10 

AATCTTAGAA GAATTAAAAA TTTAACGCAA GAAGAACTTG CTGAACGTAC AGACTTATCG 6900 

AAAGGCTACA TTTCACAAAT AGAAAGTGAA CATGCCTCAC CAAGTATGGA AACTTTCTTA 6960 

AATATTATAG AGGTGTTAGG AACGACGCCA AGTGAATTTT TTAAAGACAG TGAAAATGAA 7020 

IS 

AAAGTATTAT ACAAGAAGGA AGAACAAGTT ATTTATGATG AGTATGATGA AGGTTATATA 7080 

TTAAATTGGT TAGTTTCAAA GTCAAATGAA TATGATATGG AGCCATTAAT ATTAACTTTA 7140 

20 AAGCCTGGAG CAT CAT AT AA AAATTTTAAT CCATCAGAGT CTGATACGTT TATTTATTGT 7200 

ATGTCAGGTC AGATAACACT TAATTTAGGC AAAGAGATAT ATCAAGCACA AGAAGAAGAC 72 €0 

GTTTTGTATT TTAAAGCACG AGATAATCAT CGTTTGTCAA ACGAATCAAA CAATGAAACA 7320 

25 CGAATACTTA TTGTAGCGAC AGCTTCATAT TTATAGGGGG GATCTTATTT GGAACCGTTA 7380 

TTATCATTAA AATCAGTTAG TAAAAGCTAT GATGATCTTA ATATCTTAGA TGACATAGAT 744 0 

ATTGATATTG AATCAGGATA CTTTTATACA TTATTAGGTC CTTCAGGTTG TGGTAAAACA 7500 

30 

ACAATTTTAA AATTAATTGC AGGGTTTGAA TATCCTGACA GTGGTGAAGT GATTTATCAA 7560 

AACAAACCAA TTGGTAATTT ACCACCAAAT AAACGTAAAG TGAATACAGT CTTTCAAGAT 7620 

TATGCATTAT TTCCACACTT AAACGTCTAT GATAATATCG CTTTTGGTTT GAAATTAAAA 7680 

35 

AAATTATCAA AAACCGAAAT TGATCAAAAA GTAACTGAGG CATTAAAATT AGTAAAACTT 774 0 

TCAGCTTATG AAAAAAGAAA TATTAATGAA ATGAGTGGCG GACAAAAGCA ACGTGTTGCA 7800 

ATTGCACGTG CTATCGTAAA TGAACCAGAA ATATTATTGT TAGATGAATC TTTATCCGCA 7860 

40 ■ ■ \ 

TTAGATTTGA AATTGCGTAC TGAAATGCAA TATGAATTAC GAGAATTGCa ATCTAGATTA 7920 

GGtATTACAT. TTATATTTGT aACACATGAT CCA 7953 
45 (2) INFORMATION FOR SEQ ID NO: 153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 234 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
so (D) TOPOLOGY: linear 
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GGCGTGATCA 


TACGACCGTC 


ATTCATGCTC ATGAAAAAAT 


ATCTAAAGAT 


TTAAAAGAAG 


60 




ATCCTATTTT 


TAAACAAGAA 


GTAGAGAATC TTGAAAAAGA 


AATAAGAAAT 


GTATAAGTAG 


120 


5 


GAAACTTTGG 


GAAATGTAAT 


CTGTTATATA ACAGCACTAA 


TGATnACAAT 


CATTTTTTAC 


180 




ATTTCTATAT 


GCTAATGTGG 


CAAGATGAGC AAAACTCATT 


TTGTGGATaA 


TGTTTaAAAG 


240 


10 


TCATACACAC 


CATACACAAG 


TTATCAACAT GTGTATAAyT 


CGCCAAATCT 


ATGTTTTTAA 


300 


GACTTATCCA 


CCAATCCACA 


GCACCTACTA CTATTACTAA 


GAACTTAAAA 


CCTATATAAT 


360 




TATATATAAA 


CGACTGGAAG 


GAGTTTTAAT TAATGATGGA 


ATTcACTATT 


AAAAGAGATT 


420 


15 


ATTTTATTAC 


ACAATTaAAT GACACATTAA AAGCTATTTC 


ACCAAGaACA 


ACATTACCTA 


480 




TATTAACTGG 


TATCAAAATC 


GATGCGAAAG AACATGAAGT 


TATATTaACT 


GGTTCAGACT 


54 0 




CTGAAATTTC 


AATAGAAATC ACTATTCCTA AAACTGTAGA 


TGGCGAAGAT 


ATTGTCAATA 


600 


20 


TTTCAGAAAC 


AGGCTCAGTA 


GTACTTCCTG GACGATTCTT 


TGTTGATATT 


ATAAAAAAAT 


660 




TACCTGGTAA 


AGATGTTAAA TTATCTACAA ATGAACAATT 


CCAGACATTA 


ATTACATCAG 


720 




GTCATTCTGA 


ATTTAATTTA 


AGTGGCTTAG ATCCAGATCA 


ATATCCTTTA 


TTACCTCAAG 


780 


25 


TTTCTAGAGA 


TGACGCAATT 


CAATTGTCGG TAAAAGTGCT 


TAAAAACGTG 


ATTGCACAAA 


840 




CAAATTTTGC 


AGTGTCCAcC 


TCAGAAACAC GCCCAGTACT 


AACTGGTGTG 


AACTGGCTTA 


900 




TACAAGAAAA 


TGAATTAATA 


TGCACAGCGA CTGACTCACA 


CCGCTTGGCT 


GTAAGAAAGT 


960 


30 


TGCAGTTAGA 


AGATGTTTCT 


GAAAACAAAA ATGTCATCAT 


TCCAGGTAAG 


GCTTTAGCTG 


1020 




AATTAAATAA 


AATTATGTCT 


GACAATGAAG AAGACATTGA 


TATCTTCTTT 


GCTTCAAACC 


1080 


35 


AAGTTTTATT 


TAAAGTTGGA 


AATGTGAACT TTATTTCTCG 


ATTATTAGAA 


GGACATTATC 


1140 


CTGATACAAC 


ACGTTTATTC 


CCTGAAAACT ATGAAATTAA 


ATTAAGTATA 


GACAATGGGG 


1200 




AGTTTTATCA 


TGCGATTGAT 


CGTGCCTCTT TATTAGCGCG 


TGAAGGTGGT 


AATAACGTTA 


1260 


40 


TTAAATTAAG 


TACAGGTGAT 


GACGTTGTTG AATTGTCTTC 


TACATCACCA 


GAAATTGGTA 


1320 


CTGTAAAAGA 


AGAAGTTGAT 


GCAAACGATG TTGAAGGTGG 


TAGCCTGAAA 


ATTTCATTCA 


13B0 




ACTCTAAATA 


TATGATGGAT- 


GCTTTAAAAG CAATCGATAA 


TGATGAGGTT 


GAAGTTGAAT 


1440 


45 


TCTTCGGTAC 


AATGAAACCA 


TTTATTCTAA AACCAAAAGG 


TGACGACTCG 


GTAACGCAAT 


1500 




TAATTTTACC 


AATCAGAACT 


TACTAAftkAT AAATATAAAT 


AAAGGATGAC 


GTGATTAATT 


.1560 




AAAACGTCAT 


CCTTTATTTT 


TTGGCAAAAA TAATTCTAGG 


TGCGTATGTA 


AAATAAATTT 


1620 


50 


GGCAGCATTT 


TAAACAGCAA ATAAAAGACG CCAATTAAAT 


TTATGACAAA 


TGTATCCAAA 


1680 




ATTTAATAAG 


TGTGCTTATA TGCCCTTTAA ATTTAAAATT 


TTAATAGTCA 


ATAACAAGTT 


1740 
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AAAAATAAGA ATTAATTATT TATATGTAAA CGGTTTCTAC CTCTATTTTA AATGAAATTT I860 

GTGACAAAAA AAGGTATAAT ATATTAATGA CATACAAAGA AATGGAGTGA TTATTTTGGT 1920 

rCAAGAAGTT GTAGTAGAAG GAGACATTAA TTTAGGTCAA TTTCTAAAAA CAGAAGGGAT 19B0 

TATTGAATCT GGTGGTCAAG CAAAATGGTT CTTGCAAGAC GTTGAAGTAT TAATTAATGG 2040 

AGTGCGTGAA ACACGTCGCG GTAAAAAGTT AGAACATCAA GATCGTATAG ATATCCCAGA 2100 

70 

ATTACCTGAA GATGCTGGTT CTTTCTTAAT CATTCATCAA GGTGAACAAT GAAGTTAAAT 216 0 

ACACTCCAAT TAGAAAATTA TCGTAACTAT GATGAGGTTA CGTTGAAATG TCATCCTGAC 2220 

GTGAATATCC TCATTGGAGA AAATGCACAA GGGAAAGACA AATTTACTTG GAATCAATTT 2280 

15 >■ 

ATACCTTAGC TTTAGCAAAA AGTCATAGAA CGAGTAATGG ATAAGGGACT CCATACCGTT 234 0 

TTAATGC 234 7 
20 (2) INFORMATION FOR SEQ ID NO: 154: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 154: 

ACAAGACGTn TCTATAACTT ATCTGAAATC GCTCGTCAAG ATAAAGATTA TGCAACTATC 60 

TCATTCTTAA ACTGGTTCTT AGATGAACAA GTCGAAGAAG AATCAATGTT TGAAACTCAC 120 

ATCAATTATT TAACTCGTAT CGGCGATGAC AGCAATGCAT TATATCTTTA CGAAAAAGAA 180 

35 

CTTGGCGCTC GTACATTCGA CGAAGAATAA TTAAACATCA CTACAATAGA CAGATAAATA 24 0 

TCATACGACA TGATAGGCAT TTGGGTCACT TACAATAACC CAATGTCTAT ATTATTTTGC 300 

TTTACGGAGA TCACTAGATT CATTTTCTGA ATCATTGATG TGCGTTTTTT CATTTTCAAG 360 

40 

GCTAATTATT GTATTTTTAG TCATTTATTT TTTAAACTAC TAATGTTAAT AACTCTAAAT 420 

TTGATGTTGA ATTAATTTGA CGATTTTAAA GCATATCATC ATTTACTTTT TAATCAGAGT 4 80 

45 TACATCCAAA TGATAGATTT CACGTTATAC CTTCACGTAT AATATTATGT ATCGTTTGTA 54 0 

AGCAAATGAC TAAAAGTCTA TTAATATATA CATTTAATTA ATTGAAAGGA TTGACTACAT 600 

GATACAAGAT GCGTTTGTTG CACTTGATTT TGAAACAGCA AATGGTAAAC GTACAAGTAT 660 

50 TTGTTCTGTC GGAATGGTTA AAGTCATTGA TAGTCAAATA ACAGAAACAT TTCATACTCT 720 

TGTGAATCCG CAAGACTATT TTTCACAACA AAATATTAAA ATTCATGGCA TACAACCAGA 780 
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aGATTTACCT GTTGTCGCAC A7AACGCGGC ATTTG ATATG AACGTCTTAC ATCAAAGCAT 900 
TCAAAATATT GGTTTACCAA CTCCAAATTT AACTTACTTT TGTAGTTATC AACTTGCTAA 960 

5 AAGAACCGTT GATTCGTATC GATACGGTTT AAAACATATG ATGGAGTTTT ATCAATTAGA 1020 

TTTTCATGGT CATCATGATG CATTGAATGA TGCCAAAGCA TGCGCAATGA TTACTTTTAG 1080 

GCTACTGAAA AATTATGAAA ATTTAACATA TGTAACTAAT ATTTATGGTA AAAATCTAAA 114 0 

10 

AGATAAAGGC TAGGACTAAA TAAAATACTC CCTTCAAAAG TAAGCATTGT AAAAATGTAA 1200 

ACTTTGCAGG GAGCTTTATT TTATATAAAG TCATATATCG TCATATTTTT ATAAGTTGAT 1260 

TGTTCTAAAT TACCTACAGT GACACCAATA AGTCGAATTG GTACATCAGG GTCTTTTAAA 1320 

1S 

TCGTTATAAA GTAAATATGC AATATTATAA ATATCTTCTT CAGAACTAAC CGAATCTCTT 1380 

AAACTCATCT GTTTAGATAG CGTTTCAAAT TGATAAGTTT TAATTTTAAC CGTTACAGTT 1440 

20 TTAGCTGACT TCTGTAATTT ATTTAGACGT TCAGCTGTTT TACCTGnACA ATTCCCATAC 1500 

TTTTCTTAAA ATCTCTTCAT CATCATTCAC GTCTGTTGCA AATGTGCGTT CAGTCCCTAC 1560 

TGATTTTCTT ACTCTTGATG ATTTCACTTC ACTATGGTCA ATACCGCGTG CCTTGTTATA 1620 

25 TAAACCCCGA CCT CTTTTT C CAAACAAACG TATTAATTCA AATTCCGTTT TCTCATATAA 1680 

ATCTCTACCG TTAAAAATAC CATTATCATG CATTACTTTT TTGGAAGCTT TACCTACGCC 1740 

TGGaAAATCT CCAATATCCA ATGTCATCAA AATATCATGG aCATTTTGAT AATCAATCAC 1800 

30 AGTCATACCA TCAGGTTTAT TCATACCACT CGCTAATTTA GCTAAAAATT TGTTATAAGA 1860 

AACACCTGCA GATGCTGTTA AATGTGTCTG CTCTAGAATA TCTTTTCTAA TATACTGAGC 1920 

AATTTTCGAA GCAGGAAGGT CTGGTCTCAC TAATTCTGTA ATATCTAAAT ACGCTTCATC 1980 

35 

CAATGACATC GGTTCTACCT TATCTGTATA ACTTCGGAAA ATAGACATAA TCTGCGCAGA 2040 

TGTTTCTCGG TAAGCACCAA AATTACTTGT GACAAAGTAT CCATTTGGAC ATAATTTATG 2100 

CGCTTGTGAC ATAGGCATTG CTGAATGGAC GCCGTATTTT CGTGCTTCAT AGGATGCCGT 2160 

40 

AGAGACAACA CCCCTACTGC TTGCTTTACC ACCAACAATG ACTGGTTTCC CTTTCAATTT 2220 

GGGGTTATCT CTCATTTCGA CTTGTGCAAA AAAATAGTCC ATATCTATAT GAATAATTCG 2280 

45 TCTCTCAGTC AAGTGCTCAC CTCCCTACTA ATTTTTACTT TTATAACGCA CAAAAATATC 2340 

TCAACATAAT TATACGCTGT GTACGATTTT TTTACATAAA TCTTGCACTT AGCGATAACT 24 00 

ATATTGaGAT AACTACAAGT TGTTATaAAA TCAATTGCTA TTTAAGCATG ATGATGAAGA 2460 

SO ' CGATTGAGTA AGAAAACATA GGTAATCTGA AATAATTCAA GCAAATTCAT TTTGTTGGTA ' 2520 

TCATCATATT AAAATTTATT ATTGAGTCGG CTTTTGATGA TACAAATAAA TACTATCTTC 2580 
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AAAGCAATAA GCGGTATGCA TACTAAACAT AAAAATAAGT GATGAATAAC CAAATACCTT 2700 

AATTAAAATA AGCAAGCCAG TACTTAATAG GATTAGTGGT GACAGCATAA TAATTGAGAA 2760 

TTGCCATTTG TTGAAGCAAG CATCTGCTGT TTGGAATAAG ATTCTGTCTT TTTTTATATT 2820 

AAACATAGGT TTGCTATCTT TTTTAAATAA AAGAAATAAT GCTCTATGGA TAAGTTCATG 2880 

TAAAATCAAT AAAATAATGA ATCCAGCAAA CCCATATACA AGATTGATGA TGATATTTTG 2940 

10 

ATCGACAACC GCTGTGACAC CTAACGCCCA CTTATACGTA AATAAAATCA CGAATAACGG 3000 

AATAACAAGT TGCAAGATAA TAAACCTTCG CATTTGAAAA TTATTTGTCG TTAAATCAAT 3060 

1S TTTATGCATT ACCAACCCTC CCGATCATGA CATTCTTATT CTTCTTTAAA TATAGTATAC 3120 

AATGTCACAT TTAATTTAAA AAGTTCATAT CAAGAAAGTA AATTGGCTGT AATAAAATTT 3180 

TAATATACGA CTTCTTTCTT CACTTATTAA GGCGAAATTT TATCtCAAAT CATGTGCGCT 3240 

20 ATTTCAAATT GAATAATGCC. ACTGTCTCAA CATGTGTTGT TTGTGGAAAC ATATCTACCG 3300 

GTGTTACCTC TTCAAGTTGA TATTTTTCAG CTAATAATAA TGCATCACGT TGCTGTGTTG 3360 

CGGGATTACA TGAAATATAG ACAATACGCT TAGGTTCTAA TGTAAGCAAA GTCTGAATAA 3420 

25 . ACGTTTCGTC ACAGCCCTTT CTTGGCGGAT CAACCATTAC AACATCTGGT TTAATCCCTT 34 80 

GTGCTTTCCA TTGTAAAATA ACTTCTTCAG CTTTCCCACA GACAAAAGTT GTATTATTGC 3540 

ATTGGTTTAT AGTCGCATTT TGTTGTGCGT CTTCAATTGC AGAAGGTACT ACTTCAACAC 3600 

30 

CGTATACATG TTTTGCAAGT GGTGCCATAT ATAGCCCTAT TGTTCCAATA CCACAATAGG 3660 

TATCTAATAC AACTTCATTA CCTGTCAATT GCGCATACTC AATTGCTTTA TTATATAATT 3720 

TCTCTGTTTG TTCAGAATTA ATTTGGTAGA ATGACTGATC ACTTATTTTA AATGTACTAT 378 0 

35 

CTGTTAATTG ATCAATAATT GTATCTTTAC CATATAGCGT TATAGATTGA CGTCCCATAA 384 0 

TAACATTAGA GTGGCTATCA TTAATGTTTT GTTTAATGCT TGTCACATTA GGAAATGCAT 3900 

CTAATATCTT CTCAACAACA GCATTTTTTT GTGGCCACTT TTTACCATTA GTTACAAAAA ' 3960 

40 . 

TAATCATCAT TTCGTCTGTA TGATATCCTG TTCTTACAAC CAAATGTCTC ATTAAACCTT 402 0 

TTTTCAATTG TTCTTGATAA ATACTTACAT TTAAATCTTT TAAAATAGAT TTAACTTCAT 4 080 

45 TCATCACTTC TTGATGTTGT GAATCTTGTA TTAAACAACT TTCCATGTCA ATAATGTCAT 4140 

GGCTTCTTTG ACGATAAAAG CCCATAATAA CTTCATTCTG TTCATTCTTA CCAACTGGAA 4200 

TCTGGGACTT GTTTCGATAT CTCCAAGGAT CTGTCATGCC AACTGTATCG TTAATCTTAG 4260 

50 AATTATCAAA ATGCGCTTTT CGCTGAAACA AATTAATCAC TTGTTCCTTT TTCATTTCAA 4320 
GTTGTGCTTC GTATGATAAG TGTTGAAGTT GGCACCCACC ACAACGTTCA TAATATATAC ' 4380 
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AGTTCTTTTT TACTTTGATA ATTTTATATT CAATTTGTTC ATTAATTAAA GCTTGTGGTA 4500 

TGAAAATAGG AAAGCGATCT ATTT T T A CGA CACCATGGCC TTCATGCGTT AAATCAACAA 4560 

5 CTGTTCCCGT TTTTATGTCA TTTTTAGCTA TTGCTTGCAA AATTTTACCT CCAAAATGAA 4620 

CAGGTTAGGA ACAAAATTAT GCGCTTCCTA ACCTGCCATT ATATATTTCA CTATTTCTGT 4680 

TTATTCTTCG ATTAAATTGT CATCAACATG ATCATTATTT ATTAACTCTT CATTTACAAT 4740 

10 

ATCATTAGGT GCAAAGACAT CTATATGACG TTCTAGATTT AAGAAATTCG CTGGTAATTT 4 BOO 

ACCACCATAT TCTCCATCTA CATTTAGTTG TAAGTCTGTG AATGATGAAA TATTAATTGC 4860 

CTTTGCTTTT TCATAAATAA CTTTAGGATG CTTAGTATGT TCTCCTCTTG AAGCTAAAGT 4920 

IS 

CATAATATGA CCAAGTTCTG CAAGGTTTGA TTTTTCAACT ATAATTAACG TAAAATAGCC 4980 

GTCATCTAAC TTAGCGTCCG GCACTAATTT TTCAAATCCT GCCATTGAAT TTGTTAAACC 5040 

2Q TAAAAAGAAT AATAATGCTT CTCCTTGGAA AACATTACCA TCATATTCAA TTCTTAAATC 5100 

TACAGCTTTC ATTTGAGGTA ACATTTCGAA ACCTTTGaTG TAATAAGCAA ATGGACCAAC 5160 

AATAGATTTC AATTTACTCG GTGTTTCATA AGAGACTTGC GTCAATTGTC CGCCTGCAGC 5220 

25 TAAATTAATA AAGTATCGAT TATTCATTTT ACCAATATCT ACTTTAGTAG AATGACCTTC 52 80 

- AATGATGACA TCAAGTGCCC CCATGATGTC ATTAGGTATA TGCAATGCAC GTCCAAAGTC 534 0 

ATTAACAGTA CCCATAGGAA TGACACCTAG CTTAGGACGA TTAGGCTTTT CTGCGATACC 54 00 

30 ATTAACTACT TCATTTAATG TTCCATCACC ACCTGCAGCG ATTAATACAT CATAATTTTC 54 60 

ATGCATAGCT CTTTCTGCTT CAAGTGTGGC ATCACCTATT TTCTCOGTTG CATATGCACT 5520 

CGTTTCATAT CCCGCTTTTT CTAATTTTAT TAAGGCATCA GGTAATTCTC TTTTAAATAG 5580 

35 

CTCTTTACCT GATGTCGGGT TATAAATGAT TCTAGCACGT TTCCTCATAT CTTATCCCTC 564 0 

TACTTAAAAT TCATATATTT TAACTTCATC TTTGTTTCGT CTAATAGGGA GTGGGACAGA 5700 

AATAATATTT AACAAAATTT ATTTCGTTCT ACGCCAACTT GCATTGTCTG TAGAATTTCC 5760 

40 

TTTCGAAATT CTCTATGTTG GGGCCCCACC CCAACTTGCA CATTATTGtA AGcTGACAGA 5820 

AAGTCAGCTT CTTTGTTTGG GGGCCCCGCC AACTTGCACA TTATTGTAAG CTGACAGAAA 58 80 

ATCAGCTTCT ATGTTGGGGC CCCACTAGAA TTGAAAAAAG CTTGTTACAA GCGTATTTTC 594 0 

45 

• . TTTCAGTCAA CTACAGCCAA TATAACATTG TAGTGCCTAG GACATTGAAT TTATGACCCA 6000 

GGCTCAGTCT TATTTCATCA TTCTTAATAT CGTTAAAGAC CAACTTGTAT CTTAAACAAA 6060 

SO TACTATCTCA ATATGTACAA AGCTTGTTAT TTATTCAGCA TTTTTTGCCG TTCTTCATTA 6120 

TAtAGcTTCG TCAGTTATGC TATTTTACCT TTAAAATGAT GTTGTAAATA TAATGTTGTC 6180 

55 



779 



EP 0 786 519 A2 



AACGCATTAA TAAAATTAAT ATTTTTACCA TTAACATGTA CAATGAATAA AGTTAAAAGT 6300 

AATTTQACTT CTATAGATAT AAATAAACCC TCGATTGCAT CTAAGTCAGC AATCAAGGGT 6360 

TTATTTTTTA AATCTTCATA GTTTGATGAT TTAAATTATC TTTTATCTAA TTCTTGTTTT 6420 

AATAGTTGAT TTACTAATTG TGGATTAGCT TGACCTTTAG ACGCTTTCAT AATTTGACCA 6480 

ACTAAGAAGC CCATAGCTTT GCCTTTACCA TTTTTGTAAT CTTCAACTGA TTGTTCGTTA 6540 

10 

TTGTCTAATG CTTCATTTAC AAATTTTAGA AGTGTTGCTT CATCAGAAAT TTGAACTAAG 6600 

CCATTATCTT CCATAATCTG TTTAGCATTA CCACCTTTAG CTGCTAACTC TGGGAAGACT 6660 

TTCTTCGCAA TTTTACTGCT CATTGTTCCG TCTTCGATAA GTTTAATCAT ACCTGCTAAA 6720 

XTTTCTGGTG TTAATTTAGT ATCTAATAAT TCTACTTGAT TTTTATTTAA ATATTCGTTT 6780 

ACGCCACCCA TTAACCAGTT AGATGTTAAT TTAACATCTG CACCGTGTTC AATTGTTGAT 6 840 

20 TCAAAGAAAT CTGACATTTC TTTAGTCAAT GTTAATACGT GTGCATCGTA TGCAGGTAAA 6900 

CCTAATTCAT TTACATACTT AGCTTTACGT TCATCTGGTA ATTCAGGAAT TGTCTGACGA 6960 

ACACGCTCTT TCCAAGCATC ATCAATATAT AAAGGTACAA TGTCAGGCTC TGGGAAGTAA 7020 

25 CGGTAATCAT CAGAACCTTC TTTAACACGC ATTAAAATTG TTTTACCTGT AGATTCATCA 7080 

AATCGACGTG TTTCTTGTCC GATTTCTCCA CCATTTAACA ATTCTTCTTC TTGGCGTTTT 7140 

TCTTCATATT CTAAACCTTT ACGTACATAG TTAAATGAGT TTAAGTTTTT CAATTCGGCT - 7200 

TTAGTACCAA ATTTTTCTTG ACCATATGGA CGTAAAGAGA TGTTAGCATC ACAACGTAAA 7260 

GATCCCTCTT CCATCTTAAC GTCTGATACA CCAGTGTATT GAATAATTGA ACGCAATTTT 7320 

TCTAAATATG CATATGCTTC TTTAGGTGAA CGAATATCTG GTTCAGATAC GATTTCAATT 73 80 
35 1 

AGCGGTGTAC CTTGACGGTT CAAGTCAACT AATGAATACT CACCTTTATG TGTTGACTTA 744 0 

CCAGCATCTT CTTCCATGTG AAGACGAGTA ATACCGATTC GTTTTGTTTC ACCGTCGACT 7500 

TCGATATCGA TATATCCATT TTCACCAATT GGTTGATCAA ATTGAGAAAT TTGATATGCT 7560 

AO 

TTTGGATTAT CTGGATAGAA ATAGTTCTTA CGGTCAAACT TAGATTCTGT TGCGATTTCC 7620 

ATATTTAGTG CCATTGCAGC ACGCATTGCC CAGTCTACTG CACGCTTATT AACAACTGGT 7680 

45 AAGACACCTG GATATGCTAA GTCGATAACA TTTGTATTTG AGTTAGGTTC TGCTCCAAAA 774 0 

TGCGCTGGTG ATGGAGAAAA CATTTTTGAG TCCGTTTTTA ACTCTACGTG AACTTCAAGT 78 00 

CCTATAACTG TTTCAAAATG CATGATTTCC ACTCCTTATA ATTTTTCATA AACGTCATGT 7860 

50 AAATTGTATT GTGTTTCATA TTGATAAGCG ACACGATATA ACGTTTTTTC ATCGAATGGT 7920 

TTACCAATGA ACTGTAAACC GATTGGTCGG CCATTTGATT GTCCACAAGG AACAGAAATA 7980 
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GGATCATCAA TTTCTTCACC TAAATTAAAC GCaGTgTnAG GCGCTGTTGG ACCAACTACT 8100 

ACATCATAAT TTTCGAATAC TTTATCAAAG TCATTTTTAA TCAATGTTCT AACTTTTTGA 8160 

GATTTTTTAT AGTAAGCATC ATAGTAACCT GAACTTAATG CAAATGTACC TAAGAAAATA 8220 

CGACGTTTTA CTTCTTTACC GAAACCTTCA GATCTTGACA TTTTATATAA TTCTTCTAAT B280 

GAATGAGCTT CTTTAGAATG ATAACCATAA CGAATTCCGT CAAAACGAGA AAGGTTTGAC 8340 

GAAGCTTCTG ATGATGCAAT CACGTAATAT GATGGAATAC CAAATTTAGT ATTTGGCAAT 84 00 

GATACTTCCT CAACGACAGC ACCTAAAGAT TTTAAAGTTT CTACAGCGTT TTGAACTGCT 8460 

TCTTTTACGT CATCAGCTAC ACCTTCACCT AAGTATTCTT TAGGTAATGC AACTTTTAAT 8520 

CCTTTAATAT CTTTACCAAT TTCAGATGTA AAGTCTACAT CATCAACTGG TGCACTTGTA 8580 

GAGTCATTAA CATCTGCACC AGAAATAGCT TCTAATAOGA TTGCATTATC TTTTACATTT 8640 

2 0 CGAGTCAATG GACCAATTTG GTCTAATGAA GATGCAAAAG CAACTAATCC AAATCGAGAT 8700 

ACACGACCGT ATGTTGGTTT CATACCGACA ACGCCACAAT ATGCAGCCGG TTGTCTAATT 8760 

GAACCACCTG TGTCTGAACC TAAGCTAAAT GGTACTAAGC CAGCTGCAAc TGCTGCTGCA 8820 

25 GATCCACCTG ATGAACCACC TGGCACTGCT TTATGGTCAA ATGGGTTAAC TGTTTTTTTG 8880 

AAATAAGATG TTTCTGTTGA ACCACCCATT GCAAACTCAT CCATATTTAA TTTACCGATT 8940 

AAAACGGCAT TTTCATTATG TAGTTTTTCC ATTACAGTAG ATTCGTAAAT TGGCACAAAA 9000 

CCTTCTAACA TTTTACTTGC ACATGTTGTT TCTAATCCGT TTGTAATAAT GTTATCTTTT 9060 

ATACCCATTG GAATACCAAA TAATTTGCCA TCCATTTGAT CTTTTGCTTG TAATTCATCC .9120 

AATTCTTGCG CTTTTTTGAT TGCATTTTCT TTATCCAGCG CTAGAAAAGA CTTAATTGTT 9180 

GGATCAGTCT CTTCAATTGC ATCATATATA TCTTTAACAA CATCAGATGG TTTGATTTTT 9240 

TT GT C TT TTA TTAAAGTTAA TAAATTCTCA ACCGATTCGT AGCGAATGCT CATCTTACGC 9300 

GTCCTCCTCA TTCATGATTG TAGGCACTTT AAATTGTCCA TCTTCTGTTT CTTTGGCATT 9360 

TTTCAAAGCT AATTCTTGTG GAATACCTTT AATTGCTTTA TCTTCACGTA AAACGTTTTG 9420 

TAAATCTAAA ACGTGATATG TAGGTTCAAC GCCTTCTGTA TCAGCGCTAT CATTTTGTTT 9480 

TGCAAAATCT AAAATGCTTT CTAATGTGTT GGCCATTTCT TCCGTTTCTT CAGGAGAAAT 9540 

TTGAAGTCTT GCAAGATTCG CGATATGCTC AACTTCTTCA CGTGTTACTT TTGTCATTAA 9600 

TAAAAGCCTC CTTTAAGTCA TTCATCACTA AATTGTATCA AATTTCCAAT TAAAAATCTA 9660 

50 AGTATTTATG AGGTGCTACT TTAATTTCAT ATAAACTGTA TAAACATTAT CATTCGTTTA 9720 

TCAAATCATT TTTTATGAAA ACAACACTCT TTTAATATTA GACAACCCAA TTCAATATTA 9780 
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TATATTGGTA TGCAAGTATT TCAAAAAGAA TAAATTTAAT TTTCCTACTT TTCTAAACAT 9900 
TTATCTTTAT GTATAATGTT TTCAAGTAAC TAAATTATAA ATTAAATAAA GGGAGTGTTT 9960 

5 ATCATGCTTA CAATGGGGAC AGCATTAAGT CAACAAGTAG ATGCCAATTG GCAAACTTAT 10020 

ATTATGATTG CCGTCTACTT CTTGATACTA ATCGTTATTG GCTTTTACGG TTACAAGCAA 10080 

GCAACTGGTA ACCTAAGCGA GTACATGTTA GGTGGACGTA tATTGGACCG TATATTACTG 10140 

10 

CATTATCAGC TGGAGCTTCA GATATGAGTG GATGGATGAT TATGGGGCTA CCTGGTTCTG 10200 

TCTATAGCAC TGGTCTATCA GCTATGTGGA TTACAATCGG TTTAACATTA GGTGCTTATA 10260 

TAAATTACTT TGTTGTTGCT CCTAGACTTC GTGTTTATAC CGAATTAGCT GGAGATGCAA 10320 

15 

TTACATTACC AGATTTCTTT AAAAATCGTT TAAACGATAA AAATAATGTG TTAAAGATTA 10380 

TTTCTGGATT GATTATCGTA GTATTCTTTA CATTATATAC ACATTCTGGT TTCGTATCTG 10440 

20 GTGGTAAACT ATTTGAAAGT GCTTTTGGAT TAGATTATCA TTTCGGTTTA ATATTAGTTG 10500 

CTTTCATTGT CATTTTCTAT ACTTTCTTTG GTGGATATTT AGCTGTATCA ATTACAGATT 10560 

TCTTCCAAGG TGTCATTATG TTAATTGCGA TGGTTATGGT CCCTATTGTT GCTATGATGA 10620 

25 ATTTAAACGG CTGGGGAACG TTTCATGATG TAGCAGCTAT GAAACCTACA AATTTAAATT 10680 

TATTTAAAGG GTTATCATTT ATAGGAATTA TCTCTCTATT TTCATGGGGA TTAGGTTATT 10740 

TCGGTCAACC TCATATCATT GTAAGGTTTA TGTCTATTAA ATCACACAAG ATGCTACCTA 10800 

30 AAGCTAGACG TTTAGGTATT AGCTGGATGG CTGTTGGTTT ATTAGGCGCT GTGGCTGTTG 10860 

GTTTAACAGG TATTGCATTC GTACCTGCTT ATCATATTAA ACTAGAAGAT CCTGAGACAT 10920 

TATTCATCGT GATGAGTCAA GTACTCTTCC ATCCTCTTGT AGGTGGTTTC TTACTTGCTG 10980 

35 

CGATTCTAGC TGCAATTATG AGCACGATTT CTTCACAATT ACTTGTAACA TCTAGTTCAC 11040 

TAACGGAAGA CTTTTATAAA TTAATTCGTG GTGAAGAAAA AGCTAAAACG CACCAAAAAG 11100 

AATTTGTTAT GATTGGAAGA TTATCTGTAT TAGTTGTAGC AATTGTTGCC . ATCGCGATTG 11160 

40 

CATGGAATCC AAACGACACA ATTCTAAACT TAGTAGGTAA CGCTTGGGCC GGATTTGGTG 11220 

CATCGTTCAG TCCACTTGTG CTATTTGCAC TTTACTGGAA AGGTTTGACA CGTGCCGGTG 11280 

45 CTGTAAGTGG AATGGTTTCA GGTGCCTTAG TCGTTATCGT TTGGATTGCA TGGATTAAAC 11340 

CATTGGCACA TATCAACGAA ATATTCGGCT TATATGAAAT TATTCCTGGA TTTATTGTAA 114 00 

r - 

GTGTAATCGT TACATATGTT GTAAGTAAAC TTACTAAAAA ACCTGGTGCA TTTGTTGAAA 11460 

SO CTGACTTAAA CAAAGTTCGT GACATCGTTA GAGAAAAATA ATTCATAAGT CTTAACAAAT 11520 

TAAAAAGGTA CTAATGTTAA TCAAAATTAT GACTAACATT GGTACCTTTT TATTATCTTT 11580 
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AATTAAAGCA CGTGGTTGGT TACCATCTTT AATACGAATT TCATAGTTAT CGATTTTATC 11700 

GAAATATTTA TTCGCTTGTT CAGTAACGTA CTGTGTAATA CCAATTGTTT CAGCTTGTCC 11760 

5 ATAGTAATCG ATTGGTAAAT CTACTACTAA TCGTTGTGGC TTTTTATCAA CAAATTTAAG 11820 

TTTGCCTACT GCTTGTGTGA AATTAGAAAA ATATGATTGC AAATTATCAT TAAATTGCTT 118B0 

GAAATTATTA TTTAAATTTT CATCATAATC TGCTGCTGTT GAAGAAGGTA ATAAAGCTGA 11940 

70 

TTTTTCATTG ATATTATGCC ATTCATTAAG CTTTGTTTGA CTCTTTTCTG CAGTCGCTTG 12000 

AGTGATAAAT TCACCTGGTG TGATTGAATC TTCACTTGAT TGCTTATAAA TTGCAAAATG 1206 0 

AATTGGTATA TCTTTTAAAT CATCATTTTC ACGTAACCTT GATAATATCT CACTAGCCAT 12120 

75 

TTGTTTACCT TGCTTTTTAA CTcGCTATCA TCTAGTTTTT TACTAAAAGT CGATCCATCT 12180 

TTTTCTTTTT TATAGTAATA AACACTATTC ATAGCTAAAC CAATCGTCAT ACCTTTAATA 1224 0 

2Q TTCTTACCTT TTGTATCTCC ACCACCATAA AAATCTTGCT CTAAAATGTT AGATAAATAG 12300 

GCTGGTGATT TTTCTGCAAT CTTTTCAGGA TCTGTTTCAC CTtCGTGTGA TGGATTAAGT 12360 

CCTAAATTTT CATTCGCTTT CTTGTCTTTT TTATCTTTTT CAGACATTTT ATCGATTTCA 12420 

25 CGTTTTGTAT ACTTAGGATT TAAATAGGCA TTAATTGTTT TCTTGTCCAA AAATTGACCA 12480 

TCTTGATACA AATATTTATC TGTTGGAAAT ACTTCTTTAC TTAAGTTCAA TAAACCATCT 12540 

TCAAAGTCGC CGCCATTATA ACTATTTGCC ATGTTATCTT GTAAAAGTCC TCTTGCCTGG 12600 

30 CTTTCTTTAA ATGGTAACAA TGTACGATAG TTATCACCTT GTACATTTTT ATCCGTTGCA 1266 0 

ATTTCTTTTA CTTGATTTGA ACTATTGTTA TGTTTTTGAT TATCTTTTCC AGCCTGGTCA 12720 

TCCTTATGGT TACCACAAGC AGCGAGTATA AAGATAGCTG TAATCAATAA TACTAATGTA 12780 

35 - 

CGCTTCATCG ACATACCCCT CTAACTATTT AATTCATTTT GCTTATCTAC AAATTGTTGC 1284 0 

TCTGTCCAAA TTTCAATACC TAAACTTTGT GCTTTTGTTA ATTTTGAACC TGCATCTTCA 12900 

CCAGCAATAA CGACATCTGT ATTTTTAGTA ACGCTACTTG TAACTTTAGC ACCTTGTGAT 12960 

40 

GCAAGCCATT TAGATGCTTC ATTGCGTGTC ATTTGATGTA GCTTACCAGT CAGTACTATC 13020 

GTTTTACCAC TAAATTCAGG ATGTCCTTCA ATATCTGATG TTTTGATACC TTTATAAATC 13 080 

4$ ATATTAACAT GTTTATCTTT TAATTTTTGA ATTAAAGCAC GAATATCTTC ATTTTCTAAA 13140 

TAAGTAACTA CAGATTGTGC TACTTTATCA CCTATATCAT GAATTTCTAC TAATTCCGCT 13200 

TCAGTTACCG TTAGTAATCG ATCTATCGTT TCATATTTTT CTGCTAACAC TTGGCTCGCT 13260 

50 TTAACACCTA AATGCCTAAT ACCTAGACCA AATAATAAAT TTTCTAAAGA GTTGTCCTTA 13 320 

GCTTGTTGAA TGGCAGCTAA TAAATTATCA ACTTTTTTCT GCCCCATTCT GTCTAAAGGT 13380 
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TAAAGCTGTT GAATAATTTT AGTGCCTAAA ~CCATCAATAT TcATGGCTTG TCTTGaTACA 13500 
. AAGTGnATCa ATCCtTcAAC AAGTTGTGCT TGGTCATTTT GG 13542 
(2) INFORMATION FOR SEQ ID NO: 155: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: IB 93 base pairs 

(B) TYPE: nucleic acid 

10 (Ci STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 155: 





CAGTAAACAC 


CTCTGATTAC 


GAATATTTAT 


ACATTTATTT 


TAACACATGC 


ACTGATTTAC 


60 




GACTACTAAA 


CACCTTTACG 


TAAAAAGGGT 


AAACATGGTT 


TATCTATCTT GGTTATCTAT 


120 




TTATAAATAT 


TTnTCATATT 


ACGCATAACA 


ATTGCTTAAA 


ATATGTATAA AAATGAATAT 


180 




ATGTGTAATA 


AACTTGCTAA 


TTATTAGATT 


TAATAAGCGT 


CAATTGTTTG 


AACATATTtA 


240 




ATTAAAATCA 


CATTGATATC 


ACAGATACGA 


ATATTGTCGT 


ATAGAAATTG 


AAAATTCTAT 


300 


25 


TTTTTAAATG 


AAAGTCTTCA 


ACATAATTTT 


AAGTTTCAAC 


ATGAGAAAAA 


TCGATTAACA 


360 




AACAACGTCA 


GTTGAATATG 


CCTTTTGAGA 


CATTTCAAAC 


TTTACAATTG 


TTGCTAATCG 


420 




ATATATTTGC 


TTTTAGTGAT 


CCCTGCTATA 


AAATAAATCA 


ACGATTTCTA 


ATTkAGTGTTT 


4B0 


30 


TGTATTGAAT 


TGTTCATCAA 


TTTGCGTTAG 


TTCATCCACT 


GCTGCGTCTC 


TATGATAAGT 


DH U 




CAATTTATCT 


TCTGCGCCAT 


CTTTCCCTAA 


TAAACTCACG 


TACGTACTTT 


TATTATTTTC 


600 


35 


AAGATCGCTG 


CCCACTTTTT 


TACCTAACTT 


TGCTTCATCA 


CCATAGCAGT 


CTAATAAATC 


660 


ATCTTTAATC 


TGGAACATCA 


TACCTAAATG 


ATAACTATAA 


CTTTCTAAAT 


GTTCTTTAGT 


720 




TGTJpTCATCG 


ACATTAGCGA 


TATCTGCTGC 


ACTCATAACC 


GCAAAAGTTA ATAATGCTCC 


780 


40 


TGTTTTTGTT 


TTGTGTATCA 


TTTCCAAAGT 


TTCAAGATCA 


ATTGGTTGGC 


CTTCGCTTTG 


B40 


CATATCTAAC 


ATTTGACCGC 


CGACCATTCC 


AACATGACCA 


CTTGCTATTG ACAGCCGTTG 


900 




TAGAACTTTT 


ATTTTTACTT 


CATCAGTTAA 


TCTATCATCA 


CTTGAAATAA 


GTTCAAATGC 


960 


45 


TTTAGTTAAT 


AAAGCATCAC 


CTGCTAATAT 


CGCAGTCCAC 


TCACCATATA 


CTTTATGATT 


1020 




TGTTAATTTT 


CCTCGTCGAT 


AATCATCATT 


ATCCATCGCT 


GGTAGGTCAT 


CATGAATAAG 


1080 




TGAATATGTA 


TGAATCATJT 


CTAGTGCAAT 


TGCGCTCTTC 


ATACCTAACT 


CATACTCGGT 


1140 


50 


ATTTAGTGAA 


TCTAAAGTGA 


GTAATAACAG 


AACTGGTCGG 


ATGCGTTTAC 


CTCCAGCATT 


1200 




TAATGAATAC 


AACATACTTT 


CTTCTAGCTG 


AGTATCCATT 


ACTGATTTAT 


TTATCGCAAC 


1260 
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CATCCTCAGC TT CTTCTTTT ATTAAGTCAT TCACCTTTTT TTCGGCATTT TTTAAAGTTG 1380 

TGTCACAAGC TGCTGATAGT TTCATACCAC GTTGATATAA ATCTAATGAT TCCTCTAAAG 144 0 

5 ATACTGTTTC ATTATCTAAT TTTTGAACAA TTTGCTCTAA TTCTTGCATC ATTTCTTCAA 1500 

AACTTTGCGT TTCTTTAGTC ATTATTACAC CTTACTTTCG TAACTTTTGC ATCTACTAAG 1560 

CCATCTTTCA TTGTTAACGT CAATTGATCA TTTTCTGTTA AATCTTTAGT ACTCGTAATG 1620 

10 

ACTTCGTCTT TTTTATTAAC AATTGCATAT CCACGCAACA TTGTATTAGT TGGACTTAAA 1680 

TTGTTTAAGT TTTCTACTTT ATTTTTCAAA TCATTTTTAT AACTTAATAT CTTAGAATTC 1740 

AATAATTTAA CAAGTTGGTT TGTCAATTGA AGATTATnTT GTTGTTCTTG ATTAACACTA 1800 

15 

CTTAGTAATG CTTTTAAATn ATAACGTTGG TGCAACAGCA TTAAATCGAG GCCCCGGTGG 1860 

TCCAAAGTTG CCCGAATTnG TGGTTTCAGG CCC 1893 
20 (2) INFORMATION FOR SEQ ID NO: 156: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 base pairs 

CB) TYPE: nucleic acid 

(C) STRANDEDNESS ; double 

25 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 156: 



35 



. 40 



45 



AAAATATATT 


CCTTCACTTA ATATTCAATT 


AGAGAAAAAC 


ATGGTGATTG 


TAATATGTTG 


60 


TGCAATATTT 


CTGGGTGTTT 


TAATACTTTT 


TTTATTTCTG 


AATCGTAAGC 


TAAGGTTGGA 


120 


AATTTATAAT 


AATAACTCTA 


GTAAAGGGAA 


AATAATTTTA 


TTTCCTTCAT 


TAAAAAACTT 


180 


TTGTTTCACA 


ATATTTTATT 


ATTTTTTATT 


TGGCGGTCTT 


TCAATAATGG 


CTCTAAGTAT 


240 


GTTATTAACT 


TTAAATCCTC 


AAAATATAAT 


AGGCTTTATT 


GGTTGGTTGG 


TAATGACTGC 


300 


AGGTTTCTTT 


CTGTTAAACA 


TGTCATCGAT 


TATTGACAAA 


AAAATTTATG 


TATTATCTAA 


360 


AACTAACACG 


GTGGAAAAAT 


GATGGTTTAG 


CTGGATTTAC 


TGCAGGTTCT ATTTCGGCAA 


420 


TACTTGTATA 


TTGGACCAAT 


CAAAAAAATG 


AATTTGGAAT 


AAAAGATAAA AACGATTGGA 


480 


TAGGACATAA 


ACTAGACGTT 


GGTATAGATG 


CTGTAGAAAA 


ATCTGCAGAA AAAACAGTAG 


540 


ATGGTGTTGA 


AAATGTCATG 


GTGAAGCTTC 


AAAAAGTATT 


TCTAATCATA TAAGCCCTAA 


600 


GAAATGGAGC 


TGGTAAATGT 


TGCTATGCGA 


ATCTAAAATC 


ATCAATAAAA ACCCAAAATA 


660 


TAGAATTATT 


AAATATAATG 


ATGAATACTT 


AATGGTCGAT 


ATAATAAGCA 


CTTGGATTAG 


720 


TTTATTTTTT 


CCTTTTATTA 


ATTGGTTCAT 


CCCaAAAGaA 


TACGTCAAAA 


TTAGTAGAGA 


780 
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(2) INFORMATION FOR SEQ ID NO: 157: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 157: 





AGTAAGATAA TTTTCAATTA 


GAAAATATCT TACTGCTGTT 


CTCTATTTAT 


ACAATACTTC 
ACTGGCACTG. 


60 
120 


IS 


GTATTGAATG GcTTCGCTTT 


CCTAGGGTGC 


CGTCTCAGCC 


TTGGTCTTCG 




CTCCCTCAGG AGTCTCGCCA 


TTAATACTAC 


GTATTAACAT 


GTAATTTTAC 


TTTGAAATAC 


180 




TTTTAAAAAA TAAGACACTT 


TGCCCAACTT 


GCACATAAAT 


GTAAAATTCA 


ATAAAATGAA 


240 


20 


TTTTCTGTGT TGGGTCCCTT 


CTTATAATTT AATAAATACC 


ACTAAACTAA 


ATTAACGAGG 


300 




TGCCTTATGT ATAAAAATTA 


TAACATGCCC 


CAACTACACT 


AC CAAT AGAA 


ACTTCTGTTA 


360 




GAATCCCTCA AAATGATATT 


TCACGATATG 


TTAATGAAAT 


TGTTGAAACr 


ATACCTGATA 


420 


25 


GCGAATTCGA TGAATTCAGA 


CATCATCGTG 


GCGCAACATC 


CTATCATCCA 


AAAATGATGT 


480 




TAAAAATCAT CTTATATGCA 


TATACTCAAT 


CTGTTTAATT 


ATGTTCAAAG 


CATTAAGGTA 


540 




ACAAGACAAT ATCTAAGATA 


TCAAAGATAG 


AAATTTTTTG 


ACGTTGTTGC 


TGATTGTAAA 


600 


30 


CATAACCATC AATTTCATAA 


TTAATAGCAT 


CAATACGATA 


AATGGTTAAG 


CGTACTGAAT 


660 




CTACAAAGCC ATTATTATAA 


AATTTAACTT 


CTACAGGTTG 


GGCATATTGT 


AGCGCCTCGT 


720 


35 . 




GCCAATTGAT 


CATCTGATAA 


TACAGGACGT 


GTAATTTTGT 


780 


TTTGGTCGAT AATGTATTGT 


TGAATCGTTT 


CGAATTGTTC 


GGGTAATGTT 


GCAAAAGGAG 


840 




CCCSITTAAT CATGCCTCTT 


CCCATAGGTA 


TATTGTTATC 


TAGTAATTCT 


CTTGGAACGT 


900 


40 


TACGATAATC AGTTTCTTCT 


TCATAACTTG 


TCATCCTTAA 


TTCACCCCAA 


TCTGATAATT 


960 




ACATTATACG AACATGTGTT 


CTATTTTGCA ACAAAAATTT 


TGTGGaAGCA 


TAAACGCGTT 


1020 




AATAATTAAT GCTCGTGtAA 


GTAAAAAAGA 


GGGATTAATT 


AAAATCGAAT 


AATGaCATAT 


1080 


45 


CACaGCAAAT AGTTCTTTTA 


AAGTAGTTAA ATAGTTTTAG 


CTTTAAGGAA 


aTGATAAaTG 


1140 




ATTGTwAATT CTAGCTAAAA 


TTTAATAAAA TGAAAATAAG 


ACTAACATGG 


AGGGGTAAAA 


1200 




GTAATGACAA ATGGATATAT 


TGGTTCTTAC 


ACTAAAAAGA 


ATGGTAAAGG 


GATTTATCGT 


1260 


50 


TTTGAATTAA ACGAAAATCA 


GTCACGTATT 


GATTTATTAG 


AAACAGGATT 


TGAATTAGAA 


1320 




GCGTCTACAT ATTTGGTGCG 


TAATAATGAA 


GTTTTATATG 


GAATCAACAA 


AGAAGGAGAA 


1380 
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5 



70 



15 



20 



TGTTTGTCTT 


CAAAAGCTGG 


TACAGGTTGT 


TATGTATCGA 


TTTCAGAAGA 


TAAACGATAT 


1500 


TTATTTGAAG 


CGGTATATGG 


TGCTGGCATC 


ATACGTATGT ATGAATTAAA TACGCACACA 


1560 


GGTGAAATTA 


TACGTCTAAT 


TCAAGAACTT 


GCACATGATT 


TTCCAACAGG 


TACACATGAA 


1620 


AGACAAGATC 


ATCCACACGC 


ACATTATATT 


AATCAAACTC 


CAGATGGTAA 


GTACGTTGCA 


1680 


GTAACAGATT 


TAGGTGCTGA 


TCGTATCGTT 


ACTTATAAAT 


TTGATGACAA 


CGGGTTTGAA 


1740 


TTTTATAAAG 


AATCTTTATT 


TAAAGATAGT 


GATGGGACAA 


GACATATTGA 


ATTTCATGAT 


1B00 


AATGGAAAAT 


TTGCTTATGT 


CGTACACGAA 


TTATCAAATA 


CTGTGAGTGT 


TGCAGAATAT 


1B60 


AATGACGGTA 


AATTTGAAGA 


GCTCGAGCGT 


CATTTAACAA 


TTCCTGAAAA 


CTTTGATGGA 


1920 


GATACTAAAC 


TTGcAGCAGT 


GCGTTTATCT 


CATGaTCAAC AATTCTTATA 


TGTATCTAAT 


1980 


AGAGGGCATG 


ATAGCATTGC 


AATTTTTAAA 


GTTCTTGATA ATGGTCAACA 


CTTAGAACTA 


2040 


GTAACAaTTA 


CTGAAaGTGG 


TGGTCAATTC 


CCAAGAGATT 


TTAATATTGC 


CTCATCAGAT 


2100 


GACCyTTTAG 


TTTgTGCTCA 


kGaGCaAGGA 


GATTCAGTTG 


TAACTGTTTT 


CGAAAGAAAT 


2160 


AAAGAAACAG 


GTAAAATTAC 


GCTATGTGAT 


AACACTCGTG 


TAGCATCTGA 


AGGTGTATGT 


2220 


GTCAJATTTT 


AATCTTTAAT 


TAATCATGAT 


AAAAAGAAAA 


CCATGTTTCC 


AAAAAATTTG 


2280 


TGTATACCTT 


GAAATTTATT 


GnTTTCCAGn 


ACATCAATTA 


TGGGAAGCAT 


GGnTTATTTT 


2340 


TGT 












2343 



(2) INFORMATION FOR SEQ ID NO: 158: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 837 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 156: 



45 



AAATTGCCAG 


TTGGTATCGC TTCTGGTGCA 


GTAGTCGAAG 


GTTTCTTCCA 


AGGTATCATT 


60 


CCGATTGGCT 


ATATCGTTAT GATGGCAGTA 


TTGTTATACA 


AAATTACTGT 


TGAATCTGGA 


120 


CAATTTTTAA 


CAATTCAAGA TAGTATTACA 


AATATTTCAC 


AAGACCAACG 


TATTCAAGTT 


180 


TTACTTATTG 


GATTTGCATT CAACGCATTT 


TTAGAAGGTG 


CAGCAGGATT 


TGGTGTACCA 


240 


ATTGCAATTT 


GTGCACTTTT ATTAACACAA 


TTAGGATTTA 


ATCCATTAAA 


AGCTGCGATG 


300 


TTATGTTTAG 


TCGCAAATGC AGCGTCTGGT 


GCTTTTGGTG 


CGATTGGTAT 


CCCTGTAGGT 


360 


GTTGTAGAAA 


CGTTGAAATT ACCTGGAGAT 


GTTTCAGTAT 


TAGGTGTTTC 


TCAATCAGCA 


420 
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GGTTTTAGAG 


GTGTTAAAGA AACATTACCA GCAATTTTAG 


TAGTTTCAAT 


CACTTATACA 


54 0 




CTTACTCAAG 


GATTATTAAC 


TGTATTCAGT GGACCTGAAT 


TAGCAGATAT 


TATTCCACCG 


600 


£ 


TTATTAACAA 


TGTTAGCATT 


AGCAGTATTT TCTAAAAAAT 


TCCAACCAAA 


ACACATTTAT 


660 




CGTGTTAATA AAGATGAAGA AATTGAACCT GCAAAAGCAC 


ATTCTGCAAA 


AGCAGTATTA 


720 


10 


CATGCATGGA GCCCATTCAT 


TGTATTAACA GTCATTGTAA 


TGATTTGGAG 


TGCGCCATTC 


780 


TTTAAAAACT 


TATTCTTACC 


AAATGGTGCT TTATCATCAT 


TAGTATTTAA 


ATTCAACTTA 


840 




CCTGGaACAA 


TCAGCGAAGT 


TACGCATAAA CCATTAGTAT 


TGACTTTAAA 


TATTATTGGA 


900 


15 


CAAACAGGTA CAGCTATTTT ATTAACTATT ATTATTACAA 


TTTTAATGTC 


TAAAAAGGTT 


960 


AACTTTAAAG ATGCAGGTAG ATTATTCGGC GTTACATTTA 


AAGAGTTGTG 


GTTACCAGTT 


1020 




CTTACAATTT GTTTCATCTT AGCAATTTCT AAAATCACAA 


CTTATGGTGG 


TTTAAGTGCA 


1080 


20 


GCAATGGGTC AAGGTATTGC 


TAAAGCAGGT AATGTCTTCC 


CAGTTCTATC 


ACCAATTTTA 


1140 




• GGTTGGATAG 


GTGTGTTTAT 


GACAGGATCA GTTGTAAATA 


ACAACTCATT 


ATTTGCACCA 


1200 




ATTCAAGCTT 


CTGTTGCACA ACAAATTGGA ACAAGTGGTT 


CACTTCTTGT 


ATCTGCTAAT 


1260 


25 


ACAGTTGGTG GTGTAGCGGC AAAATTGATT TCACCACAAT 


CAATTGCAAT 


TGCAACTGCA 


1320 




GCAGTAAAAC 


AAGTTGGTAA 


GGAATCAGAA TTATTAAAAA 


TGACATTGAA 


ATACAGTGTA 


1380 




TGTTTACTAA 


TATTCATCTG 


TATTTGGACT TTCATCTTGT 


CATTATTATA 


AAAAAACGTA 


1440 


30 


TTTCAAAATA 


TAAATATACA 


GAAGGTGAGA TGTTTTCTAA 


CATCTCATCT 


■i-rrrriTATG 


1500 




GATCATTAAT 


GAAAGAAGTT 


TGACATTATA ATAATGGTAG 


CGCTTTATGT 


TAAAATGAAT 


1560 




AGTGAGTAAT 


CAGCAATCAA ATTAAATTGG TTGATAGCTG 


TTAAGGTTTG 


TGGTTTTGTC 


1620 


35 


TTTGTGCTAT 


CGCnCATAAA 


GTATATAATT AAAGTAGTTT 


CGTTATTATA 


AAATATTAAT 


1680 




ATACATAGTA 


GATAGTAATA 


GAGCATCACC ATGGGAACCT 


ATTGAGACAC 


TTATTGATTT 


1740 


40 


AAAGTGGTAT 


TAATATGTCG 


TATTTCTCGA ACGTTCCATT 


ATTCATTTTA 


AAAAGGGGGA 


1600 


CTGTATTTGT 


TATGACAACA 


CAACATAGCA AAACAGATGT 


CATCTTAATT 


GGTGGCGGTA 


1660 




TTATGAGTGC 


aCATTAGGAA 


CATTACTTAA AGAATTATCA 


CCTGAGAAAA 


ATATTAAAGT 


1920 


45 


GTTTGAAAAA TTAGCACAAC 


CTGGCGAAGA GAGTTCAAAT 


GTATGGAATA 


ATGCCGGTAC 


1980 




. AGGGCATTCA 


GCACTTTGCG 


AGTTGAACTA TACAAAAGAA 


GGTAAGGATG 


GCACAGTTGA 


2040 




TTGTAGTAAA 


GCAATTAAGA 


TAAATGAGCA GTACCAAATT 


TCAAAACAGT 


TTTGGGCATA 


2100 


50 


TTTAGTTAAA 


ACAGGACAAT 


TAGATAACCC AGATCGCTTT 


ATTCAAGCGG 


TGCCACACAT 


2160 




GAGTTTTGTC 


ATTGGCGAAG 


ATAATGTAGC TTTTATAAAA 


AGTCGTGTTG 


CAACGTTAAA 


2220 
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GGTACCGTTA ATGATTGAAG GTCGTAAGTC TGATGAACCA ATTGCTTTAA CTTATGATGA 234 0 

AACTGGTACa gATGTTAACT TTGGTGCGTT AACTGCAAAG TTATTTGATA ATTTAGAGCA 2400 

ACGTGGTGTG GGAATTCAAT ATAAGCAGAA TGTATTAGAC ATCAAGAAAC AGAAATCTGG 2460 

GGTATGGCTA GTTAAAGTTA AAGATTTAGA AACTAATGAA ACGACAACAT ATGAATCTGA 2520 

TTTTGTATTT ATTGGTGCTG GCGGTGCGAG TTTACCATTA GTCCAAAAGA CTGGGATTAA 2580 

ACAATCAAAA CATATTGGTG GTTTCCCGGT AAGTGGATTA TTCCTGCGCT GTACAAATCA 2640 

AGAAGTGATT GATCGTCATC ATGCTAAAGT GTACGGAAAA GCAGCAGTGG GTGCGCCACC 2700 

AATGTCAGTG CCGCACTTAG ATACACGTTT TGTAGACGGC AAGCGTTCAT TGTTATTTGG 2760 

TCCATTTGCA GGTTTCTCAC CTAAATTTTT AAAAACAGGT TCACATATGG ATTTAATTAA 2820 

ATCGGTTAAA CCAAATAATA TCGTGACGAT GTTATCTGCA GGTATCAAAG AAATGAGTCT 2880 

TACGAAGTAT TTAGTGTCAC AATTGATGTT ATCTAATGAT GAGCGTATGG ATGATTTAAG 2940 

AGTCTTTTTC CCAAATGCTA AAAATGAAGA TTGGGAAGTG ATTACAGCAG GGCAACGTGT 3 000 

CCAAGTAATC AAGGATACTG AGGATTCTAA AGGTAACTTA CAATTTGGTA CTGAAGTTAT 3 060 

25 ■ TACGTCAGAT GATGGCACAT TAGCTGCATT ACTTGGTGCA TCACCTGGTG CGTCAACAGC 3120 

TGTAGATATT ATGTTTGATG TTTTACAGAG ATGCTATCGT GATGAATTCA AAGGATGGGA 3180 

ACCAAAGATT AAAGAAATGG TGCCGTCATT TGGTTATCGC tTAACAGATC ATGAGGATTT 324 0 

30 ATATCATAAA ATTAATGAAG AAGTAACTAA GTATTTACAA GTTAAATAAT AAACGAAACG 3300 

GTAATGTCTT TTTTAATGTG ATAGACATTA CCGTTTTTTA GTGGTTAATA AAAATCATTT 3360 

TAATTGTTTC AGTTGCTTGT TAATAGTGTC TACGTAGTTC TTGTTTTTAA AGAATTGAAT 3420 

TATCCAAATT AATACATAAA CCACAATGAA GATAATTGTG AATATGATTA GATAATGCAC 3480 

TGTTAGTGGA AACCAACCGG CAAGCATTGC TAAAGGCAAG AATCCGACAT ACGTTGTTAT 354 0 

GAAATGCATT ATAGTTGCTT TAGTAATGCT CCAATCTGTG TATTTAAAGA TAAAATCTCC 3600 

AAGGAAAAAG ACGACGCCTA TGAGTAACCA TAAAATGATA GAAATCAACA TTACGGTAGT 3660 

TTCTGTGAAA TGCGTATAAT ACAATATGCC AATAGTTGAT TGTGGGTTCA GTGGATAATA 3720 

TTTGCCGTCT GCAAATAACA TACTAAAGAA CAGTGAAAGG GACAAACCAA TGATTAAGCT 3780 

AATAAATAAT GAGTTTTTCA AATTTTTCAT ATTGATAAGC GCTCCTTTAT AGATTTTAAA 3840 

TAACGTCTAG AAGAATAGGT GTAGTGTGCA TCTTTAAGAT ACATACGTAT AAGTCCATTT 3 900 

50 GGCTCTAATA ATAATTTTTC AATGTAATAC TTGTTGACGA TTTCTGATTT GGAAATGCGA 3960 

ATGAAATGTT GTGGTAACTG TTTTTCTAGT TCATAAAGTC GTAATTTTAG TTTGAATTTT 4 020 
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ACATTAATGA TATGGATTTC 


TTTGTCTATG 


TATCCGACTA 


ATGTATGTGA 


TTTGTCTAAA 


4140 


TCATTGACTG CATTAATAAT 


ACTTTGAACG 


TTATCATTCA 


TTTTAGGTGC 


ATGTATATCA 


4200 


ATATAAGATT CCGTCTCATT 


TGCATTGATA 


AATAAATTGA 


GTTTCATCAT 


AGGTTAATGC 


4260 


CTCCTTCAAA ATTATTAAAC 


CATAAATGAC 


CATCGATATA 


TTTAAATTTT 


GTTGAATGGT 


4320 


AGAAATTAAA TGTTAAGTGG 


CTAGAAAGCG 


CTAATCAATA 


TAAAAGATAC 


CTCCTGAAAT 


4380 


AAAAACAGAA ATGTTTTTTC 


AGGAGGTAGA 


GATTAAAGTG 


AATTATTTGG 


CAGTGTAATA 


4440 


GTAAAGGTGG TTACATACTC 


GTTACTTTGT 


GTGAATTGGA 


TTGTACCATG 


ATGCAATTCA 


4500 


ATGATGGATT TTGTAATTGC 


AAGACCTAAA 


CCATTGCTAT 


TATCATGTTT 


GCTCACTTTA 


4560 


Innnnnwl 1 \**v\i\±nnt\\~\3 






TTGGTGAACC ATCATTACTT 




ATTGTGAAAA TGATATTGTT 


GTGACTATGT 


TGCAAAGCGA 


TGTCAATGGC ACCACCAACA 


4680 


TCTGTATACT TAATAGCATT 


TATTAATAAA 


TTACTCAATG 


CTTGATGTAA 


CAAACGTTGA 


4740 


TTTCCTAGGA AATTGATGAT 


TCTAGGTCAG 


CTAAnATGAT 


TAACGACTTT 


TCATCAGCAG 


4800 


CAnATTGTTC ATGTCGAATG 


ATATCnTTAA 


TGAGCTG 






4837 



25 (2) INFORMATION FOR SEQ ID NO: 159; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 159: 

35 



40 



45 



ACAATTATTG GATTATTATC 


AAGCAACGTT AATGGATGAC 


TTCCACTTAC 


AACAGAAATG 


60 


CCCATAGATT CTAAATCTtT TGCATGAGCA TCTTGTGATA 


AGTCTTTTCC 


ATCATTGAGA 


120 


GTTACATTCG CACCTAATTT 


ACTTAATAAT TTAGCTGCTT 


CATAACCACT 


TTTTGCCAAA 


180 


CCGACAACTA ATACATTTTT 


ATTTTCTAAC CCTGTATAAT 


TAAGCATCTT 


AATGCACTCC 


240 


AATCCATAAA CCGATTAAAC 


CTGAAATCAG ACCAACAGCC 


CAAAATACTG 


TAACTACTTT 


300 


CCATTCGCTC CATCCTATCA 


ATTCAAAATG ATGATGAATC 


GGACTCATTT 


TAAATATACG 


360 


CTTTCCAGTC AATTTAAAGC TAGCGACTTG TAACATAACA 


GATAATGTTT 


CAATTACGAA 


420 


TACTAAACCT ATAAAAATTA 


ATGATAATTC CTGATTAAGC 


ATGATTGAAA 


TGGTAGCAAA 


460 


TATACCACCT AAAGCTAAGC 


TACCTGTATC TCCCATAAAC 


ACTTTAGCAG 


GGTTAATGTT 


54 0 


ATATGGTAAA AATCCTAAAA 


GTGCAAACAA CATAATGATA 


CAGAAAATAC 


CAATTGCCGT 


600 
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10 



15 



TGCTAATCCA TCTAAACCAT CTGTTAAATT TACTGCATTA GAAAAACCTA CTTGCCAAAA 720 

AACAATGAAA ATAACATATG CAAATGATAG TGGGATTGCT ACATTCGTAA ATGGAATATG 780 

TATGCTCGTA GAAAAATTCA CCAAATGAAA CACATTACTT AAAACAAAGA ATATAATCGC 84 0 

AATACCAATT TGCGCCAAAA ACTTCTGTTT ACTTGTTAAA CCTTGGTTAT TCTTTTTAAC 900 

AACAATAATA TAATCATCTA TAAAACCAAT TAACCCAAAA CCAATCGTCA CAAATAATAA 960 

CAGTATGATT GGATTAGCTT GATCTACAAA TATAATAGCC ACCAAAGACG TTATCACAAT 102 0 

ACTTAATAGA AATGTTAGTC CACCCATCGT TGGTGTACCA GTCTTCTTCA TATGGCTTTG 1080 

TGGACCTTCT TCTCGAATAC TTTGACCAAA TTTCATCCTT TTTAATGTAG GTATTAAAAC 114 0 

AGGTACCAAA ACAAATGTAA TCACTAGCGC TAATAACGCA TATACAAAAA TCATAACTAT 1200 

CTCCTCTTCT TAATCCAGAC TTTTTTAACC ACTAATATAT TATCAATTTT TCAATTAAAT 1260 

20 AAACAAAGTT GTAATCAAAA TTTATAATTT TTCTTTTTTA CGGCATAAGA GGCCAGTATA 1320 

AAAAGTTTGC CTATAACAAA CAAGTTAATC TGACCTCGTC TACCTTAAAA TTCTCTATCA 1380 

ACACTTATTT ATAAAGATTA AATGAAGATG TTGTTTTCTA TCACAGCATT ACTTTAGTAA 144 0 

25 AAACAAATAG TGACAATACA TCCTAATTTA ATGTAGCCAT TCTTGTTAGT CCGACTTATC 1500 

CTTGTCAGTT TTACTGTCAG ATTTCnTCTT ATCATCTGAA TTTGAATCAG AATTATTCGT 1560 

CGAATTGCTG TCTACATTCT CTGGATGGAA AATTCTACGT 1600 
(2) INFORMATION FOR SEQ ID NO: 160: 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1186 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 160: 

40 

ATTGCCTTTG TTTTAATTTT AAATCAAAAT mGCCTATGAA AGATTTAAAT CAATTAATTT 60 

CTATAATATT ATCATTTTTA AAGCATATCA TTGTTTAGTT TTTTTATAAT TGGATAAATA 120 

45 CTAATAGTTA CTTTATAAAA CATTACATAG AGAAAGGTTA AGGAGTGCAC ATGTCGAAAA 180 

AGGATCACTC TTCTTCAAAA TACCTTAATT CTGTTAAGGA AGCGCAAGAG GAGTCAAAAA 24 0 

AGAAAAATAA AAGTAATCCC AAAATTGATG TTGATCGTAC ATATATTGAA CCTCAACAAT 300 

50 TCCAATCTAA GAAACCTAAA AAAGATGATC AGGTTTTCTT CTTATCAAGA TTAAATAAAC 360 

CTGCAAAATA TAAGAAAGAC TCTAATTTCT TATCATATCT CATCTATCGC ATAGGAAAAG 420 
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TGTTGCTTTT CCTATTAACA TTATTACCAT TTTTCAATAT TAAGCAGAGT CAAATTACTA 540 

ATATGTTAAG CAATGCACCC GCTGAAACAT CTACTCTAAT TAAGAGTGTA ATTGGTGATA 600 

TAACTCAAAA CTCCAGTGGT GGCTTATTAT CTATCGGTTT GATTTTAGCA ATTTGGTCAG 660 

CTTCAAATGG AATGACTGCA ATTATGAATT CTTTCAATGT TGCTTACGAT GTAGAAGATA 720 

GCCGTAATGG AATCGTATTA AAACTACTAA GTGTTGTCTT CACTGTAGTT ATGGGCGTTG 780 

TGTTTGTAGT TGCTCTAGCA TTACCAACGC TTGGTTCTGT AATTAGTCAT TTCCTATTCG B40 

GTCCACTTGG aTTTGACGAA CAAGTGAAAT GGATTTTTAA CCTTATTAGA ATTGTGTTAC 900 

CAATCATTAT TATATTTATC ATATTTATCG TGTTATATTC GGTTGCACCT AACGTTAAAA 960 

CGAAGCTTAA GTCAGTATTA CCAGGTGCAG TATTTACTTC AATTATTTGG TTAGCTGGTT 1020 

CATTTGGTTT TGGTTGGTAT ATTTCAAATT TTGGTAACTA TTCTAAAACA TATGGCAGTA 1080 

20 TCGCGGGTAT CATCATTTTG TTACTATGGT TATATATCAC AAGTTTTATT ATAATTGTCG 1140 

GnGCTGAAAT CAATGCAATC ATTCATCAGC GTAGTGTAAT TAAAGG 1186 

(2) INFORMATION FOR SEQ ID NO: 161: 

25 (i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 7872 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 161: 

TCTTGAGCCA T CT TTTGAGC TAACTGACTA GATTGATACC CAAAAATCAT AGTTACCAAC 6 0 

ATAAACTTTA ATTTTACCGA AGTCTAAATC AGCGATATGA GTACATACAT TATTTAAGAA 120 

ATGACGGTCA TGCGATACTA CGATAACAGT ATTATCAAAG TTAATTAAGA AATCTTCTAA 180 

CCAACTGATT GCTGGAATAT CGAGACCGTT AGTAGGCTCA TCCAGTAATA GTACGTCTGG 24 0 

TTCACCGAAT AAACTTTGCG CTAATAATAC TTTAATTTTT TGGTTGTTTT CTAATTCAGC 300 

CATTTTTTTA TCGTGTAAAG TTGGATCGAT ACCTAAACCA GATAAAAGGT TAGCAGCATC 360 

45 AGCTTCAGCA TTCCAACCAT TCATTTCTGC AAATTCACCT TCAAGTTCAG CAGCACGGAT 420 

ACCATCTTCA TCACTGAAAT CTGGCTTCAT ATAGATTTCA TCTTTTTCTT TCATAACCTC 4 80 

ATAAAGACGT TCGTGACCTT TAATTACAAC ATCAAGCACG CGTTCATCTT CATAAGCATA 540 

SO GTGGTCCTGT TTTAAAACAG CTAGACGTTC ATTTTTCCCT AATGAAACAT GTCCTGTTTG 600 

AGAATCTAAT TCACCAGATA ATATTTTTAA GAATGTTGAT TTACCTGCAC CATTCGCACC 660 
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ATCTCCAAAA CGTAAACTCA CATCAGTTAC TTGTAACATG. CATTTTCTCC TTTTTTTCAT 780 

TCGATATTCT AACGGAAGAA TTATATCATA TTATCGTCAC AGTTTCGACC TCATATAAGT 840 

5 TGTAATGATA GAATGACTCA CACATGTTAT AATAATAAAG AATACAAGAA TCGAAGGAGA 900 

ATAACATGGC ATTAGACAAA GATATAGTAG GTTCTATAGA ATTCCTTGAA GTAGTAGGGT 960 

TACAAGGTTC AACTTACCTT TTAAAAGGAC CAAACGGTGA AAACGTAAAG TTAAACCAAT 102 0 

70 

CAGAAATGAA CGATGATGAT GAATTAGAAG TAGGTGAAGA ATATAGTTTC TTCATTTATC 10 BO 

CAAACCGTTC AGGTGAATTA TTTGCAACTC AAAATATGCC TGATATTACG AAAGATAAAT 1140 

ATGACTTTGC TAAAGTACTT AAAACGGATC GCGATGGGGC ACGTATAGAT GTTGGATTAC 1200 

75 

CCCGTGAAGT GTTAGTACCA TGGGAAGATT TACCAAAAGT GAAATCACTA TGGCCACAAC 1260 
CTGGTGATTA TTTGCTAGTT ACATTACGAA TTGACCGTGA GAATCATATG TATGGACGTT . 1320 

20 TAGCGAGTGA ATCTGTTGTA GAAAATATGT TTACACCTGT ACACGACGAT AATTTAAAAA 1380 

ACGAaGTCAT TGAAGCCAAA CCTTACCGCG TATTACGAAT TGGTAGCTTT TTATTAAGCG 144 0 

AATCAGGTTA CAAAATTTTC GTACATGAAT CAGAACGTAA AGCTGAACCA AGATTAGGTG 1500 

25 AATCTGTTCA AGTTAGAATT ATCGGGCATA ATGATAAAGG TGAGTTAAAT GGTTCATTTT 1560 

TACCACTTGC ACATGAACGT TTAGACGATG ACGGCCAAGT CATCTTTGAT TTACTAGTTG 1620 

AATATGATGG TGAATTACCA TTCTGGGACA AATCAAGCCC TGAAGCGATT AAAGAAGTAT 1680 

30 TCAATATGAG TAAAGGTTCA TTCAAACGTG CAATCGGTCA CTTATATAAA CAGAAGATTA 174 0 

TTAATATAGA AACAGGTAAA ATCGCTTTAA CTAAAAAAGG TTGGAGTCGA ATGGACTCAA 1800 

AAGAATAATC ATTTTTACAC GTGTCGTAGG ATGCGTGTTT TTTTTATTCA ATATTAAATC 1860 

35 

GGACAGATGA AGTAGTTTTT TAAACATTCC TTTCAAAGTA AAAAATTAAA TAATTCAAAC 1920 

GAATAGGCTG GGaCATTAAG TTCTTAGGCA ATGTAAAAAA GCTGATTTCT ATTAATTATT I960 

TGATGGAAAT CAGCTTTTTT GATATGTATT TTATAATGTA CAGCTCGTTG AGCTGCTATT 204 0 

40 

TTCCTTATAT TAAGTGCCAT TAATACAAAA CCTAGCTCTC GTTTAACTTT ATTTATTCCT 2100 

CGAACTGACA TTCGAGTGAA aCCCAAAATA GCCTTCATAA ATCCAAAAAC AGGCTCTACA 2160 

45 TAAATTTTTC TATGACTATA GATTTTTTTC GTTTCTGGTT CAGAAAGCTT TTGaTTAATT 2220 

TGGGCTTTAA TGTATTTCAA AGTAAAATTA CATGTTAATA CGTAGTATTA ATGGCGAGAC 2280 

TCCTGAGGGA GCAGTGCCAG TCGAAGACAG GGGCCCCAAC ACAGAAGcTG ACATATAGTC 234 0 

SO AGCTTACAAC AATGTGCCGG TTGGGGTGGC TGAGACGGCA CCCTAGGAAG GGACCCGTCA 2400 

TCAAAAATTC TATTTATAGA ATTTTACAGT AATGTGACAG ACGGGCAAAG CGAAgCCATT 2460 
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CTTACTGCTG TTTTTTTAGG GATCTATGTC CCAGCCATTT TTGTATTCAT ATTTAAATTT 2580 

CGATAATTTT TCAGGAAGCA TTTTAATTTT ACTAATGAAG CAATATTTT T TAGATTAACA 2640 

AAAATTAATA TTTACATTTT CTTAACAATT TTTTATGTAA CATTTACAGT TTCTAAAAAT 2700 

GAGGTTAATA ATTCAAGGTT AAGATAAAGA TGTAATCAAT ACAAATACTA TTTGTTGTTC 276 0 

ATACAGGGAG GATATTTCAA TGAAAAAATG GCAATTTGTT GGTACTACAG CTTTAGGTGC .2 820 

AACACTATTA TTAGGTGCTT GTGGTGGCGG TAATGGTGGC AGTGGTAATA GTGATTTAAA 2 880 

AGGGGAAGCT AAAGGTGATG GCTCATCAAC AGTAGCACCA ATTGTGGAGA AATTAAATGA 2940 

AAAATGGGCT CAAGATCACT CGGATGCTAA AATCTCAGCA GGACAAGCTG GTACAGGTGC 3000 

TGGTTTCCAA AAATTCATTG CAGGAGATAT CGACTTCGCT GATGCTTCTA GACCAATTAA 3060 

AGATGAAGAG AAGCAAAAAT TACAAGATAA GAATATCAAA TACAAAGAAT TCAAAATTGC 3120 

2 0 GCAAGATGGT GTAACGGTTG CTGTAAATAA AGAAAATGAT TTTGTAGATG AATTAGACAA 3180 

ACAGCAATTA AAAGCAATTT ATTCTGGAAA AGCTAAAACA TGGAAAGATG TTAATAGTAA 3240 

ATGGCCAGAT AAAAAAATAA ATGCTGTATC ACCAAACTCA AGTCATGGTA CTTATGACTT 33 00 

25 CTTTGAAAAT GAAGTAATGA ATAAAGAAGA TATTAAAGCA GAAAAAAATG CTGATACAAA 3360 

TGCTATCGTT TCTTCTGTAA CGAAAAACAA AGAGGGAATC GGATACTTTG GATATAACTT 3420 

CTACGTACAA AATAAAGATA AATTAAAAGA AGTTAAAATC AAAGATGAAA ATGGTAAAGC 34 80 

AACAGAGCCT ACGAAAAAAA CAATTcAAGA TAACTCTTAT GCATTAAGTA GACCATTATT 354 0 

CATTTATGTA AATGAAAAAG CATTGAAAGA TAATAAAGTA ATGTCAGAAT TTATCAAATT 3600 

CGTCTTAGAA GATAAAGGTA AAGCAGCTGA AGAAGCTGGA TATGTAGCAG CACCAGAGAA 3660 

AACATACAAA TCACAATTAG ATGATTTAAA AGCATTTATT GATAAAAATC AAAAATCAGA 3720 

CGACAAGAAA TCTGATGATA AAAAGTCTGA AGACAAAAAA TAATAAGACG CAATTTCAAA 3780 

TGTGTCTTGA AACATGATTT TGATGGTGAA TCATTATTTA GAGTACAAAG CTTGATTTAT 384 0 

CGAGACGCTG ATTTTGACAT TCAGTTAGTC TAcAAGCTTA TCAACTTAAA ATAGTGGTTC 3900 

ATCATTATTT TACAAATCTA ATTATTTTGG GAGTAATAGA AAGAGGTTTG ATTATGACTT 3 960 

45 CATCTACTAA TGTTAAAGCT TTAATCGAAA AAAATAATAA TAAAAAAGGA AAGCATAATG 4020 

ACAAAATTAT ACCAGTTATT TTAGCCGCAA TTTCAGCGAT TTCCATTTTA ACAACACTAG 40 BO 

GTATATTAAT CACATTGCTT TTAGAAACCA TCACTTTTTT CACCAGAATT CCAATAACTG 4140 

so AATTTCTATT TTCTACTACT TGGAATCCTA CCGGTTCAGA CCCTAAGTTT GGTATCTGGG 4200 

CATTGATAAT AGGGACTTTA AAAATCACAG TTATTGCGAC TATATTTGCA GTTCCAGTCG 4260 
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AACCGATATT AGAAATTTTA GCAGGAATAC CAACAATTGT GTTTGGTTTC TTTGCATTAA 4380 

CCTTTGTTAC ACCAGTATTA AGATCTTTCA TACCAGGTCT TGGAGAGTTT AATGCTATAA -444Q 

5 GTCCCGGCTT AGTTGTCGGT ATTATGATTG TCCCTCTCAT CACAAGTTTG AGTGAGGaTG 4500 

CAATGGCATC TGTACCAAAT AAAATTCGAG AAGGTGCCTA TGGACTTGGA GCAACTAAAT 4560 

TAGAAGTAGC AACTAAAGTC GTACTTCCCG CAGCAACATC AGGTATTGTA GCTTCAATCG 4620 

10 

TTCTCGCGAT TTCAAGAGCA ATTGGAGAAA CGATGATTGT AT CATTAGCG GCAGGTAGTT 4 680 

CGCCAACAGC TTCATTAAGT TTAACAAGTT CGATTCAAAC AATGACTGGA TATATTGTTG 4 74 0 

AGATAGCGAC AGGTGATGCA ACATTTGGAT CAAATATTTA TTACAGTATT TATGCTGTAG 4 800 

75 

GGTTCACACT ATTTATCTTT ACCTTAATCA TGAATTTACT TTCTCAGTGG ATTTCTAAGC 4 860 

GTTTTAGGGA GGAGTATTAA TATGGAAACG ACAGATAATA ATAGACAATC ACTCGTCGAT 4 920 

2Q CAACAACTTG TCCAAAAACA TTTATCATCC AGAACGGTTA AAAATAAAGT GTTCAAACTC 4 980 

ATATTTTTAG CATGTACATT ATTAGGACTT GTCGTACTTA TTGCGTTGTT AACTCAAACA 504 0 

TTGATTAAAG GGGTAAGTCA TTTAAATTTA CAGTTTTTCA CTAATTTTTC TTCTTCAACA 5100 

25 CCATCTATGG CTGGCGTTAA AGGCGCGTTA ATCGGTTCAC TTTGGTTAAT GTTAAGTATC 5160 

ATTCCATTAT CAATCATCCT AGGAATAGGT ACAGCTATAT ACTTAGAAGA ATATGCGAAA 5220 

AACAACAAAT TTACTCAGTT TGTTAAAATC AGTATTTCCA ATTTAGCTGG TGTACCATCA 5280 

30 GTTGTATTTG GGTTATTAGG TTATACTTTG TTCGTTGGTG GTGCAGGGAT TGAAGCCTTG 534 0 

AAAATGGGTA ACAGTATATT GGCAGCAGCG CTAACAATGA CCTTACTGAT ATTACCAATT 5400 

ATTATTGT7T CAAGTCAGGA AGCAATTAGA GCTGTACCTA ACTCAGTACG CGAACTTcTT 54 60 

35 

ACGGCTTAGG TGCTAATAAA TGGCAAACGA TAAGACGTGT TGTCTTACCA GCAGCGTTAC 5520 

CTGGTATTTT AACTGGATTC ATTTTGTCTC TTTCAAGAGC ACTGGGAGAA ACAGCGCCAC 5580 

TTGTGCTAAT CGGTATACCG ACTATATTAT TGGCAACACC TAGAAGTATA TTGGATCAAT 5640 

40 

TTTCAGCATT ACCTATCCAA ATATTTACTT GGGCGAAAAT GCCTCAAGAA GAATTCCAGA 5700 

ATGTTGCATC GGCAGGCATT ATCGTTTTAC TAGTTATCTT AATCTTAATG AATGGCGTTG 5760 

4S CGATTATTTT. ACGTAACAAA TTTAGTAAAA AATTCTAATT TAAACAATCA ATCTCATTTA 5820 

TCTATTAAAA AGGGAGTTTT AAATATGGCG CAAACACTTG CACAAACTAA ACAAATATCT 5880 

CAAAGTCATA CGTTTGATGT CTCACAAAGT CATCATAAAA CACCAGATGA TACAAACTCA 5 940 

SO CATTCTGTTA TATATTCAAC ACAAAATTTA GACTTATGGT ATGGCGAAAA TCATGCATTA 6000 

CAAAATATTA ATTTAGATAT TTATGAAAAC CAAATTACTG CCATTATAGG TCCATCTGGT 6060 
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AAAACAGCTG GTAAAATATT ATATCGAGAT CAAGACATTT TTGATCAAAA ATATTCTAAA 6180 

GAACAATTAC GTACAAATGT GGGCATGGTC TTTCAACAAC CTAATCCATT TCCAAAATCA 6240 

ATATACGATA ATATTACTTA CGGTCCAAAG ATTCAOGGTA TTAAAAATAA AAAAGTTCTT 63 00 

GATGAAATCG TTGAGAAATC ATTACGTGGC GCTGCAATTT GGGATGAATT AAAGGATAGG 6360 

TtGCACACAA ATGCATATAG TTTATCCGGT GGGCAACAAC AACGTGTTTG TATCGCGCGT 6420 

TGTTTAGCAA TTGAACCTGA AGTCATTTTA ATGGATGAAC CGACATCAGC ATTAGATCCA 64 80 

ATCTCAACAT TAAGAGTAGA AGAGTTGGTT CAAGAACTAA AAGAAAAGTA TACAATTATT 6540 

ATGGTtACAC ATAATATGCA ACAAGCAGCT CGTGTATCAG ATAAAACTGC ATTTTTCTTA 6600 

AATGGTTATG TCAATGAATA TGATGATACT GATAAAATTT TCTCTAACCC ATCAAACAAG 6 660 

AAAACAGAAG ATTATATTTC AGGAAGGTTT GGTTGATATA TAATGGCAAT AATTAGACAA 6720 

CGATATCAGG AGCAACTTGA TGATTTAATA AAAGAATTAC GTCGGTTAGG TGCaAATGTC 6780 

TATGTGAGTA TTGaAAATGG TATAAAAtCA TTAAGTATTG aCGATAGAGG cTTTGCACGA 6 840 

CAAACAGTTA AAAACGATAA ACATATCAAT CAATTAAATT ATGATATTAA TGAGCGAGTT 6 900 

25 ATCATGTTAA TTACAAAGCA ACAGCCCATT GCGAGTGATT TGCGTATGAT GATTTCTTCA 6 960 

TTAAAAATCG CCTCCGATTT AGAAAGAATA GGAGATAATG CCTCGAGTAT TGCCAATATT 7 020 

CGATTGCGTA CAAAGATTAC AGATGATTAT GTGTTAACCC GTTTAAAGAC AATGGGTAAA 7080 

30 TTAGCTATGT TAATGTTAAA GGACTTAGAT CAAGCATTTA AAAAGAAAGA TACCGTATTA 7140 

ATAAGAGAAA TAATTGAGCG TGATGAAGAT ATCGATGACT TATATAGTCA TATTATTAAC 7200 

GCAACGTATC TTATTGATAA CGtCCATTTG TCGCTGCACA AGCTCATTTA GCAGCAAGAC 7260 

ATTTAGAACG TATTGGTGAT CATATTATTA ACATCGCTGA AAGTGTTTAT TTTTATTTAA 7320 

CAGGTACACA TTACGAACAA TAACTTAAAG TTATTACTAT AAAATCCCTT ACGATAAATA 73 80 

TATATTTCTA TTATTCATAA ACCCTCAAAA AAACCAAGAT TCTCACAATT AGTAATGTGA 744 0 

AAATCTTGGT TTATATTGTT CTACTATAAA TTGTCTCGCA TCTTAGTTAT TTGCTTGCTC 7500 

AATTTCATCT GTTAATTTTT CAACTTCATC GACTAAATCA GAAATATATT GAATTGTAGA 7560 

TTTAAGTGGC TGTTCTGTAG TAATGTCTAC ACCTGCAATG TTTGCAAGTT CGACAGGTGA 7620 

TACACTACCA CCTTTTTTCA ATGTTTCTAA CCAAGCATCA ACAGCTGGTT GGCCTTCATT 7680. 

TTTAATCTTT TGAGAAACGA CAGTTCCGAT TGTTAAGCCA -GCAGAATACG TATACGAATA 7740 

50 TAATCCCATA TAGTAATGAG GTTGACGCAT CCATGTTAAT TCAGCACCCT CAGTCATGTC 7800 

TACTGCATCT CCAAAAAATT GTTTATAAAC ATTTAGCATT ATTTCATTTA ATGTnCGGCG 7860 
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(2) INFORMATION FOR SEQ ID NO: 162: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 162: 





TTTTTTCTTT 


TCTTCATTTG 


AAAATTGATC 


ATTCAGCAAT ATAAGCGTAT TTGTTAATGA 


60 


15 ' 


TTTAGGTGTT 


CCAATTTCAT 


AATCCCACCA 


ATTTAAGTTG GTATTCTTGC CAGTTGTTTT 


120 




AGTAAAATTC 


TCACTTAATT 


CTTTTACTTT 


TTTATCTGGT TCTTTTCCAT ATGCATTTTT 


180 




ATGCAGCCAC 


TCAAGGGCAT 


CTTTCACTTT 


CTTCTTATTT TCGTCAGTAT TTAAAGTGGT 


240 


20 


TTTAGGATTC 


CTCATCGCTT 


CTGCGATTTT 


CTCAATATTA CGATAGGTAC GAGTCATATG 


300 




AGAAGAATTA 


GTTTCAAGGG 


TTTCCGCTCC 


TGACCACAAG TATTTCCTAC CACTTTCAGT 


360 




TTTCATTTCC 


TTGAGTAAAT 


TCGTCGCCTC 


TTTCTCTGTA GCATCAAACT TCTTCTTCAT 


420 


25 


ATCTGGATTA 


TTCTCATCAT 


ACTTATCATA 


ACCATAGTTA ACGTCCAGCC ATGTGTTCCT 


480 




CAATTTTTCA 


TAATCTGGCG 


TTTGAACATT 


CGTATCAGCC ACAGCGATTT GATGTTTATC 


540 




AACACTTCTG 


AATTCACCAC 


CATTCAAAGT 


AATCACACCA GCCATTAATA ACGTAATGGT 


600 


30 


GGATAATTTT 


TGCCATTTCT 


TTATTCTATA 


TGTCATTGaC ATGTCTCCTT TTTGTGTTGC 


660 




GCGTGCGCAA 


TGAATATTAT 


GATTAAATAA 


TGATTCAATT TTTCAAAATT CGTTAACGTA 


720 


35 


TACAAATGAC 


TGTCTACTGT 


CAAACAATCC 


ACAAAGAATG TTGATGtCAT ATaAACAATC 


780 


GATCACCCAA 


ATTTTCCG 
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(2) INFORMATION FOR SEQ ID NO: 163: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5132 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: douBle 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 163: 
TACAGGTTTT ACTATAATGG ATGGTATTTT GGCTAAACGA CATTGGTTTA GTCTTCTTTT 60 
50 TTTnACTTCC TAnATTTACA ATGGTATAAA TAATAATGCT ATATTTAGAA TGATGAGTAT 120 
ACTTACTGAA ACTAAATTAA AAGTGTCTGG TTCTTTACTA AAGATAGCTG CTATCCTTGC 180 



797 



EP 0 786 519 A2 



5 



10 



15 



20 



25 



40 



45 



AATACAAGTT 


CCAATGAGCG 


CAATTAAAAG 


TACTAACCCA 


ACGATGAAAC 


TCTGTTTGTC 


300 


ACTTAACTCA 


AAGAAACTAT 


AGATAGGATA 


TTTTTTAATA 


ATCAAGCCAC 


CTAAAATCAT 


360 


CCATAAAAAT 


ACGATAATTC 


CATAAGTCAC 


ATTTATAACA 


TACGTTATTT 


TTTGGTCACC 


420 


AAATCGGACT 


'AATGTATTTC 


GTAGAATCAG 


CATACCAATG 


ACAACACCTA 


AAATAACGAT 


480 


ACTAGCTATA 


TAAAGTAAAA 


ATGCAATTGT 


CACATCAAAT 


GTACCCAAAT 


CTAAAAACCT 


540 


AGGAATTAyA 


AyGACTGCTA 


AAATAAAAGC 


GAAGyACAAA 


GTAATATAkT 


TATACAAACC 


600 


GGTAGTAAGA 


CTTATCTCAG 


GTGATAATTG 


ATCAGCCATT 


GACTTAATCG 


GTGTATTAAT 


. 660 


AATTGAACTT 


GTATCTTCGT 


taittttttc 


AGCCATAGTT 


AAATGATCTT 


CGAGCTCTTC 


720 


CAATAACTCT 


TCTACTTCTG 


CTTCAGTCTT 


ACCTCTAAAT 


AACAATTCAA 


CACGTAATTT 


780 


TTCTAAAAAA 


TCTTGAGATT 


GTTTACTTAA 


CATCGTTTTC 


CCCTCCAAAC 


AAGTTAATCA 


84 0 


TCCCTTTATT 


CAAAACTTGC 


CATTTCGATT 


TAAATACTTT 


TAGTTCCTTT 


AAACCTGAAT 


900 


CGGTAATCGT 


ATAGTATTTC 


CGCCTCGGGC 


CGCCATTACT 


AGA'n-rrriT 


ATTGTCGTAT 


960 


CAACGTATCC 


TTTTTTGTTT 


AAACGCATTA 


AAACTGGATA 


AATACTACCC 


TCACTTATCT 


1020 


CTGGAAACTC 


TTGATTCTTA 


AGTTTCGTCA 


TAATTTCATA 


TCCATACGTT 


TCGCCTTGGG 


1080 


CAATGAGACC 


TAATATCGCC 


CCATCTAAGA 


GACCTTTCAT 


AATCTGATCT 


GACACTGACA 


1140 


TTTTAATCAC 


CTACTATCTT 


ACATAATAAG 


ATAGTACATT 


GAGAACTTTT 


CGTCAACTAT 


1200 


CTTTTATTGT 


AAGGTAGTTG 


TTGTACACAT 


TCCTTAAATG 


ACTAACAACT 


TTGTTAATAG 


1260 


GGTAATACTT 


ACGGAAGTAT 


ATTTTATTTA 


TGGGGGAGGA 


ATTAATAATG 


ACTACAAAAA 


1320 


CAGTATTTGA 


TGTCATTGAT 


ATGGGGTTAG 


GATATTTAGT 


AAATGTGTAT 


GATGCTTGGA 


1380 


AAGTTGAAAA 


GGTACTTGAT 


GATTATCATA 


AGCCTTTTTC 


TAATACCATT 


CATTGGCAAT 


1440 


TTGGiCATGT 


ATTAACAATT 


TTTGAATCGG 


CCTTAGCTGT 


TGCTGGTAAA 


GAGAATATTG 


1500 


ATTTAAATAT 


CTATAGACCT 


TTATTCGGAA 


ATGGTTCGTC 


TCCAGATGAA 


TGGAAGGATG 


1560 


AAGTACCGAG 


TATTGAAAGG 


ATTTTAGAAG 


GTCTCCAAAC 


TTTACCTGAA 


CGTGCACGAA 


1620 


ATCTAACTGA 


AGATGATTTA 


GGAATTGAAT 


TGAAACAGCC 


AATTGTCGGT 


TGTAATAACT 


16B0 


TAGAAGAGTT 


ATT AG T ATT A 


AATGCCATTC 


ACATCCCACT 


TCATGCTGGT 


AAAATTGAAG 


1740 


AGATGTCTCG 


TATATTAAAA 


AATTTAAAAT 


AAATATGTGC 


TTATTAACCG 


TTAACAACAC 


1800 


GTTAACGGgT 


TTTTTATTTG 


TTTAAAAGGT 


CACTTTTTTG 


AATTTAATAA 


ACACCATCTA 


1860 


TACCAGTTCT 


TCACCGATTC 


TCGAAAAATA 


ATTATATTAA 


TGATTTCGTT 


AATTTAATTT 


1920 


TATATTTAAT 


TATTACTGTA 


CATCTTTTGT 


AGTTAGCTTT 


ATTCTTAAAT 


TGAAATATGT 


1980 
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TACTCCCTAT CGTTGTAGGT CTCCTTATTT 
TGGATCCAAC AGCATGGTAT ATGTTCGCAA 
5 CACAACCGAT GCCAATTGGG GCCGTCTCTA 

GCATTGTTGA CATGAAAACG GCTGTCGCTG 
CTATGGCATT TTTCATTTCG AGAGGATTTG 

10 

TTCATTTCGT CAAATTATTT GGTAAAAAAA 
TAGATTTAAT TCTAGCGCCT GCTACACCAA 
. TCCCAATTAT CAAATCACTT TCTGAATCAT 

15 

GCAAAATGGG TGCATTTCTT GTTTTCACAG 
TGTTTTTAAC TGCAATGGCC GGTAACCCCC 

20 ATGTTCACAT TACATGGATG AATTGGTTTC 

TAATTGTTGT ACCTTTTATT ATTTATAAAA 
ATG CT AAG AG TTGGGCTGAA AATGAATTAG 

25 AATTTATGAT TGGTATTTTT GTCGTTGCGT 

ATATTGATGC AACTTTAACG GCCTTTATTG 
TAACATGGCA AGACATTTTA AACGAAACAG 

30 TATTAGTGTT AATGGCCGAC CAATTAAACA 

CCATTGCTAC AAGTCTTGGT GGCTTAAGCT 
TCTACTTCTA TTCACATTAC TTATTTGCAA 

35 

CAGCATTACT AGgCGTTGCC ATCGCAGCCG 
TAGGTTTCTT CGGTAACCTA TTAGCTTCAA 
TTCTATTCTC TTCAGGTTAC GTGACTCAAA 

40 

GTTTCGTCTA CTTTATTATC TGGATTGGTT 
TATTTTAAAA TATTTAAATT AGCGCTCGAA 

45 ATTTAAAATC AACCTTTGCT AAATCAAGAC 

TCATTGCGTG CTTCACTGAA AATTTGTATA 
AAATACATTT CTATACTATT CCGCTTCATT 

SO TCATACAAAA AACACCTTCG TTTAAATTCA 

ATTTCATATT TAATGTTTCA AATTTATACG 

55 



GGGCACTTAC 


ACCTTTTAAA 


CCGGATGCTG 


2100 


TATTCGTCGC 


GACAATCATT 


GCTTGTATTA 


2160 


TAATTGGATT 


TACAATCATG 


GTACTCGTTG 


2220 


GTTTTGGTAA 


TAATAGCATT 


TGGTTAATTG 


2280 


TGAAAACAGG 


TCTTGGTAGA 


CGTATCGCAC 


2340 


CATTAGGATT 


AGCATATTCT 


ATCGTCGGTG 


2400 


GTAATACCGC 


GCGTGCTGGT 


GGAATCATGT 


2460 


TTGGTTCGAA 


ACCGAAAGAC 


GGATCAGCAC 


2520 


AATTCCAAGG 


TAATTTAATT 


ACTGCGGCTA 


2580 


TTGCACAAAA 


TTTAGCATCT 


AGCACATCTA 


2640 


TAGCTGCTTT 


AGTTCCTGGA 


CTTGTTTCCT 


. 2700 


TTTATCCACC 


AACTGTTAAA 


GAAACACCAA 


2760 


CGACTATGGG 


TAAAATCGCT 


TTAGCTGAAA 


2820 


TAACACTATG 


GATTGTCGGA 


AGTTTCATTC 


2880 


CGCTAgcATT 


gTTATTATTG 


ACAGGCGTCT 


2940 


GTGCTTGGAA 


CACATTAGTA 


TGGTTCTCAG 


3000 


AGCTTGGATT 


TATTCCTTGG 


TTAAGTAAAT 


3060 


GGCCTATAGT 


CCTGGTCATT 


TTAATATTGT 


3120 


GTTCTACAGC 


ACATATCAGT 


GCGATGTATG 


3180 


GTGCACCACC 


ATTATTCAGT 


GCATTAATGT 


3240 


CAACACACTA 


TAGTAGTGGT 


CCAGCGCCGA 


3300 


AACGTTGGTG 


GACAATGAAC 


TTAATATTAG 


3360 


TAGGATCACT 


TTGGATGAAA 


GTAATTGGTA 


3420 


TCTCATTGAT 


TTGGGCGCTT 


,TTTAATTTGT 


3480 


TCCCTTTTTA 


AAATACGTTT 


ATCCTTTAAA 


3540 


AAGATTTAAG 


TCATTACGTA 


ACATCACATA 


3600 


GATTAACATT 


ACGTATGCCC 


TCATAAATCA 


3660 


TTTTAATTGC 


GAATTCAACG 


AAAGTGCCTT 


3720 


TCTGTCACTG 


TTACTGCACA 


CATACCTCAG 


3780 
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TTATAGGGTT TTTGCGACCG GATGTTTCTT CAATTTAATG TATTGAGAAA GACTATATAA 3900 

CACAATACCT GTCCAAATAA ATATAAACGT AATTAATTGA TCTATACTAA AAGGCTCTTT 3960 

5 GAAAACAAAT ATGCCGAGTA CAAACATTAT TGTTGGTCCA ACGTATTGAA TAAATCCTAT 4020 

TAGCGAAAGT GGAATACGTT TTGCCCCGGC TGAGAATAGG ATTAGTGGTA TTGCCGTAAT 4080 

AGCACCAGAA AATAACAACC AAAATGATGA CATGTTCAAT CCAAATGACA TCTGATGTTG 4140 

10 

CTGCCATAAA TAAATAACGT ATATTAGTCC AGCAGGTGCG GTAACAATAC ATTCAATCGT 4200 

AATACTGCTG ATGGCATCAA TATGTACTAC TTTTTTCAAT AATCCGTATG TACCAAAGGA 4260 

TAACGCTAAT ATAATAGAGA CGATTGGGAA TTCTCCAATC TTGAGCGTCA TATATAATAC 4320 

IS 

ACCGATGAAT GCGAATAAAA TGGCTAGCCA TTCAAATTTA TTGAATCTTT CTTTTAAAAA 4380 

GATAAGTGCG AGCAAAATGC TAACAAGTGG ATTTATATAA TAACCTAAAC TTGTTTGTAG 4440 

2Q GACGTGACCG TTCGTTACAG CCCAAATAAA TGTACCCCAA TTTAATGTAA TGACATAGCC 4500 

TGCTACGACA ATCGCTAATA GCTGAATGGG CTTGCCTAAC AATTGATTCA TATCTCGTTG 4560 

AAATGCATTG CGTTGTTTTT GTCCAACCGC GAGTATGAAA ATCATGAATA TTGCTGAAAA 4 620 

25 TATAATACGA AAGGCTAAAA TTTCAAATGC GCCTATTGCA TCAACGAACT GCCAATATAT 4680 

AGGTAGTATT CCCCACAGAA TGTATGCACT GAGTGCTAAA AATATGCCTT TTTTATACTC 4740 

TGAATTCACC TTCAAACCTC CTTACTTTCC TAATTTTTAA TTTACTGCAT ACGCTCACTT 4 BOO 

30 GGTTATGCTA ATATAACGAT TTTACTAATA ATATTTCGAT AAAGATATCA TTTTGTTTAT 4860 

ATTTCCCACA TTTATTCACC AACCACTAAA CAATATTAAT TTTATAAATA ATTCTGTACA 4 920 

AATCAGGGTA TATTGCCAGA AAGACTACCA TACAACATAA AGGATGGATA CAAATGACTT 4980 

TACCTAAAAT TGGAAAGCCT GCAACACGCG CGCTAAATTC ACAAGGTATA TACACATTAG 5040 

AAGGAGTATC ACAATATACG AAGTCATCTC TAATGGAGAT GCATGGCGTT GGTCCTAAAG 5100 

CTATATCAAT ATTGGAACAA GCTTTATTTC AG 5132 

40 

(2) INFORMATION FOR SEQ ID NO; 164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22243 base pairs 
4S (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 164: 

AAGTAAATTA TATTATGAAT TTGCCTGTCA ATTTCTTAAA GACATTCTTA CCGGAACTAA 60 
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TAGAAGCAAT TAATAATGCy mAAGAAAAGA CAGCTAATAA TACCGGCTTA AAATTAATAT 180 

TTGCAATTAA TTATGGTGGC AGAGCAGAAC TTGTTCATAG TATTAAAAAT ATGTTTGACG 24 0 

5 AGCTTCATCA ACAAGGTTTA AATAGTGATA TCATAGATGA AACATATATA AACAATCATT 300 

TAATGACAAA AGACTATCCT GATCCAGAGT TGTTAATTCG TACTTCAGGA GAACAAAGAA 360 

TAAGTAATTT CTTGATTTGG CAAGTTTCGT ATAGTGAATT TATCTTTAAT CAAAAATTAT 420 

10 

GGCCTGACTT TGACGAAGAT GAATTAATTA AATGTATAAA AATTTATCAG TCACGTCAAA 480 

GACGCTTTGG CGGATTGAGT GAGGAGTAGT ATAGTATGAA AGTTAGAACG CTGACAGCTA 540 

TTATTGCCTT AATCGTATTC TTGCCTATCT TGTTAAAAGG CGGCCTTGTG TTAATGATAT 600 

75 ■ 

TTGCTAATAT ATTAGCATTG ATTGCATTAA AAGAATTGTT GAATATGAAT ATGATTAAAT 660 

TTGTTTCAGT TCCTGGTTTA ATTAGTGCAG TTGGTCTTAT CATCATTATG TTGCCACAAC 720 

20 ATGCAGGGCC ATGGGTACAA GTAATTCAAT TAAAAAGTTT AATTGCAATG AGCTTTATTG 780 

TATTAAGTTA TACTGTCTTA TCTAAAAACA GATTTAGTTT TATGGATGCT GCATTTTGCT B40 

TAATGTCTGT GGCTTATGTA GGCATTGGTT TTATGTTCTT TTATGAAACG AGATCAGAAG 900 

25 GATTACATTA CATATTATAT GCCTTTTTAA TTGTTTGGCT TACAGATACA GGGGCTTACT 960 

TGTTTGGTAA AATGATGGGT AAACATAAGC TTTGGCCAGT AATAAGTCCG AATAAAACAA 1020 

TCGAAGGATT CATAGGTGGC TTGTTCTGTA GTTTGATAGT ACCACTTGCA ATGTTATATT 1080 

30 

TTGTAGATTT CAATATGAAT GTATGGATAT TACTTGGAGT GACATTGATT TTAAGTTTAT 1140 

TTGGTCAATT AGGTGATTTA GTGGAATCAG GATTTAAGCG TCATTTCGGC GTTAAAGACT 1200 

CAGGTCGAAT ACTACCTGGA CACGGTGGTA TTTTAGACCG ATTTGACAGC TTTATGTTTG 1260 

35 

TGTTACCATT ATTAAATATT TTATTAATAC AATCTTAATG CTGAGAACAA ATCAATAAAC 1320 

GTAAAGAGGA GTTGCTGAGA TAATTTAATG AATCTCAGAA CTCCTTTTGA AAATTATACG 1380 

CAATATTAAC TTTGAAAATT ATACGCAATA TTAACTTTGA AAATTAGACG TTATATTTTG 144 0 

40 

TGATTTGTCA GTATCATATT ATAATGACTT ATGTTACGTA TACAGCAATC ATTTTTAAAA 1500 

TAAAAGAAAT TTATAAACAA TCGAGGTGTA GCGAGTGAGC TATTTAGTTA CAATAATTGC 1560 

45 ATTTATTATT GTTTTTGGTG TACTAGTAAC TGTTCATGAA TATGGCCATA TGTTTTTTGC 1620 

GAAAAGAGCA GGCATTATGT GTCCAGAATT TGCGATCGGT ATGGGGC CAA AAATTTTTAG 1680 

TTTTAGAAAA AATGAAACAC TTTACACTAT TAGGTTATTG CCTGTTGGTG GATATGTTCG 1740 

50 TATGGCAGGA GATGGCTTAG AAGAGCCACC AGTCGAGCCC GGTATGAACG TTAAAATTAA 1800 

ACTTAATGAA GAAAATGAAA TAACACATAT CATATTAGAT GATCATCATA AGTTTCAACA 1860 
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CACTGCTTAT GATAATGAAA GACATCATTT TAAAATTGCT AGAAAGTCTT TCTTTGTTGA 1980 

AAATGGTAGC TTAGTTCAAA TTGCTCCGAG AGACAGACAA TTTGCACATA AAAAGCCATG 2040 

5 GCCGAAATTT TTAACATTAT TTGCGGGACC GTTATTTAAC TTTATATTAG CTTTAGTCCT 2100 

ATTTATTGGT GTTGCATATT ATCaAGGcAC GCcTACGTCT ACTGTAGAAC AAGTCGCAGA 2160 

TAAGTATCCA GCTCAACAAG CAGGATTACA AAAAGGTGAT AAGATCGTCC AAATTGGCAA 2220 

10 

ATATAAAATA TCTGAATTTG ATGATGTTGA TAAGGCGTTA GATAAAGTTA AAGATAATAA 2280 

GACGACTGTT AAATTTGAAC GTGATGGTAA AACAAAGTCA GTTGAATTAA CACCTAAAAA 2340 

GACTGAAAAA AAACTGACTA AAGTAAGTTC AGAGACGAAG TATGTTCTCG GATTCCAACC 2400 

15 

AGCGAGTGAA CATACACTTT TTAAACCAAT TGTATTCGGA TTTAAAAGCT TTTTAATCGG 2460 

TAGTACTTAT A TTTTT ACAG CTGTAGTAGG TATGTTGGCT AGTATATTTA CGGGCGGATT 2520. 

20 CTCATTTGAT ATGTTAAATG GTCCGGTTGG TATTTATCAT AACGTCGACT CAGTTGTTAA 2580 

AGCGGGTATC ATTAGCTTAA TTGGTtnCAC TGCGTTATTA AGTGTAAACT TAGGTATTAT 2640 

GAATTTAATT CCTATTCCTG CACTAGACGG TGGTCGTATT TTATTTGTTA TATATGAAGC 2700 

25 GATTTTCAGA AAACCAGTTA ATAAAAAAGC GGAAACAACG ATTATTGCTA TTGGTGCCAT 2760 

TTTCATGGTC GTTATAATGA TATTAGTAAC GTGGAATGAT ATTCGACGAT ATTTCTTATA 2820 

ATTTAGGAGG ATAAATAATT ATGAAGCAAT CCAAAGTTTT TATACCAACG ATGCGTGACG 288 0 

3D __ 

TGCCATCAGA AGCAGAAGCA CAAAGTCATC GTTTATTATT GAAATCGGGT TTGATAAAAC 294 0 

AAAGTACAAG TGGGATTTAT AGTTATTTAC CGCTAGCAAC ACGTGTGTTA AATAATATTA 3000 

CTGCAATTGT GCGACAAGAA ATGGAACGTA TCGATTCTGT TGAAATTTTA ATGCCAGCGT 3 060 

35 

TACAACAAGC TGAATTATGG GAAGAATCAG GACGTTGGGG TGCATATGGC CCAGAATTAA 3120 

TGCGTTTACA AGATAGaCAT GGAAgACAAT TTgCATTAGG TCCaACACAT GAAGAATTAG 3180 

TTACATCAAT AGTAAGAAAT GAATTGAAAT CATACAAACA ATTACCGATG ACATTATTCC 3240 

40 

aAATTCAATC TAAATTCCGT GATGAAAAGA GACCACGTTT TGGTTTAyTC GTGGGCGTGA 3300 

ATTTATTATG AAAGATGCAT ATTCATTCCA TGCTGACGAG GCATCATTAG ATCAAACGTA 336 0 

4S TCAAGATATG TATCAAGCGT ATAGCCGTAT TTTTGAGAGA GTTGGCATTA ACGCAAGACC 3420 

AGTAGTTGCA GATTCAGGTG CTATAGGCGG TAGCCATaCA CATGAATTTA TGGCATTAAG 3480 

TGCTATCGGT GAGGATACAA TCGTTTACAG TAAAGAAAGT GATTATGCTG CTAACATCGA 3540 

50 AAAAGCAGAA GTCGTTTACG ArcCAaATcA TaAGCATACT ACTGTGCAAC CTTTAGAAAA . 3600 

AATTGAAACA CCAAATGTTA AGACTGCGCA AGAATTGGCA GACTTCTTAG GTAGACCAGT 3660 
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GCGTGGCCAT CATGAAATTA ATGACATTAA ATTAAAATCT TATTTCGGCA CAGATAATAT 3780 

TGAATTAGCA ACACAAGACG AAATTGTTAA TTTAGTTGGT GCAAATCCTG GTTCACTAGG 3840 

TCCTGTAATT GATAAAGAAA TCAAAATTTA TGCAGATAAT TTTGTGCAAG ATTTAAATAA 3 900 

TTTAGTTGTC GGTGCTAACG AAGATGGTTA TCACTTAATT AATGTAAATG TAGGTAGAGA 3 960 

CTTCAACGTT GATGAATATG GCGATTTCCG TTTTATTTTA GAAGGCGAAA AGTTAAGTGA 4020 

TGGTTCAGGC GTTGCACATT TTGCTGAAGG TATTGAAGTT GGTCAAGTAT TCAAATTGGG 4080 

TACTAAGTAT TCAGAATCAA TGAATGCTAC ATTCTTAGAT AACCAAGGAA AAGCTCAATC 4140 

TTTAATTATG GGTTGTTACG GAATTGGAAT TTCTAGAACG CTAAGTGCGA TTGTTGAACA 4200 

AAATCACGAT GATAATGGAA TTGTTTGGCC TAAATCAGTT ACTCCGTTTG ATTTACATTT 4260 

AATTTCTATT AATCCTAAGA AAGATGATCA ACGAGAACTA GCAGATGCAC TATATGCTGA 4320 

ATTTAATACT AAATTTGATG TGTTGTACGA TGATCGTCAG GAACGTGCAG GTGTTAAATT 4380 

TAATGATGCC GATTTAATTG GTTTACCACT GCGAATTGTT GTTGGTAAAC GTGCATCGGA 444 0 

AGGTATTGTA GAAGTTAAAG AACGTTTAAC AGGTGATAGC GAAGAAGTTC ACATTGATGA 4500 

CTTAATGACT GTCATTACAA ATAAATATGA TAACTTAAAA TAATTAAGAT CGAATGAATT 4560 

ATAAGAGTAG GAAAAAGCTG AAAGAAATCT GATGCTTATG TCCTGCTCTT ATTATTTTTG 4620 

ATATAATGAT TATTCGATGA AAAATGACTG AAGACATAGT ATAATTAAAG ATAAATTTGT 46 80 

TTTAACAATA TAATGATTAG CCAAATATAA AGCATTTAAT TTTCTATCAT TACTATGCTC 474 0 

ACATAATCTA AATATTGTTC GAACACGTAA AAGTAATTTC TATTTAAGGT GGTAATTGTC 4800 

TTGGCAATGA CAGAGCAACA AAAATTTAAA GTGCTTGCTG ATCAAATTAA AATTTCAAAT 4860 

CAATTAGATG CTGAAATTTT AAATTCAGGT GAACTGACAC GTATAGATGT TTCTAACAAA 4920 

AACAGAACAT GGGAATTTCA TATTACATTA CCACAATTCT TAGCTCATGA AGATTATTTA 4980 

TTATTTATAA ATGCAATAGA GCAAGAGTTT AAAGATATCG CCAACGTTAC ATGTCGTTTT 504 0 

ACGGTAACAA ATGGCACGAA TCAAGATGAA CATGCAATTA AATACTTTGG GCACTGTATT 5100 

GACCAAACAG CTTTATCTCC AAAAGTTAAA GGTCAATTGA AACAGAAAAA GCTTATTATG 5160 

TCTGGAAAAG TATTAAAAGT AATGGTATCA AATGACATTG AACGTAATCA TTTTGATAAG 5220 

GCATGTAATG GAAGTCTTAT CAAAGCGTTT AGAAATTGTG GTTTTGATAT CGATAAAATC 5280 
ATATTCGAAA CAAATGATAA TGATCAAGAA CAAAACTTAG CTTCTTTAGA AgCACaTATT ' 5340 

CAAGAAGAAG ACGAACAAAG TGCACGATTG GCAACAGAGA AACTTGAAAA AATGAAAGCT 5400 

GAAAAAGCGA AACAACAAGA TAACAACGAA AGTGCTGTCG ATAAGTGTCA AATTGGTAAG 5460 
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GCAATAGAGG GTGTCATTTT TGATATAAAC TTAAAAGAAC TTAAAAGTGG 


TCGCCATATC 


5580 




GTAGAAATTA AAGTGACTGA CTATACGGAC TCTTTAGTTT TAAAAATGTT 


TACTCGTAAA 


5640 


5 


AACAAAGATG ATTTAGAACA TTTTAAAGCG CTAAGTGTTG GTAAATGGGT 


TAGGGCTCAA 


5700 




GGTCGTATTG AAGAAGATAC ATTTATTAGA GATTTAGTTA TGATGATGTC 


TGATATTGAA 


5760 


10 


GAGATTAAAA AAGCGACAAA AAAAGATAAG GCTGAAGAAA AGCGTGTAGA ATTCCACTTG 


5820 


CATACTGCAA TGAGCCAAAT GGATGGTATA CCCAATATTG GTGCGTATGT 


TAAACAGGCA 


5880 




GCAGACTGGG GACATCCAGC CATTGCGGTT ACAGACCATA ATGTTGTGCA 


AGCATTTCCA 


5940 


15 


GATGCTCACG CAGCAGCGGA AAAACATGGC ATTAAAATGA TATACGGTAT GGAAGGTATG 


6000 


TTAGTTGATG ATGGTGTTCC GATTGCATAC AAACCACAAG ATGTCGTATT 


AAAAGATGCT 


6060 




ACTTATGTTG TGTTCGACGT TGAGACAACT GGTTTATCAA ATCAGTATGA TAAAATCATC 


6120 


20 


GAGCTTGCAG CTGTGAAAGT TCATAACGGT GAAATCATCG ATAAGTTTGA AAGGTTTAGT 


6180 




AATCCGCATG AACGATTATC GGAAACGATT ATCAATTTGA CGCATATTAC 


TGATGATATG 


6240 




TTAGTAGATG CCCCTGAGAT TGAAGAAGTA CTTACAGAGT TTAAAGAATG 


GGTTGGCGAT 


6300 


25 


GCGATATTCG TAGCGCATAA TGCTTCGTTT GATATGGGCT TCATCGATAC 


GGGATATGAA 


6360 




CGTCTTGGGT TTGGACCATC AACGAATGGT GTTATCGATA CTTTAGAATT 


ATCTCGTACG 


6420 




ATTAATACTG AATATGGTAA ACATGGTTTG AATTTCTTGG CTAAAAAATA 


TGGCGTAGAA 


6480 


30 


TTAACGCAAC ATCACCGTGC CATTTATGAT ACAGAAGCAA CAGCTTACAT 


TTTCATAAAA 


6540 




ATGGTTCAAC AAATGAAAGA ATTAGGCGTA TTAAATCATA ACGAAATCAA 


CAAAAAACTC 


6600 




AGTAATGAAG ATGCATATAA ACGTGCAAGA CCTAGTCATG TCACATTAAT TGTACAAAAC 


6660 


35 


CAACAAGGTC TTAAAAATCT ATTTAAAATT GTAAGTGCAT CATTGGTGAA 


GTATTTCTAC 


6720 




CGTAGACCTC GAATTCCACG TTCATTGTTA GATGAATATC GTGAGGGATT ATTGGTAGGT 


6780 


40 


ACAGCGTGTG ATGAAGGTGA ATTATTTACG GCAGTTATGC AGAAGGACCA 


GAGTCAAGTT 


6840 


GAAAAAATTG CCAAATATTA TGATTTTATT GAAATTCAAC CACCGGCACT 


TTATCAAGAT 


6900 




TTAATTGATA GAGAGCTTAT TAGAGATACT GAAACATTAC ATGAAATTTA 


TCAACGTTTA 


6960 


45 


ATACATGCAG GTGACACAGC GGGTATACCT GTTATTGCGA CAGGAAATGC 


ACACTATTTG 


7020 




TTTGAACATG ATGGTATCGC ACGTAAAATT TTAATAGCAT CACAACCCGG 


CAATCCACTT 


7080 




AATCGCTCAA CTTTACCGGA AGCACATTTT AGAACTACAG ATGAAATGTT 


AAACGAGTTT 


7140 


50 


CATTTTTTAG GTGAAGAAAA AGCGCATGAA ATTGTTGTGA AAAATACAAA 


CGAATTAGCA 


7200 




GATCGAATTG AACGTGTTGT TCCTATTAAA GATGAATTAT ACACACCX5CG 


TATGGAAGGT 


7260 
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CTGCCTCAAA TCGTAATTGA TCGATTAGAA AAAGAATTAA AAAGTATTAT CGGTAATGGA 73 BO 

TTTGCGGTAA TTTACTTAAT TTCGCAACGT TTAGTTAAAA AATCATTAGA TGATGGATAC 744 0 

TTAGTTGGTT CCCGTGGTTC AGTAGGTTCT AGTTTTGTAG CGACAATGAC TGAGATTACT 7500 

GAAGTAAACC CGTTACCGCC ACACTATATT TGTCCGAACT GTAAAACGAG TGAATTTTTC 7560 

AATGATGGTT CAGTAGGATC AGGATTTGAT TTACCTGATA AGACGTGTGA AACTTGTGGA 7620 

GCGCCACTTA TTAAAGAAGG ACAAGATATT CCGTTTGAAA CATTTTTAGG ATTTAAGGGA 7680 

GATAAAGTTC CTGATATCGA CTTAAACTTT AGTGGTGAAT ATCAACCGAA TGCCCATAAC 774 0 

TACACAAAAG TATTATTTGG TGAGGATAAA GTATTCCGTG CAGGTACAAT TGGTACTGTT 7800 

15 

GCTGAAAAGA CTGCTTTTGG TTATGTTAAA GGTTATTTGA ATGATCAAGG TATCCACAAA 7860 

AGAGGTGCTG AAATAGATCG ACTCGTTAAA GGATGTACAG GTGTTAAACG TACAACTGGA 7920 

20 CAG CATCCAG GGGGTATTAT TGTAGTACCT GATTACATGG ATATTTATGA TTTTACGCCG 7980 

ATACAATATC CTGCCGATGA TCAAAATTCA GCATGGATGA CGACACATTT TGATTTCCAT BD40 

TCTATTCATG ATAATGTATT AAAACTTGAT ATACTTGGAC ACGATGATCC AACAATGATT 8100 

25 CGTATGCTTC AAGATTTATC AGGAATTGAT CCAAAAACAA TACCTGTAGA TGATAAAGAA 8160 

GTTATGCAGA TATTTAGTAC ACCTGAAAGT TTGGGTGTTA CTGAAGATGA AATTTTATGT 8220 

AAAACAGGTA CATTTGGGGT ACCAGAATTC GGTACAGGAT TCGTGCGTCA AATGTTAGAA 8280 

30 GATACAAAGC CAACAACATT TTCTGAATTA GTTCAAATCT CAGGATTATC TCATGGTACA 8340 

GATGTGTGGT TAGGCAATGC TCAAGAATTA ATTAAAACCG GTATATGTGA TTTATCAAGT .8400 

GTAATTGGTT GTCGTGATGA TATCATGGTT TATTTAATGT ATGCTGGTTT AGAACCATCA 8460 

35 

ATGGCTTTTA AAATAATGGA GTCAGTACGT AAAGGTAAAG GTTTAACTGA AGAAATGATT 8520 

GAAA?GATGA AAGAAAATGA AGTGCCAGAT TGGTATTTAG ATTCATGTCT TAAAATTAAG , 8580 

TACATGTTCC CTAAAGCCCA TGCAGCAGCA TACGTTTTAA TGGCAGTACG TATCGCATAT 864 0 

40 

TTCAAAGTAC ATCATCCACT TTATTACTAT GCATCTTACT TTACAATTCG TGCGTCAGAC 8700 

TTTGATTTAA TCACGATGAT TAAAGATAAA ACAAGCATTC GAAATACTGT AAAAGACATG 8760 

TATTCTCGCT ATATGGATCT AGGTAAAAAA GAAAAAGACG TATTAACAGT CTTGGAAATT 8820 

45 

ATGAATGAAA TGGCGCATCG AGGTTATCGA ATGCAACCGA TTAGTTTAGA AAAGAGTCAG 8880 

GCGTTCGAAT TTATCATTGA AGGCGATACA CTTATTCCGC CGTTCATATC AGTGCCTGGG 894 0 

SO CTTGGCGAAA ACGTTGCGAA ACGAATTGTT GAAGCTCGTG ACGATGGCCC ATTTTTATCA 9000 

AAAGAAGATT TAAACAAAAA AGCTGGATTA TCTCAGAAAA TTATTGAGTA TTTAGATGAG 9060 
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GAAATAATCA AGGTATTTAT TTAATGCGTA TGGCGTAGTC AAAGAAATAC AAAATTGTTG 9180 

CTGGACACAA AATTATGCCC GTATTTCTTT TCAATGTCTT ACGAGTCTAT TCAAATGTAA , 3240 

TGGTGAAATA AAGGAACAAA CTTTTACAAG AATCTCTGAT TAATAGTGAA GTCATTTGTT 9300 

TCAAGCATAA ACTTATGCTA TAATTAAGTT GCTTAAAAAT TAGTGAACTC AGGCAGAAGA 9360 

GTGGGAGATT CCCGCTCTTT TCTATTTGCC AAAAAGGGAG GCCTGTATGA GTAAAATTAC 9420 

AGAACAAGTA GAAGTGATTG TTAAACCAAT TATGGAAGAC TTGAATTTTG AACTTGTAGA 9480 

CGTTGAATAT GTCAAAGAGG GTAGAGATCA TTTTCTTAGA ATCTCTATTG ATAAAGAAGG 9540 

TGGCGTAGAT TTAAATGATT GTACGCTAGC TTCTGAAAAA ATAAGTGAAG CTATGGATGC 9600 

AAATGATCCT ATTCCTGAAA TGTATTATTT AGACGTAGCG TCACCTGGTG CAGAACGTCC 9660 

AATTAAAAAA GAACAAGATT TCCAAAATGC AATAACTAAA CCTGTATTTG TTTCTTTATA 9720 

TGTACCAATT GAAGGTGAAA AGGAATGGTT AGGCATTTTA CAAGAAGTCA ATAATGAAAC 9780 

AATTGTAGTA CAAGTTAAAA TCAAAGCAAG AACGAAAGAT ATAGAGATAC CGAGAGACAA 9840 

AATAGCAAAA GCACGTCACG CAGTTATGAT TTAACGTGAT GAGGAGGAAA AAACGTGTCA 9900 

AGTAATGAAT T ATT ATT AG C TACTGAGTAT TTAGAAAAAG AAAAGAAGAT TCCTAGAGCA 9960 

GTATTAATTG ATGCTATTGA AGCAGCTTTA ATTACTGCAT ACAAAAAGAA TTATGATAGT 10020 

GCAAGAAATG TCCGTGTGGA ATTAAATATG GATCAAGGTA CTTTCAAAGT TATCGCTCGT 10080 

AAAGATGTTG TTGAAGAAGT ATTTGACGAC AGAGATGAAG TGGATTTAAG TACAGCGCTT 10140 

GTTAAAAACC CTGCATATGA AATTGGTGAT ATATACGAAG AAGATGTAAC ACCTAAAGAT 10200 

TTTGGTCGTG TAGGTGCTCA AGCAGCGAAA CAAGCAGTAA TGCAACGTCT TCGTGATGCT 10260 

GAACGTGAAA TTTTATTTGA AGAATTTATA GACAAAGAAG AAGACATACT TACTGGAATT 10320 

ATTGACCGTG TTGACCATCG TTATGTATAT GTGAATTTAG GTCGTATCGA AGCTGTTTTA 10380 

TCTGAAGCAG AAAGAAGTCC TAACGAAAAA TATATTCCTA ACGAACGTAT CAAAGTATAT 10440 

GTTAACAAAG TGGAACAAAC GACAAAAGGT CCTCAAATCT ATGTTTCTCG TAGCCATCCA 10500 

GGTTTATTAA AACGTTTATT TGAACAAGAA GTTCCAGAAA TTTACGATGG TACTGTAATT 10560 

GTTAAATCAG TAGCACGTGA AGCTGGCGAT CGCTCTAAAA TTAGTGTCTT CTCTGAAAAC 10620 

AATGATATAG ATGCTGTTGG TGCATGTGTT GGTGCTAAAG GCGCACGTGT TGAAGCTGTT 10680 

GTTGAAGAGC TAGGTGGTGA AAAAATCGAC ATCGTTCAAT GGAATGAAGA TCCAAAAGTA 10740 

TTTGTAAAAA ATGCTTTAAG CCCTTCTCAA GTTTTAGAAG TTATTGTTGA TGAAACAAAT 10800 

CAATCTACAG TAGTTGTTGT TCCTGATTAT CAATTGTCAT TAGCGATTGG TAAAAGAGGA 10860 
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GATGCGCGTG AAGCGGGTAT CTATCCAGTA GTTGAAGCTG AAAAAGTAAC TGAAGAAGAT 10980 

GTTGCTTTAG AAGATGCTGA CACAACAGAA TCAACCGAAG AGGTAAATGA TGTTTCAGTT 11040 

GAAACAAATG TAGAGAAAGA ATCTGAATAA TAGGTTGGAG TGAAGTATCT ATGAAAAAGA 11100 

AAAAAATTCC GATGCGAAAA TGTATTCTTT CAAATGAAAT GCATCCCAAA AAAGATATGA 11160 

TTCGTGTTGT TGTTAATAAA GAAGGCGAAA TCTTTGCGGA TGTTACTGGA AAGAAACAAG 11220 

GCCGTGGCGC ATATGTTTCT AAAGATGTTG CTATGGTTGA AAAAGCACAA CAAAAAGAAA 11280 

TTTTAGAAAA ATATTTTAAA GCATCTAAAG AGCAATTGGA TCCTGTTTAC AAAGAAATTA 1134 0 

TTAGATTAAT TTATAGAGAA GAGATCCCAA AATGAGTATA GATCAAATAT TAAACTTTTT 11400 

AGGATTAGCA ATGAGAGCTG GTAAAGTAAA AACAGGTGAA TCAGTCATTG TTAATGAGAT 11460 

TAAAAAAGGA AATTTGAAGC TCGTTATTGT TGCAAATGAT GCGTCTGATA ATACAGCTAA 11520 

ATTAATTACA GATAAATGTA AGAGTTACAA AGTTCCATTC AGAAAGTTTG GAAATCGAAA 11580 

TGAATTGGGA ATAGCACTTG GAAAAGGTGA GCGTGTTAAT GTAGGGATTA CTGACCCAGG 11640 

CTTTGCTAAA AAGTTGCTAT CAATGATAGA TGAATATCAT AAGGAGTGAT TATATGAGTA 11700 

AACAAAGAAT TTACGAATAT GCGAAAGAAT TAAATCTAAA GAGTAAAGAG ATTATAGATG 11760 

AGTTAAAAAG CATGAATATT GAGGTTTCAA ATCATATGCA AGCTTTGGAA GATGACCAAA 11820 

TTAAAGCATT AGATAAAAAG TTCAAAAAAG AACAAAAGAA CGACAATAAA CAAAGCAGTC 11880 

AAAATAATCA CCAAAAATCA AACAATCAAA ACCAAAATAA AGGGCmACAA AAAGATAACA 1194 0 

AAAAGAATCm ACAACAAAAT AATAAAGGCA ACAAAGGCAA TAAAAAGAAT AATAGAAATa .12000 

ATAAGAAAAA TAACAAGAAT AATAAACCAC AAAATCAACC AGCTGCTCCA AAAGAAATAC 1206 0 

CATCAAAAGT GACATATCAA GAAGGTATTA CAGTAGGCGA ATTTGCGGAT AAATTAAATG 12120 

TTGAATCATC AGAAATTATC AAAAAATTAT TCTTACTTGG TATTGTTGCT AATATCAATC 12180 

AATCATTAAA TCAAGAAACA ATCGAATTAA TTGCCGATGA TTATGGCGTT GAGGTTGAAG 12240 

AAGAAGTTGT GATTAATGAA GAAGACTTAT CAATCTATTT CGAAGACGAA AAAGATGATC 12300 

CAGAGGCAAT TGAGAGACCA GCAGTTGTAA CAATTATGGG ACATGTTGAC CATGGTAAAA 12360 

CGACTTTATT AGATTCAATT CGTCATACAA AAGTTACAGC AGGTGAAGCA GGCGGAATCA 12420 

CTCAACATAT TGGTGCATAT CAAATTGAAA ACGATGGCAA AAAAATCACT TTCTTAGATA 12480 

CACCGGGACA TGCTGCATTT ACAACGATGC GTGCGCGTGG TGCaCAAGTA ACAGATATTA 12540 

CTATTTTAGT AGTAGCAGCT GACGATGGTG TTATGCCACA AACAATTGAA GCAATTAACC 12600 

ATGCTAAAGA AGCAgAAGTA CCAATTATTG TTGCAGTAAA TAAAATTGAT AAACCAACTT 12660 
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GCGGCGAAAC AATTtTCGTc CACTTTCTGC ATTAAGTGGT GATGGTATCG ACGATTTATT 12 780 

AGAAATGATA GGATTAGTTG CAGAAGTTCA AGAACTTAAA GCAAATCCTA AAAACCGTGC 12 840 

TGTTGGTACA GTTATCGAAG CTGAATTAGA TAAATCACGT GGTCCTTCTG CATCATTATT 12900 

AGTACAAAAC GGTACATTAA ATGTTGGTGA TGCGATTGTA GTTGGTAATA CTTACGGCCG 12960 

TATCCGTGCA ATGGTTAATG ACTTAGGTCA AAGAATCAAA ACGGCTGGTC CATCAACGCC 13020 

TGTTGAAATT ACAGGTATTA ATGATGTGCC ACAAGCTGGG GATCGCTTTG TTGTATTTAG 13080 

TGATGAAAAA CAAGCTCGTC GTATTGGTGA ATCAAGACAC GAAGCTAGCA TTATACAACA 13140 

ACGTCAAGAA AGTAAAAATG TTTCATTAGA TAACCTGTTT GAACAAATGA AACAAGGTGA 13200 

AATGAAAGAT TTAAACGTTA TTATTAAAGG TGATGTTCAA GGTTCTGTTG AAGCTTTAGC 13260 

TGCATCATTA ATGAAAATTG ATGTTGAAGG CGTAAATGTT CGTATCATTC ATACAGCGGT 13320 

TGGTGCAATT AATGAGTCAG ACGTGACACT TGCTAATGCC TCAAATGGTA TTATCATTGG 133 BO 

TTTCAATGTT CGTCCAGACA GTGGTGCAAA ACGTGCTGCA GAAGCTGAAA ATGTTGATAT 1344 0 

GCGTTTACAC AGAGTTATTT ATAATGTTAT CGAAGAAATT GAATCAGCGA TGAAAGGTTT 13500 

ACTTGATCCA GAATTTGAAG AACAAGTTAT CGGACAAGCT GAAGTTCGTC AAACATTCAA 13560 

AGTTTCTAAA GTTGGTACTA TTGCTGGATG TTATGTTACT GAAGGTAAAA TTACGCGAAA 13620 

TGCTGGTGTA CGTATTATTC GTGATGGTAT TGTTCAATAT GAAGGCGAAT TAGATACACT 13 680 

TAAACGTTTC AAAGATGATG CTAAGGAAGT TGCAAAAGGT TATGAATGTG GTATTACAAy 13740 

TGAAAACTAC AATGACGTTA AAGAAGGCGA TGTTATCGAA GCATTTGAAA TGGTTGAAAT 13 800 

TAAGCGTTAA TTAAATAAAT TACAAGCTAA AAGTATAGTT AAGATTGATA TGCTCCCTAT 13 860 

AAATATTGCA CTTTTTAAGT GTCTACTTTA TAGGGAGCAT ATTTGATACT AGCTTTTGGT 13 92 0 

TTTTTATTAG AATAGATTAC CTATTAAAAG TTACGTTATA TGGACATGAT TTTGTATAAA 13 980 

ATTTTGTGGT GGCCTAGAAT GATTTTTAAT GACAAAATAT AATGTCGACT ATTATTGGAA 14 04 0 

AATTTTCTGT TGaAATGCCT ATCTTACGGC AAACTTTATT TGATTTTATA GGCTTAATTT 14.100 

ATTAAAATAA CGTGTGAGCT AAAATAATTG TTTAAGCATT GTTACACTAA AAAATGCAAA 14160 

TAACAATTGA ACTTAAAGAT AAAGAGGTGA CAAGAATGAG CAGTATGAGA GGAGAGCGTG 14220 

TTGGTGAACA AATGAAGAAG GAATTAATGG ATATCATCAA CAATAAAGTC AAAGATCCTC 14280 

GAGTTGGTTT TATTACAATT ACAGATGTTG TTTTAACAAA TGATTTATCG CAGGCTAAAG 14340 

TATTTTTAAC TGTATTAGGT AACGATAAAG AAGTAGAAAA TACATTTAAA GCACTTGATA 14400 

AAGCAAAAGG CTTCATTAAG TCTGAATTAG GTTCTAGAAT GCGATTACGT ATTATGCCGG 144 60 
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AAGATTTACA CAAACAAGAT AGATAATTTA GTGTTAGGTA TCTGGAAAAT GTTTGATAAT 14580 

TTCTTAATAT CGGTATATTA ACATTAAACA GTTAATACAT .AGATGTGTAG AAATAGTTAA 14640 

CATTTTCCAG TTTTTTTATG AATAAATTTA GTTGATACGC TATTAAAATA TATTTTAAAA 14700 

AAGAAGGTGA CTATATGTAT AATGGGATAT TACCAGTATA TAAAGAGCGC GGTTTAACAA 14760 

GTCATGACGT TGTATTCAAA TTGCGTAAAA TATTAAAAAC TAAAAAAATA GGTCACACGG 14620 

GTACGCTTGA TCCCGAAGTT GCAGGCGTGT TACCGGTATG TATAGGTAAT GCAACGAGAG 14 880 

TTAGTGATTA TGTTATGGAT ATGGGCAAAG CTTATGAAGC AACTGTATCG ATAGGAAGAA 14 940 

GTACAACGAC TGAAGATGAA ACGGGTGATA CATTGGAAAC AAAAGGTGTA CACTCAGCAG 15000 

ATTTTAATAA GGACGATATT GACCGATTGT TAGAAAGTTT TAAAGGTATC ATTGAACAAA 15060 

TTCCGCCGAT GTACTCATCC GTCAAAGTAA ATGGTAAAAA ATTATATGAA TATGCGCGTA 15120 

ATAATGAAAC AGTTGAAAGA CCAAAGCGTA AAGTtAATAT TAAAGACATT GGGCGTATAT 15180 

CTGAATTAGA TTTTAAAGAA AATGAGTGTC ATTTTAAAAT ACGCGTCATC TGTGGTAAAG 15240 

GTACATATAT TAGAACGCTA GCAACTGATA TTGGTGTGAA ATTAGGCTTT CCGGCACATA 153 00 

TGTCGAAATT AACACGAATC GAGTCTGGTG GATTTGTGTT GAAAGATAGC CTTACATTAG 15360 

AACAAATAAA AGAACTTCAT GAGCAGGATT CATTGCAAAA TAAATTGTTT CCTTTAGAAT 15420 

ATGGATTAAA GGGTTTGCCA AGCATTAAAA TTAAAGATTC GCACATAAAA AAACGTATTT 15480 

TAAATGGGCA GAAATTTAAT AAAAATGAAT TTGATAACAA AATTAAAGAC CAAATTGTAT 1554 0 

TTATTGATGA TGATTCAGAA AAAGTATTAG CAATTTATAT GGTACACCCT ACAAAAGAAT 15600 

CAGAAATTAA ACCTAAAAAA GTCTTTAATT AAAGGAGATA GAATTTATGA AAGTCATAGA 15660 

AGtGACACAT CCTATACAAT CTAAACAGTA TATTACAGAG GATGTTGCAA TGGCATTCGG 15720 

ATTTrTCGAT GGCATGCATA AAGGTCATGA CAAAGTCTTT GATATATTAA ACGAAATAGC 15780 

TGAGGCACGC AGTTTAAAAA AAGCGGTGAT GACATTTGAT CCGCATCCGT CTGTCGTGTT 15840 

GAATCCTAAA AGAAAACGAA CAACGTATTT AACGCCACTT TCAGATAAAA TCGAAAAAAT 15900 

TAGCCAACAT GATATTGATT ATTGTATAGT GGTTAATTTT TCATCTAGGT TTGCTAATGT 15960 

GAGCGTAGAA GATTTTGTTG AAAATTATAT AATTAAAAAT AATGTAAAAG. AAGTCA7TGC 1602 0 

TGGTTTTGAT TTTACTTTTG GTAAATTTGG AAAAGGTAAT ATGACTGTAC TTCAAGAATA 16080 

TGATGCGTTT AATACGACAA TTGTGAGTAA ACAAGAAATT GAAAATGAAA AAATTTCTAC 16140 

AACTTCTATT CGTCAAGATT TAATCAATGG TGAGTTGCAA AAAGCGAATG ATGCTTTAGG 16200 

CTATATATAT TCTATTAAAG kCACTGTAGT GCAAGGTGAA AAAAGGGGAA GAACTATTGG 16260 
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TGCTGTTAGT 


ATTGAAATCG 


GCACTGAAAA TAAATTATAT CGAGGGGTAG CTAACATAGG 


1 C^Dn 

XO JOU 




TGTAAAGCCA 


ACATTTCATG 


ATCCTAACAA AGCAGAAGTT GTCATCGAAG TGAATATCTT 




5 


TGACTTTGAG 


GATAATATTT ATGGTGAACG AGTGACCGTG AATTGGCATC ATTTCTTACG 


1 fibrin 




TCCTGAGATT AAATTTGATG 


GTATCGACCC ATTAGTTAAA CAAATGAACG ATGATAAATC 




10 


GCGTGCTAAA 


TATTTATTAG 


CAGTTGATTT TGGTGATGAA GTAGCTTATA ATATCTAGAG 




TTGCGTATAG 


tTATATAAAC 


AATCTATACC ACACCTTTTT CTTAGTAGGT CGAATCTCCA 


1 C C Q n 
1DDOU 




ACGCCTAACT 


CGGATTAAGG 


AGTATTCAAA CATTTTAAGG AGGAAATTGA TTATGGCAAT 




15 


TTCACAAGAA 


CGTAAAAACG 


AAATCATTAA AGAATACCGT GTACACGAAA CTGATACTGG 


lOOvU 




TTCACCAGAA 


GTACAAATCG 


CTGTACTTAC TGCAGAAATC AACGCaGTAA ACGAACACTT 






ACGTACACAC AAAAAAGACC 


ACCATTCACG TCGTGGATTA TTAAAAATGG TAGGTCGTCG 


lb 


20 


TAGcATTTaT 


TAAACTACTT 


ACGTaGTAAA GATATTCAAC GTTACCGTGA ATTAATTAAA 


i coon 




TCACTTGGTA 


TCCGTCGTTA ATCTTAATAT AACGTCTTTG AGGTTGGGGC ATATTTATGT 






TCCAACCTTA 


ATTTATATTA 


AAAAAGCTTT TTACAAATAT TAACATTTAT TATATGTTAA 


1 Y1UU 


25 


GCTAATATTG 


AGTGAATAAT 


AAGGTTACAA TGAGATAAAG ATGATATAAG TACACCTAGA 






GTAATAATCA 


AGATATTAAA 


AATAAAGTAT GTTTTTTTAA AAAATATAAC TTATATTTAT 


J. / zz u 




ACTGATAAGG 


GTGGGACGAT 


AAGTCTATTT TGTAAATAAT AGATGGATAT CCCGCTCTCT 




30 


TTTTTTCCAA 


TTCAATATTT 


TATAACTAAT ATTAAAATAC GATAATAAAT GATATGATAT 






AACTATTAGA 


TTGAAGAGAG 


GAGATTTATA ATGTCTCAAG AAAAGAAAGT TTTTAAAACT 


174 00 


35 


GAATGGGCAG GAAGATCTTT AACGATTGAA ACAGGGCAAT TAGCTAAACA AGCAAATGGC 


17460 


GCTGTATTGG 


TTCGTTATGG 


AGATACAGTC GTGTTATCGA CGGCAACTGC ATCAAAAGAA 


17520 




CCTOGTGATG 


GAGATTTCTT 


CCCATTAACA GTGAACTATG AAGAAAAAAT GTACGCTGCG 


17580 


40 


GGTAAAATTC 


CTGGTGGATT 


TAAAAAGAGA GAAGGACGTC CTGGTGACGA TGCAACATTA 


1764 0 


ACTGCGCGAT 


TAATTGATAG 


ACCAATTAGA CCTTTATTCC CTAAAGGATA TAAGCATGAT 


1 770D 




GTTCAAATTA TGAACATGGT ATTAAGTGCA GATCCTGATT GTTCACCACA AATGGCTGCA 


17760 


45 


ATGATTGGTT CATCTATGGC GCTTAGTGTG TCGGATATTC CATTCCAAGG GCCAATCGCC 


17820 




GGTGTAAATG 


TGGGTTATAT 


TGACGGTAAA TATATCATTA ACCCAACAGT AGAAGAAAAA 


17860 




GAAGTTTCTC 


GTTTAGACCT 


TGAAGTAGCT GGTCATAAAG ATGCGGTAAA CATGGTAGAG 


17940 


50 


GCAGGCGCTA GTGAGATTAC 


TGAACAAGAA ATGTTAGAGG CGATTTTCTT TGGTCATGAA 


18000 




GAGATTCAAC 


GTTTAGTTGA 


TTTCCAACAA CAAATCGTCG ACCACATTCA ACCTGTTAAA 


18060 
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GAAGAAAAAG GACTTAAAGA AACAGTTTTA 
CTTGATAACT TAAAAGAAGA AATCGTCAAT 
GAaTTACTTA TTAAAGAAGT TTATGCAATT 
CGTTTAATTG CAGATGAAAA AATTAGACCA 
TTAGATTCTG .AAGTTGGTAT TTTACCTAGA 

70 

CAGACTCAAG CACTTTCAGT TTTAACATTA 
GGTTTAGGAC CTGAAGAAGA AAAAAGATTC 

15 GTAGGTGAAA CTGGTCCAGT ACGTGCGCCA 

GGTGAAAGAG CATTAAAATA TATTATTCCT 
ATTGTAAGTG AGGTACTTGA ATCAAATGGT 

20 ACATTAGCAT TAATGGATGC GGGCGTACCG 

GGCCTTGTTA CACGTGAAGA TAGCTATACG 
GCATTAGGTG ATATGGACTT TAAAGTCGCT 

25 

ATGGATATTA AAATTGACGG TTTAACGCGT 
AGACGTGGTC GTTTAGAAAT AATGAATCAT 
GAATTAAGTG cTTACGCGCC AAAAGTTGTA 

30 

GATGTTATCG GACCTGGTGG TAAAAAAATT 
TTAGATATTG AACAAGATGG TACTATCTTT 
35 CGTGCTCGTG AAATCATTGA GGAAATTACA 

GCCACTGTTA AACGTATTGA AAAATACGGT 
GCGTTGCTTC ACATTTCACA AATTTCAAAA 
AAAATCGGTG ACACAATTGA AGTTAAGATT 
GCTTCACATA GAGCATTAGA AGAATAATAT 
TGTGATTTTT TTATGCCACT TTTTACGAAG 

45 

TTTTAAAACG CTTTATTATT TTGTGTGCAA 
ATAGTGTACA TCAAGTGTTT TTTAACTTAT 
50 AACAAATTTA GGAGGTAAGA TTTTGAGTTT 

TATACCATTA GGCGGTGTTG GCGAAATTGC 
TGAAATGTTT ATGTTAGATG CTGGACTTAT 

55 



ACATTTGATA AACAACAACG AGATGaAAAT 18180 

GAATTTATCG ATGAAGAAGA TCCAGAGAAT 18240 

TTAAATGAAT TAGTGAAAGA AGAAGTTCGA 18300 

GACGGCCGTA AACCTGATGA AATCCGTCCA 18360 

ACGCATGGTT CAGGTCTATT TACACGTGGT 18420 

GGTGCTTTAG GCGATTATCA ATTAATTGAT 18480 

ATGCATCATT ACAACTTCCC GAATTTTTCA 18540 

GGTCGTCGTG AAATTGGACA TGGTGCGTTA 18600 

GATACTGCTG ATTTCCCATA TACAATTCGT 18660 

TCATCATCTC AAGCGTCAAT TTGTGGATCA 18720 

ATTAAAGCAC CAGTTGCTGG TATTGCTATG 18780 

ATTTTAACTG ATATCCAAGG TATGGAAGAT 18840 

GGTACTAAAG AAGGTATTAC AGCAATCCAA 18900 

GAAATTATCG AAGAGGCTCT AGAACAAGCG 18960 

ATGTTACAAA CAATTGATCA ACCACGTACT 19020 

ACTATGACAA TTAAACCAGA TAAGATTAGA 19080 

AACGAAATTA TTGATGAAAC AGGTGTTAAA 19140 

ATTGGTGCTG TTGATCAAGC TATGATAAAT 19200 

CGTGAAGCGG AAGTAGGTCA AACTTATCAA 19260 

GCGTTTGTAG GCCTATTCCC AGGTAAAGAT 19320 

AATAGAATTG AAAAAGTGGA AGATGTATTA 19380 

ACTGAAATTG ATAAACAAGG TCGAGTAAAT 19440 

TTAAAGTCAT ATGACGACAA TGTATCGTCA 19500 

TGACCCGTTT TGAATTTGTT GTATTGAACA 19560 

CTGTTAATTA TCCTGTATGT ATAGTGATTA 19620 

AATGAATAGT GAGTTTATAT ATGGACGGGT 19680 

AATAAAGAAA AAGAATAAAG ATATTCGCAT 19740 

TAAAAATATG TATATCGTTG AAGTAGACGA 19800 

GTTTCCAGAA GACGAAATGC TAGGTATTGA 19860 
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CCTTACACAC GGACATGAGC ACGCGATTGG TGCAGTGAGT TATGTTTTAG AACAATTAGA 19980 

TGCACCAGTA TATGGATCTA AATTGACAAT AGCGTTAATT AAAGAAAATA TGAAAGCCCG 20040 

5 TAATATTGAT AAAAAAGTTC GCTACTATAC AGTTAATAAT GATTCAATTA TGAGATTCAA 20100 

AAACGTGAAT ATTAGTTTCT TTAATACGAC ACACAGTATT CCTGATAGTT TAGGTGTTTG 20160 

TATTCACACT TCATATGGTG CCATTGTGTA TACAGGTGAA TTTAAGTTTG ACCAAAGTTT 20220 

10 

ACATGGACAT TATGCACCAG ATATTAAACG TATGGCAGAG ATTGGTGAAG AAGGCGTATT 20280 

TGTCTTAATC AGTGATTCTA CTGAGGCAGA GAAACCTGGA TATAATACTC CGGAAAATGT 20340 

75 GATTGAACAT CATATGTATG ATGCTTTTGC AAAAGTGCGA GGTCGCTTGA . TAGTTTCATG 20400 

TTATGCTTCG AACTTTATAC GTATTCAGCA AGTTTTAAAT ATTGCTAGCA AGCTAAATCG 20460 

TAAAGTGTCA TTTTTAGGAA GATCACTTGA AAGTTCATTT AATATTGCTC GTAAAATGGG 20520 

20 GTATTTCGAC ATTCCTAAAG ATTTGCTAAT TCCTATAACA GAAGTTGATA ATTATCCTAA 20580 

AAATGAAGTG ATAATTATAG CTACTGGTAT GCAAGGAGAA CCTGTAGAAG CCTTAAGTCA 20640 

AATGGCGCAA CATAAGCATA AAATTATGAA TATCGAAGAA GGCGATTCTG TATTTTTAGC 20700 

25 

AATTACGGCT TCTGCTAATA TGGAAGTTAT CATTGCGAAT AcATTAAATG AGCtTgTtAC 20760 

GnCTGGCGCA CATATTATTC CAAATAACAA AAAGATTCAT GCTTCAAGTC ATGGTTGCAT 20820 

GGAAGAATTA AAAATGATGA TTAATATTAT GAAACCTGAA TACTTTATTC CTGTACAAGG 2 0880 

30 

TGAATTTAAA ATGCAGATAG CACATGCGAA GCTAGCAGCT GAAGCAGGTG TTGCACCAGA 2 0940 

AAAGATTTTC CTTGTGGAAA AAGGAGATGT CATTAATTAC AACGGTAAAG ATATGATATT 21000 

35 AAATGAAAAG GTAAATTCAG GAAATATTTT AATAGATGGC ATTGGTATTG GGGATGTAGG 21060 

AAATATCGTG TTGAGAGACC GTCATCTTTT AGCAGAAGAT GGTATCTTTA TTGCTGTTGT 21120 

AACOTTAGAT CCTAAAAATA GACGTATAGC TGCGGGACCT GAAATTCAAT CTCGTGGGTT 21180 

40 

TGTATATGTA CGTGAAAGTG AAGACTTATT ACGTGAAGCA GAAGAGAAAG TACGTGAAAT . 21240 

AGTAGAGGCT GGTTTACAAG AAAAACGCAT AGAATGGTCT GAAATTAAAC AAAATATGCG 21300 

TGATCAAATT AGTAAACTAT TATTCGAAAG TACAAAACGT CGTCCTATGA TTATTCCAGT 21360 

45 

AATTTCTGAA ATTTAATCAA AAAGTCATTA ACATAAAAGA GGTCAGAACA AGTCACTGAA 21420 

ATATAATGGT TGTCATGGAC AATTTACTTA TATTTTATGA TAGTCAATTG AAGGGGTAAC 214 80 

SO GATTAATCTG TTATCTTAAG TAAATTGATA CATAGATGAT ATTGTTCTAA CCTCTTTCAT 21540 

CGTCTGTTTG GACTACATAT TCTAAACATC AAATAGGAAA TTATATATAA TAACGTCGTT 21600 

TTAACTAAGG CAACATAAGG AGGTGCGTCA ATTGGCACAA GCAAAAAAGA AATCGACAGC 21660 
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10 



IS 



20 



2S 



GATACGTTAT GTCATAGCTA TTTTAGTAGT TGTATTAATG GTGTTGGGTG TTTTCCAATT 21780 

AGGAATAATA GGTCGTCTAA TTGACAGCTT CTTTAATTAT TTATTTGGGT ACAGTAGATA 21840 

TTTAACATAT ATTTTAGTAC TCTTAGCAAC TGGTTTTATT ACATACTCTA AACGTATTCC 21900 

TAmaACTAGA CGAACGGCTG GTTCGATTGT ATTGCAAATT GCATTGCTAT TTGTATCACA 21960 

GTTAGTTTTT CATTTTAATA GTGGTATCAA AGCTGAAAGA GAACCTGTAC TTTCTTATGT 22020 

GTATCAGTCA TACCAACACA GTCATTTCCC AAATTTTGGT GGCGGTGTAT TAGG CTT TTA 22080 

TTTATTAGAG TTAAGCGTAC CTTTAATTTC ATTATTTGGT GTATGTATTA TTACTATTTT 2214 0 

ATTATTATGC TCAAGTGTTA TTTTATTAAC AAACCATCAA CATCGTGAAG TTGCAAAAGT 22200 

TGCACTGGAA AATATAAAAG CTTGGTTTGG TTCATTTAAT GAA 22243 
(2) INFORMATION FOR SEQ ID NO: 165; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5510 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



35 



40 



45 



50 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 165: 

TTATTAATnA TTAATATTTT TATTTTTAAA AATAAAGCGA GGAGCTATCA ATGGAACAAA 60 

TTACTTCTGC ACAAAATAAT AGAATTAAAC AAGCGAACAA GCTAAAAmAG AAACGTGAGA 120 

GGGATAAAAC TGGATTAGCT TTAATTGAAG GTGTGCATTT AATTGAAGAA GCTTATCAAA 180 

GTGGAATTGT AATTACACAA TTATTTGCAA TTGAACCGGC AAGATTAGAT CAGCAAATTA 240 

WCGCATACGC GCAAGAAGTT TTTgAAATAA ACATGAAAGT TGCTGAATCT TTATCAGGTA 300 

CAGTSACACC ACAAGGGTTT TTCGCAATCA TTGAGAAGCC GCATTATGAT ATTTCTAAAG 360 

CACAACAAGT ATTGCTCATC GATCGTGTTC AAGATCCTGG AAATTTAGGC ACATTAATTA 420 

GAACTGCGGA TGCTGCTGGA ATGGATGCTG TAATAATGGA GAAGGGTACG ACAGATCCTT 480 

ATCAAGATAA AGTGTTGCGA GCGAGTCAAG GTAGTGTTTT CCATTTGCCA GTTATGACAC .54 0 

AAGATCTCGA TACGTTTATT ACTCAATTTA ATGGTCCTGT TTATGGTACA GCACTTGAAA 600 

ACGCAGTGgC ATACAAAGAA GTTACTTCAA GTGATTCTTT TGCATTACTA TTAGGTAATG 660 

AGGGAGAAGG TGTTAATCCT GAATTATTAG CACATACTAC ACAAAATTTA ATCATACCTA 720 

TTTATGGTAA AGCTGAAAGT TTAAATGTAG CGATTGCAGG TAGTATTTTA CTTTATCATT 780 
TGAAAGGTTG ACCGTGTTGA AAGTTTTCCG ATATAATTAT AATTAATTGT TTAACAGAAC . 84 0 
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10 



1$ 



20 



25 



30 



35 



40 



45 



50 
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ATAAATAATT 


GTTTTAGGGA 


GAATAATCGT 


GACTGCAAGT 


TATTCCAATT 


ATTTAAAGTC 


960 


TTTTCACCTT 


TTTGGTTACT 


TAAAGAGATT 


TAAGTCGGAA 


AGACAATCCG 


TTATCAATAT 


1020 


TAAACAAGTG 


TATGCTTAGG 


CATAAATTTG 


GGTGGTACCA 


CGGAAATGAC 


TTTCGTCCCT 


1080 


TATTTTTTAA 


GAGGATGAAA 




AGTTAAACAA 


CAAATATGAT 


AAATAGAAAA 


1140 


TGAATAGTTC 


GAATAGGGAG 


GTCAGTGACA 


TATGTCTGAA 


CAACAAACAA 


TGTCAGAGTT 


1200 


AAAACAACAA 


GCGCTTGTAG 


ATATTAATGA AGCAAATGAT 


GAACGTGCAC 


TGCAAGAAGT 


1260 


TAAAGTGAAA 


TACTTAGGTA AAAAAGGGTC AGTTAGCGGA 


CTAATGAAAT 


TGATGAAGGA 


1320 


TTTGCCGAAT 


GAAGATAAAC 


CTGCGTTTGG 


TCAAAAAGTG 


AATGAATTGC 


GTCAAACAAT 


1380 


TCAAAATGAA 


TTAGATGAAA GACAACAGAT 


GTTAGTTAAA 


GAAAAATTAA ATAAGCcAAT 


1440 


TGGcTGAAGA 


AACAATTGAT 


GTATCATTAC 


CAGGTCGTCA 


TATTGAAATC 


GGTTCAAAGC 


1500 


ATCCATTAAC 


ACGTACAATA 


GAAGAAATTG 


AAGACTTATT 


CTTAGGTTTA GGTTATGAAA 


15.60 


TTGTGAATGG 


ATATGAAGTT 


GAACAAGATC 


ATTATAACTT 


CGAAATGCTG 


AATTTACCTA 


1620 


AATCACACCC 


TGCACGTGAT 


ATGCAAGATA 


GTTTCTATAT 


TACGGATGAA 


ATTTTATTAC 


1680 


GTACGCATAC 


ATCACCAGTG 


CAGGCACGTa 


CGATGGAATC 


ACGTCATGGT 


CAAGGTCCAG 


1740 


TTAAAATTAT 


TTGCCCTGGT 


AAAGTGTATC 


GTCGTGACTC 


TGATGATGCG 


ACACATAGTC 


1800 


ATCAATTTAC 


ACAAATCGAA 


GGATTAGTTG 


TTGATAAAAA 


CGTTAAAATG 


AGTGATTTGA 


1860 


AAGGTACTTT 


AGAATTGTTA 


GCTAAGAAAT 


TATTTGGTGC 


TGATCGTGAA 


ATTCGTTTAC 


1920 


GTCCAAGTTA 


CTTCCCATTC 


ACTGAACCTT 


CTGTAGAAGT 


TGATGTGTCA 


TGTTTTAAAT 


1980 


GTAAAGGAAA 


AGGTTGTAAT 


GTGTGTAAAC 


ACACAGGATG 


GATTGAAATT 


TTAGGTGCTG 


2040 


GAATGGTACA 


TCCTAATGTA 


TTAGAAATGG 


CTGGTTTTGA 


TTCTTCAGAG 


TACTCTGGAT 


2100 


TTGCATTTGG 


TATGGGACCA 


GACCGTATTG 


CAATGTTGAA 


ATATGGTATA 


GAAGATATTC 


2160 


GTCATTTCTA 


TACTAATGAT 


GTGAGATTTT 


TAGATCAATT 


TAAAGCGGTA 


GAAGATAGAG 


2220 


GTGACATGTA 


ATGTTGATAT 


CAAATGAATG 


GTTGAAAGAA 


TATGTAACAA 


TCGATGATTC 


2280 


TGTAAGTAAT 


TTGGCAGAAC 


GTATTACGCG 


CACAGGTATT 


GAAGTGGATG 


ATTTAATTGA 


2340 


CTACACAAAA 


GATATCAAAA 


ATTTAGTTGT 


CGGCTTCGTT 


AAGTCAAAAG 


AGAAACATCC 


2400 


TGATGCTGAT AAATTAAATG 


TTTGCCAAGT 


TGATATCGGA 


GAAGACGAAC 


CTGTACAAAT 


2460 


CGTTTGTGGT 


GCACCGAACG 


TTGaTGCAGG 


ACAATATGTC 


ATTGTTGCTA 


AAGTAGGTGG 


2520 


CAGATTGCCT 


GGTGGTATTA 


AAATTAAGCG 


TGCCAAATTA 


CGCGGTGAAC 


GTTCAGAAGG 


2580 


TATGATTTGT 


TCGTTACAAG 


AAATTGGTAT 


TTCAAGTAAC 


TATATACCGA AAAGTTTTGA 


2640 
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ATATTTAGAT GATCAAGTAA TGGAATTTGA 
TATGATAGGT ACTGCTTATG AAGTTGCAGC 
5 GACAACATCA AATGAGCTTG ATTTATCTGC 

TGAAGATAAA GTACCATATT ATAGTGCACG 
GCCAATTTGG ATGCAAGCAC GCTTAATAAA 

10 

TGACATTTCA AATTATGTGT TATTAGAATA 
TGCGATTGGT TCACAACAAA TTGTTGTTCG 

1£ ATTAGATGAT ACAGAACGTG AATTATTAAC 

TCCAATTGCA TTAGCTGGTG TTATGGGTGG 
AAATATAGTG ATTGAAGGTG CTATTTTTGA 

20 TTTAAATTTA CGCAGTGAAT CATCTAGTCG 

AGATGAAGCA GTCGACCGTG CATGTTATTT 
AAAAGATAGA GTGTCTTCAG GAGAACTTGG 
TGATAAAATT AATCGCACTA TTGGATTTGA 
TAATCAACTA GGGTTTGATA CAGAAATAAA 
ACGTCGTAAA GATATTACAA TTAAAGAAGA 

30 

ATACGACGAT ATTCCATCAA CGTTACCTGT 
TGATCGCCAA TATAAAACTA GAATGGTTAA 

35 AGCTATTACG TATTCGTTAG TTTCTAAAGA 

TCAAACAATT GATTTATTGA TGCCAATGAG 
ATTJJCCACAT TTAATCGAAG CGGCATCATA 

40 ATTATTTGAA ATCGGCAATG TCTTCTTTGC 

TGAATATTTA AGTGGTATTT TAACTGGAGA 
AGAAACGGTT GATTTCTATT TAGCAAAAGG 

45 

TCTTGAATTT AGTTATCGCC GTGCTGATAT 
AATCTTATTA GAGAATAAAG TTGTTGGTTT 
TGATAATGAT TTAAAACGTA CGTATGTTTT 

50 

GTCGGTAGGT TACATTAATT ACCAGCCAAT 
TGCATTAGAA GTAGATCAAA ATATTCCAGC 

55 



TTTAACGCCG AATCGTGCAG ATGCTTTAAG 2760 
ATTATATAAT ACAAAAATGA CTAAGCCAGA 2820 

AAATGATGAA CTGACTGTGA CAATTGAAAA 2880 

TGTTGTTCAC GACGTGACAA TTGAACCCTC 294 0 

AGCGGGTATA CGTCCTATTA ATAATGTTGT 300 0 

CGGTCAACCA TTGCACATGT TTGATCAAGA 3060 

TCAAGCTAAT GAAGGCGAAA AAATGACAAC 3120 

GAGCGATATT GTCATTACTA ATGGACAAAC 3180 
CGATTTTTCA GAAGTTAAAG AACAAACATC . 3240 

TCCAGTTTCA ATTCGTCATA CATCAAGACG 3300 

TTTTGAAAAA GGAATAGCTA CTGAATTTGT 3360 

ATTACAAACT TATGCAAACG GAAAAGTGCT 3420 

TGCATTTATT ACACCAATCG ACATCACTGC 3480 

TTTGTCACAA AATGATATTG TTACTATTTT 3540 

TGATGATGTT ATTACAGTGC TAGTACCATC 3600 

TTTAATTGAA GAAGTTGCAC GTATATATGG 3660 

CTTCGATAAA GTTACTAGTG GTCAGCTAAC 3720 

AGAAGTGTTA GAAGGTGCTG GATTAGACCa 3780 

AGATGCTACT GCaTTTTCGA TGCAACAGCG 3 84 0 

TGAAGCGCAT GCGTCATTAC GTCAAAGTTT 3 900 

TAATGTGGCA CGCAAAAATA AAGATGTAAA 396 0 

TAATGGAGAA GGTGAACTAC CAGATCAAGT 4020 

TTATGTAGTC AATCAATGGC AAGGTAAGAA 4080 

TGTCGTGGAT CGAGTATCTG AAAAGTTAAA 4140 

TGaTGGATTA CATCCAGGTC GTACTGCTGA 4200 

TATTGGTGAA TTACATCCAA TATTAGCAGC 4260 

TGAGTTGAAT TTTGATGCAT TAATGGCTGT 4320 

TCCGAGATTC CCAGGCATGT CTCGTGACAT 4380 

AGCTGATTTA TTATCAACGA TTCATGCACA 444 0 
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AAAAGGTAAA AAATCAATTG CAATACGTTT AAATTATTTA GACACAGAAG AAACATTGAC 

AGATGAGCGC GTTTCAAAAG TACAAGCGGA AATTGAAGCA GCATTAATTG AACAAGGTGC 

TGTTATTAGA TAATGATTTA AACCCCATGT ATAAGGATAT CTGAAGTAGA TTGATATCCC 

TAACATGGGG TTTTATTTTT GGGTTCACCA ATTTGGTTCC AATGCATTTA AAAAGTCAAA 

GAGGAACAGC GGAATACAGA TGATGcTTCG CACAACTGCA TAAAAGCCTC TAATGATTAA 

AAATCAAAGA GGCTTTAAAA TTTTTTGGGC TTTTTCACGA TTTTTAAAAT GCTTTTTTGA 

AATGGTATCT AAACGTGAAA GACCGTATTT TTTTATAATT TTGGCGGCGA TTACATCGAC 

TTTAGCACCG GCACCTTTAG GAATCGTCAT ATTAATATTT TTTGATATTT GATCCATATA 

TGTAACAAAT GCGTATCGAG AAATTATGCT TGCCACTGCA ATGGCTAATG ACTTCGATTC 

TCCTTTTGTT TCAAATTTTG TTTTCTTTGG AAGTGGTATA TCTGATAATG CGTAATGGCT 

ATACACTTCG CGTTTTGCGA ACTGATCAAT GACGATATAG TCTAATTGAG ACGAATCAAT 

TTTTTCAAGT ACATTTTTGA TGGCTTCATT ATGAAGGGCA GCTTTCATTT TTACTTGAGT 

CCAGCCTTTT GCTTGCTGAA TATTATATTT TTCATTGTGT AGTGTTAATA ATGAATGTGG 

TATGAAAGTA ACCAATTGCT CAGCAAGTTC TACAATTTTG GTATCGGTTA A T TT TT T T GA 

ATCATCTACA CCCAAAGTTT TTAAAATAGG GACATGCTCT TTGGTAACGA AAGCAGCACA 

CACAGTCAAC GGACCAAAGT AATCGCCACT TCCAGCCTCA TCACTACCAA TACAGTTAAA 

TTGrTCATAC ATTAaAGTTg TcCAgAAAAG AATTAGCCAT ATTTnCCTTT 

(2) INFORMATION FOR SEQ ID NO: 166: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9623 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS : double 
I (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 166: 
GnTTATACTT ATAAATTTTA CGGGGGTAAT ATAATACTtA TTTACCTGTA ATATATGATA 
ATTCTTCAGC GGCAGCTGCG TTGATAGTTC TATGAGAAAT GATACCTAAT CCTTTAACAT 
TGGATTCTGA AATAACGATA GAACCATCAC TGTTAACTTT TTCAACAAAT GCTACATGAC 
CGTAATGTTG ATCTGCACCA AATTGTCCAG CCTCAAATAC AACAGCAGCA TGACGTTTTG 
GTGTATGACT TACTTGATAA TCACGGTATT GAGCTCGATT ATTCCAATTA TGTGCATCAC 
CTAAATCACC TGAGATAGAT GTACCAAATT GTTTCATACG GTTATATACG TACCAAGTAC 
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ATGAATCATC ATAATCCTTG ATAGAACGTT 
CGTCAAACTG AGTTAATTGA TAGTGTTTAA 
5 GATCTGTAGC ATATGTTTTA GATAAGTGTG 

ATTTCCATGT TGGTTTATAA ATTGTTCGAT 
AGTAATCTTT TAGTGATTCT TTCGTGCTTG 

10 

ACAATTGATT ACCATCAGCT TCTAATGTGT 
CTTTGATACC GAATAAATTA TGGTTTGGTG 

15 ATTCTAAGAT TGCTTGGGCA ATCATGACAG 

GATGTGCATC TTTAGCAATT GATTTGACAA 
TAAATTGTCC GCTATCATCA TTGTTAGATA 

20 CACGTGTATC CTTTTGATTA ACATCGTTAT 

TCAATTCATC TTGTGTTGGT AACTGTGGAT 
TTTTAGATTG AGATGCATAA TCTTTTTGTG 

25 AAATAGAGTC TAAAGCCGAA TCTGACATTG 

TTGCTTTATC GTCACTTGCT GGTTGACTAT 
AATTTGGTTG CTTATTAGAT GTACTTGGTT 

30 

TGTCTGCTTT ATCTTGTTTA GATGATTGCG 
TGTTTTTATT CGAATCATTT GTTGACTTTT 
3S TATCCGAATT TAAATTGAAT AAGTTTTGGA 

ATTTATTTTT GGTTAGCAAT TGGTTTATAT 

CAATffiATATT gttagagtct gaagtgctgt 
40 tgtcttggtt acttgtatta tttttgtctg 

TAGAAGTTTC ATCGTCATTA GATTTTTTTG 
CTTTTT GAGG TGTATCAGCA TAAGCGgTAG 

45 

TTGATAGCAA ATAAATTAAA ATTTTATTTT 
TATAATAATT AAGTGTGATA ATAAACTATG 
AGTTGATAGG TATCAATCGA CTAAATATCT 

50 

GCAAAAATAA ATTAATTTAC AAAAAATATA 
TGTGACAATG AAGAACGCAT TTAAATTATT 

55 



CATATTTATC TAAATCTGGC ATGCGTTCAT 480 

TAATACTGTT TAATTTCTTA GCATAGTTTG 540 

ATGTTGCATC TTTATAAGAA TCGGCTTCCG 600 

TGCCATCAAT ACCATTTTTA ATAAGGTCAG 660 

GATATTTTCG GAATCCAGCA TTAATACTAT 720 

TAAAAGGAAC AGAATTCCCT TCaAAAGCAC 7 BO 

ACwTAGCTAA AGCACTACGA CCTGAGTCAG B40 

ACGCATAAAT ATCGTTATCT TGACCAATGC 900 

ATTGACGTGT ATCTTTTGAG TCAACAACGT 960 

TACTAGGATC TGTTTCGAAT AATGATGTTG 1020 

TGAATGATTG AGCAGGTTTA GATTTATGTT 1080 

TCTTTGTATT AGATTTTTCA TTTTTGTCTT 1140 

TTTTCTTTGC ATCTTCACTG TATTGATCCA 1200 

ATTGATTATC TTTCGATGAA GATTTTTGAT 1260 

TTGATTGATT AGGTTGTGTT GGCTTTGGCG 1320 

TTGTATTGTT TGATTTAGGT GCTTTTTGAT 13 80 

TATCAGTGTC ATTTTTGATG CTATTGTCAC 1440 

CGCCATTACG AGGTTGTTCG TAATCAGAAA 1500 

TTAAAGTTGT TAATGAGTAA TTATCATCGT 1560 

TGGTTTGTGG TAAATTCTTA TAAATAAAAT 1620 

CGTCTATAGT TTTAAATTTT TTGTCGTTAT 1680 

CTTTATCAAT ATCTTTACTT GTAGTATCCT 1740 

AATCATGAGA TGTTGTCTTA GCTGTAGTAT IB 00 

GTGAAaCTAA AGTAGGTAAT ACGAGCGTAG I860 

TAGGCATATT TCGTATTCTC CCTTGAAAAA 1920 

ATTTGTTATA ATTTATCGTA TGCTGAAAAT 1980 

TCCAGTAAAT TGATTATACT AATTCACAAC 204 0 

TAAAAAATAT GAATAATTCC TACATAGGAG 2100 

TAAAATGGAT CTGAAGAAAG TAGCTAAGAC 2160 
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TAACTTATGG GCAATGTGGG ATCCATATGG 
TAATGAAGAT AAAGGCGACA CAATCAGAGG 
TAATACACTC AAGAAAAATA AAAGTTTTGA 
TCATGAGATA AAAATGGGTA AATATTTTGC 
TGAAATTACA GGGACACTAC GTAAGCAGCC 

10 

TCAGAAGATT AACGCTGTTG CGTCTAAGCT 
AAAAGCGAAT GAACAATTTA ATAAAACAGT 

15 AGCAGGTTTA ACTATTGAAG AAAATGTGCC 
TTCAGCAGAT AAAGCTTTAC CTAAGATTAA 
TAACCACCAA GCGGATTTAG ATAAATATGC 

20 AGGTGATATT TTAGATGCTC AGAAAAAATT 
TAATGAAAAG GCTAAGTTGA TATTAGCTTT 
GTTAAATTTT GCAGCTGATG ACGTGCCAGC 
CATTGCGAGT CAAGGTATTG ATCAAGCTAA 
CACACAAGTT AGAAGTAGAG TCGGTGATTA 
AAATCGAAGA AACCAGCAAC AGATTCCTCA 

30 

TAGTGCACCT GCAGCTGGTA ATGGTGTAGC 
TACTGCACCA AATAATAATG TTACGCAAAA 

3S ATCGACTACA CCACAAAGTA CAAGCGGGAA 
AACAACACAA GTCAGCACAG CTAACGAGAA 
ATCAtfTGGAA GCGGCATTAA CGGGCTCTTT 

40 AGCGAAAGCC GCACAAAAAG ATAGTCAGGC 
ATCGGACAAG CCTTCTGATT TTAGAGAGTC 
CACAACGCAA TATAATCAAC AATTTATCGA 

45 

TGTTGATTTA TCAAAAGAAA TTGATAAGGT 
ATTAAGGTTA GTTAATCAAT TAAGCAATGC 
AGCTACTAAA TTACTAGATC AACTTTCAAA 

SO 

TTATGTTAAA AAAGATCTTA ACAGCTCTTT 
ATTGAACAAA GGG CAAACTG CATTATCCAA 

55 



CAACACGGGA CAGATCAAGG TCGCAGTCGT 2280 

GAAAAAAGTT AATGTCGGTA ATACGATGGT 2340 

TTGGCAGTTT GTAAGTAGAG AGAAAGCTGA 2400 

AGGTATTTAC ATCCCATCTA AGTTTACACA 2460 

TCAAAAAGCA GATGTAGAAT TTAAGGTGAA 2520 

AACAGATACT GGTTCGTCAG TTGTCGTTGA 2580 

AACTCGAGCA TTATTAGAAG AAGCTAACAA 2640 

GACAATTAAC AAGATAAAAA ATGCGGTATA 2700 

TGACTTTGCG AATAAAATTG TATATTTGAA 2760 

CAATGATTTT AGAAAACTAG GAAATTATAA 2820 

AAACGAaGTC AATGGTGCTA TTCCGCAACT 2 880 

AAATAATTAT ATGCCGAAAA TTGAAAAAGC 2940 

GCAGTTCCCT AAAATTAATC AAGGACTTAA 3 000 

TGGACAGTTA AATGATGCCA AAGGCTTCGT 3060 

TCAAGATGCA ATTCGACGCG CGCAAGATTT 3120 

AAATAGCGCG GCGAACAACG AAACATCAAA 3180 

ATCAACGCCA CCAAGTGCAC CAAGTGGCGA 3240 

TACCGCACCA AATAGTAATA ATGCGCCTGT 3300 

AAAAGATGGT CAAAGTTTTG TAGATATAAC 3360 

CACACAAAAC ATTACAGATA AAGATGTTAA 3420 

ATTATCATTA TCAAATAATT TAGATACCCA 34 80 

ATTACGTAAT ATTTCGTATG GGATTTTAGC 3540 

TTTAGATAAT GTTAAGTCCG GTTTAGAATA 3 600 

TACATTAAAA GAGATTGAGA AGAATGAAAA 3660 

AAAAGCAGCT AATAATCGAA TTAATGAATC 3720 

ATTAAAGAAT GGTAGTTCAG GAACTGCTGA 3780 

ACTAGATTCA TCATTATCAT CATTTAGAGA 384 0 

AGTATCAATA TCACAACGTA TTATGGATGA 3900 

TGTTCAGTCT AAATTAAATA CAATTGATCA 3960 
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AACAGTATTA CCAAGTATTG AACAACAATA CATTAGTGCT GTTAAAAATG CTCAAGCAAA 4080 

CTTCTCGAAA GTGAAAAGTG ATGTAGcTAA AGCTGCTAAC TTTGTGCGCA ATGACTTACC 4140 

5 

ACAGTTAGAA CAGCGATTAA CTAATGCGAC AGCAAGTGTG AATAAAAATT TACCAACGTT 4200 

ATTAAATGGT TATGATCAAG CGGTAGGATT ACTAAATAAA AATCAGCCAC AAGCGAAAAA 4260 

GGCTTTATCA GATTTAGCTG ATTTTTCTCA AAATAAATTG CCTGATGTTG AAAAAGATTT 4320 

10 

GAAAAAAGCG AATAAAATTT TCAAGAAATT AGACAAAGAT GATGCAGTCG ACAAATTAAT 43 80 

CGACACACTT AAGAATGATT TGAAAAAGCA AGCGGGTATT ATTGCAAATC CTATTAATAA 4440 

1S GAAGACTGTT GATGTTTTCC CAGTTAAGGA TTATGGTTCA GGTATGACAC CATTCTATAC 4500 

TGCACTGTCA GTATGGGTAG GTGCACTCTT GATGGTAAGT TTATTAACGG TTGATAATAA 4560 

ACATAAGAGT CTAGAGTCAG TCTTAACGAC AAGACAAGTG TTCTTAGGTA AGGCAGGATT 4620 

20 CTTTATAATG CTTGGTATGT TGCAAGCACT CATTGTATCG GTTGGAGATT TGTTAATCCT 4680 

AAAAGCAGGA GTTGAGTCAC CTGTATTATT TGTACTTATA ACGATTTTCT GTTCGATTAT 4740 

TTTCAACTCA ATCGTATATA CGTGCGTATC ATTACTTGGT AACCCAGGTA AAGCCATTGC 4800 

25 

AATCGTATTG CTTGTATTAC AAATTGCAGG TGGTGGGGGA ACATTCCCAA TTCAAACTAC 4860 

GCCACAATTT TTCCAAAACA TTTCGCCATA CTTACCATTT ACGTATGCAA TTGATTCATT 4920 

ACGTGAAACA GTAGGCGGTA TTGTTCCGGA AATCCTAATT ACAAAATTAA TTATATTAAC 4980 

30 

GTTATTTGGT ATAGGATTCT TCGTTGTAGG TTTAATTTTA AAACCTGTAA CAGATCCATT 504 0 

GATGAAGCGC GTATCTGAAA AAGTTGACCA AAGTAACGTT ACAGAATAAA AATTAAATCC 5100 

35 ACACATTAGG GTTATAGCTC CTTAATGTGT GGATTTTTAT GTTTTTAGAC AGAAGAGATA 5160 

GTAATTTCTG TCTTTTATGG . GACGGTTGTT ATCATTGCTA TTATCCAGGA TGACTTACTA 5220 

TAGGACTAAT ATTACCGACA AAGTGAATAT CCTCGTCTTC CGTAGTTAAA ATAAAGCTAG 5280 

40 AACCTTTTTG GATGTCATAG TGCTTATCGT TTACTGTTAA AGTACCAGTA CCATCGATAA 5340 

TTGTAACTAA GCAATAAGCA TGTGGTTTAT TGAATTTTAA ATCTCCATGA ATATCCCATT 5400 

TATATACTGC AAAATATTGA TTATCTACAA ATTGAGTTAC AGTGTGTGTG TCGATGTGAG 5460 

45 

TTGTTATAGG AGTAGTATTT GGTTCATGAT TGCCTAATTC AATCACATCT TTACTTTGCT 5520 

CTAAGTGCAA ATCACGCAAT TGACCATTTT GATCTCGTCT ATCATAGTCA TAAATACGGT 5580 

ATGTCGTATC GGAGGATTGT TGTGTCTCTA AAATTAAAAT ACCCGAACCA ATGGCATGGA 5640 

so 

CAGTGCCAGC AGGAACATAA TAAAAGTCAC CGGGCTTAAC AGGTATACGT TTGAAAAGAC 5700 

TGTCAAATTC ATGATTATCA ATCATGTCTA TTAACGTCTG TTTATTATGT GCATGTACGC 5760 
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GTTCGCCTTC GTGTTTTAAA GCGTAGTCAT 


CATCTGGGTG 


AACTTGAACA 


GATAATTTAT 


5880 




CATTGGCATC TAATACTTTA GTTAGCAGAG 


GGAAACTATC 


TCGTGAATCA 


TTATCGAATA 


5940 


5 


ATTCACGATG TTGTGACCAA AGTTGATCTA 


GGGTCATATC 


CTTGTATGGA 


CCATTGATAA 


6000 




TTGTATTAGG ACCATTTGGA TGTGCAGAAA 


TTGCCCAGGA 


TTCACCAGTT 


GTTTCATTAG 


6060 


70 


GGATATCATA GTTAAATGCT TTTAATGCAT GACCGCCCCA AATTCTGTCT TTAAAAACGG 


6120 


GTTGTAAAAA TAATGCCATA GTTAAAACTC 


CTCTATATTT 


TCATTAATAA 


GTTATAAATT 


6180 




TCTGTAGTAC TGTTTGCATT AATTAGTGAT 


TGGCGTGTCT 


CATCATTCAT 


TAACGCTTTA 


6240 


75 


GATAAGCGCT GAAGTATTTT TAAATGTGTA 


TCCTGACTGT 


TGTTTGGTAC 


GGCAATTAAG 


6300 




AATATCAATT GAGGTAGACT ACCATCTAGA 


CTGTCCCATT 


TAACACCATG 


ATTATTTTTC 


6360 




ATAACAGCTA CAATCGGTTG TTTTACAACA 


TCAGACTTTG 


CATGTGGAAT 


GGCCACGTTC 


6420 


20 


ATGCCAATAG CTGTCGTAGm tCcATTTCAC 


GTTCTAGTAT 


TGCATTTTTT 


AAATGCGATG 


64B0 




TGTGCTCTAC ATAACGGCAA ATTTTAAGTT 


TATGAATCAA 


CATATCAATT 




654 0 




GAGACATGTC GTGATCAGTA ATTATCATAG 


TTTGTTGATC AAAAACATGA GAAGGTTTAT 


6600 


25 


TGAGATGTGA ATGTTTCGCG GTGTTATCTA 


CATTGTCAAC 


CTCTGTATCA 


TGTTGTGTAA 


6660 




TATCTGTATC ATGAAGTTGC GTGTGTTGCG 


CTGGTGCATC 


TACTGCTATA ACTGGTGTAT 


6720 


30 


TGCGTTTTAA TAATAGTACA GTAGTCATTG 


TGACAAGACT 


ACCTACTATC 


ACTGCAAAGA 


6780 


TAAACCATAA TACATGATCA ATACCACCTA ATACAGCCAC 


GATTGGACCT 


CCATGTGCGA 


6840 




CTCTATCGCC GACACCACCA ATGGCTGCAA 


TGACTGATGC 


AATCATTGCA 


CCAATGATGT 


6900 


Ji> 


TTGCAGGTAT AATGCGCAAT GGATCTTGGG 


CTGCGAAAGG 


AATAGCACCT 


TCAGTAATAC 


6960 




CAAATAGTCC CATAGTGAAG GAAGCCTTAC 


CCATTTCTCT 


TTCGGAATGA 


TTGAATTTAT 


7020 




ACTTTTGAAC AAACGTTGCT AAACCTAAAC 


CGATTGGTGG 


TGTACATACA 


GCAACTGCGA 


7080 


40 


CCATACCCAT AACGGCGTAA TTACCTTCAG 


CAATAAGTGC 


TGAGCCAAAT 


AAAAATGCTA 


7140 




CCTTGTTTAC TGGACCGCCC ATATCGAAGG 


CAATCATCGC 


ACCTATAATC 


ATCGCAAGTA 


7200 




TAATAATATT AGCACCTTGC ATACTTTTTA 


ACCAGGTTGT 


TAATGCCTCA 


AAAATATTAG 


7260 


45 


AAATTGGTGC ACCGATTAAA AATATAAATA 


TCAATCCTAC 


AACGACCGAT 


GAAATAATGG 


"7 "7 O A 




GAATAATAAT GATAGGCATA ATTGGTGCCA 


TTGCTTTTGG 


AACTTTAATA 


TCTTTAATCC 


7380 


50 


ACTTTGCGAT ATAACCTGCT AAGAAACCAG 


CAACAATACC 


ACCTAAAAAT 


CCTGCGCCTG 


7440 


CATCACTGCC ATAAAAACTA CCGTCAGCAG 


CGATAGCGCC 


GCCAATCATA 


CCAGGAACAA 


■ 7500 




GACCGGGcTT GTCAGCGATA CTAACAGCGA 


TATATCCAGC 


TAGTATTGGA ACCATAAATT 


7560 
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ATCCTTTTGA TGTCGTTtCA . CCGCCTAGAG TCAGCGCGAT GGCGATAAGG AGTCCACCAA 7680 

CTACGATAAA AGGAACCATA AACGATACAC CGTTCATTAA ATGTTGATAC ACCATTTGAA 7740 

5 • 

TACCATTTTT AGACTTACCG CGATCTTTCG AATGATAATT TGTTTCAGAT TGATAAATAG 7800 

GCGCATCTTG ATTAATGATA CGTTGAATTA GACCTCTCGG ATTATGAATC CCTTCGCGAA 7860 

CATTTTCATT AATCAACCGT TTACCAACAA ATCGGGACAG ATCAACTTGT TTATCAGCTG 7920 

JO 

CAATTATGAC ACCGTCAGCT TCTTCGATGT CTTGCGTAGT TAAAACATTT TCAGCACCAA 7980 

CACCGCCCTG TGTCTCTACT TTAATATCCA CACCCATTTC TTTTGCTACC TGCTCAAGCT 8040 

75 TTTCTTGAGC CATATATGTA TGTGCAATGC CATTTGGGCA TGAGGTAATA GCTACAATTT 8100 

TCATAAAATC ATCTCCTTTT CTATATTGTA AGCGTATTCT CGATACTAAA AAAAAGAATA 8160 

ATTACCGTTA CTAGTGGCAA TTATTCTTGT AAGTATTCAA ATAACTGTTG CTTTAAACTA 8220 

20 TGATCATCTA AACTACATAA ATGGTTCACT GAATCATCAT CCAAGTTAGC AATTAATTGC 8280 

ATCATTTGTT TTGTAAAAGC TTTGTCTTTA TGCGAAATCG CTAAGAAAAA GACAAGTTTG 8340 

ACATCGTGTT GTCGCCAAGG AAAAACATCT TTTGTGCGAA AAATAAGCAC ATGTGATTGT 8400 

25 

AAAACTTTTT CAGGATCTCC ATGAGGAATC GCCATAAAAT TACCTATGTA TGTAGAAGAT 8460 

GATTTCTCAC GCTCTAAAGC TGATTCGATA TATCCTTCTA CAATCGCATG ATGTGCTTGT 8520 

AATATTTTTT GAGCTTCTTC AAAAATTTGC ACAGTATGCC GTGATTTTTG TTCAGTATTT 8580 

30 

ACGACAAGGA AATTGACAGT GTCCATATGA TGATGTGCTT GAACCGGATT TTGCTTTTGC 8640 

TTCACAACGT GTCTGATTTT GTGACGATCA TCTTCAGAAA ATAATGGTGC AACCTTGATA 8700 

GTCGTCAGGT GCTTAGGAAG TATGTTTAGC GTTTGTTTAG GAATATCATG GGTCGTTATT 8760 

35 „ 

AATAAATCTA CATTGTCAAA GTGATAGTGT GTTATATTTT CTAGTTTAAT CGTATTTATC 8820 

ACTGACAACT CTTCGGATAA GTTATTTATT TTAGTTTCTA AAAAATTCGA CACACCTAGA 8880 

40 CCATAATAAC AAGCAATGAC TACATTTAAT TGTGTTTTGG TACGACGCTC GATGGCAGCT 8940 

TGAAAATGAA TTGTTAAAAA TGCAATTTCA TCTTCGCTCA TCTCTATATC AGTATCAATT 9000 

GCTAATTTAT CAATCGCTTC AAAAAGTGTG TTAAACACAA AGGGATAGAG TTTTTTAATC 9060 

45 TCTATAACTA AAGGATTGTT TAAATAAATG TTTTGAGTGA TACGTAAATA TGCTTTACTA 9120 

AAATGATTAT ATAAATTTTG TTGTAAAATC GAATCTTCAT TGAAAGGTAC ATGAATACGT 9180 

TGCTGCATCA ATTCGATTAA GCGATCAATA TAACTTTGTA TAAATATACG TTCTATGCCA 924 0 

SO 

ATATCGAGTT TATTAAAATG ATAAGCAATA AAGAATGAAA ACATATTGAT TACTTTTTCG 9300 

TTCAAGTCAT AACCTAATCT TTCGTTGATT TGCTTAATGC AAGATTGAGA TATCAATTTT 9360 
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AGATGAATTA AAAGCTGTTG TATTTGAATA TCAGTTGTTT CAATACTATG TTGTTGAAGT 94 80 

GTCTCTTGTA TAATATGCGA AATCATCCTT TGGTGTGAAT CAGGTAATTC aTTTAAAATT 954 0 

5 

AGGTCTTCAA CATGTACATG CCCTGATGAT AATTGATTTA AATGGATGAT GGCATTAGTG 9600 
ATATCATTAT CTGTTCCATC GAC 9623 
(2) INFORMATION FOR SEQ ID NO: 167: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1021 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
75 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 167: 



20 



25 



30 



35 



50 



ACCGTGGAAA 


i. w X AO j. 




GCGATAAAAA 


TGTGACTAAA 


TCATCTCAAG 


b U 


AGGAAAATCA 


AGCAAAAGAA 


GAATTACAAA 


GCGTTTTAAA 


CAAAATTAAC 


AAACAATCAA 


i *y n 
1Z u 


GTAAGAATAA 


TTAAAAAATT 


TTGATATTGT 


CTATGTTTAT 


AGTTCACAAG 


CCATTCAACG 


IRQ 


TATTGTAAAC 


TAAGGATAGT 


GTAITITTTT 


AATAGTAATT 


TGTCAGGAGG 


TGCCTATCTA 


240 


TGGAAGAACA 


TTACTACGTA 


AGTATTGATA 


TTGGATCATC 


AAGCGTAAAA ACAATAGTAG 


300 


GCGAGAAATT 


TCACAATGGT 


ATAAATGTGA 


TAGGTACAGG 


ACAAACCTAC 


ACGAGCGGTA 


360 


TAAAAAATGG 


TTTAATTGAT 


GATTTTGATA 


TTGCGCGACA AGCAATCAAA 


GACACAATTA 


420 


AAAAGGCATC 


AATCGCTTCG 


GGTGTTGATA 


TTAAAGAAGT 


TTTCCTGAAA 


TTACCTATCA 


480 


TTGGAACGGA 


AGTTTATGAT 


GAATCAAATG 


AAATCGACTT 


TTATGAGGAT ACAGAAATCA 


540 


ACGGTTCACA 


TATCGAAAAA 


GTATTAGAAG 


GTATTAGAGA 


AAAAAATGAT 


GTGCAAGAAA 


600 


CAG^AGTAAT 


TAATGTGTTC 


CCGATTCGTT 


TTATAGTCGA 


TAAAGAAAAT 


GAGGTTTCAG 


660 


ACCCTAAAGA 


ATTAATTGCC 


AGACATTCAT 


TAAAGGTTGA 


AGCAGGCGTA ATTGCTATTC 


720 


AAAAATCGAT 


TTTAATTAAT 


ATGATTAAAT 


GCGTAGAAGC 


ATGTGGTGTT 


GATGTATTAG • 


760 


ATGTTTACTC 


TGATGCATAT 


AACTATGGTT 


CAATCCTAAC 


AGCTACTGAA 


AAAGAGTTAG 


840 


GTGCATGTGT 


CATTGATATT 


GGTGAAGACG 


TTACGCAAGT 


TGCTTTTTAT 


GAACGCGGTG 


900 


AATTAGTAGA 


TGCTGATTCT 


ATCGAAATGG 


CAGGGCGTGA 


TATTACaGAC 


GATaTTGCAC 


960 


aAGGrTTaAA 


CACTTCTnAT 


GAAACTGCTG 


nAAAAAGTTA 


AACACCAATn 


TGGTCATGCA 


1020 


T 












1021 



(2) INFORMATION FOR SEQ ID NO: 168: 
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(A) LENGTH: 7963 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 168: 

10 TAATCTATTA TAAAAACTGT CCATACCCTT TGATTACCTT CTCTTCAGGT ACAGGCCACA 60 

CTTGAGGCCA TAAGCCATAT GCTTGCTGTG AATAAAATTG TGCCATTTGT AACAATATAA 120 

TATATACAAA TAAACACCCA ATAATTGCTG TCACTAATGG ATATGATAAC CAAACCATTA 180- 

IS ATAAAACTGC AATAATTACT AACCTAAAGA TAATATTAAA TGCGTCTCTC CCTCTTATAA 240 

AGCTTCTAAT AAATAAGAAT AAATACATCG CATTAGAGTT AAATTTACTA CCCTTTGGAA 300 

CTGGTAAAAG TATATCTAGA TAACTTCTTC TGACTGCAGA TTCTTTCAAA TGTTTTACAT 360 

CGGTGAACAT ATTAACAAAT TTATAATAAT TCATATGATG TCGATGTTCG ATTGCAATCA 420 

TTTTCTCCCA AGGATACAAA AAGCCTGGTT TATATTTTTT AACTAAAAAT TCTATTAACA 480 

CAGGCAAAGC AACCATCACA AATGCGATGT ACCATTTTGG AGCTAATAGT AAGTAATATG 540 

25 

TTAGAGCAAA GGTGATGAAT GATATTAAAT TAACTTGCCA TGTTTTAAGT CCCGATTGAT 600 

ACCATTGCCA TCTTAAGCGT AAACCAACAT ATGGAAAAAT TAATGCACTG ACTCCAAAAC 660 

AAATATAAAA TGCCACATTA TGTTGATTAA TATTGTAAAA CAACGGGAAC ATTACAATAA 720 

30 

CAATAATGAG TTGGATTAAT ATGCGCGCAA AGTAACTATA TAAAATCGCA TGACGCATAA 7 B0 

ATTGAGACAT GTGTTTTTCA AATGGTAATA AAAAGATTTT ATCCgCTTCT TTTAACAGTG 84 0 

35 GTCsCmTTGG AAAAATAGrT GTCAACGCAA CAATCACTGC TGCTATTaAT GAAAAATTGa 900 

TATTCGTTGG AATATGTTTT AACCATTcAC CATATCCArA AATAAATGCA CCCAGCAAAA 960 

TAAGTAAAAA GACCATGAAA TGACCATTAA ATATAAACTT ATTATAATAA TTTTtCTCTT 1020 

40 . TACGAAGGGC ATGTAATCTT TTATTAAATA ATGTGGTAGC TTGGTTACGC ATGTACATCT 1080 

CCACCTTGCG TCACATGAAT ATATATATCG TCTAATGTTT GATTATGTAA GCCAGTTTGT 114 0 

TGTCTCAATG CTTCTAAATC TCCAAATGCA ACGACTTCAC CTTCGTCTAG TATGaTAAAA 1200 

45 

CGATCACAGT AACGTTCAGC TGTTGCTAAA ATATGTGTAC TCATTAGAAC GGTTCTACCT 1260 

TCGTTTTTCT TTTCAACCAT TAAATCTAAC ATGGATTGAA TTCCTAATGG ATCTAGGCCA 132 0 

AGGAATGGTT CGTCTATAAT ATACAATTCG GGATTAACGA TAAACGCACA AATAATCATG 13 8 0 

50 

ACTTTTTGTT TCATCCCCTT AGAAAAATGA CTCGGAAAAA CTTTCAACTC ATTTTCTAAA 144 0 

CGGAATGTCT TTAATAATGG CATTGCTCGA TTCATCGTTT CATCACGATC AATATCATAT 1500 
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TCCGGAATAT AAGATAACTT 


TCTTCTATAA 


GCCTCTATGT 


CATCATTAAT 


GTTGATATCT 


1620 




GAAATTGATA GAGATCCTTC 


CATAGGTGTA 


AGCAATCCTA 


GCATATGTTT 


AATCGTTGTA 


1680 


5 


CTCTTACCAG CGCCATTAAG 


GCCAATAAGT 


CCAACAATTT 


CGCCTTTGTT 


TAATTCAAAA 


1740 




TTTATATCTT TAATTACAGG 


GCGTTTTCCA 


TATCCACCTG 


TAAGCTGTTC 


TACTTTAACT 


1800 


10 


GTCATAAGGC ACCTCCATGA 


CTTATATTGT ACCAAAAATT ATAAAATGCT 


CATATTAAAT 


1860 


ACACATGTCC TAATATCGAA 


TTTTTAGCGA 


CAATGTTATA ATGAATGGTA ATACTAGTTG 


1920 




AAAAGGAGTG TAGTCATCAT 


GTCAGAAACA ATTTTCGGCA AAATTTTAAC 


TGGAGAAATT 


1980 


15 


CCTAGCTTTA AAGTATATGA AGACGATTAT 


GTCTATGCCT 


TTTTAGATAT 


ATCACAAGTT 


2040 


. ACTAAAGGAC ATACGTTATT AATTCCTAAA AAAGCTTCTG 


CTAATATCTT 


TGAAACTGAT 


2100 




GAAGAAACAA TGAAACATAT 


CGGTGCAGCA 


TTACCTAAAG 


TAGCAAATGC 


TATTAAGCGT 


2160 


20 


GCATTTAATC CTGATGGTTT AAACATTATT 


CAAAATAATG 


GTGAGTTTGC 


AGATCAATCT 


2220 




GTATTTCATA TTCATTTCCA CTTAATTCCT CGATACGAAA ATGATATTGA TGGATTTGGT 


. 2280 




TATAAGTGGG AAACACATGA 


AGACATTTTA 


GATAACGATG 


CAAAACAACA 


AATTGCTGAA 


2340 


25 


CAAATTCAAG CACAATTTTA 


AATGTATGCT 


TAATCTAAGC 


TCGAACGGGT 


ATAATATGAT 


2400 




TAATATTATA ACAATTG CGT 


TTGAAGTGAT 


AACATCAAGG 


TTAGCAATTT 


TAAACAAAAT 


2460 




GAGTTATCAA GATAACAGAT 


GTTAAAAGTG 


AGGAGAATAT 


AAATGAAAGC 


ATCACGCATT 


2520 


30 


CTATTCGGTA TCGGTGTTGG 


CGTAGCAGCT 


GGTTTTGTAG 


TTGCACTTCA 


AGGACGTGAC 


2580 




GACAAAAGTG TCAAGAACAA 


CACGATCGAT 


CGTACTGCCC 


CTACTGGTTC 


AAAATCAGAA 


2640 


35 


CTACAACGTG AATTTGAAAC 


GATTAAACAA AGTTTTAATG ACATTTTAAA 


CTATGGTGTT 


2700 


CAAATTAAAA ACGAAAGTGC 


GGAATTTGGT 


AGTTCAATTG 


GTGGTGAAAT 


TAAGTCATTA 


2760 




CTTGGAAACT TCAAATCTGA 


CATTAATCCT AATATTGAAC 


GTTTACAGTC ACACATCGAA 


2820 


40 


AATTTACAAA ATCGTGGCGA 


GGATATTGGA 


AACGAAATTT 


CTAAGTAGCA 


GGTTACGTTC 


2880 


TCGATCACAA CTATTTTTAT 


TAGTAACAGC 


ATATTTATTT 


TTTAAAATTA AATGCCAAAT 


2940 




AAACGAGATG ACATTAGAAA 


TTAGATATTT 


CTTGTCATCT 


CTTTTTTAAA ACTCAAATGA 


3000 


45 


ACTTATGTTT ACAAATTATA 


GGAAGACATT 


GTTTGTAGTG 


ATTTTCGCTT 


AAATCATATT 


- iUbU 




TATGAATTGA TTGAAAACAT 


TGCTTAGGAT 


TCATTGTGTT 


ATCCtTGCAC 


TTTGATTACG 


3120 




CTTTACTTAA ATCATTATCG 


ACAAACAACA 


TACTTATATT 


TTCATTGAGC 


CGAACCTTAT 


3180 


50 


ATACACATTA CATATACCTT 


ACTTGCACAA 


ATTATTAATC 


TGGTGTTTAT 


TATAATTACA 


3240 




TATCACTATA TTTTTAGCAT 


TTGTATAACT 


TAGTTGGTCA AAAGATGCTT 


TTGCATATGC 


3300 
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TTTCATAAGT GATGCTTTAT TAGCAAGAAT ATGTGTTCGC AGAAATTTGT TCTGCATTCT 3420 

ACTTCTACGC TAGTCAATCA GACAATTTTA CCAATCCCCA CTTTCGCGTT TCAAATCAAA 34 80 

S CAATACGTCG CTCCTTTCTT CTTATATAAC AATTCTTCTA .ACATGATATG TTACTATTGA 3540 

ATTACTGAAC CTGAGTTAGT TATAATCTAA CTTATATTGA AAAGAGATGA GG CGTAAGAT 3600 

ATGTTTTTAT GTAAAAGACA AATTGATATC AATGCACGAT TTGGTTTGCC TAGAATTGCA 3660 

10 

TTTATGAGTG CAGTTGCAAC CATCATTATG TTTTTAGTTA GTTATGAAGT AATGTATTTT 3720 

TTATCTAATA CGCCATTATC AGATAGACAT TTTCTCATCT TTTTATTACT TGTATTTATG 3 7 BO 

ACGTATCCAT TACATAAAAG TATACATTTA TTA TTTTT CT TACCATATAG AAAATCGTTT 3840 

75 

AAAGTTCATA AGTTAACTAA AAGAAAATGG CTTATATTCT ATAATACCTA CGTCAATCAA 3900 

CCTGTACACA AATTTTATTT TTGCATTAAC TTAATATTGC CGTTAATTAT CTTATCTGCA 3960 

ATGTTCGTTT ATCTAACAAT TTCATTCCCG CAATATGGAC ATTATTTTAT GTTCTTATTG 4 020 

20 

GCATTGAATT TCGGTATTTC CATTACAGAT TTATTATATT TAAAAATAAT TATATTTTCT 40 BO 

AATTATGGAC AATATATAGA AGAACATAGT ACAGGTATTA ATATTTTGAA AAAAATTAAA 4140 

2S AATCCATATC ATTTATAACA AAATAATTAT AGCAAGGTGT TATTATTTGT TTTTAGGCTA 4200 

TGTAATAgcT tACAATCAAA TGTATATAGA CCTTGTTTTT TTATTTTCAT CAATTTCTAC 4260 

CCCTAAACCT AATGCTCTAG TCTGATGTCA TGGGTTATTG ATTGGTGATA ATATAAAACT 4320 

30 ATGTTATATT CACGATGATT AACTTACAAA GGAGTTTCAA CTATGAAGAT GATAAACAAA .4380 

TTAATCGTTC ' CGGTAACAGC TAGTGCTTTA TTATTAGGCG CTTGTGGCgC TAGTGCCACA 444 0 

GACTCTAAAG AAAATACATT AATTTCTTCT AAAGCTGGAG ACGTAACAGT TGCAGATACA 4500 

35 ATGAAAAAAA TCGGTAAAGA TCAAATTGCA AATGCATCAT TTACTGAAAT GTTAAATAAA 4560 

ATTTTAGCTG ATAAATATAA AAATAAAGTT AATGATAAGA AGATTGACGA ACAAATTGAA 4620 

AAAATGCAAA AGCAATACGG CGGTAAAGAT AAATTTGAAA AGGCCCTTCA ACAGCAAGGT 46 80 

40 

TTAACAGCCG ATAAATATAA AGAAAATTTA CGTACTGCTG CTTATCATAA AGAATTACTA 474 0 

TCAGATAAAA TTAAAATCTC TGATTCTGAA ATTAAAGAAG ACAGCArGAA AGCTTCACAC 4800 

ATTTTAATTA AAGTTAAATC TAAGAAAAGC GACmAAGAAG GCTTAGATGA TAAAGAAGCG 4860 

45 

AAACAAAAAG CTGAAGAAAT TCAAAAAGAA GTTTCAAAAG ATCCAAGTAA ATTTGGTGAA 4920 

ATCGCTAAAA AAGAATCAAT GGATACTGGT TCAGCTAAAA AAGATGGCGA ATTAGGTTAT 4980 

SO GTTCTTAAAG GACAAACTGA TAAAGATTTT GAAAAAGCAC TATTTAAGCT TAAAGATGGT 5040 

GAAGT AT CAG AGGTTGTTAA ATCAAGCTTT GGATATCATA TTATTAAAGC TGATAAACCA 5100 
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AAAAATCCAA AATTATTGAC TGATGCATAC AAAGATCTAT TAAAAGAATA CGATGTTGAC 5220 

TTTAAAGATC GTGATATTAA ATCAGTTGTC GAAGATAAAA TCTTAAACCC TGAAAAACTT 5280 

AAACAAGGTG GCGCACAAGG CGGACAATCC GGCATGAGCC AATAACACAA AACCGAGCGA 5340 

CCGTGGTTCA AAAATCATAC CACGGCCGCT CGGTTTTTTC GCATTAAAAA TCGGACAGAT 54 00 

GAGCTCATGT TTCAGTATAC TCATCTGTCC GATATCTTTT AATTCTTAAT CGAGTGATTC 54 60 

AGGATTGTAG AATCTACGAT TTTCAAGACC AAATATTTTA TCTGTAAACT GACCCTTGTC 5520 

AGTTTTTTTA TATGCCTTTT CAAACATATT CATTCTAGCA TCGATATTAT CGATATAGCA 5580 

TAAAATTTCT GCTTCTTTTA AGTATGGCAG TTTTGGAGAA CCATACTCTA ACTTACCATG 5640 

ATGAGATAAA ATCATATGTC TTAACAACAT GATTTCTTCT CCTTCAATGT TCAATTCACG 5700 

AGCTGCTTCA ACTACTTCAT CACTCGCAAT CGAGATGTGT CCTAATAAGT TACCTTCGAC 5760 

TGTATACGAC GTCGCAACAG GACCACTCAA TTCTCTAACT TTACCAATAT CATGCAAAAT 5820 

AATACCACTA TATAACAAAC TTTTGTTTAA CAATGGATAA ATGTCaCAAA TTGATTTTGC 5880 

AATACGTAAC ATCGTTAATA CATGATAGCT TAAGCCACTC GCAAAGTTAT GaTGATGAGA 594 0 

25 ACTAGCAGCT GGATATGTGT AAAATCGTTC TTGATATTTT TTCAATAAAT GACGTGTGAT 6000 

ACGTTGTAAA TTAGCATTTT CAATATCTAG CAAATAATGA GAAATCTCTT CTTGTATTTC 6060 

TGCCGGTGAT AAAGGTGCAC CATCTACAAA TTGTTCTGTT TTTAATTGAT CTTCAGTTGT ' 6120 

CGCTAGTCTA ATTTGGTTGA CTTTCATCTG TTTATTTCCG CGATAGTTTA TGATGTCACC 6180 

TTTAACATGT ACAATTTCTT CAGGCTTGAT TGTTGCCATA TCATTTTTTG TAGCCGTCCA 624 0 

AAATTTCGCT TCAATTTCAC CACTTTTATC TTGCAAATGT AATGTCATAT AATCTTTACC .6300 

TTGTGCTGTT ACACCCTGTG TAGCTTTATG CACTAAGAAA AAGTGATCAA CTGAATCTCC 6360 

GGGATTTAGA TTCTCTATAT TTCTCATCGT TTCCCGCCTT CCTCTATTTT GTTTAATGTA 6420 

ATCACTTCTT TTGATGGAAC AATATTATCT TTTACACATG TAAAGTATAG TACTTGATAG 64 80 

TGTTCTGATA ATGATCGTAA ATAATTCAAC ATTTTTTCAG TACGTTTTTT ATCAAAATGA 6540 

ACAAATGCAT CATCAACAAT TAATGGGAAC GGATAATATG GTCTTAGTAC CTTAATTAAA 6600 

CTGATACGTA AAGCTACATA AAGTAATTCT TTTGTAGATT GACTTAGTTC AACAGGATCA 6660 

TATAATTGAC CATTAACATG TTTAACCGTA ATTGAATCTT CATTATAGTT AATCATCGTA 6720 

TATCTGCCAT CTGTTAAATG CTTCAATATT TCTACCGCTT CATTAATAAC TTGAGGCAAA 6780 

50 CGTTTATCTT TAATTTGTTT AATGTGTTCA TCAACTAAAC TTTGTAAATA ACTTAAACTT 6840 

GCCCAATCTT TTGCGATATC ATTAAGTTGA TTTTTAAGAC TGTGATATTC ATGTCTTAAA 6 900 
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GCTTGCATTT CAAGATATTG CTCATTATAT TCGTCAACTT GAGTAGCCAA TAAATGATCT 7020 

TCTTCTTCAA GTTGTGCAGT TGTTTTTTCA CTTAAACTAG AACTTAATTC ATAAGAATAG 7080 

TTTTGGTTCT CAAGATATTT AGTTAAATCA TTAAAACGAC TCAAATTACT AGTATAAGTT .7140 

TGGTAATCTT CATGATGTTG GTAAAAATCT TCTTCAGTAC CAACATTGAT AAAATCGAAT 7200 

AGTGCTGTAA TTTCTTTATT ATTTTCTTCT AATTGAGCAT TTAAATGATT TAATTCATTT 726 0 

GTAACAAGTT TGGTATTTTC AGCATTAATA CGCCATTTTT CATTCGTGTC TTCAGCTGAT 7320 

TTCAACCATT GTtGCACATC GTGGAATAAA GATAATTTGT TGAAATAAAC AAATTGTGAT 7380 

TTTGTAACAG CTTCAGCATG ATTGTAGAAT GTATCTAATT CTTGAACCAA TTGCTGGCGT 7440 

TGTTGATTTA AATCACTGAT ATGTTGATCT AATGCTTTAA TATTCGCCAT TGTAGAAATA 7500 

CTATCAACAA TTAAATGATT TGAAATTTTA GATGATAAGT ATAATTCATC CTTAACGTTC 7560 

TCAACTGTCG ATTGTAATTC ATCATGACGC CCTTTCGCAT CATTTAAACG ACCTTCAATA 7620 

TACTGACGTT TCTCTTCTAA AATATCTTTA TTTTTCAAAG CTTGTTGCCA GTGATCACGA 7680 

ATGCGATATT GCTCATCAAG ATCAAAATCT AAGTCATAAT TTTCATCTAA AATGGCTAGT 7740 

25 TGTGCTTTAA TTTCTTCGAT TTCATCTGTG ATGGCCTCGC TATAATCTAC TTCTTTTGAT 7800 

TTAGACATGA TGATACCGAT AACAAATACT AAAGTTAATA CTGCGAAAAT AATACCAAAC 7860 

AACATGTTGT TTGAAATAAA TGAGAAGGCA GTTAAACCAA TACCTACTAA TGTTAAAAGr 7920 

30 ATAAACGTTG TTCGkAACAA TTTTTGACGT TTTTGttTTT CTT 7963 

(2) INFORMATION FOR SEQ ID NO: 169: 

(i) SEQUENCE CHARACTERISTICS: 
3S (A) LENGTH: 3958 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
T (D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 169: 

ATATTGTCTT TACAATAGTT TGCTATGGAG GTAATTAACC AATAGGAGGA ATTTATAATG 60 

GCAGTAATTT CAATGAAACA ATTACTAGAA GCGGGTGTTC mCttCGGTCA CCAAACACGT 120 

CGTTGGAACC CAAAAATGAA AAAATATATC TTCACTGAGA GAAATGGTAT TTATATCATC 180 

GACTTACAAA AAACAGTGAA AAAAGTAGAC GAGGCATACA ACTTCTTGAA ACAAGTTTCA 24 0 

SO GAAGaTGGTG GACAAGTCTT ATTCGTAGGA nCTAAAAAAC AAGCACAAGA ATCAGTTAAA 300 

TCTGAAGCAG AACGTGCTGG TCAATTCTAC ATTAACCAAA GATGGTTAGG TGGATTATTA 360 
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GAAGATGGTT TATTCGAAGT ATTACCTAAA AAAGAAGTAG TAGAACTTAA AAAAGAATAC 4 80 

QACCGTTTAA TCAAATTCTT AGGCGGAATT CGTGATATGA AATCAATGCC TCAAGCATTA 540 

5 TTCGTAGTTG ACCCACGTAA AGAGCGTAAT GCAATTGCTG AAGCTCGTAA ATTAAATATT €00 

CCTATCGTAG GTATCGTTGA CACTAACTGT GATCCTGACG AAATTGACTA CGTTATCCCA 660 

GCAAACGACG ATGCTATCCG TGCGGTTAAA TTATTAACTG CTAAAATGGC AGATGCAATC 720 

10 

TTAGAAGGTC AACAAGGCGT TTCTAATGAA GAAGTAGCTG CAGAACAAAA CATCGATTTA 780 

GATGAAAAAG AAAAATCAGA AGAAACAGAA GCAACTGAAG AATAATCAAC TGTTGAATCT 840 

GACTTAGATA TAGTTTAAAT GGGTGATAAG ATATTAATGC TTATCACCTT TTTTAAAAAG 900 

15 

AAAATCGAGG CAAATTACAA ATATTCAATT AGAGTATTGG CAATCTTGCC TATAATAATG 960 

CTAAAATCAT AATATATAAn ATGATAACTT ATTGGAGGAA TAATGAATGG CAACTATTTC 1020 

2Q AGCAAAACTT GTTAAAGAAT TACGTGAAAA AACTGGCGCG GGTATGATGG ATTGTAAAAA 1080 

AGCGCTAACT GAAACTGATG GTGACATCGA TAAAGCGATT GACTACCTAC GTGAAAAAGG 1140 

TATTdCTAAA GCAGCTAAAA AAGCAGACCG TATTGCGGCT GAAGGTTTAG TACATGTAGA 1200 

25 AACTAAAGGT AACGACGCAt TATCGTTGAA ATCAACTCTG AAACAGACTT TGTTGCTCGT 1260 

AACGAAGGTT TCCAAGAGTT AGTTAAAGAA ATCGCTAATC AAGTATTAGA TACAAAAGCT 1320 

GAAACTGTTG AAGCTTTAAT GGAAACAACT TTACCAAATG GTAAATCAGT TGATGAAAGA 138 0 

3D 

ATTAAAGAAG CAATTTCAAC AATCGGTGAA AAATTAAGTG TTCGTCGTTT TGCTATCAGA 1440 

ACTAAAACTG ATAACGATGC TTTCGGCGCT TACTTACACA TGGGTGGACG CATTGGTGTA 1500 

TTAACAGTTG TTGAAGGTTC AACTGACGAA GAAGCAGCAA GAGACGTTGC TATGCATATC 1560 

35 

GCTGCAATCA ACCCTAAATA TGTTTCTTCT GAACAAGTTA GCGAAGAAGA AATCAACCAC 1620 

GAAAQAGAAG TTTTAAAACA ACAAGCATTA AATGAAGGTA AACCAGAAAA CATCGTTGAA 1680 

AAAATGGTGG AAGGACGTTT ACGTAAATAC TTACAAGAAA TTTGTGCTGT AGATCAAGmT 174 0 

40 

TCGTTAAAAA CCCTGATGTA ACAGTTGAAG CTTTCTTAAA AACAAAAGGT GGAAAACTTG 1800 

TTGACTTCGT ACGCTATGAA GTAGGCGAAG GTATGGAAAA ACGCGAAGAA AACTTTGCGG ,1860 

45 ATGAAGTTAA AGGACAAATG AAATAATCTG TCATAAAGTA AAACAAGGAA GAAGACACCT 1920 

TTAATGTTGC TTTATTAAAA TGTAAATCAT TCTAATAAAA CGACAACTGT GTCTTCTTTA I960 

CTTGTATATG TTACATATAT TCACGATAGA GAGGATAAGA AAATGGCTCA AATTTCTAAA 2040 

50 TATAAACGTG TAGTTTTGAA ACTAAGTGGT GAAGCGTTAG CTGGAGAAAA AGGATTTGGC 2100 

ATAAATCCAG TAATTATTAA AAGTGTTGCT GAGCAAGTGG CTGAAGTTGC TAAAATGGAC 2160 
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TTAGGTATGG ACCGTGGAAC TGCTGATTAC ATGGGTATGC TTGCAACTGT AATGAATGCC 2280 

TTAGCATTAC AAGATAGTTT AGAACAATTG GATTGTGATA CACGAGTATT AACATCTATT 2340 

5 GAAATGAAGC AAGTGGCTGA ACCTTATATT CGTCGTCGTG CAATTAGACA CTTAGAAAAG 24 00 

AAACGCGTAG TTATTTTTGC TGCAGGTATT GGAAACCCAT ACTTCTCTAC AGATACTACA 2460 

GCGGCATTAC GTGCTGCAGA AGTTGAAGCA GATGTTATTT TAATGGGCAA AAATAATGTA 2520 

10 

GATGGTGTAT ATTCTGCAGA TCCTAAAGTA AACAAAGATG CGGTAAAATA TGAACATTTA 25 BO 

ACGCATATTC AAATGCTTCA AGAAGGTTTA CAAGTAATGG ATTCAACAOC ATCCTCATTC 2640 

TGTATGGATA ATAACATTCC GTTAACTGTT TTCTCTATTA TGGAAGAAGG AAATATTAAA 2700 

15 

CGTGCTGTTA TGGGTGAAAA GATAGGTACG TTAATTACAA AATAAATTTA GAGGTGTAAA 2760 

ATAATGAGTG ACATTATTAA TGAAACTAAA TCAAGAATGC AAAAATCAAT CGAAAGCTTA 2820 

2Q TCACGTGAAT TAGCTAACAT CAGTGCAGGA AGAGCTAATT CAAATTTATT AAACGGCGTA 2B80 

ACAGTTGATT ACTATGGTGC ACCAACACCT GTACAACAAT TAGCAAGCAT CAATGTTCCA 294 0 

GAAGCACGTT TACTTGTTAT TTCTCCATAC GACAAAACTT CTGTAGCTGA CATCGAAAAA 3000 

25 ' GCGA^AATAG CAGCTAACTT AGGTGTTAAC CCAACAAGTG ATGGTGAAGT GATACGTATT 3060 
GCTGTACCTG CCTTAACAGA AGAACGTAGA AAAGAGCGCG TTAAAGATGT TAAGAAAATT ' 3120 

GGTGAAGAAG CTAAAGTATC TGTTCGAAAT' ATTCGTCGTG ATATGAATGA TCAGTTGAAA 3180 

30 AAAGATGAAA AAAATGGCGA CATTACTGAA GATGAGTTGA GAAGTGGCAC TGAAGATGTT 324 0 

CAGAAAGCAA CAGACAATTC AATAAAAGAA ATTGATCAAA TGATTGCTGA TAAAGAAAAA 3300 

GATATTATGT CAGTATAAAA CTAATATACA ATGACATATT AAAATGCCAG TATTAAACGA 3360 

35 

TAATGTAACA TTTAAAATGG GCATGTTTAA TTAAATCAAA GATGCATGTG ATAATTTAAA 3420 

TTCAGAATGA GCATAAAAAT GGTGTTTAAA CAAGTTAATT AAACATATAC TTTATAAATA 34 80 

ATAGGCATTA GGTATATTGC TATAATAAAG TTATGTAATT TTTAACCTCA GTATGTATGT 3540 

40 

CACATTTCTG GTGTAAACTG TACCGAGTCA GACTTTGGTA CAGTTTTTTT ATTTGCTTAT 3600 

TCAATGCATT AAATGAGTAT GATAAAATGA TAATGATTGT TTAGTAACTT ATACTATATG 3660 

4S ACAGAGATGA TCAGGCTCGG AGGAAAGACC ATGTTTAAAA AGCTAATAAA TAAAAAGAAC 3720 

ACTATAAATA ATTATAATGA AGAATTAGAC TCGTCTAATA TACCTGAACA TATCGCTATT 3780 

ATTATGGATG GTAATGGGCG ATGGGCTAAG AAGCGAAAAA TGCCTAGAAT TAAAGGTCAT 3840 

50 TACGAAGtAT GCAAACAATA AAAAAAATTA CTAGGGTAGC TAGTGATATT GGTGTTAAGT 3900 

ACTTAACTTT ATACGCCTTT TCCACTGAAA ATTGGTCAAG ACCTGAAAGT GAAGTAAA 3958 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 170: 

10 

ATTAAAACAA CTTAATATAC CTATTTATGG TGGTCCTTTA GCATTAGGTT TAATCCGTAA 60 

TAAACTTGAA GAACATCATT TATTACGTAC TGCTAAACTA AATGAAATCA ATGAGGACAG 120 

TGTGATTAAA TCTAAGCACT TTACGATTTC TTTCTACTTA ACTACACATA GTATTCCTGA 180 

15 

AACTTATGGC GTCATCGTAG ATACACCTGA AGGAAAAGTA GTTCATACCG GTGACTTTAA 240 

ATTTGATTTT ACACCTGTAG GCAAACCAGC AAACATTGCT AAAATGGCTC AATTAGGCGA. 300 

20 AGAAGGCGTT CTATGTTTAC TTTCAGACTC AACAAATTCA CTTGTGCCTG ATTTTACTTT 360 

AAGCGAACGT GAAGTTGGTC AAAACGTAGA TAAGATCTTC CGTAATTGTA AAGGTCGTAT 420 

TATATTTGCT ACCTTCGCTT CTAATATTTA CCGAGTTCAA CAAGCAGTTG AAGCTGCTAT 480 

25 CAAAAATAAC CGTAAAATTG TTACGTTCGG TCGTTCGATG GAAAACAATA TTAAAATAGG 54 0 

TATGGAACTT GGTTATATTA AAGCACCACC TGAAACATTT ATTGAACCTA ATAAAATTAA 600 

TACCGTACCG AAGCATGAGT TATTGATACT ATGTACTGGT TCACAAGGTG AACCAATGGC 660 

3D 

AGCATTATCT AGAATTGCTA ATGGTACTCA TAAGCAAATT AAAATTATAC CTGAAGATAC 720 

CGTTGTATTT AGTTCATCAC CTATCCCAGG TAATACAAAA AGTATTAACA GAACTATTAA 780 

TTCCTTGTAT AAAGCTGGTG CAGATGTTAT CCATAGCAAG ATTTCTAACA TCCATACTTC 840 

35 

AGGGCATGGT TCTCAAGGTG ATCAACAATT AATGCTTCGA TTAATCAAGC CGAAATATTT 900 

CTTASCTATT CATGGTGAAT ACCGTATGTT AAAAGCACAT GGTGAGACTG GTGTTGAATG 960 

CGGCGTTGAA GAAGATAATG TCTTCATCTT TGATATTGGA GATGTCTTAG CTTTAACACA 1020 

40 ^ 

CGATTCAGCA CGTAAAGCTG GTCGCATTCC ATCTGGTAAT GTACTTGTTG ATGGTAGTGG 1080 

TATCGGTGAT ATCGGTAATG TTGTAATAAG AGACCGTAAG CTATTATCTG AAGAAGGTTT 1140 

4S AGTTATCGTT GTTGTTAGTA TTGaTTTTAA TACAAATAAA TTACTTTCTG GTCCAGACAT 1200 

TATTTCTCGA GGATTTGTAT ATATGAGGGA ATCAGGTCAA TTAATTTATG ATGCACAACG 1260 

CAAAATCAAA ACTGATGTTA TTAGTAAGTT AAATCAAAAT AAAGATATTC AATGGCATCA 1320 

50 GATTAAATCT TCTATCATTG AAACATTACA ACCTTATTTA TTTGAAAAAA CAGCTAGAAA 1380 

ACCAATGATT TTACCAGTCA TTATGAAGGT AAACGAACAA AAAGAATCAA ACAATAAATA 144 0 
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GCTTTTTCTT 


TATATATGAT GAGCTTGAGA CATAAATCAA 


TGTTCAATGC 


TCTACAAAGT 


1560 




TATATTGGCA 


GTAGTTGACT GAACGAAAAT GCGCTTGTAA 


CAAGCTTTTT 


TCAATTCTAG 


1620 


£ 


TCAGGGGCCC 


CAACATAGAG AATTTCGAAA AGAAATTCTA 


CAGGCAATGC 


GAGTTGGGGT 


1680 




GTGGG CCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC 


AGACAATGTA 


AGTTGGGGTG 


1740 




GGACGACGAA ATAAATTTTG AGAAAATATC ATTTCTGTCC 


CACTCCCGAT 


TATCTCGTCG 


1800 


10 


CAATATTTTT 


TTCAAAGCGA TTTAAATCAT TATCATGTCC 


AATCATGATT 


AAAATATCAC 


I860 




CTATTTCTAA ATTAATATTT GGATTTGGTG AAATGATGAA 


CTCTTTGCCT 


CGTTTAATTG 


1920 


15 


CAATAATGTT 


AATTCCATAT TGTGCTCTTA TATCTAAATC 


AATGATAGAC 


TGCCCCGCCA 


1980 


TCTTTTCAGT 


TGCTTTCAAT TCTACAATAG AATGCTCGTC 


TGCCAACTCA 


AGATAATCAA 


2040 




GTACACTTGC 


ACTCGCAACA TTATGCGCAA TACGTCTACC 


CATATCACGC 


TCAGGGTGCA 


2100 


20 


CAACCGTATC 


TGCTCCAATT TTATTTAAAA TCTTTGCATG ATAATCATTT 


TGTGCTTTAG 


2160 




CAGTTACTTT 


TTTTACACCT AACTCTTTTA AAATTAAAGT 


CGTCAACGTA 


CTTGATTGAA 


2220 




TATTTTCACC 


AATTGCCACA ATGACATGAT CAAAGTTACG 


GAT AC CT AAA 


CTTTTCATAA 


2280 


25 


CTGCTTCATC TGTAGTGTCT GCAACAACCG CATGAGTAGC GATATCACTA 


TATTCATTCA 


2340 




CTCTATTTTC 


ATCATGGTCG ATGGCCATTA CATCCATGTC 


TAATGCATTC 


AACTCACGAA 


2400 




CGATACTACC 


TCCAAAACGA CCTAGACCGA TGACTACATA 


TTCTTTACCC 


ATACTCGCCC 


2460 


30 


TCCATTAAAT 


GATTTTCATC AATTCATTGA AAATATAAAT 


TTAAAATTAT 


TATAAATGAG 


2520 




TACCCCAACT 


AAATTATCTA AATGCAGTAA TGCAAGTAAA 


TGAAAGTTGG 


GGTATCGTCT 


2580 




CAACTTATGA 


TTTCTTTCCT TCAACATATT CTTTGTCGAA AACAAATAAT 


CTTAATAATA 


2640 


35 


ATATTAACGA 


TGGAAGTAAT AAAAGTAAAC CTAAAATAAA 


GACAATCACT 


AATGTCCAGC 


2700 




CCAITTCTGG ATTAACATAT GCATCTGTAA TTTTTACAAA CGGATATAAA 


AGGTATGGCA 


2760 


40 


ATTTACTAAT 


TCCATAGCCA AAGAACGCGA ACATCATTTG 


TAAAATAACA 


AATACAAAAG 


2620 


CCAAACCATG 


TTTTTTCTTA AAGAATGTTA ACAATGAAGC 


TAATGCAAAG 


AATAAGAAAC 


2S80 


< 


TTATACCAAA 


CATCCACCAA TAGTCAAAAA CAGCTGAATA 


AAAATGTTCA 


GAATTTTGAA 


2940 


45 


TGCGTAATGA 


TAGAAATACG AATAAACAAA TGATAATCAT 


CGGCGGCCCT 


AAAAATATGT 


3000 




GCCATTGTCT 


TGTTAAATTA TATGCTGGTT CGTCATTTGC 


TTTTTTAGCA 


TAATATGTCA 


3060 




AAAATCCTGA 


TGAAATATAT AAAACTGAAA TAATTGCCAA GAATACTACA 


GACCAAGCAA 


3120 


SO 


ATGGGCTTAA TAATAACTGC ACCCAATCTA GATCGATAAC ATTGTTTCGA 


ACATTAATAT 


3180 




AGCCACCTTC 


TGTAATAGTT AAAGCAGTAG ATAATGAAGC 


TGGAATTAAT 


AATCCACTTA 


3240 
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AACTGTTTCT CAACGATATC ATAATCAGTG CTATTGAACC TGGTATTAAC AATACCGTGC 3360 

CTAAATATTT GATTGACTCT GGAAAGAAAC CTACGAATCC TACGAAGAAG AAAACAAAGA 3420 

ATACATTCGT AACTTCCCAA ACTGGGTTTA AATAACGTGA AATTAAGTGA TTAATTTTCT 34 80 

TTTCATCACC AGTTAACTTT GAATGCAATG CGAAGAAACC TGCCCCAAAA TCTATAGAAG 3540 

CAATAATGAT ATAGCAAAAT AAAAACAACC ATAACACTGT TATACCTATA AATGCATAAA 3600 

TCATTTTTCT ATTTCTCCTC CTTGCTTCTT GGCTAAACGA TTTACATCTT CATACGCCGG 3660 

TTTATTTTTA AACATACGAA TTAATACGTA TGCACATGTA TACATTAAAA TGATGTACAA 3720 

TATGCCAAAT AAAATTGTAA CGAaGGTTAT TCCGCCTGCT TGTGTTGCTG CTTCTGCCAC 3780 

GCGCATATAA CCACGAACAA TCCAAGGCTG TCTACCCATC TCTGTTAAGA ACCATCCAAA 3 $40 

TTCTATAGCT AGCATTGAAG CTGGGCCTGT TAATAATATT CCATAAAGCA TCCATTTATG 3900 

AGTAGAAAAC TTTCTAAGCT TTTTAAACAT TAAAGTTAAG ACATAAACAC CTGAAATGAC 3960 

AAAACATAAA ATTCCCATCG TTACCATTAA ATCAAAGAAA TAATGGACGA TCATAGGCGG 4020 

ATGTAAACTT TTTGGAAAAT CATTTAACCC TTGTACTTTA GTTTTGACAC TATTATCTGC 4 080 

25 TAAGAAACTC AATAGTCCAG. GTAATTCAAT CGCACCTTTA ACTTGCTGAG TCTTTTCATC 414 0 

' TAACACACCA AATAATAATA ATTTGGCATG GGAAGATGTA TCGAAATGCC ATTCATAAGC 4200 

TGCTAATTTT TCAGGTTGGA ATTTATGCAA AAATTTTGCA GATAAATCCC CTGCCAACAT 4260 

30 AGAAAGTAAT GTTGAAAAGA ATCCAACTAT CATAGACATT TTCAAAGCTT TCTTATGGTA 4320 

GACAGTATCT TTAGGTTGAC GATTACGCAA TAATTTAAAA GCTGCTATTG ATGCAATAAC 4 38 0 

AAATGCCATC GTCATACCGG CTGTAGTAAT TACGTGAAAT GATCGAACTA TAAACGATGG 444 0 

GTTAAACATC GCTTCTATAG GTTGAACATT GACCATCTTT CCATTCTTCA ACTCAAAACC 4500 

TGCAGGCGTA TTCATAAATG AATTCACTGA AGTAATGAAG AATGCTGAGA AAGAGCCACC 4560 

AATAATTACT GGTATACTAA TTAAGAAATG TGTCCATTTA TTTTTAAAAC GATCCCAAGT 4620 

ATATAAATAT ATACTTAAGA AAATAGCTTC AAAGAAGAAC GCAAATGTTT CCATAAATAA 4 680 

TGGAAGTGCA ATAACGTGTC CACCCATTTC CATAAATGTA GGCCAAATCA ATGATAATTG 474 0 

AAGTCCTATA ATTGTACCTG TAACAACTCC CACTGCTACA GTAATTGTAT AAGCTTTAGC 4 BOO 

CCATCTTTTG GCCATAGCTA TATATTGAAG ATCATTTTTG CGAATACCTA AAAATTCTGC 4 860 

AATTGCGAAC ATTAAAGGCA TACCAACACC AATCGTTGCA AAAATGATAT GAACTGCTAA 4 920 

50 AGTCATAGCT GTCAAAAACC GACTGATTTC AACTGTATCC ATTTAAAAAC ATCACCTTTT 4 980 

TCTTTTTTTG ATGACAACAC AATGAACTTA ATTATAATTG CTATAATGTG TATTTTTAAA 5040 
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GAATTTCAAT GTATAATTGT GTATATTACA TTAGAATAAA GCACGAAGGA GCATGATACA 5160 

TGTCAGAAAT AATCGTTTAT ACGCAGAATG ATTGTCCACC TTGTACATTT GTAAAAAATT 5220 

ATCTAAATGA GCATCACATT GATTTTGAAG AGAGAAATAT CAACAATCAA CAATATCGAA 5280 

ACGAAATGAT AGATTTTGAT GCTTTTTCAA CTCCGTTTAT TTTGTTGAAT GGC 5333 
(2) INFORMATION FOR SEQ ID NO: 171: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 11126 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 171: 

ATACGTGACC CTTTATCCGA AAATTTCTTT TCATATTCTG TTAAAATATT ACTGCCATCG 60 

TCTTCTTGAT GTAAATTTAG ATTTATTTTT GTAAAATACA TTCCAAATTG AGACATACTT 120 

TCTAAACTGT AGGCAAATAG TCCTCTGTTA TCAGTTTTAA AATGTAAATC TCCTTCATCA 180 

2S TTTAAGATTT GTTGATACAA CGCTAAAAAC GTATGATACG TTAAACGTCG TTTTGCATGA 24 0 

CGATTTTTTG GCCATGGATC TGAAAAGTTC AAATAAATAC GCGAAACTTC GCCGTCTTTA 300 

AAATATTCAT TTAATTCAAT GGCGTCATTA CAAATAATCT TTAAATTTGT TAAACCCATC 360 

30 TCTTTAACTT TATCCAATAC TTTATAAACG ATACTTTTCT CACGTTCCAT TGAAATATAG 420 

TTAATATGAG GATTTTGAGC AGCTAATGTT GTAATAAACT GCCCCATACC CGAACCAATT 4 80 

TCAATGTGTA TCGGTTGCGT TTTaTCAAAC CATTCAGTCA TTTTCCCTGc ATGTTGACCG 540 

TCCATGTCAA CCAATTCAGG ATGATCTTTT AAATAATCTT CAGCCCATGG TTTGTATCGA 600. 

ACTCTCATAT TTTATTCTCC TCTTAAATAA ACATGTTACT ATTCATAACT TCATTTAGGA 660 

ATTTAAGCCA AGTGTTCATA TCCTTATATC TTTTTTGCTC TTCATACCAT TGAACAAGAC 720 

CTATAGATTG AATTACCGTA TACCATTTCA TACGTTTATT TAAATTCAAG CTCTCTTGAA 780 

CACCATATGT TTCAAGCCAT TCAGACCATT GTTGTTGTGG AACATAGTTG TAAAGCAGCA 840 

TTCCGATATC AATTGCCGGG TCTGCAATCA TTGCACCTTC CCAATCAACT AAAAATAGTT 900 

CATCTCGATC GGATAATAAC CAATTATTAT GATTCACATC ACCATGTACA ACAGTGAAAA 960 

AACGCGAATC TAAACTCGGT ATATG CTCTT CTAAATAGGT TAATGATTTT CTCACAATAT 1020 

SO GATGTGTTAA AACTTCTCTT GATAAAGAGG CATTAATTTT ATTAAGCATA ATCTCAGGAG 1080 

TAATAGGTTC CATTTCCATA CGCTTTAACA TACTTAATAA AGGTCTAGAA TTGTGTATCT 1140 
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TTTTCCAATG TTGTGCTGTA ACAACCTCGC CTGTTTCTAT GCGTTTCGTC CATACTAATT 1260 
TGGGCACAAT ACCTTCTGCT GATAATGCCG CAATAAATGG ATTTGAATTT CGTTTTAAAA 1320 
5 ACAACTTTTG TCCATCTTGT TCAGCCATAT ATGCTTCACC AGATGCACCA CCTGCTGAAT 1380 

CAAGTGTCCA CCCTAATTGA TAAAACTGCT CCAACTCGTC CACCTCACTT TCAATTAGAA 1440 

AATGGCTCTA GAAATAGGTT TTTCAAGAGC CATATATTCT AATTTATAAC ACCATACTGG 1500 

10 

TACAAATATT ATGTCCAGAT AATTATTGTA AATCCTCAAC CAATGCCTAC ATTACACGAC 1560 

TAAATTTAAA TCGTAATGTC TGTCATTGAC ACCATACATT CTATAGTCAC TTACTTGAGA 1620 

TATAATGTTA CCGTGTCTAA AACTACATGT TTTTGAATCT CTGTAGGCGA TAAACTcTAG 16 BO 

75 

TTTTCAAAAT AATTGCTATC CCATTTTCAT GGTTAGCATA AATTTATGAA CTGTAACATT 1740 

TACGTACTTA GTAAAATATG ATGCACATCA TATTTGTrAC TCATAGAAAA TTTTATAA t T 1800 

20 TTTATCATTA TATTTCAACT GAAAATGAGA AACAAAATGG CACTTTTTAC TAATATGTGT 1860 

TTTCTAAACA ACACTTTTAA GCTTCGTTTT AAATTATAAC ATAATTCACT TACGAAAGTT 1920 

GATAAATTTA AGTAATTTAA TCTAAAAATA TGATGAAAGA ATTTTAAATA CTGTGTGACT 1980 

25 CTATATACTT TTCAAATCCT TCTTGTAGTT GACGTGTAAT TGGGCCAACT TTACCATCAT 2040 

TAACTGGTTC ACCATCTAAT TTAATAACAG GTGTAACCTC AGCTGAAGTA CTTGAAACAA 2100 

TAACTTCATC TGCGTTTTTC AAGAAATCTA CAGTAAACGT TTC TTCTTTA AATGGGATGT 2160 

30 

TATAGTCTTC GGCAATTTTT TTAATTACAA TTCGTGTAAT ACCATTAAGA ATATAGTTGT 2220 

TAATCGGATG TGTATAAATC ACACCGTCTT TAATTGCATA AGCATTACTT GAAGATCCTT 2280 

CAGTTACAGT TtCACCT.CGA TGTTGAATTG CTTCAACTGC ATTATATTTC ACAGCATATT 234 0 

35 

CTTTTGCTAA TACATTcTCC TAATAAGTTC AAGCTTTTAA TGTCGCAACG TAACCATCGG 2400 

ATATCTTCAA CGGTAACACC ATTCACACCA TTTTCTAAAT GATCATAAGG ACGATCATAA 2460 

CTCTTTGTAT AAGCAACAAT TGCTGGTTCT ACTTCAGGTG TCGGGAAGCT ATGATTCCTT 2520 

40 

TCAGCTACAC CACGCGTTGC TyGAATATAA ATTGCCCCAG TTTCAATTTG ATTCATATCA 2580 

ACTAATTTAC GAGATAGTTC AATTAATTCT TCTACAGAAT AATTTAAATC TAAACCAATC 2640 

45 TCATTGGCAC TACGTwAAAw TCTTTCATAA TGTTCTGTTA CTGTAAATAA CTTACCATTA 2700 

TATACTCGAA TGTATTCATA AATACCATCG CCAAATACGT ATCCTCTGTC GTTGTATGAA 2760 

ACCTTTGCTT CACTTGGACT TACAAACTCA CCATTTAAAA AAATTTTTTC CATATATTAT 2820 

50 TCCTCCACGC ATAATGAATA AATTGCTTCT AAGTAAATAC TAGTTGCGTT AAATAACTGT 2880 

TTTTTAGTGA TATATTCATT TTTCTGATGC ATTAAATCTT CAGAATCACT AAACATTGCG 2940 
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TCAGTCATAT CATTTGTTTG ATTTCTATAT GCAGTAACTA ACTTTTGTAC AAAAGGATCA 3 060 

TTTTT ATCAA CATAATGTGG TGGTTGGACT TTACCTAATT TCACTTCAAA GCCATATTGT 3120 

5 TGAATCTCAT TTGCAAAACG ATCCATAGCT TTTTCAAATT CAAATCCTTC TGGGTAGCGT 3180 

AAGTTGATAC CGAAAAGACC TGCGTTTTCA TTATCATATG TAATAACACC AATGTTAGTT 324 0 

GTCACGTCAC CCATGACATC TGTATGGAAT TTCATTCCCA TCTTTTCACC AAAATCTGAA 3300 

10 

TTAAATAAGT AG CG ATT ACT AAATGCTACA AACGCTTGTG CATTATTATC AAGATTTAAT 336 0 

GATGCTAAGA ATTTTAGTAA GTAAAGACCC GCATTCACAC CGATAGATGG ATCCATACCA 3420 

TGAACCGCTT TACCTTCAAC TGTTAAAACT AGAATGCCAC TATCAACAGT ACTATCACCT 3480 

15 

TGTAAATGAT TTTGTTCTAA AAAGTACTCA AAGTCTTGAA TAACATCTGT CATATTTTCT 3540 

TTAACAAGCA CTCTTGCTTC TGCATGATCA GGTACCATGT TGTAACGTTC ACCAGATTTA 3 600 

2Q AAAGTTATTA ATTCATAATC AGGTTCATCT TGATCTTCAG TAAGTTTATT TTGAACTAAA ^660 

TCAAATGTTG TAATGCCTTT TTCACCATGA ATACATGGAA ATTCTGCATC TGGTGCAAAA 3720 

CCTAATGTTG GCATTTCTTC TGTTTTAAAA TAGCGATCCG TACATTTCCA ATCAGATTCT 3780 

25 TCATCCGTAC CAATAATCAT ATGAATACGT TTCTTCCAAT CCACATTCAT ATCTTCTAAT 3 84 0 

ATCTTAATTG CATAATAAGC AGCAATTGTT GGACCTTTGT CATCAAGTGT ACCTCTAGCT. 3 900 

ATGATAGCAT CTTCTGTTAC AACCGGCTCG AACGGATTAC TATCCCATCC ATCACCAGCA 3 960 

30 GGAACAACGT CAACATGACA TAAGATACCT AATACGTCAT TTCCTTTACC TGCCTCAATT 4 020 

CTTCCTGCAA TATGATCCAC ATCATGTGTT GTAAATCCAT CTCTATGTGC AATTTCATAC . 4 080 

ATGTAGTCTA ATGCCTTACG AGGACCTGGA CCAACTGGTG CGTCTTCTGA TGCTTTTGCA 414 0 

35 

TCATCTCTCA CACTTTCAAT TGCTAATAAT CCTTTTAAGT CATTAATGAT TTGATCTTCG 4200 

TATTGTTGAA C T TTT T C TTT CCACATTCGA AATCGACTTC CTTTTTTCTA TAAGTTAAAT 4260 

TCTATTTTAC ATGAAAAGAT ATAAAAACTA CAATAAGATG TCAGAAAATA ATAAAAAGGA 432 0 

40 

ACAAAACGAT GCTATTGATA TGACACAAAT CATAAATAGC TGCTTTGTTC CTTTTTTAAT 43 80 

TTATATATTT AAAATACACA TATTCAAGAG CTCGAGATAT AAGTCAATGT ACTAGGCACA 4440 

45 CAATTTAATA TTGACAGTAA TTAACCGAAC GAAAATGCGC CCCGGGGCCC CAACATAGAG 4500 

AATTTCGAAA AGAAATTCTA CAGACAATGC AAGTTGGCGG GGCCCCAACA TAGAAGCTGG 4560 

CCAATAGTTA GCTTTCAATA ATGTGCAAGT TGGGGTAAGG GCCCCAACAC AGAAGCTGGC 4620 

SO CAATAGTCAG CTTTCAATAA TGTGCAAGTT GGGGTAAGGG CCCCAACACA GAGAATTTCG 4680 

AAAAGAAATT CTACAGACAA TGCAAGTTGG CGGGGCCCCA ACACAGAAGC TGGCCAATAG 4740 
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TAAAGAAATA CGTTTTCTTT AGATATTAGT AT TTCTT ATG AATGAGTTTC ACGCATGTAT 4 860 

TCTTCTTTCT ATATGCATAT TAGCTATGAC TAACGATAAA GAACCTGAAA CACTAATAAA 4 920 

TGTCCTATAG TTTACAATAT TATATTGGCA GTAGTTGACT GAATGAAAAT ACGCTTGTAA 4 980 

CAAGCTTTTT TCAATTCTAG TCAACCTTGC CGGGGTGGGA CGACGAAATA AATTTTGCTA 504 0 

AAATATGATT TCTGTCCCAC TGCCTTATCA TTTCTGTCCT ACTCACATCT TATTCTTTAT 5100 

CAGATAATGC ATTTTTATTC TTTTTTAAAT CTT C TTCAGT GACGATACGT AAATTATTAT 516 0 

TTGGTGTGCG CCACCTTCAT CATCAAATTT ACCTTTTTCA ATACTTTCGT CAGTCTTATT 5220 

GTCATATTCG GTAAATTTTG ATTTTTCTTC TTTGAAAAAT GCTTTTGGAT TATTTTTTAA 5280 

TCTATTAGCA TATTCTTTCG GATTTGTTTT TACTTCTTTA ATTGTTTCAT TAGCAATTGT 534 0 

TCCTAATTGC GTCGCTTTAT CCTTAGCATT ATCTTTATAG CTTTGAGGAT CTTGTTTATA 5400 

V 

TTTATTATAT TCcTGcTTTC AGCTTGTCAC GACTATCTTT ACGTGTAACA AGTACAGCTG 5460 

CTACAGCGCC ACCTATACCT AAAATCGCTT TAAATAAATT ACCTTTTGCC ATATCAATCG 5520 

TCTCCCTTTT ATTTATAATT TAATTTGTCA AAATCATTTT CAGTTAATAA ACGATATTCT 5580 

25 CCTGAATCTA AATTGCTGTC CAATTCTAAA TCAGCAATTT TGATACGTCT TAAATGTAAT 564 0 

ACCTCATTTT GAATGCTATG AAACATTCGT TTAACTTGAT GATATTTTCC TTCATAAATT 5700 

GTTACGTGTG ACGTTTGATT ATCAATATAA GTTAATATTG CAGGCTTAAC CTTGCCATCA 5760 

30 GTCAGTGTtA CACCCTCTTT AAAAGCTTGA ATGTCGTCTT CAGTGATAGG ATTTGCTGAA 582 0 

ATAACTTCAT ATTTTTTAGA AACATGTTTG TTTGGACTCA TTAATTCATG ATTAAAATCA 588 0 

CCATCATTCG TTATCAATAA AAGCCCTTCT GTATCTTTAT CAAGACGACC AACCGGAAAA 594 0 

ATATTTAGAT GTTGGTATTC AGGTATTAAA TCAATAACGG TTTTTGAATG ATGATCTTCA 600 0 

GTTGCTGATA TATAACCTTT TGGCTTATTT AACATAATAT AGACATTTTC AATGTATTCT 6060 
ATTAATTCTC CACGAACTGT TATCTTATCG TTTTCTGGTT CTATATGTGT TTTTGGTGAT . 6120 

TTAATTACTT GTTCGTTGAC ATTTACAAGG CCTTTTTTAA GTAACTGTTT GACCTCATTA 6180 

CGTGTACCGA CGCCCATATT TGCTAAAAAT TTATCTATTC TCATCGTAAA AACCTAACTC 624 0 

TACGTCTTAA TTTTTCAGGA ATTTCACCTA AGAATTCGTC CGCAAGACGC GTTTTAATTG 6300 

TGATTGTACC GTAAATTAGA ATACCTACTG TAACACCTAA AATAATAATG ATTAAGTAAC 63 60 

CAAGTTTAGT AGGTTCTAAG AAT AGATTTG CAAGGAAAAA TACTAATTCT ACACCTAGCA 6420 

50 TCATAATAAA TGAATACAAG AATATTTTTG CAAAATGAAT CCAACTATAG CTGAATTTAA 64 80 

ACTTCGCATA TTTTTTAAGA ATATAGAAAT TACATCCAAT TGCAAATAAT AATGCGATAC 6540 
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ACTTGATAAC TACAGAAGCT AAAATAACAT AAACTGTTAA TTTCTGTTTA TCTATACCTT 6660 

GTAACATTGA TGCCGTTACA CTTAATAGTG AAATTAGTAT TGCTACAGGC GCATAATAGA 6720 

ATAATAAGCG ACTACCATCA TGGTTAGGGT CATGACCTAA AACAATTGGA TCGTAACCAT 6780 

AGAAAACTGT GAATAATGGT TGTGCCAAGG CCATAATTCC AATACTAGCT GGAACAGTTA 6840 

TAAACATTAA TACACCAATA GATGTTCTAA TTTGATGATG CATTTCATGT AAGCGACCTT 6900 

CTGCAAATGT TTTTGTAATA TAAGGAATTA AACTCACTGC AAAAGCAGCA CTTAATGATG 6960 

TCGGAATCAT TACAATTTTA TTAGTTGACA TATTTAGCAT ATTAAAGAAT ATATCTTGTA 7020 

ACTGTGAAGG TATACCAACT AAAGATAAAG CACCGTTATG TGTAAATTGA TCTACTAAGT 7080 

TAAATAATGG ATAATTCAAA CTTACAATAA CGAACGGTAT ACTATAAGCA ATAATTTCTT 7140 

TATACATCTT GCCATATGAC ACATCTATAT CTGTGTAATC AGATTCGACC ATACGATCAA 7200 

TATTATGCTT ACGCTTTCTC CAGTAATACC AGAGTGTGaA TATACCAATA ATCGCACCAA 7260 

CTGCTGCTGC AAAAGTAGCA ATACCATTGG CTAATAAAAT AGAGCCATCA AAGACATTTA 7320 

GTACTAAATA ACTTCCGATT AATATGAAAA TCACGCGTGC AATTTGCTCA GTTACTTCTG 7380 

25 ACACTGCTGT TGGCCCCATA GATTTATAAC CTTGGAATAT CCCTCTCCAT GTCGCTAATA 7440 

CAGGAATAAA GATAACAACC ATACTAATGA TTCTTATAAT CCAAGTAATA TCATCGACTG 7500 

ACCAAC CGTT TTTATCATGA ATGTTTCTAG CTAATGTTAA TTCAGAAATA TAAGGTGCTA 7560 

30 AGAAATACAG TACCAAGAAA CCTAAAACAC CGGTAATACT CATTACAATA AAACTCGATT 7620 

TATAAAATTT CTGACTTACT TTATATGCCC CAATAGCATT ATATTTCGCA ACATATTTCG 7680 

AAGCTGCTAA TGGTACACCT GCTGTCGCAA CTGCAATTGC AATATTATAT GGTGCATAAG 7740 

CGTATGTGAA CGGCGCCATA TTTTCTTGTC CACCAATTAA ATAGTTGAAT GGAATGATaA 7800 

AAAGTACGCC CAATACCTTG GTAATTAATA TACTAATGGT AATTAAAAAG GTTCCACGCA 7860 

CCATTTCTTT ACTTTCACTC ATTACGAATC TCCCTATCTC ATGTTTATTA AAGTTTTGTA 7920 

AACTAAAAGC TGTTTCTCTG TAAAATCATT TTTCATTATT ATGAATATAT CACAAAACTT 7980 

TATTTCATTG TCGTATATTC AATGAATTAT CATAACAAAA TTATCAACAC ATTGTCATTG 804 0 

AATACTAGAT TTTGATTAGA ATATTACGAA ATTTCATATA AACATTATAC TACTATTTGA 8100 

GATGAACATC GCATAACAGT AGAAAAATCA TTCTTATCAT ACACATACAT CTTCATTTTT 8160 

TATGAAGTTC ACATTATAAA TATATTCAAC ATAATTGTCA TCTCATAACA CAAGAGATAT 8220 

50 AGCAAAGTTT AAAAAAGTAC TATAAAATAG CAATTGAATG TCCAGTAACA AATTTGGAGG 8280 

AAGCGTATAT GTATCAAACA ATTATTATCG GAGGCGGACC TAGCGGCTTA ATGGCGGCAG 8340 
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GTAAACTCAA AATATCTGGT 


GGCGGTAGAT 


GTAACGTAAC 


TAATCGATTA 


CCATATGCTG 


B460 




AAATTATTAA GAACATTCCT 


GGaAATGGGA AATTTTTATA TAGTCCCTTT 


TCAATTTTTG 


BS20 


5 


ATAATGAATC CATCATAGAT 


TTTTTTGAGT 


CTAGGGGTGT 


TAAATTAAAA 


GAAGAAGATC 


. B580 




• ACGGGCGTAT GTTTCCAGTT 


TCCAACAAAG 


CACAAGACGT 


GGTTGATACA 


TTAGTGACAA 


8640 




CTATCGAACG CCAACATGTA ACGATTAAAG 


AAGAAGAAGC 


TGTTAGTAGA 


ATCGAAGTTA 


8700 


10 


ATACAGACCA AACTTTCACT 


GTACATACTC AAAATAATAG 


TTATGAAAGC 


CATTCGCTAG 


8760 




TGATTGCTAC AGGTGGTACA AGTGTCCCTC AAACTGGTTC AACTGGTGAT GGTTATAAGT 


8820 


15 


TCGCACAAGA TTTAGGTCAT ACCATTACTG AGTTATTCCC GACCGAAGTT CCAATTACAT 


8860 


CAGCTGAACC TTTCATCAAA 


TCCAATCGTC 


TAAAAGGTTT 


AAGTTTAAAA 


GATGTTGAAT 


8940 




TGTCAGTACT TAAGAAAAAT 


GGTAAAAAAC 


GCATCAGTCA TCAAATGGAT 


ATGTTATTTA 


9000 


20 


CTCATTTTGG TATCAGTGGT 


CCAGCTGCAT 


TAAGATGTAG 


TCAGTTTGTT 


TATAAAGAAC 


9060 




AAAAAAATCA AAAGACACAG 


CACATTTCTA 


TGGCAATCGA 


TGCATTTCCT 


GAATTAAACC 


9120 




ATGAACAATT AAAACAACAC ATCACATCAT TATTATCGGA CACACCAGAT AAAATCATTA 


9180 


25 


AAAACAGTTT GCATGGTCTA ATTGAAGAGC 


GCTACTTACT 


GTTCATGCTG 


GAACAAGCAG 


9240 




GAATCGATGA AAATACCACA 


TCACATCACT 


TATCAAATCA 


ACAATTGAAC 


GACTTAGTAA 


9300 




ATATGTTTAA AGGGTTTGTA 


TTTAAGGTGA 


ACGGGACATT 


ACCTATAGAT 


AAGGCATTTG 


9360 


30 


TCACAGGTGG TGGTGTGTCA 


CTTAAAGAAA 


TTCAACCTAA 


AACAATGATG 


TCTAAATTAG 


9420 




TTCCGGGATT ATTTTTATGT 


GGTGAAGTAT 


TAGATATACA 


TGGTTATACT 


GGTGGTTATA 


9480 




ATATTACAAG TGCACTCGTA 


ACAGGACATG 


TCGCTGGATT 


ATATGCCGGA 


CATTACTCAC 


9540 


35 


ATGCATCAAT GGAATAATAG 


TATAAAATTT 


GGTTCGATTC 


TCTTTAGTAG 


ATCAACTTTT 


9600 




TCATXCAAAT AAAAATGACC 


TTAATATAAC 


TGAGTCACTA 


AAAAGTGTCG 


TTATATTAAG 


9660 


40 


GTCATTTCGT TAATTATGAT 


TCTTTTTCGT 


TTTTAGTACG 


TCTTCTAGCT 


AACAAAGCCG 


9720 


CACCTGTAAT CAGTGCAAAT 


TCTTTCAATG 


GTAAATCCAT 


TCCTTCAGAA 


CCTGTATTTG 


9780 




GAAGTTCTTT TTCAACTTTG 


CGCGATTCAT 


GTGTCTCTTC 


TTTTTTAATA 


GGCGTACAAA 


9840 


45 


CTTTTGGAGC TGGCTGAATT 


TCTTTTGGTG 


ATACTTTCGT 


CGCTTCAGCT 


GGTAATTTAA 


9900 




TTGCTAAAAT TTCATCAACA 


ATGAATTGCG 


TGTGTTGTTT 


GATGTCATTT 


AATGTCGCAT 


9960 




CTTCATCAAT CATTCTATTG 


CCATCTGCAA 


CATATTGATC 


AATTAATACT 


TTTACTTTAG 


10020 


50 


CTAATTGTTC TGGTGTTGCG 


ATCGCTTTGA 


ATTTCGCATA 


TGTTTGTTGA 


GCAATGTTAT 


10080 




CAATTCGCAG TAAGCTATTT 


TCTTTTTCAG 


TAATTACTGC 


TTCTATATCG 


CTTAATGCAA 


10140 
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CATCCATTTG TAATTTTAAA GCAGTTATAG CTTTTAATGC ATCAGCCTTA TTACGATTAC 10260 

TTACTTTTCG ATAATTTTGC ACTAAAGCAG TGACGCGTGC AAGATCATCA TTAATCGTTT 10320 

TTTCAGCATC TGGCTTTTTA ATAGGATGTA CATCTAAATC ATGTATTGTT TGTAGATTTA 10380 

ATGATGCTGT TTTATCAACT TGTGCATTGC TACGATCTTG ATCAATTTGT CCAATAGCAG 1044 0 

TGTCATAAAT ATTTTGTAAC TGTGCTAATA TACTATTTCT TTCTTCTACC GTTGCTTGAA 10500 

TATTCGCTTC AATTGCTTGT TTTTTATCGT TGAATAATGT TGTCAATTGT TCTCGAGCAG 10560 

ACGCCTTTCT GTTAATAACA GGTTCGATTT CACGAATTTC GTTTTTCTCA TCATGCAATA 10620 

AATATGCCAC ATCTGCATTA GTCACTGCAC TAGCAATTTG TTGTTTAGCT TTAATTAACT 10680 

CTTTTTCAAC TTGTGCTATT GCAATATTTT GTTCTTCATC TGTCGCTTCG TTATTTGCTT 10740 

TAATTAAATT AATTTTATTT GTAGCGATAT TTTGAATTTG TTGTAATGCT GTTGCTTTAA 10800 

CTGTTGTCGC TGGTTTAATT TTTGAAATAA TATTTTGAGC ATTTATACTA TCTTGATTAA 10860 

CTTGGGCAGT CTTATCTGCA TGATTGATCT GATCAATAGC CTGATTAAGT GCTTGTTCTA 10920 

CTAAATGTTT AGCAGCTAGT CTTTCTTCTT CAGTTGATAA ATCGCTTTGA TCGATTAGTG 10980 

25 CATTTTGAGC TTCGGCTTTT ACACCAACAG ATTGACGCGC TGCTGGTTTA ACTTGAACTT 11040 

TAGGTAAAAT CACTTTGATG TTGTCGTTGC CATCAGTCnC AGTnCGATCC ACTTCTGCAT 11100 

TCGTTTTGTT TTGTGCAATG ' TCATTT 11126 
30 (2) INFORMATION FOR SEQ ID NO: 172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3660 base pairs 

(B) TYPE: nucleic acid 
3S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 172: 

TTGCCCCGCA CGGCGGTGTG nTTCCTAGAA ATAATGAATA TAAAGaGAAA TATATAACAA 60 

CGATTTTGAA TTATGAACCT GGTGATATCG TTACAATCAA ACGTGTGAGA GATAAGACCG 120 

ATTTGCTAAT ATATTTGTCT AGTAAAGATA TTTCTATTGG TAATGAAGTG GAAATTGTAT 180 

CGAAAGATGA AATGAATAAA GTAATTATCA TTAAACGTAA TGATAATGTA ATTATTGTCA 24 0 

GTTACGAAAA TGCAATGAAC ATGTTTGCTG AAAAATAAAA TAAAGAAGCC ATAAAGATAT 300 

50 CCATGATTGA ACTGATAAAG ACATATGGAT AATTGCTTTA GGCTTCTTTT TTATTAGTTA 360 

ATTTATCAAG . TGAGTATATT TGAGTAAAAT ATTCACTGCA TAAAGATTGA AGATAATCCA 420 
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CTGTGGACTC GGACGCTGGA AAGTCAATTT AGCAATCGTC CAACTAGATT GTAGAACTTC 540 

GCCTAATAAT ACACCTAAAA TATATTGATA ACTCATTGTG ACAAGTAGTT GAATTTCTAC 600 

5 TATATTTTCA TCTTTTAATA TAAAATACAA CATGATAGAA ATTAAAGTTA TAACAACAAT 66 0 

GGGTGAGCCT TTTCtAGATG TTAAAATTAA AAAATAAATA AATATCAATA AATAGGTAAA 720 

TATAAAGAAA CTAGGTATCT GATAATGGCT CGACGCTAAA CCTATCAATA ACATAATAGG 780 

10 

TGGCATAAAA TAACCACCAA TCGTTGTAAG CCATTGGCCT GCTAGATGTC TAGATTGTGT . 84 0 

AATTGCGAAT CCTTGTTGTA ATGTCTGTTG TCGCTCTCGT GGACTTGTTA CAATGACTAA 900 

ATCTTTTGCA CGGCCACCAG CGAGTTTATT AAACAGTACA TGACCAAATT CATGTGTTAA 960 

75 

AACAGGGATA TAGTTTAAAA TGACATCTAA ATAGTTCAAA ACAGGCTTAT GTCTATATTG 1020 

ATGAATAGCA ATATAACAAG CTGCAACAAT AACGATAATG TATATATTAA GTTGAATTGT 1080 

2Q CGTATTAAAA AAGTTTGATA AATAATTCAT TGTTAACCTC ATATAAGATA TTAATTTAAA 1140 

GTTTGCTTAT CACTTATTAT AAATGATATT GGCATCAATA GCGTTAGACT TTAGACTTAC 1200 

CTTAGTTAAA CTAATTTTAA TTTTTGAAAA GGTGAATATG TGTTAAAATA AAGCAAAATC 1260 

25 ATTTCGATAT AAATAGGATG AATATAAATA CTGTTAATAT TGATTACACT AACATAATAA 1320 

TGAAATAAGA TAGGAGATTC CTGTTATGAC TGTTGAAGAA AGATCCAATA CAGCCAAAGT 13 80 

TGACATTTTA GGGGTCGATT TTGATAATAC AACAATGTTG CAAATGGTTG AAAATATTAA 1440 

30 AAC CTTTTTT GCAAATCAAT CAACGAATAA TCTTTTTATA GTAACAGCCA ACCCTGAAAT 1500 

AGTGAATTAC GCGACGACAC ATCAAGCGTA TTTAGAGTTA ATAAATCAAG CGAGCTATAT 1560 

TGTTG CTG AT GGGACAGGAG TAGTCAAAGC TTCGCATCGT TTAAAGCAAC CTCTAGCGCA 1620 

35 

TCGTATACCT GGTATTGAGT TGATGGATGA ATGTTTGAAA ATTGCTCATG TAAATCATCA 1680 

AAAAGTATTT TTGCTAGGGG CAACTAATGA AGTTGTAGAA GCGGCACAAT ATGCATTGCA 1740 

ACAAAGATAT CCAAACATAT CGTTTGCACA TCATCACGGT TATATTGATT TAGAAGATGA 1800 

40 

GACAGTAGTG AAcGnAnTTA AACTGTTTAA ACCTGATTAC ATATTTGTAG GTATGGGATT 1860 

CCCTAAACAA GAAGAATGGA TTATGACACA TGAAAACCAA TTTGAATCTA CAGTGATGAT 1920 

GGGCGTAGGT GGTTCTCTTG AAGTATTTGC TGGGGCTAAA AAGAGAGCGC CTTATATCTT 1980 

45 

TAGAAAATTA AACATTGAAT GGATATATAG AGCATTAATA GATTGGAAAC GTATTGGTAG 2040 

ATTAAAGAGT ATTCCAATAT TTATGTATAA AATAGCCAAA GCaAAAAGAA AAATAAAAAA 2100 

SO GGCGAAATAA TCATGATGAC AAAAATAAAA CCGAGGAAAT CCTTAAATGG AGATTCTCGG 2160 

TTTTTT CGGT TTATTTAATA ACGAAGCGGG ACTCATCGAG TTTGTTTCTA AATTCTTTTT 2220 
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CATCAAGTTC ACCGTAATCT TTTAACTTTC CGCCTTCAAT CCAAGCAATC TTAGTACAAA 234 0 

ATTGTCTCAC TTGTCCTAAG TTATGACTAA CGAAAAAGAT GGTTTTGTTT TGCTCTTTAA 2400 

? ACTCGTAAAT TTTATCTAAA CATTTTTGTG CAAAAGTTTG GTCACCTACA GATAAAGCTT 2460 

CGTCAATGAC TAAGATATCT GGATTAACTG TGATATTAAT TGAAAAACCA AGTTTTGCAC .2520 

GCATACCACT TGAATACTTT TTAACTGGTT GATAAATAAA CTCACCAAGT TCACTAAATT 2580 

10 

CAATAATCTT AGGTGTCATC GCTTTAATTT CTTTTCGCTT AAAGCCCATA CATAACATTT 2640 

TAAATTCGAT ATTTTCAATC CCTGTAAGTT GTCCACTCAA GCCAGCACTA ATTGCGATAA 2700 

CGCTGACTTC ACCATTACGA TCCACTTTGC CAACAGTAGG CGACAAAGAA CCGCCAATGA 2760 

75 

TATTGCTCAA CGTTGATTTG CCGGAACCAT TGATGCCAAC AAGCCCTATG ACGTCACCTT 282 0 

CATATGCTTT TAAACTAATG TCATCTAAAG CGAAAAATGT TTTGTTTTTA TGTTTGGGAA 2880 

20 TGAGCGCATC TTTCATACGT TCTTTATTTG TACGATAAAT ACGATATTCT TTTGTTACAT 294 0 

TTTTAATGTT TACCGAAACG TTCATTTGTA GACCTTCCTT ATTCACATTT ATCTAGATTA 3000 

TAATATACTA CTCAACAGTT GTTAAATTTT AAAACCTGTT GTAAAGTGTA TAGAAGATTT 306 0 

25 TGTTATTATC AGAGTGGGTG TTTTGACACA AAATGTTAAT CATCAATGAT AACAATGATA 3120^ 

TTTAAAAACT AAACTTATTT CAACTTACAT GATTGTATAC TATAATGTAT TTGTAATAAA 318 0 

CTAATATTTT AAAGAACTAG ACAATAATTT TGATAGCATC CATGTATAGT GAT AG T ATTT 324 0 

30 ACAACAATTA TTATAATACT ATTTAGTTAA GTAGAGAAAT AGTTAAACAT TTGAAAGTGT 3300 

GGTTTAATGG AATGTCAGCA ATAGGAACAG TTTTTAAAGA ACATGTAAAG AACTTTTATT 336 0 

TAATTCAAAG ACTGGCTCAG TTTCAAGTTA AAATTATCAA TCATAGTAAC TATTTAGGTG 3420 

35 

TGGCTTGGGA ATTAATTAAC CCTGTTATGC AAATTATGGT TTACTGGATG GTTTTTGGAT 3480 

TAGC5XATAAG AAGTAATGCA CCAATTCATG GTGTACCTTT TGTTTATTGG TTATTGGTTG 354 0 

GTATCAGTAT GTGGTTCTTC ATCAACCAAG GTATTTTAGA AGGTACTAAA GCAATTACAC 3 600 

40 

AAAAGTTTAA TCAAGTATCG AAAATGAAcT TCCCGTTATC GATAtACCGA CATATATTGT 3660 
(2) INFORMATION FOR SEQ ID NO: 173: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13868 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

so 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 173: 
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ATTAATCACT TGTTGTGTAG AGTCTTGTCC GTTTTGGTTA TGATTGTTAG CCATGATATA 120 

CCTCCCTTAC AACACTCGTG GACCAGAAGT TTTCTGATCT CTCACATTAA CTTCTAACTT 180 

5 ACGTACTGGC ATTTCTGTGA AATATTCTAC ATTCTTTTTA ATATCCGAAC GAATTGCTTC 240 

AGTTAAAGAT TGAACTTGAA CATTATTTGG TACGAAAAAG TCAGTTTTAA TGTCGATATA 3 00 

AGATTTATTT TTTTTGTTAT ATAGTTTCGC AACTACATTT GGTTGTCTTA CTTGATCATA 360 

10 

TTTTGCAACC GTATCGAATG CCGTCTTITC AACAGCTTTA CGAGATACGT AAACATGACC 420 

ATCATCGAAG TCTTTGTATA ATCCAGGTTT TCGATGCGTA GGTTTGAAGA TACTAAATAC 480 
TAATATAAGA CCTATTAATA TCAATAGTGC AGCAAGTGAA ATAAGTAATG GTTGGAACCA ' 540 

15 

TTCAAATTGA AGGAAGTAGT CTTGATATTC AGTTATACGT CCATCTTGGA TATACATGAA 600 

TAACAGGAAC CCCACGaTTA CTACTATTAA TAAGCCAAGG ATAAAGTTTT TAAGTCGTTT 660 

20 CACCCCTAAC GACACCTCCT TAGTTAAAGT TAATTTAAAA ACATATTAAA TATGTACCCA 720 

TCAGTTTTTT TCTTAAACAT AATAAATTAA TAACTTTAAA TTTATTTTTA ATATATAAGA 780 

TGAAGTACCA TTTAGTAATA TATTCCCTAG TTTTTGTAAA TAAAACCTCA TTATTAATTA 840 

25 ATTyTCGTCA ATATGTTTTG AAGAACGATA TTCTAAAATA TCTGGGTCAC GATGTTTAAT 900 

TAAAACCTTA TTACTATTTC TCGGTTTCTC CTCACTCAAA GATTTTATAA GCGACCATAT 960 

CATCGCTATA ATGACCACGG AAAATGGTAA CGCAGCAATG ATTAATAAAT TTTGAATTGC 1020 

30 TTGAGTACCA CCTGTGTAAA TCATGATGAT TGCAAATAAT GCCATAATGA TACCCCAACT ' 1080 

CACTTTGACA AATGACTTCG GATTAATATC ACCACTTGAA CTCAACATAC CTAAAACATA 1140 

AGTTGCCGAA TCCGCTGATG TAACAAAGAA AATCATAATA ACAAGTAAAG TAATTAAGCT 1200 

35 

TAATACAAAA CCTAGCGGAT AATGTTGTAG CGTCGCAAAA GTTGCTGTTT CTGTCGCAGC 1260 

TTTAGCAATA TCGGCAATAT GATTATCTTG TAAGTAAATT GCTGACGCGC CGAATACCGC 1320 

AAAGAATATA . AAGCAAACTA ACGCCGGGAC AAAAAGTACA CCTAGAATAA ATTCTTTAAT 13 80 

40 

CGTACGTCCT TTTGACACAC GTGCAATAAA TATACCTACA AATGGTGCCC AAGATATCCA 144 0 

CCATGCCCAG TAAAAGATTG TCCAATTTTG TAACCATTGG AATTTTTGAC CACCTGTCGG 1500 

45 AATGCGTAAA CTCATACTAA AGAAATTTGC AATATAATTA CCTAGACCAT TCGTAAATGT 1560 

ATTTAAAATG TATAGCGTTG GCCCAACAAT AAAAAGACCA ATAAGTACTA CAAAAGCAAG 1620 

TAACATGTTG ATATTACTCA ACGTTTTGAT ACCTTTATCG ATACCTGACC ATGCTGACCA 1680 

50 AGTAAATAAT ATGGTTGCAA TGACAATCAA GATTACTTGC ATCGTGAAGT TACTCGGTAC 1740 

ATTAAATAAA AAATGTAAAC CTTCGTTTAT TTGCAATGCA CCGAAACCTA ATGTTGCAGC 1800 
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CATTGCCTTT TCACCTAATA AAGGCGTCAA TGTAGCGCTG ACTAAGCCAG GATATCCTTT 1920 

ATGAAAGCTA AAATATGCAA ACACTAGCGC GACAATACCA TAGACTGCCC ATGCATGAAT 1980 

5 CCCCCAATGG AAAAATGAAA ACTGCATTGC ATCATTAATT GCAGATTGCG TGCCAGCTTT 204 0 

ATGAATAGGC GTTAATTTGA AGGCATGACT GATTGGTTCT GCCGTTGTCC AGAACACAAG 2100 

TCCTATTCCC ATACCAGCAC TAAATAACAT AGCAAACCAA GACGGCAATG AGAATTCAGG 2160 

10 

ATCTTCGCCT TCTTCACCTA ATGTAATGTT TGCGTATCTC GAAAATAAAA TATACACACA 2220 

GACAAATAAA ATAACTAAAA CGAGCAATAA ATAATACCAA GAAAAATGTA GCGCAATAAA 2280 

TGTAGTAATG TTTTG CGTGA GTTTTTCTAA CTGTTTCGGA AATATTGCTC CAAAAGCAAC 234 0 

15 

AAATATCGTA CATATCACTA AAGATACCCA AAACACTAGA CTTACTGATT TATTTTTCAT 2400 

AAATACAAAC CCTTTCTGTG TAATGGTAAG TTCATACCCA TAACTGCAAC ATTTTAATCA 2460 

20 TTTGTAATTT TATATAGACA CAATTAATAA TGCCTCATCT TTTAAAAATG ATATATAAAA 2520 

CACACTCAAA TTATTTATCA TTGAGCAACA AAGTATTTTA TTGTATTTAA GTAATGCCTT 2580 

TCTAGTGCAT TATTGATTTG ATACCTGCAA AGTTGCCATA TTTCCGTTTA GAATCAATAG 264 0 

25 TCGCTAGACA CAAAAAATAA GTCGCCTATA CAGTATTTTC TGCATAAGGC GACTTTACTT 2700 

ACTAATCTAT ATATTAATTA CTAATTTTCC AATCATTGAT TGTTTTTCCA ACAATTGATG 2760 

TGCTTGATAT AAGTTTTCAG GTGATAAACC TTCAAAAACT TGTGTCGTTG TTGGTTGGTA 2 820 

30 ATGCCCTGAT TCTATATTTT TCGTAATATC TTCTAAATAC TCATGTTGTT TAATCATATC 2880 

AGGCGTTCGA TGAATTGGAC GCGCAAACAT AAATTCATGT GTAAATGTTA TACTTTTTAA 2 94 0 

TTTTAATGCA TTTAAATCTT GATCTTCATT AAAAGCTACG ATAGTCGTAA TATGCCCTAA 3000 

35 

TGGTTTTATC AGTTCAATCA TAGTATTGTA ATACAAGTCT GTATTATAGG TGCAAAATAT 3 060 

ATAAXCTACT AATGGAATTT CTTTAAATTG ACGCACTAAA TCCTCTTTAT GATTCAATAC 3120 

GATATCTGCG CCCATCTTTT CACACCACTC TGTTGTTTCT TGTCGTGATG CTGTTGTAAT 3180 

40 

GACAGTTAAA CCATACCGTT TAGCAATTTG AGTGGCTATA CTGCCTACAC CACCGGCACC 324 0 

ATTAATGATT AAGACAGACT TCCCTTGGTT TTCAGCAGGA TTCGTAGAAA TTTTAAATGT 3300 

4$ ATCAAAAAAC GTTTCATATG CCGTAATACC AGTTAGCGGT AGACTAACCG CTTCATTAGC 3360 

ACTTATGTTG TGTGGTGCTT TTGCAACTAT AGCTTCTGAC ACCAAT^TGAT ATGTCGCATT 3420 

TGATCCTTGT CTATTTGGCG ATCCAGCATA AAATACAACG TCACCCGGAC TAAATAATGT 3480 

50 AACGTCTGGT CCGATAGCTT CAACAGTACC AATAGCATCA AACCCAAGTA CACGAGGTGC 3 54 0 

TTGAGTGACT TCCATTTGTC GTTGCTTTGT ATCTACAGGA TTTACACTAA TGCTATTTAC 3600 
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ATTTCCTTCT TCCAATTTAA AGGOCTTCTC AAATCCTATC ATTTTCATAT CGTTTCACCT 3720 

CATTTATGAA CTTATTTCTT ATTATACAAA ATAGAAGCCA TGTGTGCTTA TATCGCAGCA 3780 

5 TCATGACTCC TTTTTCATTT GAATATATAA ATAATTACAG ACGACTTTCG TATTAAATTT 3840 

TAGACTTATT TCTACCATGT TGCTGAACAA ATTTACTTTA GATAAAAAAT TATTAAATTT 3900 

TGGTCAATTA ACAAAGTTAG TTTGTTAAAA CGTgATACTT TATTATTCCG TTACTTTAAT 3960 

10 

AACTTGTTTA CCAAAGTTAT CGCCAGTaAA TAAATTTTTA AATGCATGTG GCGCATTTTC 4020 
AAAACCATCT TCAATGGTTA CTTGTGACTG AATTTTACCT TCTTGAACCC ATGTTGCAAG . 4080 

CTGTTCACTA GCTTCTTTAA AAGCATTAGC GAATTCACTT ACCAAGAAGC CTCTCATCAT 4140 

75 

TACTTGCTTC TTAATAAGCG TACCTTGAAT ACGTGGTCCG ATATCGGCTT CAGGATGATT 4200 

ATATGACGAA ATTGCGCCAC ATACTGGTAC ACGTGCAAAA CGATTTAAAT GCTTAAATAC 4260 

20 TTCATCGCCA ACTGTTCCAC CAACATTTTC AAAATAAACA TCAATACCAT CTGGTACTGC 4320 

TTGTGCTAAC GCTTCTGCAA AATCCTCTTT CTTATAATCA ATACCAGCGT CAAAGCCCAG 43 80 

TGTCTCTGTT AAATAATTTA CTTTTTTGTC GCCACCCGCA ATACCTACTA CACGGCAACC 4440 

25 TTTAATCTTA GCAATTTGAC CTACAACTGA ACCTACAGCA CCAGATGCAG CTGAAACCAC 4500 

AACAGTATCA CCGGCTTTAG GTTGTCCAAT ATCAAGCAGA CCATGATATG CTGTTTGTCC 4560 

TGGCATTCCT AAAACACTTA AATATAAATC AAGTGGTACA TCTGTCGTTG GAACTTTAGT 462 0 

30 AATTTGATCC GCTTGGACAT GATTAATGAT TCGCCAAGGC AACATACCTA CAACGACATC 4680 

TCCTTTTTTA TAATCTGCGA GTGTCGAATC AATTACTTTT GCAACGACAT GGCTAACAAT 474 0 

CGGTTTACCA ATTTCAAAAG GCTGTACATA CGAATCTGCC TTAGTCATAC GTCCTCTCAT 4 BOO 

35 

ATATGGATCC ACTGAAATAT ACAGCGTTTG TACAAGTACA CCATCGCTCT CAAGTTTaGG • 4B60 

CGTGFCAATC TCTTCaATTT TGAATGTATC CTCTTGAGGC ATGCCkTCTG GTATTTTGTT 4920 

AAAAAGAATT TGTTTATTTT GCATCATTAA TCACCTTTCT TTATTTGAAA CTTTTACTTA 4980 
40 * 

TTTGTTACTT AAGCGTTAAG TTTGAATTGT GTCtTCGTGA TGTCTGTATG CAAATACATT 504 0 

CTTAGTTGTT ATATTTTGAC TTAAGCACTG ATTCATTCAT GTAACTTCAA CCACATTATA 5100 

' / 

45 TTTGCTATAA TCATAAATTT AAAATGTTAC GACTTAGACA TTTTATGGAA ACTCTCAAAC 5160 

AATAGATAAT TTTTGAAAAG CTCTAATATT ACAAGCTTTT TTGCCCCAGA AAAACTAGCA 5220 

GTTGCTTTAT TTTTTCCATA AGAAGTCGAT TAACTCATTA GCAACATTTT CATTCTCATG 52 80 

50 AAGCTGACTA TGTTGTGCAG GCTCACCTTC ATATTTAGAT TCTCGATAAC TTTTCGGACT 534 0 

ATTTCCCAGT AAATATTTTA ATGATTTCGA AGAACTATTA GACACTCTGC CGTCTGAATG 5400 
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ATCTTTTAGC ACGCGTAATT GCTGATAAGG TTGATTCATT CGACTTGGTT TACCATCTTT 5520 

ATCAACTGTA ATTTCATTGA CATCTTCATT CATATTTAAA ACACCATTAA ATGTCCCTGC 5580 

S AATATTCACT TGTTTGTTTA ACT6TGGCAG TGACTTGTCG TTACCATATG TCATCATATA 5640 

TTGTGCAAAT GTTAAGTTCC CCATTGAGTG ACCGACAAAG TTGAATTTAT CGAAATTGTA 5700 

TTCAGATTGT AACTTAGTCA GTACATTTTT AAACCACGCA GCATTCTTAT CCAAATAGCC 576 0 

10 TTGTCTGTTA TTTTCAAGTT CAATTTTCAC AATAGGATTC ACTGCATCTT TTCTTAGTTT 5820 

CCCTTTAAAT GTCACTGCAC CATCCTTTGA AACGTAAGCA GTGATGATAT CTTTAGTTAC 5880 

CCCTCTTTTT TCTGCTTGCT TCACCATAAA CTTTTCAGAA TTGGCACTAC CACCAAATCC 594 0 

ATGTAAGAAC AATGTTGGAA TTGGCTTTTT AACAAATTGC TGTTGTTGTA TTTTAAATGT 6000 

TTGTGCCTGT CGTTGACTAA ACACCACCAT AATAATAGAG CCTATAATAA TAGCGACCGC 6060 

TAACAATGTC GTAATAATTA CAAAAATTTT CTTCACACTT TTAACTCCCA TTCATGTCTT 6120 

20 

TTATATAAGT ATAAAGGATG TGATTAAAAA TGTCCTTTAG TTGATTTTGA ATACATCATT 6180 

AACTTTTAAG ATGACTTTGG AAAGTTGTCC GTTAACGTTT GTTAATTGAT TGCTTCTTTA 624 0 

25 GCTTTCAATG GTGTGTCACC CATTGATTAA TATATAAATA TGTATATGCA TGTTTAATTT. 6300 

ATCTCTTTCT ATAAATAAAG ACCTACCAGC ACTCGACTGA TAGGCCTTTT AATATCTATA 6360 

ATTATTTAAT TTCTTTTGTT TCGGCTAACT CTTTGTACCA ATAAGCACTT TTCTTAGGAT 6420 

30 AACGTTCTTG AGTCTCAAAA TCGACATAGA ATAAACCATA TCGTTTTTCA TAACCATTTG 64 80 

ACCAAGAGAA CACATCCATT AATGACCAAA TAAAGTAACC TTTAACATTT GCACCATCTA 654 0 

TAATAGCATC TGCAATAACG TTCAAATGTT GTCTTACATA ATCAATACGT GCATCATCAT 6600 

55 GAACTGTTTT TTCAGATTCA ATAAATTCAT CTTTATATCC TAAACCATTT TCAGTGATAT 6660 

AAAXCTTATG aTAGTTAGGA TAATCTTTAA CAACACGCAT GaTTTGATCA TATAAACCTT 6720 

GAGGATAGAT CATCCAGTCC CAGTCTGTGC GAGGTACGTC GACATCAAAT TCACGTTGTC 6780 

40 

CGACACCTTT AAGTTGGTAT TTAGAACCGC CTTTATCACC TGTCGCATTA TGCGTGATTT 6B4 0 

CAGATTCTCC ATCGTAACCT CTCATCCAAT CACTCATGTA GTAATTGATA CCTAAGAAGT 6900 

CGTTTAAATC TTTGGCTGCA TCTAAAATGG CATAATCTTC ATCTGTAATG TTTAATTTAC 6 960 

45 

CGCCATTAAC AGATAAGATA TGTTGCACAC CTTCCATCGT TTCACGAGAA TACTTACCTA 7020 

AATATGTTGC ATCTAAGATG AATTTATTAT GGATGATATC TTCTAATTCT GCTGCACGAA 7080 

SO CATCTTCAGG ATTTGATGGA TCGAACGGAT ATTTTGTTGG CAATGCGTGT ACAACACCAA 714 0 

TTTCTCCTTT GTATCCGCCA TCTTTAAATA ATTTTACTGC TCTAGCATGA GCCACCATCA 7200 
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CTACTAAATA TTGACCATCA CCAATAGGTC CAATTTCATT GAATGTAGTC CAATATTTTA 7320 

CTTCTGGGAA TTCTTTAAAA CAATATTCAG CATAATCTAC AAAGTAGTCA ATCGTTTTAC 73 BO 

5 GATTTAGAAA ATCGCCATCT TTGTGTAaCA CTTCTGGTGT ATCAAAATGA TGCAATGTTA 744 0 

CAAATGGTTC AACATGACGT TTATGACACT CTGCAAATAA CTTATGGTAA TACTCAACAC 75 DO 

CTTTAGGGTT AACTTCGCCA TATCCATTTG GGAAGATACG AGACCATGCA ATTGAAATTC 7560 

10 

GGATACCATT AACACCGAAT TTTTCACTTA ATTCTAAATC CACTGGATAT CTGTTATAAA 7620 

AATCACTCGC TGGTTCTGCA GTGTACCAAT AGTTTTCTTC TAAATACGTA TCCCATGCTA 76 80 

CGCGACCTTT ACCATCCGTA TTTGTCGCAC CTTCTGCTTG ATATGCTGCT GTTGCTCCAC 7740 

15 

CAAAAATAAA ATCTTCAGGT AATGTTTTAG TCATATGAAA AACTCCTATT CTTAATTTTC 7800 

AAATTGTTGT TGAACGAAAT CAAGGGCTGC TTGGCCATCT CGTGTCAATT TGATATATTC 7860 

20 AGCACCTTGA GTCTTCGCTA ATTTAATACC TAATCTATCT GTATCTTGCT TAATATCTTC 7920 

ATAGTTAGAC GCAACTTGTG GCGCTAAAAT GATTAATTGG TACTCTTTCA TAATGTCCAT 7980 

ATGTGCGCCA TATCCGCCAG cTGCCGCTTT CACTGGCACA TGATATTCTT CAGCTGCTTT 8040 

25 ATTAAGTGCA TTGGCTAATA- ATCCACTTGT ACCACCACCG GCACAAAGTA CTAAGACATT 8100 

TGTTTGTTCT GTGATATTTG AAGCTTTAGC TGCATCGTCT GATACACCAC TTGCCGCTAA 8160 

AATTGAATCA GCTTTTTTCG TATCAAAGTT TGCTGCAACT TTTTCTTTTA AATCTGAATT 8220 

ACTTTCTTTA CGTCCTTCTT CTTCATCAAG AATTTCACTA TCATAAACTT TTAGGAATGG 8280 

GTAGTAAATA ATAATATCTA CAACAATCAA AGTAATAGCT AGTACGAATG ACCATAAACC 834 0 

AAAACCTGTA CCCATGATAA TGCCCAATGG ACCTGGTGTT GTCCAAGGTA AATTCACACT 84 00 

35 

AAAACTATTC ATTCCTAACA CTTCAACGAA AAGTTTGAAA ATCCATACGT TAACAATTGG 8460 

TGCTAATACA AATGGAATAA AGAACACAGG ATTCAATACT AGTGGTGCAC CAAATAAAAT 8520 

TGGTTCGTTT ACACCAAAGA ATGTTGGTAC AACTGATGCA CGTCCAATCG CTTTGTTTCG 8580 

40 

TTTAGATTTC GTCATCCACA TAAACATGAA CGGGACGACC AATGTTGCAC CCGTACCTCC 864 0 

AAATGTAACG ATAAACATTT GTGTACCTGA TGTAATAATT TTATCTGCGT GTTCTCCAGC 8700 

45 TTGAAGCAAC TTGAAGTTCG CTTCGATATT CGCATATGTA ATGGCTGCAA TTGCTGGCTC 8760 

TACAATTGAC GGACCATGAA TACCTACAAA CCAGAATAAT GCAAAGGCAC CAAAGATAAT 8 820 

TGTGACACCA ATCCATCCAT CTGCTGCTGT AAATAATGGT TCGAATAATT TTAAAATACC 8880 

SO TTCCGCTACA TTTGATTTAA AGCTGTTGCG AATGACTAAA TCTAATGCAT AAAGAATGAT 8940 

GATTACCGCT GAAAATGGAA TTAAGTCCTT AAATACTTGT GAAATATTCG GCGGTACTTC 9000 
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10 



is 



20 



25 



30 



35 



40 



45 



50 



AAATGCTGAT AAGAATGCTG TTAATAAACC TTTAGTTCCC ATAAATGCAC TTAAGAATCC 9120 

ACCATCTTTG GCTGGATCAG AAGCTAAGAA CAAGAATCCA CACATCGCTG CTAGCATTGT 9180 

AGAAATAAAG TTAATTTGAT TTGTACTTTC TAGCTTACGG TTAAATGAAT CTGTTAAAGA 924 0 

TTTCGCTGTC GTTCCTGCTA CTAAAAATGC TACAAGCCCC ATCGTATAGT TATATGGTTT 9300 

CATTAAAATG GCTTCCATGC CTTTATCCCA TTTAAAACCA AATATATTTG GCACATATGC 9360 

AATTAATAGA AAGATACTTG AGAATAAGAT GACAGGCATT GCAGAAATAA ATCCATCACG 9420 

GATGGCTCTT AAATATATGT TACGTGATAA TTTCTCGAAA AATGGCTTCC CTTTTTCAAT 9480 

TTGTGCGATC AATTTTTGCA TCATTGTCAT CACCCTCTTT TATAAAATTC TAATAAATGC 9540 

TTCATTAAAT CTTTCAGTAA AATTGTTGTC ATTAAATGAT CTTGACCATG CATCATCGTT 9600 

ACACTGTATG CAATATCATC ACCTTGCGCT TCTTTAGCCA ACAGGCTTGT TTGTGCTCTA 9660 

TGCGCTTCCG CAATGCAATT GTTTCCTTCT TCAATCAGTG CATCTGCTTT TGCAAAATCT 9720 

CCAGCTTGAG CTGCTGTTAA TGCTTCTAAA AACTTAGAAC GTGCATCCCC TGCAAATGCA 9780 

ACAATTTCAA AACCTAATAA TTGGACTTCT TCTCTATTCA TAGCATTAAT CCCCTTTTAA 9840 

ACTTATTTTC TTTGTTTCCA AGATGTCGCA GTATCTTTTA ATACTTTATT TAAGTCAT CA 9900 

ATATTTTTGA AACCAGTTGT ACGTAACCAT TCACGAGCAG CATCTTCACC TTGTTCAATG 9960 

TATACTTGAA CAGCACCAGA CCATGTAGCA CGGCCACAAA GTACCCCGTT GAATTTAGCA 10020 

CCAGCTTCGT GAGCAAATTT TAAAGTTTCT TGGAATAATT CCGCAGAAAC ACCAGCACTT 10080 

AAGTAAATGT ATGGTAAATG AGTTGCTGCA TCTTGATCTT TAAAGTGTTG TGCCGCTTCC 1014 0 

TOTTTTGTAT AAACCACTTC ACCTTCAGCA AAGCCTTCTA CATATTTCAT GTTTACTGGT 10200 

ACTTCAACTT TCAATACATC AACGTTAAAG CGTGGTTCTG AGAATAATTT CATTGCTTCG 10260 

TTAACCTTTC TAGGCTTAAC TTTTGCGAAT TcAACAcTAC CGTTATcAGG AATGTTGTCA 10320 

TCGTATGTTA ATACTTCTAA AAAGAATGGA ATATCTTCTG CAACACATTC TGAACCGATT 103 80 

CTTTCAATGT ATGCTTTCTT TTGAATGTTA ATTTCTTCAG CATCATCAAC ATCATAGTAA 1044 0 

AGTAAGAATT TAACAGCATT TGCGCCTTGT TCTTTTAAAC GTTTTGCAGA CCACTCTACT 10500 

AAACAGTCAG GTAAACGACC TTTAGCGTTT ACGTCATATC CAG TTTTTT C ATAAGCAAGT 10560 

AATAATCCAC AATCTTTGTT ACGTGCATCT GAAGCTGGTA AACCATATTC AGGATCTAAT 10620 

AAAATTGAAG ATGCATATTG TGTTAATTCT TCCGCAACTA ATACTTTTAA TTGTTCAATT 10680 

TGAGCTACAG TTGGTTCTTC AGTTTGATGT TTTGCCATCA TGCGTTTTAA AGCACCACGT 10740 

TGGTCAAATG CTAATGCAGA AATGATACCT TCGTTGTTAC TTAATTGTTC AATTGATGCG 10800 
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TCATCATAAT TATTTAAATT GACATAACCT GTTTGTGCTT CTTGTGCATT CAGCATGCCT 10920 

AAAGTATTGG CTTTTTTTAG TAAATCGTGG TCGTTTTCAT GATTAAGAAT TGCTGAAGTA 10980 

5 ATTCCAGCAA CTGTAGAATC ACCTGAACCA ACCGGATTTA ATACACTTAT TGTCGGAATA 11040 

TTCACTCTAT AGAATGTATG ATTGTGCTTA GCGAATGCAC CTTGTGCACC TAAAGACACA 11100 

ATAATCCACT CAATCCCTTC GAATAAGGGT TGTGACACTG CCTGTTTCAA ACTTTCTAAA 11160 

10 

CTTTCATCAA GTGGCTGGTT AAGCAATTGA TATAGTTCAG AAATGTTTGG TTTAATGACT 11220 

GTAGGTTTGT ATGGATTTTC CAAAACTGTT TGCAAAGTtG CACCCGAGCA ATCTAATATC 11280 

ACAGGCACAC CTTTGTTTTG GCATCGTTCA ATGATTTGTG CATAATAATC TTGATTTAAT 11340 

IS - 

' CCTTTAGGTA AGCTACCTGA AATAGCAACT GCTTCAACTT TTTCTAATAA TTGTTCAAAA 11400 

TGTTTAATAA ATCCTGCAGC CTCTTGATTA TCAATCTCCG GTCCCTGCTC TAAAATTTCT 11460 

2Q GTTTGTTGCC CTTCATGTAA AATTGCAATG CAGTTTCGTG TTTCACCCTT AATGTTATAA 11520 

AATGCATGCT TGATGTCGGC ATGATCTAAT TTTTTAGCAA TAAATTGACC TAATTCACCG 11580 

CCAATAAAAC CACTCGCAAG GACTGGCTCA CCTACTTGCG CAAGTACTCT TGTTACATTT 11640 

25 AAACCTTTAC CACCAGCTGT TTTACTTACT TCTTGAACAC GATTAACATC ATCTAATTTC 11700 

AATGCTGTTA ATGGGTATGA AATATCAACG GATGGATTTA ATGTTAAAGT TAAAATCATA 11760 

TGTGTCGTCC CTTAATCGTG GTATTCGCCT CTGTCCCATT TTTCTAAGAA TTCATCAAAG 11820 

Oft _ 

AAATGTGGAT CAGCTTGATC TGCATTGCTT GTTTCTAAAT GTTTAATTTT AGCGATTAAT 118 80 

• TTTTTGTTCT CTTCAGTTGG TTTATATTCA GCATTAATAA ATGCATCGAT AATATCGCAC .11940 

ATTAATAACT CACCTATAAT ACGTCCACCG AAGCCAATAA CGTTCGCATT TAATTCTTCT • 120 00 

35 

TTAGCGTATA ACGCTGATGT CATATCACGT ACTAGTGCTG AACGAACGCC AGGTACTTTA 12060 

tttXcagcat tgttaatacc aacacctgtt CCACAAATAC AAACACCTAA GTCTGCATTA 12120 

CCG.CTAACAA CTTGTTGGCC AACTTTTTTA CCAAAAATTG GATAATGTGT TCTTGTGAAA 12180 

40 

TCGTATGTTC CTACGTCAAT GACTTCATGT CCTTTTGATT TTAAAAATTC AGATACACGC 1224 0 

ATTTTTGTAT CTGTAACAAT ATGGTCGCAT CCTAATGCAA TCTTCATAGT AATTTTTCCT 12300 

45 CCTTAGCACA TTTTATTAAG CATATCTACG CGGATTTGGT GTCTACCACC ATCGTATTTA 12360 

CCTTCAACAA AACCTTTAAC GACATTTTTC GCTAATGTGT CTCCAACAAT TTCAGATCCC 12420 

ATAGTGATCA TTCTTGAATT GTTATGGCCT CTAGTCATAT ATCCAGAGCG TTCATCTGAT 124 80 

50 ACTTCAGCAG CAATCATGCC TTTGATTTTT GTAGCAACCA TAAAGCTACC TGCACCAAAT 12540 

GCATCGATAA CAATACCTAA GTTACCTTCT TGACTTTGAA CATCTTTTGC TACAGCCAAA 12600 
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5 



10 



IS 



20 



35 



TCTAATAAGT 


ATGATTTGAT 


GACTTCTTTT 


AATCGTTTGC 


CAGCTTCATC 


TGAACCAATA 


12720 


ATAATCGCCA 


TAATAAGACT 


CCTTTTTACT 


TTAATTTTGA AATACCTTTC 


TTAAAATGTG 


12780 


ACATATTTAT 


TTGTAGGTTA 


TGAAAATCTT 


GAGAAAAGGC TTTCAATTTG ATTACGTTTA 


12840 


AATTA7AAAC 


ATAAACAAAC 


AATAAATCAA 


CATAATATGT 


TTATAATATG 


TTTGTTTATG 


12900 


ACGTATTTTC 


AAACAATAAG 


TGAACATTCA 


TATTGTGGTG 


TTGTTTTAAT 


TAGGTATTCG 


12960 


TCTGAAATTG 


TAGTAAAACT 


TTGTCGAGGT 


TCCCGTTGaC 


ATAAATTTGC 


ATAAAAAAtA 


13020 


GCCCATAAAT 


GAATGCAAAT 


TCACATTCAC 


TTATGAGCAT 


ATAGATACAT 


ATTTTAACAA 


13080 


TGCAGTTATA 


CTTTTAATTT 


AGTCGACTAC 


TTCAATATAT 


GTTTTAATCG 


TTTCTACTTT 


13140 


TTCTTCATCT 


TCATAGTCCA 


TGACCACTGC 


AGTCAATTCG 


TTTAACTGAC 


AAAATGATGT 


13200 


AAAATCTTCT 


TTGCCAACTT 


TCGTATGATC 


GATTAACAAG 


TATTTTTCAA 


TTGAATTACT 


13260 


TAGTGCCAGT 


TGTTGCGTAT 


AGGCTTCATC 


TAATGTAGAT 


GTCATCACAG 


CACCTTTATT 


13320 


TACTGCGTTA 


CTACTAAAGA 


ACATCTTGCT 


AAATCTTAGT 


TTTTCCAACA 


TGGCGTTCGC 


13380 


cATTTCACCT 


ACAAATGCTT 


CTGTAATATG 


GCGCATTTCA 


CCACCAATTA AATAGACACG 


13440 


AAAATGTGCT 


GTTTGTTTTT 


CTAACAAAAT 


TTTATACACC 


GGCAAACAAT 


TCGTAATAAT 


13500 


TGTGAGCGTA 


TGATGATTGA 


CTTCTTCTGC 


TAATAGTTCC 




CTGGTCCGAA 


13560 


AAACAAAGTA 


TCCCCATCTT 


CAATTAATGA 


TGCAGCTTTT 


TTAGCTATAA 


ATCGTTTTTC 


13620 


TGCAATTTGA 


CGGGTATGTT 


TTTCTTTATG 


CGATATTTCT 


TTATACTGAA 


ATGTTGAATT 


13680 


ACTGCGTGCA 


CCACCATGAA 


TCTTCGTTAA 


AATCCCTTTA 


TTTTCCAATT 


CAATTAAATC 


13740 


TCTTCGAACT 


GTCATATCAG 


ACACATTTAA 


ACCTTCGACG 


ATTTCATTCG 


TTCTTATCGT 


13800 


GCCdTTTTA 


TTCACTAGTT 


TAGCAATTTC 


GTCCAAACGT 


TCATGTTTAT 


TCAATGTAAA 


13860 


ATTGC2TTC 












13868 



(2) Information for seq id no: 174: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 174: 
SO TTAAGTCAAC TTTGTCTATA CGGTTTGGAT CtTCTaCCCA ATGTCTTATA AAAGACAATC 60 
CCGCACCTGA AACATAACTC ATGAAATAAG AAAATGGTAT ACCATTAATT TGATCATTTT 120 
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5 



10 



IS 



25 



30 



35 



40 



AATCTTTACC 


CATACGAAAC 


ATCAATTGAT 


AAAATGCGAT 


GTCTTTTTCT 


ATCATTTCTA 


240 


TTAAAACGGT 


CATAATTTGA 


TGTATGTTAT 


CCGTGGATAA 


CTTAACTGCT 


CCATTTAACT 


300 


TCTCATCATG 


AATGAAGTCT 


CTTATTTCCT 


CCAACTGCTG 


GTCCTCTAAT 


TTTTCAAGCA 


360 


AATCATACTT 


ATCATAATAA 


TGCGTATAAA 


ATGTACTACG 


GTTAACATCA 


GCTAAATCTG 


420 


CAATTTGTTG 


CACAGTAATC 


TCTTCTAATT 


GGTGTTGATG 


TAAAAGTTCA 


ATAAATGCAT 


480 


TTCTCATTGC 


AACTTGTGAT 


TTTCTAATAC 


GTCGATCTAT 


AGTCATTTAT 


ATCAAGTCCT 


54 0 


CCCCAATGAT 


TATAAACGTT ATGTTCATTA 


TCCCACAAAT 


CTCCAACATT 


GATGATTGGC 


600 


ACACAATGTT 


TACCTGTTTA ATATAGGTGA 


TACAAACAAA 


CAGAAAAAGG 


TGATAACAAT 


660 


GAACCAACAT 


TTACTAGGAA 


ATCCAAAATT 


AACTGTAACT 


CATGTCAATG 


AAGTTAAAGC 


720 


CGGTATTAAC 


CACATCGTTG 


TCGACAGTGT 


TCAATATGGA 


AATCAAGAAA 


TGATTATGGA 


780 


AAAAGATGTC 


ACTGTGGAAA 


TGCGCGATGG 


CGAAAAATTA 


TATATTAATA 


TTTTCAGACC 


B40 


AAATAAAGAT 


GGCAAATTCC 


CTGTAGTTAT 


GTCTGCAGAT 


ACTTACGGTA 


AAGATAATAA 


900 


GCCTAAAATC 


ACAAATATGG 


GTGCCCTTTG 


GCCAACATTA 


GGTACCATTC 


CGACATCTAG 


960 


TTTTACACCT 


GAAGAATCAC 


CAGACCCAGG 


ATTTTGGGTG 


CCAAATGATT 


ATGTTGTAGT 


1020 


TAAAGTTGCA 


TTACGCGGTA 


GTGACAAATC 


CAAAGGCGTC 


TTATCTCCAT 


GGTCAAAAAG . 


1080 


AGAAGCGGAA 


GATTATTACG 


ArTGATTGAA 


TGGGCAGCAA 


ATCAGTCATG 


GAGTAATGGA 


1140 


AATATCGGGA 


CAAATGGTGT 


TTCTTATCTT 


GCGGTGACTC 


AATGGTGGGT 


CGCATCATTA 


1200 


AATCCACCAC 


ATTTAAAAgC AAtGATTCCT 


TGGGAAGGCT 


TAAATGATAT 


GTATAgAGAA 


1260 


GTAGCCTTTC 


ACGGAGGTAT 


mCCAGATACT 


GGCTTTTATC 


GTTTCTGGAC 


TCAAGGTATT 


1320 


TTTGCGAGAT 


GGACAGATAA TCCAAATATC 


GAAGATTTGA 


TTCAAGCACA 


ACAAGAACAT 


1380 


CCTC7GTTCG 


ATGATTTTTG 


GAAACAGCGT 


CAAGTGCCAT 


TATCACAAAT 


TAAAACACCT 


1440 


CTACTAACAT 


GTGCTAGTTG 


GTCTACACAA 


GGTTTGCACA 


ACCGTGGCTC 


TTTTGAAGGA 


1500 


TTTAAACAAG 


CTGCATCTGA 


AGAAAAATGG 


CTATATGTGC 


ATGGACGTAA 


AGAGTGGGAA 


1560 


AGTTACTACG 


CTAGAGAAAA 


TCTCGAACGC 


CAAAAATCAT 


TCTTTGATTT 


TTACCTTAAA 


1620 


GAAGAAAATA 


ACGATTGGAA 


AGATACGCCT 




ATGAAGTTAG 


AGATCAATTT 


1680 


TATAAAGGCG 


AATTCAAATC 


AGCGTCACGT 


GTCCCTTTAC 


CTAACGCAGA 


ATATACACCA 


1740 


TTGTATTTGA 


ATGCTGAAAA 


TCACACATTG 


AATCATGCAA 


AGATTAGTAG 


CGCGCATGTC 


1800 


GCACAATATG 


ACTCTGAAGA 


TAAACAACAA 


GATGTAAGTT 


TTAAATATAC 


GTTTGACAAA 


1860 


GATACTGAGT 


TAGTTGGAAA CATGAACTTA 


AAACTATGGG 


TAAGCACTAA 


AGACTCAGAT 


1920 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



CCTGATTTTA 
CGTGAATTAG 
TTAAAGTTGT 
CTATTTAAAC 
AATAGTACTC 
CACATGATTT 
TATGCAGCAA 
GAGAAAGCCA 
ATTTAATTTT 
TCTTCATTCA 
GCATAAGTCA 
AAACAAATTT 
TTATTATTAT 
TCTGAATCAA 
TGAATTTTAG 
TATTAAACGC 
GTGTTGTTCT 
ATGTGATTCA 
TTGATACAAC 
TTGCTCCGTT 
ATCA£TACCA 
CAATTGCTGG 
CTTATTAACT 
ATCAGCTAAT 
TGCAGGATGA 
TTGTCTCGGA 
ACTCACAATT 
ATGTAAGGCA 
AACAATGTGT 



ATCATATTGA 
ATCAAGAAAA 
CACAAGATGA 
AAGGCGAAAC 
CTGGCATGAA 
ATACTGGTGG 
TTACGGTCGC 
CACGCTTGGA 
ATAACACTAT 
TTACTGCTAC 
TGATTATGAG 
TACCACTACC 
TCACGATGGC 
TCGTAATGCT 
CATTCATCAT 
GCTTTTGAAA 
ACTAATTGAG 
AGATATTCAG 
GCTTGTGCTA 
AAATAAACAT 
ATAATTTCAA 
GCATCTTTTT 
ACTGTTACCA 
GGGCCTACTT 
AAAACAATGT 
TAAGCATCGA 
GTAATATCAT 
CCCATTGTTG 
TGCATCTCTT 



AAATGGTCAA 
ATCCTCAATC 
GATTGTACCT 
ATTGGAAGTT 
AACACGTTAT 
TAAATATGAT 
TTTTGATTAA 
ATCTTAAACC 
CGTATTTTCT 
TTTAGGTGCA 
TACATCGCCT 
TCTTTCACCA 
TACTTTTTCA 
ACCTACATAA 
TGTTCTTATC 
ATTTAACAGC 
GATAACTATA 
TAACCCTGTC 
ATAGCAAACT 
TTCTTGAACT 
CGGCATGATT 
TACCAAAATA 
CCCCATCAAA 
TGACATCAAT 
CCGCTCCTAC 
AATCTTCGTT 
TTGTACTAAC 
GGATAAAACC 
TTACCGTAGT 



GTAGCTACTG 
GCGCAACCTT 
GTTGAAATCG 
GTTGTAAAGG 
GAACATGAAG 
TCACAATTAA 
AAGTGACATA 
AAGGTGTGGC 
TTTTCATGAA 
TGGTTTTTAA 
ACTTCAACAA 
GCTATTACGT 
TTTGGCAAGA 
TTTAAATTTG 
ACTTTATTCA 
TAAcGAGATA 
AACAGCAACT 
TATAATTACT 
TTTACTTAAA 
TTTCGCCAAA 
GAAGTCTTTT 
AGCATAATCC 
ATGCCCTGGr 
ACCTAATTCA 
TTCTGATACT 
TGGACCAAAT 
TGATTCGCGT 
AATCGTTGTG 
AATCAGCTTA 



GTTGGTTACG 
GGCATAAACA 
AATTGTTACC 
GTAGTGAAAT 
AAACCGTAAA 
TCATTCCTAT 
GTGATAGGAC 
CCTTTTTATT 
TCATTTCAAT 
TTTCTTCTTC 
GTCTTGACGC 
ATGTTTCAAA 
TGTCTACCGC 
ACTCAGTCAC 
GCTCCAATTA 
AATATGCGTC 
TCTTCAATGC 
TTACTTTGAC 
TGTACCGCTT 
CCATCTGCTT 
ACCmTTTGCT 
GGCATAACAA 
CCGtTCGCTC 
CCTGGATACA 
AATTCTAAAT 
TGTAATGGAT 
ACCATCGTTA 
CCTGAGCGTT 
GTCATTGTTA 



CGTATCACAT 
TGAAACAGAA 
TTCAGGCACG 
TGTAATTGGT 
TAAAGGCATG 
CGTTAATTGA 
TGTATAACAA 
ATTGATGGCT 
AATGACATTA 
ATTCAACTGT 
TGCACCGTTT 
ACGTGCACCA 
TTCCAATATA 
TCTTGCTCTA 
TTATATTATC 
CAGTTATTTC 
GTTCACTTAT 
GTTCACCGTC 
CTTGTCGTTC 
CTCGAACGAT 
CGaCAATAGC 
TATTAAATAG 
CTTCTAACAC 
TATCTTCTAC 
CTTTATCAAT 
TAACAAATAC 
AATGTCCATC 
TGGCTGCTTT 
TTAACCTCAT 



2040 
2100 
2160 
2220 
2290 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2860 
2940 
3000 
3060 
3120 
31B0 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
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GATCGTATTG TTTTAAACCA TCCACACCAA CACTAAAATC AGCAAATTGC TTCACAAATT 
TCGCTTTATG TTCAACACCA TAATTTAACA TATCGTGATA AACCAATACT TGACCATCTG 
TACCTTTTCC TGCACCAATA CCAATGACTG GAATTGTTAA GTGCTTGCTA ATTTCTTCTG 
CTAAATCATT TGGAATTGCT TCAAGTACTA ACGCAACTGC ACCAGCTTGT TCTACATTTT 
TCGCGTCTAA AATAAGTTGc TCCGCTGCTT CTTTCGTTGC ACCTTGTAAT TTATACCCCA 
TAACGCCAAC ACTTTGAGGT GTTAATCCTA AATGTGCAAC AACAGGAATA CCAATTGCCG 
TTGCTTTTTC AATAAATGGT GTAATATGCG CTCCTTCTGC TTTAATTGCA TTTGCATTCG 
TCTCCTGATA AAGCTTTAGA GCATGATTTA AGTCTTGTGT CATAGAGATG CCTACTGCAC 
CAATCGGCAT ATCAACAACT ACAAATGTAT TTGGTGCGCC TCTTCTTACT GCACGACCGT 
GATGAATCAT ATCTGCTAAC GTCACTTGTA CGGTACTTTC ATAACCTAAT ACAGTCATAC 
CAAGTGAATC CCCAACAAGA ATCATATCAA TACCCGCTGC TTCCACTTGT TTAGCACTTG 
GAAAATCATA AGCTGTTACC ATAGAAATTT TAGTTTGCTT TTGTTTCATA TCTATTAATT 
GACTTACTGT TTTCAATGTT ATTCAACCTC TTTTTGCAGT ATnATTAGA 
(2) INFORMATION FOR SEQ ID NO: 175: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: B33 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 175: 
TTATCTTTTG TTGTTTCCTT AGACAAACGA CTAACCACAT TATAATGGAC TAATTTATTA 
ATTTTATTTA ATTCCATTAA GTTATCCGTA ACACTAAGTG AAGATGCGGA GTTCACTCTC 
GTTTGTACTC TTCGTTTTAA TAAAGCACCT CGTAATAATA CAATCATTCT TCTTATTAAT 
GATGCTTGTC TATATACCTG TGTTCTTTCA GCATAACGCA TATAGTTTTC AAGTACACTA 
TTCGTTATTT GTCCTTCATC TACTAAAGAC TCTAATGTTT TkGTTTCTAC ATTAAAAGCA 
ATTTTTTGTA - GACGTTCTAA TTCTTTAGAG TTTTCATCAT CTTTCTCTAC AGTTTTTAAA 
AATGCTAATT TATCATGATA TTCTTTAATC ACGTTACCAT ATTTAAAACT TGTTTCGAAA 
GTAGATTTTT GATTTAGATA ATCAATAACT TGTTCTAATA TATAAATTCT AGCAACTTTA 
AACGACATAT TGCCAATTAC TGTTTTAGGT GCAGGTTTCG TTAATAATGG CAATAATACT 
TGCGCAACTA CCAAACTAAT AATAACCATA CCAGATGCAA TAAATAATAA GTCGTTTCTA 
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ATTGTTCCAT GCACACCACA TAACGTCATA 
TTCTCAGTCG TTGGATTATC ATCATCATTT 
5 . GCTAAATAAA AATAAGGATA TAAGACATAA 
AAAGCAACAA CAATAGTGAT GCCTATTAAA 
ATTTTAATAA TAACTTCAGG TACTAAAAAT 

10 

ACATAACCTA GTATATTCCA TGTATGATTG 
ATAATTCTGT CACGTTCGAA ACCATGTACA 
GATGCGTGaA ACAATTCAGC AATTAAATAC 

15 

GTAAACATAT TAATGTTTTC ATATCCTCGA 
GCCATACCTA TAAGTAAACC AACCACTGCG 

20 ACAGCATCAA CAAGTGAAAA AGCACCTGTA 
ATAATACCAG CAGCATCATT CAATAATGAC 
GGCAAGACCT TTCCTTTAGT GATTGCTTGC 

2S GCAGCAATTG CAAATGCTGC TCCAATAGGT 
CCTACACCTA TCACAGTAGT AATGACTAAT 
TATTTCCTTA AATGGACTCT AGAAACATTA 

30 

ATTGTTACCA TAAACAATTC AGAATCAAAA 
AGTAACATGC CCAAGAAAAT TTGTATAAAT 
ACAAACGAAC TTAGTATCAC AACAGCTATA 

35 

CTTTCACCTC TCTAAAAAGT ATTGTTTAAT 
TATACTTTAG AGGATAAATT GAGTTAGCGA 
CTACGATTGC AGTACTTAAA TTTGCAATTA 

40 

ATAAATAAAC AACTTGCTTT CACATAACAA 
TTAAAATCGA CTAACCAATT TCaAAGTACT 

45 TCaAAAGCAT TGTTATGCTT AACAATTTAG 
TTACGATGCT CTTCTCGTTT TTCAGCACGT 
TTTGAACTTA ATAATATTGA TGCATGTGTG 

SO CCGTTGCGAT AAGCAGCGCG AGCGACTAAG 
AAAACAAGTG ACAGTAATAA ACGCACACTG 
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ATTAAAGCGT ATAAACTTCG 


CTTTGGTGGT 


720 


TTAGTCATCA TTTTTTGGAA 


TGGACTGATG 


780 


ACCCAAACAA ATCTAAATAG 


ATAGACAGCT 


840 


AAGATTAAAT 


TGTGCGGTTC 


TGTTTTGATA 


900 


CCTAATATTG 


AAAAAACAAA 


GCCATTTAAA 


960 


TAACTCATTT 


GCAGTTGTGT 


ACGTACTTGC 


1020 


AGTCCTGCAA 


CTACTGCTGC 


AATGATTCCT 


10B0 


GTAACAAATG 


GTGTTAACAA 


TTGAATAATT 


1140 


CGCATCAATG 


TTAATCGGAA 


CCTTACTAAT 


1200 


CCACCAATTG 


ATGCAATTAA 


AAACAACTGA 


1260 


ACTAATACTC 


CAACAGCTAT 


TTTAAATGAA 


1320 


TCACCTTCAA 


GAATTGTCAT 


TGCTCCTTTT 


1380 


ACTGCTACTG 


CATCAGTAGG 


ACAAAGAATG 


1440 


AAATCTGGCC AAATCCAATG AATAAATAAA 


1500 


CCTAATGCCA 


TCATCATCAC 


TGGCTTAATA 


1560 


ACACCTTCTA 


CAAATAACAA AGGCGCAATC 


1620 


TTAAATTGAA 


CAGGGATTGG 


GGTAATAAAT 


1680 


GCTAGGGGTA 


CTTTAGGTAT 


GAAAGTATGG 


1740 


AATATAAGAA 


TTGTTTCAAA 


TATTTCCAAA 


1800 


TGAAAATTAA 


GTATCACATC 


TCGTTGTAAT 


1860 


CCACAAAAGC ACTTTAATAT AGATATATGT 


1920 


TTTAATTTTA 


TTTTATCACT 


AATTGTTTGT 


1980 


CATTAACTTA 


TAATACAAAA 


AATGAGCACC 


2040 


CTTTTAATGA 


TTAATTTTGA 


AAACAGATTT 


2100 


CCAACACTTC AATCGTTTTG ATACCATTTC 


2160 


AATTGTAATG 


CTTCTGTAGA 


GTTTTGTTCA 


2220 


TGAGCATCAT 


TTTTTCGATA 


CATATAAGCG 


2280 


TGCATGCCGA 


CTGGTGAAGT 


TAAATTAATA 


2340 


AAAAATCCTG 


TATTCACAAT 


AAAATAAATT 


2400 
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CTTAAGAAAA CATCTTGGAA TTTCACGATA CCTATTGCAC TAATAAGAGC AATAAAACTA 2520 

CCTAACAACA ACA7CACAGC AGCAATAAGA CTAAAGATTT CTTTTGTTAT TTCCATTAAA 2580 

5 

CACATGCCCC CCACCAATAA AGCGTGATAT TGAAACAGAA CTTACAAAAG ATATAATGGC 2640 

AATGAGCATG ATTGAATCTA AGAAAGAAAC GGTGCCCATA AGTACACTTA ACACACCCAC 2700 

AATTGACATT ACGACAGCAC TTGTTGTATC AAATGTAACG ACACGATCTG CTGTTGTAGG 2760 

10 

TCCCTTGATT AATCTAAATA AACAGATGAT TAATGCAATT CCAAAAATAA TGAGTGAACT 2820 

AATAATCATA ATATGTGTTA TTGTTTGTAT CATCGCGACA CCTCCAATAT TAAGTCTTCA 2880 

1S TAATGCTTAA TACTTCTTAA CAAACTATCT TTTTCTTTTT CTGACACGTC GATACTATGA 2 94 0 

ATAAAAAACT TTTTAGAGTC TTGAGAAATT CGTATTACTG TAGACCCTGG AGTTATAATA 3 000 

ATTAAAATTG TTAAAAATGT TATTGACCAA TCACTTGTTA GTCTTGTTTC ATATGAAAGT 3060 

20 AATCCAGGGT TCATATCTTT TGTTTTAAAA AGAATATAAT TAATCGTGCT AATGCTAGAT 3120 

GTTATTAATT GATATAAATA AACACCTAAA AATTTAATAG CTACCCATAT TTTTCTAACA 3180 

TAAAAATCAT CGCTGAAAAA CCTGTGTAAT ATATAAATGA CAATTAAACC AATTAGATAT 3240 

25 CCAGAAAAGA AAGTCGAGAA TTTAAAATGA TCTTCATCTT GAAATAATAC CCATAAGAAT 3300 

GCAATGATAA TATTTAAAAC TATTTGATTC ATTTAGTCCT CTCCTTTCAA ATGCGGATTT 3360 

ACAAGTTTTT GATATAATTG ATCACTCGTG TTCAACTCAG TTGCATCACT TGTAACATTT 3420 

30 

AACACAACAG GTGCAGCAAT TCCGATTGCG ATAACCACAA CTACTAAAAT ACTTAAAATT 34 80 

CTTTTTCGAT ATAGCGGGAT TTTCTTAAAA TTAACTTCCT CCCCATCTTT ATCTCCAAAA 3540 

TACATATAAA AAAGTATCCT AAATAAACTG TACATTGCAA TTAGACTAGT AATAATCATT 36 00 

35 

AACGCTAGTC CAATATAATT GCCATTTTGC AATGCACCTT GGAAAATAAG TACTTTCCCC 3660 

GGAfiAGCCAC TAAATGGAGG CACGCCGCCA ATAGCAAAAA TCATTATAAT AAACGCAACT 3720 

CCAAATAAAG GTT CT TTTTT AGCTAAGCCA TTCAAATATT GATATTGTCG ATAGCCTGTA 37 80 

40 

ATGTAAACTA AACTACCAAT AATAAAAAAT AGCAATGTTT TTACAACAAT GTCATTTACC 3840 

AAATAAAATA TTGCACCATT AATACCTGCA AACGTGTTTG TTCCTAAACC TAA7ATGATA 3 900 

45 AATCCTATTG AGATTATGAC TTGGTAAGCT GCAATCTTTT TAATATCTTT ATAAGCAATG 3 960 

ACACCTATAG CGCCGATGAC CATAGTTATA GCAGCCATAG TTGCTAGCAA TGGATGTATG 4 020 

AGATCATTAT GTTGATCAAA TAGTAAAGTG AAGAATCGAA TTAATGCATA GGCCCCTACT 4 080 

50 TTGGTCATTA ACGCTGCAAA TAATGCTGCA AGCTcAGTAT TTAACACAGC GTAGGCTTTG 4140 

GGTAGCCACA TAAAAAGGAC CAGCGCTGCT TTCGCACTAA ATGCGACTAA GAAGATTAAT 4200 

55 



854 



EP0 786 519 A2 



AAGTTTAATG TACCTACTGT TTTATAAAGT 
CCAATAATAT TCAAGACAAC ATAAATAATT 
AGTGTAATGA GTACAAATGA CGCTAGTAAC 
AAATCTGATG TTAGAAAAGA GCCTATCACG 
AAGTGATAAC GATTTGCTTT ATGTTCGCCA 

10 

ATCACAAACG AAGCGGTTGT AACCATAATT 
CCAAAGGGCG CTGACCATCC TCCAAAGTCT 

i5 ATTAATAGCA TTAATGAAAT AATTGTGGTG 

GAAATACGAT CATTATTTTT TAAAAATACA 
ATTGGTAAAA TCAATAAGTT ACTTAGCATC 

20 TTCTTTTGTT ACTTTATAAG TTCTATAAAC 

CCCTATAACT ATTGCAGTTA GTACAATAGC 
TCCACCAGTT ATTAGTGGTT CTGATCTACT 

25 GAGATTACCA GCATGAGTAT ATATTGAAAT 

TAAAATCATA TATGTTCCTA TAAACACTAA 
CATGATCGAC CTCCGCTAAG CGACAACATC 
AAAATACCTA ATTCAAAAAG TGTTATTGTA 
ATCCAAGTTG TTTCATATTG AGACAAAAAT 
GTAATAGATG ATACCAATGC TCCAATAATC 

35 

TCTAAAACCT CTTCAACATT AAAAGCCAGA 
AAACCACCAA TAAACCCACC ACCAGGATTA 
AAAGTCAATA AAATAAATAC AACAAGTTTC 

40 

TTCATCTTGT CCCCTCCGAT CTTGATAATT 
AATTAACACT AATCCTTCAA ATAATGTATC 

45 TACAATATTT TTACCACCTG TTAGTTTGTC 

TAAACCATCT GTTTGTTGTG TAATAAAAAT 
TGATACAGAA ATTTTAATTA TTTCTCTTTT 

SO TAATCTTGAA AAACTGACAA TAAATAGTAT 

CAATGCTAGA TCAGGGGCTT TCATTGCTAT 

55 



AAACCTATAC CTAATAAGAA TAGCCATGAA 4320 

GCAGCACGTA ATTGTTCTAC AGATTGTCCA 4380 

ATAATTTCAA ACATGACGTA TAAATTAAAT 444 0 

CCAACACTTA AAAATAATAT GAACGATGGC 4 500 

CGCCCAAATC CGTATGCCAT AATTAAAGTA 4 560 

AAACTTAAAG AATCTCCTAA AAACTGTATA 4 620 

AGCGTAATTG GACGGTGACG CTGAACATAA 46 BO 

ATAGTCATTG TACCTAAGTA TAAATATTTA 4740 

AGGATTAAGG CACAAAGGAA TGGTAATAAC 4800 

ATCTTCCCCC CTTAGGCCTT CAATTTCATC 4 860 

AAGTACAAGT AAAAACGCAG TCATCCCAAA 4920 

TTGTAACAAG GGATCAACAA ACAATTGGTT 4980 

AGAACCATAC GTTCCCATAC TCATAATAAT 5040 

TCCGATTACA ATACGAATTA AATTGATTGA 5100 

AAATCCTATA ACTAGTAATA ATATTAAATT 5160 

ACTGTGACAA TAACACCAAC AACTGAGAAT 522 0 

CTTACATGAA TTTGTCCTAA AATTGGAAGT 5280 

GGTTTTCCAA AAAACATAGG TATTATCGCA 5340 

ATTAAAATTC TAAAATCAAT CGGTAAACTT 54 00 

AACATTAAAA TAAACGCTGA ACTAAATATT 5460 

TTATGACCTG CGAAGAAGAC ATAGAATCCG 5520 

GTGACCGTTC TTAACACGAC ATCATTCTCT 5580 

TAATAATGtg TAAATACCTA GCCCAGTAAT 5640 

TAATGCTCTA AAGTCACCAA GTATCGCATT 5700 

AGCTTTTAAA TAAAAGTCTG ATATTGATGA 5760 

TAATGATACA ACAATAAGTG CCATCAAGAG 5820 

TTTGTTAGCG TTAGATCTTG GCACGTTTGG 5880 

CGTCGTTATT GTTTCAACTA CTAGCTGAGT 5940 

AAAGAATAAG GTCACAACAA ATCCGATGAC 6000 
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GACAGTTACG ATTGCTAATA TAATTTCTAA TGCCCCAAAT TCAGAAACAT GTAACTGATG 6120 

TACTTTAGGA AGTCCaATTC GAATATAACC ATATCCAATG ATAATCATAA ATATGCCTAA 6180 

5 GGTCATAATA ATGTACTGGT. TTAAACGATC TTGCATAACA CGTTTAAATC GCTTCGTAGC 6240 

AAACTTTTCA AAATGTCGAT ATACCATCTC ATAGCTTTTT GAAACTGAAA TCTGTCTAAT. 6300 

TTTACCTGTG AACACTTTTT TCCAATCTAC TTTGATTGCT AGTAGACTAC CCAATAAAAT ' 6360 

10 ' 

AATGATGATG GTTAAAAGAA GCGGTATGTT AAATCCATGC CATTGCGAAA CATGTGGTGC 6420 

CAATTGATCA ATTTGATGAT TACCACCTGA TACAGCTCTT AATGCnAGAA CGATAATCCC 6460 

CTTCCCAAAT ATATnTGGTA CAAAAAAGAT TACAGGTACT AGCACCATTA aTATAAGAGA 654 0 

15 

TGGTAAACTA aACAACCATG GTTCGTGGAT ATTTTTTTTA GTAAAAACCT TAGAATCATA 6600 

TTTTGtCCAA AATACTTCTT TTACCATGTA TAGTGCATAT GTGAATGTAA AAACACTCGC 6660 

AATAACACCA ACAAACACGA TAGCTATCAT TGAAATCAAA CTAAATTGGG ATAATTGTCC 6720 

20 

AGTTTGTGTT AATGCATCTA AAAACATTTC TTTACTTAAA AATCCATTTA AAAATGGTAC 6780 

TCCAGCCATA GATAGAGCCG CTATCGTCAT . GACTAGATTC ATTTTAGGAA ATAGTTGACG 6840 

25 CATTCCACTT AAAATTCGTA TATCCCTTGA ACCTGCTTCA TGATCTAAAA TACCTACTCC 6900 

CATGAAAAGC GCACATTTAA AGATGGCATG ATTCATTAGa TGAAATAGcG CACCArATAA 6 960 

TACmAATACA TAAATaGATG CTATTGCGTC TTGTTGGTGT TGAGCATATC CGCCACCTAT 7020 

50 ACCCACCATA GCCATAATCA TCCCAAGTTG ACTGATTGTA GAGTACGCTA GGATACCTTT 7080 

TAAATCCCAT TGTTTTAAAG CTGTAATTGA ACCAAATAAC ATTGTTATTA AACCAACAAA 714 0 

CGTAACGATA TATACGTACA TATTGCTAnG ACCTAATAAT GGTGTAAATC GAAGTAATAG 7200 

3Q AAnGATACCA GCTTTTACCA TCGTGGCTGA ATGTAAATAA GCACTTACAG GTGTAGGTGC 7260- 

AGCCATTGCT CTAGGTAGCC AGTATGAAAT GGAraTTGTG CTGATTTTGT AAATGCACCT 7320 

AATAAAAACA TAAAAATCAT AGGGATAAAC AATCCATGAT TCTTAATATG ATCTGCTTGT .7380 

40 

CCTAATATCT CTGTGATGTT ATTCGTTCCT GTCATGATAT ACAGCATAAT AAAACCAACT 7440 

AATAACGCCA ATCCACCAAA TACTGTAATC ATAAATGATT GAATCGCACC AAATTGACTG 7500 

TCACCATTGT TATACCAATA TGAnATCTAT AAAAATGATG ATtnCACTCGT TAATTCCCAA 7560 

45 

AAaATGTACA TCmATATCGT ATTGTCTGAT AATACaaTAC CAATCATACT GAACATAAAT 7620 

AACGTTAAAT AAAAATAAAA CCTTGGTAAA TTGTCTTTTC GAGAGGATAA ATATTGAGTT 7680 

SO GCATAGAAGA ATACTGCAAT TCCAATAAGT GAAATAATAA GAGAAAACAT TAAACTTAAA 774 0 

CCATCTAAAC GTAAATCTAA ATTAATATCT AATGTCTTAA TCCATGGAAT AGAGGTAGAA 7800 

55 ' 
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10 



is 



30 



35 



40 



SO 



55 



GGTGCAACCA ACGCTATGTA CCCGGCATAT TTAGCCAATG CTCTACGTTT AGACATTAGA 7920 

AGTATCATCG CCATAATCAC AAGTATAGCA ATTAATAAAT AAACCAAACT CATTATTAGC 7980 

CTCCTTTGTT TCTATAATTG TAATGAAATA TAAATACTAT GTTCACACTC ATTTTCTAAA 8040 

CCGATAAAAT TTAGTGTTTC AATAGCAGAT TGATGCCCTA AATACTTTTG AATGACTGGT 8100 

ATAAGTATAC CTTTTTGATA AGCATGATAT GCAAATGTCT TACGCAATGT CGTTAGTCCT 8160 

ACATTATCTA TACCAGCTTC AATTGATGCT TGGTGAATTA TTCGATATGC TTGCTGTCTA 8220 

GATAATACTT GATTTGTTCG TAGTGATTGA AAAAGAACGT CTTCATTCGA AAGACTCCTG 8280 

TCCTCTATAT ATTGAAGTAG TTCTTTCGAT AATGTTTCTG GTAACCTAAT TTTAATCAA 8339 
(2) INFORMATION FOR SEQ ID NO: 176: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 588 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 176: 

CCCGATTTTT TTACGTAATC TAATACATAC GGCAAAATCA ACTTTAATCA AAAAAGACTC 60 

ATACACAATG CCTTTAAAGC ACATGTATGA GTCCTTTTTA GTAGTTTATA TCAAAAAATA 120 
GTTTAATGTA TAAATTAGTT TTTGTTTACA GATGCGTCGT AGATTGATTC TACAGCATCA ' 180 

CCTAAAGCTT TATCGAATTC TTCTTTAGAT TGATCAGCTC TTAAATCACT AGCTAATGCA 240 

CGTGAGAAAC TTGCGATAAG TTCAGCGTTA TCTTTAAGTA ATTCATTTGC TTTTTCTCTG 300 

CTGTAACCAC CTGATAATAC AACGACACGA ACAACATTAG GATGTTCAGC TAACTCTTTG 360 

TATAAGTTTG GTTCAGTAGG AATTGTTAAT TTCAACATTA CTAATTGATC AGCATTTAAG 420 

CTATCTAAAC CTTTTTTAAG TTCAGCTTTT AATACTTTTT CAATTTCAGC TTTGTCTTTT 4 80 

GCATTAATAT TAACTTCTGG TTCGATAATT GGAACTAAAC CTTTAGCAAT AATTTGTTTA 540 

GCAACTTCAA ATTGTTGTTC AACAACGTCT TTGATACCTT GCTCATTT 588 
45 (2) INFORMATION FOR SEQ ID NO: 177: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2841 base pairs 
•(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATAGAGTnCT GGllACTTACT ATGACATATG GCGCTAGAAT 


GGCTGAGCCA 


GGTGAATTTA 


60 




CAAAACGTGC CTTTTTAAAT GGTCGTATTG ATTTATCTCA AGCTGAAGCA 


GTTATGGACT 


120 


5 


TTATTCGCTC GAAGACAGAT 


AGAGCTTCTA AAGTTGCGAT 


GAATCAAATT 


GAAGGTCGTC 


180 




TAAGTGACTT AATCAAAAAA 


CAACGTCAAT CTATATTAGA 


GATACTCGCT 


CAAGTGGAAG 


240 




TGAATATTGA TTATCCTGAA 


TACGATGATG TTGAAGATGC 


GACTACTGAA 


TTTCTTTTAG 


300 


10 


AGCAGTCTAA AGAAATCAAA 


CAGGAAATTA ATCGTTTATT 


AGATACCGGT 


GCGCAGGGTA 


360 




AAATTATGCG TGAAGGTTTA 


TCTACAGTTA TTGTTGGTAA 


ACCAAACGTA 


GGTAAATCAT 


420 




CGA7GTTAAA TAATTTAATA 


CAAGATAATA AAGCGATTGT 


AACTGAGGTA 


GCAGGTACTA 


480 


15 


CTAGAGATGT CTTAGAAGAG 


TACGTCAATG TTCGTGGCGT 


GCCATTAAGA 


TTAGTTGATA 


540 




CTGCTGGTAT ACGTGAGACA 


GAAGATATAG TAGAGAAGAT 


TGGTGTTGAA 


CGCTCTAGAA 


600 


20 


AGGCTCTTAG CCAAGCAGAC 


TTAATTTTAT TTGTATTAAA 


CAATAACGAA 


GCATTGACwC 


660 


AAGAAGATTA CACATTATAT GAAGTGGTTA AAAATGAAGA TGTAATCGTA 


ATTGTTAATA 


720 




AAATGGATTT AGAGCAAAAC 


ATAGATATTA ATGAAGTTAA AGATATGATA 


GGTGATACGC 


780 


25 


CATTAATTCA AACTTCAATG TTAAAACAAG AAGGTATTGA TGaATTAGAA 


ATACAAATTC 


840 


gAGATTTGTT CTTTGGTGGa GAAGTACAAA ATCAAGATAT GACTTATGTT 


TCTAATTCAA 


900 




GACATATTTC ATTATTAAAA 


CAAGCAAGAC AAACGATACA AGATGCGATT 


GATGCAGCAG 


960 


30 


AATCTGGTGT GCCTATGGaT 


ATGGTACAAA TTGATTTAAC 


TAGAACTTGG 


GAAATATTAG 


1020 




GAGAAATTAT TGGTGAGACT GCAAgTGATG AACTCATCGA TCAGTTATTC 


AGTCAATTCT 


1080 




.GCTTAGGTAA ATAGTAATTG 


AAATAGACGG AATACCGTCT 


TAAGAAGGCT AGTAAGATAT 


1140 


35 


CAAATAAGGA GGTTTATATT 


GTGGTTCAAG AATATGATGT 


AATCGTTATA 


GGTGCGGGAC 


1200 




ATGCAGGTGT AGAAGCAGGT 


TTAGCATCTG CAAGACGTGG 


TGCTAAAACA 


TTAATGCTAA 


1260 




CAATAAATTT AGATAATATT 


GCATTTATGC CATGTAACCC 


ATCTGTAGGT 


GGACCAGCTA 


1320 


40 


AAGGTATCGT TGTTCGTGAA 


ATTGATGCTT TAGGTGGACA 


AATGGCAAAA 


ACAATCGATA 


13B0 




AAACACACAT TCAAATGAGA ATGTTAAATA CAGGTAAAGG 


ACCTGCTGTA 


AGAGCACTAA 


' 1440 




GAGCGCAAgc AGaTAAAGTA CTTTATCAAC AAGAAATGAA ACGCGTGATT GAAGATGAAG 


1500 


45 


AAAATTTGCA TATAATGCAA 


GGTATGGTAG ACGAACTTAT 


TATAGAAGAT 


AATGAAGTTA 


1560 




AAGGTGTACG TACAAATATT 


GGTACAGAGT ATTTATCTAA 


AGCAGTAATT 


ATTACAACGG 


1620 


50 


GAACATTTTT ACGTGGTGAA ATCATTTTAG GTAATATGAA 


GTATTCAAGT 


GGACCAAATC 


1680 


ACCAATTACC ATCAATCACA 


TTATCAGACA ATTTAAGAGA 


ACTTGGTTTT 


GATATTGTTC 


1740 
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20 



25 



30 



AAATACAACC 


AGGTGACGAT 


GTAGGTCGTG 


CATTCAGCTT TGAAACAACA GAATATATAT 


1860 


TAGATCAATT 


GCCATGTTGG 


CTAACGTATA 


CTAATGCTGA 


AACACACAAA 


GTTATCGATG 


1920 


ATAATTTACA 


TCTATCTGCA 


ATGTATTCAG 


GGATGATTAA AGGAACCGGG 


. CCACGTTATT 


1980 


GCCCTTCAAT 


TGAAGATAAA 


TTTGTTCGAT 


TTAATGATAA 


GCCGCGACAT 


CAACTTTTCT 


2040 


TAGAGCCTGA 


AGGTCGTAAT 


ACAAATGAAG 


TATATGTGCA AGGATTGTCT ACAAGTCTTC 


2100 


CTGAACATGT 


GCAcGTCAAA 


TGTTAGAGAC 


GATACCAGGT 


CTTGAAAAAG 


CAGATATGAT 


2160 


GCGTGCCGGC 


TACGCAATTG 


AATATGATGC 


GATTCTGCCA ACGCAGTTAT 


GGCCTACACT 


2220 


TGAAACGAAA 


ATGATTAAAA 


ACTTATATAC 


TGCAGGTCAA ATTAATGGTA 


CATCTGGTTA 


2260 


TGAAGAAGCA 


GCAGGACAAG 


GATTGATGGC 


AGGTATTAAC 


GCTGCAGGTA AAGTGTTAAA 


2340 


CACAGGCGAA 


AAGATATTAA 


GTCGTTCAGA 


TGCATATATT 


GGTGTCTTAA 


TCGATGATCT 


2400 


TGTAACTAAA 


GGTACTAATG 


AACCTTATCG 


TTTACTAACA 


TCACGTGCAG 


AATATCGTTT 


2460 


GTTAcTACGT 


CATGATAATG 


CTGATTTGAG 


ATTGACGGAT 


ATGGGATATG 


AACTTGGTAT 


2S20 


GATTTCTGAA 


GAAAGATATG 


CACGTTTTAA 


TGAAAAACGT 


CAGCAAATTG 


ATGCGGAAAT 


2580 


TAAGCGTTTA 


TCAGATATTC 


GTATTAAACC 


AAACGAACAT 


ACGCAAGCGA 


TTATTGAACA 


2640 


ACATGGTGGT 


TCTCGCTTAA 


AAGATGGTAT 


TTTAGCTATC 


GATTTATTAC 


GCAGACCTGA 


2700 


AATGACTTAC 


GATATAATTT 


TAGAACTTTT 


AGAAGAAGAA 


CATCAATTGA ATGCAGATGT 


2760 


TGAAGAACAA 


GTAGAAATAC 


AAACAAAATA 


TGAAGGTTAT ATCAATAAAT 


CACTACAACA 


2820 


AGTTGAGAAA 


GTTAAGCGTA 


T 








2841 



(2) INFORMATION FOR SEQ ID NO: 178: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3025 base pairs 
C (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 178: 
ATCTAATTTC AAACCCGGTG ATAAATTGCC AAGCGTGACG CAATTAAAAG AACGTTATCA 60 
AGTAAGTAAG AGTACTATCA TTAAAGCATT AGGCTTATTG GAACAAGATG GTTTGATCTA 120 
TCAAGCACAA GGCAGTGGTA TTTATGTGAG AAATATTGCT GATGCCAATC GTATCAACGT 180 
CTTTAAGACT AATGGTTTCT CTAAAAGTTT AGGTGAACAC CGAATGACAA GTAAGGTACT 240 

50 

TGTTTTTAAG GAGATTGCAA CGCCACCTAA ATCTGTACAA GATGAGCTCC AATTAAATGC 300 

55 
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CGAATATTCT TATTATCATA AAGAAATCGT GAAATATTTA AATGATGATA TTGCTAAGGG 420 

CTCTATCTTC GACTATTTAG AATCAAACAT GAAACTTCGT ATTGGTTTTT CAGATATTTT 430 

5 CTTTAATGTA GATCAACTCA CTTCAAGTGA AGCTTCATTA CTACAATTGT CTACAGGTGA 540 

ACCATGTTTA CGTTACCACC AGACTTTTTA TACAATGACT GGCAAACCCT TTGATTCATC 600 

TGACATCGTA TTTCATTATC GTCATGCACA GTTTTATATT CCTAGTAAAA AGTAATAAAT 660 

10 

ACATAAAAAC GTCTATATCC CAGTTATAAA CTGGAGTATA GACGTTTTTT TACGATAATA .720 

ACAATGGCTC AAATTGCTAT TATCTTGCTT AGGTTTTTCG TTTTAGAAGA ATATTGCTAC 780 

AAAGACAGGC ACAACTGCTA CAACAACTAC ACCAACTAAC ACTAAAGCTA TACTTGCCAT 840 

75 

TGATTCTTCT ACAGGTCCTA ATTCTTTGGC TGGTGCTACA CCTAATGTGT GACCACTTGT 900 

TCCAAGTGCT AATCCTCGGG CAATAGGGTT AGTAATTCGG AAAAGCTTTA AGAATTTATT 960 

ACCTAGGGCA TAAATAATGA CACCATTTAA AATAACTGCT AATGATGTTA ATTCTTTTAT 1020 

20 

ACCACCGATA CCAGCTGATA CTGGTAACGC AATCGCTGTA GTTGCTGCTT GAGGTAACAT 1080 

TGATAAAATA ACATCATTGG CAAATTGTGC TAACTTCGCA AAAGTTAAAA TAATTAATAA 1140 

25 CGCTACAACT GTACCGATAC CAATACCTCC GATGATACGA TGCCAATGTT TAACAAGCAC 1200 

TTCACGCTTT TTATATAACG GAATCGCAAA ACAGATTGTT GCCGGTTCTA AGAAGAAGTA 1260 

AATAATGTCT CCACCTATTT TGTAAGTCTT ATACGGAATG CCTGTTAAAT AGAGGAAGGC 1320 

30 CACACCAAAT ACCATACTGA CAAATAGCGG TGCGAATAAG AAGAAACGAT TAGTTTTTTC 1380 

AAATAATATG GTCGCTAAGA AAAATGGTAT AACGGATAAC AGTATTCCGA AGTAAGGTGT 1440 

GTTTaGTGCT AAGTGGTTAA TCaTGAGCTT GTGCCTCCTC TATTTTGATC TTTTTTGTGA 1500 

35 CTTTGTCACC TTTAGATCTC GAAGTAACTT TCATAATAAT TTgTGTGACA TAGCCAGTAC 1560 

AAAT&AGTAA TAGTATTGTT GAGACGATTA TTAGTCCAAT GATTAAAAAT GGTGCTTGGC 1620. 

TAATGACACC TAAAGAGTTA ACAACTGAGA TACCGGCTGG TACGAAGAGT AAGCCAATGT 168 0 

40 

TATTTGTTAG TGTCGTTCCT ACTTTTTCGA CTTCGCCTAA CTTAACAGCA CCAGTACATA 174 0 

ATAATACAAA TAATAATACT AAACCGATTA CTGATGCAGG CATAGGAATT GGCATAAATG • 1800 

ATTCAATTAT TTTCGATACA AAGAGTACTA AAGCAATTAC AATGACTTGG TGAAAAAAGT 1860 

45 ■ 

GTGCTGGTTT TGATGCGTCT TTTTGTTGTT TCACGACCAT TGCCTCCTAC GTTTGATTTA 1920 

ACTAAAGTAT AGATGGCTCA CTTCGATTTG CGTGATTTTT AGTCCGAAAT ACAAAATATC 1980 

ATAGGTAAAA TGCATAAAAA AAAGGATTAC TGTTAAAGTA ATCCTATCGA CGCTTTAAAA 2040 

50 

TCTTTCATAA ATGAACGTCC AACTTGCATC TTGACACCAT TTGTCAATAT TACCATATAA 2100 

55 
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TGAATACGTA TAAAATAAGT 
GTTTCGTATT TATGATTCGT 
5 ATAATATTTT GTTQCTTTAA 

TCAAAATTCG CCGACATATC 
GTCGCACGCA CTTTATTGAC 

10 

TAGTCTGTCG CATTTAATTC 
ATAATCGCAG GTGGCTCTTT 
TCCATTAAAT TGACATCTAA 
GTTTCTTTTA CATTTTCTGC 
TATGTTAATT CATTACGTGC 
CCTCCTCTTG TCTTTCATAA 

20 

. CAAATTGTAA TGCTGCGGAT 
AAGCACTACC AGTTCCAGAC 
2S CTTTAGAAAT ACCCTGACCA 

TAATCACTGA CACGTCAATA 
(2) INFORMATION FOR SEQ ID NO: 179: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16B9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY : linear 

35 

~ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 179: 
ACAGAATTTC ACAGCATTTT TAGATGAAAA AATAAGCCAG TCATAGCGTT GATTTAACAA 6 0 

40 

ATGAATATCA AAATTTAGTG GCTTTATATC AATAAAGGGT TTGTGAATAA TTGATACTAA 120 
ATCACTTTGC ATGTCATTTG TTTGTGTCAT AACTACAACT GGCTTCATAT TTAAACGTCA 180 
CTCCATTATT TAATGTTGTT CATTTAAGCG TTTTATAATT TCATAAGCAC CTTGCTCTTT 24 0 

45 

TAATTTGTTA CTCACTGTTT TGCCTAACTC AACCGGATCT GTTCCGTTCA TTGTATATTC 300 
AAATCGTTCT TTACCATCTG GGGTCATAAT TAAACCTGTA AATTCGATTT CGTTTTGATC 360 
TGAGATTGTA GCATATCCTG CAATTGGCAC CTGACAACTA CCATCCATTT CTGCTAAAAA 420 

50 

CGTTCGTTCA GCAGTCACAC ATTTTGCAAC CTCATCATTA TGTACTTTGC TTAATAATGT 480 



55 



GGGATTCAAT CGTTTTTCAT AACGATTCAA TGGCTCTGTT 2220 

TGTATGTATG GTTGTAATAC CATTATGTGT GCCAATCCCA 2280 

CATGTGAATT TTATCGTCAA TTTCAACAGG TAAGCTTTGA 2340 

ATTCGCAATT GCACTTGCGT TATTATCATC TTTGGCTTTA 2400 

TGCTTGTTCA ATACG TTTTT GACCAAACGG TTTCAAAATA 2460 

AAATGCCTGT ACTGCGTATT GGTCATGTGC AGTTGCAAAA 2520 

CATCTTTTGA ATCTTAGCTC CTAATTCGAT CCCATTTTCA 2580 

AAATATAATG TCATATTGAT TGATCAGTAG TGCTTCCAAT 2640 

CTCATTAATT TCTTCAAAAC CACCAATTTC ATTTAATAAA 2700 

TAATGGCTCA TCATCTATGA TTAATGCTTT CATATTTATT 2760 

GGAAGTACAC ACCAAAAAGT GGTACCGCTC GATGTCGATT 2820 

TTTCCAAATA ATCCTTTTAG GCGTAAGTTT AAATTTTCTA 2880 

TCTGATTCTA CAGATGTnTC TCCCaACAAA TGCATTTTAT 2940 

TTATCTTGTA CAATAATACG TACATGTGTT GCAGTTTCTT 3000 

TCGTT 3025 
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TAAfAATVTTA 


^V^T^T'AHPOft ft 


unl/UVJAlO X 


Tri C AATA TPA 


TCTGACCAGC 

1V>1 WAV WrtVJ V. 


CCATTCTTCT 


600 




ta a a cr'A t^rr 


ftPftfifTft 71 Zl ft 




ATPAA'I'i "Vl'CA 




TTTCTAAT CG 


€60 


5 


HV-iT' ft T*r ™i"ft HP & 


R PV TV ^f'ft ft 


1 UU/il 1 1 ne\l 


"T'ft ft A'T"/ ! A 


CSfSATACTTAG 


ATAATATTTG 


720 




TYsr'A ffft rrift 




TAfJTACC'AAT 






GGGATAGTGG 


780 


10 




TTftflftft ftTftT 


ArYSCATPAAA 


TV PI/ 1 ' IV JA J F U F'r — | • 


TTATCAGGGA 

LlAi W4VJV7V3A 


TACAACCTAA 


840 






X 1 VJV7 1 


fiTr "rrriAfir 

UiLl 1 A/V\VJV» 


AtTPtTTA'] "I'G 


CCATATCGAT 


900 




ATTTTTTTCA 


ft 71 71 71 rTTP ft T 


HPTVT! HP ft HPHPHP f 
vj X X O XXL 


ftf A A A^P 
X X InALnAnl 


AAGCCTTTGC 


CTCCGACTTT 


960 


15 


AGACAATTG T 


TTATCTACTA 


iALuAl UlsLL. 


X XX LvjX\jcH-«H 


ATTTCTTTAA 


TTTCAATTTC 


1020 




TAGATTTGGC 




^^Tv TV HPHPHPft HPO 


ft ftT A A ATT rip 


TGGCTTTGTG 


TTAAAGCTAA 


1080 




TTTACyTCTT 


crnr* tv tv tv 




& P*PAO 21 'IV SHPTV^ 
/ILvwW V? 11V. 


AATTCCTCCT 


AGGAACGGAT 


1140 


20 


TGCTCTAGAT 


TATTTTCTCA 


ATxLALAAAA 


TGTGTTG CAA 


AAAATAAATT AATCATATTT 


i*uu 




« » r^j^y\ tv n T*ff»TV 

AAGCAAAATA 


IV TV Tl *T*TV TV T^'W 

AAATAATGTT 


ATAGTATATT 


»A TV TVT'TA'TTT'TY^ 

AAA1 AX L X 1\j 


AATTCAACCA 


TTTGTTGATT 


T 5C0 




f »»T»7\ TV /'•'!< T\ TV TV TV 

CTAAGTAAAA 


TATAACTTCC 


ft hp ft T7i ft t ft rr 
AX Al AAXAL X 


r*Tft ft TftftTTf2 


AAGAGAGTAT TACCTTCGGG 


1**20 

X J jt u 


do 


Tl/^TV TV IT 1 /"' TV TV T»7\ 

TCAAXGAATA 


TACGTTCACC 


AACTGAAATT 


ta r*ft rvvvft rr* 


GTGTACCTAA 


AATAATACTA 


1380 




7\ 7» rri7\ T , /""» TV O TV TV 

AATATGAGAA 


TTATCCAL.L.L 


ACTxAAHjX X 


r*ftr"Tft ft ft ftr*ft 


CAATTGATTC 


AAGTGTAGCA 


144 0 




ACGCTACCAA 


TTCTAAAGTA 


TTTTTGATCA 


AAACGTTTTT 


CCTTCAAATT 


ACGGTATTGC 


1500 


30 


ATGATATACA 


GTAATGCATT 


GACAAAAGCT 


AAGGCAAAGA 


AGACATAACT 


TAACACAGCT 


1560 




AGACCGATAT 


GGACTAACAG 


TAACTCGTCT 


ACAACAGCAA 


TTTTCTGAAC 


CTTATTAGTA 


1620 




TAATGTGTCG 


GTTGAAATGT 


ATTCATCCCT 


AAnAGTGTTA 


ACCCTATTAA ATTCCAAGGA 


1680 


35 


AAAACACAG 












1689 



(2) Information for seq id no: iso: 
(i) sequence characteristics: 

40 (A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 180: 

nTGGnTGGCT TTTCCTATTG GACCAAATGG ACCnTTTACC TGGCCnTTCC CAGGACACCC 60 

CGCTTGTGCC CACATTCCAA TCGGAAAAGG TGTATGTGGT ACAGCCGTTT CAGAACGTCG 120 

TACACAAATT GTAGCTGATG TTCATCAATT CGAAGGACAT ATCGCTTGTG ATGCTAATAG 180 
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CGATGCCCCT ATAACGGATC GATTTGATGA CAATGACAAa GAaCATCTTG AaGCAATTGT 300 

TAAAATTATT GAAAaGCAAC TCGCATAAAA GGACATCAGC ACTTTCAATA AAGTGTTGAC 360 

AGTTAGCAGG AAAATGTTAC AATAATCTTT GTGTGAATTA ACGAAAGTAG CAGTTGTATA 420 

TTATTGAGCG CTATGTTGTT CCCAATGCGG ACGTGTCACG TAACTGTCGC TATAAGGTGA 480 

AGACACATAA AACAATATAT CTTAGTAAGC ATGCAACACT CTTTTTTGTT TATTCATAAC 54 0 

AACAAAAAAG AATTAAAGGA GGAGTCTTAT TATGGCTCGA TTCAGAGGTT CAAACTGGAA 600 

AAAATCTCGT CGTTTAGGTA TCTCTTTAAG CGGTACTGGT AAAGAATTAG AAAAACGTCC 660 

TTACGCACCA GGACAACATG GTCCAAACCA ACGTAAAAAA TTATCAGAAT ATGGTTTACA 720 

ATTACGTGAA AAACAAAAAT TACGTTACTT ATATGGAATG ACTGAAAGAC AATTCCGTAA 780 

CACATTTGAC ATCGCTGGTA AAAAATTCGG TGTACACGGT GAAAACTTCA TGATCTTATT 840 

AGCAAGTCGT TTAGACGCTG TTGTTTATTC ATTAGGTTTA GCTCGTACTC GTCGTCAAGC 900 

ACGTCAATTA GTTAACCACG GTCATATCTT AGTAGATGGT AAACGTGTTG ATATTCCATC 960 

TTATTCTGTT AAACCTGGTC AAACAATTTC AGTTCGTGAA AAATCTCAAA AATTAAACAT 1020 

25 CATCGTTGAA TCAGTTGAAA TCAACAATTT CGTACCTGAG TACTTAAACT TTGATGCTGA 1080 

CAGCTTAACT GGTACTTTCG TACGTTTACC AGAACGTAGC GAATTACCTG CTGAAATTAA 114 0 

CGAACAATTA ATCCGTTGAG TACTACTCAA GATAATACGG TCAATACCAA CACCCACAAT 1200 

30 TGTGGGTGT 1209 
(2) INFORMATION FOR SEQ ID NO: 181: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 698 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 181: 

AAATCCCTTt GTtaAAgTsC AAAtTTTTCc AACrgCTTTA AtArGACCCA TATTACCtTC 60 

TTGGATTAAA tCmAGGaATG AcATACCACG ACCaCGTATC TTTTAGCAAT ACTTACAACT 120 

AAACGTAAGT TCGCTTCTGC AAGTCTTGAT TTTGCTACTT CATCACCTTG TTCAATACGT 180 

TTGGCTAATT CGATTTCTTC TTGTGCACTT AATAAGTTAA CACGCCCAAT TTCTTTAAGG 240 

TACATACGAA CTGGGTCATT TATTTTAACA CCTGGAGGGG CACTAAGATC ACTTGGATTC 300 

AGTTTCTCGT CAGTATCTGA ACTATCTTTT TCATTAACTA GTGAAATATC ATTATCATTT 360 
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GCAATTTCTT CATGACTTAA ATGACCCTCT TTTTTA CCTT TTTCAATTAA TTGCTTCTTA 4 80 

ACATCTTCTA ATGTTAATGT CGGATCAATT GTTTGTTTTT TAATTTTAAC TGTGTTATCA 540 

S 

GACATGAAAC GGCCTCCCGA TTTTAAATAT GAACATTCGA AATTTATTCA ATATTGCTAT 600 

TTTAAACGAA ATTCTTAATT AATTCCATCC ATATTTTnAA TTTTATTTTA CAAATTGGGA 66.0 

ACTAAATCCC CAATATTTAT TTTTCAATAG TGGTGGTT 69B 

10 

(2) INFORMATION FOR SEQ ID NO: 1B2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5147 base pairs 
• (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 182: 
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ACTTGATGAT 


GTATACAATG 


TATTTCAAGA 


ATATTATLAA 


AAAACATCTA 


ACATTAAGTT 


b \J 


TTGTAGAATT 


CACAATTCTA 


G CTATTATCA 


CTTCTCAAAA 


TAAAAACATC 


GTTCTTCTTA 


Lz. U 


AAGATTTAAT 


TGAAACAATC 


CACCATAAAT 


ACCCTCAAAC 


TGTTAGAGCT 


CTCAATAATT 


180 


TAAAAAAGCA 


AGGCTATCTA 


ATAAAAGAAC 


GCTCAACTGA 


AGATGAAAGA 


AAAATTTTAA 


240 


TTCATATGGA 


TGACGCGCAG 


CAAGACCATG 


CTGAACAATT 


ATTAGCTCAA 


GTGAATCAAT 


300 


TATTAGCAGA 


TAAAGATCAT 


TTACATCTTG 


rrrn'GAATA 


ATATCTCTAT 


TACGCAAGTG 


360 


TGCTGTATTC 


TAAAGTGCAC 


TTGTGTTTTC 


TATTITTTAA 


TAAAACCTCA 


GCACATAATG 


420 


AACAACTTTC 


TATTTTCTAT 


ATCACTTAAA 


ACCATTTCCG 


AAATTAAACC 


TCAGCACATT 


. 480 


CAAAGCCCCA 


CTTTATTCTT 


AAAAATATTT 


TTTAACTCAT 


ATGTATTAAA 


CCGCTTTCAT 


54 0 


TATAAAAAAT 


ATCTCTATAT 


TtTATCTGtT 


TtTATTAATC 


GAAATAGCGT 


GATTTTGCGG 


600 


TTTTAAGCCT 


TTTACTTCCT 


GAATAAATCT 


TTCAGCAAAA 


TATTTATTTT 


ATAAGTTGTA 


660 


AAACTTACCT 


TTAAATTTAA 


TTATAAATAT 


AGATTTTAGT 


ATTGCAATAC 


ATAATTCGTT 


720 


ATATTATGAT 


GACTTTACAA 


ATACATACAG 


GGGGTATTAA 


TkTGAAAAAG AAAAACATtT 


780 


ATTCAATTCG 


TAAACTAGGT 


GTAGGTATtG 


CATCTGTAAC 


TTTAGGTACA 


TTACTTATAT 

r 


840 


CTGGTGGCGT 


AACACCTGCT 


GCAAAtgctG 


CGCAACACGA 


TGAAGCTCAA 


CAAAATGCTT 


900 


TTTATCAAGT 


CTTAAATATG 


CCTAACTTAA 


ATGCTGATCA 


ACGCAATGGT 


TTTATCCAAA 


960 


GCCTTAAAGA 


TGATCCAAGC 


CAAAGTGCTA 


ACGTTTTAGG 


TGAAGCTCAA AAACTTAATG 


1020 


ACTCTCAAGC 


TCCAAAAGCT 


GATGCGCAAC 


AAAATAACTT 


CAACAAAGAT 


CAACAAAGCG 


1080 
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AAAGTCTTAA AGACGACCCA AGCCAAAGCA CTAACGTTTT AGGTGAAGCT AAAAAATTAA 1200 

ACGAATCTCA AGCACCGAAA GCTGATAACA ATTTCAACAA AGAACAACAA AATGCTTTCT 1260 

ATGAAATCTT GAATATGCCT AACTTAAACG AAGAACAACG CAATGGTTTC ATCCAAAGCT 1320 

TAAAAGATGA CCCAAGCCAA AGTGCTAACC . TATTGTCAGA AGCTAAAAAG TTAAATGAAT 138 0 

CTCAAGCACC GAAAGCGGAT AACAAATTCA ACAAAGAACA ACAAAATGCT TTCTATGAAA 1440 

TCTTACATTT ACCTAACTTA AACGAAGAAC AACGCAATGG TTTCATCCAA AGCCTAAAAG 1500 

ATGACCCAAG CCAAAGCGCT AACCTTTTAG CAGAAGCTAA AAAGCTAAAT GATGCTCAAG 1560 

CACCAAAAGC TGACAACAAA TTCAACAAAG AACAACAAAA TGCTTTCTAT GAAATTTTAC 1620 

ATTTACCTAA CTTAACTGAA GAACAACGTA ACGGCTTCAT CCAAAGCCTT AAAGACGATC 1680 

CTTCAGTGAG CAAAGAAATT TTAGCAGAAG CTAAAAAGCT AAACGATGCT CAAGCACCAA 1740 

AAGAGGAAGA CAATAACAAG CCTGG CAAAG AAGACAATAA CAAGCCTGGC AAAGAAGACA 1800 

ACAACAAGCC TGGTAAAGAA GACAACAACA AGCCTGGTAA AGAAGACAAC AACAAGCCTG I860 

GCAAAGAAGA CGGCAACAAG CCTGGTAAAG AAGACAACAA AAAACCTGGT AAAGAAGATG 1920 

GCAACAAGCC TGGTAAAGAA GACAACAAAA AACCTGGTAA AGAAGACGGC AACAAGCCTG 198 0 

GCAAAGAAGA TGGCAACAAA CCTGGTAAAG AAGATGGTAA CGGAGTACAT GTCGTTAAAC 204 0 

CTGGTGATAC AGTAAATGAC ATTGCAAAAG CAAACGGCAC TACTGCTGAC AAAATTGCTG .2100 

CAGATAACAA ATTAGCTGAT AAAAACATGA TCAAAC CTGG TCAAGAACTT GTTGTTGATA 2160 

AGAAGCAACC AGCAAACCAT GCAGATGCTA ACAAAGCTCA AGCATTACCA GAAACTGGTG 2220 

AAGAAAATCC ATTCATCGGT ACAACTGTAT TTGGTGGATT ATCATTAGCC TTAGGTGCAG 22 80 

CGTTATTAGC TGGACGTCGT CGCGAACTAT AAAAACAAAC AATACACAAC G AT AGAT AT C 2340 

ATTTTATCCA AACCAATTTT AACTTATATA CGTTGATTAA CACATTCTTA TTTGAAATGA 2400 

TAAGAATCAT CTAAATGCAC GAGCAACATC TTTTGTTGCT CAGTGCATTT TTTATTTTAC 2460 

TTACTTTTCT AAACAACTTC TGAAACGCCT CAACACTTTC TACTCTGATT ACATATATGA 2520 

CATTTTTAGG CATTAAAAAA TCGAACTAGA CAAGATGCTC ATTGCATTTC GTACTAGTTC 2580 

GATTCATGAA TAATTAGATT TAAAATGTCA TTTGAATCCA AGTGACAACA TTATTTATAT 264 0 

TTAGAATATT AACGTTAGTA TAAACGTCCA AACACAAATA AAAGCAACAA ATATAATACT 2700 

GTATTTTAAC GTCATTTTTA ATAATGCAGA TTCTTCACCA ACTTTTTTAA CAGCTGCAGT 2760 

CGCAATGGCA ATTGATTGTG GTGAAATAAG TTTCGCTGCT ACACCACCTG CAGTGTTAGC 2820 

TGCCACAAGT AATGAACCGC TTGTTGAAAT TTGTTGTGCC ACTGTCGCTT GAATAGGTGC 2680 
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TGGAGAGAAT AATGGGAAAA TTGCTCCCGC 
CAAACCACCG TATGTCATAA CTTTAGCAAT 
5 TAACCATAAT TCTTTAATTG CTTCGACCAA 

CGTAATTAAA ATTGTAATAA TTACTGTTAA 
ATCGAGACGC AACGCAATTC CT7TAGGCGA 

70 

TTTTATTACT AAACTTTCAA GTGCACCTCC 
ACTCCATACT AATACAAAGG CAGTTAAAAT 

1S TTTAGGCGTT CGTTTTTGAA TTTTATGTTC 
TTTAAATTTA CGACAAACAA ATGCTAACAC 
TGCTAGTTCT GGACCATGGA ATATTGTTAA 

20 CACTGTTAAA ATGACAGGTA AAATTTCTTT 
TAAAACAAAT GGAATAATAA AGTTTAAAAT 
ATCTAATGTT GTAACGCCTC CACTTAAGTT 

25 

AATTGCACCA AAGGCACCCG CCGCACCATT 
TGGTTCAAAT CCAAGTTGAA TTAATAATAC 
TGCTGCACCT TCTAAAAATG CGTTGAAACA 

30 

GTCCACTGAA ATACTTG CAA TACTATCTTG 
AACTTTATAT AACCAAACTG CCATTAAAAC 
35 AACGCCTTCT GTAATCGCAC CTGCTGATAC 
CACAATCAAT GTAACAACCA AAGTTGTCAA 
GGTTfiAGCAT AATAAAAATA AAATAATAGG 

40 

ATTATCGAAT GGATTTACAG TAAGTAGTGT 
TTTATCATTC TGATTAATCT ACAACCTATT 
TAAAATGTAA CACTCCTATA TGTGACAGGC 

45 

ATACACAATA TTTAACTATA ATAmATAATA 
TTTATAATAA TCTCAGGAAT TCGCTTCAAA 
50 AGAATCTCTC ATTTTATGAA TTGTAGGAAG 
GATAATGATA AAT AT CAT AT TAAACCATAG 
CCAAATTTCT AATACTGTGA AGATAGACAT 

55 



TTTAGCAATA CCTTGTCCAA TTGCTACAGT 3000 

AGCTAGGATA GCTGAAATTG TAAGGATCGG 3060 

TAAAGCACCT GCACTTTTCC ATTTTAACTT 3120 

TAAAATCGCT GTCCCAGTTG CACCAATTAA 3180 

TAAATCACTC ACAGTATTTG GAATTGGCAA 324 0 

AGGTTGGAAT AATTTTTTGA AGAATGGTGC 33 00 

TACGAACGGA CTCCAAGCAA AGACAATTTC 3360 

AGACGCTTCC AATCTGAAAA TGTTTTTCGG 3420 

CACCATTGTT GCTAGTGATG GAATAATGTC 3480 

TAATAATTGT AATCCAGTAT ATGTACCACT 3540 

AATACCTTTC ATACCATCTA CAATGAATAC 3600 

TGGAAGTGTT AATGCTGAGT ATCTCGCAAC 3660 

AAACGTATCA ATAATACTAA CTGGTAAACC 3720 

AGCAATTAAA CATAACATCG CTGCTTTTAA 37 BO 

TGCACAAATC GCAATTGGCA CACCAAATCC 3840 

AAATCCAATT AATAATAGTT GGATTCTTTG 3900 

AATAATAGAA AATTGTCCTG TTTTAATAGA 3960 

GATATATCCT ATTGGGAAAA TACCGGCAAC 4020 

ACGCGCTGGT AATTCAAATA CAAATAAAGC 40 BO 

TGCTGCATAA ATGCCTTTCA TTTTAAAAAC 4140 

TACTGCTGCA ACTAAGGCTG ATAATCCGAC 4200 

CATAATGACT CCCTCTCTTT ATATAAAATA 4260 

TCAACTTATA TTTTGCGATG ATCACATATT 4320 

AATCGAATTT TTACAAAAAG TTCACAAAAT 43 BO 

TATCaTntTA ATTATAAATA CTAGATATTA 444 0 

ACTGCATCAT GAGAGTTTAT ATTTTTATTG 4500 

TAAACAAAAT ATGACAAGCG TCAAACCAAT 4560 

TAAATTGAAT TGATGATGGT GTTGTATTTG 4620 

ATAGCTCATA ATCTCTAAAT TTAACGTACT .4680 
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AAATCGTTCA TAGTATCTAC CTGCAATGAA AAATATAAGC CAAATCACTA 
ATTAATCAAA AGCAGCACCC ATTTATCAGC AAAATTATCA GCATCCCCTG 
ATGAATAGGC ACTTTGGTTG GTAATTTTGG ATAGGTCACT ACTGTATAGC 
TAAGTAAATA AGTAGACTTA ATATTGTAAA AGACCTGATT TTAGACATTC 
TcTTTACATT TTATGTATAA CACTCTGCCT ATTTTACCTT TTAATaCATT 
TtAAaCAATA tGTAaTGATA CTATAATTGC GTCAGGAGTA TCCGCTTGTT 
AGCTTATATT TAGCTGTTTA ACATGCCACA TAATGATTCG AATTATT 
(2) INFORMATION FOR SEQ ID NO: 183: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 183: 
CACTTACTTC CACCATTATC ATAACTTTAA AATGGATATA nTTCATCAAA CATTATCTAA 
AGGCGTCGCA CCTACACCAA CACCATCCAA CAATTAACTT ACAACTCTGC GATTACTTCT 
TCAGCAGCAA CTTTCACnTG CGTAATACAA TCAGGTAGTC CAACCGCTTC AAAAGATGCA 
CCAGTTACTC TAAGTCGTGG ATATGTTTGT TTAATATGTG CTTGAATCTG TCTAATTTGT 
TGAATATGAC CGACATGGTA CTGTGGCATA CTTTTCGGCA AACGATTGAC AATTGTAAAT 
TCAGGATCAC CTTTAAATGT CATCATTTGA CTTAAATCTC TACGTACAAT CGATACTAAT 
TCATTATCTG TATGATCATC AACCACAGTA TCACCTGGTT TACCTACATA CGCACGAATC 
AAASCCTTAC CTTCCGGTGT AGTAAATGGC CATTTTTTCG ATGTCCAAGT ACATGCGGTA 
ATGTCTGTAT CACTCGTTCT CGCAATTACG AAGCCAGTAC CATCATGGGT ATTTTCAATG 
TCTTTTTCAT CAAATGCCAA TACAACAGTT GCAACAGTCG TACTATCCAT CGTTTTAAAG 
TAATCAAATG CTGGATCTTG TCCGAACCAA TTTAAAAACA CTTGATGTGG TGTCGTTACT 
AATACGCCAT CATACACTTC TTCTAGTTGA TCATTGTAAA CAATTTTATA TTGTTTTTGA 
GATGTAATTA TATCATCCAC TGACGTATTG TAGCGTATTG TCACACCTTT ATTTTTAACA 
TCTTGTTCTA ' ATGCTTCAAT AAATGAGCTT AAACCATGCT TAAATTGTTT GAATTGTCCT 
TTCGGTGCGC CAGGATATAA TTGTCTTTGT TTCAGACGCT TATTTTTCTC ATCCTTCATA 
CCTTTTATCA GACTTCCGAA TGCCTCTTCT TTTTCTTTAA AATTAGGAAA CGTACTCATC 



TAAATGCGCT 
CTAAATTATA 
ACATCATAGC 
TATCGCCTcT 
ACCCCAACGA 
AAATGTGCAT 
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TCAAGTACCT CATTACCTAA TCTTGCTCTG AAAAATGCAC CAACAGAAAT GTCACCATCC 10BO 

TGCATTTGAG TAGGTTTTTT TAATAAATCA AACCCTGCTC TTAATTTACC AAGTGGCGAT 114 0 

ATTAATTTTG TAGTAACAAA TGGTTTAATA TCTGTTGGAA TACCCATAAT TGAACCACCT 1200 

GGAATCGGAT ATAATTTATT TTTCGCAAAA ATATATGATT GTCCAGTCGT ATTTGTAACA 1260 

ATATCTTGTT CTAATCCAAT ATCTTTCGCT AATTCTGTCA TAATCGTTTT TC 1312 
(2) INFORMATION FOR SEQ ID NO: 184: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 ( X i) SEQUENCE DESCRIPTION: SEQ ID NO: 184: 

TTTTACAATA AAAATATGAT ATACTACTTG TCGTATATAA GGAACGGAGG ACAATTTATG 60 

CATACATTTT TAATCGTATT ATTAATCATT GATTGTATTG CATTAATAAC TGTTGTACTA 120 

CTCCAAGAAG GTAAAAGCAG TGGACTTTCA GGTGCCATCA GTGGTGGTGC TGAGCAGTTA 180 

TTCGGTAAAC AAAAACAACG TGGCGTCGAT TTATTCTTAA ATAGATTAAC AATTATTTTA 240 

TCAATATTAT TTTTTGTACT TATGATTTGC ATAAGTTATC TTGGTATGTA AGGTCCGGCG 300 

ATGTAAATGT CGGGCTTTTT TATTTATAAT TAAGAATGTA ATAGTTTAAC AATAAGCTAT 360 

GTAAAATATA TAGCCTAGTT AAGTATGCAA AGGGAGCGTT AGATTTATGC AGATAAAATT 420 

ACCAAAACCT TTCTTTTTTG AGGAAGGTAA ACGTGCCGTG TTATTACTAC ATGGTTTTAC 480 

AGGCAATTCG TCTGATGTTC GTCAATTAGG TCGATTTTTA CAAAAGAAAG GTTATACATC 540 

ATATDCACCG CAATATGAAG GCCACGCGGC ACCACCAGAT GAAATACTGA AATCTAGTCC 600 

40 TTTCGTTTGG TTTAAAGATG CGTTAGATGG TTATGATTAT CTTGTTGAAC AAGGTTATGA 660 

TGAAATTGTT GTTGCTGGTC TATCATTAGG TGGGGATTTT GCTTTAAAAT TAAGCTTAAA 720 

TAGAGATGTA AAGGGTATTG TAACGATGTG TGCTCCTATG GGTGGCAAAA CTGAAGGTGC 780 

CATTTATGAA GGCTTTTTAG AATATGCACG CAATTTTAAA AAGTATGAAG GTAAAGATCA 840 

AGAGACTATT GATAATGAAA TGGATCATTT TAAACCAACT GAAACTTTAA AAGAACTAAG 900 

TGAAGCATTA GATACGATTA AAGAGCAAGT TGATGAAGTG TTGGATCCTA TTTTAGTGAT 960 

TCAAGCAGAA AACGACAATA TGATTGATCC ACAATCCGCA AATTATATAT ATGACCATGT 1020 

AGATTCTGAT GACAAAAATA TCAAGTGGTA CAGTGAATCT GGACATGTTA TTACGATTGA 1080 
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AGAATAAAAA GAGATTTTAA CATTAGAAAG 
AGAAGAGATT ATTAATCAAC CTGAATATGA 
5 ATTAGGTTTA AGCAGTGCCG ACTCGTTTAG 

ACAATCAGGA TTAATCGAAC GTACAAAAAC 
AGGTCAATCA AAATTGATAA AAGGAACGTT 

10 

AAGACCTGAA GATGAGGATA TGGAAGATAT 
CTTGGATGGA GATACTGTTA TTGTAGAAAT 
AATCGAAGGG GAAGTTAAGT CGATTGAGAA 

15 

TAGTGAAGCT AGACATTTTG GCTTTGTTAT 
TTTCATTCCT AAAGGTCAAA GTTTAGGCGC 
20 TACTAAGTAT GCTGATGGTT CAGATAATCC 

TAAAAATGAT CCTGGCGTAG ATATTTTATC 
ATTTCCTGAT GAAGTGTTAC AAGAAGCTGA 
AATTAAAGGC CGTCATGATT TACGTGATGA 
TAAAGACTTA GATGACGCAA TTAGTGTTAA 
TGTAAGTATT GCTGATGTCA GCTATTATGT 

30 

ATATGATAGA GCGACAAGTG TATATCTTGT 
ATTAAGTAAT GGTATTTGTT CATTGAATCC 

35 CATGGAAATC GATGCTAGTG GTCGCGTTGT 

TTCTGATTAT CGAATGACGT ATGATGCGGT 
CATTCGCGAA CAATATAATG AAATTACGCC 

40 TCGTTTGATT CAAATGAGAA AACGACGTGG 

AGTATTAGTT AACGAAGACG GTATACCAAC 
TGAACGTCTA ATTGAATCAT TTATGTTAAT 

45 

TAAGTTAGAT GTACCTTTTA TTTACCGAGT 
ACAATTCTTT GATTTTATTA CAAACTTTGG 
50 TCATCCAACA ACACTTCAAA AGGTTCAAGA 

CATTTCAACA ATGATGTTGC GTTCAATGCA 
ACATTTTGGC TTATCAGCTG AATATTATAC 
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GAGGGGCATA ATGAATTTAA AGCAATCTAT 1200 

ACCTATGTCA GTGTCAGATT TTCAAGATGC 1260 

AGATTTAATT AAGGTGCTTG TGGAGTTAGA 1320 

AGACAGATAC CAAAAAAAGC ATAGTTATAG 1380 

AAGTCAAAAT AAAAAAGGCT TTGCATTCTT 144 0 

ATTTATTCCC CCGACGAAAA TTAATCGTGC 1500 

CCATCAATCA AAAGGTGAAC ATAAAGGTAA 1560 

GCATTCTGTA ACTCAAGTTG TTGGTACGTA 1620 

TCCGGATGAT AAACGTATTA TGCAAGATAT 16 BO 

AGTCGATGGT CATAAGGTAC TTGTACAAAT 1740 

AGAAGGACAT ATTTCTGCTA TTTTAGGACA 18 00 

TATTATCTAT CAACATGGCA TAGAAATTGA 1860 

AGCAGTACCT GATCATATTG AAAATACTGA 1920 

ATTGACAATC ACAATTGATG GTGCTGATGC 1980 

AAAGTTAGCG AACGGTAATA CGCAATTAAC 204 0 

AACAGAAGGT TCTGCATTGG ATAAAGAGGC 2100 

TGACCGTGTA ATTCCAATGA TTCCACATCG 2160 

TAATGTTGAT CGTTTAACTC TAAGCTGTCG 2220 

TAAACATGAA ATTTTTGATA GTGTTATACA 2280 

AAATCAGATT ATTACTGAAA AGGATCCTAA 2340 

TATGCTAGAT TTAGCACAAG ATTTATCTAA 2400 

TGAAATCGAT TTTGATATTA GTGAAGCAAA 2460 

AGATGTTCAA TTAAGACAAC GTGGCGAGGG 2520 

TGCAAATGAA ACAGTTGCTG AACATTTTAG 2580 

GCATGAGCAA CCTAAATCAG ATCGCTTAAG 2640 

CATCATGATT AAGGGT ACTG GCGAAGATAT 2700 

AGAAGTAGAA GGTCGACCTG AACAAATGGT 2760 

ACAAGCGCAT TATGATGATG TGAACTTGGG 2820 

GCATTTTACA TCACCAATTA GACGTTATCC 2880 
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AGAAGTGAAG CGTTGGGAAG ACAAATTGCC TGAGTTAGCT GAACATACTT CTAAACGTGA 3000 

ACGTCGTGCT ATTGAGGCAG AACGTGATAC TGATGaATTG AAAAAAGCAG AATATATGAT 3 0 60 

TCAACATATT GGTGATGAAT TTGAAGGTAT TGTCAGCTCA GTAGCTAACT TCGGTATGTT .3120 

CATTGAATTG CCAAATACGA TAGAAGGTAT GGTTCATATT GCGAATATGA CTGATGATTA 3180 

TTACCGTTTT GAAGAGCGTC AAATGGCATT AATTGGTGAG CGTCAAGCTA AAGTATTTAG 3240 

AATTGGTGAC ACAGTTAAGG TTAAAGTGAC GCATGTTGAT GTAGATGAAC GATTAATTGA 3 300 

TTTTCAAATT GTAGGTATGC CTTTACCGAA AAATGATCGA TCACAGCGCC CAGCGCGAGG 3360 

TAAGACAATT CAAGCCAAAA CGCGTGGTAA ATCATTAGAT AAATCAAAAT CTGATGATAA 3420 

GGGTCGTAAG AAAAAAGGTA AGCAACGTAA AGGTAAAAAC CAACGTAATA ATGATAAATC 3480 

AGGTAATAGT AAGCATAAGC CATTTTATAA AGATAAAAGT GTGAAAAAGA AAGCACGTCG 3540 

20 TAAGAAAAAA TAAGCAGCAA TGAGGTGAGT ATGAATGGCT AAGAAGAAAT CACCAGGTAC 3600 

ATTAGCGGAA AATCGTAAGG CAAGACATGA TTATAATATT GAAGATACGA TTGAAGCGGG 3660 

AATTGTATTG CAAGGCACAG AAATAAAATC AATTCGCCGA GGTAGTGCTA ACCTTAAAGA 3720 

25 TAGTTATGCG CAAGTTAAAA ACGGTGAAAT GTATTTGAAT AATATGCATA TAGCACCATA 3780 

CGAAGAAGGG AATCGTTTTA ATCACGATCC TCTTCGTTCT CGAAAATTAT TATTGCACAA 384 0 

GCGTGAAATC ATTAAATTGG GTGATCAAAC ACGTGAGATT GGTTATTCGA TTGTGCCGTT 3900 

AAAGCTTTAT TTGAAGCATG GACATTGTAA AGTATTACTT GGTGTtGCAC GAGGTAAGAA 3960 

AAAATATGAT AAACGTCAAG CTTTGAAAGA AAAAGCAGTC AAACGAGATG TTGCGCGCGA 4020 

TATGAAAGCC CGTTATTAAG CGATTTAGTT GCTTAATCGG GCTATATTTG ATATAGTTAT 4080 

ATGTGCTTTT GTAAATTACA AAAGTATGAT TTGTTTGATT TATTATTTCG GGGACGTTCA 4140 

TGGATTCGAC AGGGGTCCCC CGAGCTCATT AAGCGTGTCG GAGGGTTGTC TTCGTCATCA 4200 

40 ACACACACAG TTTATAATAA CTGGCAAATC AAACAATAAT TTCGCAGTAG CTGCCTAATC 4260 

GCACTCTGCA TCGCCTAACA GCATTTCCTA TGTGCTGTTA ACGCGATTCA ACCTTAATAG 4320 

GATATGCTAA ACACTGCCGT TTGAAGTCTG TTTAGAAGAA ACTTAATCAA ACTAGCATCA 4380 

TGTTGGTTGT TTATCACTTT TCATGATGCG AAACCTATCG ATAAACTACA CACGTAGAAA 4440 

GATGTGTATC AGGACCTTTG GACGCGGGTT CAAATCCCGC CGTCTCCATA TTTGTAGCCT 4500 

ACAGCCTTTG TGGTTGTGGG CTTTTTTATT TTGTGTTTTT CAGGGGATAA TGCATTGCAG 4560 

AATTTGTTGT GAGTATTGAT ATAGCAGTGT TTGTATAGGT GTTTATTTGA TGGAGGAAAG 4620 

AGTAATAAGT GATTATGAAT TAGTTTTTGA GATATAAGGG GACAGTGATG TGTGTCAAAT 4680 

55 
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TTATACGCAA 


AAAATTCTCC 


ATGTTATATA 


TGTCAATATA AAAATGTGAA TCGTCTACAC 


4800 




TTAATTGGAT 


AAATGGCTAC 


TGAAAAAGAA 


CTTTTCATTT 


TTGTTACGTC 


ACTAAQj 1 


4860 


5 


TGTAGTTATA 


AAGAGATGAG 


CCGAGTTTTG 


ATATTTTCAT 


TAGAATCAAT 


ATGCCTATTA 


4920 




ACACAATCAG 


CAATAGTTGA 


CGAGACGGAA 


ATAAAAGAAG 


TCGTAGTTAA 


GAAATGCATT 


4980 


10 


TCACAACATA 


CCATTGTAGC 


CATTTTTATT 


GTTTTGGATG 


ATAAACTCTT 


TTTGGAATTT ' 


5040 


TTAGTTTTTA 


TAATTTGGAA 


CTACACTACT 


TCTTTTACTA 


ATATTAATGT 


CTAAGTAATC 


5100 




GATAAAAAAT 


TTTCCATTGA 


ATAAATGAGA 


AGTTAAAAAC TTTACTTAAC CTTTCycATT 


5160 


15 


GCATTTTCCT 


ATTCACGATT 


TTAAGAACCC 


AACATACTAC 


AAACGAATTT 


TAAAAGGCGA 


5220 




GAGTAAAGCT 


TACTTGTTTA 


TTATACATAT 


TTAAAATCCA AGAGTCAGAA 


CAGACTACTC 


5280 




CTCTTTATAA 


CTATAAAAAA 


TAGCTATGAA 


AAAATCTATC 


GTCATAGATT 


CCTTCATAGC 


5340 


20 


TAATCTTAGT 


ATGTTTATTT 


TTATTTTAGG 


ATGCTATTTA 


TCAACTCAAC 


ATATAACTCA 


5400 




CTATTTTTAT 


AACCTTCTAA 


TATATCATTA 


ACTTGTCTAA 


TAGGTATTTC 


TGGTACTTCT 


546 0 




CTAATGTTTT 


CCAATTTTGT 


TTTAAATTGT 


TTTTTTGTTA 


TTTGCTCTTT 


ATTTGTAGCC 


5520 


do 


AATTGGAACA 


AGTAAGAATC 


TAGCATATTA 


ATTTCTTTAT 


ATGAATACAT 


ATATCTTAAT 


5580 




AACACTAAAT 


CTCTAGTTTT 


TAAGTTAGGC 


GCTAGTTCTT 


CTTGTAATTG 


TTCTATTGAT 


5640 


30 


TGTyTCATTA 


ATAACAATCT 


CATTTCTAAT 


TCTTCATTAT 


TCATTTTATC 


ACACTCTTtT 


5700 


TATATTAATG 


CTTGACCAAC 


TTGGGAAACC 


CAAAACCCTA 


TGCTTCTTGC 


AGTAGAATCT 


5760 




TTAATACCAG 


TTCCCATCAA 


TGCTTGTGAA 


ACTTGACCTT 


GTACATTTCC 


CCATGTAGCC 


5820 


35 




TTAATG CATT 


ATT CAATG CG 


GGATTTACAA ATTTATCCCA TCTTTTTTTT 


5880 




ATGATTTTCC 


GGCACGGGGA 


CTGATTTCTT 


TAACACCATT 


AAACACAGAT 


TTTTTATTTT 


5940 




TAAT£ATAGC 


TTTATAGTAT 


CATGTTGGCT 


AAGCTATAAA 


TAAGTCAGTT 


TCTCTAAAAA 


6000 


40 


TTAAATAACT 


GAATGTAAGA 


CAATCAACAA 


wCCAAATTTA 
TCTCTCGTTC 


TACTTCATCT 
TTGTTCTTTT 


AAACCACTGT 
TTGTACTCTT 


6060 
6120 




GGTCGTCATC 


TTTTTGCTTT 






CTTCAAATTC 


TTTTTCTTTC 


TTTTCTACTT 


CTTCTCT 






6157 



45 

(2) INFORMATION FOR SEQ ID NO: 185: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 884 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



55 
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AAGATGAACA AACGTCTGAA CAAACGTATC 60 

TGGCAACACT TGTTCATGAT GACGTTATTG 120 

CCATATCAAA GAAATGGGAT CAGACAACTG 180 

TAGGACTTGA ACACTTAATG GCCGTTAAAG 240 

CTATCGTTGA TGTTTGTAGA GGGGAACTTT 300 

AGACAATTAT TAATTATTTA CGACGTATCA 360 

CAACTGAAGT TGGTGCAATT ACTTCTCAAT 420 

TGATTGGTCA TTATATAGGT ATGAGCTTCC 480 

GTACCGAAAA GAAATTAGGT AAGCCGGTCG 540 

TACCGATtTT ATTAGAAATG CGTAAAAATC 600 

GTCGTGATAG TGAACGCAAA GAATTTGAAG 660 

GCATCGATGA GGCTAAGGCA GTAAGTTCGA 720 

CyGaGTTACC aGATGGACaT CCGaGAtCAC 780 

CAAllAAACAC GTAGTATTTA TGnAAAAGTA 84 0 

TAATAGTAAT ATAC 684 
(2) INFORMATION FOR SEQ ID NO: 186: 

30 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 6876 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

35 



- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 186: 



50 



AATTTCATCT 


GCTCGTGCAA 


AATCTTTGTT 


TTTCCTTGCT 


TCATTACGCT 


CTTCGATTAA 


60 


TTTTTCAACA 


TCTTCATCCA 


ATAATTCATC 


TGCATTTTTA 


GATTTTAACG 


GTACACCTAA 


120 


AACATCGCTG 


AAAATTTGAT 


AAACTGCTTT 


AAATTTATCA 


ATTACTTCTG 


TTGATGTTGT 


180 


GTTCTCTAGT 


ACATATTTAT 


TCGCAAGTkT 


TGCTAAATCA 


TACCAAGCTG 


TAATTGCATT 


240 


AGCTGTATTA AAATCATCAT 


TCATAACTGT 


TTCAAAACGA TTTAAAATCG 


CATCAATTTG 


300 


ATCAATATAT 


GTCTGTTGAT 


TTTCAATATT AGTAGCAATT 


TGTGCGCGCT 


CTTCAATTAA 


360 


TTGATAACTA 


TTGCGAATAC 


GCTCTAGTcC 


aCTACGTGCT 


GATTCTACCA 


ATTCTAGATT 


420 


ATAGTTAATT 


GGGCTTCTAT 


AATGTACGCT 


AATCATAAAG 


AATCTTAGTA 


CATCTGGATC 


480 



55 



EP 

CATTTGTTAT TCTGAGTAGC CAATTTGGCA 
AAGTTGCAGT CGCATTAGAG TTAATTCATA 
5 ATAAAAGCGA CAAGCGTCGA GGCAAGTTAA 

CTATTTTAAC TGGGAATTTT TTATTGGCAT 
ATAATCGTGT ACATCAATTG ATATCTGAAT 

10 

TCCAATTTCA AGACCAATTT AACAGTCAAC 
ATCGCAAAAC AGCACTGTTA ATTCAAATAT 
CTGATAAAGA GACTGTACGA AAATTGAAAA 

75 

AAATCATTGA TGATGTATTA GACTTCACAA 
GAAGTGATTT GCTTAATGGT CATATTACGT 

20 CAGACTTCAA ATTGAAAATC GAACAGTTAC 

AATGTATCCA AATCATTAGA AAATCTGACA 
AGTATTTAAG TAAAGCyTTG AATTTGATTT 

25 TACyTTTAAG TTTGACGAAA AAAATGGGTT 

TTGAAAGCGC TTTACCAACC TGTTAATATA 
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ATTATCAATA 


TTAATGAAAC 


CATTATGCAT 


CCAATAATTA 


GCAAATGGCG 


CATGATTATG 


600 




TGCTTCTGAT 


TGTGCTATTT 


CATTTTCATG 


ATGTGGAAAT 


TGTAAATCTG 


AACCACCCGC 


660 


5 


ATGTATATCA 


ATTGTAGGTC 


CTAGCTCATG 


AAATGCCATT 


ACAGAACATT 


CTATATGCCA 


720 




TCCTGGTCTA 


CCTTCACCAA 


ATGGGCTATC 


CCAACTAATC 


TCGCCAGGTt 


CGCTITTTTC 


780 


10 


CACAATGTAA 


AATCAAGTGC 


ATCTTCTTTA 


TGCTCTCCTG 


CATCTATACG 


AGCACCCACT 


840 


TTTAAGTCAT 


CTATGGATTG ATGACTTAAT 


TTACCATAAC 


CTTCAAATTT 


ACGTGTTCTA 


900 




AAGTAAACAT 


CGCCACCACT 


TTCATATGCA 


TAACCTTGAT 


CCACCAAATC 


TTTAATAAAT 


960 


15 


TGAATAATGT 


CATCCATATG 


GTCCATTACC 


CTTGGATTTG 


AAGTCGCTTT 


TCTAACATTT 


1020 




AACGCACCAA 


CATCTTCATG 


AAAAGCAGCG 


ATATATTTTT 


CTGCAATTTC 


GGGAACAGAC 


1080 




TGATTTAATT 


CTTGAGAACG TTTAATTAAT TTATCATCTA CGTCTGTAAA 


ATTTGATACA 


1140 


20 


TATTCTACAT 


TATATCCTTG 


GTATTCAAAG 


TAACGTCTCA 


CTACGTCATA 


ATTAATTGCw 


1200 




GGTCTTGCGT 


TACCAATATG 


AATGTAGTTA 


TATACAGTAG 


GACCACATAC 


ATACATTTTT 


1260 




ACTTTCCCTG 


GTTCTATAGG 


CTTGAACACT 


TCTTTTTGAC 


GTGTAAGCGT 


ATTATATAAT 


1320 


25 


GTAATCATCT 


TGAATCTCTC 


CATTCCTAGT 


CTTTTCAAGT 


TGTCGTTCTA 


AATGCTTAAT 


1380 




TTGTTCATAA 


ATTGGATCAG 


GTAGATGGCG 


ATGATCAAAT 


GrrrrrccAA 


CTCGAACACC 


1440 


30 


ATCTTGCTTA 


ACAATATGTC 


CTGGTATACC 


AACAACCGTT 


GAATAACTTG 


GAACTGATTG 


1500 


TAAAACAACT 


GAATTTGCAC 


CAATATTTAC 


ATTTGAATTT 


ATTTTAATAT 


TTCCTAAAAC 


1560 




TTTCGCACCG 


GCTGCTATTA AAACATTGTC 


TCCTATATCT 


GGGTGTCTTT 


TCCCTCTTTC 


1620 


35 


TTTCCCTGTC 


CCACCAAGTG 


TCACGCCTTG 


ATAGATTGTC 


ACATTATCAC 


CAATTGTACA 


1680 




TGTTTCTCCT 


ATTACAACGC 


CCATACCATG 


ATCTATAAAT 


AGACGCTTTC 


CAATTTTAGC 


1740 




ACCTGGATGG 


ATTTCTATAC 


CTGTGAAAAA 


TCTTGAAATT 


TGAGATATCG 


CGCGTGCTGC 


1800 


40 


AACATATTTT 


TTTTGGTTGT ATAACTTATG 


TGCAATCAAA 


TGACTCCAAA 


CTGCATGTAA 


1860 




ACCTGCATAC 


GTTGTAATGA 


CTTCTAATGT 


TGAACGTGCC GCTGGATCCT 


GCTCAAATAC 


1920 




CATTTTTATA 


TCGTCTCTCA 


TTCTTTTTAA 


CAAGATCATT 


TCCTCCTCAA 


TGATTGAACT 


1980 


45 


ACGTAAATAC 


ATAATTGAAG 


TACCTGCGAA 


ATTAAATATC 


AAAAAAGCAC 


CACTAACATA 


2040 




CAAATTGTAT 


TGTTAGAGGC 


GCTTCCGCAC 


GGTTCCACTC 


TGAATTTAGC 


GAATAACATT 


2100 


50 


AATAATATTG 


CGGGCGCTTC 


CAAATTATCA AGGAAACTAA 


GTCAACTTAA 


TGCTCATCAC 


2160 


TCTCATTATA 


TATTTAATTC ATTTTACGAA 


GGTGCATTCA 


TTAATTTCTA 


CGTTGTACTC 


2220 




ACAGCAACCG 


TACACTCTCT 


GCATCGTATA AATTTAATTA 


CTAATCCTTC 


GTTTTATATA 


2280 
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ATAAAATTCA AGTATATACT ACCTTGATCT 
CGGTTTAGCA CTTTTTCTTT ACCAAGTACT 
ATTTGGCCTG TTACAGCAAC ACGAATAGGC 
TCTTTTTGAA CTTCTTTAAT TGTCTTTTTA 
TCTAATTTAC TGAATAAGTG CGTCATTAAC 

10 

TGTTCTTCTT CACCAAGAGC TGGCATTTCT 
TCACCGGCAT AACTCATTTC T TTTTGATAA 

15 TCCTCTTCTG ACGGCACCTC AGGAATCAAA 

TGGAATACTG TTTCAGTATC TTTTTGTTTC 
TGCTTATCGA AAAATGCTGG TGATTTTGAC 

20 TCTTCTTTAG AAAAGATTTC TTCTTCACCT 

TTAAATAACG CTTCAGGTAA ATAACCTAAG 
TGCCCATCAC GTTTACTTAA CTTTTTACGT 
AAACGAGGTG GCTCCCAGCC AAATGCTTCA 
ATATGATCAT CACCACGAAT TACATCTGAA 
GCAAAATTGT ACGTTGGAAT GCCATCTTTT 

30 

GAATCAAATG AAATATTTCC TTTTACCATA 
ACTCGGAAAC GAATTGATGG TTGGCGTCCT 

35 GTCAAATGCG CATGTTGACC ACCATAGCGA 

CGTTCAGCTT CTAATTCTTC TTCTGTCATA 
AACTGATCTA TTAATGGTTG GTAGATATGT 

40 . CCATTGTCTT TATCTACAGA CTCATCCCAA 

TGTGATGTTT CTCCATCTTC TAAATTACGT 
AAATCTCCGT TGTAATGTTT AGCATACAAG 

45 

ATATGAAGAT ACCCAGTTGG ACTTGGTGCA 
TTCACTCCTA AATTAAATAT CAGATTTTCA 
TCTTCGACCG TCATAACAAA TGTCTAACTC 

50 

TAACATGACC TTAAAATAAT TTCATTGTTT 
TTAAATTTTA AATAGAAAGC TGTTGTTTTT 

55 



TGTCTATTTC ATTACTTATA TTGTTTTAAA 24 00 

TCAATTGTAT TTGGTAATTC AGGACCATGC 2460 

ATAAATAATT GCTTGCCTTT TATTCCTGTT 2520 

ATTTCAGCCG CTTCAAATGG TTCAAGTGCT 258 0 

TCTGGTACTT GCTCTCCATT AATCACTTGT 2640 

TTAAAGAACA TTTCTGATAA AGGTACAATT 2700 

AGCGCAATTA ATTTGCGTCC CCAAGATAAA 2760 

TTTGCTTTAA TTAAATGAGG TAATGCTAAT 2820 

ATATATTGGT TATTAACCCA TGCTAATTTT 2880 

AAACGCTTTT CATCAAAGAT TTTGATAAAT .2940 

TCAGGAGACC AACCTAATAA CGCAATAAAA 3000 

TCACGATATT GCTCAATAAA TTGTAAAATT 3060 

TCTTCATTAA CAATTAATGA CATATGACCA 3120 

TAAATCATAA TTTGTTTAGG CGTGTTTGAA 3180 

ATTTGCATGT AATGATCATC TATAGCTACT 324 0 

TTTACGATAA CCCAGTCACC AATACCATTT 3300 

TCATCAAATG AATACGTTTG GTTTTGAGGT 33 60 

TCTGCTTCAA ATTGTTGACG TTGTTCTTCA 3420 

GGCATTTCAC CACGAGCGAT TTGCGCTTCA . 3480 

TAGCATTTAT ATGCTTTATC TTCTGCTAGT 3540 

TGACGTTCAG ATTGACGATA TGGTCCGTAG 3600 

TCTAATCCTA ACCATTTAAG ATTATCAAAT 3660 

TTTTTATCAG TATCTTCAAT TCGAATCACA 3720 

TAATTGAATA ATGCTGTTCT TGCATTACCA 3780 

TATCTTACTC TTATACGATC GCTCATTTTT 3840 

AGTTAGTTCA TATAAATTGT TCATTTGCTA 3 900 

GTCTTATTGT TAAAACGAAA CAATGCTTTT 3960 

AATCATAACA TAATTCCCTG GGTAATATGC 4 020 
TCAACACTTT AAAAAAGCTA TCCCTAAGAA . 4080 



874 



EP 0 786 519 A2 



TTAAACTTCA AATTAACTAT TCAAATACGT 
TTGCTATAAG AATAACTTTA TTAATATCTA 

5 

GATATATTAG CACTATTTGT ATTTTTTGAT 
ATACTTCAAA TTACAAACTt CGCCATTTCA 
CAACTACAGT TTTTATAATG ATACTGTATC 

10 

TGAAGATGTT TGCAATAAGC TGACTACTTC 
AGCATTATCA ACAACTACCA TCGTACCATC 

15 CTTACCCATT TTTGTCAGTA AAATATGCAG 

TGCTTCTGAT AAATCATTAA CATTTAATGC 
TAGGTTGAAA TCTGTCGTTA TAATACTTGC 

20 CGTATCAATA TCACTATGTG TTTTAGTTGG 
ATACAATTCA TTTAAAATAT CTAAGCCTCT 
TGAATCTGCA ACAATTTGTA ATTCATTAAT 

25 

GATAAAACCG CAACGAATGA CTTCTAAAAT 
AATTTTTGGC GTAgcaCTTT TaGTATGTTG 
AAACATTAAC ATTTCATCTC GTTTTTTAAG 

30 

TATCGTAATT ATGACAGGAA TGAAATGATT 
AAACGACACC ATAACAGAAA TAAGTAATCC 

35 TTCAACAGCA CTTCTACGCA TAATAAAATG 
TCTAATAAAT ACACCAAAAA TTAAGAACAT 
GTGA3TTTTT AAAAAGCTGG AGTTTTGTAA 

40 AATTATTCCT AATGCGCTCC CAATAATTAA 
ATTCAGACAA TGTCCTCCTT TCTTGATGTT 

ATGTACAGTA GTTACACCTA TTACTTGTAT 

AS 

ATTTTTAGGA ATAATTACAC GTTTGAAACC 

TATCCGAGAT ACACGACGTA CCTCACCCGT 

TCCGTCGACA GCTTTATCTT TAAAGCTAGA 

SO 

TGCTGGCTCC GTTAACTTTA CACCGCCAGC 
TAGATAATTT TCTTTCTTTT CCAAAACAGC 

55 



TAAAATTGAT TCTAATTTTG TATGTCTTGA 4200 

AAATTTAACA CTTAATGAAC TTGTTTCAAT 4260 

AACTAATATG TTTTGCATTT ATTTATAGTT 4320 

TATACCTTTT AATATCTATT TTGTTTTCGT 43 80 

TTCGATTTTT TTAGCAAAAA CAATTCTTCC 444 0 

TAAATTGACA TGACTGCCAA TAAGATTTTT 4500 

ATCTAGATAT CCTACTGCCT GACCAGGCtC 4560 

TTGATCACCT TGATGTACAT TAGGTTTGAT 4620 

TTTGATACCA TGTACATGAC AAACTTTATT 4680 

ATGATATTGT TTTGCTAATT TTAATAACAT 4740 

ATGTATAACC TTTGTAGGAT AGTCTAAATC 4800 

TTTACCCTTT TCaCGTTTAA CACTGTCATT 4 860 

AACACCTTGT GGAATTAAAA TATTGCCATC 4 920 

ACGACCATCA ATAATTGCGC TTGTGTCGAT 4 980 

TGACATGGAA CGCGCTATAT TCTCAGGTAA 5040 

GCCAAATTGG AAACCGAAAT AACATAGTAA 5100 

AAAAATAGAG TTGCCAATTG ATTCTAATAT 5160 

GATTATTAAA CCTATTGTTG CGAATAGTAT 5220 

TTCTAAACCT TTTATAGCGT TAGTAACTCG 5280 

AAAAATACTA CCGATAATGC CATCTACATA 5340 

TCCAAGATCA TTTGCAATTT CAGGAATAAT 5400 

GTAAATAATA ATAACCATTA GTTTAACGAT 5460 

TTATGAATGA AGAGCAAATG ACAATACTTC 5520 

ACCTTCAGGA TATGTCCATC CGCCTATATT 5580 

TAGTTTTGCA GCCTCTTGCA CGCGTTGTTC 5640 

TAAACCAACT TCTCCAATAT AGCAATCTAA 5700 

TGCAGTTGCT ACAATTACAC TTAAATCAAC 5760 

TACTTTGATA TAAGCATCTT GTTGTTGTAA 5820 

CATCAACAAA CTTAATCGAT TATGATCAAT 5880 



87S 



EP0 786 519 A2 



5 



10 



15 



20 



25 



TATTAAAAGT 


GGTCTGGTTC 


CCTCCATGGT 


TGCAACAATT 


GTTGAACCTG 


GAACATTTGT 


6000 


TGAACGTTCT 


TCTAAAAACA 


TTTCAGATGG 


ATTATTTACA 


CCTTTTAATC 


CACTTTGCTT 


6060 


CATTTCGAAG 


ATTCcCATTT 


CATTCGTTGA 


ACCAAAACGG 


TTTTTAACAG 


CTCGCAAAAT 


. 6120 


TCGATATGCG 


TGGTGTTCAT 


CGCCTTCAAA 


ATAAAGCACA 


GTATCaACCA 


TGTGTTCTAG 


6180 


CAATCTTGGG 


CCCAGCAATT 


TGACCTTCTT 


TCGTTACATG 


ACCCACTATA 


AAAGTTGCaA 


6240 


TGTTCATTTG 


TTTAGCAATA 


TTCATTAAAC 


TTTGTGTACT 


TTCACGAACT 


TGTGAAACAG 


6300 


AACCTGGCGC 


AGAGCTGATT 


TCAGGATGAT 


ATATTGTTTG AATCGAATCC 


ACTACTAATA 


6360 


AATCAGGTTG 


TTCTTCTTTT 


ACTGTTTGAT 


AAATAACTTC AAGATCTGTT 


TCAGCTAATA 


6420 


CTTGCAATTC 


ACTTGAATCT 


TCATCTAATC 


GCTCTGCACG TAATTTAGTC TGACTAAGCG 


6480 


ATTCTTCTCC 


AGTAATATAT 


AGTACTTTTT 


TCTTTTGAGA 


TAACGATGCA 


CAAATTTGTA 


6540 


AAAGTAACGT 


TGACTTACCA 


ATACCTGGAT 


CCCCACCAAT 


AAGTACTAAC 


GATCCGCTCA 


6600 


LAAIAC-CTCC 


ACCTAATACA 


CGGTTGAATT 


CTGCTGAATC 


TGTTAACACT 


CTCGGCGTTG 


6660 


TTTCATGTTT 


AATACTATTT 


AATTTTTGTA 


CTTTACCTGC 


TAATTCCTTG 


GTTTTAACTC 


6720 


CATGTTTAGG 


ATTGGCTGCT 


TTTTCAACAA 


TTTCCTCCAT 


TTGATTCCAA 


GCGCCACAAT 


6780 


TAGGACATTT 


CCCCATCCAT 


TTAGGAGATT 


GATAACCACA AGCCATACAT 


TCAAAAATCA 


6840 


CTTTTTTCTT 


GGCCArAATT 


GCAcCTCCAC 


TTTCTT 






6876 



30 

(2) INFORMATION FOR SEQ ID NO: 187: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1193 base pairs 
3S (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 ' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 187: 

CAACTCAAAC AGCAGAACAA CGTCGTGAGT TGATTAATGG TGTATTTACT GACATTAATC 60 

CCATACATTA AAAATATGAT GTACGTGTTA GCAGATAATA GACATATCTC ATTAATAGCT 120 

45 GACGTATTCA AGGCGTTCCA AAGCTTATAT AACGGACACT ACAATCAAGA TTTTGCAACA 180 

ATTGAGTCAA CATATGAATT GAGTCAAGAA GAGTTAGATA AGATTGTCAA ACTAGTAACT 240 

CAACAAACGA AGTTATCTAA AGTTATTGTA GATACAAAAA TTAATCCAGA TTTAATTGGT 300 

SO 

GGATTTAGAG TTAAAGTCGG CACAACTGTA TTAGATGGTA GTGTTAGAAA TGATCTTGTC 360 

CAATTACAAA GAAAATTTAG AAGAGTTAAT TAATTATAAA GAGGAGTGAC ATAGATGGCC 420 

55 
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ATGTCCGTAA CTGATGTAGG TACTGTATTA CAAATTGGTG ATGGTATTGC ATTAATTCAC 540 

GGATTAAATG ACGTTATGGC TGGTGAGCTA GTAGAATTCC ATAACGGCGT ACTTGGTTTA 600 

GCCCAAAACC TTGAAGAGTC AAACGTGGGT GTGGTTATTT TAGGACCATA CACAGGTATT 660 

ACTGAAGGTG ACGAAGTTAA ACGTACTGGT CGTATCATGG AAGTACCAGT AGGTGAAGAA 720 

CTAATCGGAA GAGTTGTTAA TCCATTAGGA CAACCTATTG ATGGAcAAGG ACCGATTAAC 780 

ACAACTAAAA CACGTCCaGT AGAGAAAAAA GCTACTGGTG TAATGGATCg TAAATCAGTA 840 

GATGAGCCAT TACAAACAGG TATCaAAGCA ATTGATGCTT TAGTACCAAT TGGTAGAGGT 900 

CAACGTGAGT TAATCATCGG TGACCGTCAA ACAGGTAAAA CAACAATTGC AATTGACACA 960 

ATTTTGAACC AAAAAGATCA AGGTACGATT TGTATCTATG TTGCTATTGG TCAAAAAGAT 1020 

TCAACAGTAA GAGCAAATGT TGAAAAGTTA AGACAAGCAG GCGCTTTAGA CTACACTATT 1080 

20 GTTGTAGCAG CATCAGCTTC TGAACCTTCT CCATTATTAT ATATTGCACC ATATTCAGGT 1140 

GTAACAATGG GTGAAGAATT CATGTTTAAC GGTAAACATG TTTTAATCGT TTA 1193 
(2) INFORMATION FOR SEQ ID NO: 188: 



10 



15 



25 



30 



35 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 188: 

TGCTAAGAAG TCAAAATAAA CTAACTATnA AACATCTAGT ACGATTATTA AAGTGACAGA 60 

TnATAAAATT GAATTATTnA GAGAAGGAGA TATAAAGTTT GAAGAAATAA AAGAAAGACT 120 

AGGTACAGGT ATTATTTATG AATAAGTTAA TACTTGGGAT TTATTTATAC CGAATTTTTT 180 

40 CACGAGCATA CTTTTATTTA CCGTTTTTAT TAATTTACTT TTTGATTCAA GGTTATTCCA 240 

TAATACAATT AGAAATATTA ATGGCGTCTT ATGGCATTGC AGCATTTTTA TTCTCTCTAT 300 

ACAAAGAGAA GTGTTTTAAA ATTTGTAACT TAAAAGATTC TAATAAATTA GTTGTTAGTG 360 

AAATATTCAA AATCATCGGT TTATTGTTGT TATTATATCA AAATCAATAT TTAATTTTAG 420 

TAGTGGCACA AATATTATTA GGGTTAAGTT ACTCAATGAT GGCGGGTGTT GATACCGCAA 480 

TAATTAAAAG AAATATAACA AATGAGAAAT ACGTACAAAA TAAGTCAAAT AGCTATATGT 540 

TCCTATCATT ATTAATTTCA GGGATTATAG GTAGTTATCT TTATGGAATA AATATTAAAT 600 

GGCCTATAAT AATGACTGGT ATATTTTCAA TTCTAACAAT TATAATTATT CGATGCACAT 660 

55 
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TACCAGAAGA GAAGTTTTGG ATATTGCATT ATTCTTTTTT AAGAGCGTTA ATATTAGGAT 780 

TTTTTATAGG ATTTATTCCA ATTAATATAT ATAATGATTT AAAACTGAAT AATTTACAAT 840 

5 

TTATTTCAGT ATTAACTTGT TACACAGTTA TGGGTTTTGT ATCTTCACGT TATTTAACTA 900 

AATACTTGAA TTATAAGTTT GTGTCAGAAA TTTGTlTAGT AATATTTTTA ATAATATATA 960 

CATATCAAAG TTTCATAGCA GTTACTATTT CTATGATATT TTTAGGTATT TCTTCAGGGT 1020 

10 

TAACTCGTCC ACAAACTATA AATAAACTTT CTAGCAGTAG TAACTTAAGA GTGATGCTTA 1080 

ATTATGCAGA AACGTTATAT TTTATTTTTA ATATCGCATT TTTACTTATG GGTGGTTACT 1140 

15 TATATACAAT AGGAACTATT CAATACTTAA TATTATTTAT TTCGTTATTA ATTTTTATAT 1200 

ATTTAATAAT AATATTTyAT. TTTACAAGGA GAGAGCAACA TGAAAATAAA AACTGAATTT 1260 

AAAGGGAACA ATATACCATA TGAATACGCA GCAGGTGCAG ATGTGAGTGA TTCTATTAAC 1320 

20 GGGAATCCAA TTAAGTCATT TCCATTTGAA GTAATTGAAT TACCGGAAGG gACTAAATAT 1380 
CTTGCTTGGT CTTTAATTGA CTATGATGCA ATTCCTGTAT GTGGCTTTGC TTGGATTCAT * 1440 

TGGAGTGTAG CTAATGTAAG TGTTAGTGGC AATTCAATTT CTATAAAAGC AGATTTATCA 1500 

25 

AGAACAAAGG GCGACTATGT ACAAGGTAAA AATAGCTTTA CTAGTGGGTT GTTGGCTGAA 1560 

GATTTTTCAG AAATAGAAAA TCACTATGTA GGACCTACAC CACCTGATCA AGATCATCAA 1620 

TATGAATTAA CAGTTTATGC GTTAGATCAT TCTTTAAATT TGAAGAATGG GTTCTACTTG 1680 

30 

AATGAATTTT TAAAAGAAGT AAATCAACAT AAAATTGATC AAACAAGTAT TAACCTTATA 1740 

GGAAGAAAAA TTTAATACTA AATATCTCAT CAATATAAAA TTGTTCAATT AAAAGTACAA 1BO0 

35 AGAAACAAAG GTTTTAATTT ATATATTAGG TACGGCGTTC GCTATAATGC AAAGAAGTAA 1860 

TTAAATTTAA GAAATGTAAA CTTAGTTATT GTAATGTGAA TTTATTTGAA AAAATAGAAA 1920 

GTATTAACAA TTATAGCTTT TACATTAATT AAAATTTATT TTTAAAAACA AGTAAACAAT 1980 

40 TTACATACTT ATAATTTTTG AAAATTTTCA ATTTGTGTTA TATTGATTTT GTAAGATACT 2040 

TTAACTCACA AAGGAGAGAG AGTATATGAA ATTAAAATCA TTTATAACTG TAACTTTGGC 2100 

ACTGGGCATG ATCGCAACGA CTGGCGCTAC TGTGGCAGGT AATGAGGTAT CTGCAGCAGA 2160 

4 ° AAAGGACAAA CTACCGGCAA CTCAAAAAGC TAAAGAAATG CAAAATGTTC CATATACAAT 2220 

TGCAGTAGAT GGCATTATGG CTTTCAATCA ATCTTACTTA AATTTACCAA AAGATAGCCA 2280 

ATTATCATAT TTAGATTTAG GAAATAAAGT TAAAGCTTTG TTATATGATG AACGCGGTGT 2340 

50 

AACACCTGAG AAGATTCGAA ATGCAAAATC TGCCGTTTAC ACGATTACTT GGAAAGATGG 2400 

TAGTAAAAAA GAAGTGGATC TTAAGAAAGA TAGCTACACA GCAAACTTGT TTGATTCAAA 2460 
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CAACATGAAG 


CATTTAATTT TACAGTGATG ATTATAAAAT 


AATTGCCTTG 


ATACAAAGAT 


2580 




TACTCGTAAA 


TGACATCTTT GTATTAAGGC TTTTTCTAAA 


TTTAAAAGTG 


ATGGGTTAGA 


2640 


5 


GGTCATTGAG 


CTTTAAAATA TTCAAAATAC AAAACATTAA 


TGGCCAAAAA 


TAAAAGCCGC 


2700 




CTTTATCTGG 


GCAGCTTCAA TAATAAGAAA GACATATTTC 


ATTTTATACT 


AAATAGTTAT 


2760 




TGTGATGAAT 


CTTTCGGCGG TTTAATTACT GCAGCAAAAA 


TTGCTGTGAA 


AATCGTGAAC 


2820 


AATACTGCCA 


TGATAATTGG ATTCACTACA TTTAAGCTGT 


CTCCACCTAC 


TAGGCTATTA 


2880 




AGTACAAAGT 


TAACCATTTG CATTAATAAT AATGCCCAAA 


AGAATGTTAC 


GAGGTGTTTC 


2940 


15 


ATGTCATTCT 


ACCTCCACTT TAATTATATA TATTTTATTT 


TAAGTGAAAG 


TTAGAAATTT 


3000 




GTATAGTAAC 


ATCTCATATA TTTTGACCAT ATTATACAGT 


TTAAATAAAT 


GATTTTATCT 


3060 




GAATGGCTAT 


TCTAAATTAA GCGCATTAAA ACCAATTTCA 


TACTGAAATT 


TGACGATAAT 


3120 


20 


AAAGCATTAA 


AATTTTATTA ACTAGTCAAT ATTCCTACCT 


CTGACTTGAG 


TTTAAAAAGT 


3180 




AATCTATGTT 


AAATTAATAC CTGGTATTAA AAATTTTATT 


AAGAAGGTGT 


TCAACTATGA 


3240 




ACGTGGGTAT 


TAAAGGTTTT GGTGCATATG CGCCAGAAAA 


GATTATTGAC 


AATGCCTATT 


3300 


25 


TTGAGCAATT 


TTTAGATACA TGTGATGAAT GGATTTCTAA 


GATGACTGGA 


ATTAAAGAAA 


3360 




GACATTGGGC 


AGATGATGAT CAAGATACTT CAGATTTAGC 


ATATGAAGCA 


AGTTTAAAAG 


3420 


30 


CAATCGCTGA 


CGCTGGTATT CAGCCCGAAG ATATAGATAT 


GATAATTGTT 


GCCACAGCAa 


3480 


CTGGaGATAT 


GCCATTTCCA ACTGTCGCAA ATATGTTGCA 


AGAACGTTTA 


GGGACGGGCA 


3540 




AAGTTGCCTC 


TATGGATCAA CTTGCAGCAT GTTCTGGATT 


TATGTATTCA 


ATGATTACAG 


3600 


35 


CTAAACAATA 


TGTTCAATCT GGAGATTATC ATAACATTTT 


AGTTGTCGGT 


GCAGATAAAT 


3660 




TATCTAAAAT 


AACAGATTTA ACTGACCGTT CTACTGCAGT 


TCTATTTGGA 


GATGGTGCAG 


3720 




GTGCGGTTAT 


CATCGGTGAA GTTTCAGATG GCAGAGGTAT 


TATAAGTTAT 


GAAATGGGTT 


3780 


40 


CTGATGGCAC 


AGGTGGTAAA CATTTATATT TAGATAAAGA 


TACTGGTAAA 


CTGAAAATGA 


3840 




ATGGTCGAGA 


AGTATTTAAA TTTGCTGTTA GAATTATGGG 


TGATGCATCA 


ACACGTGTAG 


3900 




TTGAAAAAGC 


GAATTTAACA TCAGATGATA TAGATTTATT 


TATTCCTCAT 


CAAGCTAATA 


3960 


45 


TTAGAATTAT 


GGAATCAGCT AGAGAACGCT TAGGTATTTC 


AAAAGACAAA 


ATGAGTGTTT 


4020 




CTGTAAATAA 


ATATGGAAAT ACTTCAGCTG CGTCAATACC 


TTTAAGTATC 


GATCAAGAAT 


4080 


50 


TAAAAAATGG 


TAAAATCAAA GATGATGATA CAATTGTTCT 


TGTCGGATTC 


GGTGGCGGCC 


4140 


TAACTTGGGG 


CGCAATGACA ATAAAATGGG GAAAATAGGA 


GGATAACGAA 


TGAGTCAAAA 


4200 




TAAAAGAGTA 


GTTATTACAG GTATGGGAGC CCTTTCTCCA 


ATCGGTAATG 


ATGTCAAAAC 


4260 
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5 



15 



20 



35 



TGAACCTTAT 


AGCGTTCACT 


TAGCAGGAGA 


ACTTAAAAAC 


TTTAATATTG 


AAGATCATAT 


4380 


CGACAAAAAA 


GAAGCGCGTC 


GTATGGATAG 


ATTTACTCAA 


TATGCAATTG 


TAGCAGCTAG 


4440 


AGAGGCTGTT 


AAAGATGCGC 


AATTAGATAT 


CAATGAAAAT 


ACTGCAGATC 


GAATCGGTGT 


4500 


ATGGATTGGT 


TCTGGTATCG 


GTGGTATGGA 


AACATTTGAA 


ATTGCACATA AACAATTAAT 


4560 


GGATAAAGGC 


CCAAGACGTG 


TGAGTCCATT 


TTTCGTACCA 


ATGTTAATTC 


CTGATATGGC 


4620 


AACTGGGCAA 


GTATCAATTG 


ACTTAGGTGC 


AAAAGGACCA 


AATGGTGCAA 


CAGTTACAGC 


4680 


ATGTGCAACA 


GGTACAAATT 


CAATCGGAGA 


AGCATTTAAA ATTGTGCAAC 


GCGGTGATGC 


4740 


AGATGCAATG 


ATTACTGGTG 


GTACAGAAGC 


ACCAATTACT 


CATATGGCAA 


TTGCTGGTTT 


4800 


CAGTGCAAGT 


CGAGCGCTTT 


CTACAAATGA 


TGACATTGAA 


ACAGCATGTC 


GTCCATTCCA 


4860 


AGAAGGTAGA 


GATGGTTTTG 


TTATGGGTGA 


AGGTGCTGGT 


ATTTTAGTAA 


TTGAATCTTT 


4920 


AGAATCAGCA 


CAAGCTCGAG 


GTGCCAATAT 


TTATGCTGAG 


ATAGTTGGCT 


ATGGTACTAC 


4980 


AGGTGATGCT 


TATCATATTA 


CAGCGCCAGC 


TCCAGAAGGT 


GAAGGTGGTT 


CTAGAGCAAT 


5040 


GCAAGCAGCT 


ATGGATGATG 


CTGGTATTGA 


ACCTAAAGAT 


GTACAATACT 


TAAATGCCCA 


5100 


TGGTACAAGT 


ACTCCTGTTG 


GTGACTTAAA 


TGAAGTTAAA 


GCTATTAAAA ATACATTTGG 


5160 


TGAAGCAGCT 


AAACACTTAA 


AAGTTAGCTC 


AACAAAATCA ATGACTGGTC ACTTACTTGG 


5220 


TGCAACAGGT 


GGAATTGAAG 


CAATCTTCTC 


AGCGCTTTCA ATTAAAGACT 


CTAAAGTCGC 


5280 


ACCGACAATT 


CATGCGGTAA 


CACCAGATCC 


AGAATGTGAT 


TTGGATATTG 


TTCCAAATGA 


534 0 


AGCGCAAGAC 


CTTGATATTA 


CTTATGCAAT 


GAGTAATAGC 


TTAGGATTCG 


GTGGACATAA 


5400 


CGCAGTATTA 


GTATTCAAGA 


AATTTGAAGC 


ATAACTATAA 


nAATCTTCAG 


TAACGTTGTT 


5460 


TTAGTTACTG 


AAGATTTTTT 


CaGTTTCTTT 


ATACTAAGAT 


GAGGGACACA 


CAATCGTCAT 


5520 


AATAAAATAT 


GAATATTTAT 


TAATAATAA 








5549 



(2) INFORMATION FOR SEQ ID NO: 18 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4832 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

V (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 189: 

50 

AGATTATAGT AAGATTGATA GTTTGGCGAC TGaAGCgCGa GaAAAATTAT CAGaAGTAAA 60 
mCCTTTAAAT ATTGCACAAG CTTCTAGAAT ATCAGGGGTA AATCCAGCAG ACATATCTAT 120 
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TGGTTAGCAG AACAATTAAA AGAACATAAT ATTCAATTAA CTGAGACTCA AAAACAACAG 24 0 

TTTCAAACAT ATTATCGTTT ACTTGTTGAA TGGAATGAAA AGATGAATTT GACAAGTATT 300 

ACAGATGAAC ACGATGTATA TTTGAAACAT TTTTATGATT CCATTGCACC TAGTTTTTAT 360 

TTTGATTTTA ATCAGCCTAT AAGTATATGT GATGTAGGCG CTGGAGCTGG TTTTCCAAGT 420 

ATTCCGTTAA AAATAATGTT TCCGCAGTTA AAAGTGACGA TTGTTGATTC ATTAAATAAG 480 

CGTATTCAAT TTTTAAAGCA TTTAGCGTCA GAATTACAAT TACAGGATGT CAGCTTTATA 540 

CACGATAGAG CAGAAACATT TGGTAAGGGT GTCTACAGGG AGTCTTATGA TGTTGTTACT 600 

is GCAAGAGCAg TAGCTAGATT ATCCGTGTTA AGTGAATTGT GTTTACCGCT AGTTAAAAAA 660 

GGTGGACAGT TTGTTGCATT AAAATCTTCA AAAGGTGAAG AAGAATTAGA AGAAGCAAAA 720 

TTTGCAATTA GTGTGTTAGG TGGTAATGTT ACAGAAACAC ATACCTTTGA ATTGCCAGAA 780 

20 GATGCTGGAG AGCGCCAGAT GTTCATTATT GATAAAAAAA GACAGACGCC GAAAAAGTAT 840 

CCAAGAAAAC CAGGGACGCC TAATAAGACT CCTTTACTTG AAAAATAATG CATAATCCTT 900 

TACAACTAAC ATAAAAGGAG CGAATGGATA ATGAAAAAAC CTTTTTCAAA ATTATTTGGT 960 

TTGAAAAACA AAGATGACAT CATTGGACAT ATTGAAGAAG ATCGCAATAG TAATGTTGAA X020 

TCCATTCAAA TTGAACGTAT CGTTCCCAAC CGTTATCAAC CAAGACAGGT GTTTGAACCA 1080 

AATAAAATTA AAGAACTTGC , TGAATCAATA CATGAACATG GTTTACTACA ACCTATTGTT -1140 

GTAAGACCGA TTGAAGAAGA TATGTTTGAA ATTATTGCTG GAGAGCGCCG ATTTAGAGCA 1200 

ATACAATCAC TAAATTTACC TCAAGCAGAC GTTATTATTC GTGATATGGA TGATGAAGAG 1260 

ACGGCTGTTG TTGCATTAAT TGAGAATATT CAAAGAGAAA ATTTGTCTGT TGTTGAAGAA 1320 

GCGGAAGCCT ATAAGAAATT ATTGGAAATT GGTGATACAA CGCAAAGTGA ATTGGCAAAA 1380 

AGTTTAGGTA AAAGTCAAAG CTTTATTGCA AATAAGTTGC GTTTATTGAA GTTGGCGCCG 144 0 

40 AAAGTACTAC TTCGCTTAAG AGAAGGTAAA ATTACTGAAC GTCATGCGAG AgcGGtATTA 1500 

TCATTGTCTG ATAGCGAACA AGAAGCGTTG ATTGAGCAAG TCATTGCACA AAAGCTAAAT 1560 

GTGAAcAGAc TGAAGATAGA GTACGCCAAA AAACGGGGCC CGAAAAAGTC AAAGCACAAA 1620 

ACCTTCGCTT TGCACAAGAT GTCACTCAAG CACGAGATGA GGTAGGCAAA AGTATCCAAG 1680 

CGATTCAACA AACAGGATTA CATGTTGAGC ATAAAGACAA AGATCATGAA GATTATTATG 1740 

AAATAAAAAT TCGAATATAT AAACGTTaGT AGTAGGATGT CGTATACATG ATGACTAACA 1800 

CATAAAAGAC AAAGCTAAGA TCATAACAGC TTTGTCTTTT TTTTTTGTTT TACGTGAAAC 1860 

ATAAAAATTT ATATTTATAT GTTGATCAGG CTGGTACATA AATCAATGTT CTATGCTCTA 1920 

SS 



25 



30 



35 



45 
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TTCTAGTCAA CCTTGCTGGG GTGGGACGAC GAAATAAATT TTGCGAAAAT ATCATTTCTG 2040 

TCCCACTCCC TAATTTGAGC TGGATATACT TTCATTTGAA CCCTTTATTG CTAGTTTATG .2100 

5 AAAGTATCAT GAAAGCTTTA TGAACATCGC TTGAGTTGCC TTTACAGTAG AAAATTTAAG 2160 

TTTTACACTT TGTGTGAATG ATACGTTTTG TATTGAATTA ATTATAGAAA GGTACGTTGA 2220 

AGATGTTTTC AATTGGAAGT GCAATTCTTC ATTTTGTCAT TGGTGGTATC GCTGTTGCAT 2280 

10 

TAGCTTCAAT TATTGCTGAT AAGGTAGGTG GTAAGTTAGG AGGTATTATA GCTACTATGC 234 0 

CGGCAGTCTT TCTTGCGGCT ATTATCGCAT TAGCTTTAGA TCATCGTGGT ACGCAATTAG 2400 

TGGAGATGTC GATGAATCTT AGTACTGGAG CAATTGTCGG TATTCTGTCT TGTATATTAA 2460 

15 

CTGTATTTTT GACATCTCTC TACATTAAGC ATAAAGGTTA TCGGAAAGGC GCAATATTCA 2520 

CAGTTGTTTG TTGGTTTGTC ATTTCCCTCG CAATATTCAG TATTAGACAT TTATAGTTTG 2580 

20 GAAAATGCGT GATAATTAGT TGTATTCAGT TATTAAGTAA TAAATTATTG GAGGCAGAAC 2640 

ATCATGAAAT TAACATTAAT GAAATTTTTT GTGGGGGGAT TTGCAGTATT ATTAAGTTAT 2700 

ATTGTATCTG TAACACTACC TTGGAAAGAA TTTGGCGGTA TATTTGCaAC GTTTCCGGCA 2760 

25 GTATTTTTAG TGTCTATGTT TATTACAGGT ATGCAATATG GTGATAAAGT CGCTGTGCAT 2820 

GTAAGTCGTG GCGCAGTGTT TGGTATGACA GGGGTATTAG TTTGTATTTT AGTTACATGG 2880 

ATGATGTTAC ATATGACGCA CATGTGGTTG ATTAGCATTG TTGTTGGTTT CCTAAGCTGG 2940 

30 

TTCATCAGTG CAGTATGTAT TTTTGAAGCG GTAGAATTTA TAGCACAAAA AAGATTAGAA 3000 

AAGCATAGTT GGAAAGCTGG AAAATCGAAT AGTAAATAGT GTGAACGTAA TCTCTTAACT 3060 

AGGACTAACT TTGCAAGCAT TGAATAGCAT GG AAAAGTTG CATCATTAAT AAGTGAAATT 3120 

35 

CAAGTTGGCA TTGAGAAAAT TACAAGCGCG TAATCATACa GGTCTGTCTT AAGGGAGTCT 3180 

TCGAACCCCG ATGTTGTCGT ATGTCAAAAC ATTTAGTCAA TCATAAAGGT GACTTGATTT 3240 

40 AACTTTATCT GATAGTCTGA TTGTAATGAT TGTACTAATT GACTGGAGGC GTATGTAATT 3300 

GAATCTGAGT AAACAAATTA AAAAGTATAG GGAACGAGAT GGTTATTCAC AAGAATATCT 3360 

TGCTGAAAAG TTATATGTAT CTAGGCAGAG TATTTCTAAT TGGGAAAATG ACAAAAGCTT 3420 

45 ACCAGACATA CATAACTTAT TAATGAyGTG TGAATTGTTC AATGTAACTT TAGATGATTT 3480 

AGTAAAAGGG ACCATTCCAT TTGTACCTGA TATTAAAGCG CAACGAAGTC TTAACTTATG 3540 

GACATATGTG ATGCTTATTT TCATGACATT AGCTGCAATT TTAATGGGAC CTTTAGTTGT 3600 

so ' 

TTATTGGAAT TGGACTTGGG GTGTAACGGT GGCAATCATT TTGGGAATAG GTTTTTATGC 3660 

ATCTATGAAA ATAGAAGATT TAAAAAAAGT GCATAAAATG GACAACTACG ATCGAATTGT 3720 
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GACAAATGCG CTTTCTATTA TATCAGTAAT TGGTATACTC AGCCTCATAA TTTTCCTTAG 3840 

TGTGTATTTG GCAAATAAGT TTTTATAAAT CATCGTGGTA TCGTCTCATA TTATTTATAT 3 900 

TATCCAAAAT AGCATAAAAA AATACCAACA AGATTTAGAA CCTTGTTGGT AATCAAAGCG 3 960 

aTTCATTTAT AATGAGTCGT TTTATGTTGT AAGATTAAAC AGTTTGTACG TTAACTGCTT 4020. 

GGTCTCCACG TTGACCTTCA GTGATTTCGA AAGTAACTTT TTGACCTTCT TCTAAAGTTT 4080 

TGTAGCCATC GCTAGCGATA CCTGAGAAAT GTACGAATAC GTCTCCGCCA TTTTCTTGTT 4140 

CGATGAAACC AAAACCTTTT TCTGCrTTAA ACCATTTwAC TGTACCGTTA TTCATATwGA 4200 

AwACCTCCGT gTGCTTTTGC ACTTAATATT TGTAACAAAT TCATAACTAA AAAAGAGGAT 4260 

ATTCTAAACA AATACACTAC AATTTAATTC ACGAGCTTTT ATTACGTAAG ACCAACTATA 4320 

CGCTCATATT GGCATAATGT ACAGTGTTTT TTGAAAATAA ATTAAAAAAG ATTTTTAAAA 4380 

20 ACCTTAGAAA CGTTGATTTA AAGGGGTTTA TAAAAATwAw AAAATTGTAG TCTTTTATGG 4440 

TGTTTGCTAG TTTTCAAAGT GACATATCGT TTAAACATGA TGATTTTATA AGCAATCCAT 4500 

AAAAAACAAG CAGCGATAAA CGCTACTTGT TGATATTAAA ATCTGACTTG AAAGGTCATA 4560 

25 GCAATGTTCT ATACCGATGG AATGTGCTTA CTTGCCTTTT TCTTCACGAC GTTTTAAATA 4620 

ATAAGAGCCA CCTAATAAAC CAGCTGGAAT GCCTATCATT GGTGTTGTGA ATGAGCTTAA 4680 

TACAATAACA AGTATTGTTA AAGCAATGAC GTTATACCAA GTTACAGTCA AATTTTTCAA 4740 

30 

ATCCTCATAT GATTGTTTTA CTAATTCTCT AAATTTCATG ATTCAATCTC TCCTTTTTTA 4 800 

TAAATCTTTA GATTGTCAAA TTAAGCTGGA CA 4 832 

<2) INFORMATION FOR SEQ ID NO: 190: 

35 ■ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5727 base pairs • 
Z (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 190: 

45 CAAAGCTGTT CAAAAGGCTT ATAATTTAAA TTTAGATAAC ATACGTACAA TGGAACCTAA 60 

GTTGAGATAT CAAGCGATCA ATAAAGGTAA TATTAATTTA ATAGATGCAT ATTCAACTGA 120 

CGCTGAATTA AAACAATATG ATATGGTTGT GTTAAAAGAT GATAAGCACG TATTTCCACC 180 

50 

ATATCAAGGA GCACCATTAT TTAAAGAAAG CTTTTTAAAG AAACATCCAG AAATTAAGAA 240 

ACCGTTAAAC AAACTAGAAA ACAAAATATC TGATGAAGAT ATGCAAATGA TGAACTATAA 300 
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GTTAATCAAA TAACGACCAA CGCCACATAA GATGCGTAAC ACCAAATTAT ATCTTATGTG 420 

GCGTTGTTAT ATTTAAATCT ATAATTATGT TCAATTTAAA CATGCAATAA TGATTAAAAA 4 80 

5 ATATGACATG TTAAACACAA TGTAAGCTAT TATGATGTGA AAATAGTAGC ATTGCATTTT 54 0 

AGAAACATAG AGCGATATAA TGAATATAAG TTTTTTGAAA TTTCAGTTAA TTCTAAGGAG 600 

GTTGTTTTTA TTATGAAAGA ACAACTTAAT CAACTATCAG CATATCAGCC TGGTTTATCT 660 

10 

CCAAGGgCAT TGAAAGAAAA GTATGGCATT GAAGGAGATT TATATAAACT TGCATCAAAT 720 

GAAAATTTGT ATGGACCATC GCCTAAAGTT AAAGAAGCGA TATCAGCACA CTTAGATGAG 780 

TTATATTATT ATCCTGAAAC AGGATCACCG ACATTAAAAG CGGCGATTAG TAAACATTTA 84 0 

IS 

AATGTAGATC AATCACGCAT TTTATTTGGT GCGGGATTAG ATGAAGTTAT ATTAATGATT 900 

TCTAGAGCTG TATTAACGCC AGGGGATACT ATTGTTACAA GTGAAGCGAC ATTCGGTCAA 960 

20 TATTATCACA ATGCGATTGT - TGAATCAGCT AATGTGATAC AAGTACCTTT AAAAGATGGT 1020 

GGCTTCGATT TAGAAGGTAT TTTAAAAGAA GTTAATGAAG ATACGTCATT GGTATGGTTA 1080 

TGTAATCCAA ATAATCCTAC AGGTACATAT TTTAATCATG AGAGCTTAGA TTCGTTTTTA 1140 

25 TCTCAAGTAC CTCCACATGT ACCAGTAATT ATAGATGAAG CTTATTTTGA ATTTGTGACA 1200 

GCAGAGGACT ACCCGGATAC ACTTGCTTTG CAACAAAAAT ATGACAATGC TTTCTTATTA 12 60 

CGTACATTTT CAAAGGCGTA 7GGATTAGCG GGTTTACGTG TAGGATATGT GGTAGCAAGT 132 0 

30 

GAACATGCGA TTGAAAAATG GAACATCATT AGACCACCAT TTAATGTGAC ACGTATATCT 13 80 

GAATACGCAG CAGTTGCAGC ACTTGAAGAT CAACAATATT TAAAAGAGGT AACACATAAA 1440 

AATAGTGTTG AACGCGAAAG ATTTTATCAA TTACCTCAAA GTGAGTATTT CTTGCCAAGT 1500 

35 

CAAACGAATT TTATATTTGT AAAAACmAAG CGGGTAAATG AAGTTTATGA AGCACTTTTA 1560 

~AATGTAGGGT GTATTACGCG ACCATTTCCA ACTGGTGTTA GAATTACAAT TGGTTTTAAA 1620 

GAACAAAATG ATAAAATGTT AGAAGTTTTA TCAAACTTTA AATACGAATA GTAAGTGGGG 1680 

40 

AGTGGGACAG AAATGATATT TTCGCAAAAT TTATTTCGtC GTCCCACCCC AACTTGcATT 1740 

GTCTGTAGAA ATTGGGAATC CAATTTCtCT TTGTTGGGGC CCCGCCGGCA AGGTTGACTA 1800 

45 GAATTGAAAA AAGCTTGTTA CAAGCGCATT TTCGTTCAGT CAACTACTGC CAATATAACT 186 0 

TTGTAGAGCA TTGAACATTG ATTTATGTCT CAAGCTCAAT GCAGTGTGAA TGATGAGGTG 1920 

AGAGTATTCA GTGTAAAAAG CAACAATAGA TGATATTGTT TTGTATCAAT TGCTTTTTTG 1980 

50 CTATACTGAA TCAATACTGA TATTTTCAGG AGAAGATTAA AATGACCCGT AAATCAATCG 2040 

CGATTGATAT GGATGAAGTA TTGGCAGATA CATTAGGAGA AATCATTGAT GCTGTCAATT 2100 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



TTCCTGAACA TGATGGATTA ATTACAGAAG TATTGAGAGA ACCAGGCTTC TTCAGACATC 2220 

TTAAAGTGAT GCCGTATGCA CAAGAAGTTG TGAAAAAATT AACTGAACAT TATGATGTAT 2280 

ATATTGCTAC AGCAGCAATG GATGTACCAA CATCATTTAG TGATAAATAT GAATGGTTAC 234 0 

TAGAGTTCTT TCCATTTTTA GATCCTCAGC ATTTTGTTTT TTGTGGTAGA AAAAACATCG 2400 

TTAAAGCTGA TTATTTAATA GATGACAATC CTAGACAGCT TGAAATTTTT ACTGGTACAC 2460 

CGATTATGTT TACAGCAGTG CATAATATTA ATGATGATCG ATTTGAACGC GTAAATAGCT 2520 

GGAAAGATGT AGAACAGTAT TTTTTAGATA ATATTGAGAA ATAAAATATA TCACTTGAAA -2580 

AATTTCATGT AGAAAAGATG ATGGATAGGC TATAAAGTAA TTGTGACTGA GATGAACTTT 2640 

TATGTCTTAG ACACTACAAC ACTATATTGG CAGTAGTTGA CTGCGGGGCC CCAACATAGA 2700 
GAAATTGGAT TCCCAATTTC TACAGACAAT GCAAGTTGGG GTGGsCCCCA ACATAAAGAA , 2760 

ATACTTTTTC TTTAGAAATT AGTATTTCTT ATGCATGAGT GTAACTCATG CATTCATATT 2820 

TTTAAGTACA CATTAGCTGT GACTAATGAT AAAGAATCGC TACATAATCA ATCATTAGTC 28 BO 

GTTCTTTATC ATTTCCGTCC CGCTCTCAAT AAATGTTAGT CTATCTTATT ATTATAAATC 2940 

GGATGAATGT GTTAATCTAT GGCAGATTAC ACGTCATCCG ATTTTTTATA GAATTTGAAA 3000 

AAGACGCATA AACCACTATG ATTTAAAATA CAACATCAAT CATTTTAGTG gCATGCGCCA 3 06 0 

AAATTATATG TCTGTTTTTG AAACAGGGTA ATAGCTTAAA GCTAATAAAA ACGAATATAA 3120 

GGTGCGTTGA ATCTTATGAT TACACTCCAA ACCTAATATA ATATCGGGTT AAGATCATTC 3180 

CGGATGCTTA CAAATCATTG ACAGTAAGTA ACTGAATGGC ATTTGGTATA ACCTCAATAT 3240 

CAATAGGTGT TTCTAATGAA ATTTCGCCAT CAATATCAAC TTTCATTGCT GGATCTGTTG 3300 

TAAGTGAAAT CTTTTTACCA GGTATATGCT CAATACCTTG AGTAATTTCA TTCCaATTCA 3360 

TGCTATCACG CTTTTTAAAA ATATCATTTA AAATACTGAA ACTTTGTTCA TTAAAAATGA 3420 

AAGTGTTCAG TTCACCATCT TGAGGAGACA AATCAGTCaA TGGTATACGA CTACCACCAA 34 80 

TGAATGGACC ATTTGCTGTT AGTATCATGG TCGTTTCGCC AGAATATGTC TTATCATCTA 354 0 

TTGATAATTG ATAATTAAAT TGTGTTGGAT TTAGCAGTGT TTTGACAGTT GATCCAATAT 3600 

AACTCAATTT ACCAAATATA TCTTTTGAAC CATCTTGTAC GTTTTCAGCG TTTTGAACAA 3660 

TGAGACCTAA GCCAACAAAG TTGAGTGCAT ATTGATTATT TATTTTAATT ACATCGTATG 3720 

TAC CAACTTG TGCAGAAATC ATTTGTTCAC TAGCTTGTTT ATGATTAGGT GCTATATTTA 3780 

GCGTTTTTGT AAAATCATTA AAAGTACCGC CTGGTAAAAT GCCAATAGGG AGTTGAAGGT 3 840 

CATGTGTCAT AACACCGTTT ATAAGTTCGT TAACCGTGCC ATCACCGCCA AGAATAAATA 3900 
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CACCTTCGTT TTCACTCAAT TGAATAGAAA GATGCTTACA AATTGAACTT AATGCTGTTG 4020 

TAACTTCCCC AATACCTTGA TTAATATTTT TTAATCCACT GTGTTCATGG TAAAAGAGGA 4080 

CACCATGTGT ATATTTATTT TCCATAGTTT AGCCTACTTT CTAAAAATTG GTTCATTAAA 4140 

TATATATACC CACTTTTAAT TGTTAATACC AAAAATATGT TTTTAAATAG AGAAAATGGT 4200 

AATAAATGAA ATTGATTTCT ATAGAGTGGG ACGAGAAAAT ATAGTTATAG CTGTCTATAA 4260 

TGAGCATATT AAGTTTTTAT TTATACTGAT ATCTTGAATT TAATTAATAG AAACCTATAA 4320 

AAAAACAGTA AGCCATTTAA ATGACTTACT GTTTTTTGAA TTAGGCCAAC AATATTAACG 43 BO 

TATACCTTTC ATCGCTTTGA TGATTAAAGG TGAGAATGCT AATACAATTG TTGTAACAAT 444 0 

AATTGCAACA ACACCTAGGA AAATAAAGTA ATTTGTTTGA CCTAGTGGTT CTATTAACTT 4500 

AACTAAAGTA CCATTGATTG CTTGTGCAGA AGCGTTAGTT AAGTACCAAA TACTCATCAT 4560 

TTGGGCATTA AATGCTTTAG GTGCTAACTT AACAGCAGCA CTATTACCCG TTGGTGATAA 4620 

GCATAGCTCA CCGATAACAC AAATAATGTA CGATAAAATA ACCCAGTTAA CTGAAAAGTT 4680 

TGATGAACCT GATGCATAAC CTACAATACC AATTAGTATG TATGACGCAC CTGCTAAGAA 4740 

25 CGTACCAATT GCAAATTTTA CTGGCAGGCT AGGTTGTTTA GTTCCAAGCT TTTGCCATAA 4800 

AAGTGAAATA ATTGGAGCTA GTAATAAAAT AAATAATGGG TTAATTGATT GGAAGATCGC 4 860 

TTCACCAAAG TTTGTTTTCC AACCAAATAA GTTTAATTTC ATATCTGAAT GTTCAATTCC 4920 

ATATATGTTT AATACATTAG ACCCTTGTTC TTGAATAGCC CAGAACACCA TTCCAAGAAT 4980 

AAATAATGGA ATAAATGCTT TAACACGAGA ACGTTCAGTA TCAGTGACAT CTTTACTTCT 5040 

AATAATTAAA GTGAAGTAAA TGAnTGGTAA TGCAATACCT AATACTAAAA CAGTATTACT 5100 

AACTAAGTTA AATGATAATG AGTTAGTTAA TGCACCAATA ACGATAATTA ATACAATTGC 5160 

TAAfiACAACA CTTCCGATAA TAAGACCATA CTTTTTCTTT TCAGCTGGTG TCAATGGGTT 5220 

AGTAGGTTTC ATACCAACGC TACCTAAGTT TTTGCGGTTG AAAAGTACAT ACCATACTAA 5280 

ACCTAATGCC ATACCAACTG CTGCAATCAA GAATCCGCCG TGGAAGTTTT TAACATTAAC 5340 

AAAGTGTTGC AAAATAATAG GTGATAATAA TGCACCCATA TTAACTGACA TATAGAAAAT 5400 

45 AACAAAACCT GCATCCATAC GTCTATCATT TTCAGGATAT AAACGGCCAA CGATATTTGA 5460 

AATGTTTGGC TTCATTAAAC CTGAACCAAT AATGATGAAG AACATTGATG TGAATAAGCC 5520 

GATTAATGCA AATGGTAAGC TTAAACAAAT ATGTCCGATA ATAATAAAGA CTGCACCTAA 5580 

SO TAAAGTAGCG CCTCTAGTGC CTGTAATTCT GTCAGCAATC CATCCGCCTG GTATTGATGT 5640 

CATATAGATT AATGAACCAT ArACTGACAT AATTGACATA GCTGTTGTTT TATCAATTCC 5700 
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(2) INFORMATION FOR SEQ ID NO: 191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14078 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 191: 

TGGACTATTA ACGGCGaAGA AGATTTAACG AAATACTTAC AAACCAATGT TGATGGTATT 60 

ATCACAGATG ACCCAGCATT AGCTGATCAG ATTAAAGAAG AAAAGAAAGA CGAAACATAC 120 

15 

TTCGATCGTT CTATAAGAAT TTTGTTTGAA TAATATAAAC AAAGACCTCT AAAGTTATCA 180 

AGATGATACC TTCAGAGGTC TTTTTAATGT TGCCATCTAT GGGATAGGCA ATCGTTTCAT 240 

20 TCGTTTATAT TCATATGACA AGTATTTGTA TGGCAATTTG GCGTCACAAA CACTTACATG 300 

ATTTATTGGT GAATTATTAA TTGTTTTGTG AATGCAAAGG GTTAGAAATT GAATTGTAAA 360 

TACTTTCTAA TCTTTGTTTC GCTTTAGTCA TTTGATCCAA ATTTTTAGTG CGTATAGCGG 420 

25 ATTTTGCAAT ATAGTGCGCA cTAAAATATC GCGTTTTTGA AACGCATCTA AATTTAGGTA 4 80 

CGATAATTTA TTTAAGTCAG TGTTTGCTAT TAATTCATGT AATTGATCTA CAAGCGCTTG 540 

ATGTTGATAC GTATGTGATG TAGTTTCAGA TTTGCTTGCT AATTTAATAC CAGTCGTATC 600 

30 AAGGAGCGCC GCTTTAATAC CAGCAACTAA ATATGTTTTG ATTTTCATTT GTGTTGTCAT 660 

GCTTTGTTAC TCCTTTGATG TACATTAATC AAAAAAATTA TACACTATTG TATATTGCAA 720 

AGCTAATTAA CTATAACAAA AAGATAGTTA ATGCTTTGTT TATTCTAGTT AATATATAGT 780 

35 

TAATGT CTT T TAATATTTTG TTTCTTTAAT GTAGATTGGG CAATTACATT TTGGAGGAAT 840 

TAAAAAATTA TGAAAAAGCA AATAATTTCG CTAGGCGCAT TAGCAGTTGC ATCTAGCTTA 900 

TTTACATGGG ATAACAAAGC AGATGCGATA GTAACAAAGG ATTATAGTGG GAAATCACAA 960 

40 

GTTAATGCTG GGAGTAAAAA TGGGACATTA ATAGATAGCA GATATTTAAA TTCAGCTCTA 1020 

TATTATTTGG AAGACTATAT AATTTATGCT ATAGGATTAA CTAATAAATA TGAATATGGA 1080 

4S . GATAATATTT ATAAAGAAGC TAAAGATAGG TTGTTGGAAA AGGTATTAAG GGAAGATCAA 1140 

TATCTTTTGG AGAGAAAGAA ATCTCAATAT GAAGATTATA AACAATGGTA TGCAAATTAT 1200 

AAAAAAGAAA ATCCTCGTAC AGATTTAAAA ATGGCTAATT TTCATAAATA TAATTTAGAA 1260 

50 GAACTTTCGA TGAAAGAATA CAATGAACTA CAGGATGCAT TAAAGAGAGC ACTGGATGAT 132 0 

TTTCACAGAG AAGTTAAAGA TATTAAGGAT AAGAATTCAG ACTTGAAAAC TTTTAATGCA 1380 
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GTTGTATCAT ATTATGGTGA TAAGGATTAT GGGGAGCACG CGAAAGAGTT ACGAGCAAAA 1500 

CTGGACTTAA TCCTTGGAGA TACAGACAAT CCACATAAAA TTACAAATGA ACGTATTAAA 1560 

AAAGAAATGA TTGATGACTT AAATTCAATT ATTGATGATT TCTTTATGGA AACTAAACAA 162 0 

AATAGACCGA AATCTATAAC GAAATATAAT CCTACAACAC ATAACTATAA AACAAATAGT 1680 

GATAATAAAC CTAATTTTGA TAAATTAGTT GAAGAAACGA AAAAAGCAGT TAAAGAAGCA 1740 

GATGATTCTT GGAAAAAGAA AACTGTCAAA AAATACGGAG AAACTGAAAC AAAATCGCCA 1800 

GTAGTAAAAG AAGAGAAGAA AGTTGAAGAA CCTCAAGCAC CTAAAGTTGA TAACCAACAA 1860 

GAGGTTAAAA CTACGGCTGG TAAAGCTGAA GAAACAACAC AACCAGTTGC ACAACCATTA 192 0 

GTTAAAATTC CACAGGGCAC AATTACAGGT GAAATTGTAA AAGGTCCGGA ATATCCAACG 1980 

ATGGAAAATA AAACGGTACA AGGTGAAATC GTTCAAGGTC CCGATTTTCT AACAATGGAA 2040 

CAAAGCGGCC CATCATTAAG CAATAATTAT ACAAACCCAC CGTTAACGAA CCCTATTTTA 2100 

GAAGGTCTTG AAGGTAGCTC ATCTAAACTT GAAATAAAAC CACAAGGTAC TGAaTCAACG 2160 

TTAAAAGGTA CTCAAGGAGA ATCAAGTGAT ATTGAAGTTA AACCTCAAGC AACTGAAACA 2220 

2S ACAGAAGCTT CTCAATATGG TCCGAGAGCG CAATTTAACA AAACACCTAA ATATGTTAAA 2280 

TATAGAGATG CTGQTACAGG TATCCGTGAA TACAACGATG GAACATTTGG ATATGAAGCG 2340 

AGACCAAGAT TCAATAAGCC ATCAGAAACA AATGCATATA ACGTAACAAC ACATGCAAAT 2400 

30 GGTCAAGTAT CATACGGAGC TCGTCCGACA TACAAGAAGC CAAGCGAAAC GAATGCATAC 2460 

AATGTAACAA CACATGCAAA CGGCCAAGTA TCATACGGAG CTCGTCCGAC ACAAAACAAG 2520 

CCAAGCAAAA CAAACGCATA TAACGTAACA ACACATGGAA ACGGCCAAGT ATCATATGGC 2580 

GCTCGCCCAA CACAAAACAA GCCAAGCAAA ACAAATGCAT ACAACGTAAC AACACATGCA 2640 

AACGGTCAAG TGTCATACGG AGCTCGCCCG ACATACAAGA AGCCAAGTAA AACAAATGCA 2700 

TACAATGTAA CAACACATGC AGATGGTACT GCGACATATG GGCCTAGAGT AACAAAATAA 2760 

GTTTGTAACT CTATCCAAAG ACATACAGTC AATACAAAAC ATTACGTATC TTTACAACAG 2820 

TAATCATGCA TTCTATGATG CTTCTAACTG AATTAAAGCA TCGAACAATC GGAAGCATAT 2880 

TTCTAAATTA TTTATTCATT ATAGTCTTAA ACATAACATG ACCTAATATA TTACTAACCT 2940 

ATTAAAATAA ACCACGCACA TCTAAGTGAT ATACGACAAT CACAGCAATA ATAATTGCTT 3 000 

TAGAAAGTCG TGCCGAACTG GAACTTACAA GTCTAGTTCG AACACACACT GATGTGAGTG 3 060 

SO GTTTTCTTTA TTTTAAACAT GAACAATCAG ATAAGTTACT AGCATTAGCA AATATTATTA 3120 

AATCAAAGGG CTTCGATTCA TAAAATTTAA AACAATGATT AAAATTAGAC GTGTAAATGT 3180 
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TATTTCACAC AGCTTCATTA ATAAAACGAA ATTGCTTCAA 


CCCGCTTCAA 


CTTCAACTGG 


3300 




CTTCAACTTC 


AGCCTACTTC 


ATTCAATAAC 


AAAACGAATC 


CGCTTCATCC 


AAAATCAACC 


3360 


5 


ATTCTAACGC 


ACATATTCAA 


ATATAGCAGC 


TGCACCCATG 


CCGACACCAA 


TACACATCGT 


3420 




AACCATGCCG 


TAACGGCTAT 


CGGGACGTCT 


ACCCATTTCA 


TTAAGTAAAC 


GCGCGGTTAA 


3480 




CATTGCGCCT 


GTAGCACCTA 


ATGGATGACC * TAAAGCAATA 


GCGCCACCAT 


TCACATTCGT 


3540 


10 


ACGTGATATA 


TCTAGACCTA 


CTTCTTTAAT 


AGATGCAATC 


GTTTGAGAAG 


CAAATGCTTC 


3600 




GTTCAATTCG 


ATCAAATCAA 


TGTCTTCAAC 


AGATAGATTG 


CTGAGTGACA 


ATACTTCAGG 


3660 


15 


AATCGCATAT 


GCAGGCCCAA 


TACCCATAAT 


TTTCGGGTCA 


ACGCCTACTG 


CCTTAAAACC 


3720 


AACGAATCGT 


GCAATAGGTG 


TCACGCCGAG 


TTCTTTCACT 


TTATCTCCAG 


ACATTAAAAC 


3780 




TACAAATCCT 


GCACCATCAG 


AAAGTGGGGC 


AGATGTTCCT 


GCAGTCATAG 


TGCCGTCAGC 


3840 


20 


TTTAAATACT 


GTACGTAATT 


TGGCTAATGC 


CTCCATCGTG 


GTGTCAGGGC 


GTATAAATTC 


3900 




ATCTTGGTCA 


AAGATATTTG 


TGTGTACTTT 


TGGTCCTGCG 


TTTGTATATT 


CAACTGAGTT 


3960 




TACTTGTATT 


GGAATAATTT 


CATCTTTGAA 


CCGACCATCA 


CGTTGTGCGT 


CATAGGCACG 


4020 


25 


TTGATGACTT 


CTGACAGCAT 


AAGCATCTTG 


ATCTTCGCGT 


GATACGTCAA 


ATTGGGATGC 


4080 




TACATTTTCA 


GCAGTTAAAC 


CCATAGGATA 


TGACGCACCT 


ATATCATCAT 


ATTGTAAGGT 


4140 




TGGATTGTTT 


GTGGGCTCGT 


TGCCACCCAT 


TGGTACGGCA 


CTCATCAATT 


CAACGCCACC 


4200 


30 


AGCTACAAGT 


ATATCTCCTT GACCAGCCAT 


AATTTGATTG 


GCTGCAATCG 


CGATGGTTTG 


4260 




TAATCCTGAT 


GAGCAGTAGC 


GATTCACTGT 


TTGACCCGGT 


ACCGTGTCAG 


ATAATCCCGC 


4320 




ACGCAATGCA 


ATCGTTCGTG 


CAATGTTTTG 


GCCTTGTAAT 


CCTTCTGGAA 


AAGCCGTACC 


43B0 


35 


AACAATGACA 


TCTTCAATCA 


TATTCTTATT 


GAATTTTCCG 


TCAATACGTT 


TCAATACGCC 


4440 




TTGTAATACT 


TTGGCTGCGA 


CATCATCAGG 


TCTTTCGTGG 


AATAATGCGC 


CTTGCTTTGC 


4500 


40 


TTTCGCTGCG 


GCTGAACGCC 


CATAAGCTAC 


AATGTATGCT 


TCTTGCATGG 


TTATCATCCT 


4560 


CTCTTAATGA 


CTATCTTTTA 


ATTACGTAAT 


GGCTTACCAG 


TTTTTAACAT 


ATGTGCAATT 


4620 




CTTTCATATG 


ATTTTTTAGA 


TTTTAGTAAG 


TCAATAAAGC 


CAATTTTCTC 


CAACGATTGA 


4680 


45 


ATGTAACGTT 


GATTGATAAA 


TGTATTTCTT 


GGTAAATCAC 


CACCCGCTAA 


AATTGTGGCG 


4740 




ATATTTAAGG 


CAATATGATA 


ATCATGGTCG 


CTAATAAAAT 


GACCCCGTCT 


TTGCGCATCT 


4800 




AATTGTCCTT 


GGATCAATGC 


TTTGAAGTCT 


TCACCTAAAG 


CGATATATTG 


ATGTCTAGGA 


4860 


50 


TTCGGAATAT AGTTTGTTTC 


TGCTTCATAT 


TTCGCACGTT 


TGAGCGCAAC 


TTCGACACGT 


4920 




TGTGCTGTAT 


TGAAAATAAT 


CGTATCTGTA 


TCACGTAAAT 


AACCATAACG 


ACGTGCCTCA 


4980 
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TGTTTGTCAT CAAACTTATG CGATGTGCGT AATATGCGAT CAGCCATTTC TGCAAGGCCA 5100 

CCGCCACTCG GTAATAAGCC AACACCTGCT TCAACAAGAC CGATATATGT TTCACTTGCA 5160 

GCGACAACAA TAGGTGAGTA AAGTACAAGC TCACAGCCAC CGCCTAAGGC ACGACCTTGA 5220 

ACAGCTGTGA CTACTGGTTT CAAACTATAC TTCAAACGAT TAAAGCTATA ATGTAATTTA 528 0 

TCAATTGATT GTGCAACGAC ATCATCTACA AGACCGTCTT CATGCGCCTT TTTCATTAAG 5340 

AAAAGGTTAG CACCCACACT GAAATTGTTA CCATCTGCAT AAATAACCAT ACTTGTGTAA 5400 

TGGTCATTTT CCAGTAAATC AATCGCATCA ACTAACGCAT CG7TGAATTC ATCGGTAATG 5460 

ACATTATTTT TACTTTGTAA TTTCAGTAAC AGTTGATCAT CATGAGTTAC GGAAAGTTTG 5520 

GCATCACCTT TATCCCAAAG TTCATCTTTT ACGAAGTGAG AAATAGGTGT TGCATATTCA 5580 

ATGGTCTCAT CTTGTTTATA AAAGCCACCA TCTAAATCAC TAATCCATTG TGGTAAGTCT 5640. 

CCAAGTTCGT CTTCCATACG TGTTTTAACA CGTTCGTATC CCATTGCATC CCATAATTGG 5700 

AATGGACCAA GTTTCCAGTT GAACCCCCAG ACAAGCGCAC GGTCTATGTC TCGGAAATCA 5760 

TCGGTAGCTT TAGGTACATT GATAGCAGAG TAATAGAAAT TATTACGTAA TGTCTCCCAT 5820 

AAAAATAGTC CCGCTTCGTC TTGCGCATTG AATATGGTAT CAAGGTTATG CACTAAGTCT 5880 

TTATTAAATT CATTTAAAAT TGGTAATTGT GGTTGCGATA CAGGTACATA ATCTTGTTTT 5940 

TCAACATCGT AAACAAGTCG AGCTTTAGTT TCTTTATCCT TTTTGTAAAA TCCTTGTTTC 6000 

30 GTTTTACGTC CGAGTGCGCC ATTGTCAAAC AACGTATTTA CAATTTTGAC ATCATGAAAA 6060 

TAAGGTGTTT CTTCAGGTAC TTGTTGCATG CCTTTAATTA CAGACACTGC AATATCTAAA 6120 

CCGACTAGGT CAGATAGCGC ATATGTACCT GTTTTAGGAC GACCAATCGC TTGCCCAGTT 6180 

55 AAAGCATCCA CATCTACAAT GCTTATCTTG TGTTGCTCGG CGCGATACAT AATATCATTC 6240 

ATTQTTTGCG TGCCGACTCT ATTTGCGACA AAGCCAGGCA CATCATTGAC GACAATGACA 6300 

CCTTTACCTA ACACATTTTG CGCGAAATTT TTTACATCTA ATATAATAGA TTCCTTCGTG 6360 

TGTGACGTAG GTATTAACTC CACTAATTTC ATAATACGTG GTGGGTTAAA GAAATGTAGA 6420 

CCAAAGAATC GTTCTTGATC CTTCTCGTTA AATGCTTGAG CAATCGCATT AATTGGAATA 64 80 

CCTGATGTAT TTGTAGCGAA TAAAGCATCT TCTTTAGCAT GTTGTAGAAC TTGTTGCCAA 6540 

ACAGCATGCT TAATTTCAAT ATCTTCTTTG ACTGCTTCGA TATATAAATC AGCATCATCA 6600 

TTTACCAAGT CATCATCAAA ATTACCATAT GTTAAATGAC TCGCTAGATT TAAGTCGAAT 6660 

AGTAGCGGCC GTTTCTTATC TGTAATTTTA TCGTAAGATT TTTTCGCAAT GAGATTTGGA 6720 

TCGTTTTTGT CCACTACAAT ATCTAATAGT TTTACTTTAA GTCCAGCATT CACAAAAAGT 6780 
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GTGATTCCTC CAATTTAGTT GAGGATAAGA TAACCATTAA GATAATTGGA ATAACGTTGC 690 0 

TATTTTATAA AATTAATTAA GTATCTTTGA CAGTCATCTT AGCCTCTTAT TTAAGGAAAA 6960 

5 AGCTTTATGC TTAAAATAAG TCTTTTTTAG TGAAATTAAT GCATCTCATA TAATTATTTG 7020 

CTATTTATAC GAAAGCAGAA TCTCCAGTCA AAGCGCGTCC AATTACTAAG GCATTAATTT 7080 
i 

CATGTGTACC TTCGTACGTG TAAATCGCTT CTGCATCAGA GAAGAAACGT GCAATATCAT 714 0 
AATCGTCAGC TAGTATGCCA TTACCACCTG TAATACCGCG GCCCATAGCT ACTGTCTCAC . 7200 

GCAAACGTAA GGCATTCATC ATCTTCGCCG TTGAAGTTGC AACCTCGTCA TATTCACCAT 7260 

GTGCTTGCAT ATTAGCTAAT TGAGCACATG TTGCCATTGC TTGAGCTAAA TTACCTTGCA 7320 

75 

TCATTGCTAG CTTTTCTTGT ATTAACTGAT ATTTACTAAT TGGTTTGCCG AATTGcTTAC 7380 

GCTCAGTGAC ATAATCTAAT GTGGCACGTA AAGCGCCAGC CATACCACCT GTAGCCATAT 7440 

AAGCAACGCC TGCTCTCGTT GAATAAAGAA TTTTGGCAAT ATCTTTAAAG CTTGTTATGT 7500 

20 

TTTGTAAGCG ATCCGCTTCA TCTACTTTGA CATTAGTTAA TTTAATTAGG GCGTTAGGAA 7560 

CAATGCGAAG TGCGATTTTA TTATCAATGA CTTCAATATC GACGCCATCT TGTTCTGGTC 7620 

2s TGACTACAAA GCAATGGGGT TTGCCAGTTT CTTTATTTAC TGCGAATACT GGAATGACAT ( 7680 

CAGATACATG TGCACCACCA ATCCATTTCT TTTCACCATT GATAACCCAA GTATCGCCTT 7740 

GGCGTTCAGC GACTGTTTCA AGACCTCCCG CAACGTCCGA ACCGTGTTCT GGTTCAGTTA 7800 

30 AAGCAAAGCA TGTACGCAGT TCATGTGACT GTAATTTAGG TACATATTTC GCAATTTGTT 7860 

CTTTGCTACC TCCGAAATAG AAAGTGTTAT GCCCTAAACC TTGGTGAACA CCGAGTAGGG 7920 

TAGCTAAGGA AATATCAAAT CGCGCGAGTA GGTAAGACAT GAAAAACTGA AATAGTTGAC 7980 

35 

TAGGCATTTT GGCGTTTGGA CGATCCTTGT AAAGTAATGG ATTGTTAAAA TAATTTAATT 8040 

CTC&AGATC TTTAAAATAG TCCTCGGGTA CAGTAGCGTC TATCCAATGT TGATTAATAT 8100 

TTTCACGGTA CTTACTTTCT AGCAATGAAT CTACTTGTTG TAAAAATTCG ACTTCACCGT 8160 

40 

CTGTTAAACC TTTAGCAATA CTAAGTACAT CTTCAGGAAA TAATGTTTTT AAGACCGTTT 8220 

CTTTTTCAAA TGTCATATAA ATTCCTCCTA AAAATAATAT GAATACTAAT GTGAAATGCA 82 80 

TTTAATTCAA AAACAACACG CTTTATTTGT AAACGCTTAC ACTAAATGTC AAAAATTTTT 8340 

45 

ATCACCTTTA AAGTGTTTGC GAGACTTTGT CATTCATCAT TTGTCGAATC GCAAGTTTAT 8400 

CTGGTTTCTG CGTACTGTTT AACGGCATAT GTGTCACTGG TACATACATT CTTGGGACTT 8460 

50 TATAACCTGC TAAACGACTt CGCATATGTT GATTTAAAAT TTCAGCGTAA TGAGGTTCAT 8520 

CTTCGCGAAG TATAATGGCT GCAGCAATTG ATTCACCATA TTTTGGATGA TCATAGCCAA 8580 
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AGACATTTTC GCCACCAGTT ATGATTAATT CTT TTT T G CG GTCAATAATA AATATATCGC 8700 

CATCGTTGTC CATCTTCGCT AAGTCACCAG TTAATAAATA TCGACCATGA AATGCTTTGG 8760 

CAGTCTCTGC TGGTTTATTC CAATATCCTG GCGTGACATT TTTAGCCTTA ATTGCAAGTT 8820 

CGCCAATCTC ACCAGTAGGT ACTTCCTCAC CGTTATCATC AAGGATACGT GCATCAACGA 8880 

ACATGACTGC TTTACCAATA CTCATTGGCT TACGTTTTGA ATTTTCCGGT GTATTAACAA 8940 

GTACAAGAGG TGCTTCAGTT AAACCATAGC CGTTAATAAT GTTTATGCCA TATTGTTTAA 9000 

AAGCTGCTTG GATACTTGGT AATGGTTGTG AACCACCTTG GATGATATAA TCCATAGCTC 9060 

TAAAATTTTC AGGATTAAAA TTACTAGCAC GTAGCGTACT ATAATACATT GTCGGAATCA 9120 

TGATAATAAA TGTAGGGTGA TATTGTGCAA TCATGTCATT CAATTCTTCG CCGTTAAAGT 9180 

AACGTTGAAG AATAAGTGTG CCACCTGACA TTAATACTGG TAATACAGTA TCGTTAAACC 9240 

CTAAAACATG GAACATTGGT GTTGATACAA TCGTAATATA GTTTGAATTG AACTTATACG 9300 

TCAGCTCTAA GTTTGCACCG TTATGAACAA ATGATTCATA TGAGAACATC ACACCTTTAG 9360 

GTGATCCGGT TGTACCACTT GTATAAATTA ATGCTGCAAG ATCTTGTGGT TCAACAGGTG 9420 

TTGCTTGAAA AGGTTGGTGA TAATCTGGAT TTACGATTTC ATCATATTGC GCCACATCAA 9480 

TATCCATATG CAATAAGTTT TGGTCAATAT CGGTGAGTGA ACTTAAATGT TTTTCAG CAT 9540 

AG AAGAG CAG TTTTAATTGT GCATCTTCCA CAATGGCTGC AATTTCTTTT GGGTTAAGCC 9600 

GCCAATTCAA TGGTAAAAAA ACCGCACCTG TTTTAAAACA AGCAAACAAT AAATCTAATA 9660 

TTGCAATATC ATTTGGCGCA AAAATACCGA TAACATCGCC TTTTTTAACA CCTTGAGATG 9720 

TTAAATAATG TGCCATATTA TCAGCGCGTG CATTGAGTTG TTGGTATGTC CAAGATGTTT 9780 

GTTTTGCGTG ATCAATAAGG GCAGGCTTGT CATCATCGAA GTCTGAACGC GTTTTTATCC 9B40 

AATCGAAATT CATTAGTATA CCCCCTTTAG CTTCACTTTC ATACTTTATG AATTGATTGT 9900 

TTAAGTTGTC CCCATTTTTC TTTGTAAATG CTGGTATCAA TTAATTTTAA ATGATCAGCA 9960 

ATAATTGGTT TAAAAGCCAT TTGATTCAAA ATATCTTTAT GCAAATCAAG ACCTGGTGCA 10020 

ATTTCAATTA GTTTCAAGCC TTGATTGGTG AGTTCGAATA CTGCACGATC AGTAACAAAA 10080 

TAGATTTCTT GCTCGAGTGA TTGTGAATAT TGTGCATTAA AGTCGATATG GCTCACATCT 10140 

GATACAAATT TCTGGTTTTG TCCTTCAGTT TCAATGTTTA ATCGTTGATT ATGGCATGAG 10200 

ACATGACTGC CAGCTACAAA AGTACCTGAA AAGATAATTT TATTTACAGA TTGCGTAATG 10260 

TCTATAAAGC CACCACATCC ATTTAGTCGG TCATTGAAGT AAGACACGTT GACATTGCCG 10320 

TATTGATCAA CCTCAGCAAA GCTAAGATAG GCAACTGATA CACCATTGTT ATAAATAAAA 10380 
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CGACTCCCAA CGAATCCACC GAAAATGCCA ACATCTAAAA TCGGTTGCAC ATCATGTTCA 10500 

ACACATTCTT CATGCAATAA ATTAGAGAGT TCATTATTGA TGCCATAACC GATGCTAATT 10560 

5 GTATCGCCAT AAGTTAAAAA CTGAGCAGCA. CGTCGGAGAA TCAATTTGCG ACTATTAAAA 10620 

GGTAATGCGG GTTCAGGTAT TCCATCAATT CGTTCTTCTC CAGACAAGGC TGGTAAATAA 10680 

TGACTTTGAA TTACTTGGCG GTGAT TCT TT TCATCTTCTG TGACGTATAC ATAATCGACA 10740 

10 

AGATTTCCTG GGATAACAAC TTCATTCGGT TTTAGTTGAT AGTCGTCAAC TAAAGCTTTA 10 BOO 

ACTTGTACAA TAACTTTCCC ATGATTGGCT TTCGCGTTTA ATGCGACATG ATAACACTCG 10860 

CTCAAGTACG CTTCTTGAGT TAAATAAATG TTACCTTGTT GATCTGCGTA TGTTCCTCTC 10920 

75 

AGTAGTGC CA CATCAACGCT AGGGAATGTG TAATGTAAGT ATGTTTCATC GTTGATGGTT 10980 

ACTAATGAAA CTAAATCATC CGTTGTTCGT GTATTTACTT TACCGCCACC GTATCTAGGA 11040 

2Q TCAACAGCTG TGTTTAATCC GATTTTAGTA ATAACTCCAG GTAATAATTG .ATTACTCTGA 11100 

CGATAATGAG TTGCAATGAT ACCTTGTGGT AAAAAATAAG CTTCAATGTC ATTATTTTTC 11160 

ATTGcTTGTG CCGTTTTGGA AGAAGCCGTT AAAATACTCA TAATGACACG TTTAATCATG 11220 

25 CGACGTTCTA TAAAATCATC TAAATCCGGT GCGGCACCTA AACTATGAAT ATCATTCGCT 11280 

AATATAAACG TTAAATCATT GGGCGTATGA TATGTGTCAT GTTGCGCTAA CACAGCACGT 1134 0 

AGAACTTCGG CGGGTAAGTT GGCTACAGCT AATGCTGGTA AACCAATCAC ATCACCATCT 114 00 

30 TTAATGATAT GTTGTAAGTC GTGCCATGTG ATTTGTTTCA AGCAAGTCAC CTCCATCACA 11460 

TTTGATAAAA TATAGCGTTT TTACACTTTG TGTAAACCCT TaCAAGAAAT ATAACATAAC 11520 

GACGTTTAAA ATCAATTAGA AATATCTTTT TATTCTGATA ATAGACACAG TATAGACACA 11580 

35 

TTTTGATGGT CGATAACAAT TGTAATATCA AGGGTTTGTA ATGAATTGAA TATCATTAAA 11640 

ATACTTATAT AAAAATATTG TTCGGAATAT AAAAAGTTAA ATAGGTTTTG ATTTTTAAAT 11700 

ATGAAATACA AAGTGCCCAA TCGAACAAAG TATTTATATT AAAATATGGA AAATCCATCA 11760 

40 

ATATTAAATT AAAATAGTTT TATTATGAAA AGTGAAAGTA GGTAAGTCTA TGGAAGGTCT 11820 
TAATCATCGA AGAAATACAG AAAAAGAAGA GACAACACAA ACGCAATCaG TTGCACCTAA . 11880 

TACAGGTGAA GAGGGGATGT CATCAGCAAG TACACAATCA ACTAAGACGT CCGACATACA 11940 

45 

TAATGAATCT ATCGATAAAC AAATGGAAGC TAAAGCGCAT GAAACAGCGC AAAATACAGA 12000 

TTTAAAAAAC GAAGCAAGAA GTTTATTTGA TAATGCAACC AAATCAATCG GTAGACTAGC 12060 

50 GGGCAATGAT GAAAGCTTAA ATCTTAATTT AAAAGATATG CTTTCTGAAG TATTTAAGCC 12120 

GCATACTAAA AACGAAGCAG ATGAAATATT TATAGCGGGT ACTGCTAAAA CTACGCCAGC 12180 
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TTTCACAGTA 


ACATTTATTG 


GATTATGGGT 


CATGGCAGCA 


ATTTTTAATA 


ACACTAACGC 


12300 


GATTCCGGGT 


CTCATTTTTA TAGGGGCTTT 


AACAGTACCA 


TTATCGGGTT 


TGTTCTTCTT 


12360 


TTATGAATCA 


AATGCGTTTA 


AAAATATTAG 


CATTTTTGAA 


GTTATTATCA 


TGTTCTTTAT 


12420 


TGGCGGCGTA 


TTTTCATTAC 


TAAGTACGAT 


GGTATTATAT 


AGATTTGTCG 


TTTTTAGTGA 


12480 


TCAATTCGAA 


AGGTTTGGTT 


CTTTAACATT 


TTTCGATGCA 


TTTTTAGTAG 


GATTAGTTGA 


12540 


AGAAACTGGA AAAGCACTCA 


TTATTGTTTA 


TTTCGTCAAT 


AAATTGAAAA 


CAAATAAGAT 


12600 


TTTGAATGGA 


TTATTAATCG 


GTGCTGCTAT 


TGGTGCAGGG 


TTCGCAGTTT 


TTGAATCAGC 


12660 


AGGTTATATT 


TTGAATTTCG 


CTTTAGGAGA 


AAATGTCCCA 


TTATTAGATA 


TTGTCTTCAC 


12720 


ACGTGCGTGG 


ACTGCGATTG 


GTGGTCATTT 


AGTTTGGTCA 


kCGATTGTTG 


GTGCTGCAAT 


12780 


AGTTATTGCG 


AAAGAACAGC 


ATGGCTTTGA 


ATTCAAAGAT 


ATTTTTGATA 


AACGCTTTTT 


12840 


AATATTCTTT 


TTATCAGCCG 


TTGTTTTACA 


TGGCATTTGG 


GATACATCTT 


TAACTGTACT 


12900 


TGGCAGTGAT 


ACGTTGAAAA 


TATTTATTTT 


AATCGTTATT 


GTGTGGATAC 


TTGTATTCaT 


12960 


TTTAATGGGG 


GCAGGTTTAA 


AACAAGTGAA 


TTTACTGCAG 


AAAGAATTTA 


AAGAACAACA 


13020 


GAAAAAAGTA 


GACGAATAAT 


AATTAAAGCT 


TATGTTGCTC 


ATATGTTTGT 


GACATAAGCT 


13080 


ATTTTTATAA 


TTTGTCTTTA AAAGAGTGGA 


ATAGGAATAC 


TTTTTGGAGT 


TAAAAAAGTG 


13140 


TTtCACGTTA AACAAATAGT 


GACAATTAGA 


TTTATATAAA 


ATGAACATGA 


TTCACTGAAA . 


13200 


GTATGTAATA 


ATCATTTTAT 


TGAAATTCAT 


CAAACAGAAA 


TTAATACAAT 


CATATAAGCA 


13260 


AATTAAACCA 


CGCCATAATC 


ATATTGGATG 


ACTTCGGCGT 


GGTTTTTATA 


GTTGAAGCAG 


13320 


GGCTGAGACA 


TAAATCAATG 


TCCCACACTC 


CCTTATCGTT 


CAATCGTTGT 


TCGATAATCG 


13380 


ATTAAATAGA 


TACCTTCAGG 


TGTTACTTTA 


TAATTTTTAA 


CCTTAGAGTT 


AGCAGCGACT 


13440 


ATTD5ATCGT TGTAAGCAAT ATAACTGTTT 


GGTACATCTC 


GACTTGATAA 


TTTAATAATA 


13500 


TCAtTAGAAA TATTGTGACG TTCCTTAACA 


TCTACAGTAT 


GATTCAATTG 


ATTAATTAAA 


13560 


TCATCGACGT 


TGCTATTATT GTAGTCTCCT 


TTATTAATAG 


CACCATCTTT 


TTTATATGCT 


13620 


TGATTAAAGA AATAACCTGT ATCTCCACGA 


GGAATTGTTC 


CGAAACTATA 


CATCGTTGCA 


13680 


TCCCATGCAG 


AACGGTCTTT 


TAAGTAACCT 


TCTATGTCAT 


CAACACTTTT 


AATGTCGATT 


13740 


TCAATATTTG 


CTTTTTTAGC ATCTGATTGT 


AATACTTGCG 


CAATTTTCGA 


TAGCTCTGGA 


13800 


CGACCGTCAT 


ACGTAATTAA 


CTTAATTTTT 


AAAGGGTGTT 


CTTTTGTATA 


ACCATCTTTA 


• 13860 


GCTAATAACA TTTTTGCTTG 


TTCGATATTT 


TGTTTGGTTA 


ACTTAGGTTC 


TTTAATATAT 


13920 


GGAATTTTAT 


CATTAAATGG ACTCGTTGCA 


GGTTTCGCAT 


AACCTTGATA 


AATATGATCT 


13980 
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TTATtAGTAT GATTATACAT AAGTaAGAAG TTCTAAAn 14078 
(2) INFORMATION FOR SEQ ID NO: 192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 86 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 192: 

TGAAAACTAA AGTGTTTCTA ATGCGTGACT AAAATTAGTA ATAATTAAGT TCTCATGATA 60 

ATAGGTATTT TTGAAAAATG GAGGAGTCTA TAAATGGGTA AAAAAATGGG TCTAGGTTTA 120 

TCTATTGCAT TGGTTGTTAT TGGTATTGCC GTTGTATGTT TAATGATTTT TTCTAGTCAA 180 

AAAACGACTT ATTTTGGTTA TATGAATAGT AATACAAATG CAGAAAAAGT TGTCAGTGAA 240 

AAAGATGGAT TAGTCAAACA TAATATCAAA GTAGAACCAT CTAATGATTT CAAGCCGAAA 300 

AAAGGAGACT TTGTAAAATT AGTTTCTAAA GATGATGGGA AGACATTTTA TAAACAAGAG 360 

25 ATTGTTAAAC ATGATGACGT CCCACACGGT TTAATGATGA AAATTCACGA CATGCATATG 420 

AATTAATAAA AAAGCATCTA TAACGTAATT TTGAAGAAGT AGAGTTATCT TCTTATGCGT 4 80 

TTTAGA 4 86 
(2) INFORMATION FOR SEQ ID NO: 193: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1626 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 193: 

40 

GAGGTCTATA TACAATTATG GTTGTTCCAG TTAAACGAAC TGATGGCTTT ATTACTAAGT 60 

TTAATAGATT AATTGAAAGA CGATTATTAC GTCATTTCAG TAAAAAAGGT TATATCACAT 120 

45 GGGAGGAAAA TTGATTGTCT GACATTTTAA AATGTATCGG TTGTGGTGCG CCACTTCAAT 180 

CTGAAGATAA AAATAAACCT GGTTTTGTAC CAGAGCATAA TATGTTTCGT GATGACGTGA 240 

TTTGCAGACG TTGTTTCCGC TTGAAAAATT ATAACGAATT CAAGATGTAG GATTAGAAAG 300 

50 TGAAGACTTT TTAAAATTAT TATCAGGACT TGCGGATAAA AAGGGTATTG TCGTCAATGT 360 

CGTGGATGTA TTTGACTTTG AAGGATCATT TATTAATGCA GTTAAACGTA TTGTCGGAAA 420 
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TCGAGTTAAA GAATGGTTAA AACGAACAGC AAGAAAATAT GGTTTGGAAG CTGACGATGT 540 

CGTATTAATT TCAGCTGAAA AAGGCTGGGG CATAGACGAC TTATTATCAT CAATTGCGAA 600 

TATTCGAGAA AATGAAGATG TGTATATTGT AGGGACAACG AATGTTGGGA AATCTACATT 660 

GATTAATAAA CTGATTGAAG CTAGTGTTGG TGAAAAAGAT GTAGTAACAA CTTCAAGATT 720 

CCCTGGAACA ACTTTAGATA TGATAGATAT TCCTTTAGAT GAAACATCAT TTATGTATGA 780 

TACACCAGGT ATTATTCAAG ATCACCAAAT GACGCATTTA GTTAGTGAAA AAGAATTGAA B4 0 

AATTATTATG CCTAAGAAAG AAATAAAACA ACGCGTATAT CAATTAAATG AGGCGCAGAC 900 

ATTATTCTTC GGCGGTCTAG CGCGCATAGA TTATGTATGA GGTGGTAAAC GTCCGTTAGT 960 

TTGTTTCTTT TCTAATGACT TGAATATACA TCGTACTAAm ACGGAGAAGG CTAATGATTT 1020 

ATGGCGTAAT CAACTTGGCG ATTTATTAAC GCCACCTGGA AATCCACAAA ATTTTGATCT 1080 

TAATGAGGTA AAGGCTGTTA GACTTGAAAC AGGCAAAGAG AAACGCGATG TTATGATCTC 1140 

TGGTCTAGGC TTTATAACTA TAGGACCAGG GGCTAAAGTA ATCGTTCGTG TTCCTAAAAA 1200 

TGTTGAkGTT; GTATTAAGAA ATTCTATTTT ATAAGGTGaT TAAAAAAATG AAATTTGcAG 1260 

TTATAGGAAA TCCTATTTCA CATTCCTTGT CGCCCGTTAT GCATAGAGCA AATTTTAATT 1320 

CTTTAGGATT AGATGATACT TATGAAGCTT TAAATATTnC CAATTGAAGA TTTTCATTTA 1380 

ATTAAAGAAA TTATTTCGAA AAAAGAATTa GAAGGCTTTA ATATCACAAT TCCTCATAAA 1440 

GAACGTATCA TACCGTATTT AGATTATGTT GATGAACAAG CGATTAATGC AGGTGCAGTT 1500 

AACACTGTTT tGATAAAAGA TGGCAAGTGG ATAGGGTATA ATACAGATGG TATTGGTTAT 1560 

GTTAAAGGAT TGCACAGCGT TTAnCCAGAT TTAGAAAATG CATACATTTT AATTTTGGGC 1620 

GCAGGT 1626 

(2) Information for seq id no: 194 : 
(i) sequence characteristics: 

(A) LENGTH: 635 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 194: 

AGGGTTAATT GTCGGTTTAA TTGCAATGAA TAAGTTCCAT GTATTAGCTG GCTATAGAGC 60 

GAAATTCATC TTAATGGTGA TTTTAACTAT GATGGTCTTC GTACTTATTA ATACGTATTT 120 

ACTAAGACAG GTAAAATCTA TCGGTATGTT CTTAATGATT GCTGCATTGG GTCTATACTT 180 
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GTCTTATATC GATAACATGT TCTTCAATTA TTTAAATGCA GAGCATCCTA TAGGCTTGGT 300 

GCTAGTAATA TTAACAGTAC TTGTGATTAT TGGCTTTGTA CTGAACATGT TTATAAAACA 360 

CTTTAAGAAA GAGAGATTAA TCTAATGTTG ATGAATAGCG TGATTGCTTT AACTTTTTTA 420 

ACAGCATCTA GCAATAATGG CGGACTTAAT ATTGATGTGC AACAAGAAGA GGAAAAGCGA 480 

ATCAATAATG ATTTAAATCA ATATGATACA ACGCTATTTA ATAAAGACAG CAAAGCGGTT 54 0 

AATGATGCGA TTGCTAAGCA GAAAAAAGAA CGACAACAAC AAATAAAAAA TGATATGTTT 600 

CAAAATCAAG CGAGTCACTC GACTCGCTTG AATGA 635 
(2) INFORMATION FOR SEQ ID NO: 195: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: . 13715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 195: 



25 


CTGAAATGGG 


TATTATTTGT 


CTTCTTCATC 


ATAAAGTAAT 


AAAGATTGTT 


CATCATTGCG 


60 




ACGTTGCCAA 


TTTTCATTTG 


GCGATCTTGG 


AACGTATAAT 


AATTGCCTAG 


TAGCATACGC 


120 




TTTGATTGAA 


ACATACAAGT 


CATTCCTTGA 


ACTTGCACCT 


CAATTTCCAC 


ATTTGAATTT 


180 


30 


TCTTTTATAT 


TAATAATTTC 


ATCCAAATTC 


AGCTCACGTG 


CTAAGACAGC 


TCTTGATGCG 


240 




CCTCTTTTAC 


CCCAGTAATT 


ACATTGAAAA 


TGATTAGTTA 


CTAACGTCTC 


TGCATTCCAA 


300 




TGAAGTGGTA 


TTGGATTTTC 


TTGCGCCTTC 


ACATACATTA 


CTACTGCTGG 


ATCCCCGAAA 


360 


35 


ATAATTCTGT 


CAACTCGTAT 


TTCATGTAAA 


AAATTAATAT 


AATCTTCTAC 


AGCATCTAAA 


420 




TGAT&ATTAT 


GAAATAATCC 


ATTCACTGCC 


GCATATACTT 


TTTTATCGTT 


TTTGTGAGCT 


480 


40 


AATGCGACAG 


CCTCTGTCAT 


TTGTTGTCTA 


TTGAATTCCC 


CTGGAAGTCT 


TAAACCAAAC 


540 


TTTTGCTCGC 


CAATTACAAA 


AGCATCTGCA 


CCTAAATCAA 


TAAGTGTTTC 


CATATGGCTT 


600 




AATGACTTGG 


GTGTGACAAG 


TAATTCTGTC 


ATAGTCATTC 


TCCTTTAATT 


GAAATCGCTA 


660 


45 


ATCCATCGTC 


TATATTTAAA 


AAATTCGTTG 


TATATCCTGG TTGCTTTATT 


AACCACTCAT 


720 




TATAATCTTG 


AACCTTTTTA 


ACCATTTGTC 


TTACATTTCT 


CGATCTAACA 


ATCCCAATAT 


780 




CCGATACAAA 


ACCGTGATAT 


AAAACATTAT 


CTGTAATTAC 


GAGACCTTGG 


TGCTTTAAAA 


640 


50 


GTGGTGTATA 


TATTTCAAAA 


AATTTCTTTG 


ATTGCGCTTT 


TGCTGCATCA 


ATAAATATCA 


900 




TATCATAAAC 


TTTGTCATTT 


ACATTTTCAA 


ATTGCTCTAA 


AGCATTACCT 


TCAATAATTC 


960 
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CATTACGCTC TATCGTTGTG ACATGAATGT CATCAGATAT AGAAGCGAAT TGCATAGAAC 1080 

TATAGCCGAT TGCTGTACCA ATTTCTAAAA TATTTTTAAC ATTATTCATA CGAATTAATT 1140 

GCTTAATTAA ATCTAATGTT AAACGATCTA CAATTGGCAC TTCATTTACC TCGGCAAATT 1200 

CACGCAAAAC TTCGATTGAA CTATTTTGAT GTTGATGTAA ATCTATTAAA TATTTTTTAT 1260 

TTAGGTCATC CATGTTTTAA ACTTCCTTTA TGTAAAATAA GTCAATATGA TTATGACAAT 1320 

AAAATAAATC AGCCTTCACA ATTGATTATA ATTTTGCCAA CCAATTAAAT GACTGATTTC 1380 

GTGTTAGACG CAAAGCTATT TTATTTATAG AAGCGAATCA TTCATATAAA ATTTAACTTT 144 0 

AGATATTTTA CCATATTTTC AATAAAATTA TAAGCGTTAA TTATTTATAC ATTGCTTGAC 1500 

TTAAAAAATA CTCTTGCCTC CCCATCTTTA AGGTTAGCAA GAGTAAAATC TTTTTAATTA 1560 

TTCTTCCATT TCAGTATTTA CAACTTCTTC AATCATGTCC CATTCTTCAT CAGTTTCGAT 1620 

TGGTACTAAC TTACCACCGT CACCTGACTC ATCTGGTTCA TTGATCATTG GTACAAGCTC 1680 

AATCATATCG TCTTCATCTG ATTGAGCACC TTCTTCAGCT AAGATAACAT ACTCTTTTTT 1740 

GAATTCAGGA TGATAAAATT CTAAAACTTT TCGGTATAAA ACTTCATTTC CCTCTTCATC 1800 

GAATAAAGTT AATAATTCTT CTTCGTTATT AATTTCTAGT- TGTGAATCAT GATTATGTTC 1860 

AGTCATAGTA AAATCTCCTT TTAATGTAGT GAATCTAAAT AGCCTTGTAA AATAAATACC 1920 

GCTGCCATTT TATCAATCAC TTGTTTTCTT TTTTGTCTTG AAACATCTGC TTCTAATAAT- 1980 

30 GATCGTTCAG CAGCCATTGT GCTTAATCTT TCATCCCACA TCACAATCTC AATAGAAGGA 204 0 

TAAGCTTCTA • ATAATTTTTC TTTATATGTT AACGAAGCTT CGCCTCGAAA TCCTATTGAA 2100 

TTATTCATGT TTTTAGGTAG TCCTATTACG ACTGTACCCA CATTATGTTT TTTAATAATG 2160 

' Jb TCTACTAATT GGTCAATACC TAATTCATTA TTTTCTTCAT TGATTCGGAG TGTGTCTAAT 2220 

CCTT5TGCCG TCCAACCCAT TATATCACTA ATTGCAATTC CTACCGTTCT ACTACCGACA 2280 

TCGAGTCCTA AAATTTTATG TTGTAACATA AATTATTTAT TTTGCTCTTT TAAATAGTAA 2340 

GAAACAAGCT CTTCCATAAT AACATCTCTA TCAATATGAC GAATTTGATT TCTTGCTTCA 2400 

TTTTGGCGTG GAATATACGC AGGGTCACCT GATAATAAAT AACCTACAAT TTGGTTTACG 2460 

GCATTATATC CTCGTTCATC TAATGTTCGA TAAACATTAT TTAAAACATC TCTTACATCT 2520 

TGCGTTGGAA GTTCTTCATA GTCGAATTTC ATTGTTTTAT CAAAGTTTTC CATTTGCGAC 2580 

ACTCCTTTAA TTACAAATAT AACTCACTAT CATCATACAA TATTATGGCT TTAAATTATA 2640 

GATTTTTAAT GTAATCTTTA ATAAAGCTTA ATGATTTTGA GATATTTTCA GGTTGTGTAC 2700 

CGCCACCTTG AGCCATATCT GGACGACCGC CACCTTTACC ACCAACGATT GGTGCCATTT 2760 
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TCGATACTTT ATCATCAACA TTACTTGCAA GAATGATAAT TGTATCTTGT AGTTTAGATT 2 8 80 

TAAAATCGTC CATTGTCGAG CGAATTGCTT TCGCATTTGG TACATCCACT TCAGTAACCA 2940 

5 ATACTTTATA GCCATTGATT TCTTCAACTT GATCTTCAAT ATTACCCATT TTAAGTGATG 3000 

TGATTTCTTT GTCACGTTGC TCTAATTGTT TTAATAATGC TTTTTCTTCA TCTTGTAATT 3060 

GTGTTAACTT ATCGACTACT TGATCATCAG ATTTCACTTT CAGCTGTGAT TTCATCGTAT 3120 

10 TAAATTTCTC TTGAATATCT TCTAAATATA AGAAAGCTGC TTTACCTGTT AATGCTTCAA 3180 

TAOGACGCAC ACCAGCTCCT GTACCTGACT CACTTACTAT TTTGAATAAG CCAATTTCAG 3240 

AAGTATTGCG GACATGAATA CCACCACATA ATTCAATTGA AAATGGTGCC ATATTTACTA 3300 

CACGCACAAC ATCACCATAT TTTTCACCGA ATAATGCCAT TGCGCCCATT TCTTTAGcTG 3360 

AAGCAATATC CATTTCTTGA ATGTTAACGT CAATACCTTT CCAAATTTCT TCATTTACTA 3420 

AGCGTTCAAC TTGATCAATT TCATCATTAG TCATTGGACC AAAATGAGAG AAATCAAAAC 3480 

GTAAACGATC TGCTTCTACT AGTGAACCAG CTTGGTTAAC ATGATCACCC AGTACTGATT 3540 

TCAACGCTGC ATGTAATAAA TGTGTTGCAC TATGGTTCTT TTGAATGTCA CGTCGATCAT 3600 

TTTGGTTCAC TTCAGCAGAC ACTGTAGCGC CAACATTTAC TTGGCCAAAT TGTACTACTC 3660 

CTTTATGCAA GTTTTGACCA TTTGGTGCTT TGGTTACTTC ACTAACAGCA ATTTCAAAAT 3720 

TGTCATTATA AACAATACCT GTATCCGCAA CTTGTCCACC ACTGATTGCA TAAAATGGTG 3780 

30 TTTCCGTTAA CATGAAGTAT ACTGTTTCAC CCGCTTCAAC TTGTGAAACT TCTTCACCAT 3640 

TGTATATCAA GTGTGTTAGT GTTGTTTGAG CtGTCGCAGT ATCATAACCA ACAAAAGTAC 3 900 

TTGCAGATGT AATATTTTTC AATACTTCAC TTTGAACTTG CATTGATTGA GAATTTTGAC 3960 

35 GTGCTTGACG TGCACGATCA CGTTGTTGTT GCATTTCTGA CTCGAATGTT GTCATATCAA 4020 

CTTXCAATCC TGCTTGCACT GCTATTTCTT CAGTTAATTC AATTGGGAAC CCATACGTAT 4080 

CATACAATTT AAATGCATCT TTCCCATTAA TTTCATTTGT TGTCGCTTTA GCTTTTTTAA 4140 

TTAATTCATT TAAAATCGCT AAACCATCTT CTAATGTTTC ATGGAATCGT TCTTCTTCAG 4200 

ACTTTATAAC ACGCTTAATG AAATCTGCTT TTTCCTTAAC ATTTGGATAA TATGGTTCCA 4260 

TAATGTCTGC AACAATATCA ACAAGTTTGT ACATAAATGG CTCATTGATT CCTAACGTTT 4320 

GACTAAAACG AACGGCACGA CGTAACAATC GACGTAATAC ATACCCTCTA CCTTCATTGG 43 80 

CAGGTAATGC ACCATCAGAA ATTGCAAATG CAATCGTACG AATGTGGTCA GCAATTACTT 4440 

TAAATGCCAC ATCTTGTTCG TTGTTTACTA AATATTGTTT ACCTGATACT TTTTCGATTT 4500 

CATTCATTAT AGGCATAAAT AAATCTGTTT CATAGTTAGT ACGTACATTT TGAGAAACTG 4560 
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TATGATCTTT ATTATGATTG AATTCACTAA ATACTAAGTT CCATACTTCA AGATAGCGTT 4680 

CATTTTCTCC ACCTGGATAC ATTTCTTCTG CCGGATCGTC TTGTCCATAT GCTTCTCCGC 4740 

GATCATAGAA AATCTCAGTG TTCGGTCCTG AAGGCCCTTC ACCAATATCC CAGAAGTTAC 4800 

CTTCAATGCG AATAATACGA CTTTCTTCAA GCCCAATATC TTTATGCCAA ATGTTGTATG 4 860 

CTTCCATATC TTCCGGATGA ATCGTAACGT ACAATTTATC TGGCTGCATA CCCATCCATT 4 920 

TATCACTCGT TAAAAATTCC CAAGCAAATT CAATCGCTTC TTGTTTAAAA TAATCACCAA 4980 

TTGAGAAGTT ACCTAACATT TCAAAGAATG TATGGTGACG CGCTGTGAAA CCAACATTTT 5040 

CAATATCATT TGTACGAATA GCTTTTTGAG AGTTTACAAT TCTTGGCTTT TTAGGTGTTt 5100 

CACGTCCATC AAAATATTTC TTTAATGTTG CTACACCTGA ATTAATCCAT AATAATGTAT 5160 

CATCATCAAT TGGCACTAAT GGTGCAGAAG GTTCAACCAT ATGTCCTTTT TCAACAAAGA 5220 

AATCTAGATA TTTTTGTCTA ATTTCACTCG CTTTTAACTT TTTCATCATT TACACATCCT 5280 

ATTTACTGTT TTTAAATTAC CATTCCATAA AAATTGATGA CACAGATAGT CGATTTGCAA 5340 

AACTAGTATA AATCAATATC ATTTTTTATT ATTAAAAAAT AAAAAACGCC CATCCTCAAA . 5400 

2S AGGGACGAAC GTTATCGCGG TACCACCCTA GTTATAAATG CAATTCAACA CATTTATCAC 54 60 

TTTAATTCGA CTATACAGTT GTGCATAAAG TAGCGTTCAC TAATGTTTGT tGTACTTTTC 5520 

ACCAACCAGT ACATCTCTGA TAAACAAATC aTTAACTACT CATCTTTATA CGAATTTAAT 5580 

30 TCTATTTTAG TTACATTTAC GCTTGTTGTC AACGTTCTAT AAAGTCATAC GGCGTGATTT 5640 

CTCCCATATT AATCATTGGG TCAATTTTAA ACATTGTAGC TTCCGTTAAT ACATTTGTAT 5700 

CTGTTTTTGT TGAATCAGAC ATAACTTCTT CACTATCATT CGATGACATT GGCGCTTCTA 5760 

CTTGATCATC TATTGTCGTT TGTGAAGCTC CTGTATCATT AgTTGCTGTG TTTTCCAgCA 5820 

TTTCSTCATC TTCTGAATTA AAATAATTTT TCAACAATGT ACATAATTGT GTTAAACGCG 5880 

CTTGACCATT TGTTTTCAAT CCAATATCAA ATGCTTCCGG ATCACCAAGT AAAACTAAAC 5940 

TCGTTTTCGC TCTAGTTAAA CCAGTATATA ATATCGGTCT TTGTAACATT CTAAAATACT 6000 

GTTTAACAAT AGGCATGATA ACAATAGGAA ATTCTGAACC TTGTGATTTA TGGATTGATG 6060 

TACAATAAGC ATGTGTTAAT TCCATCATAT CTTGTTTCGT AAATGTAATT TCATTACCTT 6120 

CAAAATCCAC AACAAGTACA TCTTTATTAA GGGCATTTTC TTTCGCCCAA AAAATACCAA 6180 

CAATAACTCC TATGTCACCA TTGAATATGT TATCATTTGG CCTATTAACA AGTTGTAATA 624 0 

SO CTTTGTCACC TTTTCTAAAG ACTACATCAC CAAACTCAAT TTCTCGTGTG T CT T TCTTT T 6300 

TAGGGTTTAA AATATCTTGT AAAACTTGAT TTAAACGTTT AATACCGGCA TTTCCTTTAT 6360 
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CTACCTTCTC 


AACAACTGTT 


GGTATTTGGT 


TTGCCTGACA 


GTTAATAAAA 


CTTCTATCAT 


6480 




GAAAAGGCTG 


TGTAATATCA 


ATTTTCTGAC 


CCAACTTCAT 


TCGATGTGCT 


AATTCTATAA 


6540 


5 


TGCTTGAACC 


ATCTTGTTGA 


CGATATACTT 


CAGTCAGATT 


TACTCGTGGT 


ATAGCTTTCG 


6600 




ATTCAATTAA 


ATCTTTAAAT 


ACTTGACCAG 


GACCTACAGA 


AGGCAATTGG 


TCCTCATCAC 


6660 




CTACAAATAT 


CAATTGTGCA 


TCTAAAGGAA 


CTGCACTTAA 


AAATTGGTGG 


AACAACCAAG 


6720 


10 


TATCTACCAT 


AGACATCTCA 


TCAATGATTA 


TGAGTCGTGC 


GTTTATTTCA 


TTTTCTAATA 


6780 




TATCCTCTGG 


CTTTGTGTCT 


TGATTCCAAC 


CTATTAAACG 


ATGAATCGTC 


ATTGCTTCTA 


6840 


15 


ATCCAGTTGA 


CTCTTGTAGT 


CTCTTAGACG 


CTCTTCCTGT 


TGGCGCTGCT 


AATACAACTG 


6900 


GATAATCATC 


ATTGACATAA 


TCATCATAAT 


CTAATGATAA 


GCCATGAATC 


TCAGCATATA 


6960 




ATTCAACAAT 


ACCTTTAATT 


ACTGTCGTTT 


TTCCTGTTCC 


CGGTCCACCG 


GTTAATAGCA 


7020 


20 


TCACCTTAGA ATTGATAGCC 


GTTTGCAAAG 


CTTCTTTTTG 


TGAAGCTGCA 


TAGTTCACTT 


7080 




GATTCGCATC 


TTCTATTTCA 


CCAATATGCA 


TTTGTAAATC 


TGACTGTTCA 


ATTTCTGTAA 


7140 




GTTTATTTGT 


ATGCGTCTTT ATTCTGAATA 


AGTTTTGAAC 


ACTTTTGATT 


TCaGAATAAT 


7200 


25 


ACAAACTTGG 


AATTGCAACT 


TGTTCaTTGT 


CAATAATTAG 


TCGTTTTTCC 


TCATTTAAGT 


7260 




ATTGCAACAT 


TTCGTCTAAT 


TTTTCAGGTT 


CGATGACCTC 


TTCATCTTGa 


TAATTTAATA 


7320 




CATCAACCGT 


TAAATCTATA 


ACAACATTGA 


TAGGCAAATA 


TGTATGTCCC 


TGTTTAATAC 


7380 


30 


ATTCTTCTTC 


TAACGTATAG 


AGCAACGCAG 


CTTTTAATCG 


TTCATTATCG 


TTATAAGCGA 


7440 




TACCAATATT 


TCTAGCAAGT TGATCTGCTT 


TATTAAAACC 


AATACCTTTA 


ATATCATAAA 


7500 




TCAATTGATA 


TGGATTTCGA 


TCTAAAATAG 


TCAGTGTATC 


GCCGAGATAA 


AACTGATAAA. 


7560 


35 


TTGCCATTGA AAGTTTAGGA CCAAACCCTA 


AATCATGTAA 


ACGAATCATT 


ATTTTTTCAG 


7620 




ATTdTTGATT 


TGCTGAAATT 


TGTTCTGCAA 


TTTGTTTCTG 


TTTCTTTTTA 


GATAATCCCG 


7680 


40 


AAACTTTTTC 


TAGCACTGAA 


TGGTCATCTA 


ATATATCATT 


TATCGCATTG 


TCACCTAATG 


7740 


TATTAACAAT 


ATTTTGAGCT 


GTCTTTTTAC 


CTACACCTTT 


AAACAAATCA 


CTAGATAAAT 


7800 




AACTTATAAT 


TGCTTCTTTC 


GTTTGTGGCA 


TTTCTTTTTC 


AAAAGTCTCT 


GCTTTTAATT 


7860 


45 


GTTTACCATA ACGTGGATGA TCAACAACTT 


GCCCTTTAAA 


TGTGTAGACA 


TCGCCTTCAA 


7920 




CAATATTCGG 


AAGAAACCCT 


ACAACAGTTG 


GCATTGTATC 


AAAGTCTTCA 


TTTGTTTCAA 


7980 




TAGTATCTAC 


TTTAAGCACT 


GTATAAAAAT 


TATCACTGTT 


TTGAAACAAT 


ATCGCTTCAA 


8040 


SO 


CAGTACCTTT GATCATTGAA TAATCAAATA 


GTGTAGGGTC 


TGACATGTTA 


CTCCTCCTCT 


8100 




TTCATTTTAG 


TGAATGTTTT 


CAGCGCATGC 


TGACTTAATA 


AGTGTTTAGG 


GTCGATAGTC 


6160 
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AAGCCCAAAT TGTATCTTGC ATCAACATGA TTTTTATCAA TCGTTAATAC ATGTTTAAGT 8280 

TGAGTTATGG CTTCATTAAA CATTTCTAAT TGACATAATA CAAGACCATA TTGAAATTGA 8340 

5 ACTTCTGCAT CTTTGTCTTT' ATCTAGTTCC GCAGCAGTCA TTAAATACGG CAATGCCAAG 8400 

CTTAAATGAT TCTAACTGAT TAAACGCCAT ACCGATCATA TAATTACAAT CAACTTGTTC 8460 

AATCTCTGTT TGTAATGCTT GTTGATATAA TTTAATAGCT TCTTGATAAC GTTGCTGATT 8520 

10 

ATAATATACA TTTGCTAGAT TAAAAAATAC GACGCCATTC TTCGGATCTA TTGTnAAAGC 8580 

TTTTTGGAAA AAACGCTCTG CCTTTyCAAy CyCATTCgCA TCAGCAAGTA CGATmCCaGC 8640 

ATTAATATAA TTTTCAATAA TTGTAGGATT TTCTTCGATA TTTCCGAACA ATGCTTGTAA 8700 

75 

CGCTTCTTCT ATTTTTCCAT TTTGTATGTA TTGATAAATT GTTTGTTGAT CTATCATTTA 8760 

CGAACCTCAT TTCTCATCAA TTATAACATC . TTGATAAATT GTATGTCTCG AATCACTTAA 8820 

2Q CAACGAATAA AATATAATCT AATATCATCT TCATTCATGA AAAAGCGGGA ATGGAATAGA 8880 

AATGCTTAAG AACCATTAAC GGTTTATTAT GTAATGGTTC TTCCACATTA GCCACCACTA 894 0 

TTATGTACTT AAAAATAAGA ATACATAATT AGATTCATGC ATAGGGAGTG GGACAGAAAT 9000 

25 GATATTTTAA CAAAATTAAA TTCGTTATCC CCAACTGGCA TTGCCTGTAG AATTTCTTTA 9060 

CGAAATTCTC TATGTTGTGG TCCCGCCAAT ATAACATTGT AGAGCCTAGG ACATTGTGAT 9120 

GTCCCAGACT CTATCCTCAT GAATTATTCT CATCAAAAAC TGTCTTTCGT CATTTTCAAC 9180 

30 GTTGAAACTT CAAATAAGTA ATTTATTGTT GCCATTGTTT ATACAACATA ATTTAATTGA 924 0 

CCTTCATTTT TGAACACATC GTCAATTGTT GCACCACCAA GACACACATC ACCTTGATAA 9300 

AAAACAACTG CTTGTCCAGG TGTGATTGCT CTTACTGGCT CAGCAAAAGT AACACGTAGg 9360 

35 

CAtGGtCGTT TTCACGTTTC ACAAAAACTT TCGTATCTTT TTGGCGATAT CTAAATTTAG 9420 

CTGtACATTC AAAACCTTGA TCTAAGTCAT TATCTTCTGG ATTTACAAAT GAATAGTCTG 9480 

AAGCAATTAA GTAATCACTG TATAATGCAT CGTGATGGAA TCCTTGTTCT ACATATAAAA 9540 

40 

CATTATCTTT TAGGTTTTTA CCGACAACAA ACCAAGGATC GCCATCTCCA CCTATACCTA 9600 

ATCCATGTCT TTGTCCTATT GTGTAATACA TCAAACCACT ATGTTTACCC ATTTTCTTAC 9660 

CATCAAGTGT TATCATATCA CCCGGTTGTG CAGGTAAATA TTGTGATAAA AATGTTTTAA 9720 

45 

AGTTTTTTTC GCCGATAAAA CAAATGCCTG TAGAATCTTT TTTCTTAGCA GTAACAAGTC 9 7 BO 

CTTGTTCTTC AGCAATTCGA CGCACTTCAC TCTTTTCGAT GTCGCCAATT GGGaACATCA 9840 

50 CTTTTGAAAG TTGTTGTTGA GATAATTGAT TCAAGAAGTA TGTTTGATCT TTATTATTAT 9900 

CTACACCACG TAACATTTCA ACATGACCAT CTTCATGACG ATGTATGCGT GCGTAATGTC 9960 
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w 



15 



20 



25 



30 



35 



40 



45 



SO 



TTTCTTTATT 
AATACGTAAA 
•CAATTTGATT 
TTTCGTCAGT 
TTAAGACGTG 
TATCTTTATT 
CTACAATTGC 
GATTTGATCG 
CTGCTGTACA 
TCGTTTCAAC 
TTGAACCATT 
TTAAATTCAT 
TAGCCATCCC 
CACCGCCAAG 
CTTTTGGTCC 
ACTTAACATC 
CAGCAATAAT 
ACATraTAGa 
TAGCACCTGT 
CAAAAACATG 
GTTC&TTTGC 
TAAATATAAT 
CTAAATATTT 
AATTGTAAAT 
AATCTGCATA 
GCACCTTACA 
TATTTTACCA 
TTGTCTTTTT 
CTATTATTTT 



ACACATAACG 
GACTTTATCC 
ACACACTTCA 
GTCATCCCAG 
GGCTGTTACA 
TGACAATTAT 
ATTAATTTCA 
CTCCTCATCT 
TGCAGATCCA 
TTCAACAAAC 
TAATTCAAAT 
TAAATGAATA 
AACAATTTGC 
TTGAGGATAA 
GCCAAACTTA 
AAGATGTCCA 
ATCTTGAATA 
TACTAAAATC 
ATCATCAACA 
TAACACAGAA 
TTTTACTATG 
TTCATGTGTA 
ACGCGCATCT 
CGTCATCATC 
AATTTCCATG 
TACTATTTTT 
TTTTCTACAC 
TATAAATTTT 
GGTTATGAGC 



TCTGGATTTG 
CAATATTCTT 
ATAACATCGT 
TTTTTCATAA 
GAACTATCTA 
GACTCCTCCT 
TTTTCAGTAG 
TCGAACATTG 
GACGACACAT 
GGAAAATATA 
GGAATCGCTC 
TTGTTATCGT 
GCAAGATTTT 
TCTAGTGTAA 
TGAGCAGTAA 
ATTGCTTGAA 
TCATAAATTT 
GTCTTATCTG 
TCTAGATATG 
TGATGTTCAA 
CCTTTAATTG 
TCTGCACCAA 
CTTCCCTTAG 
GCATCTACTA 
TTTGGACACT 
CTACTTTTCT 
ATGCTTTTCG 
AGTATTTGCA 
AATTTAATAT 



GAGTACGACC 
TTTCAAAATT 
TGTAATCTTC 
ATATGCCAAT 
CACCGCCTGA 
TAAATTTAAA 
TCAATTCGTT 
CATCTAAAAC 
AGATTTGTGC 
GATTTACAAT 
TTTCTTGTAA 
CTCGATTCTT 
CAGTGCCTGC 
CATGGTCTTT 
TACTCATTGC 
CCGCATCAAC 
GTtGCACTGT 
tAATTGTTTC 
TTACATCAAA 
TCTTCGATGT 
CCGTATTATT 
GTAATTGTGC 
CATGTATTGA 
CTTCAGGTTT 
CCTCACAATT 
GTTTAACTTT 
ATAGGCTTTT 
GATATTTTTT 
TTATCTGGTT 



TTTTTTGTAT 
AACAGCGTAA 
AGTTGCAGTA 
GACATCATAA 
CATACCAACG 
ATATATTTTA 
AAAACTAAAT 
ATGCGACGGT 
CATATCCAAC 
ATGGCCTGTA 
TTTAACTAAA 
TTCTGCTAAT 
ACGGCGTTTC 
AACTAGTAAT 
GTCGATCTCA 
ATGGAAATAT 
GCCAaCTTCA 
TTCAAGTTGA 
ACCTTCTCGC 
GATAATGTGA 
CGATTCTGTT 
AATTTGACGT 
TGATGGATTA 
TACTGGTGTG 
TTATCAATGT 
ATTTATAATG 
TTAAGTTTAT 
TATTTGTAAA 
ATTCGATTGG 



TCATCTAAGA 
TACGGAATGC 
CATACGC CAT 



ACAACACGTA 
TGAATTTCAG 
CGAATCGAAT 
TGTGTAGAGC 
AATGTTAACA 
GCATCCGTCA 
AATTGTTCTT 
TGTAATGCTT 
AATTCTTGTT 
GCACCGACAC 
AATTCGTCAA 
GCATTTGTCT 
TTATTTACAA 
TCTAAATCAA 
TCTAATTGTT 
TTACCCAATT 
GCGCCACTCG 
CTTGACTCAT 
CCATAATGCG 
GTCGCAGCAT 
TCCAATAATA 
TTTTTAATTA 
CGCTTTATTC 
ATGTAACGTA 
TATACTTCTT 



10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
10560 
10620 
106B0 
10740 
10800 
10860 
10920 
10980 
11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
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GTCCCTATTC GAGAAGGTGA AGATGAACAA ACAGCAATTA ATAATATGGT TAATCTCGCA 11880 

CAACATTTAG ACGAATTATC ATATGAAAGA TATTGGATTG CTGAACACCA TAACGCTCCC 11940 

5 AACCTAGTAA GTTCAGCAAC TGCTTTATTA ATTCAACATA CGTTAGAACA TACGAAACAC 12000 

ATACGTGTAG GTTCTGGAGG CATCATGTTA CCTAATCATG CTCCATTAAT CGTTGCGGAA 12060 

CAATTTGGCA CGATGGCAAC ATTATTTCCA AATCGTGTCG ATTTAGGATT AGGACGTGCA 12120 

70 CCTGGAACAG ATATGATGAC CGcAAGTGCA TTAAGACGAG ATCAACATGA TGGTGTTTAT 12180 

AAATTTCCAG AAGAGGTTTC ATTATTACAA CAATATTTCG GCCCTGCTCA CCAACAAGCA 12240 

TATGTTCGTG CTTATCCAGC AGTAGGTAAA AATGTGCCTT TATACATTCT TGGTTCTTCA 12300 

75 

ACAGATTCTG CACATTTAGC TGCTCGCAAA GGGCTTCCAT ATGTGTTCGC TGGACATTTT 12360 

GCACCTCAAC AAATGAAAGA AGCTATCGAA ATTTACAAAA CGTTATTTGA ACCTTCTGAT 12420 

GTATTAGACG AACCTTATGT TATTGTATGT TTAAATACAA TCGTTGCTGA AAATGATGAC 12480 

- 20 

GAAGCACAAT ATTTAGCTTC ATCTATGGCA CAAGTAATGG TTAGTATCAC TCGTGGCAGA 12540 

ATGCAGCCCG TTCAACCGCC AACACATGAA CTACAAAATA TATTAACGCC GAGAGAATAC 12600 

2$ GCGATGGCTA TGGAAAGACA GAAAATATCA TTAATAGGTT CAGAAAATAC TGTTCAACAA 12660 

AAAATTCAAG ATTTTATGGA AACTTATGGT GAAGTCAACG AAATTATGGC AATAAGTTAT 12720 

ATTTATGATA AAGATATGCA ATTAGACTCT TATCGTCGGT TCAAGAATGT TATAAATCAG 12780 

30 ATAAATGAAA AAAACACTTT ATAATGTGAT AAATAAACTA AGTGAAAGTA TGTATCCATA 12840 

ATATTAATAA AAATATACAG TAACAGCATT TTGAATGAAA GATGTCTTTA TTGTTCAATC 12 900 

ATTTATTTTA GTAATGATTC AAATTCACTT AAAATyCTAA tGCAAATATG AAAGCGCCCC 12 960 

35. TTCAcTTTAC ACTGTGTAAG TGTTTATTTG ATGGGGCGCT TTCAAAATAT TGAAAAGCAT 13020 

ATCCfiAAATT TAAAGAAATT TATTTCTCTT TATCTTCATT TTCTTTTTTC TCTTCGTTAT 13080 

TCGATCCTGT ATATTCATTT ATCTTATCTT TTACATTTTT AACTTGTTCA TTATCGCTAT 13140 

40 

TTTTAAATTT TTCTACGCGT CTTTAGCTTT ATCCATAAAA CTCATATTAA TCGCTCCTCT 13200 

TATATTTGAT TAGTTTAATT GAACTTATTT TTTAAGTTTA TCAATTGCAT CAGTTATTTT 13260 

GTTTTTAGCA TTTTCAACAA CTTCTTTTGC TTTaCCAGTC GCTTTATCTT GCTGACCTTC 13320 

45 

TTTTTCTAAT TCTTTGTTAT CAGTAACGTT ACCTACTGTT TCTTTAACAT TTCCTTTAAA 13380 

TTGATCGAAC TtACTTTCGT* CTGCCATAGT GAAACCTCCT TGGATGTATA TATTTATATA 1344 0 

so CCACTAAGGA GGTTCGCTmm mCAyymyAAT ATGAAGTTTT TATGTTATAG TATAGTATTT 13500 

ATACGATTAA ATATAAAACA TGTATCCGTC TAAATCTTCA CTTGTATCTA CATATTCCGC 13560 
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TAGTTGTTTT TGCGCAGGTG GTTCTGATTC AATACTTTCA ACAAATGTAA TTGGACCTTC 13680 
TAACAGTCTT ATAATATCCC CTGCTGAGAT TTCTT 13715 
5 (2) INFORMATION FOR SEQ ID NO: 196: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 873 base pairs 

(B) TYPE: nucleic acid 

70 (C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 196: 

AAATCCATAA TGTCATGATA ATCTGCATAT GCTTCATATA ATTCAATCAT TGTGAATTCA 60 

GGGTTATGTC TAGTTGATAC ACCTTCATTA CGGAATACTC TACCAATTTC ATATACTTTT 120 

2Q TCAAGTCCAC CGACAATTAm ACGTTTTAAA TGCAACyCAA TAGCAATACG CATGTATAaC 180 

GTTGCATCTA ATGCATTATG ATGTGTTACA AATGGTCTAG CAGCTGCTCC ACCAGCAATT 240 

TGGTGCATCA TAGGTGTTTC TACTTCCAAG AAACCTTTAT TATTTAAATA ATTACGCATT 300 

25 TCTTGAATGA TTTTACTACG ATTAATAAAT GTACGAGTGC TATCTTCGTT CGTAATTAAA 360 

TCTAAATATC TTTGACGATA tCTCTGTTCA ATATCCTGTA AACCGTGGAA TTTATCCGGT 420 

AATGGTCGCA ATGATTTAGT TAGTAGCGTG AATTTCTTCG CTTTAACCGA TAATTCGCCA 480 

30 GTATTTGTTT TGAACATTAC ACCTTCAACA CCAACGATAT CGCCTAAATC AGCATTTTTC 540 

CATAAATCAA ATT CGT CATC GCCAACTTGA TCTTTACGAA CGTAAATTTG AATTTGTCCA 600 

GCTAAGTCCT GAACGTGTGC AAATCCTGCT TTACCTTTAC CACGCTTAGT CATTAATCGT 660 

CCAGCTATAG CGACATGACT ATCCGCTTCT TTTTCTACCA ATTCTTCTTT AGAATACTGG 720 

TCCeACTCTT CTTTCAAATC ACTAGATAAA CCTGAACGGT CAAATTTAGA ACCAAACGGG 780 

TCTATACCAA GATCATATAA TTCTTGTAAT TtTTGACGTC GAACCAACAT TTGGTCATTC 840 

ATTTCTTCTG ACATAACTtT CTCTCCTTTA ACT 873 
(2) INFORMATION FOR SEQ ID NO: 197: 



35 



40 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 197: 
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ACCATAATAT GAATGGCTTC AGGATCAAAA TAAAGACCAA CTTCACTGCC TACTTCAGCT 120 
TTTTTAGTCG TTTGTATTAC CCATTCATAA CCTTTATTGT CTATACAACA TATTTCATAG 180 
5 TGGACCCCTC TAAATAACAT AGAATCAACA GTTGCTTTAA ATAATCCTTC TTCAGCTTTG 24 0 

ATTAATGATA TATCTTCTGG TCGAATAACG ACTTCTACTT TTTTATTTTC AGGAATACCC 300 
ATATCGACAC ATTCGAAATC TTGCCCATAA ATATTCACGA CATAATCTCT AACCATGCGC 360 

10 

CCTTCAACAA TATTAGATTC TCCAATAAAA TCAGCTACAA ATCGATTCAC TGGTTCGTCA 420 
TaTATATCTG TTGGTGTGCC AAATTGTTGA AT 452 
(2) INFORMATION FOR SEQ ID NO: 198: 

75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 198: 



25 


TAGGTTGGGT 


TCTAACATAC 


GATAAAGCTC 


AACAAATCAA 


CACAGCTTTC 


TTTGTAAAAT 


60 




TGTTTAATAC 


TGCATTAGCA 


GAACGTGATT 


ATTATTTTAA 


TATAGATGGA ACAAATGCTT 


120 




TTAGATTATT 


TAATGCTGAA 


GGTGATGGTG 


TTGGGGGATT 


AACAATCGAC 


AATTACGATG 


180 


30 


GTCATTTGTT 


GATTCAATGG 


TACTCAAAAG 


GTATTTATAA 


ATTTAAATAT 


GCCATTCTTG 


240 




AAGCGGTTAG 


AAAAGTATTT 


GATTATAAAT 


CTATTTACGA 


AAAAGTAAGA 


TTTAAAGACA 


300 




GCGAATATAG 


TGGTGGTTTT 


GTTGAAGGAG 


ATGCACCTGa 


GTTTCCAATT 


GTTATCGAAG 


360 


35 


AAAACTTCAC 


ATTTTATAAT 


GTAGACCTTG 


AAGATGGTTT 


GATGACAGGT 


ATCTTTTTAG 


420 




ATCfiAAAAGA 


AGTGCGCAAG 


AaATTAAGGG 


ATCAATATGC 


CAAAGAACGC 


CATGTTTTAA 


480 


40 


ACTTATTTAG 


TTATACAGGT 


GCTTTTTCTG 


CAATAGCAGC 


AAGTGAGGCA TCTTCAACAA 


54 0 


CAAGTGTAGA 


TTTGGCTAAT 


CGTTCTCGTA 


GTTTAACTGA 


AGAAAATTTT 


GGATTAAATG 


600 




CTATTGATCC 


TAAATCCCAA 


TATATTTATG 


TCATGGACAC 


TTTTGATTTC 


TATAAATATG 


660 


45 


CTGCACGACA 


TGGACATAGT 


TATGACACGA 


TCGTGATTGA 


TCCACCTAGC 


TTTGCGCGTA 


720 




ACAAAAAACG 


TACATTTTCA 


GTGCAAAAAG 


ATTATGACAA 


ATTAATTAAT 


GGCGCCTTAA 


780 




ATATCTTATC 


ATCTGAAGGA 


ACATTATTGT 


TATGTACAAA 


CGCAAGTGTA TATCCATTAA 


840 


SO 


AGCAATTTAA 


AAATACTATT 


AAAAAGACGC 


TTGAAGAGAG 


TGGCGTTGAT 


TATGAATTAA 


900 




CTGAAGTTAT 


GGGATTACCA 


AAAGATTTTA 


AAACGCATCC 


ACATTATAAG 


CCATCTAAAT 


960 
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TATTGAGAAA 


AAGAAGGGTG 


ATAATATTAT 


GGGATTCAAA 


AACAATTTAA 


CATCAAATTT 


1080 




AACAAATAAA 


ATCGGTAATT 


CAGTCTTTAA 


AATAGAAAAT 


GTTGACGGAA AAGGTGCAAT 


1140 


5 


GCCAACGACG ATTCAAGAAT 


TGAGAGAAAG 


ACGACAACGT 


GCTGAAGCAA 


TTGTAAAGAG 


1200 




AAAGTCTTTA ATGTCATCAA 


CAATGAGCGT 


TGTTCCAATT 


CCGGGTTTAG 


ATTTTGGTGT 


1260 


10 


TGATTTAAAA 


TTAATGAAAG 


ATATTATCGA 


AGATGTTAAT 


AAAATTTATG 


GTTTAGATCA 


1320 


TAAGCAAGTT 


AATAGCCTTG 


GGGATGATGT 


GAAAGAAAGA 


ATTATGTCTG 


CAGCAGCAAT 


13B0 




TCAAGGTAGT 


CAATTTATTG 


GTAAAAGAAT 


TTCAAATGCA 


TTTTTAAAAA 


TTGTAATTAG 


1440 


IS 


AGATGTAGCT 


AAACGTACTG 


CTGCAAAACa 

j 

ACTATTTTAT 


AACAAAATGG 


TTTCCTGTTG 


TAGGACAAGC 


1500 


TGTGTCTGCA 


TCTATTAGTT 


GAATAAAATT 


GGAAAAGATC ACATTCAAAA 


1560 




ATGCGAAAAT 


GTTATTAAAA 


ATGTCATGTA 


GGTGCTATAA 


TAGTTTTGCA 


ATTTGCAAAT 


1620 


20 


TTTACTGAAA 


CCGGTTTTAA 


ACGAATTGAA 


TTTAAAGcAT 


GGTTTTGGTA AAGTTAATGT 


1680 




ATAAAACTAA GTTAGyATTG 


TAATAATATk 


GAAGATTCTA 


ACTATACGAA 


GGAGAAATGT 


1740 




AATTATGGAA 


CAAAATTCAT 


ATGTAATCAT 


CGACGAGAmT 


GGTATTCACG 


CTAGACCAGC 


1800 


25 


AACAATGTTA 


GTACAAACAG 


CTTCAAAATT 


CGATTCTGAT 


ATTCAATTAG 


AATATAACGG 


1860 




TAAGAAAGTA 


AACTTAAAAT 


CAATCATGGG 


TGTTATGAGC 


CTTGGTGTTG 


GTAAAGATGC 


1920 




TGAAATTACA ATTTATGCTG 


ACGGTAGTGA 


TGAATCTGAC 


GCCATTCAAG 


CAATCAGTGA 


1980 


30 


CGTCTTATCA 


AAAGAAGGAT 


TGACTAAATA 


ATCATGTCTA 


AATTAATTAA AGGTATTGCC 


2040 




GCATCTGATG 


GTGTCGCAAT 


TGCTAAAGCT 


TATTTATTAG 


TTGAGCCAGA 


CTTAACATTC 


2100 




GACAAAAATG 


AAAAAGTCAC 


TGATGTTGAA 


GGAGAAGTTG 


CAAAGTTCAA 


TAGCGCTATC 


2160 


35 


GAAGCTTCTA 


AAGTTGAGTT 


AACTAAAATT 


AGAAATAATG 


CAGAGGTTCA 


ACTAGGTGCT 


2220 




GATAAAGCTG CTATCTTTGA 


TGCAcaTTGG 


GGGGTGGTAG 


ATGACCCTGA ATTAATTCAA 


2280 




CCAATCCAAG ATAAGATTAA 


AAATGAAA 








2308 



40 

(2) INFORMATION FOR SEQ ID NO: 199: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 5559 base pairs 
4S (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



SO (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 199: 

AAGTAATAAA TCGTCTCATT TGGCAACTGA CGCATAATTT CTTTAGCTAC TGTCAAACCT 60 
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TTTAATTTTT AGTTTATCAT AACTAAGCAT TGGATTTTAG TATTATGCAC TGTGTTTACC . 180 

ATTTTTGTCA TTATAATATT TATTTTAAAT CAGCCCACTA TCATATTGTC ATGTAATCTG 240 

CTTATTAAAA AAATCCCTTC CAAGTTATTG TGTATCTCCA TTCAATTTAA TTTTGAAAGG 300 

AACATAACwT TTTAACTCAA AAGGGATTAA TTTnTAnTCT ACTTCATGGT CTGAACCAAA 350 

GAATGATTTA AACATGTGGA ATGTTGTTTC TCTGTTCATT GCTGCAATGG ATGTTGTTAA 420 

TGGAATACCT TTAGGGCAAG CATTAACACA GTTTTGTGAA TTACCACACT GCTGTAAGCC 480 

ACCAGTACCC ATTAATCCAT TTAAACGTTC ATCTTTAGTC ATAGATCCTG TTGGGTGCAA 540 

ATTAAACAAA CGAACTTGCG AGATTGCTTG TGCACCAACG AaTTTATTAT TTTCAGTAAC 600 

ATTAGGACAA ACCTCTAAAC ATACACCACA TGTCATACAT TTAGATAATT CATAAGCTGT 660 

TTGACGTTTT TTCTCTGGCA TACGTGGTCC CGGACCTAAA TCATACGTTC CATCAATTGG 720 

GATCCATGCT TTCATACGTT TTAAGTTATC GAACATTCTA GAACGATCAA CTTGTAAGTC 780 

ACGGATAACT GGGAAAGTAT TCATTGGCTC TAAACGAATA GGTTGTTCTA ATTGATCAAC 840 

AATCGCAGAA CAAGATTGTC TTGCACGACC ATTGATAACC ATAGAACATG CTCCACATAC 900 

25 TTCTTCTAAG CAGTTCATAT CCCAGACAAC AGGTGTTGTT TTTTCACCTT TAATATTAAC 960 

TGGGTTACGT CTAATTTCCA TTAAACAAGC AATGACGTTT AAATTTTCAC GATATGGAAT 1020 

TTCAAATGTT TCTTCATAAG GCTTAGAATC ACTTGTATCT TGTCGTTTAA TAATTAATTT 108 0 

30 TACTGTTTTT TGTTTCGGTT TAGATTGTGT TTCATGTTGT GGAGTGTTTT TCACTGATTG 1140 

TTCAGTCATT ATTTTTTACC CCCTTTAGAC TTACTTGTGT AATCACGTTT ACGAGGTGGT 1200 

ATTAAACTCA CATCGACGTC ATCATAAGTA AACTGCGGTT TTTCAAATGC GCCTTGGAAT 1260 

35 GAGGCCATTG TCGTTTTTAA CCACTCTTCA TCATTACGCT CTGGGAATTC TGGTTTATAA 1320 

TGGGCACCGC GTGATTCGTT ACGGTTATAT GCACCAATCG TAATAACACG TGCAAGTACT 1380 

AACATGTTCC. ATAGTTGACG qq^j^qj^j ACCGCTTGGT TACTCCAAGT TTGAGTATCT 1440 

TCCATATCAA TATCTTCATA ACGTTTCATC AATTCAACAA TCTTTTTATC TGTTTCTAAC 1500 

AGTTTTTCAT TTTCACGAAC AACAGTTACA TTTGCTGTCA TAATTTCACC AAGTTCACGG 1560 

TGTAATTTAT ATGCATTTTC TGTACCGCGC ATAGCTAATA ATTTATCAAA ACGTTCTTGC 1620 

TCTTCAGCTT TACG CT TTTC AAAAATACTT TCGTCCATAT CAGTATATGA TCGATCAATA 1680 

TTTGAAATAT AATCAATCGC GTTTGGACCT GCTACTGTAC CACCATAAAT CGCTGATAAC 1740 

AATGAATTGG CACCTAAGCG GTTACCACCA TGTTGAGAGA AGTCACATTC TCCAGCTGCA 1B00 

AATAACCCTT TAATATTTGT CATTTGATCA TAATCTACAT ATAGACCACC CATTGAATAG 1860 
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TAAATCTCAA TGATACCACC TAGTTTTACA TCTAACTCAT GTGGATCTTT ATGTGACAAA 1980 

TCAAGATATA CCATGTTTTC GCCATTTATA CCTAATTTTT GGTTAATACA TACATCGAAA 2040 

ATTTCACGCG TTGCGATATC ACGAGGTACT AAGTTACCAT AATCAGGATA TTTCTCTTCT 2100 

AAGAAGTACC AAGGCTTACC ATCTTTATAT GTCCAAATTC GTCCACCTTC ACCACGTGCT 2160 

GATTCACTCA TTAGTCGCAG TTTATCATCA CCAGGGATTG CAGTAGGATG AATTTGAATG 2220 

AACTCACCAT TAGCATAAAT AGCGCCTTGT TGGTAAACAA TGGAAGCCGC TGATCCTGTA 2280 

TTAATCATTG AGTTTGTTGT TTTACCGAAA ATAATACCAG GGCCACCCGT TGCCATAATA 2340 

ACTGCATCTG AACCAAATGT TTCAATCTCA GCAGTTGTCA TATTTTGTGC AnCGATACCT 2400 

CTTGCACTAT CATCGTCACC TTTAACTATG CCAAGGAATT CCCATCCTTC ATACTTCGTA 24 60 

ACTAATCCAT CTACTTCATA TGCACGAACT TGTTCATCCA ATGCATATAA TAATTGTTGT 2520 

CCAGTTGTTG CCCCTGCATA TGCTGTTCTG TGATGTAATG TACCACCGAA ACGTCTAAAA 2580 

TCTAATAGAC CTTCATTTGT TCTATTGAAC ATTACGCCCA TACGGTCTAA TAAATGAATA 2640 

ATTTTAGGTG CTGCCTCTGT CATCGCTTTA ACAGGTGGTT GGTTTGCAAG GAAATCGCCA 2700 

CCATACACTG TATCATCAAA GTGAATCCAA GGAGAATCGC CTTCCCCTTT AGTATTGACC 2760 

GCACCATTAA TGCCACCTTG GGCACAAACA GAGTGCGAAC GCTTTACTGG TACAACTGAG 2820 

AACAAATCTA CATGTGCACC TTTTTCTGCC GCTTTAATTG TTGACATTAA GCCCGCTAGG 28 BO 

30 . CCACCTCCGA CAACAATAAG ATGTTTCTCT GCCATAAAAA TTTCACTCCC CTAAATTTTC 2940 

AATCTATATT TGTTAAATGC GATGTATTAC ATAAAGGCAA TAATTGCAGT AACACCAATA 3000 

TACGAAATAA CTAAAAATAC GATTAATGAA ACCCATGTAA ATACTCGTTG TGATTTTGGA 3060 

35 GATTGAAGTC CACCCCAAGT AACTAAGAAT GACCATAAGC CATTTGCAAA GTGGAACACA 3120 

ACAGCAATAA TACAAATAAT ATAAAATATT GCCCATCCAG GATGTTGCAA TGTTTCGTGC 3180 

ATTAAATOGT AATTCACTTC TTTGCCGTAA AATGCTTTTT GTAAACGTGT TTGCGATAAA 3240 

TGGATACCAA TAAAGATAAA TGTTAAGATA CCACTCACTC TTTGGAAGAA GAACATCCAG 3300 

TTTCTAAAAA TCGAGTAATG TCCAACATTT TCTTTTGCTG TAAATGCAAT GTGTATACCA 3360 

AACAAACCGT GATATAACAA CGGAATGTAT ATAAATAAAA ATTCTACAAT AATTAGAAAT 3420 

GGTAATGATT CCATAAAGTT AGATGCCTTA TTAAACGCTT CAGCACCTTG TGTTGCTTGG 34 80 

TGATTCACTA ATAAATGAAC GACCAAAAAT GCACCTATTG GGATAATACC TAATAACGAG 3540 

TGAATACGTC TTAGATAAAA TTCAT TTTTT GATTGAGCCA AAAGGAGTCC CCCCTGTGAA 3600 

CGAATATTTA ATTTATTGAG CTATTTATAT TAAACGTACG CTTAACCCCC TAAAGTGATA 3660 
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CGATCACCAA ACTGCATGTC GAACAATGTA ACATTTGGAT TCGATATTTA AAATTGCTTG 3780 

TGATGATAAA CTTTCTCATT TAGAAAACGC TTCCACGTAC ATTCAAAAAA ATAACTTTGT 3840 

TAACCATATT GTAACATTAT TTCATATATT TTGGGGCATG AGAATGATTC TCACGCCCAG 3900 

TAATTTATTT ATGCAATTGT TCATGTAGGT TCTTTGCGAC GTTTTCAGGA ATACCTATAT 3960 

TTTTAAAATC TTCAAGTGTA GCTTCCTTCA TTTTCTTGAT TGAACCGAAT GAACGCAATA 4020 

ATAATGTTTT ACGTTTGTTA CCGATACCAT CTATATCATC AAGTATTGAT TTCAAGCCTG 4080 

TCTTTTGACG TGTTTGTCTA TGAAATGTGA TTGCGAATCT GTGAACCTCA TCTTGGATAC 4140 

GGTGCAACAA ATAAAATGCC TGGCTATTTT TCTTCAGTGG TACAATTTCT GCACTAGCGC 4200 

CATATAATAA TTCAGATGTT TGGTGTTTAT CATTTTTCTG CAAACCTGCA ACAGGGATAT 4260 

CAAGACCTAA TTCGTTTTGT AGCACATCAA TAACCCCGTT CATATGTCCT TTACCACCAT 4320 

CTACTATTAT TAAATCAGGT AATGGTAATC CTTCGTTTAA AACGCGAGAA TATCGTCGTC 4380 

TTACTACTTC TCTCATTGAT TTGTAATCAT CTGGACCTTT AACCGTTTTG ATTTTATACT 4440 

TTCTATAATT TTTCTTATCT GGTTTACCGT CGACAAATGT AACCATTGCT GACACTGGAT 4 500 

2S CCACACCTTG AATATTAGAA TTATCGAATG CTTCAATTCT AATTGGTGTT TGAATTCCCA 4560 

TTTGTGTTCC AAGTTCTTCA ATAGCTTTAA TCGTTCTGGA CTCATCACGT GATATTAATT 4620 

CAAATTTATT ATTTAAGGAT ACTTTAGCGT TATGTGCAGC TAGGTCAACC ATATCTTTTT 46 BO 

30 TGGGACCTCG CGCGGGTTGA ACGATTTTAG TGTCCACAAC AGATTGAATC ATTTCTTTAT 4740 

CCAAATTACG TGGTACATGA ACTTCCTTAG GTAAAATATG TTGGTTTAAG CTATAAAATT 4 800 

GTCCAATAAA TGTATAAAAT TCTTCTTcTT CTGTTTGCTG TAATGGAATC ATCGTTGTAT 4 860 

CTCGCTTTAT CATATTACCT TGTCGTATAA AGAAAACTTG GATACACATC CATCCTTTAT 4920 

CAACACTATA ACCAAAGACA TCACGAATCG TTTTATCTGA TGACATAATT TTTTGTTTGT 4 980 

TTGTCAGATT TTGAATATGT TGAATTAAAT CTCTATATTC TTTAGCCCGT TCAAAATCAA 5040 

GTGATTCACT TGCAGTTAAC ATTCGCTCTT CTAAACTTTT TAAAATTGTT TTGTCTTCCC 5100 

CATTCAGAAA ATCAGTAATT TCCTTCGTCA TTTGTGCGTA TTTACTCAAA TCAACGTCAT 5160 

ATACACATGG TCCTAAACAT TGTCCAATAT GGTAATAAAG ACATAATTTA TCTGGCATCT 5220 

TATCACATTT GCGATATGGA TATATTCTGT CTAATAACTT TTTAGTTTCT TGAGCAGAAT 52 BO 

ATGCATTCGG ATACGGTCCG AAATATTTGC CAGTACCTTG TTTTACAGTT CTCGTCACTA 5340 

50 GTAGTCTAGG ATATTTCTCC TTCGTAATTT TAATAAATGG ATAACTTTTA TCATCCTTTA 5400 

ATAATATATT ATATCTTGGT TGATATTGTT TAATCAGATT CAATTCCAGT AAAAGTGATT 5460 
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TTTTAGCATC ATGAGCACCC GTAAAATATG ATCGCAATC 
(2) INFORMATION FOR SEQ ID NO: 200: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4594 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 200: 



IS 


AAATCAATCG 


AGTGGCATGT 


CAAGGTCATA 


TCAATATTTT 


AGAATCTGCG 


ACTATGAGAG 


60 




AGGAAATAAA 


TGAAATTGCG 


CGACGTATCA 


TCGTTGATAT TCGTGATAAG 


CAATTACGAT 


120 




ATCAAGATAT 


TGCTATTTTA 


TATCGTGATG 


AATCTTATGC TTATTTATTT 


GATTCCATAT 


180 


20 


TACCGCTTTA 


TAATATTCCT 


TATAATATTG 


ATACAAAGCG 


TTCGATGACA 


CATCATCCGG 


240 




TCATGGAAAT 


GATTCGTTCA 


TTGATTGAAG 


TTATTCAATC 


TAATTGGCAA 


GTGAATCCAA 


300 




TGCTACGCTT 


ATTGAAGACT 


GATGTGTTAA 


CGGCATCATA 


TCTAAAAAGT 


GCATACTTAG 


360 


25 


TTGATTTACT 


TGAAAATTTT 


GTACTTGAAC 


GTGGTATATA 


CGGTAAACGT 


TGGTTAGATG 


420 




ATGAGCTATT 


TAATGTCGAA 


CATTTTAGCA 


AAATGGGGCG 


TAAAGCGCAT 


AAACTGACCG 


480 




AAGATGAACG 


TAACACATTT 


GAACAAGTCG 


TTAAGTTAAA 


GAAAGATGTC 


ATTGATAAAA 


540 


30 


TTTTACATTT 


TGAAAAGCAA 


ATGTCACAAG 


CGGAAACTGT 


AAAAGATTTT 


GCAACTGCTT 


600 




TTTATGAAAG 


TATGGAATAT 


TTCGAACTGC 


CAAATCAATT 


GATGACAGAG 


CGAGATGAAC 


660 


35 


TTGATTTAAA 


TGGTAATCAT 


GAAAAGGCGG 


AGGAAATTGA 


TCAAATATGG 


AATGGCTTAA 


720 


TTCAAATCCT 


TGATGACTTA 


GTTCTAGTAT 


TTGGAGATGA 


ACCAATGTCG 


ATGGAACGTT 


780 




TCTTAGAAGT 


ATTTGATATT 


GGTTTAGAAC 


AATTAGAATT 


TGTTATGATT 


CCGCAAACAT 


840 


40 


TGGACCAAGT 


AAGTATTGGT 


ACGATGGATT 


TGGCTAAAGT 


CGATAATAAG 


CAACATGTTT 


900 


ACTTAGTAGG 


TATGAATGAT 


GGAACGATGC 


CACAACCAGT AmTGCGTCAA 


GCTTGATTAC 


960 




AGATGAAGAA 


AAGAAATACT 


TTGAACAGCA 


GGCTAATGTC 


GAGTTAAGTC 


CAACATCAGA 


1020 


45 


TATTTTACAG 


ATGGATGAAG 


CATTTGTTTG 


TTATGTTGCT ATGACTAGAG 


CTAAGGGAGA 


1080 




TGTTACATTT 


TCTTACAGTC 


TAATGGGATC 


AAGTGGTGAT 


GATAAGGAGA 


TCAGCCCATT 


1140 




TTTAAATCAA 


ATTCAATCAT 


TGTTCAACCA 


ATTGGAAATT ACTAACATTC 


CTCAATACCA 


1200 


50 


TGAAGTTAAC 


CCATTGTCAC 


TAATGCAACA 


TGCTAAGCAA ACCAAAATTA 


CATTATTTGA 


1260 




AGCATTGCGT 


GCTTGGTTAT 


ATGATGAAAT 


TGTGGCTGAT 


AGTTGGTTAG 


ATGCTTATCA 


1320 
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GTTTGACAAT GAAACTGTAA AATTAGGTGA 
AATCAATGCC AGTGTATCCC GTTTTGAAGG 
5 GTCACATGGT CTGAAACTAA ATGAGCGAAC 

TGATATTTTC CATTCTGTTT TAAAATATAT 
ATTAGACCTG AAAAAAATAA GACAATTAAC 

10 

AGTTCAGTTT AATTTATTAA ATTCTTCAGC 
CGCTATTGTA GAAACAACAC TAAGCGCATT 
GCCAAAACAT TTTGAGACAA GTTTTAGAAG 

75 

ACAAACATTA ACGACAACTC AAGGTATTCC 
CGATACGTAT ACAAAGAATG ATACAAGTTT 

2Q AGGTAGTGCG ACACTTGATT TAACGAAAGT 
ATACATGGAT ATCGTTTTAC AAAATAAACA 
GGTGGaTTAT TATACTTCCA TGTACATGAA 

25 ATTGATGAAG ATAAACTAGA ACAAGATTTA 
AATGCAGACC AAACTGTTAT TGATGCATTG 
GATATTGTAC CAGTTGGTTT GAATAAAGAT 

30 GCAGATGAAG CAACGATTTA TAAATTCATC 
GCTTCAAATA TTATGGATGG ACATACTGAA 
CCATGTGCTT TTTGTAGTTA TCAATCGGTA 

35 CGATATCGAA CTGTAGATGA AACAATAAAT 
AATGATGAAT TTGGGGGTGA GCAATAGATG 
TGGACTGACG CGCAATGGCA AAGTATTTAC 

40 

GCGGCAGGTT CAGGTAAAAC AGCTGTACTA 
GATGGCATTG ATGTCGATCG ACTTTTAGTC 
ATGAAGCATC GTGTAGACCA ACGTATTCAA 

45 

CACTTGAAAA ACCAACGCAT CAAAATTCAT 
TGCTTGAAAT TAATTCAACA GCATTATGAT 
50 AGCAGTGAAG CTGAAAATAT TTTATTATTA 
CATTACGATA TCCTTGATCC TGCTTTTATT 

55 



AACGTTGTCT AAAGATTTAT ATGGTAAGGA 1440 

TTATCAACAA TGCCCATTTA AACACTATGC 1500 

GAAGTATGAA CTTCAAAACT TTGATTTAGG 1560 

ATCTGAACGT ATTAATGGCG ATTTTAAACA 1620 

GAATGAAGCA TTGGAAGAAA TTTTACCTAA 1680 

TTACTATCGT TATTTATCAA GACGCATTGG 1740 

AAAATATCAA GGCACGTATT CAAAGTTTAT 1800 

GAAACCAAGA ACAAATGACG AATTAATTGC 1860 

AATTAATATT AGAGGGCAAA TTGACCGTAT 1920 

TGTTAATATC ATTGACTATA AATCCTCTGA 1980 

ATATTATGGT ATGCAAATGC AAATGATGAC 2040 

ACGCCTTGGA TTAACAGATA TTGTGAAcCA 2100 

CCTAGAATTA AATTTAAATC ATGGTCTGAT 2160 

ATTAAAAAGT TTAAGTTGAG TGGTTTAGTT 2220 

GATATTCGTT TAGAACCTAA ATTCACTTCA 2280 

GGCTCTTTGA GTAAACGAGG CAGCCAAGTG 234 0 

CAACATAACA AAGAGAATTT TATAGAAACA 2400 

GTTGCACCAT TAAAGTACAA ACAAAAATTG 2460 

TGTCATGTAG ATGGCATGAT TGATAGTAAG 2520 

CCAATTGAAG CAATTCAAAA TATTAACATT 2580 

ACAATTCCAG AGAAACCACA AGGCGTGATT 264 0 

GCAACTGGAC AAGATGTACT TGTTGCAGCC 2700 

GTTGAGCGTA TTATCCAAAA GATTTTACGT 2760 

GTAACGTTTA CAAACTTAAG CGCACGTGAA 2820 

GAGGCATCGA TTGCTGATCC TGCAAATGCA 2880 

CAAGCACAAA TATCTACACT CCATAGTTTT 2940 

GTATTAAATA TTGACCCGAA CTTTAGAACA 3000 

GAACAAACGA TAGATGAGGT CATAGAACAA 3060 

GAATTAACAG AGCAATTGTC TTCAGATAGA 3120 
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AATCCTACAA ATTGGTTGGA TCAATTGGTG ACACCATACG AAGAAGAAGC ACAACAAGCG 324 0 

CAACTTATTC AACTACTAAC AGACTTATCT AAAGTATTTA TCACAGCTGC TTATGATGCT 3300 

5 TTAAATAAGG CGTATGATTT GTTTAGTATG ATGGATAGCG TCGATAAACA TTTAGCTGTT 3360 

ATAGAAGATG AACGACGTTT AATGGGGCGT GTTTTAGAAG GTGGCTTTAT TGATATACCT 3420 

TATTTAACTG GTCACGAATT TGGCGCGCGT TTGCCTAATG TAACAGCGAA AATTAAAGAA 3480 

10 GCAAATGAAA TGATGGTCGA TGCCTTAGAA GATGCTAAAC TTCAGTATAA AAAATATAAA 3540 

TCATTAATTG ATAAAGTGAA GAGTGATTAC TTTTCAAGAG AAGCTGATGA TTTGAAAGCT 3600 

GATATGCAAC AATTGGCGCC ACGAGTAAAG TACCTTGCGC GTATTGTGAA AGATGTTATG 3660 

75 

TCAGAATTCA ATCGAAAAAA GCGTAGCAAA AATATTTTGG ATTTTTCTGA TTATGAACAT 3720 

TTTGCATTAC AAATTTTAAC TAATGAGGAT GGTTCGCCTT CAGAAATTGC CGAATCATAC 3780 

CGTCAACACT TCCAAGAAAT ATTGGTCGAT GAGTATCAAG ATACGAACCG AGTTCAAGAG 3840 

20 

AAAATACTAT CTTGCATCAA AACGGGTGAT GAACATAATG GTAATTTATT TATGGTTGGA 3900 

GATGTTAAGC AATCCATTTA TAAATTTAGA CAAGCTGATC CAAGTTTATT TATTGAAAAG 3960 

2S TATCAACGCT TTACTATAGA TGGAGATGGC ACTGGACGTC GAATTGATTT GTCGCAAAAC 4020 

TTCCGTTCTC GAAAAGAAGT ACTGTCAACG ACTAACTATA TATTCAAACA TATGATGGAT 4080 

GAACAAGTCG GTGAAGTAAA ATATGATGAA GCGGCACAGT TGTATTATGG TGCACCATAT 4140 

30 GATGAATCGG ACCATCCaGT AAACTTAAAA GTCCTTGTTG AAGCGGATCA AGAACATAGT 4200 

GATTTAACTG GTAGTGAACA AGAAGCGCAT TTTATAGTAG AACAAGTTAA AGATATCTTA 4260 

GAACATCAAA AAGTTTATGA TATGAAAACA GGAAGCTATA GAAGTGCGAC ATACAAGGAT 4320 

35 ATCGTTATTC TAGAACGCAG CTTTGGACAA GCTCGCAATT TACAACAAGC CTTTAAAAAT 43 BO 

GAAGATATTC CATTCCATGT GAATAGTCGT GAAGGTTACT TTGAACAAAC AGAAGTCCGC 4440 

TTAGTATTAT CATTTTTAAG AGCGATAGAT AATCCATTAC AAGATATTTA TTTAGTTGGG 4500 

40 

TTAATGCGCT CCGTTATATA TCAGTTCAAA GAAGACGAAT TAGCTCAAAT TAGAATATTG 4560 

AGTCCAAATG ATGACTACTT CTATCAATCG ATTG 4594 
(2) INFORMATION FOR SEQ ID NO: 201: 

45 

(i) SEQUENCE CHARACTERISTICS: / 

(A) LENGTH : 6313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
50 (D) TOPOLOGY: linear 
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GGTTTTCnTG GAAAGATAGT GAAAATCTCG TGTTTTTTGG TTTTgAGGTG TTGTTTGTAT 60 

TTTaTAAAAT GGCTTACATA TATGAAGCGT TGATTAAGTA TGGAATTGTT AATTAATTGA 120 

5 ACCTATTTAG CTTTAAGAAG GCATAACAAG ATGACCTTAT TTTATGCTAT AATATTTCTA 1B0 

TTATGCGAAG ATTAAGGTGA GTAGTAAATT GGATAAAAAA GTAAGTATTC AAACAAAGCA 240 

AGTGTTGAAA CAGCACAACG AAAAAGAAAA ATTTGAATTT ACTACTGAAG GAACTTGGCA 300 

10 

ACAAAGGCAA TCTAACTTTA TTCGGTATGT AGAACAAATT GAGGATGCAA CAGTTAATGT 360 

TACAATAAAA GTGGATGATG ATAGCGTTAA GTTGATTCGT AAAGGCGACA TTAATATGAA 420 

TTTGCATTTT GTTGAAGGAC AAACGACAAC AACTTTTTAC GATATATCGG CTGGACGAAT 4 80 

15 

TCCACTAGAA GTTAAAACAT TACGCATTTT ACATTTCGTA AGTGGAGACG GTGGCAAGCT 540 

AAAGATTCAT TATGAATTAT ATCAAGATAA TGAAAAAATG GGTTCTTATC AATATGAAAT 600 

2Q TAACTATAAG GAGATAGGCG AATGAATATT ATTGATCAAG TGAAACAAAC ATTAGTAGAA 660 

GAAATTGCAG CAAGTATTAA CAAAGCAGGA TTAGCAGATG AGATTCCTGA TATTAAAATT 720 

GAAGTTCCTA AAGATACAAA AAATGGAGAT T ATG CT ACT A ATATTGCGAT GGTACTGACT 780 

25 AAGATTGCAA AGCGTAATCC TCGTGAAATT GCTCAAGCGA TTGTTGATAA CTTAGATACT 840 

GAAAAAGCAC ATGTAAAACA AATTGACATT GCTGGTCCAG GATTCATTAA TTTTTACTTA 900 

. GATAATCAGT ATTTAACAGC AATTATTCCT GAAGCAATTG AAAAAGGTGA TCAATTTGGA 960 

30 CATGTAAATG AATCAAAAGG TCAAAATGTA TTGCTTGAGT ATGTTTCAGC TAACCCTACA 1020 

GGAGATTTAC ATATTGGTCA TGCTAGAAAT GCAGCAGTTG GTGATGCTTT AgcTAAtATT 1080 

TTAACTGCAG CTGGCTATAA TGTAACACGT GAATATTATA TTAATGATGC TGGTAATCAA 1140 

ATTACTAACT TAGCGCGTTC GATTGAAACA CG TTTCTTT G AAGCTTTAGG TGACAATAGT 1200 

TATTCAATGC CAGAAGATGG CTATAATGGA AAAGATATTA TTGAAATAGG TAAAGATTTA 1260 

GCAGAGAAAC ACCCTGAAAT TAAAGATTAT TCTGAAGAAG CACGTTTGAA AGAATTTAGA 1320 

40 

AAATTAGGCG TAGAATACGA AATGGCTAAA TTGAAAAATG ATTTAGCAGA GTTCAATACG 1380 

CATTTTGATA ATTGGTTTAG TGAAaCATCT TTATATGAAA AAGGAGAAAT TCTTGAAGTT 1440 

TTAGCAAAAA TGAAAGAATT AGGTTATACG TATGAAGCTG ATGGCGCTAC ATGGTTACGT 1500 

45 

ACAACTGATT TTAAAGACGA CAAAGACAGA GTATTAATTA AAAATGACGG TACATATACG 1560 

TATTTCTTAC CAGATATTGC GTACCACTTC GATAAAGTAA AACGTGGTAA TGACATTTTA 1620 

50 ATCGATTTAT TTGGTGCTGA TCATCATGGT TATATTAATC GTTTGAAAGC ATCTCTTGAA 1680 

ACGTTTGGTG TAGATAGTAA TCGTTTAGAA ATTCAAATCA TGCAAATGGT TCGTTTAATG 1740 
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ATTATGGACG AaGTTGGCGT TGACGCTGCA CGTTATTTCT TAACTATGCG TAgTCCTGAT 1860 

AGTCACTTTG ATTTTGATAT GGAATTAGCG AAAGAGCAAT CTCAAGACAA TCCAGTTTAC 1920 

5 TATGCTCAAT ATGCACATGC GCGTATTTGT TCAATTTTAA AACAAGCGAA AGAGCAAGGT 1980 

ATTGAAGTGA CTGCTGCGAA TGATTTTACA ACGATTACTA ATGAAAAAGC GATTGAATTG 2040 

TTGAAAAAAG TAGCTGATTT CGAACCTACA ATTGAAAGTG CTGCTGAGCA TAGATCGGCA 2100 

10 

CATAGAATTA CTAATTATAT TCAAGATTTA GCTTCTCATT TCCATAAATT CTATAATGCT 2160 

GAAAAAGTGT TAACAGATGA TATTGAAAAA ACAAAAGCAC ATGTTGCTAT GATTGAAGCG 2220 

GTCAGAATTA CATTGAAAAA TGCATTGGCA ATGGTCGGTG TAAGCGCACC TGAATCAATG 2280 

15 

TAAGAACATT TATATACACT CCAACGTAGA GTTTCTCGAA AGATACTTTG TGTTGGAGTG 2340 

TTTTTTTTAG GTATGTGACA TATTGGGGAA TGCTTAGTAT GTGAATAAGG TTAAGAGGAA 2400 

2Q CACA6TTGGA TGCTCTGCAC AACTGCATAA GAGAGCCTGA GACATAAATC AATGTTCTAT 2460 

GCTCTACAAA GTTATAATGG CAGTAGTTGA CTGAACGAAA ATTCGCTTGT AACAAGCTTT 2520 

TTTCAATTCT AGTCAACCTT GCCGGCGGGG CCCCAACAAA GAGAAATTGG ATTCCCAATT '2580 

25 TCTACAGACA ATGCAAGTTG GGGTGGGACG ACGAAATAAA TTTTACGATA ATATCATTTC 2640 

TGTCCCACTC CCTCTAAAAT GGAGGGTGTA AATGTTAGGA ACTGATGAAT TATATAAAGT 2700 

TTTATATGAA CATCTCGGAC CACAATTTTG GTGGCCTGCT GATAATGACA TTGAAATGAT 2760 

30 GTTAGGTGCA ATTTTAGTTC AAAATACTAG ATGGCGAAAT GCAGAAATTG CATTGAATCA 2820 

GATTAAAGAA CATACGCATT TTAATCCAAA T CAT AT ATT A GAACTACCTA TTGAAACGTT 2880 

ACAATCATTG ATACATTCAA GTGGCTTTTA TAAAAGTAAA TCACTGACGA TTAAAACATT 2940 

35 ATTAACATGG TTAGCACGAC ATCATTTCAA TTATCAAGAG ATTAATGAGC GATATAAAGG 3000 

TGGATTAAGA AAAGAATTAT TATCTTTGAA AGGTATTGGA AGTGAAACAG CAGATGTCTT 3060 

ACTTGTTTAT ATATTCGGAC GTATTGAATT TATTCCAGAT AGCTATACAA GAAAAATATA 3120 

40 

TGATAAATTA GGATATGAAA ACACTAAAAA TTATGATCAA TTAAAAAAAG TAGTCaCATT 3180 

ACCAAATCAT TTTACAAATC AAGATGCTAA TGAATTTCAT GCTCTGTTAG ATGTATTTGG 3240 

TAAACATTAC TTTAGAGACA AAGATATAAA GAATTATGAT TTTTTAGAAC CTTACTTTAA 3300 

45 

AAAGTAAACG CTGTGAAGTT AGATAGATGA GTTTATATGA AATATAAAAA ATAATTTACT 3360 

ATTTTCTTTT AGTATGTGGA CTTATATAAT AAATAGAAGC ATATAAAGAA AAAAACAGTT 3420 

SO GTTTGTTTGT GCAGCAACTG CATAAGAGCC CCTAATCGCT AAAGCTCAAG GGGAGTAAAG 3480 

GAATACAGTT GTTTGTGCAG CAACTGCATA AAAGCCTCTA ATCACTAAAG GTGAAGAGGA 3540 
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AACGCAGTTG GATgCTACCG CACAACTGCA TAAATCCCTC TaATCgcTAA AGCGAAAAGT 3660 

GGGATTAAAA AGGAGATGTG ATAGTGTGAA GAAATCGTTA ATTGCTTTTA TTTTGATTTT 3720 

5 TATGCTTGTC CTGAGTGGCT GTGGTATGAA AGATAATGAT AAACAAGGTA GCAATGATAA 3 780 

TGGCTCGTCT AAATCGCCGT ACCATAGAAT TGTTTCGTTA ATGCCTAGTA ATACTGAAAT 3B40 

TTTATATGAA TTAGGATTAG GTAAATACAT AGTTGGTGTT TCAACGGTTG ATGATTATCC 3 900 

10 

AAAAGATGTG AAAAAGGGTA AGAAACAATT TGATGCTTTG AATCTAAATA AAGAGGAACT 3 960 

TTTAAAGGCA AAGCCAGATC TAATTCTTGC GCATGAGTCG CAAAAGGCAA CTGCTAATAA 4 020 

AGTATTGTCA TCATTAGAGA AACAAGGCAT CAAAGTAGTG TATGTTAAAG ATGCACAATC 4080 

75 

AATTGATGAA ACTTACAACA CATTTAAGCA AATTGGGAAA TTAACGCATC ATGATAAGCA 4140 

GGCTGAACAA CTTGTTGAGG AAACTAAAGA TAATATCGAT AAAGTCATAG ATTCAATTCC 4200 

TGCTCATCAT AAAAAATCAA AAGTATTTAT TGAGGTTTCA TCAAAGCCTG AAATATATAC 4260 

20 

AGCAGGGAAG CATACATTTT TTAATGATAT GTTAGAAAAA TTAGAAGCCC AAAATGTGTA 4320 

TAGTGACATT AATGGTTGGA ACCCTGTAAC GAAGGAAAGT ATTATTAAAA AGAACCCAGA 4380 

25 TATATTAATT TCGACGGAAG CTAAGACAAG ATCAGATTAT ATGGATATCA TCAAAAAAAG 4440 

AGGTGGATTC AATAAAATTA ATGCTGTCAA GAATACACGT ATTGAAGTTG TAAATGGTGA 4 500 

TGAAGTATCA AGACCAGGTC CACGTATTGA TGAAGGATTA AAAGAATTAA GAGATGCAAT 4560 

30 TTATAGAAAA TAAACCATTC TAATTATGCC CCTTATTGCT ACATGTAAAA AATACATGT? 4 620 

TGAGATAAGG GGTTTTTaAA ATATATTTAG TGAATGATAG CAACGCGAGT ATGTGATTGC 4 680 

TATAATGAAT GTAATTATCG ATGAAcaaAA GAGAATGCTA TGACATTTAA TAAAGTATTA 4740 

35 TTGAGCTGGa TAGTCmTATT GATTATAACA ACTAGCATAT ATCTATTTTG GCAGTTGGGC 4 800 

GATATCAATG ATGTATTTAA CCAGTCTATT TTAATCAATG TTAGATTACC GAGATTATTA 4 860 

I 

GAAGCATTGT TGACAGGTAT GATATTAACT GTTGCAGGCC TTATATTTCA AACAGTTTTA 4 920 

40 

AATAATGCAT TGGCAGATAG CTTTACATTA GGATTGGCAA GCGGCGCTAC ATTTGGTTCA 4 98 0 

GGATTAGCAT TATTTTTAGG TTTAACAACG TTATGGATTC CTGTATTTTC AATAACATTT 5040 

AGTTTGATAA CATTAATAAC TGTATTAGTC ATTACGTCGG TATTGAGCCA AGGCTATCCA 5100 

45 

GTTAGAATCT TAATATTAAG TGGTTTAATG ATTGGTGCGT TATTCAATTC ACTTCTATAT 5160 

TTTTTGATTT TATTAAAACC TCGCAAATTA AATACAATTG CCAATTATCT GTTTGGTGGT 5220 

50 TTTGGTGATG CAGAATACTC AAATGTATCT ATAATAGCAA TCACATTTAT CATTGCATTG 5280 

TTTGGTATAT TTATCATTCT TAATCAACTA AAGTTATTGC AATTAGGAGA ACTAAAAAGT 5340 
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ATAACGGCGA TAAATGTCGC ATATGTTGGC ATCATTGGAT TCATTGGTAT GGTGATACCG 5460 

CAACTCATTA GAAAATGGCA GTGGAAACAA TCATTAGGAA GACAATTGGC TTTAAATATT 5520 

5 GTAACTGGAG GACAAATAAT GGTTATGGCA GATTTTATTG GTAGCCATAT ATTGTCACCA 5580 

GTACAAATAC CGGCAAGTAT TATCATTGCA TTAATTGGTA TACCAGTGTT AtTTTACaTG 5640 

CkAAwAtCtC aGTCgAAAcG GTTACaCTAG CACACGACaT TTGCTAAAAT AAAAATAACT 5700 

10 ATAAACATAA AGAGGGCATA AGCGATGGAT TTGAATCAAA TTAAAGCAGT TGTATTTGAT 5760 

TTAGAAGGTA CGTTGTTGGA CAGAGTTAAA TCTCGAGAGA AATTTATCGA AGAGCAATAT 5820 

GAACGATTTC . ATGACTACTT AATTCATGTT CAACTGGCAG ATTTTAAAAA AgCATTTATT 58 8 0 

15 

GAGCTAGATG ACGATGAAGA TAATGATAAA CCTGATTTAT ATAAAGAAAT CATTAAACGT 5940 

TTCCATGTAG ATAGGTTAAC TTGGAAAGAC TTATTTAATG ATTTTGAAAT GCATTTTTAT 6000 

CGTTATGTAT . TTCCTTATTA CGATACTTTG TATACACTAG AAAAgCTATC GCAAAAAGGC 6060 

20 

TTTCAAATTG GTGTTATCGC AAATGGTAAA TCTAAGATTA AACAATTTCG ATTACATTCA 6120 

CTTGGTTTGA TGCATGTTAT TAATTATTTA TCAACATCAG AAACAGTTGG TTTT CGTAAA 6180 

25 CCACATCCTA AAATTTTTGA AGATATGATT GATCAACTAG GGGTATTACC TGAGCAAATT 6240 

ATGTATGTTG GCGATGATGC GTTAAATGAT GTAGCTCCAG CACGAGCTAT GGGCATGGTT 6300 

AGTGTATGGT ATA 6313 

30 (2) INFORMATION FOR SEQ ID NO: 202: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2174 base pairs 

(B) TYPE: nucleic'acid 

(C) STRANDEDNESS: double 
35 (D) TOPOLOGY; linear 

" (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 202: 

40 

• CCGTAAACAC ATCAACAAAA GAAGGCTATA TTACAAAAGA AGACTTGGAC TTATGCTGCA 60 

CGTCGCTCTA ATTCAGCTGG AATGCAAGTC ACCGGACGAC TGGCTTACAT TGAACCTTAT 120 

GGGGCAACAA GTCGCACAAA ATAAACGCGC GAGAAGCaAG AATAGGAAGT GATATCTATG 180 

45 

AAATGGTTAT CACGAATATT AACAGTAATA GTGACCATGT CtATGGcGTG TGGTGCATTG 240 

ATATTTAATC GTAGACATCA GCTAAAGGCG AAAACGCTGA ACTTCAATCA TAAAGCATTA 300 

50 ACAATTATTA TTCCGGCTAG AAACGAAGAA AAAAGAATAG GTCATTTACT ACATTCGATA 360 

ATACAACAGC AAGTTCCAGT AGATGTCATT GTTATGAATG ACGGATCGAC AGATGAAACA 420 
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AAATGGTATG GGAAATCACA TGCTTGTTAT CAAGGTGTGA CGCATGCATG TACGAATCGC 540 

ATTGCCTTTG TAGATGCTGA TGTAACTTTC TTAAGGAAAG ATGCTGTTQA AACGTTGATT 600 

AATCAGTATC AATTACAAGG TGAAAAAGGA TTGTTAAGCG TACAGCCTTA TCATATAACA 660 

AAGCGTTTCT ACGAAGGGTT TTCAGCGATA TTTAATTTAA TGACAGTCGT TGGTATGAAT 720 

GTATTTTCTA CCTTAGACGA CGGTCGGACT AACCAGCATG CATTTGGACC GGTGACATTA 780 

10 

ACAAATAAAG AAGATTATTA TGCAACTGGA GGTCATAAAA GTGCAAACCG TCATATTATT 840 

. GAAGGATTTG CTTTAGGAAG TGCATATACT TCACAATCAT TGCCCGTAAC AGTTTATGAA 900 

GGGTTTCCAT TTGTTGCATT TCGCATGTAT CAAGAAGGAT TTCAGTCATT ACAAGAAGGA 960 

IS 

TGGACAAAGC ATTTGTCAAC TGGGGCAGGT GGCACAAAGC CTAAGATCAT GACAGCAATT 1020 

GTGTTGTGGT TGTTTGGTTC TATAGCGAGT ATTTTAGGGC TATGTCTTAG TTTAAAATAT 1080 

20 CGCCAAATGT CTGTAAGAAA AATGGTAGCA CTTTACTTGA GCTATACTAC ACAATTTATT 1140 

TATCTGCATC GAAGGGTCGG CCAATTTTCT AATTTATTAA TGGTATGTCA TCCATTGTTA 1200 

' TTTATGTTTT TTACTAAAAT TTTCATCCAA TCTTGGAAAC AAACGCATCG TTATGGTGTA 1260 

25 GTTGAATGGA AAGGTCGTCA ATATTCTATA TCTAAAGAAC AATAAATCAA GGTAATGGCA 1320 

TTTCAATATA GGAGGACTAG TATGACAATG ATT3GATATGA ATTTTAAATA TTGTCATAAA 13 80 

ATCATGAAGA AACATTCAAA AAGCTTTTCT TACGCTTTTG ACTTGTTACC AGAAGATCAA 1440 

30 AGAAAAGCGG TTTGGGCAAT TTATGCTGTG TGTCGTAAAA TTGATGACAG TATAGATGTT 1500 

TATGGCGATA TTCAATTTTT AAATCAAATA AAAGAAGATA TACAATCTAT TGAAAAATAC 1560 

CCATATGAAC ATCATCACTT TCAAAGTGAT CGTAGAATCA TGATGGCGCT TCAgCATGTT 1620 

35 

GCACAACATA AAAATATCGC CTTTCAATCT TTTTATAATC TCATTGATAC TGTATATAAA 1680 

GATCAACATT TTACAATGTT TGAAACGGAC GCTGAATTAT TCGGATATTG TTATGGTGTT 1740 

GCTGGTACAg TAGGTGAAGT ATTGACGCCG ATTTTAAGTG ATCATGAAAC ACATCAGACA 1800 

40 

TACGATGTCG CAAGAAGACT TGGTGAATCG TTGCAATTGA TTAATATATT AAGAGATGTC 1860 

GGTGAAGATT TTGACAATGA ACGGATATAT TTTAGTAAGC AACGATTAAA GCAATATGAA 1920 

45 GTTGATATTG CTGAAGTGTA CCAAAATGGT GTTAATAATC ATTATATTGA CTTATGGGAA 1980 

TATTATGCAG CTATCGCAGA AAAAGATTTT CAAGATGTTA TGGATCAAAT CAAAGTATTT 2040 

AGTATTGAAG CACAACCAAT CATAGAATTA GCAGCACGTA TATATATTGA AATACTGGAC 2100 

50 GAaGTGAGaC AGGCTAACTA TACATTACAT GAACGTGTTT TTGTGGaTAA GAGGAAAAAG 2160 

GCAAAGTTGT TTCA 2174 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4715 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 1 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 203: 



is 



20 



25 



30 



35 



40 



45 



GAAnCAGnTA 


GACAAATTAT GGaAAmCGGT 


GTGAATCaAG GATTCtTTGG 


TGTAGCTGGT 


60 


TTTGACCTAC 


TCGTCGATGA GGATGATAAC 


GTTTATGCGA TTGATTTAAA 


CTTTAGACAA 


120 


AATGGTTCaA 


CGAGCATGTT ATTACTTGCT 


AACGAGTTGA ATTCAGGATA 


TCAAAAGTTT 


180 


TATAGTTATC 


ATTCAAAAGG TGATAACACA 


CATTTCTTCA ATACGATTTT 


GAAATATGTC 


240 


AAAGAAGGTA 


GTTTATACCC GTTATCTTAT 


TATGATGGTG ATTGGTACGG 


TGAAGATAAA 


300 


GTTAAATCAA 


GGTTTGGCTG TATTTGGCAT 


GGTGATTCAA AAGAAACAGT 


ACTGGAGAAT 


360 


GAACGCGCAT 


TTTTAGCTGA ACTTGAACAC 


TATTAGAGTT CGGAACATAA 


GGCGCTACAA 


420 




CCAGTAGTTG ACTGAATATG 


CGTTTGTAAC AAGCTTTTTT 


CGATTCTAGT 


480 


CAACAGTAAT 


TAAATTTATG ATATGGCAAT ACTTTGTAAT ACTAATATTA AATGGCGACT 


540 


TTTATTTCAC 


TATGTTATAA GAGTTGCCAT 


TTTGTTGATA AAGGTATACT 


AAAGGTTATC 


600 


GTTTTGAAAT 


TTTTAGTAAC TAGATATGTT 


TCGTGTTATA GACCGAATTT 


GTGTATACGT 


660 


AAAATTTAAT 


GCTATTGAAT TTTTAAAATG 


AAAAACATGA CATTAAATTG 


AATTCATAAT 


720 


ATGTCTAATT 


GACTAACTTG TTGGAGTCAT 


TTACTATTTT ATGTATGACA 


TATTTTAAAA 


780 


AGTGAGGGTC 


AAGCATGTCT TATAAAGCAT 


ATCCATTCTT TAGAGATATA 


TTAATAAATG 


840 


AATGTATTTA TTTCGCCTCT AAAAATAAAA AACTAGTACG CCTAAATTAT AAAAGTGAAG 


900 


CGnATGTAGG 


CGTTTGGACA GAAGAAAGTG 


TGGCCGTATC ATTTTTAACA 


AGTCGTGATA 


960 


TTCCATTTGA 


TAAAGTTGTA AAAATGGACG 


TTGATCGTTT TGCTACTTAT 


GAATTAGATG 


1020 


AATTGTTTGA 


TGAACAAGAC CATATTATTA 


TGAATCAAAC AATGGAAGAw 


GAAGGGCATC 


1080 


TACTAAACGT 


TGTAGCTGTT ACACAAGAAG 


TGATGACGGA ATTAGATAAA 


ATTAGAATCA 


1140 


AAGAATTTGT 


CCAAGATGTA GCGAAATATG 


ATGAAGTATA CGGCTTAACT AAAAAAGGTA 


1200 


GTAAGCAGTT 


TATTCTCATT AGTGAAAATG 


ATAGCGACGA AAAAAAGCCG 


CATATTATGC 


1260 


CTGTATGGAG 


TATTAAAAAC AGAGCGTTAA AAGTTCGAGA TGAAGATTTT 


GAAGAGTGTG 


1320 


ATTTAATTAC 


GATTGAAGGT TCTGTTTTCG 


GAGAATGGCT AGATGAACTT 


AGAGATGATC 


1380 


ATAAAGCCGT 


TGCGATAGAT TTAAAAACTG 


GCGTGGTTGG TACAATTGTT 


TCAGCGCAAA 


1440 



55 



919 



EP 0 786 519 A2 





ATGGAACAAT ACGTATTCAA 


AACACTTAGA 


CCATAAAATA AAAGGCCATT 


TATATAGCGT 


1560 




TTATTTAAAA CAACGCGCAT 


ATAAATGGTC 


TTTTTCTATT TTTCTAAATA 


TAATGCACCA 


1620 


5 


ATAGCACCTG XlAAAATGCGC 


CGTTTTCAAC 


ATAGTACGGT TTGCAACCGC 


GTAACACAGT 


1680 




ATAATCTTCC ACAACTTTGC 


GTAATAAAGC 


GTTATTATGA AATGAAGAAC 


CGATATAAAC 


1740 


10 


GATATTTTCA GTTTTAAATT 


CACGTGCAAC 


AGTAATGGCC ATTGTCGTAA 


CAACTTCGCC 


1800 


AACGACACCA ATAACGGCTG 


CTAATTTATT 


GCTAGGTGTA AAATCAGCAT 


CTAAATGATG 


1860 




TAGTACATGA CCAAAATTAG 


CTGCTGTTAA 


ATCACCGGGA ATGGGTGGTT 


CGGTATCTTT 


1920 


15 


ATAAATATGT CTAACCTTTA AATCGATAGT GTTACGATCA CCGTGTTGTG 


CCATGTCAGT 


1980 




TAACTGTTTA TAATCAGTGA 


TTTGACTTAG 


TAAATAACCG AGTCCTTGAA TCATGCCTCC 


2040 




ACCTGTACCG ATACCGCCTA 


CACGACGTTG 


TGATTGGCCG TCGAAATAAT 


GTAGTGACGT 


2100 


20 


ACCGGTACCA ACATTTGCAA AAATATAATC 


TGCTAAGTCA TGGCCTTGCT 


CTTTTAACAA 


2160 




AATACCTAGT CCTTGAGATG 


CAGCATCAAA 


CTCTACAAAA ATTTGTGCAG 


GAATGTTGAT 


2220 




GTTTTCAGCA ATGACACCTG 


CATTACCTCC 


AGTTAAGCAT AATTTTTCAA 


TTTGCTGTTG 


2280 


25 


GTTTAACCAT TCCACAACTT 


GATCAATATT 


TTTAGTTAAT TCAGTTTTAA 


AAGTACGTTG 


234 0 




GTTATCTTGC TCTTGAACGA 


TTTTAATTAG 


TGTACCGCCA GCGTCAATGC 


CAACTTTCAT 


2400 




AAGATTCCCA CCTCATTATT 


AATGTCTATC 


CTTAAATAAT AGTATAGTAA 


AATGACTAAA 


2460 


30 


AAACAAGTAA TAATAGTAAT 


TATTAACAAA 


TTTGATGCCa TTGCATTTCA 


ACATTGTAAG 


2520 




cGTATCGCAA ■ TTAAtGTTTT 


ACAAACGTGG 


ACGTTAAGT t ATATATATTA 


TTTTCTAGGA 


2580 


35 


ATTTTGAAGT TGTATAGGAT 


TGTTAGTTAG 


TGACGCAATA TTAAAAGTAG 


TTCGTACGCA 


2640 


GTGTATTTGT AAGTCTCTGA 


TTAAAATGAT 


AAGTAATGAG GAATAGTACA 


TTAATTTTGA 


2700 




AATTTAAAAA ATATAAATAA GTAATTTATT TAACTTAGAG CAAATAATGG TATCGTAGTG . 


2760 


40 


AAATAATAGG TAAAATAATA 


TGGGGATTCA 


TGCTTCATAT ATAAAAAGAT 


AGGGGTTAAA 


2820 


TATATGGCTA AAGAACTTTG 


TTTTGAAGGT 


ATCACTTTAA AAGCATTTGA 


TGAACAATAT 


2880 




CGTTCAGCAA TTAATGATTT 


TGACTTGAAT 


GAAAGACAAC AAATATATTC 


ATCTTTACCT 


2940 


45 


AAAGAAGTTA TTGATGATGC AATTAATGAT GCTGATAGGA TTGCTAACGT AGCAwTAAmC 


^ ft ft A 

3000 




. GATAAAAATG AAGTGGTGGG 


CTTTTTTGTA .TTACATCGTT ACTATCAGCA TGAAGGTTAT 


3060 




GATACACCTG AAAATGTCGT 


TTATATTCGT 


TCATTATCGA TTAATGAAAA ATATCAAGGT 


3120 


50 


TTTGGATATG GCACGAAAAT 


AATGATGTCA 


TTGCCGCAAT ATGTTCAAGG 


TGTATTTCCT 


3180 




GATTTTAATC ATCTATATCT 


AGTAGTAGAT 


GCGGAAAATG ACAATGCTTG 


GAACCTATAC 


3240 
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10 



15 



CTATATTACT TGGACTTAGA TTCAAAACAT GTTTCATCAT TAAAGCTTGA AGAAGAAAGT 3360 

CGTTCAGAAG TGACCAATGT ACATATCATT AATTTAATGA TTGATGGCCA AAAGGTTGGC 3420 

TTTATCGCAT TGGAGCAGAT TGGTGAACGC ATGAACATTG CTGCTATTGA AGTGGATAAA 3480 

TCATATCGCT TTAATGGTAT TGGTTCAAGT GCTCTGCGAC AATTGCCAAC TTACTTAAGA 3540 

AAAAACTATG ACAACCTTAA TGTGATTACG ATGATTCTGT TTGGAGAGAA TAATGATTTT 36 00 

AAACCATTAT GTTTAAATAG TAATTTCGTT GAAATCGAAC AAACTGATGA TTATGTCGTT 3660 

TTCGAAAAAT ATTTAAATTA CTAACAGTGA TTGCGAAATA TGATATTGTC ATTTATAATT • 3720 

- TAGTTTTGTT ACTATATATA AATGAATTCA GACGTATAAA TTTAGATTAT ATCCTTCGAA 3780 

AGGAAGTATT GGGCAATGAA AATTCAAGAT TATACAAAAC AAATGGTTGA TGAAAAATCA 3840 

TTTATTGATA TGGCTTATAC ATTATTGAAT GATAAAGGCG AAACAATGAm mTTATATGAT 3 900 

20 ATyATCGATG AATTTAGAGC GTTAGGTGAT TATGAGTACG AAGAAATTGA AAATCGTGTT 3 960 

GTACAATTTT ACACGGATTT AAACACAGAT GGTCGTTTTT TAAATGTTGG AGAAAATTTA 4020 

TGGGGATTAC GTGATTGGTA TTCGGTAGAT GATATTGAAG AGAAAATCGC ACCAACTATT 4080 

25 CAAAAATTCG ATATTCTGGA TGCAGATGAT GAAGAAGATC AAAACTTAAA ATTATTGGGC 4140 

GAAGATGAAA TGGATGACGA CGATGATATT CCAGCTCAAA CAGATGATCA AGAAGAACTA 4200 

AATGATCCAG AAGATGAGCA GGTTGAAGAA GAAATCAATC ATTCGGATAT AGTCATTGAA 4260 

GAAGATGAAG ATGAACTAGA CGAAGACGAA GAAGTGTTTG AAGACGAAGA AGACTTCAAC 4320 

GATTAATTTT TTGTTTGACT TTTAGTTGAA AGATGATAAA ATTTTATTCG GGCTCCTTTA 4380 

AATAGGACAC GTGTATAAAA TTTATACGCT CCCCTTACAG AATTTGTGAG AGGGAGCGTT 4440 

TTTTLATTTA ATTGAGTAAA TCAAGAAATG ATAACGCAAA AATCAAAGTT GTAAATGATA 4500 

TACAJAGTGA CATAGCAGTA TGGAAACGGT AAGTAAACAG AATTTAATTT TGTCGAtTCG 4560 

ACAAtAAaCA aCTtGAaTGA GCTTGCTTTA ATGTTATGTn nTACGTAATT TTTACAATTG 4620 

ATGAGGAAGC ATTCCCTTTA ATAATTAGGA GGTCAAGACA TGACAAAATT TATTTTTGTA 4680 

ACAGGTGGCG TAGTTTCATC CATTAGGGGA AGGGT 4715 

45 (2) INFORMATION FOR SEQ ID NO: 204 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 918 base pairs 
{B} TYPE: nucleic acid 

(C) STRAJIDEDNESS : double 

(D) TOPOLOGY: linear 



30 



35 



40 



50 
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ATAATAACTG AAATTAAAAT TGCTAAATmG TGTTaAgCTA TCGCmACAAT GAAAATwCCG 60 

ATTTTGCGTT GTTGAAAATA TCTTTCCAAA CCAAGAATCG ATAATGGCAA TAAATATAAT 120 

5 

AAATTTCCAT AAAATGACCA AGTAAAATTA AAGTATATAA CGACAGTTGA CATGCCGTAT 180 

AAAATCGTAG CGATCATATT TGCTGAGCGT TTAAAGTGTA ATATTTTAAA TAAGTAGAAG 24 0 

GTCACGAGAA ATGTTATGAT AGCTCGTATC ATAGCCATAA TAAGTTGGTT TGTCGGCCAA 3 00 

AAATGTATTG TCGTCGGATT AAATATACCA ACCGTTTCTC CTATTTTAAT GAAkAGAAAA 360 

TTTAGCCACA TTAAAGGTGA CAGCGAATAA TAATnTGATA GTCCTTTCAT ATAATCGCCA 420 

15 CCTAmTCCAA ACGATGCATC ATrTAAACTA GAAnAACTAC GTAGATGTTC ATACAnATAC 480 

ATTTGAAATG GCATCATTTG ACGGAATCCA TCTCCAGCCC CGCTAAAAAC AGTACCATTC 540 

ACAATATAAT CATAGATATG AGTAGAAAAT AAAATAAGCG TTAATATTAC ACTAATGAAA . 600 

20 GTTATAACAA AGAATTGTTT GACGTTTGAA TTTAGCCACT TTTTTAACAC AACATTATCC 660 

TCAACTTTCA AATTTAAAAT TAAGTTTAAC TGAAACTAAA GTTAATGAGG TTCTTGATAG 720 

GTAAAGACGA AGATGACTGT GGAACAGATA CCTTATCATA GTTACTTAAA CTTTGGATCA 780 

25 TTTTCAGTTT ATCATTAAAC AAATATATTG AATAATAAAa aTGTCATACT GATAAAGATG 84 0 

AATGTCACTT AATAAGTAAC TTAGaTTTAA CAAATGATGA TTTTTAATTG TAGAAAACTT 900 

GAAATAATCA OcTATACC 918 

30 

(2) INFORMATION FOR SEQ ID NO: 205: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 16397 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

txi) SEQUENCE DESCRIPTION: SEQ ID NO: 205: 

40 

TCGCCCnATA ATCAATTTAT TTTTCATGTG CCACTCCTAT ACAAGCTnAC AATGCTTCTT 60 

CAGTTAAGGC AATATCTTTT AATTTTGTTT GATATTTTTG TTCAAAGTCA TATTGTAACT 120 

45 GAACAATTTC TGGCAAACCA ATATGCCAAT CCGCCAATTT TTTTTTAyCT TtGAAGAGCT 180 

CTTTTGGTGA TGkTTGcGAC ACTATACTAC CTTCTTTCAT AACGATGACT TCATCTGCAT 24 0 

AACGCGCGAC TTCATTCATA TCATGTGAAA TTAGGATAAT TGCCTTATTT TCATCTGTTT 300 

50 GTAGTGACTT TAGTAATCTC ATTACTTGTC GTTTACTTTG TGGATCAAGT CCTGCTGTAG - 360 

GTTCATCAAC CACGATAATA TCAGGATTCA TTGCCAATAT CGATACAATC GCTATTTTAC 420 
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AATCCATCAA CAGACGATGG GCATAGTTTT TGGCTTCATC TAAATTCATT TTAAAGTTTT 540 

TAGGTCCAAA TATCATTTCA CGCTCTACTG TGTCCTCAAA TAATTGAGAT TCGGGAAATT 600 

5 

GAAATACCAT TCCAATTCTT TTTCTTACAG GTCTAATATA TTTATCTTTG GTCTTATGTG 660 

TAATAGTAAT GTCATCAACT GTAACTGTCC CAGTAGTCGG CTTTAACAGC GCATTAATAT 720 

TTTGTATCAA CGTTGATTTA CCACTACCCG TTTGTCCAAC GATGGCGTAA TATTTACCTT 780 

10 

GTTCAAATTC TGTATTAACA TCATGAATAG CTTGATGCTG ATATGGTGTC CCTTTTTGAT 840 

AGGTATAACT TACATTGTCA AACCGTATAG TCATAGTTGA TCCACCAGCC CTTCATAAGT 900 

TAAGAATGAT GTTTGGTGTC CCAGCATTTG ATTTATTTTG ATTGGGAATG GCAAATCTAG 960 

is 

ACCTATTCTT GTTAACTCTT CTGCATTGTC GAAAATTTCA GTCGCTGTGC CTTCTTTATA 1020 

GACAGTCCCT TTATTCATAA CGATAACATG ATCTGCTTCC ATCGCCTCAG ATAAATCATG 10 BO 

2Q CGTAATAGAA ATGATTGTAA TATTATGTTC TGATTTAACT TTTCTCACTA AATCCAATAA 1140 

ATTTTGACGT GCATCAGGAT CTAACATAGA AGTCGCCTCA TCTAATATAA TGACAGAGGG 1200 

GTTAAGTGCT AATACACTTG CTATAGCCAC ACGCTGCTTC TGTCCCCCCG ATAATGCATT 1260 

25 AGGTTCATAA TCTGCACGTT CTAACATATC AACTTGTTTA AGTGCTTCGC TGACTCTTCT 1320 

ATGCATTTCG TCATATGGAA CCGCATGATT TTCGAGTCCA AATGCCACAT CGTATTTTAC 1380 

AATTGAACCA ACAAATTGAT TATCCGGATT CTGAAATACA ATTCCTATGT CT TTT CTTA A 1440 

30 CTTTTCAAAA TTATCATCAG TTATAGCTTG ATTATTATAA AAAATTTCTC CAGATTTAAC 1500 

TTTCTCTATG CCAATCATTA ACTTGGcAAT TGTAGATTTT CCAGAACCGT TATGACCAAC 1560 

AATAGATGTC CACTGACCTT TAGGTATATT AAAAGAAACA TCTTTCAATG TGAAGGATGC 1620 

35 

ATCACTTTGA TATTGAAATG AAACATTTTT AAATACAATA ACTGAATTCT TATCCTCCAC 1680 

TTGTCTCTCT CCTTTACGAT TCGTGTATCT ATCATATTTT ACAATATTTA TAAATCGCTG 1740 

TATATGACAT' TGACTGGGTT CTCTATATAT TACTAGTATT TTCTGACTCA TTTCTAGTCT 1800 

40 

TTAAAGTGTT GTTTAACAAC TAATGATAAG GACTTTTATT CCTCTCTAAC AATTATGTAT 1860 

AAACGTTAAT AAAATAAATG ATTTACTAAT ATAGGGGTGG TCGCGTTTGA TTCAACGATA 1920 

ATACTTTCAC TTCATTCAGT TCTAGTGAAA TTGATCAAAC TAGCTTCATC ATATTTTTAG 1980 

45 

ATTCGCACTC AAAAAAGTAA ATATAAAGAA ATCGGACTTA AAAACATTTC TGTTCATAAG 2040 

TCCGATATTT TATTCAATAA AAAAGCGCGC ACCCCATCAT AAGTTTGTTG AGTTCACGCT 2100 

so TTAAATCTTT ATTTAGTTGA TGGGGTACTC TGAGCTAGAC AATATTTGTA TGTGGCAAAC 2160 

ATTAtCGTTG CACTCATTTG CTTTATATAA AAGTAGTTAG TGTATTTATA TAAATTCTTA 2220 
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ACGAGTGTAA CCACCTTGAC GTTCTGTGTA AcGCTCTGCG ATTTCACCAA ATAATTTTTG 2340 

AAGTGCAGTT TGTGTAGTTT CATCTTCGTT TAAGATTTCA ACATTACGTA AAGTTTTAGC 24 00 

TGCATTACGA CGAGAAGCTA AATCTCCTTT TTTACCTAAA GTGATTAATT TCTCAACAAC 24 60 

ACTGCGAACT TCTTTTGCAC GAGCTTCTGT AGTTTCAATA CGTTCACTAA TAATAAGTGA 2520 

TGTAGCTAAG TCACGTAACA TAGCTTTACG TTGATCAGAA GTACGACCTA ATTTTCTGTA 25 BO 

10 

ACCCATGAGT TAACCTCCTT TATCAATCTT CTTTTCTTAA TCCTAATCCT AAATCTTCTA 2640 

ATTTGTATTT AACTTCTTCT AAAGATTTAC GACCTAAATT ACGCACTTTC ATCATGTCAG 2700 

CTTCAGATTT GTCAGCTAAC TCTTGAACAG AATTGATTCC TGCGCGTTTT AAGCAGTTAT 2760 

15 

ATGAACGTAC AGATAAGTCT AATTCTTCAA TAGACATTTC TAATACTTTT TCTTTTTGAT 2820 

CTTCTTCTTT TTCAATCATG ATTTCAGCGT TTTGCGCTTC ATCAGTAAGA CCAAGGAAGA 2880 

TATTCAAGTG TTCAGTCATT ATTTTTGCTG CTAATGAAAC TGATTCTTGT GGTGTGATTG 2940 

20 

AACCATTAGT CCAAACATCC AATGTTAATT TATCAAAATC ACTGCTTTGA CCTACACGTG 3000 

TATTTTCAAC AGTATAGTTC ACACGTTCAA CAGGTGAATA CAATGAATCA ACAGGGATTA 3060 

2s CACCAATTGG TAAATCACTA GTATTATTTT "GTTCTGCTAA TGCGTAACCT CTACCCTTGT 3120 

TAGCAACTAG ACGAATTTTT AAGTGACCAC CTTTAGATAC TGTTGCAATT TTAAGCTCTG 3180 

GGTTTAAAAT TTCAACATCA CTATCATGTG TAATGTCGCT TGCTGTTACT TCGCCTTCAT 3240 

30 CACGTACATC AATTTCTAAA GTTTTATCTT CTTCAGAGTA AATTTTCAAT GCTAATTGTT 33 00 

TAATGTTCAT AATAATTGTA GAAACATCTT CAACTACATT GTCTACTGCT GAGAATTCAT 3360 

GTAAAACTCC CTCAATTTCA ATATACTTAA cGGCTGCACC TGGTAATGAA GATAGTAGGA 3420 

35 TACGACGTAA GGAGTTTCCT AGTGTAGTAC CGTAGCCACG TTCTAGTGGT TCAACAACGA 3480 

ACTEACCGAA TTTAGCATCT TCACTAATTT CAATTGTCTC AATTCTAGGT TTTTCGATTT 3540 

CTATCATTTA AATATCCTCC TTATATACGT CGACTTAATT TAAAATGTTT GCTCAGTGAC 3600 

40 

CTGTAACAAT ACCATCATAA ATTATACACG ACGACGTTTT GGTGGACGAC AACCGTTATG 3660 

AGGTACTGGA GTAACGTCTC TGATCGCAGT TACTTCTAAA CCTGCAGATT GTAATGCACG 3720 

AATAGCTGAT TCACGACCTG GACCAGGTCC TTTAACTGTT ACTTCAACTG TTTTTAAACC 3780 

45 

ATGCTCCATA GCTGATTTAG ATGCAGTTTC AGAAGCCATT TGTGCTGCAA ATGGTGTTGA 3840 

TTTTTTAGAT CCTTTGAATC CTAATGCACC AGCTGATGAC CATGATAAAG CATTACCGAA 3900 

CTCATCAGTG ATAGTTACAA TAGTGTTGTT GAATGTTGAA CGGATGTGTG CTACACCATT 3960 

50 

TTCAATATTC TTTTTCACTC TACGTTTACG AGATACTTGT TTACGTGCCA TTTAAAATTT 4020 
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* • 


CGCGCGTkGT 


TTTTCGTTTT TTGACCACGA ACTGGTAAAC 


CACGACGGTG 


ACGGATACCc 


4140 




ACGGTATGAT 


GAAATTnCCA TTAAAcGTTT GATATTTAAG 


TTAGTTTCAC 


GACGTAAGTC 


4200 


5 


ACCTTCGACT 


TTATAACCGT CTACAACTTC ACGGATGCGA CCTAATTCGT 


CATCAGTTAA 


4260 




ATCTTTCACA 


CGAGTATCAG CTGATACGTT AGCTTCTTCA 


AGAATTTTTT 


GAGCAGTTGA 


4320 


10 


CGTACCGATA 


CCGTATATAT AAGTTAATGA GATAACTACG 


CGTTTTTCAC 


GTGGAATATC 


4380 


TACTCCTGCA 


ATACGTGCCA TATTAATTTA CACCTCTCTT 


TTATTAACCT 


TGTCTTTGTT 


4440 




TGTGTTTTGG 


ATTTTCACAA ATTACCATTA CTTTACCTTT 


ACX3TTTAATG 


ACTTTACATT 


4500 


15 


TTTCGCAAAT 


AGGTTTTACT GATGGTCTTA CTTTCATTTT 


TATACCTCCC 


TATATTATGG 


4560 




AGTGACGATT 


ATTTATAACG ATAAGTAATT CTTCCGCGTG 


TTAAATCGTA 


CGGAGACATC 


4620 




TCAACAGTTA 


CTTTGTCGCC AGGTAGAATA CGAATGTAAT 


TCATTCTGAT 


TTTACCACTT 


4680 


20 


ACGTGAGCnA AAATCTCATG ACCATTTTCT AATTCTACTT 


TAAACATTGC 


GTTCGGTAAA 


4740 




GTATCTAATA 


CAGTACCTTC TAATTCAATT ACATCTTGTT 


TAGCCATTGA 


TTAACTTCCC 


4800 




CCTTTTTGCA 


ATAGTAAGGT AATCGTCAAT AGACAACTTT 


ATTGTTACGA ATCTATCAGT 


4860 


25 


GATTAATTTT ATAAGTTAAA CAAAAATTAC GGGAATTAAT TATCGTTAAT TGCCACTCTC 


4920 




ATCTATCTAA 


TATGATTAAA TCATGCCTCA CTTAAAATAG 


ACCGCTAAAA 


GTTGATCTAT 


4980 


30 


TACAAATGAT 


CTAAAATATC AATGACATCT TTGGTAACGT 


CGCTAATATC 


TTTTGAACCA 


5040 


TCAATATTTT 


TCAATACACC TTTTTGATCA TAGAAATCTA 


AAATAGGCTT 


AGATTGTTTA 


5100 




ATATTAACAC 


TCAAACGATT AGCTACCGTT TCAGGATTAT 


CATCTTCTCG 


TTGATACAAT 


5160 


35 


TTACCACCAT 


CGATATCACA AATACCTTCG ACTTCGGAGG ATTAAATACA AGATGATACG 


5220 




TTGTACCACA 


TGACTCACAG ATTCGACGAC CTGTAAGACG 


GTTCATTAAT 


TCTTCTTCCG 


5280 




GAAC3TCGAT ATTGATGACA gcatcaatgt ttctgtcaag ctcagagata atattattta 


5340 


40 


ATGCCTCAGC 


TTGCTCGATT GTTCTTGGGA AGCCATCTAA 


TAAAAAGCCT 


TTTTTTGCAT 


5400 




CGTCTTCAGA AATTCTTTCC TTAACGATAC CTACAGTCAC TTCATCAGGA ACTAATTCGC 


5460 




CACGGTCCAT ATAAGACTTA GCTTCTTTAC CTAATTCAGT 


TTCTTCTTTT 


ATAGCTTTTC 


5520 


45 


TGAACATGTC 


ACCAGTTGAA ATGTGGGGTA TTGGGAATTT 


CTTGaCAATT 


TCACTTGCTT 


5580 




GAGTTCCTTT 


ACCTGCGCCA GGTAAACCCA TCAAAATGAT 


ATTCATAAGT 


GCCCTCCTAA 


5640 


SO 


AATTATCTAC 


CACCAAAGCC TTTATATTCT TTTTGAGATA 


CTTGAGCTTC 


TAAAGATTTC 


5700 


ATTGTTTCAA TCGCTACACC AATAACGATA AGTAAACTTG TACCACCAAT CTGAATTGAT 


5760 




TGTGGTAATC 


CCATAAACTT AGTTGCTAAT ATCGGTAGAA TTGAAATAAC 


GGCTAAGAAG 


5820 
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CCAGGTCTAA TACCTGGAAC ATAGCTACCT 


TGTTTCTTAA 


GGTTATCAGC 


CATTTTTTCC 


5940 




GGATTAACTT GTACAAATGC AT AGAAGT AT 


GTGAATAGTA 


TAATTAGTAC 


AATATATACA 


6000 


5 


ACCATACCAA CATTACTTGA AGGATTTGCA 


GCATTCGCAA 


TGTTTTGTGC 


CCATTCTTTA 


6060 




TCTGGATAGA ACAACGTTAA TGTTCTAGGC AGTAAGAAGA ACGCCATTGC 


AAAGATTACA 


6120 


10 


GGAATAACAC CGGCTGAGTT CACTTTTAAA 


GGTAGATAAG 


TTGCCTGTGA 


ACCTAATCTT 


6180 


TGAGCAGTTT GTTTCTTAGC ATATTGAATC 


GGAATTTTAC 


GAACGGCTTC 


AAGTACATAA 


6240 




ATAGCACCTA CAGTTAATAG TATCAGTGAC 


ACTAAAAGTC 


CTAATACTTT 


CAACCATGCT 


6300 


15 


AATGATGTAT CTTCTTGCCC AACGAACGCA 


tTTGTcCAAA 


TTGAATTAGA 


CTGGCTGGCA 


6360 




ACGTTGATAA AATACCCGCA AATATGATAA TAGAAATACC 


ATTACCAACA 


CCGAACTGAG 


6420 




TGATTTGATC ACCAAGCCAT ATTAAGAAAG 


CAGTTCCTGC 


TGTnCAAAAC 


TAGTGCTATT 


6480 


20 


AATAAATAAC TCATAATTGA CTGATTGATA ATCAGCGCAC 


CTTTGAGATA 


ATTATTAAAT 


6540 




TGGAATGCCA TACCTATAGA TTGGATAAAT 


GCTAAAGAAA 


TTGCTAAATA 


ACGAGTAACG 


6600 




TTATTTAACT TTCTTCTACC TACTTCACCT 


TGTTTTGCCC 


ATTCTGAGAA 


TTTAGGGACA 


6660 


25 


ATATCCATTT GTAATAATTG CATTACGATT 


GATGCAGTGA 


TGTAGGGTAC 


AATACCCATT 


6720 




GCAAAAATAG AAAATCGTTT cAAGGCTCCG 


CCACCAAAAG 


TATTTAATAA 


CTCAGTGGCA 


6780 


30 


CCTTGAGAAC CTTGGGGATT ATCAAAAGCT GCAGGATTTA 


CTCCTGGAGC 


TGGTATATAA 


6840 


GTCCCTATTT TAAAAATTAC TAACATTGCT 


AGTGTGAAGA 


AAATCTTGTT 


ACGAACCTCT 


6900 




TTTGTTCTAA AGAAGTTCAC AAGGGTTTGA ATCATTAGAT 


CACCTCGTGT 


GcTCCACCTT 


6960 


35 


TAGCATCAAT AGCTTCTGCT GCTGAAGCTG 


AGAATTTATG 


AGCTTTCACT 


GTCAATTTCT 


7020 




TATCAAGTGA ACCATTACCT AGTATTTTGA 


TACCAGATTT 


TTCATTCTTA 


ACAACACCAG 


7080 




ATTCTACTAA TAAAGCTGGA GTTACTTCAG TACCATCTTC AAATTTATTA 


AGTTGGTCTA 


7140 


40 


AGTTAACAAT AGCATATTCT TTACGATTTA 


TGTTAGTAAA ACCACGTTTT 


GGTAAACGAC 


7200 




GGAATAATGG TAATTGACCA CCTTCAAATC 


CTGGTCTTAC 


ACCACCGCCT 


GAACGAGCTT 


7260' 




TTTGACCTTT GTGTCCGCGA CCACTTGTTT 


TACCGTTACC 


TGTCGCAACA 


CCACGTCCAA 


7320 


45 


CACGATTGCG TTCTTTACGT GAACCTTCTG 


CCGGTTTTAA 


CTCATGTAAT 


TTCATTTCGG 


7380 




CACCTCCTTG ATTATTTTTC TTCTACTGTT 


ACTAAGTGCT 


TAACTTTGTT 


GATTTGCCCA 


7440 


SO 


CGAATAGCAG GGTTATCTTC AACAACTACT 


GAACTGTTAG 


TCTTTTTAAG 


ACCTAAAGCT 


7500 


TCAACAGTTT TACGTTGTGT TTCAGGACGA 


CCAATAACAC 


TACGAGTGAG 


GGTAATTTGT 


7560 




AATTTAGCCA TAACTAGTTT TCCCTCCTTA ATTGTATAAT 


TCTTCTACTG 


TTTTGCCACG 


7620 
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CATGTTGATT GGTGTGTTTG ATCCTAATGA TTTACTTAAG ATATCAGTGA TACCTGCTAA 7740 

TTCAAGTACG GCACGAACAG GACCACCAGC GATAACTCCT GTACCAGGTG CAGCCGGTTT 7800 

5 CATAAATACG CTTCCTGAAC CGTAACGGCC AGTAATTGTG TGTGGAGTTG TACCTTCAAC 7860 

ACGTGGAACA ACTACTAAAT CTTtTTTAGC TGCTTCAACA GCTTTTTTGA TTGCTTCTGG 7920 

TACCTCTTGA GCTTTACCAG TACCGAAACC TACACGACCA TTTTTGTCTC CAACTACAAC 7980 

TAATGCAGTG AAACGGAAAC GACGACCACC TTTTACAACT TTTGCTACAC GGTTGATTGT 8040 

AACAACGCGT TCTTCAAATT CTTTCGTCTC TTCTTCtCTA CGAGCCATGT ATTTGTCCCT 8100 

CCTTTAAATT AAAATTCTAA TCCGCTTTCT CTTGCTGCTT CAGCTAATGC TTTAAcACGT 8160 

75 • 

CCGTGATATA AATATCCTCC ACGGTCAAAT ACGATTTCTT TAATGCCTTT GTCAGCAGCT 8220 

TTTTTAGCAA TTGCTTCACC GACTTTAGTT GCTAATTCAA CTTTAGTTGC TGTAGTAGCA 8280 

2Q ATGTCGCTGT CTTTT GAAGA AGCTTGAGCT AATGTTACGC CTTTATTATC ATCAATAATT 8340 

TGAGCGTAGA TATGCTTGTT TGAACGATAT ACGTTTAAAC GTGGCTTTTC AGCTGTACCT 8400 

GATAAGTTAG TACGAACACG AGCATGTCTT TTTAAACGCA CTTTATTTTT ATCAATTTTA 8460 

25 CTGATCATTT CAATACTCCT TTCTTTAGAG TTTATCTATT ATTTACCAGT TTTACCTTCT 8520 

TTACGGCGAA CGTATTCACC TTGGTAACGA ATACCTTTAC CTTTGTAAGG CTCTGGAGGT 8580 

CTTACTGAAC GGATGTTAGA TGCTAATGCT CCAACTTGTT CTTTTGAAAT ACCTTCAACT 8640 

30 TTAACGACTG TGTTTTTCTC AACTGAGAAA GTAATGTTTT CTTCAGCTTT AATTTCTACT 8700 

GGGTGAGAAT AACCAACGTT AAGGATTAAG TCTTTACCTT GCATTTGAGC ACGGTAACCT 8760 

ACACCAACAA GTTCAAGTAC TTTTACGTAT CCTTGAGAAA CACCTTGTAC CATATTGTTT 8820 

35 

AATAAAGCAC GAGTTGTACC ATGGTTTGTT CTATCTTCTT TAGAATCAGA TGGTCTTACA 8880 

ACTTCAATTG TGTTTTCTTC TTGTTTGAAT GTCATTCTTT CATTTAAAGT TCTTGATAAT 8940 

TCACCTTTAG GACCTTTAAC AGTTACATGA TTTCCATCAA AAGTTACTGT TACGTCACTA 9000 

40 

GGGATGTCAA TAATTTTCTT ACCAACACGA CTCATGTTAT GGCACCTCCT TATTTTTTAT 9060 

TACCAAACGT ATGCGATAAT TTCTCCACCA ACATTACGTT TTCTTGCTTC TTTGTCAGTG 9120 

45 ATTACACCTT CAGAAGTTGA TACTAATGCA ATACCTAAAC CATTTAATAC TTTAGGCATT 9180 

TCGCTAGCTT TTGCATAAAC ACGTAAACCT GGTTTTGAAA TACGTTTTAA TCCTGTGATA 9240 

ACACGCTCAT CGTTTTGACC ATATTTTAAG AATAAACGAA GTACACCTTG TTTATCATCT 9300 

TCTACGTATT CAACATTTTT AATGAAACCT TCACTCTTTA AGATTTCAGC AATTTCTTTT 9360 

TTAATATTTG ATGCAGGTAA TTCTAACTTC TCGTGACGCA CCATGTTTGC GTTTCTTACA 9420 
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TCTTTTTTAT TACCAGCTAG CTTTACGAAC 
ACGGAAACAA ATACGGCATA ATTTAAATTT 
TTCACAACGA GTGTATTCAC GAACTGCATA 
TGAAGTTTTA GCCACTTAAT TAGCCTCCTT 
AGTTAGCTAA CAATTCACGA GCTTCTTCAT 

10 

CCATTCCTCT AACTTTACTT ACTTTATCAT 
TAACACCTAA AGTGTAGTTA CCGCGTCCGT 

15 CACGTACACG TGGTaATGAT ACTGAAATTA 
CGCGAAGTGT TACTTTCGCA CCGATTGGCA 
aTTTTTTAGC TTTAGTTACT AATGGtTTTT 

20 CATTGTCTAA TACTTTAGAA TTTTGTACTG 

TATCTATTTT TGGTACTTCC ATTACTGAAC 
TAACTTCAGT GTTaAACTTT TCtTTTaAAC 

25 

TtATTAATTA TTAGAkTTAA TTTCTTCGCC 
ATCAACAAAT TTGTAACCTA CACGAGTTGg' 
ATTAGAAACA TGGATTGCTG CCTCTGTTTC 

30 

TGGTTTTTGG TGTTTTTTCA TAATGTTAAC 
TAGAGTAGCA ATTACTTTAC CTTCTTTACC 
GTCACCTTTT TTGATATGCA TGTGGGCACC 

35 

AGTACTTCTG GTGCTAATGA TACGATTTTC 
ACAOTTCCGA AGATACGAGT ACCACGTGGG 

40 TTTTCATCAA ATTTGATGTA TGAACCGTCA 
ATTACAGCTT TGACAACGTC ACCTTTTTTA 
GTACATACGA TAACATCGCC GATGTTTGCT 

45 ATTGTAAGAA CTTCACGAGC ACCAGAGTTG 

ATCATTAGTT AAACCTCCCT TATCTCTAAA 

AATCTCTACT AAACGAAAAC GTTTTGTTGC 

50 ' 

AATGTCTCCT AATTTAGCTG AATTGTTTTC 

TACTCGTTTA CCGTATAATT TGTGTGTTTT 

55 



GCCAGGGATT TGGCCTTTGT AAGCTAATTC 9540 

ACGATATACA GAATGTGGAC GGCCACAACG 9600 

TTTTTGTTTT TTTTGTTGCT TAGCAACCAT 9660 

TAAATAATTA TTTACGGAAT GGCATACCGA 9720 

CAGTGTTAGC AGTCGTTACG ATAACAATAT 9780 

AGTCGATTTC TGGGAAAATT AATTGTTCTT 9840 

CAAATGCTTT TTTAGAAACA CCTTGGAAGT 9900 

ATTTGTCTAA GAATTCATAC ATTCTTTCAC 9960 

TACCTTCACG TAAACGGAAA GTCGCGATTG 10020 

GACCAGTGAT CAATTCTAAT TCTTCAACAG 10080 

CGTCACCTAC ACCCATGTTC ACAACGATTT 10140 

TATAATTGAA TTTTTTCATT AAGTTTTCAG 10200 

GGTTCaAAGT GGGATCCTCC TTTCaACTTG 10260 

AGATTTTTTA GCGATACGAA CTTTTTTACC 10320 

TTCGTTTGTT TTAGGGTCCA ATAATTGTAC 10380 

TAAGATTCCA CCTTCAGGAT TTAATTGAGT 10440 

ACCTTCCACA ACGACACGGT CTTTTTTAGG 10500 

TTTGTCTTTA CCTGCGATAA CTTTAACGTT 10560 

TCCTTATTTG TATTGGTTGT TATTAATTAA 10620 

ATGAAGTTAC CTTCACGTAA TTCACGAGCA 10680 

CCTTTGTCAT CACGGATGAT AACACATGCA 10740 

TTACGACGAA CACCTGACTT AGTACGTACG 10800 

ACAACGCCAC CTGGTGTTGC ATTTTTAACA 10860 

GTTTTACGAC CAGATCCACC TAATACTTTG 10920 

TCTGCTACTT TCAAGCGTGT TTCTTGTTGG 10980 

CTTGTATTAA ATAATTACTG ACTCTTCAAC 11040 

TGATAAAGGA CGAGTTTCTT GAATTTTAAC 11100 

ATCATGAGTT TTGTATTTTT TAGAGTATTT 11160 

GTAAGTTTCA ACAAGTACTG TAATAGTCTT 11220 
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TTTTGTAACC TCCTCTTACT TAATTATTGA TTAGCCTTAC TTTGTTCAAT TTCTCTTTCA 11340 

CGAGCAACAG TTTTTAGACG TGCAATCGTT TTTCTTACTG TACGAATACG TGCAGTTTCT 11400 

5 TCTAATTGAC CTGTAGCTAA CTGAAAGCGT AGGTTAAAAA GCTCTTCTTT TGAAGATTTG 11460 

ATTTGTTCTT CGATTTCTGA AGTGGTTAAG TCTCTAATTT CCTTAGCTTT CATTTGTTTC 11520 

ACCACCCAAT TCCTCACGTT TTACAAACTT AGTTTTTACT GGAAGTTTGT GACTTGCTAA 11580 

10 

ACGTAGTGcT TCACGCGCAA CTTCTTCAGA AACGCCAGCA ACTTCGAATA AAATTCTACC 11640 

TGGTTTAACA ACTGCGATCC AGCCTTCAAC CGCACCTTTA CCAGCACCCA TACGTACTTC 11700 

. TAAAGGTTTT TTAGTATATG GTGTATGTGG GAAGATTTTA ATCCAAACTT TCCCGCCACG 11760 

15 

TTTCATGTAA CGTGTCATTG CTATACGAGC AGATTCGATT TGACGAGATG TGATCCAAGA .11820 

CGTTGTTGTA GCTTGTAAAC CAAACTCACC AAATGTTACG TAtTACCGCC TTTAGAACGA 11880 

2 0 CCAGTTGTTT TAGGACGATG TTGACGACGA TATTTTACAC GTTTTGGTAG TAACATTATT 11940 

ATTTTCCTCC TCCACTAGTG TTCTTAGTAG GAAGAACTTC TCCACGATAA ATCCATACTT 12000 

TAACGCCTAA TTTACCGTAA GTAGTGTCAG CTTCAGCGTG tGCATAATCG ATGTCAGCAC 12060 

25 GTAACGTATG AAGTGGAACA GTTCCTTCTG AATATTGTTC AGCACGAGCG ATGTCAGCTC 12120 

CGCCTAAACG ACCAGATACT TGaGTTTTGA TACCTTTAGC ACCAAGTTTC ATAGCTCTAG 12180 

TGATTGCTTG TTTTT GTACA CGACGGAATG AAGCACGGTT TTCTAATTGA CGTGCGATGT 1224 0 

30 

TTTCAGCTAC TAAACGAGCG TCAAGATCAA CTTTTTTGAT TTCAATTACG TTGATGTGTA 12300 

CTTTTTTATC AGTTAACGCA TTTAATTTGT TGCGTAATTT TTCGATTTCT GAACCGCCTT 12360 

TACCAATTAC CATACCAGGT TTACCAGTAT GAATTGCAAT GTTGATACGG TTTGCAGCAC 12420 

35 

GTTCAATCTC TACGTGAGAA ACTGATGCTT CTTTTAATTC ATTATCAATA AATTTACGGA 12480 

TTTTTAAATC TTCGTGTAAA AGTGAAGCGA AGTCTTTTTC AGCATACCAT TTAGCTTCCC 12540 

AATCACGGAT AATACCAACA CGAAGTCCGA TTGGATTAAT TTTTTGACCC ACAGTATTCC 12600 

40 

CTCCTTAAAA GTTAATTAAG CTTCTTTAGC TTCTTCTTTA CCGTCACTTA CGACGATTGT 12660 

AATGTGGCTT GTACGTTTGT TAATCGCACT TGCACGACCT TGCGCACGTG GACGGAAACG 12720 

45 TTTTAATGTT GGTCCTTCGT TAGCATATGC TTCTTTAACT ACTAATTCAT CTGTGTTCAT 12780 

• GTCATAGTTA TGTTCAGCAT TAGCTAAAGC GGACATTAAT ACTTTTTCAA TTACTGGTGA 12840 

TGAAGCTTTG TTTGTTAATT TTAAAATTGC AATAGCTTCA GCAGCATTTT TACCTCTGAT 12900 

50 TAAGTCAAGA ACTAGTCTTA CTTTACGAGG TGCGATTCTT ATTGTTCTAG CAACCGCTTT 12960 

TGCTTCCATT AGGATGTCCT CCTCTACTTA ATAGATATTA TCTTCTTGTT TTCTTGTCGT 13020 
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TATCTTCAGT TACATATACA GGTACGTGTT TACGTCCGTC GTATACTGCA AAAGTATGTC 13140 

CGATGAAATT AGGGAAAATT GTAGAACGAC GTGACCATGT TTTGATTACT TGTTTCTTTT 13200 

CGCTTCCTTC TTGAGCTTCA ACTTTTTTCA TTAAATGCTC ATCGACGAAA GGTCCTTTTT 13260 

TAATACTACG AGCCATTTGG GCGCCTCCCT TCTTATTATG TGCGTGCAGC TTTAAGCCGC 13320 

ACACCCAAAT AAGTTGATTA TATTATTTTT TCTTACGTCC ACGAACGATA AGTTTGTCTG 13380 

10 

ATGATTTTTT ACCACGACGA GTTTTCTTAC CAAGCGTAGG TTTACCCCAT GGTGACATTG 13440 

GAGATGGTCT ACCGATAGGA GCACGACCTT CACCACCACC GTGTGGGTGA TCGTTAGGGT 13500 

75 TCATTACAGA ACCACGAACT GTTGGACGGA TACCTTTCCA TCTTGAACGT CCGGCTTTAC 13560 

CAACGTTAAC TAATTCGTGT TGTAGGTTAC CAACTTGACC GATTGTAGCA CGGCAAGTAG 13620 

ATAAGATCAT ACGAACTTCA CCAGATCTTA ATCTGATTAA TACGTATTTA CC TTCTT TAC 13680 

20 CAAGTACTTG AGCACTTGCA CCAGCTGAAC GAGCGATTTG TCCACCTTTA CCAGGTTTAA 13740 

GCTCGATGTT GTGTACTACT GTACCAACTG GAATGTTTTG TAATGGTAAT GCGTTACCAA 13800 

CTTTGATGTC AGCTTCAGCA CCACTTTCAA CGATTTGACC TACTTCTAAT CCTTTAGGAG 13860 

25 CAATGATATA TCGTTTTTCA CCGTCTGCAT ATACAACTAA AGCGATGTTT GCTGAGCGGT 13920 

TTGGATCATA TTGAATAGAA TCAACTTTTG CATTGATACC ATCTTTGTTA CGTTTGAAAT 13980 

CGATAACACG GTATTGACGT TTGTGTCCAC CACCATGGTG TGTTACAGTC AATTTACCTT 1404 0 

30 

GGTTGTTACG TCCCGCTTTT TTCGGTAGCG GTTTTAATAA TGACTTTTCA GGTGTAGTTT 14100 

TCGTGATTTC TGCGAAATCT AACGAaGTCA TATTACGACG ACCATTTGTT ATTGGCTTAT 14160 

.ACTTTTTAAT AGCCATTGTC GCTTACCTCC TTAATGGTAA TTGTTTTATT AGTTAAATAA 14220 

35 

GTCGATTGAT CCTTCTTTAA GAGTTACAAT CGCTTTTCTT CTTTTGTTTG TATAGCCTTG 14280 

GTAACGGCCC ATACGTTTTT TCTTAGGTTT GTAATTCATG ATATTAACAC TTGCAACTTT 14340 

4Q TACGTTGAAG ATTTCTTCAA CTGCCATTTT TACTTGTGTT TTGTTAACAC GAGTATCAAC 14400 

GTCGAAAGTG TATTTGTCTT CAGCCATTGC TTCAGAAGAT TTCTCAGTGA TTACGGGGCG 144 60 

CTTAAGAATA TCTCTTGCTT CCATTATCCG AGCACCTCCT CAACT TTTTT AGCAGCAGCT 14520 

45 TCAGTAATTA CTAAGCTGTC AGCATTAGTG ATATCTAAAA CATTTAAACC TTGAGCAGTT 14580 

GTCACTTGAA CGCCAGGGAT GTTGCGTGCT GATAATTCAA CATTTACATC TTCGTTT T CA 14 640 

GTAACTACTA ATACTTTTTT AGGTTGTTCT AATGTAGATA ATACATTTTT GAATTCTTTA 14700 

50 

GTTTTTGGAG CTTCGAAGTT GAATGCGTCA ACTACAGTTA AGCCATTCTC TTGAGCTTTG 14760 

AAAGATAATG CTGAGCGTAA AGCTAAACGA CGCATTTTCT TAGGCATTTT GTATGCATAA 14820 
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CCTTGACGAG CACGACCTGT fCC T TTTTGC TTCCATGGTT TACGTCCGCC ACCGCTTACT 14 940 

GCTGAACGAT TCTTAACAGC ATGCGTACCT TGACGTAATG AAGCACGTTG TAAATTAATA 15000 

5 

GCTTCGAATA AAACGCTATT ATTTGGCTCA ATACCGAATA CTGCATCGCT TAATTCGATT 15060 

GAACCTGATT TAGTTCCGTC TAATTTTAAA ACATCATAAT TAGCCATTAT GCATTTCCTC 15120 

CTTTCACTTC TTATTATTTA TTACCTTTTT TAATTGAAGT TCTGATTTCT ACTAAACCTT 15180 

10 

TTTTAGGTCC AGGTACGTTA CCTTTTACTA AGATAACTTT GTTTTCTGTG TCAACTTGAA 15240 

CTACTTCTAA GTTTTGAACA GTTACAGTGT TTCCACCCAT ACGTGCTGGC ATTTTTTGGC 15300 

1S CTTTAAATAC TCTAGAAGCA TCTGAAGCCA TACCTACAGA ACCTGGTGCT CTGTGGAAAT 15360 

GAGAACCGTG TGACATAGGT CCACGAGATT GTCCGTGGCG TTTAATTGCA CCTTGGAAAC 15420 

CTTTACCTTT TGATACGCCT GTTACGTCAA TAACGTCGCC AGCTACAAAA GTATCTACTG 154 80 

20 AGACTTCTTG AaCCTAcTcG TAAGCATCCA CGTCTACATT GCGGAATTCA CGAATGAAGC 15540 

GCTTAGGTGC TGCGTCAGCT TTTTTAGCGT GACCTTCAGC TGGTTTATTA GCATATTTAT 15600 

TAGATTTTGC ATCTTTTTTG TATGCTTTTT TGTCTTCAAA TCCAACTTGG ATTGCGTTGT 15660 

25 ATCCATCAAC TTCTACAGTT TTCTTTTGTA ATACAACATT TTCTTTAGCT TCTACTACTG 15720 

TTACAGGGAT TAATTCACCG TTTTCTCCGA ATACTTGTGT CATCCCAATT TTTCTTCCTA 15780 

AGATTCCTTT GGTCATCGAA AGTCCACCTC CTAAAATTGT CTATTATAAT TTGATTTCGA 15840 

30 

TGTCTACACC AGATGGTAAG TTTAAGCCCA TTAAAGCGTC AACTGTTTTT GGTGTTGGGT 15900 

TTACAATATC GATTAAACGT TTGTGTGTAC GTTGTTCGAA TTGTTCACGT GAATCTTTAT 1-5960 

ACTTATGCAC GGCACGGATG ATTGTGTAAA CTGATTTCTC AGTTGGTAAC GGAATTGGTC 16020 

35 

CAGAAACATC TGCACCAGAA CGTTTCGCTG TTTCTACAAT CTTCTCTGCT GATTGATCAA 16080 

TTAC5CGGTG ATCATAAGCT TTTAATCTGA TTCTGATTTT TTGTTTTGCC ATAATTTTCC 16140 

40 CTCCTTATTC GTCTACATTT AGTGATAGAC TTCTCCACGA AAACTATCTT ACACAGCGCC 16200 

ATGGCAAAGC GGCCGGGTGT GTCAGTAACC TTTCGCTTCA TCGCTTTTCT TAAAGTCCAA 16260 

CGTTAGTTAT ATTACACGAA AAACATCGAT AAATCAAGGC TTTTCACATA ATTTTTCTAT 16320 

45 CTGTCTAACA CATACTTTTA- TATTTnACTT TATATACTTA GTCAGTTCAA CTATTTTCGA 16380 

GATATTTTnA ATTTCCn 16397 
(2) INFORMATION FOR SEQ ID NO: 206: 

SO 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 29555 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 206: 



10 



15 



30 



35 



50 



TnAGTTGTTT 


CTGCCACGAA AGATTCAATG 


GCTTTTCTTG 


CTTTACGCTT 


TTCTTTCAAT 


60 


GGCAAATCAC 


CAATCATTTT 


TTTAAGGTGA 


TGTGGGTTTA 


CAACACCACT 


ATACTGGTAG 


120 


TCATTTGAAn 


TTGTTTTTAG 


GGCTTGTTCA 


TCGATAGATC 


TCTCTCCAGC 


AAATCCTTTG 


180 


AACTCCGCTT 


CTTTTTTAAT ACTTTCGAAA 


TTAACATATT 


CTTGATCGAT 


ATCATCATCC 


240 


TTATTTAAAG 


AAGGTACAAC 


ATTGTCGATG AATTCTCTAA 


TTAGATCTCG 


TTTTAACCTC 


300 


AATGtCGGAT 


CATCTGCATG 


ATCTAAAATG 


CGTCTAATTT 


GTTCTTGGTT 


ACGACGTTGT 


360 


TCCGCTTTGT 


CTTCAAGATC 


AATTTGTCTC AATATATTCA 


TAATATAATT 


CACATTAATC 


420 


GTATCATTAC 


GCATCATTTC 


TATTTCGAAA 


TCAATATCAT 


TTAAAATGGA 


TACTTTATTT 


480 


TTCTCAGCCG 


TCGCTCTTTT 


TACTTGATCG 


TACACAGCTA 


AATATTTACT 


TTTATAGTCT 


54 0 


TCATTCTCTT 


GTTCATCCAT 


TCCAATTTCA 


TCAATTGTAA 


ACTCAAACTC 


GTCAAATGCT 


600 


TTTAAACGTA 


ATATTATTTT 


AGCTAATAAA 


CGATAAGCtT 


CAACAAAGCG 


CTTTAGCTCT 


660 


TCTTCATCtT 


GaATGtCATC 


AACCATGTGT 


GGTGTCGGCA 


CAATCATTTT 


AAGCTCACGA 


720 


TAAGCGTCCA 


TAAATTCTTT 


TTTATACTCT 


TCATAACTGC 


GCATTAAAAT 


TGTATCCGTA 


780 


TCATTTGTTT GTGAGAATAC TCTCAGTGCA TCGTCTGTCT 


CTTTTTTCAA 


GTCACGATAG 


840 


TTTACAATTT 


TACCAAATGG 


CTTTGATTCT 


TTTTCAACCC 


TATTTGTACG 


TGAATACGCT 


900 


TGAATTAAAT 


CATGATACAT 


TAAATTCTTA 


TCAACATATA 


AAGTGTTCAG 


TACTTTACTA 


960 


TCAAAACCAG 


TTAAGAACAT 


ATTAACAACG 


ATTAAGATAT 


CAATTTTACT 


ATCTTTAACG 


1020 


cccrrriTAA 


CGTTTTTtGA 


AATATGATTA AAATACTCAT 


TAGTTGtGGC 


TGnTGaAAAA 


1080 


TTCGTCTCGA ACTTTTTATT ATAATCACTA ATCATTATCT 


CTAATTTTTC 


ACGTGAATGA 


1140 


TATGGCACTT 


CACCATCACG 


ATCATCTTCA 


TTAGGTTTAA 


ACGTAAATAT 


ACCAGCTATC 


1200 


GTTAACGGTT 


GTTCCAACTT 


TTTGTTAAGT 


CGCTTAAATG 


TCTCATAATA 


TTTAATAAGC 


1260 


GCGTGAATAC 


TTTGGACTGT 


AAATATACTT 


GAATATTGAC 


GATTACGTGT 


ATATTTATCA * 


1320 


TGATTATTGA TGATATGTCG TGTTACTAAT TCCACACGTT 


TATCCGCTAA 


CCATACTTCT 


1380 


TCCGTATCAA TTGCTTCAAC 


CATGCtGTTA 


TCTTCTGCTT 


TTAAAGCTTT 


ATTTTTAAAA 


1440 


GTATTAATAT 


AGTCAACTGA 


GAAACCAAGT 


ACATTACCAT 


CATGAATGGC 


ATCTCTAATT 


1500 


AAATACGTAT 


GTAAGCATCT 


ACCGAAAATA 


TCTGCAGTTG 


TTCTACCATC 


TTGACTACTA 


1560 


TTTTCTGGAA AACGTGGCGT ACCAGTGAAT 


CCAAAGTATT 


GGGCATTTTT 


GAAATGTTGT 


1620 
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ACTTTATTCG TTTTATACTG TTCTAATAAA GGGGCATTCC CTTGAATCGC TTTAGCCATT .1740 

TTTTGAATCG TCGTTACAAT AAGTGGCAAA C T TTTATCAT TTAGTTGGCG TACCAGTTGC 1800 

GAGGTATTAA AAGTTTTGTC TACAGCACCC TTAGCAAATT TATTAAATTC CTCTTCTGTT I860 

TGACTATCCA AGTCTTTACG GTCAACCAAA AAGATAACTT TCTTAATGTC ATCTTGCTGT 1920 

GATAAAATCT GACTCGCTTT AAAAGAAGTC AACGTCTTAC CACTTCCAGT TGTATGCCAT 1980 

ACATATCCAT TATTCCCTGT CTCAGTCGCT TGTTGAATAA GTGCTTCTAC CGCATACACT 204 0 

TGATACGGAC GCATTGCCAT CAGTATTCTA TCTGTTTCAT TAATAATCAT ATAGCGCGAT 2100 

ATCATCTTAG CTAATTGACA AGGTCTCATA AATGACTCAG CAAACGATTG CAATGTATTG 2160 

ATACGGTTAT TCTGTTTATC ACTCCAATAA AACATGTGAC TCTTCAATAG TTCGCTATCA 2220 

TTATTAGAAA AGTATCGCGT TTCAACACCA TTACTAATGA TAAACATTTG TATGTAGCGG 2280 

AATAAGCCTG TGTAATTTTG TTTGCGGTAA CGTTTTACTT GGTTAAACGC CTCATTAATA 2340 

TCAATACCTC GACGTTTCAA TTCAACTTGG ACAAGGGGTA GTCCGTTGAT TAATATCGTT 24 00 

ACATCATAAC GTGCTTTATA TGTATCCTCG ACAGATACTT GATTCGTCAC TTGAAACTTA 2460 

25 TTTTTACACC AACTTTTCGT ATCTAAAAAC GACAAATAAA TCTCAGACTC ATCATCACGT 2520 

CTAAGTGGTA ATTTATCACG TAAAATACGG GCACTCTCGA AAATACTTTT TCCATCAATC 2580 

ATCGTTAACA GACGTTGaAA TTCTTTATCT GTTAaGGGAT TGCCTTCTAA TTTGTCCGCA 2640 

TGACGCTCAT TTAAAATCGT TCTAAAATTA TCAAGCAATT GCTTATTATC ACGTATCGTT 2700 

ACTCTTTCGT AACCCAATTG TTCAAGTTGA TTCATCATTT CATTTTCTAA TGCGTATTCA 2760 

CTTTGGTATG CCATTCATAT CCCCTTCCAT ACACTTTCTA TTGCTCTAAA TATATCATAA 2820 

ACTTTAATGA AAAATGTTTG TTT T TTA TCT TCAAACGTAA ATTTATTCTA ATTTTATTGT 2880 

CTTATCTTTT AATATTTGTC TTTGAGGTAA GTCGTATACT AAAATTTGAA TACAAATAAT 2940 

CAAATCATTG ATAAATTTTT TGTCTACGAT TAATGGAGGG ACTTGAATGG TGTTAATTAC 3000 

CTATCAAATC ATTTTATTTT TTATTATTAG TCTAAGTTAC TATTTAACTT TAAATCATTA 3060 

CATGGCAGTC ACTGTAGGTA ACTTCACTTC AATATTCGGC ATGTTCGCAG CCATACTCTT 3120 

45 TATGTACTAC TACCTACTCT ATAAAAGTCC CGAATACAAT CAACGCAAAC GATTTAAACA 3180 

TTTCATTCAT ATCACTAATT TGATAATAAT TGCTTTTAGC ACCTTCGTAT TAGTTCATTT .3240 

AGCATTAAAA TTATTCTTCA GCATTTAATT TCCATCTATG AAAAAAGCAA AGCTCAAATC 3300 

50 

TGAACTTTGC TTTAATTTGT CACGCCTTTA TCATTTTCAA AATAGCCTCT ATGCCAGTTT 3360 

TACAAACTTG TAGCAACAAT TTTTCATCAA GCAACTGAAT CACATCAAAA ACTTCAATTG 3420 
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GTCGCAAGAT GCTTCCTGTA 


ATTATCAAGT 


GCCATTTTCG 


ATTGGGTTAT 


ACAATCTAGA 


3540 




ATCGCATGAT AATTTAATGC 


TACAAATCGA 


TAGTACAATA 


TATCTACCGT 


. GAAT AACTGT 


3600 


5 


GCAAATAGTG ACGTTGTAGC 


CGCCATACGC 


ATTTCATTTT 


CATCAGTTCT 


GCCATAAATC 


3660 




AATGCATAGT CTGCAATTTG AGCCACTGGA TTATTAGCTG 


TACTAGATAT AGTTATGATG 


3720 


10 


GGAATACTGT AATGTGTGGC 


CACCTGTGCA ATTGACTGCA ATTCACTATG ACTACCTTGA 


3780 


TTCGTCACAA AAATCATGCA ATCTCTATCA TCATGCGTCG 


CAAATGTTGA 


CACAAGTAAA 


3840 




TGCGTTTCAT GTAATAACCT 


GACATTTAAG 


CCAATACGAG 


ATAACTTTTG 


AAAAAGATCA 


3900 


15 


CCAATAGTCA AACTCGATGC 


GCCAAATCCA AATAAAAATA 


TTGTCCTGGc 


ATTTTTCAAC 


3960 




ACATCACAAA TTGCATCAAT TTGCGCATCC ATAATATTAG TAGCTACAAA TCGCATCGTA 


4020 




TTCGTTGCTC TAGCAATCAT 


TTTATTTTTC 


AAAGTTTCTA 


CAGATTCATT 


TTCAATCAAT 


4080 


20 


TCTAAATGTG GATTGGTTGC 


AATATCTTCG 


GGTAAGTATC 


GAGATATCGC 


AATCTTTAGC " 


4140 




TCTTGAAAAC CTTGATGTGT 


CATTTTCCGA 


CTAAATCTAA 


CAATTGATGC 


TGTACTAACA 


4200 




TTCGTAACAT CTGCCAAATC 


ATTCACAGTC 


ATATCAATGA 


TTTTATGTGG 


ATTCTTTAAA 


4260 


25 


ATGTAATCAG CGATTATCTT 


TTCTGTCTTC 


GTAAAATCAC 


TCAACTGCTT ATCAATGCGA 


4320 




TATAAAATAT TTGTCATCAT 


TAATCACCCA ACAAATCTGT 


CTGTCGCATC 


GCCTTTGTCG 


4380 


30 


TTCCAAATAA ATATGTACAA 


ACGAATCCAC 


CAGCATACGC 


AGCAAGTAAT 


CCTGCAATAT 


4440 


AACCTAAATA CATATTATCT 


GAGATTAATG 


GTAATAGTGA 


CACACCACTT 


GGGCCTATTG 


4500 




CTTTGGCACC AATATGTCCA 


ATTCCACCTA 


TTACAGCGCC 


ACCAATACCA 


CCACCAATAC 


4560 


35 


AAGCAGTTAA GAAAGGTCGA 


CCTAATGGCA 


AAGTCACACC 


ATAGATTAAT 


GGTTCTCCGA 


4620 




TACCTAGGAA ACCAACTGGC 


AATGCACCTT 


TTAAAGTATT ACGTAATGTT 


GTGTTGCGTT 


4680 




TACATCTTAC CCAAAGTGCT 


AATGCGGCAC 


CTACTTGTCC 


AGCACCAGCC 


ATCGCTGCAA 


4740 


40 


TTGGCAATAA GTAAGTAGCA 


CCTGATTGGT 


TAATCATTTC 


TATATGAATT 


GGCGTAAAAA 


4800 




TATGATGAAG CCCTAACATA 


ACTAACGGTA 


GGAAGCTTGC 


ACCAATGATA 


AATCCACTAA 


4860 




ATACGCCACC AATACTAATA ATTCCGTTAA 


CTACTGAAAC 


TAAACTGTCT 


GAAACAAAAC 


4920 


45 


CTGCTAATGG CATAAAGATA AAGATAGTTA ATAGTCCTAC 


AATCAACAAT 


GCAATAGTCG 


4980 




GCGTTACAAT AATATCAATC 


GCATTTGGCA 


CAATTTTATG 


TAATCTCTTT 


TCGACAATAC 


5040 


50 


TTAAAATCCA AACGGCAAAA ATAACGCCAA 


TAATCCCACC 


TTGTCCAGGT 


TGCAATGGTT 


5100 


CTCCAGTGAA GACATTCATT 


AAAATATTTT 


TACCAGCAAT 


ACCCGTTAAT 


AACGTTGTAC 


5160 




CACCAATCAC GCCACCAAGT 


CCTGGTGTCG 


CACCAAATTC 


TTTAGCCGCA 


TTAATACCAG 


5220 
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GCGTAATCCA AGCACCTGAA ATATAGCCTG 
TACCACCAAT TAATCCAGCT CCAATAAATG 
5 TTGATTTCAA TACTTTATTC AACTTACCAT 

TCGCCTTTGC TTTATCAGCT GCATATGATT 
GGTGTGGTAT TGGGTCACCT AGTTTAACAC 
TATTGatGTA CCAGGTCCAA CCACAACTTG 
ACCATCAATA TGCCTTAGTT CTTGGTCATC 
CACACGTGTC ATACAGTTCA TGACACTATC 

15 

AATTCGTTCT GCAAGTTGTT GTTCTTTGGT 
CTGATTGCTC GTTTAAaATG TCACCATTGT 

20 AAATGCCACA CATACCCATA ACTGTCGCAA 

ACGCCATTGC TTCATCATAT GTGATAGCAC 
CGATCAGTTT . TTGATTGGTT GCTTTAACAT 

25 CACCAACCAT TGTGATGGTT GAAATCATAT 

TTAAACGTGT TGAACCAGTT AATACTTCTG 
CAATTTCACT TATAACTGCA TGTTCATTGC 

30 

TGTTAGCAAA TGTTAAACCG CCTATAACAT 
TAACGACATC TTTTGATGTT AAATCTATAT 
GATCTTCCGC ACCTTCTACA GCCATCGTCA 

35 

TAATTTCATG AGGGTCAGTA TTGAATGTAG 
ACCTTCCACT TGTACCTGCA CCGATATAAA 

40 TTGTTTTTTT AATTACTTTT GTCAATTGTG 

GCTGATCTTC TTTATTCATC GTAATTAAAG 
TCGTCGCTTC ATTACGCGCT TCGGTCGTAC 

45 TTTTTGAAAA TCAAATGTAT CATTCGGCTC 
AATAATTTGT GCAaCAACAT TCACATTGTC 
CATTTCGCCT ■ TGATAACGTC CGTTATTCAA 

50 

TTGCGTCGTA AACTGTGGTT GAATCGAATG 
AATGACATTT TCCGGATTAT CCGGGCGTAC 

55 
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CCACCATTAA GTTACTCAGT ACTGCTGCAA 5340 

CAGGAATCAA CGGTATAAAG ATATTGGCAA 54 00 

TCTTTTGTTT TGCTTTATGC GCTTCCTTAT 5460 

TATAGTCCAT TTTTTCACTA TCATTGTGAT 5520 

CACTTAATTC CGC CATATGA TTAGCCACTT 5580 

AATGCGTTCA TCGTGTATAA CACCCATGAC 5640 

TACTTTATTC TCATCTAATA CTTTAATACG 5700 

TATATTATCC ATACCACCTA CTGCAGCAAT 5760 

CATTTAAATC CCTCCTAAGG TTGTCTATCT 5820 

TTAATAACCG TCTTGTTGCT TCTTCCTTAG 5B80 

CTTTCACATC ATGCTCAGAT ACCTGATATA 5940 

ATATTTCTTG AATAATACGC ACTGAACGGT 6000 

CAATCATGAG GTTATCGTAA ACTTTTCCGA 6060 

TTAAAATTAA CTTTTGTGCt GTACCAGACT 6120 

GACCAACTTT AACTTCTACT GGATAcTGCG 6180 

ATGAAATAGA TACTGTTGTA GCACCGATTG 6240 

ATGGCGTTTT GCCACTCGCG GCAATTCCTA 6300 

TTTTCAAATC TTCTTCCGCT AATTTTTTGT 6360 

TAGCATGTTG TCCACCAGCA ATAATACCTA 6420 

GTACACACTC CGCTGCATCT AAGACACCCA 6480 

TCAATCGTCC ACCCTTTTTA TACTGTGCAA 6540 

GTATTGCCTT TCGAACTGCT AACGGGACTT 6600 

CCTCTTCCAC AGTCATTTCA TCAAGATGCA 6660 

TATTTTCCAT CACTTCTTAC ACTCCCTAGT 6720 

GATACAACTT AACAGTGGTA AGTCTTCTTT 6780 

ATGTGCACTA AGCGTTTGTC TCACAATTTG ' 6B40 

ATTATCAACG GTTACTGAAC CAATGCGTCG 6900 

TGGACATATT TGTCTTGACG TTTCCGAACG 6960 

TTTATGACAC ATATCGAAAA GGTAAGTCAC 7020 
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AAGTTGTTTT GCCTGCCTCA TTTCAATCAA 
AGTTTCTTGT AATAATTTAG CTGCAACGAC 
TGGCAAGCCT TTATGCAAAG GACCTCGCAA 
TGCCTTTGGA TTAAATTGAT AAATGAGTTC 
TCCCGTATCT GGTCTTGGAT AATAGTTATG 

10 

TTGTTGATGT AAGCTTGTTA ACAATTCCCG 
TAAACCCTGT GCCATTATCG CTTCGATTGC 

15 CATAAATTGT GCATCATATT GTCGAAGATG 

ATTAGCATCT ATGAGGTAAG TCACTTGTTC 
ATAATGATAT TTTGTCTCGT CATCTTCTTC 

20 GTAACCTTGT TTAATCATTC GCTTAATATA 

* TGAAAAGCCT GTCAAAGTAG CCCTCCTTAA 
TTGTGTAATC GCTTACATTT TACTATAAGA 
GCCTGTTCAT TGTAATAATC TTACATATTT 
GATTATCCTT TCTTTTAAGT GCTGATAATA 
TTATACATTT TATTCAGGAT TATCTTATAT 

30 

AGAGATTTGA AATTGAATTT AATAATTGTA 
AGGAGTCGCA AACAAATAAG ATTCAATAAG 

35 TCGTGTCTTT TTTTATTGAA TCTTAAATAA 

GGAAGAAACT TTTCGGTCAA TTGCAACGTA 
TTTIACCAGC AGCTGGTCTG TTATTAGCTA 

40 AACACTACTT. GCCGTTTATA CAAAATGGTG 
GTGCTGGTGG TATCATTTTT GATAACTTGC 
GATTAGCTGG CGGTGATGGC GTAgcAGCTA 
ACAAAACAAT GGGCGACTTT TTACAAGTTA 
GTTACGCTAG CATTTTAGGT ATCCCAACAT 
TCGGGGCCCT GGCAGCTTGG TGTTATAACA 

50 

TAGGTTTCTT CGCTGGTAAG CGTTTCGTAC 
TAGCATTCCC AATGGCATTA ATTTGGCCAA 

5S 



TGAGTCTCCA ACTAACACTT CAGATACACC 714 0 

AGGATGACTA TGTCTCGTTG CTTCAATTGT 7200 

ATCACTCCCT ACAATAAAAC CATATATTTG 7260 

ATTTTTCTTA TTGACCAAGT CAACAGATAA 7320 

ACAAAATGAA AGTAATGTAA AATCATTCAA 7380 

GGAAATAATA CTTGCATTCA AACAGCACTT 7440 

CTCAATTGAT GTACTATGAT CGATACGAAT 7500 

GTCATAAAAA GATGGTGTTA AAATAGATGG 7560 

ATGTTTTAAT AAATTGAGTA GTTTTGTGAA 7620 

TGGTATTTGT ACAGATGTAA AAATCATTTG 76 BO 

CGCTTCATCT AAAGGTTGTC CTAAATACAC 7740 

CAATATAATT ATTAGGAAAA TATAGTTGAT 7800 

GAAAACACAT TACAATATTA ATCAGTTAAA 7850 

CTGTCACAAG TTAATTATTA CACCATCAAA 7920 

GCTGCTACTG CTGGATTATT ACAATAACTT 7980 

TATGTTTTAA TAATAATCTG TGAACAATTA 8040 

TTGAAAACGC ATACTTCACC ATGCTAAAAT 8100 

ATGTGATGGT TACCAACACA GTCTATTTGC 8160 

TAAATACAAC TTTGGAGGTT GGACAAGTGA 8220 

TTGGTAAAGC GCTAATGTTA CCTGTTGCGA 8280 
TCGGTACAGC TATGCAAGGT GAATCATTAC . 8340 

GCGTACAAAC TGTCGCTAAA TTAATGACAG 8400 

CTATGATTTT CGCATTAGGT GTCGCAATCG 8460 

TCGCAGCATT CGTCGGTTAC ATAATCATGA 8520 

CACCTAAGAA TATTGGTGAT CCAGCGAGTG 8580 

TACAAACAGG TGTGTTCGGC GGTATTATAA 8640 

AGTTCTATAA CATTAACTTA CCATCTTATT 8700 

CTATTATGAT GGCTACAACA TCATTTATTT 8760 

CGATTCAATC AGGATTAAAT GCATTCAGTA 8820 
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TATTAATTCC 


ATTCGGTCTA 


CATCACATTT 


TCCACGCACC 


GTTCTGGTTC 


GAGTTTGGTT 


8940 




CATGGAAAAA 


TGCAGCTGGT 


GAAATTATTC 


ACGGTGACCA 


ACGTATCTTT 


ATCGAACAAA 


9000 


5 


TTCGTGAAGG 


CGCACATTTG 


ACAGCTGGTA 


AATTCATGCA 


AGGTGAATTC 


CCTGTTATGA 


9060 




TGTTCGGTTT 


ACCTGCAGCA 


GCTTTAGCAA 


TTTATCACAC 


AGCTAAACCT 


GAAAATAAGA 


9120 


10 


AAGTAGTAGC 


AGGTTTAATG 


GGTTCTGCTG 


CTTTAACATC 


ATTCTTAACT 


GGTATTACAG 


9180 


AACCATTAGA 


ATTCTCATTC 


TTATTTGTAG 


CACCATTATT 


ATTCTTTATT 


CACGCaGTAC 


9240 




TTGATGGTTT 


ATCATTCTTA 


ACATTGTACT 


TATTAGATCT 


TCATCTAGGT 


JATACATTCT 


9300 


15 


CAGGTGGTTT CATCGACTAC TTCTTACTCG GTATACTACC 


TAATAAGACA 


CAATGGTGGT 


9360 




TAGTCATTCC 


TGTAGGTCTT 


GTATACGCAG 


TTATTTACTA 


CTTCGTATTC 


CGATTCTTAA 


9420 




TTGTAAAATT AAAATACAAA ACACCAGGTC 


GTGAAGATAA 


ACAATCACAA 


GCGGCTACTG 


9480 


20 


CTTCAGCAAC 


TGAATTACCA 


TATGCAGTAT 


TAGAAGCTAT 


GGGTGGCAAA 


GCAAACATTA 


954 0 




AACATTTAGA 


CGCTTGTATC 


ACACGTCTAC 


GTGTTGAAGT 


TAACGACAAA 


TCTAAAGTTG 


9600 




ATGTTCCTGG 


TTTGAAAGAT 


TTAGGCGCAT 


CTGGTGTATT 


AGAAGTCGGC 


AATAATATGC 


9660 


25 


AAGCAATTTT 


TGGTCCTAAA 


TCTGACCAAA 


TCAAACATGA 


AATGCAACAG 


ATTATGAATG 


9720 




GTCAAGTAGT 


AGAAAATCCT 


ACTACTATGG 


AAGACGATAA 


AGACGAAACT 


GTTGTTGTTG 


9780 


30 


CAGAAGATAA 


ATCTGCAACA 


AGCGAATTGA 


GCCATATCGT 


GCATGCACCA 


TTAACTGGTG 


9840 


AAGTAACACC 


ATTATCAGAA 


GTGCCTGATC 


AAGTGTTCAG 


CGAAAAAATG 


ATGGGTGACG 


9900 




GTATCGCTAT 


CAAACCTTCA 


CAAGGTGAAG 


TTCGTGCACC 


ATTCAACGGT 


AAAGTACAAA 


9960 


35 


TGATTTTCCC 


AACAAAACAT 


GCAATTGGTC 


TTGTATCAGA 


TAGTGGTTTA 


GAACTATTAA 


10020 




TCCACATCGG 


TTTAGACACT 


GTTAAATTAA ACGGAGAAGG 


CTTTACTTTA 


CATGTTGAGG 


10080 




AAGGTCAAGA AGTTAAACAA GGTGATTTAT 


TAATCAACTT 


TGATTTAGAC 


TACATCCGCA 


10140 


40 


ATCATGCAAA GAGTGATATT ACGCCTATTA TCGTGACACA 


AGGAAACATT 


ACAAACCTTG 


10200 




ATTTTAAACA AGGTGAACAT GGCAACATTT 


CATTTGGCGA 


TCAATTATTT 


GAAGCTAAAT 


10260 




AATGCTTACT 


ATAAACAGGT 


GCGTATACCT 


TCATAAGGTG 


ACGCGCCTGT 


TTTTTCTTTG 


10320 


45 


CTATTGTATT 


TTGCAGCATC 


ATTGATAGTT 


CGCTCTCCCC 


TTAAATTTTG 


AATTTTAAGA 


10380 




TCATCAATTA 


AAGCCCCCCT 


TCATACTCAT 


TTCCTAAAAA 


ATATTAATTG 


TTCACTATTG 


10440 


50 


TTAGCGTTTT 


CACAACAAAG 


TCAACTTCCT 


TGACCTTACA 


CTATATTCGA 


GGCTATCATT 


10500 


TTAAGTGTAA ATATAGAGAA AAGGTGGCTT 


TTTTTATGAA 


ACAACGCATT 


GGAGCTTACT 


10560 




TAATTGACGC 


TATTCATCGA GCAGGCGTCG 


ATAAAATTTT 


TGGTGTTCCT 


GGTGATTTTA 


10620 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



ATGAATTAAA CGCAAGTTAC GCAGCGGACG GTTATGCCCG TCTTAATGGA CTCGCTGCAT 10740 

TAGTTACTAC ATTTGGTGTT GGCGAATTAA GTGCCGTCAA CGGTATCGCA GGTTCATATG 10800. 

CTGAACGCAT ACCTGTCATT GCGATTACAG GTGCGCCGAC ACGTGCTGTT GAACAAGGGG 10860 

GTAAATATGT ACATCACTCA CTTGGTGAAG GTACATTTGA CGACTATCGA AAAATGTTTG 10920 

CACATATAAC CGTTGCACAA GGTTATATCA CACCTGAAAA TGCAACAACC GAAATACCAC 10980 

GTTTAATTAA TACAGCAATC GCCGAAAGAC GCCCAGTTCA TTTACATTTA CCAATCGATG 11040 

TCGCAATCTC TGAAATTGAG ATACCGACAC CATTTGAAGT GACGGCAACT AAATATACGG 11100 

ATGCATCAAC ATATATAGAG TTATTAGCAA CTAAACTGCA TCAAGCGAAG CAGCCTATCA 11160 

TCATTACTGG ACATGAAATT AACAGTTTTC ACCTCCATCA AGAATTAGAA GATTTTGTAA 11220 

ATCAAACACA GATACCAGTA GCACAACTTT CATTAGGAAA AGGTGCTTTT AATGAGGAAA 11280 

ATCCATATTA TATGGGTATT TACGATGGGA AAATTGCCGA AGATAAAATA CGAGATTATG 11340 

TGGACAACAG CGATTTAATT TTAAATATTG GAGCCAAATT AACAGATTCA GCAACAGCAG 11400 

GTTTTTCATA CCAATTCAAT ATCGATGATG TCGTTATGTT AAATCATCAC AATATCAAAA 11460 

TTGACGATGT TACAAATGAT GAAATATCTC TACCATCATT GTTAAkACAG TTATCCAATA 11520 

TTTCATATAC GAATAACGCA ACGTTCCCTG CGTATCATCG TCCAACATCA CCCGATTATA 11580 
CTGTTGGCAC AGAACCATTA ACACAACAAA CTTATTTTAA AATGATGCAA AATTTCTTAA ' 11640 

AACCAAATGA TGTCATCATT GCTGATCAAG GTACATCATT CTTTGGTGCT TATGATTTAG 11700 

CATTATACAA AAACAATACT TTTATAGGGC AACCGTTATG GGGTTCTATC GGCTATACAT 11760 

TACCTGCAAC ATTAGGTTCA CAATTAGCAG ACAAAGATCG TCGTAACTTA TTATTAATTG 11820 

GTGATGGCTC ATTGCAACTA ACTGTTCAAG CTATTTCAAC TATGATTAGA CAGCATATTA 118B0 

AACC9GTATT ATTTGTGATt! AATAATGACG GCTATACGGT AGAACGACTT ATTCACGGCA 11940 

TGTATGAACC TTATAATGAA ATTCACATGT GGGATTATAA AGCTTTACCA GCTGTATTTG 12000 

GTGGTAAAAA TGTTGAAATT CATGACGTTG AATCATCAAA AGATTTACAA GACACGTTTA .12060 

ATGCAATTAA TGGTCATCCC GATGTGATGC ATTTTGTCGA AGTCAAAATG GCTGTCGAAG 12120 

aCGCACCGAA GAAACTCATC GATAtCGCTA AAGCTTTTTC ACAACAAAAT AAATAATTTC 12180 

ATCGTATACA GGGTATAAGT TTAAGCGAAT ACTTTATTAA ACGAATAGGA CTCTGATATA 12240 

AGATGATTAA TTTTAATAAA ACCGCTTTAG TGTTAATCGA CCTGCAAGAA GGTATTCTTA 123 00 

AAATGGATTA TGCCCCATAT ACAGCTGAAA ATGTCGTTCA AAACGCTAAT AAATTAATAG 12360 

ATGTTTTTAG AAAAAACAAT GGCTTTATCG CTTTTGTTCG CGTGAATTTC TATGATGGTA 12420 
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AGTCGTTTCC ATCATTTATT AGACAAGAGA GATGACGATT TTGTCATAGA CAAACGACAT 1254 0 

TTTAGTGCAT TTGTAGGAAC AGATTTGGAC TTACAATTGC GACGTCGAGG AATTGATACG 12600 

ATTGTTCTTG GTGGTGTCGC AACGCATATT GGCGTAGATA CGACAGCGCG AGATGCCTAT 1266 0 

CAATTAAACT ACAATCAGTT TTTTGTTACA GATATGATGA GTGCACAAAA GGAAACGCTA 12720 
CATCAATTTC CAATAGATAA TGTATTCCCA TTGATGGGAC AAACAATAAC TACAAACGAC . 12780 

TTTCTAAATA TATTGAACTA AACATATACT TCCCCCCTTC , GATCATGTTG AGGGGGATCT 12B40 

TTATTTCACA AAGTATTAAT ACGTCGGGTT GTCTAACCTT CTATATTTAA CATATTCTAT 12900 

ATCTGTTAAA TCGTTCTTAA CTTACGCCCC TACTACATAA AAAACAGTAT TTATTCCGGA 12960 

ATTTTCAAAA AATTTAGTAT TTATTGCAAA ATTATGTATC ACTTTATGTT TAATTTTTGA 13020. 

TATTATCTTA ATTAAGTAGA TTTTTATAAG TTCTAAAAAG GAGAACAAAT ACATATATGA 13080 

AGAAGAAACT AACATTTAAA GAAAACATGT TTATAGGTTC TATGTTATTT GGTTTATTCT 13140 
TTGGTGCCGG CAATCTTATC TTCCCAATAC ACTTGGGTCA AGCTGCTGGT TCTAACGTTT . 13200 

TTATCGCTAA CTTAGGATTT TTAATTACAG CAATTGGCTT ACCATTTCTA GGTATCATTG 13260 

CTATTGGCAT TTCAAAGACA TCTGGTTTAT TTGAAATTGC ATCGCGTGTT AATAAAACAT 13320 

ATGCTTACAT TTTCACGATT GCCTTATATC TAGTTATCGG ACCATTTTTC GCCTTACCTA 13380 

GACTGGCAAC GACATCATTT GAAATTGCAT TTTCGCCATT TTTATCACCA AAGCAAATCA 13440 

CTTTATATTT ATTTATTTTT AGCTTCGTCT TCTTTGTGAT TGCATGGTTT TTTGCGAGAA 13 500 

AGCCATCAAG AATTTTAGAA TATATCGGTA AATTTTTAAA TCCGGTATTC TTAGTATTAT 13560 

TAGCAATTAT TTTATTATTT GCTTTTATCC ATCCATTAGG TGGCATATCT GATGCACCTA 13620 

TTAGTAAACA ATATCAATCA CATGCCTTAT TTAACGGCTT TTTAGATGGA TACAATACCT 13680 

TAGATGCGCT AGCGTCATTG GCATTTGGTA TTATCATTGT TGCAACGATT AAAAAGTTAG 13 740 

GTATCGAAAA TCCAACTGAT ATCGCTAAAG AAACAATTAA GTCTGGTACT ATCAGTATCA 13 BOO 

TTATGATGGG GATCATTTAT ACCCTACTAG CAATCATGGG TACATTAAGT ATTGGTCATT 13 860 

TCAAACTTAG TGAAAATGGT GGTATTGCCT TAGCGCAAAT TACTCAATAC TACTTAGGTA 13920 

ACTACGGTAT CGTCCTGTTG TCACTTATCG TTATGGTTGC TTGTTTAAAA ACAGCCATCG 13980 

GTTTGATTAC GGCATTTTCA GAAACATTCG AACACCTTTT CCCTAAAATG AATTACCTAG 14 040 

CGATTGCAAC AGTTGTAAGC TTTATTTCGT TCTTATTCGC GAATGTTGGT TTAACTAAGA 14100 

TTATTATGTA CTCAGTCCCA GTGTTAATGT TCTTATATCC ATTAOCAATT GCCTTGATTG 14160 

TACTAACATT ATTTAGTAGC AAATTCCATC ATTCAAAACT TATTTATCAA TGTACCATTT 14220 
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GCACATCATT CTCACAAACT TTGATTAATT TCAGCCAAAA ATATTTACCA TTATCAGACA 14340 

TTGGTATGGG CTGGGTTGTT CTCAGTTTGA TTGGTTTCAT TATCGGCTTC ATTATTTATA 14400 

AAATTAAGCA TCGTAAAATT CCACAAGCAT AATACTATGC CACAGTCATA TGTTAAACAT 14460 

^ATGCTTGTGG CATTTTTTAT TCATACTACA TTAAACTGCA ATCGTATACA TACATATCAA 14S20 

TGATTATCCA CAAAAAATAT TAGTACTTTC ATTTTACAAA TCACATTAAT ACAAACACAA 14580 

CCTTATCTTT ATATTATTAA ATTTATATTT GACACTTATA TTGAACAACT GTAATATATT 1464 0 

AATATTAATT CTTTAAAATG TATAAATATA AAGGAGGGAG ACCGATGaAT TCAATCATTG 14 700 

aATTAACTGA TTATTATAGC TCTAATAATT ATGCACCACT TAAGCTTGTC ATTTCTAAAG 14760 

GTAAAGGTGT GAAAGTTTGG GATACTGATG GCAAACAATA TATAGATTGC ATTTCGGGTT 14 820 

TTTCAGTTGC AAACCAAGGC CATTGTCATC CAACAATTGT TAAAGCGATG ACAGAACAAG 14880 

CTTCAAAGTT GTCTATCATT TCACGTGTCC TTTATAGTGA CAATCTCGGG AAATGGGAAG 14940 

AAAAAATTTG TCATCTTGCT AAGAAAGACA AAGTACTCCC CCTTAACTCT GGTACTGAAG 15000 

CTGTTGAAGC AGCCATTAAA ATTGCTAGAA AATGGGGCTC TGAAGTTAAA GGCATTACTG 15060 
ACGGACAAGT TGAAATCATC GCTATGAATA ACAATTTTCA CGGTCGTACA CTTGGCTCAT . 15120 

TATCACTATC TAACCACGAC GCATATAAAG CAGGATTTCA CCCCCTACTT CAAGGCACTA 15180 

CAACAGTAGA TTTTGGAGAC ATTGAACAAT TAACACAAGC TATTTCACCG AATACAGCAG 15240 

CAATTATTTT GGAACCAATT CAAGGTGAAG GTGGCGTTAA TATACCACCG AAAGGATATA 15300 

TTCAAGCTGT GCGTCAACTA TGTGATAAAC ATCAAATATT ATTGATTGCA GATGAAATTC 15360 

AAGTTGGTCT TGGTAGAACT GGGAAATGGT TTGCTATGGa ATGGGAGCAA GTCGTTCCAG 15420 

ACATTTATAT TTTAGGTAAG GCATTGGGTG GCGGCTTATA CCCTGTATCT GCTGTACTTG 15480 

CAAATAATGA TGTCATGCGT GTTCTAACAC CAGGTACACA TGGTTCAACA TTTGGTGGTA 15540 

ACCCTTTAGC CATTGCAATA TCGACGGCAG CGCTTGATGT ACTTAAAGAT GAACAACTGG 15600 

TTGAACGATC AGAACGCTTA GGTTCATTTT TATTAAAAGC GTTGCTACAA CTTAAACATC 15660 . 

CTAGTATTAA AGAAATTAGA GGTCGTGGTT TATTTATAGG CATAGAGCTT AACACAGATG 15720 

CTGCACCTTT TGTGGATCAA CTGATTCAAC GTGGAATCTT ATGCAAAGAC ACGCATCGTA 15780 

CTATCATTCG ATTGTCTCCA CCTCTAGTCA TTGATAAAGA GGAAATCCAT CAAATTGTTG 15B40 
CAGCTTTTCA AGACGTTTTT AAAAATTAAC AATTAATCAT TTATATATGA CATAGGAGGG . 15900 

ATTCATGATG ATTAAAGTAG GTATCGTTGG CGGTAgcGGT TATGGCGCAA TTGAATTAAT 15960 

TCGATTGTTA CAAACACATC CTCATGTAAC GATTGCACAC ATCTACTCAC ATTCAAAAGT 16020 
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ACTTACAGTG GaTAATAATG ACTGTGATGT AATTTTCTTT GCGACACCAG CACCCGTAAG 16140 

TAAAACATGT ATCCCTCCCT TAGTAGAAAA AGGTATTCAT GTTATCGATT TATCTGGCGC 16200 

5 

ATTTAGAATT AAGAATCGTG AAATATATGA AGCATATTAC AAAGAAACTG CTGCAGCACA 16250 

AGATGATTTG AATCATGCTA TTTACAGCAT TTCAGAATGG CAATCGTTTG ATAACAATGG 16320 

AACGAAGCTC ATTTCTAATC CTGGCTGTTT CCCTACAGCA ACATTATTAG CATTACATCC 163 80 

10 

ACTTATTAGC GAAAAAATAG TAGATTTGTC ATCTATTATT ATTGATGCTA AGACCGGCGT 16440 

GTCAGGTGCT GGTCGTTCAT TATCACAACG gTTcATTTTT CAGAAATGAA TGAAAATCTA 165O0 

75 AGCGCTTATG CAATCGGAAA CCATAAACAC AAACCGGAAA TCGAGCAATA -TTTATCTATC 16560 

ATTGCGGGTC AAGATGTATC AGTCATATTT ACACCACATC TCGTACCAAT GACACGAGGT 16620 

ATTTTATCAA CAATATATGT CAAATTATCA TCTGAATATA CGACTGAATC ATTACATAAA 16680 

20 TTAATGACCT CTTATTATGC TAATCAGCCA TTTGTCAGAA TTAGAGATAT TGGGACTTTT 16740 

CCAACCACAA AAGAAG TACT CGGTAGTAAC TACTGCGATA TCGGCATCTA TGTAGATGAA 16800 

ACAACGCAAA CAGCAATTTT AGTATCAGTG ATTGATAACC TTGTCAAAGG CGCAAGTGGG 16860 

25 CAAGCCATTC AAAATTTAAA TATATTATAT GATTTTGAAG TGACGACTGG CCTAAATCAA 16920 

TCACCAGTTT ATCCATAAGG GGTGTTAGAA TGAAACATCA AGAAACGACA TCACAACAAT 16980 

ATAACTTTTC AATTATTAAA CATGGCGATA TCAGTACACC TCAAGGCTTC ACGGCTGGTG 17040 

30 

GTATGCACAT CGGTTTACGC GCTAACAAAA AAGACTTTGG GTGGATTTAC TCATCGTCTT 17100 

TGGCAAGTGC AGCTGCCGTA TATACTTTAA ATCAGTTTAA AGCTGCACCA CTTATTGTCA 17160 

CTGAAGACAC TTTACAAAAG TCTAAAGGAA AATTACAAGC ACTTGTTGTT AATTCAGCTA 17220 

35 

ATGCAAATTC TTGTACCGGT CAACAAGGCA TAGATGATGC ACGACAAACA CAAACATGGG 17280 

TTGCTCAACA ACTTCAAATA CCATCTGAGC ATGTTGCTGT TGCTTCAACT GGGGTCATTG 17340 

GTGAATATTT GCCTATGGAT AAAATTAAGA CTGGGACCGA ACATATTAAG GATGCTAATT 17400 

40 

TTGCAACGCC AGGTGCGTTT AACGAGGCAA TTTTAACAAC TGATACCTGT ACAAAACATA 17460 

TCGCTGTATC ACTAAAAATC GATGGTAAAA CCGTTACAAT TGGTGGTAGC ACCAAAGGTT 17520 

45 CAGGTATGAT TCACCCAAAT ATGGCTACCA TGCTTGCTTT TATAACAACC GATGCATCGA 17580 

TTGAATCGAA TACACTTCAT CAATTATTAA AATCTTCGAC TGACCATACA TTTAATATGA 17640 

TTACTGTTGA TGGCGATACA AGTACAAATG ACATGGTATT AGTCATGGCA AATCACCAAG 17700 

50 TTGAACACCA AATACTTAGT CAAGACCATC CACAATGGGA AACATTTGTT GATGCATTCA 17760 

ATTTTGTCTG TACATTTTTA GCTAAAGCTA TAGCCAGAGA TGGCGAAGGC GCAACAAAGT 17820 
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CTATCGTAAG TTCAAATCTA GTAAAATCAG CTATTTTTGG CGAAGATGCC AATTTTGGTC 17940 

GAATCATTAC AGCTATTGGC TACAGCGGAT GTGAAATTGA TCCTAACTGC ACATATGTTC 18000 

AACTGAACCA AATACCTGTC GTTGATAAAG GTATGGCTGT ACTATTTGAT GAGCAAGCTA 18060 

TGTCGAATAC ATTAACTCAT GAAAATGTCA CAATTGACGT TCAGCTTGGT TTAGGTAACG 18120 

CTGCAGCGAC TGCATACGGT TGTGATTTAT CCTATGATTA TGTGCGTATC AACGCATCAT 18180 

10 

ATCGAACATA AGGTGGTGTT GGTTAGATGA AATTTATTGT CATTAAAATT GGTGGCAGTA 18240 

CACTTAGTGA CATGCATCCA TCAATTATTA ACAACATTAA GCATTTACGA TCAAACAACA 18300 

TCTACCCCAT TATCGTTCAT GGCGGTGGCC CATTTATTAA TGAAGCATTA TCAAACCAGC 18360 

75 

AAATCGAGCC ACACTTTGTT AATGGCCTAA GAGTGACTGA TAAAGCAACC ATGACCATTA 18420 

CTAAACACAC GCTCATTGCA GACGTTAACA CTGCATTAGT AGCTCAATTT AACCAGCACC 18480 

2o AATGTTCTGC AATAGGCTTA TGTGGTTTGG ATGCACAGCT GTTTGAAATT ACATCTTTTG 18540 

ATCAACAATA TGGATATGTC GGTGTTCCGA CCGCTTTAAA TAAGGATGCT TTACAGTATT 18600 

TATGTACTAA ATTTGTACCT ATCATCAATT CGATTGGTTT CAATAACCAT GATGGAGAAT 18660 

25 TTTACAATAT TAATGCTGAC ACGCTTGCCT ATTTTATTGC ATCATCATTA AAAGCGCCTA 18 720 

TTTATGTATT AAGTAATATT GCAGGTGTAC TCATCAATGA TGTTGTTATA CCTCAATTGC 18780 

CATTAGTCGA TATTCATCAA TATATTGAAC ATGGTGATAT TTATGGAGGT ATGATTCCCA 18840 

30 

AAGTGCTAGA TGCCAAAAAT GCGATTGAAA ATGGCTGTCC TAAAGTTATC ATTGCATCAG 18900 

GAAACAAGCC AAATATCATT GAATCTATTT ACAATAATGA TTTTGTTGGC ACAACAATCC 18960 

TTAATTCATA ACTATGAAAT TAAGGCCTAA CAAGTTTTGA CACGCGAGAT GATTCCAGTT 19020 

35 

CGATTATCCA TTGCGCTAAA ACATTTATTT ACCGTTCATC TCGTTAACAA TTTTGAATAC 19080 

AGTfidGATAC AATATGAGAT GTAAAAAACT AATAACCTTT TACAAATTTG TTTATCAAAA 1914 0 

TATTTTAAGT TTTGCAAAGC TTTTTATTGT GATTATTTTC ACAAAATACT ATAATGAGGA 19200 

40 

TAGTAAATAG AGAGGAGTCC TTAAGTTGAC GAAACGACAA ATGGGTATAT TCATTTATGC 19260 

TGGAATTATC GGTGGCTTGT TATCTGGAAT TGTAAAATTA GGTTGGGAGG • TCATGTTTCC 19320 

45 ACCTCGCACA CCAGAACGTA ATGCAACGAA CCCACCTCAA GAGTTATTGC AACAATTAGG 19380 

ATTTAGTAGT GAGTTTACGC ATCAAACATA TACATTTTCA AATATGGAAT TGCCTTGGGT 1944 0 

AAGCTTTATT GTCCACTTTA GTTTTTCTAT CGTCATTGCA ATTATTTACT GCATATTAGT 19500 

50 TAAAAAATAC GCTTACTTAG CAATGGGACA AGGTGCTGTT TTTGGTATTG CTATTTGGGT 19560 

ATTATTCCAC CTTATCATTA TGCCAATCAT GCATACTGTA CCTGCTGTGT GGGATCAACC 19620 
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AGTGCGACAA CATTTTGTCT ATCGCTATAA 
TGAGTTTtAA ATCATCGTTT GAGTATGATG 
5 ATGATATGAT TCCAGCCAAC TTACGTGAGC 

ATTATTTTCG GTAATTTCAA TATTGCTCAG 
TCGTCATGAT TTTGAATGAT GCCATCGTGT 

10 

ATCTAACAGT AAACATTTAG GCAGTATAGT 
CGTCATATTA ACTACTCAAT AACTGAAATA 
GTCCTAAGAA ACATACCTGT ACCATAAACT 

15 

CACGTGATTG ATAAAAGTGT GATTGTGTCA 
CTTCACCAAT CAATTTTGTA AAACTTTCTT 

20 . AACTACATAT TTGATGTGCT TCGTTTGTAG 
AACGTTGTGA TATCTTTTCG ATATCAATAC 
TATAAGGATA ACTATATGAT AAGCTCACAT 

25 CAATATCTGC CTTACCTCGT GGCGAAATGT 
TGTCATGTTG AATTCCATAT TGCACTAAAA 
GATTGTATCT ATAGTTGACT GTACGCGGTT 
ATTCTTCAAT ACTTTTCAAG TTACTCTGTA 
CTACTTACTT ATTGAATATT GTTTTGATAT 
r TTATCGTTGG CCATTTTTCA CT<^ATGTGAT 

35 

TCATTGCTAG TTGTTGTTCA CGGTCAACAC 
GTTTCTGTAA AATTTCATTA AATGATTCAT 
CGCATTCTAA CAAAGATTTC ACCGGCATTT 

40 

GTGGTGTGTT GACCTGTGCT AATGCGACAA 
AATCTACAGG CATTTCAGCC ATGCTAACCC 

4S TTACCATTGA AAAACGGTTA GTCTTTATAT 
TCGTCAAATT ACCAACACGT ACAATCCGAC 
CTTTTAATTC ACTATAAAAT TTGCTCCGTG 

50 AGACATCCGC TTsTGAAAAT GTCACATCTT 
TTATCGTAGA CACATATATT AACCTTGCAT 
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TACATGTCTT 


208B0 


CGCTGACGAG 


TTCAATTTCT 
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20940 
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TAGCACATGG 


TAAATGATTT 


21000 
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21060 


CGATACAATC 


CAGTTGTAAC 


AAATCATTCA 


21120 
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CCATCTTCCA TTGTAAGGAT 
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CATCTAAGCC ATTATTTACA GCTTCTAATA 
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21300 
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21360 


GATGTTGTTG 


TGCCAAACGT 


ATGACATCAA 


21420 
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CACCTGCATG AATAATCGTA TCCATGTTTT CTGGTAAAAC AACATCATCC ATACACTCGA 21540 

AATCACCAAC AATGACTTCA ATGTTTGATA ACATTATTTC AACCGTCTCT TCTGAAAAAT 21600 

5 

AATCATTTAA ATTCGTCATC AACTTATACC ATGCTATTTC CTCATTATCA GCACGTATGA 21660 

AACAATAAAT GCGATGACTG TATCCTTGTA GTACTTCAAT CAGATAAGCA CCTAAAAAAC 21720 

CTGTCGCGCC AGTCAATAGT GTATTTCCTA GAGGTCGATG ACTTAGACTA TCCTCTAAAA 21780 

10 

TACCCAAGTT ATAACGAGAC ATAACAATCT TTTGTAATTC CGAAAGATTA TCCGGTAATG 21840 

CAACTAATGA TTGTTGATTT . TGGTACATAT AATTAACAAT CTGTCGCACG GTTTTATATT 21900 

GGTATAATGT CTGCATTGAA ATATGATGGC CAAATCGTTT TAAATGCGAG ACAACTAACA 21960 

TCGCCTCTAA TGAGTTACCA CCAAGTTCAA AGAAATCATC GTCAACACCG ACATCATTTT 22020 

GTTTCAATAC CTCTCCAAAT ACATCAACAA ATGTCTGCTC AATTTCATTA GAGGGTTCGC 22080 

20 TATACACTTT ATTAGACTGT TGTATAGGTG ATGGATTTGG CAAACGCGTA GTATCCACCT 22140 

TGTCATTCGT GGTTAATGGC ATACAATCGA TATGCGTTAT AGTCTTAGGA ATCATATACT 22200 

TAGGCAGCTG ATCATTTAAA TATTGCTTCA AATCCTGTTC CACTTGTTGC .TCTCCGACAT 22260 

25 AATAAGCATT CAATATATCA TGCGTATCAA AGTGACTTAC TGTTACAACA CAATCAGATA 22320 

TACCACGAAT AGCTAATATT GCATTTTCAA TTTCATCAAG TTCAATACGG TACCCGTTAA 22380 

CTTTCACTTG TTTATCTATT CTTCCTAAAA ATTCAATTTG ACCATCAGAT GTATAACGTG 22440 

30 

CTAAATCACC ACTATGATAC AACTTTCCTT TACCAAATGG ATTATTTTGC CATTTATCAG 22500 

CCATTAATTC TGGACGATTA ATATATCCTA TCGCTAAACT ATCACCTGCA ATACACAACT 22560 

CGCCTGGCAT ACCAATACCG CATAACAAAC CATCTGACAT AATATACACT TGGATGTTAG 22620 

35 

ATAAGGGTTT GCCAATTGGA ATCGTCTCAG GTATCAAATC ACCACAATGA TGTGACCAAT 22680 

ACGAT GTGAT GACTGTTGAC TCAGATGGTC CATAGGCATT GAAATACGTG CCACAATGCT 22740 

TCTCAATATA TTTAACAAAG GATGCCGTAC TAGTTGCCCC GCCTGTAATC AACTTTTCAA 22800 

40 

TATAAAAGTC TTCCATAACA CTACACATCT GTAACGGAAT CGACGCAACC GTCACACGAT 22860 

GCTTATTAAT GAGTTGTTGT AACTGTTCTG GATTAACACG TTCCTCTCTA TCTGGAATCA 22920 

45 CAAGCGTATG ACCATTTAAC AAACAACAAT AAATCTCCAT AACTGATGCA TCAAAAACAA 22980 

TATTTGCATG TTGCAAAAAT ACTTCATTGT CGCCTAATTG CAATTCAGTT GACCATGCAT 23040 

GCACTAAATT CAACAAATTT CGTTGTCGTA TGGCAACCCC TTTAGGCATC CCGGTCGTAC 23100 

50 CAGATGTGTA AATAGCATAC ATCTCATTAT CTAACATCGC TGTGTTTTCA AGTTGATTGC 23160 

CATGTAAATC ATCATATTGT TCATTTTCCT TTGATTCAAC AAAGCCTTTA GCATTTTCCA 23220 
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TAGCATCCTC CAAAATTGCA CCTTGTCGTT 
AAGATGCACC TACTTTAACT GTCGCCAACA 
5 TAAACAAGGC AACCCGTTGA CCATTGCCCA 

CATCCACATA GTTGCGTAAT GTTTCATACG 
CATGATTACC TTGTCGTGAG ACAACTTCAT 

70 

GGACATTAAG CATTCGATCG TTAACATGCG 
CGTTTGGTAT ATCACAAATT TGTAGTGTAT 
TCATACATTG ATTACCCATG TGACGAACTG 

75 

CGATATTGAT TGTATAGTCA TCGCGATCTT 
CCGTCACTGA TTTGGGTTGA ATGTGTGTTA 
TTTCATTGTT TTGTAGTACT AACATGACAT 

20 

ATTGATCTAA GTCATTTACT AAACATTCGA 
AACTCATTTC CTTAACCTCT TGTAAAAACT 

25 CTCTATATAC CAACGTATTA GCAAACATGC 

GCGCACTCAT CACACTACCG ACAACAACAT 
TCATGACCAC ACTCATAAAG AACATAAAAT 

30 GAAGTAGCTG TCTCATTTGT TGATTCATTG 

TAATATTTGG TCTAACATAG TCTGTCGGTA 
GAGATAACCA ATATTGTCTA TGTTTCGTCA 

35 

CTTTATATTG CAATTTAAGT GGTAACAATA 
TCATTAATTG TATATTACTC ATACCGTCAT 
ATGCATGTAA GGGACTTCTA ATGTATCTCA 

40 

AAGGTGCTAC AAATTGGCGC ATGATTTCTT 
CAAAGTCAAC TGCAACATCT GCCACAATAC 
ATTGTGTTCG TAAAATCTCA TGTCGCGCTA 

45 

CTACATTAAG TTCTGATGAT AACCGCCATA 
GGTTTGATTT CCATAATAAA TACATACGCT 
50 TAACTATAGT TTCTGGAATC ACTTCATAGT 

CTAGTTCAAA TACAGTTGGC TTTTGTAATA 



TATTCGGAAA ATCAATATCG ATAGGTATAT 23340 

TCGCCGCAAT CATTTCAAAA CTACGTTCTG 23400 

CACCATTTGA TAGGAGCATG TGCGCAATGG 23460 

TCATTGTCAA ATCATTCATG ACTAGCGCAA 23520 

TAAAGTAACT TATGATAGAT TTATTTCCCG 23580 

TATTGACCCA ATTTAGAAGT TCCTCCGTGC 23640 

CTTGATGCTT CAAAATATAA TCAATCATAA 23700 

TTTCTGAGTG ATATAAATCG GTATTATACT 23760 

CTTCAATGAT GAAAGATAAA TCAAATTTCG 23820 

ATTTACTATG CCCAAAATGA GCATGATTCG 23 880 

CAAATAATGG ATTCCGTGAG GCATCATGTG 23 940 

ATGGGTATTC TTGATGCTCG TATGCCTCCA 24 000 

GTGTCCACAT TTTATCAGGT GACGGTTGCC 24 060 

CTAGCATTTG CTCCGTGCCT TTATGCATAC 24120 

CATCTTTTCG AGCATATCTA CTTAACAACG 24180 

CAGTAATTTG ATGCTTTTCT ACATACTTTT 24240 

TAAATGACAT CATTGCTCCA TTTGTCGTTT 24 300 

AGCTTAAAAT AGGTACTTCA TCTTTGAATT 24360 

TATCACGATG CGACATCCAC TCACTATAGT 24420 

ATTTATGTTG ATAAAGTGCG TTAAGATCAT 244 80 

TAATGATATG ATGCGTATCT ATAAAGAGGT 24540 

CTCTAATTTG ACTTGGCTTT TCCAAATTAA 24600 

GTTCATCCGT AAAATGCGTG TTAACTTCTT 24660 

GTTGTCGAAC CTCATCATCT ACAACAATAT 24720 

TCAAACGCTG CACTGCTTGT CGCAATTGAG 24780 

AAAAAGGTAC GTTATACACC GTATCTTTAT 24 840 

TTTGTGCAGA GCTCAGCACA TAATCATCTT 24900 

TTTGTTCTTG AACCTTAGCA ATCGCTTGTG 24 960 

AATCACCAAT TTGTAATCGT TTCCCAGTAG 25020 
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AATTATCATG AATACCTACT 


TGATTCACAT 


GTAAAATATC 


TGCAAAAATT 


TGGCATAGCA 


25140 




AGTGTTCGGT ATCTGTACTC 


GGTGCTACAT 


AGGCATCCGT 


ATCGACATAG 


TCCATGATAG ■ 


25200 


5 


GCAATGCCTT CTTATCTAAT 


TTCCCATTAA 


TAGTAATAGG 


AATTTGCTCA 


ATATGCATGA 


25260 




AATTAACTGG TATCATGTAC 


TCCGGTAAGG 


TCATACGTAA 


TTGTGATTTA 


ATCTTATTAT 


25320 


70 


GTGATAATGT ATGCATCGCT 


TCATAATAAG 


CAACGATATA 


CTGATCTTGA 


TCATGATTTT 


25380 


GAACAATAAC AACTGCTTTA 


TTAATACCTT 


GTATACGCTC 


GAGCGCATGC 


TCAACCTCTG 


25440 




ACAACTCAAT CCTAAACCCT 


CGAATCTTAA 


CTTGTTTGTC 


CTTTCGATAT 


AAATAATCTA 


25500 


15 


TGTTGCCATC GGGTAACAAA 


CGAACGATAT 


CACCACTTCT 


ATACATCAGC 


TGaTTTATAT 


25560 




TTGAATCTTT GATAAATTTA 


TCTGCTGTCA 


ATTCTGGCTG 


ATTTAAATAA 


CCTGCAGCTA 


25620 




ACCCAAAGCC ACTTGTACAT 


AATTCTCCAG 


GAATACCAAC 


GCCACACCGA 


CGCTCGCCTT 


25680 


20 


GCATG AT AT A AACATGAGTA 


CCCAGAATCG 


GTTTACCAAT 


AGGAATACGA 


TTTGGAACTT 


25740 




TGTTAGGTAT ATTATACGTC 


GTTGTAAATG 


TTGTATTTTC 


AGTTGGTCCA 


TAACCATTAA 


25800 




TAATTTGAGG ATGCTTCGGT 


TTTTGATTAA 


GCAAATCCAC 


CCACTTAGCA 


TTCAATACTT 


25860 


25 


CTCCACCAAT TAATAAATAC 


TTTAACGGTA 


CCAATACTTC 


TATTCGTTCA 


CTAGCAATCT 


25920 




GATTAAATAA TGAGGAGGTT 


AACCACATAG 


TATTAACGTC 


ATTTTCATTG 


ATTAATTGTT 


25980 




CTACCGCTAT TGGATTTAAT 


AATTGTTCTT 


TTTTAGCAAC 


AATCAGCTTT 


CCACCATTGA 


26040 


30 


GCAATGCACC ATATATTTCA 


AATGTTGCAG 


CATCAAAGGC 


TATAGTTCCT 


GATAACAAAA 


26100 




TCGTCGTCTC TTCATTTAAT 


GGTACATAAT 


GATTTTGATG 


GACCAAGCGA 


ACAATACCTC 


26160 


35 


GGTGCGGAAT TAGTGTCCCT 


TTAGGGTTAC 


CAGTTGTCCC 


CGACGTGTAA 


ATAACATAAG 


26220 


CATGATCTTC TAACGTGTTA 


CATTTAGAAA 


GATTATCAAT 


ATTTTTCCAC 


GCTATCTTAT 


26280 




TCAAS.TCAAT GTGATTAATA 


TTTTGTTTAC 


CATTTTCATA 


TAAAGCTTGG 


TACGTTATTA 


26340 


40 


CAACTTTAGG CGTTACATCT 


TTTAAAATGT 


ACTCCTGACG 


ATCACTTGGA 


TAGTTCGGAT 


26400 


CAATTGGCAC GTAAGCCCCA 


CCAGCTTTCA 


ACACACCTAT 


CATCGCTATT 


ATCATCTCAA 


26460 




TACTTTTTTC AGCTATGACA 


GCGACACGAT 


CATTAGGTTC 


AACACCATAC 


TGGTTTCTCA 


26520 


45 


AACGGTGTGC TAAATCATTC 






21 T* A TY2TT 21 T 1 & 


nnUiL i. UWU J. 






CAAATTGCAC AGCGACATGA 


TTCGGCGTTG 


CTTCAACTTG 


TTGCTCAAAT. 


AAGGTAACAA 


26640 




CTGTTTGCGC ATCATCTATC 


TCAGGCAAAC 


TTAAATTGAT 


ATCGTCATAT 


AATTGAATAT 


26700 


50 


CACGTTCTGT CATCAAATTA 


AGTTCATCTA 


CAGTTGTTCG 


TTTATTTCCA 


TTTTCTTCAG 


26760 




TAATTTGCAA ATAAATATTT 


CGAACTAAGT 


CACTCAGCGT 


CTCGATTGAG 


AGCAAATCAT 


26820 
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15 



30 



35 



40 



CTAAAGATGT 


ATGTGCATCA 


TGTATTTGAT 


GTACATCCTC 


AATAACATCA 


TTACAACAAG 


26940 


ACATCATATG 


ATGATAACAA 


TGAAAAATAG 


TCTCTAGTGA 


AAGCGAAGAC 


TTCGCGCACT 


27000 


GTAATTGCGA 


CATATTTTGC 


AACACACATT 


TATTAAAATC 


TGTTGTAAAA 


CGTTGACATA 


27060 


CATCTTTTGC 


ATCGATTGTT 


AACGTTAACG 


GCACAATATT 


TCCGTGTAAA 


TCATTTGGTA 


27120 


AATGTGATGG 


TACATGTATA 


CCTAATGTGA 


CATCATGTTG 


TTGACTCATT 


ATATGATTAG 


27180 


CTAAATACAC 


ACTAACAGCC 


AACGATGCCA 


TATCTATTGA 


TGTCATATCA 


TCAATCAAAT 


27240 


ACGTTTGATA 


TAAAGCTTGT 


TCAAATGGAT 


GCTTAATTGG 


AAAATAACTA 


TCAATATGGA 


27300 


TGTCAGAGTT 


ATTCTCTAAC 


CGAAAATAGT 


TTGAGTCTAA 


TGCTATATGC 


GATGCATCTT 


27360 


GATTGTCTTT 


ATCATCATTT 


CTATTTATAT 


GTGCATGCTG 


TCGAGTATTG 


TTAATAACAG 


27420 


TATTGCCACG 


ATATGCATTG 


CATAAATCAT 


CAAGAAnAAT 


ATCAATTTGA 


CTATCATCga 


27480 


AAATGGaCAC 


ATGAAAATCT 


aATAGTATAT 


ATGcAGCATC 


AGCGAACTGra 


AACAATTTAA 


27540 


CTTTGAATAA 


AGGTGAATCA 


TTAAAATGGT 


AAGTACTTAA 


TTCTTGCTTA 


AAAAAAGCTT 


27600 


CTAAATCATA 


GTTTGCGGAA 


GAAGATGGAA 


CTTGTTTTAT 


CTCAATAAAA 


GGCAGAAATT 


27660 


CATGAAGTAT 


CATTGTTAAA 


TTGTCATCGG 


TAGTAACATC 


AAAAAAATGT 


CTTATAGATG 


27720 


CATGTTGTgC 


ACAATTGTCG 


ATAATGCATA 


CATCATTTTA 


GTAGCTTCAA 


CATTTTTAGC 


27780 


GAGTTTAACC 


CAATACGCAT 


TACGGTGTGT 


CGTTGATTCT 


GTATTATTTT 


TGTATATACG 


27840 


AAAATATTCC 


TGTTGAAATC 


TCAAATTACC 


CATAATCATA 


AAAAGTCCTT 


CTTTCATATC 


27900 


ATAATACTCA 


TTACTTACTG 


AAATTGCATG 


ATGATATGAT 


AACCGACGAA 


ATGTTAATTA 


27960 


ACTCGTTATG 


TAATGaTTAA 


TATaAAACAC 


CATTCGCAAC 


ATATGAGCGA 


TATATTCTAC 


2B020 


CCTAAAATAC 


ATCTTGTATC 


ATCGTTACAA 


TTGGTATATT 


TTTCAATGTA 


AATTACATAC 


28080 


ATCTTCGATA 


AATAGCACAC 


TACAAATCGT 


TAATCACTTT 


CTGTTGTTCA 


CATCTCATTG 


28140 


LnnnL X Cnni 


r\X X\3 X lw* 1A 


CAAAATATCC 


ATGAAGCAAG 


TTTATATTAA 


AGAAACAACT 


28200 


CGCATAAAAC 


AATTGTTATC 


CTTAAATTTT 


AACAAATTCT 


TAATAAATTT 


ATCTCTATTT 


28260 


TAATTACGAC 


CAAATTAATA 


GGTTTTCCAT 


ATAAAAAGAT 


GCATAAAATA 


AATATTTAAA 


28320 


TAAATTCAAT 


TTGTATTCAC 


TTGTTTTTGT 


CCCCCAAATA 


CACCAGCAAC 


AAGCATGCTA 


28380 


GCACCAATTG 


TTAAAACGAT 


AAACATATAC 


AGTCCCATTT 


GTAATGACGT 


TAAGAAAACA 


28440 



CCCAACACAA TCCCTAACCT AGCTAGTGTT TCTGAAAAAT GAATACCTAA TGCATTAACT 28500 

50 GCACTATATG TTCCTCTTTT AGCTTTAGGA ATAATTTTAA AGCGTTGTTC TGAAACTATA 28560 

GGCGAATAAA TAATTTCACC TACAGTCGCA ATTATCATAA AAACAACTAA TAAGCCAAAC 28620 
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TTCGAGATTG AATACGTGAG TAaAATGACG 28740 

AACATCTTAG CACCTGTAAT ATCATATGAA 28 BOO 

AGTCTAATAG CAATATATGA GGAGATTGAA 28B60 

GAAATCAATA ACATATAATT ACGGTCTTTC 28920 

ATTATTTGTA ATTTTTGATA ACGACTTGCA 28 980. 

TTTCGGTCTT GAGGTAACCA AATATATAAA 29040 

GCTATTAAGA AAAGTAGTAA CATACTGTAG 29100 

ATAGCTACCG ATAAGTTTGT CATCCAATAG 29160 

TCGGTAATTG CATCCATAAT TAATGTGTCC 29220 

GTAAATATGG CATATGCAAC ACAAAAACCA 29280 

ACGCCCATGA ACACAAGCAT TATCACCATC 29340 

tTCGGAAATA TCTCAATAAG GTAACCAGAT 29400 

AAACCAACAA GATATATACC GACAATTGAT 29460 

ATAAACGGTA TAAATGCTGT CGTAATAATT 29520 

TTCAA 29555 
(2) INFORMATION FOR SEQ ID NO: 207: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

r (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 207: 

AAAAAAAAAA AAAAAnGGTG AATCTTTAAT TAAACACTAA TATTGTAAAA GATGTTAAGT 60 

40 

AAACGCTTAA TGACACTTAT TTTTTGAAAA TAATAGTAAT ATCATTTTGT TAAATGAAAG 120 

AATAAAGCTA TAATmATTAT AGAATAACTA TTTAAAGGAG ATTATAAACA TGCCAATTAT 180 

TACAGATGTT TACGCTCGCG AAGTCTTAGA CTCTCGTGGT AACCCAACTG TTGAAGTAGA 24 0 

45 

AGTATTAACT GAAAGTGGCG CATTTGGTCG TGCATTAGTA CCATCAGGTG CTTCAACTGG 3 00 

TGAACACGAA GCTGTTGAAT TACGTGATGG AGACAAATCA CGTTATTTAG GTAAAGGTGT 360 

50 TACTAAAGCA GTTGAAAACG TTAATGAAAT CATCGCACCA GAAATTATTG AAGGTGAATT 42 0 

TTCAGTATTA GATCAAGTAT CTATTGATAA AATGATGATC GCATTAGACG GTACTCCAAA 480 

55 



GCTTTTTTAA AATCTATTTT CAATACAACT 
.ACGACCGTAT TAATCATTAG CAAGATTGCT 
CCTATACTTA TTGTTTCAAA CTGATCCTTT 
AATTCACCCA TCATGATGAT ACTGAACCCC 
AAAACTAATT TATAACTGCG AAATATATTC 
TGCCTCTTGT CATCACTTTG CTTTACTTGA 
ATAAAGAGTA CAATTAAAAA TATACAAGCT 
CCATACATCA AGCCACCTAA CAATGCCCCA 
CTAATCTTGT AAATATAATG TTCCACGTCT 
ATAACTGGAA ATTGTAATCC CCAAACGATT 
ATAATTTGCC ACAATTGATG TGACCCAAAT 
GTCGCTTGAT AAATAAGTAC TAGCAACTTT 
ATAATGGACA ATGGAAATTT nAGAACCACT 
TGACTTAACA TATCTGTTAA ATATAGTGCT 
AGCTGTAAAA nATTGCTAAT CAATCGTACT 
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AGCTGACTTA TTAGGTCAAC CACTTTACAA ATATTTAGGT GGATTTAATG GTAAGCAGTT 600 

ACCAGTACCA ATGATGAACA TCGTTAATGG TGGTTCTCAC TCAGATGCTC CAATTGCATT €60 

CCAAGAATTC ATGATTTTAC CTGTAGGTGC TACAACGTTC AAAGAATCAT TACGTTGGGG 720 

TACTGAAATT TTCCACAACT TAAAATCAAT TTTAAGCAAA CGTGGTTTAG AAACTGCAGT 780 

AGGTGACGAA GGTGGTTTCG CTCCTAAATT TGAAGGTACT GAAGATGCTG TTGAAACAAT 840 

TATCCAAGCA ATCGAAGCAG CTGGT7ACAA ACCAGGTGAA GAAGTATTCT TAGGATTTGA 900 

CTGTGCATCA TCAGAATTCT ATGAAAATGG TGTATATGAC TACAGTAAGT TCGAAGGCGA 960 

ACACGGTGCA AAACGTACAG CTGCAGAACA AGTTGACTAC TTAGAACAAT TAGTAGACAA 1020 

ATATCCTATC ATTACAATTG AAGACGGTAT GGACGAAAAC GACTGGGATG GTTGGAAACA 1080 

ACTTACAGAA CGTATCGGTG ACCGTGTACA ATTAGTAGGT GACGATTTAT TCGTAACAAA 1140 

CACTGAAATT TTAGCAAAAG GTATTGAAAA CGGAATTGGT AACTCAATCT TAATTAAAGT 1200 

TAACCAAATC GGTACATTAA CTGAAACATT TGATGCAATC GAAATGGCTC AAAAAGCTGG 1260 

TTACACAGCA GTAGTTTCTC ACCGTTCAGg aAACAGAAGA TACAACAATT GCTGATATTG 1320 

25 CTGTTGCTAC AAACGCTGGT cAAATTAAAA CTGGTTCATT ATCACGTACT GACCgTATTG 1380 

CTAAATACAA TCAATTATTA CGTATCGAgA TGAATTATTT GAAACTGCTA AATATGACGG 1440 

TATCAAATCA TTCTATAACT TAGATAAATA ATTTTCTnTA TAATCAAATG CTGACATAAT 1500 

30 TTTAGTTGAG GATTATTATG ACGGTATAAA TAAATAAAG 1539 
(2) INFORMATION FOR SEQ ID NO: 208: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 208: ■ 

CAATTTCTAT CTATCAATGA TGTGCATACT TCCAnTTAAA TTAAtCGAAA TGaATCAAGG 60 

TATATCATTC CTGCCTCTTT ATATAACaAC AAATAGTGAT TACAATATTT CGGTTATTAA 120 

45 

CACGAAAATT TTACAAGCAC CTATTTCATT TACATATATA TACAGCAAAA AAGAAAGCCC 1B0 

AGAAATATTG GTGTTTATTA AATCATTTAA AAAGTATATT GCCAATGAAC AATTATAATA 240 

50 AATTTCAAAT CTAAAAAACC AAGAATGCGA TTAATCATCA CATTCTTGGT TCAATTTTAT 300 

TCATGAATTT TTTCAACATT AAACGTTAAG TTATTGTCTG AATTTAAATT AACTTTAATC 360 
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CGTTGTACAA AACGTTTTAA TGGTCTTGCA CCGTATTGAG GTTCATAAGC TTCTTGACCT 4 80 

AGCCAAGCTT TAGCATCATC AGAAACTTCA ATTGAGATTC GTTGTTCTAA TAATCTTATA 540 

5 TTTAATTGCG TTAAGATTTT ATCTACAATC ATACTCATGT CATCAATAGA TAATGGTTTA 600 

AATAATACGA TATCATCCAT ACGATTCAAA ATTTCTGGTT TGAAATATGC ATTTAAACTT 660 

GTCATAACAG C TTTTT CTGT TGATTCTGTA ATTTCACCAG TCTCTTTTAC GTTTTCTAAT 720 

10 AAAACTTGAG ATCCAATATT ACTTGTCATA ATAATAATAG TATTTTTAAA ATCAACGCTA 780 

CGTCCTTTAG AATCAGTTAA ACGGCtTCAT GTAAAATTTG CAATAATACA TTAAAGACGT 840 

CAGTAT 846 



15 

(2) INFORMATION FOR SEQ ID NO: 209: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1674 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 209: 



40 



45 



50 



nTGGGAACAG 


TAAGCCAGTA 


TTTTTAGAAA GTTGCCATAC 


ATGAGCGTCG 


ATTTTTCCAA 


60 


TATGGCTATG 


ACACTAGAAC 


AATGGGAATT TGGAGGAAAA 


GTAAATGATT 


AAACCTAAAA 


120 


TAGCATTAAC 


CATTGCAGGT 


ACTGATcCaA CAGGTGGTGC 


CGGCGTAATG 


GCTGATTTAA 


180 


AATCATTTCA 


TTCATGTGGT 


GTATATGGTA TGGGCGTCGT 


TACAAGTmTT 


GTTGCTCAAA 


240 


ATACATTGGG 


CGTACAACAT 


ATTCATAATT TAAATCATCA 


ATGGGTAGAT 


GAACAACTTG 


300 


ATAGTGTCTT 


CAATGATACC 


TTACCTCATG CTATTAAAAC 


GGGGATGATT 


GCTACAGCAG 


360 


ATACTATGGA 


AACGATTCGT 


CATTATTTAA TGCAACATGA 


ATCTATTCCA 


TATGTAATtG 


420 


ATCCTGTTAT 


GTTGGCGAAA 


rCggTGATTC ATaATGGwTA 


ATGACaCAAg 


CaAAACTTGC 


480 


AGCATaCGTT 


ATTGCCATTA 


GCTGACGTAG TAACACCGAA 


TTTACCAGAA 


GCTGAAGAAA 


540 


TAACGGGACT 


AACCATTGAT 


AGTGAAGAAA AAATTATGCA 


GGCTGGCCGC 


ATCTTTATTA 


600 


ATGAGATTGG 


TAGTAAAGGT 


GTCATCATTA AAGGCGGTCA 


TTCAAATGAT ACTGATATAG 


660 


CAAAAGATTA 


TTTATTTACT 


AACGAAGGTG TTCAAACATT 


TGAAAATGAA 


CGATTTAAAA 


720 


CAAnACATAC 


GCATGGAACA 


GGGTGTACAT TTTCAGCAGT 


TATAACGGCA 


GAACTTGCAA 


780 


AAGGTAGACC 


ATTATTTGAG 


GCTGTACACA AGGCTAAAAA 


GTTTATTTCA ATGAGTATAC 


840 


AATATACGCC 


TGAAATCGGC 


CGTGGTAGAG GTCCAGTGAA 


TCATTTTGCA 


TATTTAAAGA 


900 
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TGTTATACAA ACGATGTAGT TAAAAATTTT ACAGCGAATG GTTTATTAAG. TATTGGTGCT 102 0 

AGCCCTGCAA TGAGTGAAGC TCCCGAAGAA GCTGAAGAAT TTTACAAAGT TGCACAAGCG 1080 

CTATTAATCA ATATCGGTAC TTTAACAGCA GAAAATGAAC AAGATATTAT TGCGATTGCT * 1140 

CAAACGGCAA ATGAGGCAGG CTTACCTATT GTATTTGACC CTGTAGCTGT TGGTGCTTCT 1200 

ACATATCGAA AGCAATTTTG TAAATTATTA TTGAAATCAG CGAAAGTATC AGTAATTAAA 1260 

GGCAATGCAT CTGAAATATT AGCGTTGATT GATGATACAG CAACTATGAA AGGTACAGAT 1320 

AGTGATGCTA ATCTTGATGC GGTTGCAATA GCGAAAAAGG tTACGCAACA TATAAAACTG 138 0 

CAATAGTAAT CACAGGTAAA GAGGACGTTA TTGtTCmAGA TAATAAAGCC TTCGTATTAG 144 0 

CTAATGGATC TCCATTATTA GCACGAGTAA CTGGAGCTGG TTGTTTATTA GGAGGCGTTA 1500 

TTGCTGGATT TTTATTTAGA GAAACAGAAC CAGACATAGA AGCGTTAATT GAAGCGGTAA 1560 

GCgkATTTAA TATTGCTGCT GAGGTAGCTG CTGAAAATGA AAATTGTGGT GGTCCTGGTA 1620 

CGTTTTCACC ATTGTTGCTT GATACGTTAT ATCATTTAAA TGAAACAACC TATC 1674 
(2) INFORMATION FOR SEQ ID NO: 210: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2232 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 210: 

35 ATGAGTTGCC GATGAATTTA GCACCACCAA CGATTGCnTT TGATACTGTG TCCCAACCAG 6 0 

CTTGTTTAGC ATATTTAATA CCTTCACGTA AAGGATCGTT ATCATATGCA GCAATACCAA 120 

ATACGTTATG GTATTTCGTG TTTGAGTTAG TTACAACTTT GTnTTGCACT ACATCTGCAC 180 

CTTTtGCTAA TTGAGAAGTA CCGTTACCTG TTTCTAATAG GGCATGTGAG ATAAGATAAA 24 0 

CTTCATTAAT GCCATACATT TGAGCAGCTT TGTTAAATGC AGCACCTTGG TTTTCTAATA 300 

CACCTTTACC TTTTAAGAAT TGATTAATTT TATCAATAGA AATATTTTGT GGTTGGTCTA 360 

AGCGTAAGAA TTGATATTTT AATGCTGGAT CTTGAGCTAA ACGCTTCGTA TCCATTGCAT 420 

GCTTAACATC ATTAAATTTA GCATCTGTCC ACTTACCTGG TACACGTTGT ACTTGTGGTT 480 

50 TATATTGTAA ACCAGCTTGT ATTTGAGCAA CTTGGTTTAA TGTCATACCT GTTTGATTAT 54 0 

ACTTAATTAA TTCTTTAGCT AAATCAGTTG ATTTAATCCA TGCtAATTTA CCGTTAGATA 600 

ATTTACCATA GTACCAAGTT TGTCCATTAA TGACTTGTTC TTTAACAACT GCGAATGGTT 660 
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AACCATTACC ATTTTTAATT ACATAAGTGT AGTTATAATC TTTGGcAGCT GATGTAGTTG 780 

GTTTCACAGC AGTTGGTGCA GTTAAATCTT TTGCATTTAC CCAACCAGTG CGGTTATTAA 84 0 

TAGTACCGTA TAAATAAACA TCTTTGCCTA CAGATACTTG TTTCGTTGCA TTAAATGTAC 900 

CTTGAGCAAT GTTATTGCCT GTTAAAATGA CTTGGTTTTT AGTACCCCAA GGAACCATTG 960 

ATAAGCCGTT ATTTGATTTA TTAACAGTAT ATTTTTGAGT CGTTTTAACT. TCTTTGCCTA 1020 

AGTTTTGAAC ATTTAAGTCT TTTACATTGA ACCAACCTAA TGGGATGTTA TGGCTTGTAT 1080 

TGTTTAATAA TACATACGTT TCATTACCAT GAGCACGCTC TTTTGTTACA TAGAACGTAC 114 0 

15 GGTCTGCATA TTTCGCACCG TTTTTCGCTG TTTTTTCATA AACAGAAGCA CGAATACCAG 1200 

TGTTGTTTGG TTTAACTTGA GCAATCTTGC TAACTGTTTG AGTCGTTTGT GGTTTAGTAA 1260 

CAGTATAAGC TTTTACAGCT GTTTTTGGTT GTGCTACTGC TTTTTTAGGT GCAGCAGGTA 1320 

CAGCTAAATA TGCTTTACTT ACCCAACCAG ATTTACCATT TACAGTTCCA AATAAATAGA 1380 

TAGATTTATC AATTTGTTGT TGCTTAGTCG CTTTAAAAGT TTGGTTACCT GTACCAGAAA 144 0 

CTGCACCAGC TTCTTGTTTA TAAGTGCCCC AAGGTACTGA ATATAATTTA GTGCCTGGgT 1500 

TTACTGTATA TGTTTGCATT ACATTTACAG GTGATTTTGC ATtGtTATAA ATACGTCACC 1560 

TTGTTTAACC CAACCAATTA AAGTTGGACT ATTGTAATCT TTAACTAAGT AGAATTTGTT 1620 

30 TCCACCTAAA CTTGC TTCT T TTGTTACAGC AAATGTTTTT TGAACTTCTT TCGTTGGCTT 1680 

ACCAGTTTTG TCATAAACTG TAGTGAATAA GCCATTGTTT TTAGCATTAA TTTGAGCAAC 1740 

ACCGTTTAAT GATGAAACTG TTAATTTATT ATTTGTTGTA GGTGTTGATG GCTTAGGTGT 1800 

TGGTGTAGGC GTAGGTTTAG CAGTATCAAC TAAATATGCT TTACTTACCC AACCAGATTT 1860 

ACCATTCACA GAGCCATATA AATAAATTGA TTTATCAATT TGTTGTTGCT TTGAAGCCTT 1920 

AAATGTTTGG TTTCCAGAGC CAGACACACT ACCAGCAACT TGTTTAGATG TACCCCAAGG 1980 

TACTGTATAA AGTTTCGTAC CAGGTTTGAT TGAATATGAT TGATTTACAT TTACAGGTGA 204 0 

TTTAGCTGTG TTGTAAACCA CATCGCCTTC TTTAACCCAA CCAAATTTAT TACCAGAATT 2100 

45 GTAATCTTGA ACAAGATAGA ATTTTTGATT ACCTAATGTA GCTGTTTTAG ATACAGCAAA 2160 

TGTTTTTTGA ACTTCATTAG TTGCTTTACC AGTTTTGTCG TATACAGTAG TATATAAACC 2220 

ACTATTTGTT GG 2232 
(2) INFORMATION FOR SEQ ID NO: 211: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2082 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 211: 

GATTTAAATA AAATTAATGG ATATCGTGAT CGTACGATGT TAGAACTTCT GTACGCAACG 60 

5 

GGAATGCGTG TATCTGAATT GATACATTTA GAGTTAGAAA ACGTGAACTT AATAATGGGA 120 

TTTGTACGCG TATTTGGTAA AGGCGATAAA GAAAGAATTG TACCATTAGG CGACGCAGTC 180 

10 ATTGAGTACT TAACTACTTA TATTGAAACG ATTAGACCGC AACTTTTAAA AAAGACTGTT 240 

ACTGAAGTCT TATTTTTAAA TATGCATGGT AAACCTTTAT CACGACAAGC AATATGGAAA 300 

ATGATTAAAC AAAATGGTGT AAAGGCAAAC ATTAAAAAGA CGTTAACGCC ACATACGTTA 360 

75 CGCCACTCTT TTGCGACACA TTTATTGGAA AATGGCGCAG ATTTAAGAGC AGTGCAAGAG 420 

ATGTtAGGtC ACTCTGaCmT ATCTACTACC CmaCTCTATA CmCATGTTTC GrAATCTCAA 4 80 

ATTAGAAAAA TGTATAACCA ATTTCATCCT AGAGCATAAA GTGAACAATA ACTCAAAAGT 54 0 

20 

CACAATACAC ATGACTAAAA ATGTCTGTGC TATTGTGGCT TTTTTAAATT GGTTGATTAA 600 

TTACGTCTAT GTTTTCTTAA TTGAATCGCT TCTTCTTTTG CTGCAATCAC TTCTGAACGA 660 

TCACGGCGCA TGTGATGGTC TACAATAAAA GGATCTGTTG CTGTTTCCTG ATTATAATCA * 720 

25 

TAGTCTGGAT AGTTGGCCTT GATGATGCGT TCAAAGACTG GAGTTATTGG TAATATAACA 780 

GATGAAAAAG GCTTTGCTGC ATTCAATTTT GCAATCTGTT GCTCAATTAA CAACTGATAA 840 

30 TCATTTAAAT TAAGGTATAA CGCATCTCTA TCTTTAGCAT TTTGTATTAT TTCTTTAGAT 900 

TTATTAAAAG ACTTATAGGC GCCTTTTAAA TTATTGCGGC GATAATGGTA ACAAGCAGTT 960 

GCAAACAAGA TTAAACTAAC AACTGCATCT TGCTTACTGT AGTTATTTTC AGCTTTCCAT 1020 

35 

GCATCTTCTA AAATGTCATG ACATAGGAAA TAATGTTGCT TAGTATGAAA TTGATAATAG 1080 

AAAOTTATCA GTGCCTGTTG CATTTTGTTA TCACCCCAAT TTAAAAGTAA GTTATTTTCA 1140 

TGCTATAATA TTTTAGAGAA TTATGCACAT ATGACGCAAT ACGAGGTAGA TATTATGTAT 12 00 

40 

GAAGTTAAAT TAGATGCTTT CAATGGACCA TTAGATTTAT TGCTGCATCT TATCCAAAAA 1260 

TTTGAAATAG ATATTTATGA TATTCCTATG CAAGCATTAA CAGAGCAGTA TATGCAGTAC 1320 

45 GTTCATGCAA TGAAACAGCT TGAAATTAAT ATTGCAAGTG AATACCTAGT ATTAGCGTCA 13 80 

GAACTCTTAA TGATTAAAAG TAAGATGCTA TTACCACAAT CAACATCAGA TATGGATGTT 144 0 

GATGATGACC CACGGGAAGA TTTAGTtGGG CGTTTAATAG rATATCaAAA TTATArAGAA 1500 

so 

TATACTGCtA TTTTAAATGA CATGAAAGAA GAAAGAGATT TTTATTTTAC CAAAAAGACC 1560 

GACAGATTTA TCtCATTTGG AAAcAGATGA ATCyTGGGAT CCaAATCATA CGATTGATTT 1620 
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ATCTGTTGAA ATCCGAAAAG AGACATTTAC CATTCAACAA GCTACAGAAC AAGTGACATC 174 0 

GAGATTGAAA GATAAAGATC ATTTTAACTT CTTTAGTCTG TTTACGTTTT CTGAGCCAAT 1800 

5 

TGAACAAGTA GTCACTCACT TTTTAGCTAT TTTAGAGATG TCAAAAGCAG GAATAATTAA 1860 

TATTGAGCAA CAACGTAATT TTGAAGATAT TAACATTATT AGAGGAGTGA ACTACCATTT 1920 

w TGGATAATCA TGGTATATTA GAGTCGCTTT TATTTACAGC TGGCGATGAA GGTTTAGATG 1980 

AAAAACAACT ATTAGAAATA TTAGATATGT CGAAAGACCA ACTCGTTGAA TTAATTGAAA 2040 

ATTATTCATC ACATGGATTA ATGATACAAC GATTTGGAAT GA 2082 

IS (2) INFORMATION FOR SEQ ID NO: 212: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 219 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 212: 

25 

TCTATTCTCG TTCTTCCAAG ACCCTGaATT AGAAGTTAAG AAAATCGAAG AAGATGAGAA 6 0 

AGAATCTATT AAAAAAGCTC AAAAAGGTAT TTATAAAGAC CCTAGAGACA TCAATGATGA 120 

30 CGAACAAGAT GATGATACAA AAGATACTGT TGATAAAAAG GAATGATTGT AATTGCCTAA 180 

CAAAAACACT CAAGAATATT GGGAAGAACG CGGACGCAAA GCAATCGAGA ATGAGTTGAA 24 0 

GCGTGATAAA ACTAAAGCTG AAGAAATAGA ACGTATATTG AATATGATGA TTAAGCGCAT 300 

35 TGAAAAAGAG ATCaATGCGT TTATTGTCAA GTACGGAGAT TTTGCAGGCG TTACATTACA 360 

AGAAGCACAA AAGATTATTG ATGAGTTCGA TGTAAAAGCG TTTCAAGAAG AAGCAAAAAG 420 

ATTGOCGAA AACAAGGAGT TTAGCGATAG AGCAAATGAA GAATTAAAGA AGTATAACAC 4 80 

40 

GAAAATGTAT GTATCTAGAG AACAGATGTT AAAGATTCAA ATAGAATTCT TAATTGCTTA 540 
TGCAACAGCT CAAACAGAAT TATCGATGAG GGAATATTTC GAATCAACAG CTTATCGTGT - 600 

45 GTTCAGTGAT CAAGCGGGTA TTTTAGGTGA AGGTGTACAA GTAGCTAAAG AAGTTATAGA 660 

TACAATCGTT GATACACAAT TTCATGGTGT CGTTTGGTCA GAGCGATTAT GGACTAATAC 720 

CGAAGCAATG AAACAAGAAG TAGAAGAAAT AATTGCTAAT GTAGTTATTA GAGGTCGACA 780 

so TCCTAATGAA TATGTTAAAG ATATGCGCAA CACTTAAATA AATTCGAAGG CACAGCACGA 840 

CAAAAGACCG CAGCAATTAA ATCATTGCTT TATACGGAAT CGGCACGTGT TCACGCACAA 900 

TCAAGCATTG ACAGCATGAA AGAAATTTCA CCGGAAGgAT ATTATATGTA TATTGCAAAA 960 
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GACGCTAAAA TTGGTGTTAA TTTCTATCCT ATGCATATCA ATTGTCGTTC AGATTGCGCT 


1080 




TTACTACCTA AATCTATGTG GCCGAAAAAA CCAAGCAAGA AACGAAAAAC 


AAAATACTTC 


1140 


5 


GGAGGGAAAG 


TGAAAAGCGG TGATTGATTT AAAAGTGAAG 


TTTTTTAAAG 


GCAAGTTAGT 


1200 




TTTGTATGAC 


AGTAAATTAA ATGTTTGGAG GATACTAATA 


TGAGTAATAC 


TGACAAATAC 


1260 


10 


CTTAGAGACA 


TAGCAAGAGA ATTAAAAGGT ATACGTAAAG 


AGTTACAAAA 


GCGAAACGAA 


1320 


ACAGTTATTA 


TTGATGCAAA CTTAGACAGT TTAAGGTCGG 


CAGTATTAGC 


CGATAAAGAA 


1380 




AAATCGAAAT 


ATAATGAACC TCTCTTTTAA TAGCTAGCAC 


TTAATTGTGT 


TGGCTATTTT 


1440 


15 


TTATGTCCAA AACGTGCTGA TGACATAAAA AGCACGCATG 


GAAAAACAGT 


CGACAGACTA ' 


1500 




TAAATGGAGG 


TATATCTCAT GGAAGAAAAT AAACTTAAGT 


TTAATTTGCA 


aTTTTTTGCA 


1560 




GACCAATCAG 


ATGATCCGGA CGAACCAGGC GGAGATGGTA AAAAAGGAAA 


TCCTGATAAG 


1620 


20 


AAAGAAAATG 


ACGAAGGTAC TGAAATAACT TTCACGCCAG 


AGCAACAAAA 


GAAAGTTGAT 


1680 




GAAATACTTG 


AACGTCGTGT AGCCCACGAA AAGAAAAAAG 


CTGATGAGTA 


TGCAAAAGAA 


1740 


25 


AAAGCAGCAG 


AAGCTGCTAA AGAAGCTGCT AAATTAGCGA 


AAATGAACAA 


GGATCAAAAA 


1800 


GATGAATATG 


AACGCGAACA AATGGAAAAA GAACTGGAAC 


AATTACGTTC 


AGAAAAACAA 


I860 




TTAAACGAAA 


TGCGTTCAGA AGCACGAAAA ATGTTGAGTG 


AAGCGGaAGT 


TGATTCATCA 


1920 


30 


GATGrGGTTG 


TCAATTTAGT TGTAACAGAT ACTGCTGAAC 


AAACTAAATT 


GAATGTTGAA 


1980 




GCTTTTTCTA 


ATGCAGTAAA AAAAGCGGTT AATGAAGCGG 


TTAAGGTTAA 


CGCTAGACAA 


2040 




TCGCCATTGA 


CTGGTGGAGA TTCATTTAAT CACTCGACTA 


AAAATAAACC 


GCAAAACTTA 


2100 


35 


GCTGAAATAG 


CTAGACAAAA AaGAATTATT AAAAATTAAC 


GGAGGCATTT 


AAATGGAACA 


2160 




AACACAAAAA TTAAAATTAA ATTTGCAACA TTTTGCAAGT AACAATGTTA AACCACAAGT 


2220 




ATTTAACCCT 


GACAATGTAA TGATGCATGA AAAGAAAGAT 


GGCACGTTGT 


TAAACGACTT 


2280 


40 


TACAACACCT 


ATCTTACAAG AGGTTATGGA AAACTCTAAA 


ATCATGCAAT 


TAGGTAAGTA 


2340 




CGAACCAATG 


GAAGGTACTG AGAAGAAGTT TACTTTTTGG 


GCTGATAAAC 


CAGGTGCTTA 


2400 


45 


CTGGGTAGGT GAAGGTCAAA AAATCGAAAC GTCTAAGGCT ACTTGGGTTA ATGCTACAAT 


2460 


GAGAGCGTTT 


AAATTAGGGG TTATCTTACC AGTAACAAAA 


GAATTCTTGA 


ATTACACTTA 


2520 




TTCACAATTC 


TTTGAAGAAA TGAAACCTAT GATTGCTGAA 


GCTTTCTATA 


AAAAGTTTGA 


2580 


50 


CGAGGCAGGT ATTTTGAATC AAGGTAACAA TCCGTTCGGT 


AAATCAATTG 


CACAATCAAT 


2640 




TGAAAAAACT AATAAGGTTA TTAAAGGTGA CTTCACACAA GATAACATTA TTGATTTAGA 


2700 




GGCATTGCTT GAAGATGACG AATTAGAAGC AAATGCATTT ATCTCAAAAA CACAAAACAG 


2760 
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TGATTCGTTA 


GACGGTCTAC 


CTGTGGTTAA CCTTAAATCA 


AGCAACTTAA 


AACGTGGTGA 


2860 




ATTAATCACT GGTGACTTCG ACAAATTGAT TTATGGTATC 


CCTCAATTAA 


TCGAATACAA 


2940 


5 


AATCGATGAA 


ACTGCACAAT 


TATCTACAGT TAAAAACGAA 


GATGGCACAC 


CTGTAAACTT 


3000 




GTTTGAACAA 


GACATGGTGG 


CATTACGTGC AACTATGCAT 


GTAGCATTGC 


ATATTGCTGA 


3060 


10 


TGATAAAGCG 


TTTGCTAAGT 


TAGTTCCTGC TGACAAAAGA 


ACAGATTCAG 


TTCCAGGAGA 


3120 


AGTTTAATAA 


ATAATTAGGA 


GTGGTAACAT GCCCGAAATC 


ATTGGAATTG 


TTAAAGTAGA 


3180 




TTTTACAGAT 


TTAGAAGATA ACAGACATGT CTATATGAAA 


GGGCATGTCT 


ACCCTCGTAA 


3240 


15 


AGGTTATAAT 


CCTACAGATG AACGTATCAA AGCTTTAGCT 


AGTGTTGAAA ATAAACGCAA 


3300 




CAAACAAATG 


ATTTACATTG 


TAAATGACAA ATTAACCAAA 


AAAGAACTTG 


TCGAAATAGC 


3360 




AAGTGTTGCT 


GGCTTACAAG 


TTGATGAAAA ACAAACAAAA 


GCTGAAATTA 


TCAATGCTTT 


3420 


20 


TGAGTCACTA 


GAGTAGGTGG 


TTATATGACT ACGCTAGCTG 


ATGTAAAAAA 


ACGTATTGGT 


3480 




CTTAAAGATG 


AAAAGCAAGA 


TGAACAATTA GAAGAAATCA 


TAAAAAGTTG 


TGAAAGCCAG 


354 0 


25 


TTGTTATCAA 


TGTTACCTAT 


TGAAGTTGAA CAAATACCGG 


AAAGgTTTAG TTACATGATT 


3600 


AAAGAAGTTG 


CAGTTAAACG 


CTACAACAGG ATTGGTGCTG 


AAG t ATGACA 


TCAGAAGCGG 


3660 




TTGACGGACG 


TAGCAATGCG 


TATGAATTGA ACGATTtCAA 


GGAGTATGAA 


GCTATTATTG 


3720 


30 


ATAATTACTT 


TAATGCTAGA 


ACGAGAACTA AAAAAGGAAG 


GGCTGTGTTC 


TTTTGAGATA 


3780 




TGAAGATAGA 


GTTATTTTTC 


AATTAGAACA AGTAGCAACT 


TACAATCCTA AAACTAGCAA 


3840 




AAAAGAAAAC 


ACACTAATCA 


CTTATGATGC GATACCATGC 


AATATTAACC 


CCATTTCTAG 


3900 


35 


AGCAAGAAAG 


CAACTTGAAT 


TTGGTGATGT AAAAAACGAT 


GTAAGTGTTC 


TGAGGATAAA 


3960 




AGAATCAATA 


TCTTACCCTG 


TTAGCCACGT GTTGGTTAAT 


GGCATTCGCT 


ACAAGATAGT 


4020 




TGAT&CAAGG 


ATATACAGAC 


ACGAAACGTC ATATTATATC 


GAAGAGGTCA 


ATTGATGAAT 


4080 


40 


ATAGATGGAT 


TAGACGCACT 


GTTAAACCAA TTTCACGATA 


TGAAAACCAA 


CATTGATGAT 


4140 




GATGTAGATG 


ATATTTTACA 


GGAAAACGCC AAAGAATATG 


TAGTACGAGC 


TAAATTGAAA 


4200 




GCTAGAGAAG 


TAATGAATA 








4219 



45 

(2) INFORMATION FOR SEQ ID NO: 213: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GCTTACAAGT ATATTCATAA TTACATATTC AAGGTCCTTG CATGTGGTAT TTTGCTATGG 60 

yCtTTaACTA CAACGGGGTC TAAGACTGCG TTTATCATAT TAATCGTCTT AGCCATTtAT 120 

TyCTTTATkA AAAAGTTATT TAGTAGAAAT GCGGTAAGTG TTGTGAGTAT GTCAGTGATT 180 

ATGCTGATAT TACTTTGTTT TACCTTTTAT AATATCAACT ACTATTTATT CCAATTAAGC 240 

GACCTTGATG CCTTACCGTC ATTAGATCGA ATGGCGTCTA TTTTTGAAGA GGGCTTTGCA 300 

TCATTAAATG ATAGTGGGTC TGAGCGAAGT GTTGTATGGA TAAATGCCAT TTCAGTAATT 360 

AAATATACAC TAGGTTTTGG TGTCGGATTA GTGGATTATG TACATATTGG CTCGCAAATT 420 

AATGGTATTT TACTTGTTGC CCATAATACA TATTTGCAGA TCTTTGCGGA ATGGGGCATT 480 

TTATTCGGTG CATTATTTAT CATATTTATG CTTTATTTAC TGTTTGAATT ATTTAGATTT 540 

AACATTTCTG GGAAAAATGT AACAGCAATT GTTGTAATGT TGACGATGCT GATTTACTTT 600 

20 TTAACAGTAT CATTTAATAA CTCAAGATAT GTCGCTTTTA TTTTAGGAAT TATCGTCTTT 660 

ATTGTTCAAT ATGAAAAGAT GGAAAGGGAT CGTAATGAAG AGTGATTCAC TAAAAGAAAA 720 

TATTATTTAT CAAGGGCTAT ACCAATTGAT TAGAACGATG ACACCACTGA. TTACAATACC 780 

CATTATTTCA CGTGCATTTG GTCCCAGTGG TGTGGGTATT GTTTCATTTT CTTTCAATAT 840 

CGTGCAATAC TTTTTGATGA TTGCAAGTGT TGGCGTTCAG TTATATTTTA ATAGAGTTAT 900 

CGCGAAGTCC GTTAACGACA AACGGCAATT GTCACAGCAG TTTTGGGATA TCTTTGTCAG 960 

TAAATTATTT TTAGCGTTAA CAGTTTTTGC GATGTATATG GTCGTAATTA CTATATTTAT 1020 

TGATGATTAC TATCTTATTT TCCTACTACA AGGAATCTAT ATTATAGGTG CAGCACTCGA 1080 

35 TATTTCATGG TTTTATGCTG GAACTGAAAA GTTTAAAATT CCTAGCCTCA GTAATATTGT 1140 

TGCGTCTGGT ATTGTATTAA GTGTAGTTGT TATTTTTGTC AAAGATCAAT CAGATTTATC 1200 

ATTGTATGTA TTTACTATTG CTATTGTGAC GGTATTAAAC CAATTACCTT TGTTTATCTA 1260 

TTTAAAACGA TACATTAGCT TTGTTTCGGT TAATTGGATA CACGTCTGGC AATTGTTTCG 132 0 

TTCGTCATTt AGCATACTTA TTACCAAATG GACAGGTCAA CTTATATACT AGTATTTCTT 1380 

GCGTTGTTCT TGGTTTAGTA GGTACATACC AACAAGTTGG TATCTTTTCT AACGCATTTA 144 0 

ATATTTTAAC GGTCGCAATC ATAATGATTA ATACATTTGA TCTTGTAATG ATTCCGCGTA 1500 

TTACCAAAAT GTCTATCCAG CAATCACATA GTTTAACTAA AACGTTAGCT AATAATATGA 1560 

so ATATTCAATT GATATTAaCA ATACCTATGG TCTTTgGTTT AATTGCaATT ATGCCATCAT 1620 

TTTATTTATG GTTCtTTGGT GAGGAATTCG CATCAACTGT CCCATTGATG ACCATTTTAG . 1680 

CGATACTTGT ATTAATCATT CCTTTAAATA tGTTGaTAAg CaGGCAATAT TTAtTAAtAG 1740 
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TATGTAyTAT TTTGATATAT TTTTATGGAA TTTACGGTGC TGCTATTGCG CGTTTAATTA 



1860 



CAGAGTTTTT CTTGCTCATT TGGCGATTTA TTGATATTAC TAAAATCAAT GTGAAGTTGA 



1920 



ATATTGTAAG TACGATTCAA TGTGTCATTG CTGCTGTTAT GATGTTTATT GTGCTTGGTG 



19B0 



TGGTCAATCA TTATTTGCC 



1999 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATION FOR SEQ ID NO: 214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7769 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID. NO: 214: 

TCATTATTAA GACTATTATA TATAATGAAT TTTAACTGGT TTATTAAACG AGAACGTCGG 60 

GAATTAAGTA ACTACAATAA AAATAAGATA TGACAATAAG GAGACTACAC GCGTGATCAT 120 

TGCCATAATT ATATTGATAT TTATTTCGTT TTTCTTTTCA GGAAGCGAGA CGGCATTAAC 180 

GGCTC3CCAAT AAAACAAAAT TTAAAACTGA AGCTGACAAA GGTGATAAAA AAGCAAAAGG 240 

CATTGTAAAG TTACTTGAAA AACCAAGTGA GTTTATTACA ACGATTCTAA TTGGGAATAA 300 

TGTCGCGAAT ATTTTATTAC CAACACTTGT TACAATTATG GCTTTACGTT GGGGGATTAG 360 

CGTTGGTATT GCATCAGCTG TTTTAACAGT TGTTATCATT TTGATCTCCG AAGTGATTCC 420 

CAAGTCTGTC GCTGCAACAT TTCCAGATAA AATAACAAGG CTTGTATATC CAATTATTAA 480 

TATTTGTGTC ATTGTGTTCC GTCCTATCAC ATTACTTTTA AATAAGTTGA CGGACAGTAT 54 0 

TAATCGAAGT TTATCTAAGG GCCAACCTCA AGAACATCAA TTTTCAAAAG AAGAATTTAA 600 

AACAATGTTA GCAATTGCTG GACATGAAGG TGCTTTAAAT GAAATTGAGA CGAGTAGGTT 660 

GGAAGGTGTC ATTAATTTTG AAAATTTAAA AGTAAAAGAT GTAGATACAA CACCTAGAAT 720 

TAATGTGACG GCATTTGCTT CAAATGCGaC ATACGAAGAA GTTTATGAAA CGGTTATGAA 780 

TAAGCCATAC ACTAGATATC CAGTGTACGA GGGAGATATT GATAACATTA TTGGGGTGTT 840 

TCATTCTAAA TATCTGTTGG CTTGGAGTAA TAAAAAAGAA AATCAAATTA CAAACTATTC 900 

AGCTAAGCCA TTATTTGTGA ATGAACACAA TAAAGCTGAA TGGGTATTAC GTAAGATGAC 960 

TATTTCTAGA AAACATTTAG CAATTGTGTT GGACGAATTT GGTGGTACTG AAGCGATAGT 1020 

GTCACATGAA GACTTAATTG AAGAATTATT AGGTATGGAA ATTGAAGATG AGATGGATAA 1080 

AAAGGAAAAA GAAAAACTTT CTCAACAGCA AATTCAATTT CAACAACGGA AAAATCGCAA 1140 
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GTATTGAATA TCCAATTATA CAAGCAGGTA TGGCAGGAAG TACGACACCG AAATTAGTTG 1260 

CATCAGTAAG TAACAGTGGT GGGTTAGGCA CAATAGGCGC AGGTTACTTT AATACGCAGC 1320 

AATTGGAAGA TGAAATAGAT TATGTACGCC AATTAACGTC AAATTCTTTT GGCGTAAATG 13.80 

TCTTTGTACC AAGTCAACAA TCATATACCA GTAGTCAAAT TGAAAATATG AATGCATGGT 1440 

TAAAACCTTA TCGACGCGCA TTACATTTAG AAGAGCCGGT TGTAAAAATT ACCGAAGAAC 1500 

AACAATTTAA GTGTCATATT GATACGATAA TTAAAAAGCA AGTGCCTGTA TGTTGTTTTA 1560 

CTTTTGGAAT TCCAAGCGAA CAGATTATAA GCAGGTTGAA AGCAGCGAAT GTCAAACTTA 1620 

1S TAGGTACAGC AACAAGTGTT GATGAAGCTA TTGCGAATGA AAAAGCGGGT ATGGATGCTA 1680 

TCGTTGCTCA AGGTAGTGAA GCAGGTGGAC ATCGTGGTTC ATTTTTAAAA CCTAAAAATC 1740 

AATTACCTAT GGTTGGAACA ATATCTTTAG TGCCACAAAT TGTAGATGTC GTTTCAATTC 1800 

20 CGGTCATTGC CGCTGGTGGA ATTATGGATG GTAGAGGAGT TTTGGCAAGT ATTGTCTTAG 1860 

GTGCAGAAGG GGTACAAATG GGCACCGCAT TTTTAACATC ACAAGACAGT AATGCATCAG 1920 

AACTACTGCG AGATGCAATT ATAAATAGTA AAGAAACAGA TACAGTCATT ACAAAAGCGT 1980 

TTAGTGGAAA GCTTGCACGC GGTATCAACA ATAGGTTTAT CGAAGAAATG TCCCAATACG 2040 

AAGGCGATAT CCCAGATTAT GCAATACAAA ATGAGCTAAC AAGTAGCATA AGAAAAGCCG 2100 

CAGCAAACAT CGGCGACAAA GAGTTAATAC ATATGTGGAG TGGACAAAGC CCGCGACTAG 2160 

CAACAACGCA TCCCGCCAAC ACCATCATGT CCAATATAAT CAATCAAATT AATCAAATCA 2220 

TGCAATATAA ATAATCGACC GCAATCCACA AAAGCACAAG CACCCCCAAA CATTATTTTA 22 BO 

35 GTGCTTGCCA TTTTTGTGGA TTGCGTTTCT ATTTTACCAA TTTAATCAAA CGAAAACATC 2340 

AAGCTGAAGA TCGCCGAAAG ATTTTAATCA AGCAAAAACA TCAAACTAAA GTTCGCTGAA 24 00 

ATGATTATGA TAAAAGTTAT ATGGTATGAT GACATTGGTG ATATATATGA TAAACATCGG '2460 

ATTAACAGGT TGGGGTGATC ACTATTCATT ATATGAAGAT TTAGAACGCC AAACCGATAA 2520 

ACTTAAAACA TATGCTGGAC ATTTTCCGGT TGTCGAATTA GATGCGACAT ACTATGCGAT 2580 

ACAACCGGAA AGAAATATAT TGAAATGGAT AAAAGAAACG CCTGATACAT TTGAATTTGT 2640 

GGTCAAAATT CATCAAGCAC TcACATTGCA TGCAGACTAC AAAACATTTG CAGATACAAG 2700 

GCAAGAACTA TTTGATCAAT TTAAGAATAT GTTAGAGCCC TTACATACAC AGAAAAAATT 2760 

so AGCAATGGTA TTGGTTCAAT TTCCGCCATG GTTTGACTGC AATGCACAAA ATATCAAATA 2820 

TATTTTGTAT GTAAGACAGC AATTACAAGC ATTTCCAATG TGTGTAGAAT TTAGGCATCA 2880 

ATCATGGTTT AGTGATGCAT TTAAAGAACA AACATTGGCA TTTTTAACAG AACATCAAAT 2940 

55 



25 



30 



40 



45 



959 



EP0 786 519 A2 



15 



20 



25 



30 



35 



40 



45 



50 



AATCACAAAT 
GAAAGATATG 
AGAATTAATA 
CGTCATATTT 
ATTATTGAAT 
GACGACTATG 
AGGGTCTATC 
TGTTGAACAT 
CATTCTAATT 
TATTAAAAAT 
CATTAGTAGA 
CGTAGCCATT 
CAAGTATGAA 
ATTGTTTGCT 
AGGTGGCGCA 
TGTTGGAACA 
TGCTCAAGGT 
TGCGAAAATC 
GAGAACAGTA 
CTTTAAAAGG 
CATTTACATG 
CATCCTTCTT 
GCAACTTTAG 
ATCGGTAATA 
GCAAAATTTA 
ATCGTTTCTT 
GCACCTTTTA 
ATAAAAGAAG 
TGTGGCATTT 



GAAATTGCGT TTGTACGTTA 
TCAGATCAAG AATGGCGCGA 
GACTTGGCAC AAAAGGCACA 
AACAATAATT CTGGTGGTCA 
ATAGAATATG AAGGGTTAGC 
TTATTAACAA TTACATTATT 
GTAGGCATTG GAGGCGGTAT 
GGATTACTAC ATAATATTAC 
GTGACAGGAC TTTCTTCATC 
GGTTCCATCT TTTTATTTGG 
TATTTAACAT TTGAGTCATT 
TTATTAATGG TAAGAAATAA 
AAGACTTATG TAGACGCTAA 
TTTATTACAA CGTTTTTAAT 
CTAATGACGC CACTAATGCT 
AGTATGATGA TGATTTTCTT 
CACGTAGCTT GGGGTTATGC 
GGTGTCAAAG TGAATCAATC 
ATGTTGTTAA TGGGTATATA 
AGGACGTCAA TTTGAGGCTT 
AATACGAACG CATTAAAGCA 
TATATGTTGA TCTAGGTGAT 
GTAAAAAGAA TGTGGCATTA 
ATGAAGGGAT GACCATTTCA 
TAGTGACATG TAGCAATGTT 
CTTATATTAA GGACATAGAC 
CCCCATTTTA TCGTGCACTA 
AAATTGAACT TCAACGAGGT 
TCTTCGATGA AACATTATGC 



TCATGGACGT 
TGTACGCTAT 
AATATTAGCA 
TGCAGCAAAT 
ACCACAACAA 
AGTTTTAATC 
TATTATCGTT 
AACACAAGTA 
ACTTGGATAT 
ACTATTACCA 
TAATTTATAT 
GATTAAACCG 
AGGTAAAACA 
TGGTATATTG 
TATTGTATTT 
TTCAAGTGTC 
AATCATtlTTA 
AATTAAGTCA 
TTTAATTATT 
ACAATTTATC 
TATATGGCAG 
CATGTAGATT 
CTAAATGAAG 
TACGAAGCTT 
ATAGATGAAT 
GGTGTGAAAA 
AATTGGATTG 
AAATTTGATG 
CAAGAATTGC 



AATCATTACG 
TTATATGATT 
CAAAAAGCTA 
AATGCCAAAA 
TTAAAATTAT 
GGAGGTTTGT 
CCAACAATGG 
GCGATAGGGA 
TTAAAAACAA 
GGTTCATTGC 
TTTGGTATCT 
TTTAAAATTT 
TATCATTATA 
ACAGGTTTAT 
AGATTTCCAC 
ATGAGTTCTA 
ATTATTTCTA 
GATACGGTAG 
CGTGCGTTGA 
ATACGAACGA 
AACATCGGCC 
TATCCGCACC 
CAAAATGTGA 
TAAATCACCT 
CAGGTCATTT 
TACTATTCGT 
TTACCGATCC 
TATTAATCGT 
CTGAAATTGA 



GTTGGACTAA 
ATAATGAGCA 
AGAAAGTTTA 
CATATCAGCG 
TTTAAGAGGC 
CAGCGATTAT 
TTTACCTCGG 
CGTCTTCAGT 
AACAAGTTGA 
TTGGGTCCTT 
TTTTAATTTT 
TCGATAAACC 
gTGTTCCACC 
TTGGTATTGG 
CTCATGTAGC 
TAGGGCACAT 
GTTATTTTGG 
TAACATTATT 
TTTAATACAA 
TATTCATAGT 
ACGACTTAAT 
TATAACTGAA 
TGTTGCAACA 
TTACGACGAA 
ACCAAATAAT 
TGCAGCGACA , 
ACTTGAATCT 
GCTAAGTCAT 
TGTCATTTTT 



3060 
3120 
3180 
3240 
3300 
■ 3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 
4680 
4740 
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GCAgCTGGAA AGTATGGTAA TTATCTTGGA GAGGTTAATT TAACTTTTGA GG CACATAAA 4860 

GTAGTACATA AAACTGCAAA GATTATTCCT TTAGAAACAT TACCTGAAGT TGAAACTTCA 4920 

TTTGAAGAAG AAGGAAAAAC GTTAATGTCC AATTCAGTAA TTCAACATCC AGTAGTGCTT 4 9 BO 

AAGCGTAGTA TGAATCACAT AACTGAAGCT GCATACTTAT TAGCTCAAAG TGTTTGTGAG 5040 

TATACACATG CACAATGTGC CATCATCAAT GCTGGCTTAC TCGTTAAAGA TATTGTAAAA 5100 

GATGAAGTGA CAGAATATGA CATTCATCAA ATGTTACCGC ATCCGATTAA TATGGTAAGG 5160 

GTTAGACTTT TTGGTGTGAA ATTAAAAGAG ATTATAGCTA AAAGTAATAA ACAAGAATAT 5220 

ATGTATGAAC ATGCACAAGG TTTGGGTTTC AGAGGGAATA TATTTGGAGG ATATATTCTT 5280 

TATAATTTAG GGTACATTCA TTCTACAGGG CGTTACTATC TGAATGGAGA AGAAATCGAA 5340 

GACGACAAAG AATATGTACT AGGTACGATA GATATGTATA CGTTCGGTCG TTATTTCCCA 5400 

/ 

20 ACATTGAAAG AATTACCAAA AGAGTATTTA ATGCCAGAGT TTTTAAGAGA TATATTTAAA 5460 

GAAAAATTAT TGGAATATTA AAAAGTAAGA TTATTGGATT TTCATTTGTC ATGAATTTCG 5520 

ATATAATGTT TAAAGATACA CTTAACAGGA GGGTATGTGT TGTTATGGCG ACAAAAAACG 5580 

AGGAAATATT ACGTAAACCG GATTGGTTGA AAATAAAATT AAATACCAAC GAAAACTATA 5640 

CAGGACTTAA GAAGATGATG AGGGAAAAAA ATCTTAATAC TGTATGTGAA GAAGCTAAAT 5700 
GTCCTAATAT ACATGAATGT TGGGGTGCAC GTCGTACAGC GACATTTATG ATTTTAGGTG . 5760 

CCGTATGTAC AAGAGCTTGT CGTTTTTGTG CGGTTAAGAC AGGTTTACCT AATGAACTTG 5820 

ATTTAAATGA GCCTGAACGT GTAGCTGAAT CAGTTGAATT AATGAATTTG AAACACGTTG 5880 

35 TTATCACTGC TGTTGCGCGT GATGATTTAA GAGATGCTGG TTCAAATGTT TATGCTGAGA 5940 

CAGTACGTAA AGTTAGAGAA AGAAATCCAT TTACAACGAT TGAAATTTTA CCATCAGATA 6000 

TGGGCGGGGA CTATGATGCG TTAGAAACAT TAATGGCGTC AAGACCTGAC ATTTTAAACC 6060 

ATAATATTGA AACTGTTCGT CGCTTAACAC CGAGAGTTCG TGCGCGTGCG ACTTACGACA 6120 

GAACATTAGA GTTTTTACGT CGTTCAAAAG AATTACAACC GGATATCCCA ACTAAATCAA 61 BO 

GTATTATGGT TGGATTAGGT GAAACTATAG AAGAAATTTA TGAAACGATG GATGATTTAC 6240 

GTGCGAATGA TGTAGATATT TTAACGATTG GTCAATATTT ACAACCTTCA CGTAAACATT 6300 

TAAAGGTTCA AAAATATTAC ACGCCTTTAG AGTTTGGTAA ATTAAGAAAA GTGGCAATGG 6360 

ATAAAGGGTT TAAACATTGC CAAGCTGGAC CTTTAGTACG TAGTTCTTAT CATGCGGATG 6420 

AGCAAGTAAA TGAAGCTGCT AAAGAAAAGC AACGCCAAGG TGAGGCACAG TTAAATAGTT 64 80 

AATATTTAAC CATTAATAAG GCATAAAGGC TTAGTTTGTA CAAAACGAAC GTGTCATAGA 6540 
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AGGTGAAGAA TTTGATAAAA GTAGATCAAC ATTACTTTGA ATTAATAGAA AATTATCGCG 6660 

AATGTTTTAA TGAAGAACAA TTTATTGCTA GGTATTCAGA TATTTTAGAT AAATATGATT 6720 

ACATAGTTGG TGACTATGGT TACGATCAAT TACGATTAAA AGGTTTTTAC AAAGATTCTA 6780 

ATAAAAAAGC AGAGATGAGT AAACGTTTTT CAAATATTCA AGATTACATA TTTGAATATT 6840 

GTAACTTTGG TTGTCCTTAC TTTGTATTAA GACATTTGTC TAAACAAGAG GTTAAAAAGT 6900 

TAATCGAAGA AGTTCATCCG TCTGATGTGA TAGATGACGA CAATAAACTT CAAGATGTGA 6960 

AGATTAAGCC AACCATTCAA GATACTGAAC ATTAATAAAA CCCTTAGCTA GATTGAAAAT 7020 

GGGAATCATG CAATTCAAGC ATGGACCTGT AATCTAGTTA GGGGTTTTTA TCTTTAATGA 7080 

ATGACTTCAT TTAAATACTC AGTAATTTCA TCGCCTTCTT CAGCATTTAC ACCTAAAATA 7140 

TGAGCGATAT AGCCTTCTTC TTTTAAATCA TCAGTACCGA TAATACCGAA TTTATTTGTT 7200 

20 * TGCATATTAA GTACGAGTGT CTTACCATAA TGTCTATTTG TATGGACTAA CATCAAATCA 72 60 

TATCGACTAT GCTCGCCAAC AAAACCAACA AACTGAACTT GACTCTCTTC GTTGTCATCA 7320 

TATAAATACA TATCAATCAT TTTGTAGCGA CTCCTTTTAA AAGTAGTAAA GTTAGTATAA 7380 

CGACAAATGA AGTATACTGC AAAATTATGA TAATATATAA GTGAGAGGTG ACAAGGAATG 744 0 

TATTTTGTAG ACAAAGATAA ACTAACTCAG AAATTAGCCT ATTTACAAGC ATTAACTGAT 7500 

GATTATCATG AGAGCAAGCA CAATCATTAT GCATTTGAAC GCATTGCTCA AATGTTGATA 7560 

GAATCATCGG TAGATATAGG GAATATGATT ATCGATGCAT TTATTTTAAG GGATCCTGGT 7620 

AATTATAAAG ATGTGATTGA TATATTAGAA CTAGAAAATG TTATTACTAA AGAAACACAG 76 80 

35 CAGGCGATTA ATAAAACTGT CGGTATTCGT AAACAATTTA CATATGATTA CACAGCCTTA 7740 

GATGTTGAGA TTATCATGCC AATGTTTGA 7769 

U) Information for seq id no: 21s: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 215: 

ACCGCCACCC ATTAATGATT GCTTAAAATC AATAGTCGTA CCATTTAATA CGGGTGCATC 60 

TTTTTT GTCT ACTAATACTT TTAATCCAAA GTATTCTAAG ACTTCATCAT TTTCACCAGG 120 

CGCTTCTTCT GCACCCATAC CGTATGTTAA ACCAGTGCAC CCGCCACCAT TCACTTTAAT 180 
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TOCTTCTGTT AATATAACTG TTGGCATGAT AACTCCTCCT TAAAAAATCC AAGTTTCTTT 300 

TATATGTGCA TATATATTTT GTAATAATTC TTCCGGCGAA TCACCTTCAA CAATATCACC 360 

5 

ATTT ACT AAA GCATACAACC CGGCTGAACA TATACCACAA TGTGTCAGGC AACCATACTC 420 

TAACACATCG ACATCTGGGT CATTTTCCAG TTGATTAAAA ACATAATCTC CACCTTTTGC 4 80 

CATGTTAGAG AGACAAAATT CTACGATCGG ATTCATACTT CACCTTCTTA TTTCATTTGT 540 

10 

TACAATATTA TAGCATTTTA AAACTGGTAT TTTAACATGA TGTGCTCAAT TAGCAACAAC 600 

TGATGTTTCT TATCCCAGTT ATGTAATAGT GCCTTAGTTA GTAC^ 644 
IS * (2) INFORMATION FOR SEQ ID NO: 216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1578 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 216: 

25 

GAATGATGAA AGGAATAGAA AAGAAAAGAT AAATAATGTA ATAGATTTAT CCGAGAAAAT 60 

TGAAAGAACA AAAGATATGC CAATCAAGAA TACTATAACT ACTCAATTAG GAAATAAACT 120 

3Q TATTGGCACA AAAAAAGCTC GTTTTGATGA TAAGAAAGTA GTGTCGTTTG GAGCATTTGA 180 

AGATGAATAA AATAAATGAT AGAGATTTAA CAGAATTGAG TAGCTATAGG GTTTATCAAG 24 0 

ACATCAATAA AGATAATGAC TTTACAGTTA ACGAAAAACG ATTTAAGCAG GCAGATGTAT 300 

35 TTGAAGATTT ATATAGAGAG AAACTAAAAG ACACAAATAA ATTAAGAGAG TATAATTATT 360 

TACAAAATGA AACTTTTAAA AGCGCATAAA TAGGTGATGA GATATGCTTA AAAAAGCAAA 420 

ATTTSTCTTA ATGGCAACGA TACTACTATC AGGATGTTCA ACTACCAATA ACGAATCCAA 4 80 
40 1 , 

CAAAGAAACA AAATCTGTAC CAGAAGAAAT GGATGCTTCA AAATATGTAG GACAAGGATT 540 

CCAACCACCT GCAGAAAAAG ATGCGATTGA ATTTGCAAAG AAGCATAAAG ATAAAATTGC 600 

TAAGCGAGGC GAACAATTTT TTATGGATAA CTTCGGTCTA AAAGTTAAAG CTACAAATGT 660 

45 

TATAGGTAGT GGCGATGGTG TAGAAGTATT CGTGCATTGT GATGACCACG AyATCGTATT 720 

TAATGCGAGT ATTCCATTTG ATAAATCAAT wATTGAsAGT GATAGCTCAT TAAGAAGTrA 780 

so GGAyAAAGGy GATGATATGA GTACTTTAGT TGGTGCAGTA CTCAGTGGGT TTGAATATCG 840 

AGCACAAAAA GAAAAATATG ATAAATTATA TAAATTTTTC AAAGATAATG AAGAGAAATA 900 

\ 

TCAATATACA GGATTTACAA AAGAAGCAAT TAATAAGACG CAAAATAGTG GTTATGAAAA 960 
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ACCATTGTTA AACAAAAGTG ACAGTGAATT TTCAAAAGAA TTGTCAAATG TTAAGAAGCA 1080 

ATTAAAAGAT AAGTCTAAAG TTTCGGTAAC TACTACTCTA TTTAGTAAAA AAAAGAACTA 1140. 

TACTAAAAAA AGTAACAGTG AAAATGTAAT AAAAATGGCA GAAGAAATAA AAAAAGATAA 1200 

AGAGATACCA AACGGTATAG AGCTTAGTAT AAAATTTTCG GACAATAAAA TAAATACGGT 1260 

TAAACCAAAT TTTAACGGTG aAAGCACTTC AGAATATGGT GTGTTTGATC AAGAATAAAA 13 20 

TTAATGATGa AAATTTAACG GAGAATAGTG TATATTGAGT AGATCmAGAA TAAAAAGATA 1380 

ATT CT ACT AT TGTTGTGAAG GCAAATAAGT AGAAGATTTT AAGTGTAATT TCTGGTGATT 1440 

TAAATAATAA TATAnATGGn AGTACTGATA TAAnACTTTT TAACCTACTA GATTCTTATA 1500 

ATTTGCTTTC CATTTTATGA CGATTTTTAC TCCAATTGAG TGATAGAATC CAAAAAAGCC 1560 

ATCTGCAAAA ATTAATCC 1578 
(2) INFORMATION FOR SEQ ID NO: 217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5137 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 217: 

TGTTTTCCTT GGGTTAAAAC ATGCTTGCTA TGCGTTTGTA AATATGACTT GCTGTTTTnA 60 

CCTGnATACC CGTCACACCA TGGAAGTAAA AATGTTTCTT GCTCTTGGCT TACAATTTTA 120 

35 GCTTTAATCG CTTCATATGC TTTATATTGG TCTTCTGTTA ATTGCTGTTT TGATTCTTGT 180 

TCGAAAACAC GATCTTTAAA TGGGTCTCTT TCAACAACCG CGTCATATTT TTCAACATAA 24 0 

C CTTCTTT GA TAAGTCCATC TAAACTGGAT TTTGAAAAGC CCATATCCTC AATATCAGTT 300 

AAAAATATTG TTTTATGTTG TTCTTCAGAC AAGTAAGCAT ACAAATCGTA TTGTTTAATA 3 60 

ACTTTCTCCA ACTTAGCTAA TACTTCATCA GGATGATACC CTTCAATGAC ACGAACAGCA 4 20 

45 CGCTTGGTTT TTTTAGTTAT. ATTTTGTGTG AGAATCGTTT TTTCTTCAAC GATATCATCT 4 80 

TTTAACAACT TCATAAGCAA TTGAATATCA TTATTTTTTT GCGCATCTTT ATAATAATAG 54 0 
TAACCATGCT TATCAAATTT TTGTAATAAA GCTGAAGGTA GCTCTATGTC ATCTTTCATC ' 600 

50 TTAAATGCTT TTTTATACTT CGCTTTAATA GCACTCGGAA GCATCACTTC TAGCATAGAA 660 

ATACGTTTAA TGACATGAGT TGAACCCATC CACTCACTTA AAGCTATTAA TTCTGATGTT 720 

AATTCTGGTT GTATATCTTT CACTTCTATG ATTTTTTTTA ACTTCGAAAC GTCAAGTTGT 780 
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ACAATTACAC GCACACCAGG TTGGATGACA GATTCGAGTT GTTCGGGAAT TATATAATCA 900 

AATTTATAGT CAACGCTCTT CGACGCGACA TCGACTATGA CTTTCGCTAT CATTATTGCC 960 

5 

ACCTAGTTTC TAGTTCATCT AAAATTTGTG CAGCTAATAC TACTTTTTTT CCTTTCTTGA 1020 

TATTTACTTT TTCATTATTT TTAAAATGCA TTGTCAATTC ATTATCATCA GAACTAAATC 1080 

1Q CGATAGACAT ATCGCCAACA TTA7TTGAAA TAATCACATC TGCA TTTTT C TTGCGTAATT 1140 

TTTGTTGTGC ATAATTTTCA ATATCTTCAG TCTCTGCTGC AAAGCCTATT AAATACTGTG 1200 

ATGTTTTATG TTCACCTAAA TATTTAAGAA TGTCTTTAGT ACGTTTAAAA GATACTGACA 1260 

15 AATCACCATC CTGCTTTTTC ATCTTATGTT CTAATACATC AACCGGTGTA TAGTCAGATA 1320 

CGGCTGCTGC TTTTACAACA ATATCTTGTT CGTCAAATCG GCTTGTCACT TGTTCAAACA 1380 

TTTCTT CAGC ACTTTGAACA TGAATAACTT CAATATCTTT TGGATCCTCT AGTGTTGTAG 1440 

20 

GACCAGCAAC TAACGTCACG ATAGCTCCTC GATTTCGCAA TGCTTCAGCT ATTGCATAGC 1500 

CCATTTTTCC AGAAGAACGA TTGGATACAA ATCTGACTGG ATCGATAACT TCAATAGTTG 1560 

GTCCTGCTGT AACCAATGCG CGTTTATCTT GAAATGAACT ATTAGCTAAA CGATTACTAT 1620 

TTTGAAAATG AGCATCAATT ACAGAAACGA TTTGAAGCGG TTCTTCCATA CGTCCTTTAG 1680 

CAACATAACC ACATGCTAGA AATCCGCTTC CTGGTTCGAT AAAATGATAC CCATCTTCTT 1740 

30 TTAAAATATT AATATTTTGC TGCGTACGTT TATTTTCATA CATATGCACA TTCATAGCAG 1800 

GCGCAATAAA TTTCGGTGTC TCTGTTGCTA GCAACGTTGA TGTCACCAAA TCATCAGCAA 1860 

TACCTACACT CAATTTTGCA ATTGTATTTG CCGTTGCAGG TGCAACAATG ATTGCATCTG 1920 

35 CCCAATCACC TAATGCAATA TGCTGTATTT CTGAAGGATT TTCTTCTATA AAAGTATCTG 1980 

TATAAACAGC ATTTCGACTT ATTGCTTGAA ATGCTAATGG TGTCACAAAT TTTTGTGCGT 204 0 

GATTSGTTAA CATAACGCGA ACTTCATACC CAGATTGTGT TAACTTACTT GTCAAATCAA 2100 

40 

TTGCTTTATA TGCCGCAATG CCACCTGTAA CGGCTAATAA TATTTTCTTC ATATTCAATC 2160 

TCCCTTAAAT ATCACTATGA CATTTACGCT TTACATCATC ATATGCGCAC AAATGCTCAT 2220 

TA CTTTTT TA TAGATACAAA TTTAGTATTA TTATAACATC AATCATTGGA TAAACTAAAA 2280 

45 

AAACACACCT ACATAGGTGC GTTTGATTTG GATATGCCTT GACGTATTTG ATGTACGTCT 2340 

AGCTTCACAT ATTTTTAATG GTCGAAACTA TTCTTTACCA TAATAATCAC TTGAAATAAC 2400 

50 AGGGCGAATT TTACCGTCAG CAATTTCTTC TAACGCTCTA CCAACTGGTT TAAATGAATG 2460 

ATATTCACTT AATAATTCAG TTTCAGGTTG TTCATCAATT TCACGCGCTC TTTTCGCTGC 2520 

AGTTGTTGCA ATTAAATACT TTGATTTAAT TTGTGaCGTT aATTGGTTtA AAgGTGGATT 2580 

SS 
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TTTArGTGcT 


CAGCTTCTAC AATACATTGA ATTCtATTCy 


TCGcAAGtTC 


TACTTCAtCA 


2700 


TTAACTACAA cGTAAyCGTA TAAATTCATC ATTTCrACTT 


CTkTACGCGC 


yTCGTTAATA 


2760 


CGACTTTGTA 


TTTTCTCATC AGATTCTGTT 


CCTCTACCTA 


CTAATCGCTC 


TCTCAAGTGT 


2820 


TCTAAACTTG 


GAGGTGCTAA GAAAATAAAT AGCGCATCTG 


GAAATTTCTT 


TCTAACTTGC 


28B0 


TTTGCACCTT 


CTACTTCAAT TTCTAAAAAT 


ACATCATGAC 


CTtCGTCCAT 


TGTATCTTTA 


2940 


ACATATTGAA 


CTGGTGTACC ATAATAGTTG 


CCTACATATT 


CAGCATATTC 


TATAAATTGG 


3000 


TCATCTTTGA 


TTAAAGCTTC AAACGCATCC 


CTAGTTTTAA 


AAAAGTAATC 


TACGCCATCA 


3060 


ACTTCACCTT 


CACGCATTTG ACGTGTTGTC 


ATTGAAATAG 


AATACTTATA 


TGATGTACTT 


3120 


GGATCTTCAA 


ATATnCGTnT TCTAACAGTA 


CCTTTACCTA 


CTCCAGATGG 


TCCTGATAAA 


3180 


ACGATTAACA ATCCTTTTTC ATTATCCATG 


CCTTACGACC 


TCTCTAAGCT 


AATCTTCTAT 


3240 


TATTTAAATA 


TGATATCACA TTGTTCTTTA 


TATTGTATAG 


CATATTTGAA 


ATTGCATGCC 


3300 


ATAATTTCTA TTAAGTCTAA CAATATCGTT 


ATATTGCACG 


ATTAATTTTA 


ATTAAATAAA 


3360 


TTGAATTGCA 


AACTTTTAGA TAATGTAAAA 


TGTATGGCAT 


AATGTATGGT 


TCAATAACTA 


3420 


TACTGAAAAG 


TTACAATCAT GTTAAAATGA AACGAATGAT 


ATGAAGAAGG 


TGGAAGATAA 


3480 


ATTATGGCTT ATGATGGCTT ATTTACAAAG 


AAAATGGTTG 


AGTCTCTACA 


ATTTTTAACA 


3540 


ACAGGACGTG 


TTCAGAAAAT CAATCAACCT 


GATAATGACA 


CGATACTAAT 


GGTTGTACGT 


3600 


CAAAATAGAC 


AAAACCATCA ATTGTTATTG 


TCAATCCATC 


CAAACTTTTC 


AAGATTACAA 


3660 


TTGACTACTA 


AAAAATATGA TAATCCATTT 


AATCCACCCA 


TGTTTGCGCG 


TGTTTTTAGA 


3720 


AAACACTTAG 


AAGGTGGTAT TATCGAATCG 


ATTAAGCAAA 


TTGGTAATGA 


TCGTCGCATT 


3780 


GAAATCGATA 


TAAAGAGTAA AGATGAAATT 


GGCGATACTA 


TTTACCGCAC 


TGTCATCCTT 


3840 


GAGATTATGG GTAAACATAG TAACTTAATT TTAGTAGATG 


AAAATCGCAA 


AATAATTGAA 


3900 


GGATTTAAAC 


ACTTAACACC AAATACGAAT 


CACTATCGTA 


CAGTAATGCC 


AGGATTTAAT 


3960 


TATGAAGCAC • 


CACCTACTCA GCACAAAATA AATCCGTATG 


ATATTACAGG 


TGCAGAGGTG 


4020 


TTGAAATATA 


TCGATTTTAA CGCAGGTAAT 


ATTGCTAAAC 


AATTATTGAA 


TCAGTTTGAA 


4080 


GGATTTAGCC 


CTTTAATTAC GAATGAAATC 


GTTAGTCGTC 


GTCAATTTAT 


GACTTCATCA 


A T A A 

414 0 


ACATTACCAG 


AAGCATTTGA CGAAGTAATG 


GCAGAAACCA 


AGTTACCACC 


TACTCCTATT 


4200 


TTTCATAAAA 


ATCATGAAAC AGGTAAAGAG 


GATTTCTATT 


TTATAAAGTT 


AAATCAATTT 


4260 


AATGATGATA 


CAGTTACATA CGATTCATTA 


AATGATTTGC 


TTGATCGTTT 


TTATGATGCG 


4320 


CGTGGCGAAC 


GTGAACGCGT TAAACAACGT 


GCGAATGATT 


TAGTTCGATT 


TGTTCAACAG 


4380 
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ATAAAGATAC 


TGAACAGTTA 


TATGGTGAAT 


TGATCACTGC 


TAATATATAT 


CGAATTAAGC 


4500 


AAGGCGATAA 


AGAAGTGACG 


GCATTGAATT 


ATTATACGAA 


TGAAGAAGTT 


GTCATTCCTT 


4560 


TAAATCCTAC 


AAAATCCCCA 


TCAGCAAATG 


CTCAATATTA 


TTATAAACAA TATAAyCGTA 


4620 


TGAAAACGAG 


AGAmCGTGAA 


TTACAACATC 


AAATTCAATT 


GACGAAAGAC 


AATATAGATT 


4680 


ATTTTTCAAC 


AATCGAACAA 


CAATTACATC 


ATATTTCTGT 


CCATGACATT 


GATGAAATTA 


4740 


GAGATGAATT 


AGCAGAACAA 


GGCTTTATGA 


AACAGCGTAA 


AAATCAAACT 


AAGAAAAAGA 


4800 


AAGCGCAGAT 


TCAATTACAA 


CATTATGTAT 


CAACTGATGG 


CGACGATATA 


TATGTTGGTA 


4860 


AGAATAACAA 


GCAAAATGAT 


TATTTAACAA 


ATAAAAAAGC 


TAAAAAAACT 


CACACATGGT 


4920 


tACACACAAA AGATATTCCT 


GGTTCACATG 


TCGTTATATT 


TAATGATGCA 


CCAAGTGATA 


4980 


CGACAATCAA 


GGAAGCGGCT 


ATGTTAGCAG 


GATACTTTTC 


AAAAGCTGGT 


AATTCTGGAC 


5040 


AAATACCTGT 


TGATTATACA 


TTAATTAAAA 


ATGTGCATAA 


ACCATCaGGT 


GCAAAGCCTG 


-5100 


GGTTTGTAAC 


ATATGACAAT 


CAAAAAACTT 


TGTATGC 
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(2) INFORMATION FOR SEQ ID NO: 218: 

25 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
30 CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 218: 

35 GTTTTATCGC AGCAGTAAAG CTATCAATCG GCGGTTCAAT TGATGATGCA TTAGCAGAAA 60 

TCAnACAATC ATTTTAGTTA AAATTTACTA ATAATGAaAA ATGTAAACCT. TTTTCAAATG 120 

AAACTTTATa AaAAATATGA TAGTATATAT GTAAATGTTT AATAAAATCT GGAGAAATAG 180 

40 

GAGGACATTG CCATG.CAACA CCTTATAAAA AAACATGTAT TGAATGGCGA GTTTGATTTA 240 

GTACGACAAT TGATGTCCGA AACAGATTTT ATGGAATTTG AAGAAGCATA TATTTCAAGT 3 00 

45 GCGCATGAAG TAGAAAGTAT GATGTTTTAT ACATGTATTT TAGATATGAT TAAGTACGAA 360 

• GAATCATCTG AAATGCATGA CTTAGCATTT TTATTGCTTG TGTATCCACT AAGTGAATAT 420 

GAAGGTGCTT TGGATTCTGC TTATTATCAT GCAGACGCTT CCATAAAACT TACTGACGGC 480 

50 AAAGAAGTTA AAAGTTTGTT ACAAATGTTA TTATTGCATG CGATACCAAC ACCTGTTATT 54 0 

TCAGATAAGA AGGCTTTTGA TATCGCCAAG CAAATTTTAA AATTAGATCC TAATAATAAT 600 

GTTGCTCGTA ACGTCTTAAA AGACACTGCC AAACGTATGc gACAaCGTTG TTGTTGATAT 660 
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AGTTTTAACA TTTGGTTGGG TTGGGCATAT GTTCCAGCCT TTTTTAATAC TTAAAAACTA 780 

ACGAAgTATA CTTGTGTGCA CAAATGGTTT TTATACAACA TTTTATAAAT TTATACATTT B40 

S 

TAATAAAGAA CATACGATAG ATGGTTTAAA CCTTGTTAAC TGAGAAATTT TGATATGTAT 900 

TCTTCGAAAT TTAACTAAAT ATACGAAATT CAAGAAGCAC AATAATTAAT CATTTTTCCT 960 

w ATACAAAAGT TCGTATGACT GCATTATAAA AGCATAAATT TATAATTTTT TTAAATGTCA 1020 

TTGAACGTGA TAATGTGAAT GGATTGAGCA ATTTTGAAAA AGTGAAAAAT AACCTATGCG 1080 

ACTTGCAATT AATTTTCAGT ACGTTATAAT GCACACTGTG CAAAATTAAG GAGGTCTATT 114 0 

15 ATTCACATGA TGATGAaTAA AGAAGCAACA AAAATTGGAT TTGCCTACGT CGGCATTGTA 1200 

GTGGGCGCAG gATTTTCAAC TGGACAAGAA GTTATGCAAT TTTTCACTAA ATATGGCTTG 1260 

TGGGCTTATT TAGGTGTTAT TATATCTGGT TTTATTTTAG CTTTTATTGG GCGCCAAGTA 1320 

20 

GCAAAAATTG GTACTGCCTT TGAAGCGACA AATCATGAAT CAACATTACA ATACGTATTC 1380 

GGTGAAAAGT TTAGTAAAGT CTTTGaTTAT ATTTTAATCT TCTTCTTATT TGGTATAGCT 144 0 

GTAACCATGC tAGCTGGTGC AGGCGCAACA TTTGAAGAAA GTTATAACAT ACCTACATGG 1500 

25 

CTAGGTGCTT TaATTATGaC ATTAGCGATT TATATTACGT TGCkATTAGA CTTTAATAAA 1560 

ATAGTACGTG CACTAGGTAT CGTTACACCA TTTTTAATTG TTTTAGTTGT ATTAATCGCT 1620 

30 GGCGTTTATT tATTTAAAGG TCATGtTTCA TTAGCAGAAG TTAACCAAGT AGTGCCtGAA 1680 

GCAAGTATTT GGAAGGGAAT CTGGTTTGGT ACAATATATG GTGGATTAGC TTTTTCTGTA 1740 

GGTTTTAGTA CCATCGTAGC AATCnGTGGG GATACTGAAA AGCGTACAGT GTCAGGTGCA 1800 

35 GGCGCGATGT ATGGTGGTAT TATCTATACT GTATTACTAG CATTGATCAA CTTTGcATTG 1860 

CAAGTGaATA TCCAACTATT AAAAATGCCT CAATTCCTAC ATTGACGTTA GCAAATAATA 1920 

TCCATCCTTT AATAGCAACA GTGkTATCTG TTATTATGCT GGCGGkTATG TATAATACTA 1980 

40 

TTCTAGGACT AATGTATTCA TTTGCAGCAC GTTTTACAGA ACCATACAGT AAAAATTATC 2040 

ATATCTTTAT TATTATAATG ATGGTAGCAG GTTATTTATT AAGTTnCGTA GGATTTGCTG '2100 

AATTAATTAA TAAGTTATAT ACnATTTATG GGATATGTAG GCTTATTnTA TTGTAGTAGC 2160 

45 

TGTAATTATn AAATATTTCC AAACGTAAAA ATGGCGGATA AAAAACATAT TGCTTTAATA 2220 

TCATATGGAG GGGATATCCG AAACTTTACA ATTTGAATCA CTTTGGT 2267 
50 (2) INFORMATION FOR SEQ ID NO: 219: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6336 base pairs 

(B) TYPE: nucleic acid 

« (C) STRAND EDNESS : doilble 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 219: 

GTCAATGTAA CCTAATAGTT TATGTCTATC TTGTGTACCA ACTACTACAT CGACACCAGG 60 

AATTTCCATA ATTTCAGCTG ATGAAGTTTG CGCATAACAA CCTGTTACAC AGATTACAGC 120 

ATCAGGATTT TGTCTTATTG CACGTCTAAT TATTTGACGA CTTTTTTTAT CACCCGTATT 180 

CGTTACTGTA CAAGTATTAA TAACAAATAC ATCAGCATTC GCTTCAAAGT CAACGCGCTC 240 

ATAGTTTGCT TCTTTAAATA ATTGCCAGAT TGCTTCAGTT TCATAATGGT TTACTTTACA 300 

75 ACCTAATGTG TGaACGCAAC TGTTGACATA AATATTCACC CCATTAATTC TTTTTCATAA 360 

CTTATTGCAC TTAACGCATA CAATGGCGCA GTTTCTGCCC GTAAAATTCT CGGCCCAAGA 420 

CCAACAACTG TACTAGTATT ACTAAATAAT GAAATTTCAT TTTCTGACAA ACCACCCTCA 4 80 

GGaCCAAAAA TCATCAACAC TTTATCCTGA GCATTGAATT GTTGTAAAGT TTGCTTGAAA 540 

TTGCTTAACT CACCATCTTT TGCTTCCTCT TCATATGCAA TAAGAATATA GTCATAATTA 600 

TCAATAGTAT CACAAATTAA TTTTAAATTC GACTCGAATT GAATAGATGG AATCACTAAA 660 

CGATAGCTTT GTTCAGCAGC TTCTTTAATT ATTTTTTGCC AACGCTCTAT CTTTTTGGCA 720 

ACTTTTGCCT CGTTTAATTT AACAATTGAA CGTTCCATGC TCACAGCTAT AAATGATGAA 780 

GCACCCAATT CAGTAGCTTT TTGTAGCAAC CACTCATATT TGTCAGCTTT GATTAGTCCA 840 

CTGCAAATCG TAACATCAAC TGGCAATTCT GTATTAATAT TTTGTTTTTC TTTTAAATCA 900 

ACTTCAATTT TATCACTTGT TATGTCAGCA ATTTCACATA AATAAACTGT TTGATCATTA 960 

35 AAAGTTAAAA TAATTTTACT ACCAACATCA TATCTCATTA CATTTGTTAT ATGATGAATA 1020 

TCTTCTTTTT TTGTAATAAA AAAACGCTGA CTTACATCAG CGTTTTGGnT CTATGAAATA 1080 

ACGTTGCACA TTATTCACTC ACTTTCTGGC CAACAAGACA AACCCAACCG TTGTCATGTT 1140 

GTTCTGAAAT AATTTTAAAA CCTACACGCT CCATATGTGA CTGTATACCT TCATACTTCT 1200 

CTTTTATAAT ACCAGAAGTA ATAAAATAAC CGCCTTCATT TAGAGTATTA TAAGCATCTT 1260 

CAATCATTTC ATCAATAATA TGCGCTAAAA TATTTGCTAT TACAATATCA AATTTTTCTG 1320 

TTTCGTCTTT CAATAAGTTA CCTGGAACAG CTTCAATTAA CGTTTCACAA TGATTTCTTC 1380 

TGAAGTTTTC TTTAGCTACA CTCACTGCCA TTTCATCAAT ATCCAACGCT TTAATACGTT 1440 

SO TTACACCGAT TAGATGACTT GCAATACTTA ATATACCTGA GCCAGTACCA ACATCAATTA 1500 

CTGAATGCTG TGGCAATACA TATGTTTCTA TTGCCTTCAA ACACATACTT GTAGTCGGAT 1560 

GATCACCTGT TCCAAAAGCC ATACCTGGGT CGAGCTCAAT GCAAAGCTCT TCATCCGCTT 1620 
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GGAAATAGTT TTTCCATTCA TTTTCCCAAT CCGTCTCTGC AATAATTTGC TCACTGAATT 1740 

GAACGTTATG TTGATCAAGT TCATCTAAAT TTAATAACTC ATCTTTAATT TGCTGTCGCA 1B00 

5 

ACTTATCATC ATAAGTCATT TCATTAAAAT AGGCTTTCAA TCTTACTCCC TTATCTGGAT I860 

AATCCTCTTT TTTCAAAGCG TAAATTTCAC CGTATTTATC TTCTGGTTGG TTAATTAAAT 1920 

CATCTGAATC TTCTATCACG ACACCATTTG ATCCATGATT TTCAAGTATA TTGGTAGCCA 1980 

ATTCTACTGC TTCATGATTA ATAATAATTG AAAGCTCTGT CCAGTTCATA CTTTATTCTC 2040 

CCTTAAAGAA TCTTTTTGCT CTATCTTTAA AATTCGAAGG TTGTTCATTA ATTTCTTCAC 2100 

15 CATTTAATTG GGCAAATTCT TTCATTAGTT CTTTTTGTCT ATCTGTTAAT TTAGTAGGCG 2160 

TTACTACTTT AATATCAACA TATAAATCTC CGTATCCATA GCCATGAACA TTTTTTATAC 2220 

CCTTTTCTTT TAAGCGGAAT TGCTTACCTG TTTGTGTACC AGCAGGGATT GTTAACATAA 2280 

20 CTTCATTATT TAATGTTGGT ATTTTTATTT CATCGCCTAA AGCTGCTTGT GGGAAGCTAA 2340 

CATTTAATTT GTAATAAATA TCATCACCAT CACGTTTAAA TGTTTCAGAT GGTTTAACTC 2400 

TAAATACTAC GTATAAATCA CCAGCAGGTC CTCCATTCAC GCCTGGAGAG CCTTCACCAG 2460 

25 

CTAATCTAAT TTGTTGTTCA TTGTCGACAC CTTCAGGTAC TTTCACTTCT AATTTAACTG 2520 

TTTTATTTTC AGTACCTTTT CCGTGACATG TTGGACAAGC TTCTTCAAAT TCTTGACCAC 2580 

TTCCATTACA TTTAGGACAA ACTTGTTCAG TACGAACTCT ACCTAAAATT GTGTTTTGTT 2640 

30 

CTACAGCTAC ATGACCAGCG CCATTACAGT AACTACAAGT CTTTTTACTT GTTCCAGGCT 2700 

TTGCACCATC ACCATGACAT GTTTCGCATG TTACATCTTT ACGGATTGAA ATTTCTTTTG ?760 

35 TTGTACCAAA TACCGCTTCT TCAAATGTTA ATGTCATTGT ATACTGAAGA TCATCACCTT 2820 

TTTGCGGTGC ATTTGGATCT CTTTGTCTGC CGCCACCGAA GAAAGAGCTA AAGATATCTT 2880 

CAAA5CCGCC GCCACCGAAG CCACTAAAAC CGCCAAAGTC AGAGCCATTG AATCCTTGTC 2940 

40 

CACCAAAACC TTGTGGACCA TCATGTCCAA ATTGATCATA GcTTGCGCGT TTATTATCAT 3000 

CACTTAAAAC TTCATAGGCT TCAGAAATTT CTTTAAACTT TTCATCTGCA CCTTCTTCTT 3060 

TGTTAATATC TGGATGATAT TTTTTCGAAA GCTTTCGATA CGCTTTTTTG ATTTCATCTT 3120 

45 - 

TTGAAGCATC CTTACTAATG CCTAAAACTT CATAATAATC TCTTTTGGCC ACAGCTATCT 3180 

CTCCTTTTCT TAATTAACTC AT AT AG TTT A ACGTAATATG TCATACTATC CAAATAAAAA 324 0 
SO GCCAAAGCCA ATGTTCTATT GACTTTGACT TTTCAGATCA TGACAACATT CTAATTGTAT * 3300 

TGTTTAATTA TTTTTTGTCG TCGTCTTTTA . CTTCTTTAAA TTCAGCATCT TCTACAGTAC 3360 

TATCATTGTT TTGACCAGCA TTAG CACCTT GTGCTTGTTG TTGCTGTTGA GCCGCTTGCT 3420 
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TATCTTCTAT 


ATCTTGACCT 


TCTAAAGCAG 


TTTTAAGAGC GTCTTTTTTC 


TCTTCAGCAG 


354 0 


£ 


ATTTTTTATC 


TTCTTCACCG 


ATATTTTCGC 


CTAAATCAGT TAAAGTTTTT 


TCAACTTGGA 


. 3600 


ATACTAGACT 


GTCAGCTTCG 


TTTCTTAAGT 


CTACTTCTTC ACGACGTTTT 


TTATCTGCTT 


3660 




CAGCGTTAAC 


TTCAGCATCT 


TTTACCATAC 


GGTCGATTTC TTCGTCTGAT 


AATGAAGAAC 


3720 


10 


TTGATTGAAT 


TGTAATTCTT 


TGTTCTTTAT 


TTGTACCTAA GTCTTTTGCA 


GTTACATTTA 


3780 




CAATACCGTT TTTATCGATA 


TCAAACGTTA 


CTTCAATTTG AGGTTTACCA 


CGTTCAGCTG 


3840 




GTGGAATATC 


AGTCAATTGG 


AATCTACCAA 


GTGTTTTATT ATCCGCAGCC 


ATTGGACGTT 


3900 


15 


CACCTTGTAA 


TACGTGTACA 


TCTACTGATG 


GTTGATTATC TACTGCTGTT 


GAATAGATTT 


3960 




GAGATTTAGA 


TGTAGGAATC 


GTAGTGTTAC 


GTTCAATTAA CGTATTCATA 


CGTCCACCTA 


4020 




AAATTTCAAT 


ACCTAAAGAT 


AGTGGTGTTA 


CGTCTAATAA TACTACGTCT 


TTAACGTCAC 


4080 


20 


CTGTGATAAC 


GCCACCTTGG 


ATTGCAGCTC 


CCATTGCCAC TACTTCGTCC 


GGGTTTACTC 


4140 




CTTTGTTAGG 


CTCTTTACCG 


ATTTCTTTTT 


TGACAGCTTC TTGTACTGCT 


GGAATACGAG 


4200 




TTGATCCACC 


AACTAAGATA 


ACTTCATCGA 


TATCTGAGTT TGTTAAGCCA 


GCGTCTTTCA 


4260 


TTGCTTGGCG TGTAGGTTCC ATTGTTCTTC 


TAATTAATGA ATCTGATAAT 


TCTTCAAATT 


4320 




TAGAACGAGT 


TAAGTTTACT 


TCTAAGTGTA 


ATGGACCGTT TTCACCAGCT 


GAGATAAATG 


4380 


30 


GTAATGAGAT 


TTGAGTTTGT 


GATACACCTG 


ATAAGTCTTT TTTAGCTTTT 


TCAGCAGCAT 


-4440 




CTTTCAAACG 


TTGTAATGCC 


ATTTTATCTT 


GAGATAAGTC TACGCCATTT 


TCTTTTTTGA 


4500 




ATTCTGCAAC 


TAGGTAGTCA 


ATAATTACTT 


GGTCAAAATC ATCACCGCCA 


AGTTTGTTGT 


4560 


35 


CACCGGCTGT 


TGATAGTACT 


TCGAATACAC 


CGTCACCTAA TTCTAGGATA 


GATACGTCAA 


4620 




ATGTACCGCC 


ACCTAAGTCA 


AAAACAAGAA 


CTTTTTCATC TTTATCAGTT 


TTGTCTAAAC 


4680 


40 


CATATGCTAA 


TGCTGCAGCT 


GTTGGTTCAT 


TAATGATACG CTCAACTTCT 


AAACCAGCAA 


4740 


TTTTACCAGC ATCTTTAGTT GCTTGACGTT 


CAGCATCGTT AAAGTATGCA 


GGTACTGTAA 


4800 




TTACAGCTTT 


GTCAACTTTC 


TCACCTAAaA 


TAgTTTCAGC TGTATTTTTT 


AAGTTTTGTA 


4860 


45 


AAATCATAGC 


TGAGATTTCT 


TGTGGTGTGT 


ATGATTTACC TTCAATATCT 


ACTTTATAAT 


4920 




CAGTACCCAT 


ATGACGTTTA 


ATAGATTGAA 


CAGTGTTTGG GTTTGTAATA 


GCTTGACGTT 


4980 




TTGCTACTTC aCCAACTTGA GTTTCTCCAT 


TTTTGAAAGC TACAACAGAT 


GGTGTTGTAC 


5040 


50 


GTGAACcTTC AGGGTTTTGA ATTACTTTTG 


GCTCATCGCC TTCTAATAcT 


GTnACACATG 


5100 




AATTTGTTGT ACCTAAGTCT ATACCAATAA 


TTTTACTCAT AATAAAATTC 


CTCCATTTAA 


5160 




TCATTAAATT 


AATTTAATTT 


TAAACAATGT 


CTTTTCGCCA AATTTAAGTT 


ATTGGTTTAC 


5220 
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AGTGATTTCG CCAGATTCAA AATCAGGGTT ATCATCTTGA ACTACAGCTT GGTGAATATT 5340 

TGGATCAAAT GCTTCACCTT CAGTTTTAAT AACTTCAAGA CCATTATCTT TTAGTGCGTT 5400 

5 

AAT.CAAACTT TCATGCACCA TTTGTACACC TTTTTGAAGA GATTTAAAAG TCTCATCATC 5460 

ACCTTCAATT TGAAGTGCAC GTTCTATATT GTCTATTGCT GGTAAAATAT CTGTTAACAC 5520 

10 ACGTTGTGCT TGATATGTTT TGTTTATTTC ATTTTCTTTT TGAATTCTAC GCTTATAATT 5580 

TTCAAACTCA GCGTAGAGCC TTAAATATTT CTCTTCGTTT TCATCTGCTA ATTGTTGAAG 564 0 

TTCATTAATT TTTTGATCTT TTGGATCTAT TTCTTCAATA ACATTCTCGT CAGACGTTTC 5700 

15 TTCTATTGCT TCATCTTGTA AATGACCTTT ACTTTCTTCA GCTTGTTCAA CTGAATCATC 5760 

AATATTTTGT TTGACGTTTG TTTCTTCAAC TGTTGATTCA GTGTTTTTTT CAACTGATTC 5820 

GTCTTTATTT GTCATTTTCT GTCCTCCAAT ACTTTCTAAT CCATCATTAC CAAATTCTAT 5880 

20 

TTAATAATTG AATGACATTT TGATAATGCA TAGCTGTAGG TCCAATCACA GCGATTTGAC 5940 

CTTTTAACGT TTCATCAAAA TGATATTGAC TTGTTACAAT TGAAATATCA CTTAAGCTGT 6000 

CATCAATTTC ATTACCAATT TTTACATTAA TATTTGGTGA AGATATATCT TGTAATAATT 6060 

25 

CTGCAATTCT ATTTGATTCT ATATATTGTA GAATGGGCTG AATTGAAGAT ACATTACTTT 6120 

CATTCAATGC ATCAATAAGT TTAACCTTTC CACCCATATA AATGCTATTA CTTTGATTAG 6180 

30 AAATATGATT ATTCATCGTA TTTAACAATT TATTGATAAA AATTTCTTCC TGCTCTGATT 624 0 

GAaCAAAAGA GACAATATCA TCTTGTAAAT TCTGATTAAA CTCAGTTAGT TTGTTTGTAA 6300 

CAAAATTTGA TATTGTATTT AGTTTGTCAT TATTAA 6336 

35 (2) INFORMATION FOR SEQ ID NO: 220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 13059 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

4S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 220: 

TTCATGATTA TTATCTGTTG TAGACACTGC TGGATCTTCC GATGTATCTT TCGATGCATC 60 

TTTCGATTTG TGTATTTGCT GATTCAAATG GTCTAGGTCT TCTAACGCCT TATTTACCAT 120 

SO TGCTTCATCA TTTTTATCAT CTTTTTCTCC ATGTTTTGTT GTAGCCGTTT GTGACATATC 180 

ATTTTTCATT GCATTAAGAT CGTCCTCGCC ACTTTGTTGA CCCCTATCAA CATTTGAAGA 240 

AACCTCATTT AAATCTTTAA GCAATTGATC TAATTTACTG TCTATATCAC TTTGACCGTT 300 
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5 



10 



15 



20 



25 



40 



45 



50 



TTCATCTATT 


TGCGATGCTG 


TTTTCGCTTC 


ATTTAGTTGT 


GCTTTATAAT 


GTTCTTTAGA 


420 


TGAAGCCGAT 


AACTGTTTTA ATTGCTCAAT 


TTGACGAATT 


GCCTTGTCAA 


CTTTGTCTAA 


480 


TAAATCTTGC 


TTAGATAATA 


TCTCTTTTGT 


AATTTGAGTA 


TCCTTTTCAG 


ATGCAGCTTG 


54 0 


GGCATCGTAC 


GGCAAGATAT 


TCGTTAAAAT 


GATACTTGTC 


GCCATCATTG 


TCGAACACGA 


600 


TAACTTTACA 


TATAATTGAA 


ACGGTTTCCC 


TCGATATTTA 


GCCATCAACA 


TACTCCTTCC 


660 


TCACTTACTT 


CCTTCAAAGA ATTACATACT 


ATTATATACC 


TGTTTACAAG 


AAATTTACAC 


720 


TTATCTATCT 


AGTTATTGTT 


GTTAGTAATT 


ATCTACTTAT 


TACTTAGCTT 


ATATTTAAGT 


780 


AAACAAAACA AGCATGACGT AATATCATAT 


TGTCCATGTC 


GCTAACATCA 


TATTACGTCA 


840 


AATCTTTTAT 


ATTAAATGAT 


GTTTTATTTT 


AGACTGCTTT 


TTCCTTTTAG 


CTTTCGAGCG 


900 


CCTGTTTAAA 


AACTTGCTCG 


AATTGTTCAC 


GCGAGATTTC 


GTGTGCATGT 


GCTTTTTGTG 


960 


CTAATAAAGC 


ATCTCGAAAC 


TGTTGTTGAT 


CTTTCAAACT 


TTCTAACATT 


TGTATTAATT 


1020 


GGTCTTTACT 


TTCGATTGTT 


ATCTCATCAT 


TATGCTCAAA 


TAAGTGCTCT 


GATAATGTTA 


1080 


CTTTAGCATG 


GTGTGCGGTT 


TGACGATAAC 


CTAAAATCAA 


CAACTCATAG 


TCAAACGCTT 


1140 


GTTCCACCGC 


ATTTAAAATT 


TCATTACCCT 


CATTGATATC 


AAGATAAATA 


TCACATAACT 


1200 


GGTATAGTTC 


ATTTACCCTG 


TCAATATTAA 


TAGATGGGTA 


TAAATGCACA 


TTAGCATATT 


1260 


GATCAAGTTG 


CATTAGCTTA 


TCAGACATCT 


CTGTAATAGC 


AGCGATGTGn 


AACTTAAAAT 


1320 


CTGGTAAAGt 


TyCAACCAAT ACCTTGATGT 


TACGAatTGa 


TCCgAGTTAG 


TTAATATTAC 


1380 


AATTTCTTTA 


GTATATCTAT 


TACGACTACG 


ATAGTTATAT 


AGATATCCGC 


CTTGTAAAAT 


1440 


ACGAGATTGA 


ACCTTTGCGT 


CTGCTATATT 


GAGCATCGTT 


TCATATTCGT 


TTTTATCTGG 


1500 


AATAATAATA 


TTACAATGTC 


GTTTCATATC 


ACCTTTACAC 


ATCAATTGCA 


TATTTCCCGG 


1560 


GACATTACCA TTACAGTGTT CTTGCCATAC 


CAAAACATCA 


CTACCTTTTG 


ATGGCAAATT 


1620 


ATATAACACT 


GAAAATGGTA 


GGGCTAGTGA 


GTTAATAACG 


AAATGATGTT 


CCGTAATTTC 


1680 


AAGTTGCTTG 


ATAAAAAATA 


ATGCGAATGC 


GAGCTTTGAA 


GGGAAAAAGT 


AAGACTTCCC 


1740 


TTGCCAATCC 


AATATGACAT 


CAGATGTTAC 


AAAATTTTCA 


TAAATCACTT 


CTTTACCTTC 


1800 


TGCTGTCATA 


TATTTCTTCA -AGATCGCTTT 


ACGATTTAAA 


TCGTAAACAG 


TTTGTGCAAA 

X X X w * WwWl 


1860 


TTTAATACCA 


TTCTTAGAAT 


AATAATCGAC 


AAATCGGACA 


CGTTGTTGGT 


CATCAAACCA 


1920 


TTCGACACGA 


CTAACAATTC 


TAGGGCGCTC 


TCCACTTTGA 


yAAAATATTT 


TACCTCGyAG 


1980 


ACGTCCCATA 


TCATTaATTG 


TAGCCGAATT 


GTTGTTACCT 


TTAATTTCCC 


AAAAAGCTGG 


2040 


TACAGTAACC 


TGATTAAAAA ATCGTGGTTT 


CATATTTTCT 


GTATTATGAT 


TATCTGCAAA 


2100 
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TAAATCTTCT TCCAACTTAC TGGCTTTAAA AG ACT CAT AT AACTTTCGTG AATGATCGTT 2220 

AAAGTAATCA AATAATTTAA TCATGTAGCA CCTCTTGaAC TAATGTTTCC CATTTTAAAA 2260 

5 

TAATATCTTG AGTCATAAAT TGCTGTGCCA CTTCATAAGA GATGTCATGT GGTGTCTGGG 2340 

GACCATTGTT AAAATACATT ACAATGGcAT GAGCTAGTTT TGCGATAACA TCATCCACAC 24 00 

w TATCTTCGTC GGTATCAAAA GGTACCAAGT AGCCATTTTC CCCATCTCGA ATAAAGGTTG 2460 

GGTTACCATA ATTCACATTT AATCCAATCA TACCTAGTCC TGAGCCTACC GCTTCCATTA 2520 

GTGTTAACCC AAAACCTTCG CTAGTTGATG CAGAAAGAAA TAACTCATAA TCATTATAAA 25 BO 

75 TTTCATCAAG TTTAACATGC CCTAGTAAAC GAATATAATC TTGTGCGCGG TGTGTATCAA 2640 

TAATTTTACG CAGTCGCGTC TTCTCGCTAC CTTCTCCATA AATATCAAAT GTTAATTCTG 2700 

GCACTTGTCG TTTAGCCACG ATAACCGCTT TGACAAGCCA ATCAATATGT TTCTCATTCG 2760 

20 

CTAAACGAGA TGCACTAATC ATCGCATATG GCTTTCTTGA TAATGTTGGA TATGATAATG 2B20 

CATCAATGCT TCCCACAGGA ATAGTATAAA CACGTGGGCG ATAACCTTGA TATTGCTCAA 2BB0 

ATTGTCGACA AACCATATGA TTTTGAATAT CTGTTGCTGT AATAAAGAAA TCAATGTATT 294 0 

25 

TAGCTTTTGA AAATTGATAT TCATAATAAT TGTTCCATAG TATATGCTGC TCACTCATCA 3000 

TATTATTACT ATAATGATCA GCATGAATCA CAACACCAAC TTTACTATCA CCTTTATGCT 3060 

30 GCAAAACAGC CTGACCAATA TCAGAAGCGC GGTCTAATAT GACAATATCG TCTCGGGTTA 3120 

AATTCAATCG TTGTAAAAAG TATGCAATAA ATTCCGTTTT GTTATACAAC ACCGCATCTT 31 BO 

CAAACACATA TATAGAGCTG TCTCCATCAA TATATTCGTT ATAAGCGATG GAAGCATCTT 3240 

35 GATTATAAAA TTGTCGCATA TATAATTTCG CTTTATTATC AGCTGGTGCA TAATACTCAG 3300 

AAAATATGCG CGTATAACTA TAAAAATCTT TACGTACTAA CATACTATTA ATTACAAATT 3360 

CTGCftCGATC CACAATATCT TTTTGTTCAT TTTGCAGATA ACATGTTACA AATGATGATT 3420 

40. 

TCCCATTAAA ATATAGGCGG ACTATCTTAC CATTTCTTTC TCTAAAACTA ATGTCATGAC 34 BO 

CAAGCTCACG TTCAATGTCA TCTAACGTGT ACGTTGTTGG TGCTAAAGAA ATATCACTAA 354 0 

AATACTGATA CAACCAAATA ACTTCTTGAT CTTTAAACCC AATGTTTTGC GTTAATGTCT 3600 

45 

GTATGTTCTC TGACTGTATA AAATCTAAAA ACACAAATTT AGTGTCTTGA TTTGTACGTC 3660 

TCAATAATTT AGCACGGTAA GCTTGTGCAT ATTCAACACC GCTACTCGCC CAGCCTATAC 3720 

50 CAAAGTTTAT ATTATATATT GTCATGCGCT ACCCCTTTTC ATTTATGGAA AATGTATAAC 37B0 

TGGCATACCC TCTTTATCAA ATGTAATCAT GCTTTGACAA ATATTTTTCA CCATTCTTTT 384 0 

TTTGATATTT CGTGTCATAA CTTCAAATGA ATCTAAGGCA ACTCTATGGT ATTCAAAAAT 3900 
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GACTTGTTCT AACCAACATG AATCAATTGC TTTCAAAAAG ACTTTTTGAA CGAAAATATT 4020 

ATAATAATAT GCACTTTGCA TGTTTTTACG ATTCAAAGCT AATTGCTTTT CAAATTGCTC 4080 

5 

TAATAAAAAT GTCACTACTG CTTGCTTATC TTTAAAATTA ACACAAGCCA CATCTTTATT 4140 

AAATTGGAAA CTTAAATTTT GATAAATATA CTCGACAACA CGCGATTTTG TTAGCACCTT 4200 

10 TTCCTCATTT ACAAACATTT CAAATACATC TTTAGCTAAC GCTTTAAAAT CTTGATTCTC 4260 

AGCATCATCT ATTTCTAAAA CTCGATTGCG TTCCTCGTAT ACAAGATCTC GCTGTATACT 4320 

AATGCTTTTT TCAAATTCAT TAGCCATTTC ACGAGCTTTA ACCCCTTGTT CTTCCGAGAt 4380 

15 aCGcTGCGCT TTAACTACAA TTTGCTTAAC TTTGCGATTA AACAAATTAC TTTGCGATAA 4440 

TCGTTGTGCA TCTAATGAAT ATAATTGATT ATTTTCCGCT AAATTACTAT CGCTCCATCG 4500 

CTTAACTAAA TAATCATCTA GTGAAATATA TATACAAGAT GATCCCGGAT CCCCTTGTCT 4560 

20 

ACCAGAACGA CCACGTAATT GCCTGTCTAC ACGGCTATTT TCCATATGTT CATGAATAAT 4620 

AACAGCTAAT CCACCTAATG CTTCGACACC TTCACCAAGT TTAATATCTG TGCCTCGACC 4680 

25 . TGCCATACTA GTCGCAACAG TCATGGAACC AATTTGCCCT GCTTCAGCTA TCATCTGCGC 4 740 

TTCTTTTGCA ACATTTTGCG CAATGAGTAA ATTATTAGGA ATATCCATTT GGAATAATAC 4 800 

TTTCGAAAAG TATTCAGCCG CTTCAGCAGT TCTCGTTATG AGTAAAACCG GTCGCCCCGT 4 860 

30 TTCATGAAGT TCAACTATAT CATGAATCAT CGCGATGTTT TTCTCATCAA CTGAACGAAA 4 920 

CACTTTATCT GGTTCATCGA TACGTTGAAT CGCTTTATCA GTTGGTACTT GTACGACTAT 4980 

TTTTGAATAC AAATCAAAGA ACTCTGATTC GCCTAATTTT CCTGTAGCTG TCATACCTGA 504 0 

35 

AAATGATTCA AAAAGTTTAA ATAAATTCTG GAAGGTAATT GTTGCCATAA CACTTTTATC 5100 

TGTTGAAACC TCCATACCTT CTTTCGCTTC AATAGCTTGG TGAAGTCCAG CTTGCAACTT 5160 

AGTTCCCGGT AACATACGAC CTGTAATACG GTCAATTAAA ACAATATCAC CATTATATAC 5220 
40 ' 

AAAGTAATCG ACATTAGATT CAAACAAATA TTGTGCGCGC AGTGCTAAAT TAATATTACG 528 0 

CACTAGGACC ATCGCTTGTT CGCTATATAA ATCTTCAACA TTAAAGTATG ATTGTGCCGC 534 0 

45 TTCAATACCT TGATTTAACA GCCATATTTC TTTTTTGGTC TTCTTCATTT TAAAATGCAC 54 00 

GTCTTCAATC AATGTATCTA CAAACTCTTT CACAATATGA AATAGATTTG ATTGTAATCT 5460 

TGGTGCACCC GAAATAACTA ATGGTGTTTG AGCAGCATCT AAAATGATTG AATCCACTTC 5520 

50 ATCAATAATA CCGTAATTTA ATTGTGGTAA AAATTTCCCT TCCGCACTAT CAGCCAAATT 5580 

ATCAATTAAA TAATCAAAAC CGAGACGTCC ATTAGTTGTA TATATAATAT CATGTTCATA 564 0 

TATATTACGT TTTTCCCCTT TTTGATACTC ATAATCCACA ATATCAACAA AACCTAATGA 5700 
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TAATCATTCG TTGTAATTAA ATATGTTCCT 
ATCGTTGCCG TTAATGTTTT ACCTTCGCCT 
AATACAATCG CTCCGATTAA CTGAACTTCT 
GCTTCACGTG CCACTGCATA AGCTTCAGGT 
GCTAAACGTT CTTTAAATTC TATTGTCTTT 

10 

ACTTCATCGC TCCATGTATT GaTGsGTTcA 
CGTTTATCGT AACATCTAGT TTATGTTTCA 

15 TCTAAGTAAT CTAAAAATCG TACTGGATTC 
TCTTGCTCTT CTTTAAAATA AACCTCGACA 
ACATGTTCTG TAAGCCATCC TTTTAAATCA 

20 CAACCCAAAT GCTGAGCGAC ATAAGTTGCA 
AAATTAATAG CCTTTAGGGT ATCTTGACTT 
CCCTCGATAT TGTCGTCCAT CCAACGTTCA 

25 AAATCATTCG AAATAGTTGG ATAGGTGTCA 
TCATTTCCAT ATTGGTCATC GCCTATCTTC 
TATTGATGCG TCATCTCTGT GATTGTTAAA 

30 

TTCATCATTT TTATTTTATA AGCATAGGCk 
GACTTTACAA TCTCAGTACT TAGTTTTGTG 

35 TACACTGAAC CAGCAGGCGT TGCATCAAAA 
CGCAAAATTG GkAAACTTGG CGcACTTTTA 
TCATGAATCG GTAATCCAGA GGGCATCAAA 

40 AATGATACTT TAGAGCCATA CATAAATGTA 
AATGTTTTGT TTTTCAGCAT GTTGAACTCT 
AAAATTAACA . AACCAACTTG CAATGGTAGG 

45 

GCGATTCATC ACTCTTGCTT GGTGTGCTGT 
TGGATCATAA TCATCATGTT GCATATATGC 
CTTTTGAAAT GTTTGCCAAA ATTTTTGATT 

50 

ACCTTCATTA GACACCAGGA CGTCTAATGC 
CATATGTTCA GCAATCGTTC CAATATTAAC 

55 



TTTCCCGAAA GAGCATTTAA ATATAAAGGC 5820 

GTTTGCATCT CCGCAATGTT ACCTTCATGC 5880 

TTAGGATACA TACCTAATAC TCTCCAGCTC 594 0 

AACAATGTAT CTAGTGTATC AACTCCTGAT 6000 

TGTTTTAACG CATCATCAGA ATATGATTTA 6060 

CTATTTTTCT AATCGACTTT AGTCTTAATT 6120 

TTTACTTCCC CACCATTCAG TTTCGATACA 6180 

ATTAAACGTG ACATATAATT TAGATGTTTG 6240 

TTTGTATCTT TTAGTTCATG ATTTCCTGGG 6300 

TCATCTTCAT GGCTTGTACG ATACACTTTG 6360 

AAAACATTTG ACTTTGACCC ATAACTAATC 6420 

TGCAAATCAT TCTTTAGTTG CTTAATATTT 64 80 

ACGAGCCAAA CATGACCAAA CAGTTTCAAA 6540 

GATGGTTCTG CAATAATGAC ATTGATCATA 6600 

GTCACCCGCA TGCTTTTATA CTCTAAATCA 6660 

CATCTAAATA tAAGACTCGT CGATGCTGCA 672 0 

TCATCAGGAT ATTGAATCGT AATACTATTT 6780 

CCATTTTTAT TATAAAAAAT GATGATAAAA 6 84 0 

TCAAAATGCA ATTTATAATG CTGTCCTCTA 6900 

TATTTTGAAA ATTGCTTTAA CATCAACCAC 6960 

GGATTTATAA AAGTCACTTC ACCATTTGAA 7020 

GTTTGTGAAA TATAATTCCA AGTAACTTTA 7080 

CCCAAACTTG TCTTCCAAAA TAATGTTGTA 7140 

TGAATCATCA TTATGTCGCC CAGGAATACT 7200 

CAATACAGGT AATAGCTCTT GAAATGCATG 7260 

TATGGCAAAA ACAGTTTGTG ACAATGATTy 7320 

TAATGCCTGT ATCGACGCTT GAGATGTATC 7380 

TGTACCGAAC TCTTCTGGTC TAAGTAATCG 744 0 

AAGTGGTTTA CCAACAATAA TTGCCTGAGG 7500 

1 " 
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TAATTCATGT GATTTAAAAT TCAGCTTTTC TAATGTCTCG TCAATAACAT TGATAATACC 7620 

TTGTTCATAT TCAGATGAAC CGATATAAAA ACTACCACCT TCAACACGAG GATCGCCGAT 7680 

AAGTAAAAAC GGTGCATTCA TACGTTTCAT CATATAATAT CCTTCGAAAC CTTCCGCTGT 7740 

TCGATAACCA CTAAAATATA CGTTTAGTGG CGGTTTCATA TCACCAGGGT GGAAATAATA 7800 

AATAAATTCC TGTCGTTGAC TATCTACGAA ACGACTACCA CCAAGTAAAA ATTGACCCAT 7860 

10 

GTCTAATCTA GACCATCGTT TGTGTATAGG TCCTAAATGT ACCGTCCCGT TCCCACGCGC 7920 

CTTAACAGTT ACACTTATAT AAGCATCAAA TGGTTTCGCA GGTATCTCTA AAGGACTGTC 7980 

15 TAACATATCA TCAGTCAATA CGATTTGTTC AATTAATGCA CCATCAGCGC CAGTCTGAAT 8040 

CAATCTAAAT GTATATTGCA ACTCGACCGC ACCATCAATA TCAAATTCTG GCCATATTTG 8100 

AATGACTTTA TCTTTATCGT AAACGAGATT ATTTTGCCAA GATGCGATAG GTTTAAATTC 8160 

20 TTTCCCAAAT TCTCCACTCA ATGTGAGCTC TGAATTACCT TGGTAAACGA CATCTCCTTT 8220 

AAAATTCGGA TGCACAAGTG CTAACTTAGG AGAAACCTTA TCTCCATACT GTCCTGAGAA 8280 

GCTAACTGCC TCTAATTTAT TATTACGTTC TTCAATATTC CGGTAATGTA ATGGTTGAAC 834 0 

25 

AACGTATTTT TGGACATTTT CGTCTTGTTC ATATTCAACT GACCAAAATG ATTCATCAAC 84 00 

ATACGTATTG TATGGTTCGC TTATCATTTG TAATAAATTC GTTAATGTCT CCGAGTATGG 8460 

TGCTTGAATA TAGATAAAAT CAAAGCGCCC TTCTGCTTCA ACAATCGCTT CAATAGCCTC 8520 

30 

TACATAACCA CTATCAAATT CAAACAATCC AATATCGAAG TAATCCCAAC TCACACCTTT 8580 

TTTGTGTTGA AAAATAGGTT CTAAATCGTC TCCTCCAATT TGCAAAACTC TAAATTTACG 864 0 

3S TGGCATCATT TTCACCTTCT ATTAACTCAT CGAGCTGATT AATAATATTC TTAGAAGCAT 8700 

ATGCATCTAT TAATTTTAAA GAATAGGCGT ACGCATAATT CCAATTTTTC AAATAAAATA 8760 

AATAATAATT TAACGCATCA TCTAATTCAT CAACTGTATT TATAATACGG CCATTGTCAT 8820 

40 AATCAGAGAC GTAATCTGTT TGTTGACCAT TAATTTGTGG AATCCCAGCG CTAATTGCAC 8880 

. TAATTTGTAA ATACAAGTCA GGTTCTTTTG ACATATCTAT CACAAGTCGC AACGTCCGCA 8 94 0 

ATGCTTCTAC AACATCATGT TCAGCATGTA TCGTCTTAAC AGCAATGATG TCATCTTGAT 9000 

.45 

CTTCAGGTGT CATTAATGCT GAAACATTAA CATCCGCATT CTGTTTAGCT TGGTATTCCT 9060 

CATTTACCGA CGTAATACAT TCACGAAGCC ACATCGGTAT GTCATTTTGA TGGCGCGATA 9120 

ATAAAATTAA ACGGTAATAA TCTTCCTGTG CGATATAATC CACAAGTCGT TGCATCATTT 9180 

50 

GTTGCAAATC AGCGTCACTC ATACCATCTA TCCATACACC TATAAATGTT TCCATCAATT 924 0 

GACTACTTAT ATTAGGTGAT TGTCTCGTTT CAAATGGTGT GATTCGAATC ATTGTATTCT 9300 
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TTAAATGGGC ATTCTTTACG ATAGATTGAT ATTCCTCATC TGACACAGTT TCATTTCTAT 9420 

TTTTAAAAAA TGAATAACTT AATGATTTCG CTGGAATATG ATTGGCTATT TGTCGATTGT 9480 

GCCTAGCATC TGAAGCCACA ATCACATGAT CATCTTCATG TATTTGTTGT GCAATCATTG . 9540 

CTTGAAATTT TTCTTCAATT AGTTGAGCCA TATTGTTATA TTCTGTTTGT TGATAGTGAT 96 00 

GTTGATATCT TTTTGAAACA GTGACTCTGC CATTTTTCAA ATCTTCATGA AGTACACAAT 9660 

CTCCATTAAT CGTTAAATAT TCTTGGTAAG AAGCCTCTCC CTGATCATCA AAATAACGTA 9720 

TCGCTGATAA ATAACCTCTG TCATCAAAAA TATAACGCCG TTGTAACTGA TCTCTTTCAA 9780 

ATTCTTCAAA CCAAATTGAA TACCCTTCTT GACTAAAATA AATATTTGTA TAGGTCTGTT 984 0 

CACTCGTCAC ACATTTTAAT AAATACGGTG TGTACACAAA CTCAACATCA TCCGGCCATT 9900 

TTAAGTGATG ATAATTAATC GCTTGTGGCG CATGGTGACT GAATCCTTGA ATTTCATCAA 9960 

ACACAGACGA ATACTTTGTC TCATATAAGT CATATCGATG TAAAAATGTT CTTAAATTTG 10020 

GTGCATGATT GAGAACAATC AGTTGATAAT CTAAGTCATT TTCAAGGTGC ATTCCCATTA 10080 

AACTAATCAT ATCGTCAAAT TCCGTCTTAT TTTGTAGTTG ATAATACGGC ACAGTCGTGT 10140 

CTTGCCACCA TCGTTGGTCA TCGTACCAAG CTGGAATAAA GTATTTCATA ATTACCTCCT 10200 

TACCAATACT GGTTTAAAAA TGGCTTATAT TTATCAAAAT ATAAATATGT ACGAATTGTT 10260 

TCTGCAATAT TAATACTGAT GTAAACTAAT ACAATCAGTT GTACTGAGaA ATAAATTTCA 10320 

GTAGATAAAT GCGGTACAAA CAATGTGAAA TAAAGCGGTA TACCAATAAT GACTGTAACT 10380 

AATGCCAATC CAAACCAACA TACGCGTCGT GCTTGATAAT TTAAATAACG TTCTGTATCC 10440 

TTACCAGGTT TaACTCCTGA AAAATAATTG CCACTCTTTA AGAAATCTTT GGATTTTTGT 10500 

TTAGTATTGA TTAAAAATCT CGATAAAAAA TAACCCAATA ACATTTGAAT CACTAAATAT 10560 

ACTGAAATAC CTACTGGACT ATCAAATGTC AGCATTGGCA TGTCATCTGA TATGCTTTTA 10620 

TTAAACATAG ATAAAATAAA ATGAATGCCA CTTTTTAAGA AAACAAAAGC TGAAATACTC 10680 

ATCATTAAAG TAATACTGCC TGCAGGGTTA ACTTTCCAAG ATAAATAAGA TTTCATATTT 10740 

GTTGCGGAAA CGTTCATTAA ATCGATATAT GGTATTCTCA CTTCTACTAA TTCAATAAAT 10800 

AATAAGATAA ACAATGTGAT TATCACAAGG ATGATTAACA ACGCAATCAC AATATGACTT 10860 

GCATCTATAT ATTCCATTTT TTGATGCATC ATTGATTTAA TAATACTAAC CATTACAATC 10920 

GGCATTGGTC CTGCGATGCC GTAGCGACTA TTTTTGTCAG CTAACCAAAC TAATAACATC 10980 

GTTCCAGTAA CCAAAATCAA TATTGTTAAG TAAATATTGT CTTGATGAAC ACGTTCTTTC 11040 

GAAACATATT CATGAATCAC AAAATAACTT TGAATAACAC TTAAAATTAA TGTTAAGATG 11100 



55 



978 



EP0 786 519 A2 



10 



is 



20 



25 



30 



40 



45 



50 



GAAATCAGCA TCAAGATAAT CATTGATGTT AACCACGGAC CTAAtCCTAA AGTGAAAATG 11220 

TTTAAAGTAT TAACGTCTCC ACCCATATTA GAAATAGCTA TTTTAAAAAA TGACTCATGT 11280 

TTTACTTGCA TATCGTTaTA GGAAACGATG GAAATGTTTG TGCCTAATAT ATAAATAaAC 11340 

AAGATAAAAC ATGTGTATAG CATACGTTTA TATATAATTT TATATTCGTA TTGTTGTAAA 11400 

AGTTTTAACA TGTTGCACCT CTTTTATATC AAAAACATTA AAAAGACTAA GGGTTCATCA 11460 

CTAATTATTA AAATCCTATA TCGATTTTTC TAGTGATTGG TGCCTCAGTC TTTTTAATTT 11520 

TAGCCAGCTA TAAATTCAAT TTATGCTTGA GAATCATCTT GATCATTTTC ATCTTTCTTT 11580 

TTCTTTCTCT TCATTAAACC TAAACCAACT AATAATGTCA TAACGCCACC TAGTAATCCA 1164 0 

TTTTGTTTTA TTGAGTCACC TGTATCTGGC AATCTTTTTT CACTTTGTGC TGGTGTGCCA 11700 

TTATGTTTAG TCACTTCAGA TGTTGCACTT AATGTAGACT GAGATTCACT CGTGCTCGTT 11760 

GTTGCTTCAC TTGATAAGCG AGATGTGCTC GTGCTGTGAG TATGATGCAT ACTCATTGAG 11820 

TCTGACGGAT GCATTGAGTT AGATTCAGAT GTACTTGTTG AGCCGGACAT ACTTGTTGAT 11880 

GTTGAGTCAG AAATGCTTTG TGAACCAGAC ATAGATGTAC TCAGTGATTC GGATGTGCTT 11940 

GTCGAATCGG ATGTGCTCAA TGACGTTGAT GTGCTTGTTG ACACTGATTC TGAGTCACTA 12000 

ATTGATGTTG AGTCGGATTT GTCTTGTGAC ATTGAAACAC TCGATGAATT AGATTCACTC 12060 

ATTGATGTTG AGTCAGATAC GCTCGTTGAA CCTGAACCAG ACGTACTTAA TGATTCAGAT 12120 
ATGCTTGTTG AAGTTGAACC ACTTGTTGAG TCCGATGTAC TTGTCGATGT CGAGTCTGAA * 12180 

TCTGATGTAC TCAATGATTC TGAGTCACTG ATAGAAGTTG AATCACTTGT AGATTCTGAT 1224 0 

TCTACTGTAC TTTGTGAACC ACTGATACTT ATTGAAGTAG AATCACTGAT ACTGTCTGAT 12300 

GTTGATAATG ATGTCGACAC CGATGTGCTT TGTGATGACG ATGTACTAGC ACTCATTGAC 12360 

ATTGATGTTG ATATCGATGT ACTTAAGGAA CCAGATGCAC TTGTACTTGT TGACTGGCTT 12420 

TGTGACATTG AATCACTTAA TGATGTAGAT GTGCTTGTTG AGCTCGAGTC ACTTACACTT 124 80 

GTTGAACCTG ATATTGAGTC ACTTAAACTT GTCGATGTTG AAACTGAtwC GcTTCCGCTC 1254 0 

ATTGAGTCAG ATGTTGAAAG TGATGTACTC GTTGAATTTG ATCCACTGAT GCTAGACGAA 12600 

TCACTTGTAG ACATTGAGTC GCTTTCTGAT GCACTGATGC TCATAGAGTC AAATTGACTA 12660 

TTACTTGTTG AGCTTGACTG CGAATCGCTC ACACTTGTTG ACGTTGATTC TGATCCACTC 12720 

ATACTTTGCG AGCTACTCAA TGATTTTGAA TCACTTAATG AATCCGAAGT GCTAAGACTT 127B0 

GTGGAACCAC TTAAAGATAT TGATCCACTT AATGAGTCGG AGTCACTTGT ACTAGTAGAA 12840 

TCACTCATTG ATATTGAATC ACTTAGcGAG GTAGACTyGC tTACGCTTTC TGAACCACTT 12900 
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TTTGAATCAC TTAATGAATC .AGATTCACTC ACGCTTTCTG AACTTC T T A G TGACGTCGAT 13020 
ACACTTAATG ATGACGAATC GCTTGTGCTT ACTGAATCG 13059 
(2) INFORMATION FOR SEQ ID NO: 221: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 10758 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



75 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 221: 

AGGGATGGCC TTACCTAAAA AACCGGGnAA ACCCTCCAAA ACCCATTAAA AGGnTGGnTA 60 

CCCTTTAAAA TGGTAGCATT TAACCGCCAC CCGCCAAGGT GGGTGGTTTA TTCTTCCGTT 120 

20 ATTTAAATTA GTACACCATG CAGATTCTGT AGTTGAGGGA TATTTTAACG AAAGCTTATT 180 

AGCAACTGAT AAAAAAATAC GTCCTAAGGC ATATATTGCT TCATGGAAGG ACATCGAGCC 24 0 

GGCTAAGAAA ATAGAATTTA AAATTAAAAA AGGTATTAAA TGGCATGATG GTAATGAATT 300 

GAAAATTGAT GATTGGATTT ATTCAATTGA AGTCTTAGCT AACAAGGACT ACGAAGGTGC 360 

TTATTATCCA AGTGTAGAAA ATATCCAAGG TGCGAAAGAT TATCATGAAG GAAAAACTGA 420 

TCATATTAGC GGATTGAAGA AAATAGATGA CTACACTATG CAGGTTACAT TTGATAAAAA 4 80 

ACAAGAAAAT TACTTAACAG GATTTATTAC TGGACCTTTA TTAAGTAAAA AATATTTATC 540 

AGATGTACCA ATTAAAGATT TAGCGAAATC AGATAAAATC CGAAAATATC CTATTGGTAT 600 

TGGACCGTAT AAAGTTAAGA AAATCGTTCC AGGTGAGGCT GTTCAACTCG TTAAATTTGA 660 

TGATTATTGG CAAGGTAAGC CTGCACTAGA CAAAATCAAT TTAAAAGTTA TTGATCAAGC 720 

GCAAATTATT AAGGCAATGG AAAAAGGCGA TATTGATGTT GCGAATGATG CTACCGGTGC 780 

40 AATGGCAAAA GATGCTAAGT CATCTAATGC TGGTCTCAAG GTATTATCTG CGCCAAGCTT 840 

AGACTACGGT TTAATAGGtT CGTATCTCAT GATTACGATA AAAAAGCTAA TAAAACTGGT 900 

AAAGTGAGAC CAAAATATGA AGACAAAGAA TTACGTAAAG CAATGCTTTA TGCAATTGAT 960 

AGAGAAaAAT GGATCAAAGC GTTTTTCAAT GGTTACGCTA GTGAAATCaA TAGTTTTGTA 1020 

CCATCTATGC ATTGGATAGC AGCCAATCCT AAGGACCTAA ATGATTACAA ATATGATCCT 1080 

GAAAAAGCTA AAAAAATCTT AGATAAGTTA GGTTATAAAG ATAGAGATGG TGACGGATTT 1140 

AGAGAAGATC CTAAAGGTAA TAAATTTGAG ATTAACTTTA AACATAATTC AGGTTCTAAT 1200 

CCTACTTTTG AACCAAGAAC TGCTGCGATA AAAGATTTCT GGGAAAAAGT TGGCTTGAAA 1260 
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AATACGATTC CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC 1380 

GACAGACCTT TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA 1440 

GAAACAGAGC GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT 1500 

GTTTATCAAT TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT 1560 

CTTCGCGTCT TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT 1620 

ACTAAAGATT CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 1680 

ACGTACATTG AGTCATTATT AAAAGAACAC AAATTGTTAT AACGAAAACC ATTAATAGAT 1740 
TTTTATTTGG TGATTTCAAA TCATGAGACT GGGACAGAAA TGATGTTTTC ATAAAAATTA . 1800 

TTTCGTTGTT CCACTCTCAT GATTTTTTTG ATGAAACATA ATTACATGAT TGATTGCATC 1860 

ATTTTGTTAA ACAAGTGATT GCAAACCTGC CATTTCACAC TGAAAATTTA CATAATAAGT 1920 

20 GACGATATTT TACAAGTCAT ATACAAATAA CATATATTGT TAAATAATTT TACCTAATCT 1980 

TAACATTAAA TTTACAATTA TAAGCGATAA TCTAAATATA AAGCTTATTT GAGGTGAAAT 2040 

AATGGAAATG TCGGTTACAG AAGTCATTTT CTCCTTTTTA GGTGGTTTAG GTATTTTCCT 2100 

TTACGGCTTA AAAATCATGG GAGACGGGCT TCAAGCATCA GCAGGAGACA GGCTACGAGA 2160 

TATTTTAAAC AAATTTACAT CAAATCCAGT ATTAGGTGTT ATTGCAGGTA TCGTTGTAAC .2220 

TATTTTAATA CAAAGTAGTT CAGGTACGAC AGTTATCACA ATCGGACTGG TAACAGCTGG 2280 

ATTTATGACA TTGAAACAAG CCATTGGAGT GATAATGGGT GCTAATATCG GAACAACGGT 234 0 

AACTGCATTT ATTATCGGTA TAGATTTAGG CGAATATGCA ATGCCAATTT TAGCATTAGG 24 00 

TGCATTCTTA ATCTTTTTCT TTAAACGCTC TAAAATCAAT AACATTGGCC GCATACTATT m 2460 

CGGTTTCGGT TCACTATTCT TCGGTCTAGA ATTTATGGGT GATGCCGTTA AACCTTTAGC 2520 

ATCASTAGAT GGATTTAAGC AATTAATGCT TGATATGTCT ACAAATCCAA TACTCGCTGT 2580 

40 . CATTGTCGGC GCAGGGTTAA CAGCACTAGT TCAAAGTTCA AGTGCGACGA TTGGTATTTT 264 0 

ACAAGAATTT TATCAACAAG ATTTAATTAG CTTAAACGCA GCAATCCCTG TGTTACTAGG 2700 

CGATAACATT GGTACCACGA TTACAGCTAT CTTAGCTAGT TTAGCCGGCT CAATCGCTGC 2760 

AAAACGTGCG GCGCTTGTAC ACGTCATCTT TAACTTAATC GGGGTAATTA TCTTCACAAT 2820 

TTTCTTGCCA GTTGTGATTC ATTTGATTAG TTTGTTACAA GATTTATGGC ACTTAAAACC 2880 

AGCGATGACG ATTGCAGTAT CACATGGTAT CTTCAACATA ACAAATACTT TGATTCAATT 2940 

ACCATTTGTA GCAGGTTTAG CATGGATTGT TACAAAGCTT GTCCCAGGTA AAGATATTGC 3000 

TGATGACTAT AAACCTCAGC ACTTAAACAA AGATCTTGTT TATCACGCAC CTGGTGTTGC 3060 
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AGACATTCGC GAAATTACAA AAGACGATAA 
TCAAGCTGTT GAAACAATCA ATGATAGCAT 
5 AGCCATTACG AAGGCAGACG TTGAGCGTTT 

TTTAAAAGTA GCAGAGCTAA CAGAAGAGTA 
AGATATTCGC ATTACAGAAG ATGCACAACG 

10 

TGAGTCATTT GATAAAGCCA TCGACATGTT 
AATTGTAGAA CGTAGTAGAG AATCATTTAA 

J5 TAAACGCCTT AATCGTGGTG AATGTACAAC 

TGGTGTTCTT GAACGTATCG GTTATCATTC 
TAACGATGAT GTACCTACAG ATGAAGAAAT 

20 TTATTTATAT TCATATTTTT TTAAAATTAG 

ACATTCATGG GTTGGCTTTT TTGTTTAGCA 
CTGATATAAT AATGCTTCAT TAGTATGCGG 

25 GACCCCGAAA- CGTTTAATAT ACACTTTTAC 

ACCTTCACAT TATATATAGT TCTTTC CATA 
AAATAACCAT CCACCTAACT TATCAAAAAT 

30 

ATTTATATTA GTGTTAATCC AATCATAGAT 
TCATTTAATT TGCTTTACTT TCATTTATAT 

35 ATTATTTCGC ATCTTTGACA CGTTTATCAT 

ATGCCATGAT TCCCATTAAT ACGTTAACCC 
TCGTTGCAGC AACTGCACCA GCATATACAG 

40 CAAGTGATGA AGCCATTTTA TAAATACCTG 

TCGAAATAGC TGTATCTGTA GAAGGTGTTG 
AATATCCTAC GACACAACTG ATAACATAAA 

4S 

GTGCAATACC AATGACCACA ATGAATGTAC 
TTTGTAATAA TTTTTCACCA ACTCGAATCA 
ATAAGTATCC TGCCTGCAAT GCTGTATAAC 

SO 

CAATTAATGT ACCTGCAAAA CCGTTTAATA 
GTTTATTTTC AAATAATTTA AAATCAATAA 

55 



AAAATTGATC AAAAAGCTTG AACAAAAGCA 3180 

TCGAAATTAT TTAGTTAGAA TTTCTACAAA 3240 

AGCAGTTATG TTTGATGTCA ATCGCTCTAT 3300 

TGTCGCTCAA . TTAAAACGCC AACATGATGA 3360 

CGGTATGGAT AAATTATTCA ACCATGTTGC 3420 

AGATGTTTAT GACAAAACGA AAAAAGATGA 3480 

TATTGAACAT AAACTACGCA AAGGTCATAT 3540 

AAAAGGCGGA TTACTATATA TCGATATGAT 3600 

ACGAAATGTT TCTGAAGCAC TTGTTGGCCT 3660 

TGCAACAACT GAAATTTAAT TTTTACTGTC 3720 

AGATTCAGAT GCATGTAAAA AGCCAATCCA 3780 

AAATTTATTA TCTTAAATCG GCTATAAACA 384 0 

TAAGCATGAC GGACACTGTT CTCGGAGTCT 3 900 

ACGTCGCCTT CATTGAAGCG AATTGCCATA '3960 

TAAATGTCCA AATTTTTAGA ACAACGCAAT 4 020 

TTAAGTGGAT GGTTTTTCAT TTTCATTTAT 4 080 

TTATCTATAT GCACTGCTCT ATACATTTCC 414 0 

CATTATCAAA ACACTTGGCG TGTCATCGTT 420 0 

CATTAGGAAT CGCGAATAAA ATTGCGATAA 4260 

AAAGTGCAAT CATCGCACCT GTATGAATGC 4320 

CACCACTAAT TGCGACACCG AATGCGCCAC 4380 

AAGCAACGCC AACTTTATCT AACGGTGCAT 444 0 

CATAAATACC TAAGCCTAGT CCGAAACATA 4 500 

ATATGCCTGG TAAGAATACT AATGAAATAA 4560 

CTAATAACAT TGGTCGCTTA GAACCCATTT . 4620 

TCAATAACAC CATGATTAAA TAAGTAATTG 4 680 

CTAAACCTTG TTGCACGAAT GTATTCGCTA 4 74 0 

AGAAGTTCGA AATCGTTGCA CCTGTATATG 4 800 

GTGGATTATC TACTTTTTTC TCAACATTTA 4860 
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AACCAAGTGC TGCACCTTTA GTAATGACAA 
CAATTAGCCC TGCAACGTCA AATTTATGTG 
TCCCTTTGAT GAGTAACATT GAAAGTACGG 
TCCAACCCAT AGTTGTCGCA ACTGCACCAC 
AAGAACCGAT AGACCAATAA CTTAAGGCAC 

10 

TCATAATGGC CAATGTAGAA GGCATAATAC 
CTAAAATTAA TAATGCCGGT AAATTCGTAA 

1$ ATAATAAACC GATATTCGTC ATTTTCACGC 

CAACAACAAA CATGCCTGAA AATAGTGCAG 
TGTCTGTACC AAAACTTTGT TGTAAATTCG 

20 ACCAAAATGT AATAACACCT AATACAATAC 
TTTCATTCAT GTTAGTTATC TCCTTTAAGG 
GCAGAAATAA GTAATGATTT TTCTGAAATG 

25 TCACCATTTT CCACCGCTGC ACCTGTATAA 

TATGCAAAGT CTTCTGAAGG TGGTTGTGGT 
GCTTCTTTCA ACGTCTTAGC CACGTACTCA 

30 

TAATCATCGT TATATTCTAA GGTGCAAGTt 
AAACGTTTAA TTTCTTTTTC AATTGTTGCT 

3S CCTTCAATTT CAACAACATC TTTAATGACA 

ATTGTGACAA CACCGGTTTC AAATGGACTT 
GTGACGAAGT AGCTACCTGC AACAATGGCA 

40 CCACCTTTAC CTTGAACTTT CAATTTGAAG 
CTGTAATACA CTTTACCTGT TTTCATTGTG 

ACACCGTCTA ATACACCATT TTCAATCATT 

45 

GCTGGTTGAT GTATCACAAC GACTTTTCCT 

TCTGCTAATA CAAGCATGTA TGCTGTATGT 

TTATTTTGTG ATGCAAAAGA TAATCCTGTA 

50 

GCACGGATTG CTAATGTTTT ACCAGGTTTC 
GGTCCGACAT TCGTTTCTAC ,TTCCACATCT 

55 



CGTTTAAACT TAGCAACATA ACTACTAGAA 4980 

TATTGGTAAT TTCTGATTTC GTTTCAGGCG 504 0 

CAACGATAAT TGAGAAGATG AAAATCCATC 5100 

CGAAGAGTGA ACAGATACCA CTGCCACCCC 5160 

GCTGACGTTC AGCACCCTGA TAATAAGTTT 5220 

ACGCTGCTGA TACACCTTGT ATAACACGAC 52B0 

TAATAATTAA TGCTGAACCA ATAATACTTA 534 0 

GCCCAATTTT ATCTGCCAGA CCACCTGCTC 5400 

TTAGACTGAC CGCAATACTA ATTGTCCCCA 546 0 

GTACAACATT TACAAGTGAT TGTGCAAACA 5520 

CTAAGATTAA CTTGTTGCCC CCGCGATACG 5580 

TAATCTAAAA CAACTGTCCC TACTGCTTCT 5640 

TTAAATTTAG GATGATGATG TGGGTAAATT 5700 

ATAAAGGCAC TTGGGCGTTC TTTAGCATAA 5760 

TCACACATTT CAACACCAAA ATCAAGGTTT 5820 

GTAAACTCTG GATCATTATA TAATGCTGGA 5880 

ACACCATACA TATCCTCTAA TCCTTTTGAT 594 0 

TTTGTAGCAT CTGTTAATCC ACGTACATCA 6000 

TTGAATTGAC CTTTACCGTC AAATGAACCG 6060 

AGTCGTCTAG ATACAACTGT TTGTAACGCT 6120 

TCATTGGCCA TATGTGGTGA TGAACCATGA 6180 

AATGCGCGTC CTGTTTGAAC ATAACCAGGT 624 0 

CTCATGACGT GTACACCTAA TACATGATCA 6300 

GTTTTAGCAC CACCTGGTGG TACTTCTTCA 63 60 

GTAAAACTAT CTTTCATTTC AGCAAGCGTC 6420 

GCATCGTGAC CACATGCGTG CATAACACCT 6480 

TCTTCAGTAA TGGGTAATGC GTCAAAGTCT 654 0 

CCTGAATCAA TCGTTACTTT AATTCCACGT 6600 
TTACCnTGT AAAATTCAGC GATGTATTTC . 6660 
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ATCATTTTGC CTTCTTTAGA TTTTAAAGTT -TCAATTAATT GTTGATTCAT ATCCTTCATG 6780 

TCCTTAGTTA CATCATAAAT GATTAATCAT TATTTATATT GCCAACAACA GAQATGTTAA 684 0 

CCATTAATTT TTTGCAATTT TAGCTTTGAA TATAAAAAAT CACAAATTAT GTATATCAAA 6900 

ATTTGTGATT TGTGATCATT TTATGAACTT GGGTAACGTT TTACTTCAAT TAAGTGAATC 6960 

CCATTCGTAA TCATTTTAAT GTTTAATGCC AGTGTGTCCG TGATATCTAT ATCATATACT 7020 

TCTAATTTCG GAAAACTCAT TCGATTAACG TAATCTATAG AGTCCTTGTC CATGCCATGT 7080 

ATCGTATGAT GTTTGCGCCA AAGATTAAAT AACGCACCAT TTTCTTTATC TAAGGTAAAA 7140 

1S TGTTTAATCT TATACATACC TTCTTCCAGG GCATTAATGT TCAAATGAAT CATTTCCGTC 7200 

GCACGCATAT TCATTTGATT GTCCAACGCT AAGTACGGAT TAAAATGCTT TGCATCATAT 7260 

AACAATATTT GAAAATTTGA ATCAGTCCCC GTGACAATAC ATGTATCATC AGAATACAAA 7320 

20 , ATATTGCTTG TTAATTTATT AAATAGCAAT GCCGTGAAAT AGACCGGACG TTTTCCATTA 7380 

TATTGATGAA ATAGTTCAAT AGAATTCATA TAATCCCGTT CATTTTTACA ATGACTGACG 7440 

TGCAAATCAT AATTCAACCA ATACCCGATA CCCTCTACTT TAGAACTTAA TTTTAATAAT 7500 

TGCTCAATGA TGATACCACC TCTAAAATAT TCGCCGTTTG TAATAAATGT ATCACCCGTC 7560 

AATGTATTCC AATTGAGTAA AATGAGTGGA CGCTTTAGGC GATGACGATG CATTAAGTCG 7620 

ATAAGGTAAT TCGTTTTATT AATAATCATT TGACTCGCGG TTTTAAATTC ATCATCATTC 7680 

ATTTTATTAA AATCAACAGC GTCATTTGAA TTGGCATCAA ATACAAAATG GTCGATGTGT 7740 

GGCTCAAGTC GTTTCAATAA TGGTAGATGT CTTTCCGTAG CTTGATCTAA GTGAATGTAC 7B00 

35 AAGCCACCAT TAGGGAATAA TGCTTTAAAA TAATCAATCA TTTCAATCAA AGACGTGTGC . 7860 

AATGTCGTCA CATACAAGTT GAACTTCAAA TCTTTTCTAT GACTGACATG CAGGGCAACG 7920 

TGATSGATAA AAATTTTAAA TGCATCGATA TAATCACGTG AGTCATACTG ATCCAAATGC 7980 

40 ATGGTCAAAC TAAAGTTATG ATCTAATAAA AAGTCTAAAC ACAAATCAAT ATCATAAAAT 8040 

ATATTCGAAA TTTCTGCATC ATACGTGAAT GGCGCATTGA GCTTTTTCAT GATATATGGA 8100 

ATCACATCAT ATGCTAATAC TTCATTGACT TGAAAATCAT GATGACATGT AAGCAACTGT 8160 



25 



30 



45 



50 



55 



GATTGATACT GTGTATTGAG CAAATTCCTC AAATAGCCCA CTTGAATAAT ATGATTAAAT 8220 

TGATTTAGTT GGTGATTGGT TGGTTGAAAG GCAATCTCTT TATAGTTCAT CTTTTCAATA 8280 

TCTTCAATAA AATGATTCAT TTCTTCAATG TAGTCATTTA AAAGTAATAT CAATTCACGG B34 0 

TCGTGATAAT CATGTTGTGC CGATTGCTGG TTTTCAGTGA TTGCTGGACG ATCACCTCGA 8400 

TATTGTTTAG GTGTTTGATG CGTAAAGTGT TTAAATGTTC TCGCAAAGCT CGCTGCACTT 8460 
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TTCGCATGCT CAATTCGCGT CGTATTTAAG AAATGATGGA ATCCTACACC TAGCGATTCT 8580 

GTAAACTTTT TAGACAGATG GCTCTCTGAC CACCCAACGT ATTCGCTTAA TTCTGAAAGG 8640 

5 

CTTAAATCTT CATGAAAATG TAACTCGATA TAGTCGCATA CTTGATTCAC TTTATCATCA 8700 

TTTAAGATAC TTTGGTTCGA ATGATATGTA CGCGGGACAT AATGAATCAT ATGCATAAGC 8760 

AACTGAATCA CAAGTTGTTG CTCAGTCAAT TTAGACAACT CATTATGTCG GATATGTGTT 8820 

10 

GAAACCAGTC TTGCCATTAT ATTTCTCAGT TGATGTATAT TCTTTGTTGT GGTCGCATCT 8880 

GTTAAGTGAA AATATAGACA ATGCACATCA TCAAACTTGT CTGCTAAATA TTTCATTTGG 8940 

15 AATTGGATAT AACATATGAT GCCATCTTGT TGAAGTTGAA ATCGATACAA GTCGCGGTGG 9000 

TTAATGATGA AAATGTCGCC ACTGTTGCAT TGCGTCATAT TATTTTCATC ATAAATGTGT 9060 

GCCTCnCCTT TAATAACAAA. ACCAATCATT AAACTATTGA GCCTTTTGAA ATCTGACATA 9120 

20 CTCTCAGTTT CTACTCGAAT TAAATAATCA CGTTGCATAC TATCCCTGAA TTCAGTAATA 9180 

TGAATACGTT TATTTTACAT TATTTTACAG CAACATATTT GAATTTCATA TTGAATCGTG 9240 

TGTGTGGATG ATTATTTATC CTCACTCGGT TCAAGATGTA GACTATCAGT AAAAAAAGTA 93 00 

25 

TTTTCACCTT TTTTCTCCAC AAAAGTAAAT TCAATGTCTT TATATCCAAC TGrTGaACCT 93 6 0 

TTTAAGTCTC CCgAACCTTT CaACaATAAC TTTGGTGCTT TATTCGTTGG TATTTTATAT 9420 

CTTTTTCGTA ATTGTTTTAC ATTATAGTCA TCATTAGTTA ATTGATATTT TGCTGAATAA 94 80 

30 

CTCGGTACCT CTGGATTATA TGATATATCG CCGTCTTTGT ACTTCGACAA ATCTTTAAAG 954 0 

CTGCCATATT GCGCGAAGAA CTTAAAATTC TCGATTTCTT TTTTTATATT TTCGTCTTTG 9600 

35 ATACCTTTAG TTGGAATGAT TTTATTGTCT ACCATTTTAA CGGGATATTC TTTATCTTTA 9660 

CTCTTAGGTC TACCATCTTC ATCATGAAGT GTTTCACTCA CTATATACTT CCCGGTTGTA 9720 

GTCTTAGTGT TTCTATTCAT ATATAGAACC ATACCTTTTG ATTTCATACG TTCCCCTTTA 9780 

40 GGTTGAACAA CCATTTCAGA ACCAATAATC CATGTACCTT TATCATTTTT ATCAAATTCG 9840 

TCATCACGAT AACCTTCTTT ATCGTATAAA TCCTCTAGAT TTTTAATCGG ATACATACTC 9900 

AATGTTTTTT CAAAGCTTTT CTTAACTTCC GCTTCTTTAC CTATGCCACA ACCAGCAGTG 9960 

45 

AAACTAATGA CTAATATCAA AAAACTAATA TACAATACCA ATTTGTTTAA TCGTTTCATA 10020 

ATTTCACAAT CCTATTCTTC TTATTATCTT TCCTGGATTG ATTTCATATT TTGATCGAGT 10080 

CATGATTATT TATCCTCACT TGGTTTAAAA ATTAACCCAT CACTAAAGTA AATGTTCTCT 10140 

so 

TCTTTTTTCT CTACAAACGT AAATTCAATG TCTTTATATC CAACTGATGA ACCTTTTAAA 10200 

TTCCCTGTAC CyTTCAACAA CArCTTCGGy GCTTTATTTG TTGGTATGTC ATATCTTTTA 10260 
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ACCTCTGGAT TATATGATAT ATCTCCATCT TTATAATTCA TTAAATCTTT AAAATTGCTA 10380 

TATTGCGCAA AAAACTTAAA GTTTTCGATT TCTTTTTTTA TGTtTTCTTC TTTAACTTCC 1044 0 

TCAGTAGAAA TGAATTTATT ATTAATCATT TTAACTGGAT ATTTTTTTTG ATTATCCTGA 10500 

GCTACTTCGT ATTTCTCCGT CTTTAtTTCA TTAGTATAGT AAAAtCCTTT TGCACTTCTT 10560 

GTATTTCTAT CTATCTTCAA AAGCATGCCT TTTATTTTTA GAGCTTCTCC TTTATTTTGA 10620 

ATTGCCATTT GAGAATTTAC AATCCATGTT CCCTTATCAT TTTTATCAAA TTGATCATCA 10680 

CGATATCCTT CTTTATCGTA TAAATCCTCT AGATCTTTAA TCGGATACAT ACTCAATGTT 10740 

TTTTCAAAAC TTTTCTTT 10758 
(2) INFORMATION FOR SEQ ID NO: 222: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 222: 

nTaTCaACTT TGGaATTTAA AgTCAATAAC TTTTTTAAAA ACTTTTTGTG TTCACAACCC 60 

GCTTCTTTTT CAACGCGTTT ATTGCTTAAC ACAAGAACTT ATTTTACCAG CATTCCAAAA 120 

CAAATCAACA TAAAAACGTA CAAAATAAAA GTAATTTTGT ACGTTTAGCA . TATATTATAC 180 

CTATTTATTT GTAGCAGCTA TAACTTTTTG TGCAATCGAG CTATAAATTT TACCTAGACG 240 

ATCATCTGAT TGATATATTG ACGGTGCAAA ATCTTTTGGA TTCCAAGATG GTTGCTCTAA 300 

AGGTAATTCC CCAAGTAATT GAGTATTAAG TTCATCAGCT AACTTAGTAC CGCCACCTTT 360 

GCCAAAGACA TATTCTTTAT TACCCGTCTC TTTACTTTCA AAATAACTCA TGTTTTCAAT 420 

TACGCCAAGA ATAGAATGAT CCGTATGTTT TGCCATCGCA CCTGCGCGAg CTGCAACAAA 4 80 

TGCTGCTGTA GGATGAGGTG TCGTTACAAT AATTTCCTTA CTTGAAGGTA ACATCGTATG 54 0 

AACATCTAAA GCTACATCTC CTGTTCCAGG TGGAAGATCG AGTATTAAAT ATTCAATGTC 600 

TCCCCATTTA ACTTCTGTAA AGAAATTCGT CAACATTTTA CCTAACATTG GCCCTCTCCA 660 

TATAACTGGC GCATTTTCTT. CCACAAAAAA GGCCATTGAT ATAACTTTAA CGCCATGACG 720 

TTCAACTGGA ATTACTTCCT TCCCTTTAAT TCCAGGCTTT TCATCAATAC CCATCATATC 780 

TGGTACACTA AATCCATATA TATCGGCATC TACTAATCCG ACTTTTTTCC CTTCACGAGC 840 

TAAGGCAACG GCTAAATTTA CTGCAACAGT AGATTTACCG ACACCACCTT TACCGGAGGC 900 
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ATTTTCTTCT TTTGGTTTAA ATTGATTTAC TTTTTCTT CC GGCAATGTTT CAAATCGTAT 1020 

ACCGACCGTT TTCGCACCGT TTTCTTTTAA TGCATTAACA ACAGCCATCT GTAAATCTAA 1080 

5 aTTGCGtGCA CCACCTAATT GTGCCATTG 1109 

(2) INFORMATION FOR SEQ ID NO: 223: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 3997 base pairs 

(B) TYPE: nucleic acid 

.(C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

is 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 223: 

TCTTTATTTA AAAAAATGAT TGTCTAGTTT GTATCTCTCT GAAGATTTGG CAATAAATAA 60 

20 AAGCCGATAA CCGTATAATG ATTATCGACT TAAAGTTTAT GTGGCATTTT TTACTTTTGT 120 

• " AATTTCAGGT GAGTTAGATG ATTATTATCA GATAGATTAT TGCTTATAAT CATATGATGT 180 

TTGAATGATA TCTTTGATTT CACTGATTAG TGCTTCTTTA GGATTAGCAG TTGTACATTG 24 0 

ATCTTCAAAT GCGAGCTCTG CCATTCTATC AATTGACTCA TTTAATTCTT CTTCAGACAC 300 

ACCTTGTGAT TTCAAATTCA TTTCAATTCC GACTGATTGA CCTAATTCGT AGACAGCTTT 360 

AgCTAATGAT TCTACGAGTG CTTCTGTCGT ATTACCTTTT AATCCTAAGA ATTTGGCAAT 420 

ATCTGCATAA TCTGTATCTG CTCTGAAGAA CTCATATTTA GGGAATAATG CATGTTTTTG 480 

CGGGTCTTTG GCATTATAAC GGATAATATG CGGTAGTAAT ATCGCATTCG CTCTACCATG 54 0 

CGGAATACCA TATTCGCCAC CAATTTTATG CGCAATTGAG TGTGCAATGC CTAAGAATGC 6 00 

ATTTGCAAAT GCCATACCAG CCAAAGTTGA TGCGTTATGC ATTTTCTCTC TTGAAACTTT 660 

ATCACCCTTT TCAACAGATG ATTTTAAATA TTCGAACGTC AATTTAATCG CTTGTAGACT 720 

40 " CAAACCTCTT GTGTAGTCTG AAGCCATTAC AGATACATAT GATTCCATTG CATGCGTTAG 7 B0 

TACATCCATT CCTGTATCTG CTGTAACGCT TTTTGGCACA CTCATCACAA ATTGAGGGTC 840 

AATAATTGCA ACGTCAGGTG TTAAAGCAAA ATCAGCCAAC GGATATTTTA CATTTGTTTC 900 

ACTATCTGTG. ATAACTGCAA ATGGTGTTAC TTCTGAACCT GTACCTGATG TCGTAGGGAT 960 

ACAAATGAAC GTCGCATTTT CAGGCATGCC TATTTTATAA GTACGTTTAC CGATGTCTAG 1020 

GAACTTTTGT TTAGCACCGA AGAATGATGT CTCAGGG TGT TCAAAGAACA TCCACATTGC 1080 

TTTTGCAGCA TCCATCGCTG AACCACCACC AAGTGCAATG ATTGTATCCG GTTGGAAATC 1140 

AACCATCATT TCCAGACCTT TATATACTGT ATTAGTTGAT GGGTTCGGTT CGACTTCGCT 1200 
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ATAACCGAAT TCTACCATAC CAGGGTCACA GACAATCATC ACTTTTTCAA TCTTGTCCAT 1320 

TGTTGTTAGA CTCATGATTG CATTTTCTTC AAAATAAATT TGAGCAGGCA CCTTGAAAAT 1380 

TTGAGTATTA TTACGTCGTT TAGCAATCGT TTTAATGTTT AATAAATCTG TCGCACTAAC 1440 

ATTATGTGAA ATTGAGTTTC TACCGTAGaA CCACAACCTA ATGTTAAAGA CGGAATCAAT 1500 

TCGTTATACA TATCACCAAT ACCTCCAACC GCTGATGGTG TATTTACAAG TACACGACAA 1560 

GCTTTCATTC TTAGTCCAAA AT CTTTTT GT AATGTTTCAT CTTCTGTATG GATAACGGCT 1620 

GTGTGTCCTA ATCCACCAAA ATGTAGTGTG TCTTCACAAA TTTGAAATGC TTGTTTTGTA 1680 

GATTGGGCTT TTACTAAGGC TAATACTGGA GATAATTTTT CACGAGATAA CGGATAGTCT 1740 

GAACCTACAC CGCTAATTTC GGCTATGATA AGTTTTGTAT TTTCGGGGAC AGGTATACCT 1800 

GCTAATTCAG CTATTTCAAC TGCAGATTTA CCGACAATAT CAGGCTTAAT ACCTGTTTTT 1860 

20 TGTTCATTcA TAATTGCATT TTCTAAGCGT TGTAATTCAT CTTTTTTAAC AAAGTATGCT 1920 

TGATGTGCTT TAAATTCATT AGTAACATCT TTATAAATTT CTTTATCAAT GACTACAACT I960 

TGTTCAGAAG CACAAATCAT ACCATTATCA AATGTTTTTG AACCAATGAT ATCATTTACT 2040 

GCACGTTTAA TGTGTGCTGT TTTTTCAATG TAAGACGGCA CGTTACCTGG TCCCACACCT 2100 

AATGCCGGTT TGCCAGTTGA ATATGCAGAC TTAACCATGC CCGAACCACC TGTTGCTAGA 216 0 

ACTAATGCAA TACCTTTGTG ATTCATTAAT TGTTTTGTTG CTTCGATAGA AGGCACTTCA 2220 

ATCCACTGAA TAATATCTTT AGGTGCACCT GCCTTCATTG CCGCTTCTAA TACAACTTCT 2280 

GCTGCACGCT TCGACGATTC TTGTGCACTT GGATGGAATG CAAAAATGAT TGGATTTCCT 234 0 

GTCTTAATTG CAATCATCGC TTTAAAAATA GTTGTCGACG TAGGATTTGT TGTTGGCGTA 24 00 

ACACCACAAA TAACACCAAT TGGTTCCGCT ACATACGTTA ATCCTTTTTC TTTATCTTCA 2460 

CCAATAATCC CTACTGTCTT ATTGTCTTTT ATTGAATTCC ATATATATTC AGAAGCGTAT 2520 

40 AAATTTTTAA TCGCTTTATC TTCGTATATA CCTCTTCCAG TTTCTTCATG TGCTAATTTT 2580 

GCTAGCACCA TATGTTGATC AACAGCTGCT AAGCTCaTTT GATGAACAAT ATGATCAATT 2640 

TCTTCTTGTG ACTTTTTAGA TAATGCTTCT AATGCTTTTT TCCCTTTGTC AGCTAGAGCA 2700 

TCAATCATAA TTGCCACTTC TTGTTCTTTC . GATCCACGAT TTTCTTTTTC AGGTATAGTT 2760 

AACATATACA ACCACTCCTT TATACTTTGT GAATTATTTC ACAAACATTA TAGTACATGT 2 820 

CTCTCAGGAT ATAAAGAAAA TTCTATACAA AAAAGTTTAA TTTCGAATAT TATTTGAACA 2880 

AATATCAAAT TTTAAAATAA ATGTTTTCAT GAAATCATTG TTATTTCGGT GTTTTTAGAA 2940 

TGATTTTATA ATCATAATTT TTTCAATGAC ATAATTTATT CATAATTATA TATTTAATTC 3 000 
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TCCTTGTCGA TACCTATCAA CAGATGTTAC AAATAAAAAC CaCCCGTGTG AACGGGTGGT 3120 

TTGTTCTGCG gCTATAAGCC TTCCTTACTG GCCaGCCCTA AAAGGGCACT GACAAGTCAG 3180 

CCAACTGCAC TACTATTCCA GCAATCCTAA AGGTTTACTC TTTTTTCTTT CTTTTTT TAT 3240 

TTTTCTCTCC AGTGAAAGGA TCTAAATATT CTTCCATTGA AATTTCATCT GCAACGATAT 3300 

CCTCTTGTAA TTGATTACGA ATATAATTTT CAATCACTTT TTTATTTCTA CCTACTGTAT 3 360 

CCACATAAAA TCCTTTACAC CAAAACTTTC TATTTCCATA TCTATACTTT AAGTTAGCAT 3420 

GTCTATCAAA TATCATTAAA TTACTTTTCC TTTTAAATAG CCAACAAATG ATGATACCCC 3480 

AAGTTTGGGT GGTATACTTA CTAACATATG GATATGATCT TTACATACCT CTGCTTCAAT 354 0 

TATCTCTACA CCTTTTCTTT CATATAATTG ACGTAATATA ATCCCTATAT CTTTTTTTA T 3600 

TTTTCCATAT ATCGCTTGTC TTCTGTATTT AGGTGCAAAG ACAATATGTT ACTTACAATT 3 66 0 

20 CCATTTCGTA TGTGCTAAAC TGTTTGTGTC AGATGACATT AAATAGCATC TCCTCGTGTT 3720 

GATTATTTTG GTTGGCTGAC CAATATTTAC TCTAACATGT AGAGATGCAT TTTTTTGACA 37B0 

ATGGTAGAAC CTTTTCTGGG GAGTGGGACA GAAATGATAT TTTCGCAAAA TTTATTTCGC 3840 

CGTCCCACCC CAACTTGCAT TGTCTGTAGA AATTGGGAAT CCAATTTCTC TTTGTTGGGG 3900 

CCCCGCCCCA ACTCGCATTG CCTGTAGAAT TTCTTTTCGA AATTCTCTGT GTTGGGGCCC 3 960 

CTGACTaGAA TTGAAAAAAG CTTaTTaCAA GCGCATT 3997 

(2) INFORMATION FOR SEQ ID NO: 224: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1391 base pairs 
3S (B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 224: 

GnGCGAGACA AACACACtTA TTGGTGCCAT TATmCcTAGA ATGaATTCaT ATGCAGTAGA 60 

TGAaaCAATC AAAGGATTGG CAAAACAATG CCAAAAATAT GAATCaCAAT TAATTTTAAA 120 

TTACACAGGT TTAAATATCG AAGCAGAAAT ACAAGCGCTT GAAACATTAG CACGCaGTAA 160 

AGTAGATGGT ATTGTTTTAA TGGCTACAGA CATAACAGAG AGACATATTG AAGTCATTAA 240 

TAAAATGAAT GTACCAATCG TTATTGTTGG TCAACAACAT GAACAACTTC ATAGTATTGT 300 

GCATGATGAT TATAAAGCAG GTCAAATTAT AGGCGAATGG ATTGGTCAAC AGGGATATCA 360 

ACAAGTTGAA GTGTTTAGTG TAAGTGAAAA AGATATTGCA GTTGGTATAC ATAGAAAACG 420. 
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TACTTATGTG GAAGCACAAA AAGATGTTGC AAATGTTTTG GAAAATGTGG AGCAAGTAGA 54 0 

TGCGGTTGTT GGAGCAACTG ATACGATTGC ATTAGCTGCC TATAAATATT ATTCTGATAA 600 

AAAAGATGTT ATGAAACCAC ATCAAATATA TGGTTTTGGT GGTGACCCAA TGACACAATT 660 

AGTGTCTCCA TCGATAAAAA CAATTCATTA TAATTATTTT GAAGCTGGCC AATGCGCGAT 720 

GGaAGAGATA CAACAGATGC TTAAAAAGCA AGATATGCCA TATAGCGTCA CAGTAGATGT 780 

TAATATTTAG ACGCTGTATT TTTTAAAATA AATGTGGAAC CGATACCATA TAACTATAAA 84 0 

TGGATAGGTT AAAAGTTAAA GAACGTAGGT AAAATTTGCT ATAATAGAAT ATAAATTGTT 900 

AACAGCATAA ATTATAAAAG GAGGACTGGG TAAATATTAT GACCGAATGG ACTAGAGAAG 960 

AACGTTATCA ACGAATCGAG GACGTTGATA CTGAGTATTT TAAAACATTA AAACAACAAG 1020 

TTGATCAATC AAAATTTCGT CAACAATTTC ATATACAACC AGAAACAGGC TTATTAAATG 1080 

20 ACCCCAACGG ACTTATTTTT TATAAAGGGA AGTATTATGT TTCACATCAA TGGTTCCCAT 114 0 

TAGGCGCAGT ACATGGCTTA AAGTATTGGT ATAACTACAC GAGTGATGAC TTAATAAACT 1200 

TTAAAGCTGA AGGGCCAATT TTAAATCCAG ATACTAAATA TGACAGCCAT GGTGTATATA 1260 

GCGGTAGCGC TTTTGAATAT AACGGGCATT TATATTATAT GTACACAGGA AATCATCGAG 1320 

ATAATCATTG GCAACGACAT GCGAGTACAG ATGATCGCAC GATTGAAAGA AGACGGTTnC 1380 

AGTTGGnAAA A 1391 

(2) INFORMATION FOR SEQ ID NO: 225: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 0 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 225: 

ATTTATTTTA ATGTTTATAT TTTCTAACAC TTTTTTATGA TCATAGTAGT AATTGACATT 60 

TTTCAATTCA AAGACTGGTG TCATCGTATC TCACCTCGCA TTCAACTATA CAACTCCTAG 120 

TAACATATGT AAACAGTAAT GTTTACGACT CAAAATTAGA CAAAATAAAG AGATATGCCC 180 

CCTTCAAGTT TTATTTATCG CATTTCTTGA AGAGAGCATT ATCATTTTAT TGTTGCATAA 24 0 

CCTTATTTTT TAATTCTGGG TCAAATTGCT GTTGTTTTAA CATTTCAATT TCAAGTTTAT 300 

ATGGCGGTTT TTTATTTTTC TTATCTTCAC CAACATAAGG TGTTTCTAAG ATTTTCGGAA 360 

TATCTTTAAA ACTATCATGA TGCACAATGT AATTTAATGC ATCAAAACCA ATGTAACCGA 420 
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GAACAACTTT GATTCTGTCG ACTCCAATGA TTTTATCAAA TTCATTTAAT ACGCCATCAA 540 

AGTCCTCTTT AACATTATAT CCAGCATCAT GCGTATGACA TGTATCAAAA CATACTGATA 600 

AACGTTCGTT ATTATGAACT CCATCAATAA TACGTGCTAA CTCTTCAAAT GAGCGACCAA 660 

TCTCTGTACC TTTACCTGCC ATCGTTTCAA GCGCAATACG TACATTATTG TCATTCGTTA 720 

AAACTTCATT TAATCCTTCA ATAATCTTAT TAATTCCGGC ATCAACACCA GCTCCAACAT 780 

GCGCACCTGG ATGTAATACa ATATCTTTAG CCCCTATAGC TTGCGTTCTk TCaATTTCTT B4 0 

GTTGCAAGAA ATCTACACCA AGATTAAACG TTTCTGGTTT GGTTGTAttG CAATaTTaAT 900 

GATGTATGGT GGCATGAACA ACAATATTAG 930 
(2) INFORMATION FOR SEQ ID NO: 226: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1984 base pairs 
20 (B) TYPE : nucleic acid 

-(C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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• (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 226: 

TGACGCACCA ATTTATAACG CAATTGACAA AACAATTAGA TATACCTGTG AAATTTGTAC 60 

CTGGAAACCA TGATTTATGG GAAGTTGAAA GTATGACTAC GCAAGACATT TGGAATAATT 120 

ATAAGAGTAT GTCACAGTGC TTGGTAGGAA AACCATTTAT AGTAAATGAA GAATGGGCAA 180 

TCATAGGACA TACTGGCTGG TATGATTATA GCTTTGCAGC ACAACGATTT TCATTAGATG 24 0 

AGTTACAAAA AGGAAAACAT TATGGTGCGA CTTGGCAAGA TAAAGAACGA ATATCTTGGG 300 

GCATATCAGA TCAAAATTTA TCTAAAATAG CGGCTGAACA AGTGAAGAAA GATATATTAG 360 

AAGTAGGAAA TAGACGAGTG ATTTTAGTCA CACATGTTGT GACGCACCCT GATTTCATTG 420 

40 TTCCTATGCC GCATCGTATA TTCG ATTTTT ATAATGCATT TATTGGGACA AGTGATTTCA 4 80 

ATCCTTTGTA TGCGATGTTC GATATACCAT ATAGTATTAT GGGCCATGTT CATTTTCGTA 540 

AAAGTGTGAT AGATGATGGC AGATGTTATC TCTGTCCGTG TCTAGGCTAT CCAAGACAAT 600 

GGCGTTCAGA AGATATTTAC CAGGAAATAA ATGAGACGAT ACAAATAATA GAAATTTAAA 660 

ATGCGCAAAC CTGACCCAGT TTGCGCATTT TATGTTTTAC ACACGCGAGT AATGTGTTTA 720 

CTTACGTGTG TTTATTTTGT TGCTGATTTT CAATTGTATA TGAATGTGGT TGCACATAAA 780 

TGCACTTTCT TCCTGGTGAA TTAAAGCTGT ATTCCATTTT CTCTTTACGA ATTTTAATAA 840 

TTTGTTTGCG ATTTGGAATG ATGGCAGGTA AAACTAGGCC ACGACGAATA TGACTCCAAA 900 
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TTGAAACTTG TTTCGCTGGC TTGTTATCAA AGCGGAAAAC ACGTAGTAAT GGTTTAGAAC 1020 

, CAAGATTAGT ATGGTATATT AACACAGGTT GACCTTGATC GATAATACCT TTAAGATCTT 1080 

CTAACGATTT ACCAGTGCCG TCTACGATAT TAGGATTGTA TTTTTGTAAA AATGGTACAT 1140 

ATGCTTCTGG AAATATCGTT TGATGATAAT TGCCAAGCTT AATGAATAAG TGATGTCCAA 1200 

CATAACCTTT ATGTGGATTG TTCGGATGTG TCGGCCAATG TCTCATAATT TCTGTAGCAG 1260 

GGATATGTTG GTTGTTGTAT TGCAACATCA TGGCTGCGGA AACACCTTCA CACCCCATGA 1320 

CCATAGGGAT AGGAAATAGC TGACTGATAG GTTTAACTGG TAATATTTTT CGGTTCATAA 1380 

TATAGTCCTC GCATTGATTC AATAAATATT TAATATAATT ATATAGCGTC AATGCAAAAT 1440 

GTCCTAAACA TATGTTTTAC ATGAGTGAAT AAAATTAATG GAGTGATAAA ATGGAATATC 1500 

AATTACAACA ACTTGCGTCG TTAACGTTAG TAGGTATTAA AGAAACGTAT GAAAATGGAC 156.0 
20 GACAGGCTCA GCAACATATA GCAGGGTTTT GGCAAAGATG TTATCAAGAG GGAGTAATTG . 1620 
CGGATTTACA GTTAAAAAAT AATGGTGATT TAGCCGGGAT ACTTGGCTTA TGTATACCTG . 1680 

AATTAGACGG TAAGATGTCA TATATGATTG CAGTTACCGG AGATAATAGT GCTGATATTG 1740. 

AAAAATATGA TGTCATAACA TTAGCAAGTT CAAAGTATAT GGTATTTGAA GCACAGGGCG 1800 

CAGTACCTAA AGCAGTTCAA CAAAAAATGG AAGAGGTTCA TCACTACATA CATCAATATC 1860 

AAGCAGATAC GGTAAAATCA GCACCATTTT TTGAGTTGTA TCAGGATGGT GATACTACAA 1920 

GTGGAAAATT AATATTACCA GAAATTTGGG ATnCCTGTTA AAGGGGTGAT TGAAATAnGA 1980 

AnTG 1984 
(2) INFORMATION FOR SEQ ID NO: 227: 

35 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6373 base pairs 
~ (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
40 (D) TOPOLOGY; linear 
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(xij SEQUENCE DESCRIPTION: SEQ ID NO: 227: 

GATTCCACGT GTGTTAAAAG AAGTTACAcC TTCAATGATG GTATTTACTA ATTTCTTTAG 60 

AGATCAAATG GATCGCTTCG GTGAAATTGA TATTATGGTT AATAACATTG CAGAGACAAT 120 

TAGTAATAAA GGCATCAAAT TATTGCTAAA TGCTGATGAT CCATTTGTGA GTCGTTTGAA 180 

AATCGCAAGT GATACGATTG TGTACTATGG TATGAAAGCA CATGCCCATG AATTTGAACA 24 0 

AAGTACGATG AATGAAAGTA GATATTGTCC AAACTGTGGT CGCTTATTGC AATACGATTA 300 
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AAAATATGAA ATATCAAGTT TTGATGTGGC ACCG TTTTT A TATTTAAATA TCAATGATGA 420 

AAAATATGAT ATGAAAATTG CAGGTGACTT TAACGCTTAT AACGCGTTAC AGCATATACT 4 BO 

5 

GTTTTAAGAG AGCTAGGGTT AAATGAACAA ACAATTAAAA ATGGCTTTGA AACGTATACA 54 0 

TCAGACAATG GTCGTATGCA GTACTTTAAA AAAGAACGAA AAGAAGCGAT GATCAATTTA 600 

GCTAAAAATC CTGCAGGAAT GAATGCAAGT TTATCAGTTG GTGAACAATT AGAAGGCGAA €60 

10 

AAAGTGTATG TTATTTCGCT AAATGATAAC GCTGCAGATG GTCGAGATAC TTCATGGATT . 720 

TATGATGCAG ATTTTGAAAA ATTATCTAAG CAACAAATTG AAGCTATCAT CGTGACAGGT 780 

IS ACACGAGCAG AAGAACTTCA ATTGCGATTG AAGTTAGCAG AGGTTGAAGT ACCAATTATA 840 

GTTGAGCGTG ATATTTATAA AGCAACGGCA AAGACTATGG ATTATAAAGG TTTCACAGTT 900 

GCAATACCAA ACTATACATC ATTAGCGCCT ATGCTTGAAC AATTAAACCG TTCGTTTGAA 960 

20 GGAGGTCAAT CATAATATGC ATGAATTGAC TATTTATCAT TTTATGTCAG ATAAATTGAA 1020 

TTTATACAGT GATATAGGAA ATATTATTGC TTTAAGACAA CGTGCTAAAA AACGAAATAT 1080 

TAAAGTTAAT GTCGTAGAAA TCAATGAAAC AGAAGGTATT ACCTTTGATG AATGTGATAT 1140 

25 

TTTCTTTATC GGTGGTGGAA GTGATAGAGA ACAAGCATTA GCAACAAAAG AATTAAGTAA 1200 

AATTAAGACA CCACTTAAAG AAGCGATTGA AGATGGTATG CCGGGATTAA CGATTTGTGG 1260 

AGGCTATCAA TTTTTAGGGA AAAAATATAT CACGCCTGAT GGTACAGAAT TAGAAGGGTT 1320 

30 

AGGTATTTTA GATTTTTATA CTGAATCAAA GACAAACCGA TTAACAGGAG ATATTGTTAT 13 80 

CGAAAGTGAT ACTTTTGGAA CTATTGTAGG TTTTGAAAAT CACGGTGGTA GAACATATCA 144 0 

55 TGATTTCGGT ACACTTGGTC ATGTTACTTT TGGTTATGGT AATAATGATG AAGATAAAAA 1500 

AGAAGGCATT CATTATAAAA ATTTATTAGG TACTTATTTA CATGGACCAA TTTTACCTAA 1560 

AAATTACGAA ATCACTGATT ATCTGTTAGA AAAAGCTTGT GAACGTAAGG GTATTCCGTT 1620 

40 TGAGCCTAAA GAAATAGATA ATGAAGCGGA AATACAAGCG AAACAAGTAT TAATAGACAG 1680 

AGCAAATAGA CAGAAGAAAT CTCGTTAACT CTGAACATCG CATCAATGGA TTTAATATTG 1740 

ATAAACGATG AAGTTTAGTA ATTAATCATA TATGTATAAA CACACACATT ATTTTGGATG 1800 

45 

GAAACAACCA AATTGATGTG TGTTTTtTTG TTCTAGTGAA TAATTATTAT ACAATGAGTA 1860 

TCTATCCTAG AATTATCAAT AGTAATGGTG ATTATGCAAC ATGAAAAAAT GAATGATGAA 1920 

AGGAATTTGA CGATGAAGCC TACTAAAGTG ATATTAAAAG ATGCATCTTA TTTACATAGC 1980 

AAAACATCGA TAACATTTAT TTTAAAAGAT GTAGTTATCG AAGAAGATAA TAAAATTTAT 2040 

TATTTCGACA CTAGTGCAcT TTCGAAGATC AAGAAGTTAA ATTTGAATTT GCACTCTTTG 2100 

55 
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35 



40 



45 



SO 



TTATAGAACC TGATTTACAT TTTACAATTA TTGATTTTAA TCAAGAACTG 
ATATTGATTT TGATTCTGGT TTAAGGCATT CAAACATGGC AACAGAATCT 
TAAGGATAAA TGTTGCTAAA TCAGATTTTA CTAAATTTAT TAATGAATTA 
ATTAATGATT TAAATCTGAT ATGTAATTAC AATCAAAAAA GACAGCCACA 
GTTTAGGCGT GTGGCTATAT TTGAGTCTGA ATATTTATGC TTGTAATTTT 
ATGCTATATA CGATAAAAAG AGGCGGGGAC ATAAATCAAT GTTCTATGCT 
ATATTGGCAG TAGTTGACTG AACGAAAATG CGCTTGTAAC AAGCTTTTTT 
CAGGGGCCCC AACAAAGAGA AATTGGATTC CCAATTTCTA CAGACAATGC 
GTGGGCCCCA ACACAGAGAA TTTCGAAAAG AAATTCTACA GG CAAAGCGA 
GACGACGAAA TAAATTTTAT GAAAATATCA TTTCTGTCCC ACTCCCATGG 
CATAAGGTAC TTAAATTAAG CATATCTGCT GTCTAGCAGT CGATAAATCA 
GTATAGTATA TGACTTTTAA TTTGATTTTC ACCACTAATT TCAAGTGCTT 
ACGTAAAGTT TCTACAGAAT CATCTTCTCT CTTAAAAGAA CCATCATAAA 
GATGCTACTA CTAATTTTTA GCAATGCCAT TTTTTCGTCA CCTGAAAAGT 
ATTTTTAGGC AAGTAAATGA TATTTGATAA ATGAGTGATA AACAAACGAT 
ACGTTTAGTT AATTGATTGA GTAATTTCCA ATCACATTCT TTTTTCTTAT 
TTCATCACGT TGATAACTTA TTAACGTTTC AACTTGATTA TTTAAATTGA 
ATATGCTTTT TCGCTTTTAT CAGATTGCAG TCTTGATAAG ATAAGTTCTT 
GTAAAATAAT TTATACATCA AGGCATCTGT CTTACTTAAT TTTTCTTOGA 
ATACTTAGGT GGAAACACCA TGAAGTTAAT TAAACCTGAT GTCACGAGTC 
TGTCAATGTT CGAGACAAAA AGTTGAATAT GTAGGCATCA TGAATACCTG 
TAATGATGTT AGTACAGCGA CATTCGTACC AACTTGCAAT TTGAGTTTTG 
AATCGTGAAC GTTGCACTCA ATGCATATGT AAAAGGTGAT TGATCGCCGA 
AAATAATACT GCAAAGCCTG CACCAATTAC CGTAGCAGGT AATCTACGAT 
AAGTGATGCC TTGGCAGTTG GTTCAATTGT GACTACAGCT GTTAAAATGG 
TGTTAAATCT AGTGCCATAC AAAAGACAGC TGTTAAAAAA ATGGCAATAC 
TGTTCTGGCA CCAATTAAAT GTTTATACCA TTGATCGTTC ATTTTTTAAC 
CGTAAAATCT TAGCGAGCGC TTTATAATAA TAGTATCGTA CATTGGAAAA 
GTAAAATATT TGAAATAATC ATACATAAGC ATTACTTTGA TTTTCATATA 



CTTTGTATTT 
GGTATTTCAT 
GCCTCTTTAC 
TCCCTCCGTA 
AAAAAGGGAC 
CTACGAAGTT 
CAATTCTAGT 
AAGTTGGGGT 
GTTGGGGTGG 
TGCCAATTAG 
TTAGAACTTC 
TTATAGTCGA 
ATATATCTTT 
TAACACGAGT 
TCGTATATGC 
GATAGCTTAA 
AAATATTTTT 
GG CAGCGATT 
CCTGACCATA 
CAATAATTGC 
GAATCATAGC 
TACAGAATAA 
ATAAATATGT 
AACCTTTAAT 
CATAGATGGG 
CAGTTTTAAT 
CTCTAATCAT 
GTTCATGTAT 
CATTAATCAA 



2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
• 3840 
3900 
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CAAGCATTTT TCAATTATAG TCCGGGGCCC 
ACAGGCAATG CAGGTTGGCG GGGCCCCAAC 
5 AATGTGCAGG TTGGCGGGGC CCCAACATAG 

TGCAAGTTGG GGTACAACGA TAAAGAAATA 
CTTTCCAAAA TACTAAAGTA ACATCTTTAG 

10 

ATAATTATAT AAAAAAGGAA CGGGATAAAA 
CGTTAAAAGA AATTGGATAC ATATGCGCTA 

1S AACCAGGTAT CACTACGAAA GAGCTTGATA 
GTGCTATTTC TGCGCCAATT CATGATGAAA 
ATGAAGAGGT GGCACATGGG ATTCCAAGTA 

20 ATATTGATGT ATCGGCTTTG AAGAATGGCT 
TTGGAGAATC AGATGATCCA ATGAAACAAA 
AGAATGCAAT TGCAAAAGTA AAACCGGGTA 

25 ATAATACAGC TAGACAAAAT GATTTGAAAG 
GTTTATCATT ACATGAAGCA CCAGCACATG 
CATTATTAAC TGAAGGTATG GTATTAGCTA 

30 

TTGTTACAGA AGGTAAAAAT GAATGGGCTT 
AAATTGAGCA TACGGTTATC GTGACTAAGG 

35 AAGAATAGTT CAACATATAC TAAGACTAAA 
TCATATTGGT TTCGGAACTG TTTTATAATA 
AAATATGTTG TAACAAAGTA GTTTTTAAGC 

40 TAGCGCATTT GGTATTTTAA AACTTATTAT 
GTTATATTGA ' ACAGTTTTTG AGAGCAACAG 
TGCTAGATGA AAAAATGAAA AATTTAGATG 

45 

AACAACTTAG CAAGTTAATT GACAGTCTAA 
TTGCTGAAGC ATTTCAAATT CAATGTGCAA 
TTAAATCAGA GTTGAATAAA GTTGAAGCAT 

SO 

AAACTTCAAC TGAAAAAATA GCAACAGAAA 
CTGTGGCATA GAAAGGCGGC GAAACATGAC 

55 



CAACATAGAG AATTTCAAAA AAGAAATTCT 4020 

ACAGAAGCTG ACGAAAAGTC AGCTTACgAT 4 080 

AGAAATTGGA TCTACAATTT CTACAGGCAA 414 0 

TTTTTTCTTT ATCACACTAT GTCTCACTCA 4200 

TATATCAAAG AATTTTTGCT ATAATAAGTT 4260 

TGATTGTAAA AACAGAAGAA GAATTACAAG 4320 

AAGTGCGCAA TACAATGCAA GCTGCAACCA 43 80 

ATATTGCGAA AGAGTTATTT GAAGAATACG 4440 

ATTTTCCTGG TCAAACGTGT ATTAGTGTCA- 4 500 

AGCGTGTCAT TCGTGAAGGA GATTTAGTAA 4 560 

ATTATGCAGA TACAGGCATT TCATTTGTCG 4620 

AAGTATGTGA CGTAGCAACG ATGGCATTTG 46 80 

CTAAGTTAAG TAACATTGGT AAAGCGGTGC 474 0 

TCATTAAAAA CTTAACAGGT CATGGTGTTG 4800 

TACTTAATTA CTTTGATCCA AAAGACAAAA 4860 

TTGAACCGTT TATCTCATCA AATGCATCAT 4920 

TTGAAACGAG CGATAAAAGT TTTGTTGCTC 4 980 

ATGGTCCGAT TTTAACGACA AAGATTGAAG 5040 

GTATGAACAT CATTTAGTTC CGGAGCCTAT 5100 

ATTAAGAACA CAATCAATGC GTCATTTCAA 5160 

AAACATATCA TCGACATCAA CGAAGATACA 5220 

AAAAGGTGAT AGTTATGAAC TATGTTGAAC 5280 

TAAGAAATAA TATCAAGCAC TACCTTTTAA 5340 

ATTATATGCG TTATTTAATT ACTAAAAAAG 5400 

TGCTAACATT AGAAAATAAA TATATTGATA 54 60 

GAGAAATCAA TAATCAAGAA ATTGAAAATA 5520 

ATTATGCACA AATTGAAACT CAAATTCAAC 5580 

AAACATCGTA TCTAATAAAT TATATGAACG 5640 

ACACAAATAT ATATCAACGC AAATGTTGAT 5700 
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CTTTTTACTC GTTCTATTAT TGGGATGTGT ATTAGTTTAT GTAGGATATC TTTATTTTCA 5820 

TAAAATACGT GGCCTTTTGG CGTTTTGGAT AGGCGCGCTA TTAATTGCAT TCACATTATT 5880 

GTCTAATAAG TATACAATCA TCATCTTGTT CGTCTTTTTA TTATTACTTA TTGTGCGTTA 594 0 

TTTAATACAC AAGTTTAAAC CAAAAAAAGT AGTTGCGACG GATGAGGTTA TGACTTCACC 6000 

ATCTTTTATT AAACAAAAGT GGTTTGGTGA GCAACGTACA CCAGTTTATG TATATAAGTG 6060 

GGAAGATGTA CAAATTCAAC ATGGAATTGG CGACCTACAT ATTGACTTAA CAAAAGCTGC 6120 

AAATATTAAG GAAAATAATA CCATTGTTGT TAGACACATT TTAGGTAAAG TGCAGGTTAT 6180 

ATTGCCGGTT AATTACAATA TTAATTTACA TGTAGCTGCT TTTTATGGAA GTACTTACGT 624 0 

GAATGAAAAA TCATATAAAG TTGAAAATAA CAATATTCAT ATTGAAGAAA TGATGAAACC 6300 

GGATAACTAT ACAGTTAATA TCTACGTATC AACGTTTATC GGAGACGTAG AGGTGATTyA 6360 

20 TCGATGAAyC ACT 6373 

(2) INFORMATION FOR SEQ ID NO: 228: 



10 



15 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 44 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 228: 
ATAGnGAAAG CGTTTTACAC TTAATAACTC CCTCTTAAAT GCATCCAGGT TCTATGTAGT 60 
35 AAATCATGAA nATAACATAT AAATnTAGAG GAGATTTACC TTTGAATACA GAGAACAACA 120 

AGAATCAAAA CCAATCTGTT AAAAATTCTG AAAGaCGCGG CATGTTAAAA GGATGCGGCG 180 
GTTGCCTTAT TTCTTTTATT TTATTAATAA TCTTATTATC AGCCTGTTCA ATGATGTTTA 240 
40 GTAATAATGA CAATTCCACT AATAATCAAT CATCAAAAAC GCAATTAACT CAAAAAGATG 300 

AAAATAAAAA TGAAGATAAG CCTGAGGAAA AATCAGAAAC AGCAACAGAT GAGGATTTAC 360 
AATCAACCGA AGAAGTACCT GCAAATGAAA ATACTGAAAA TAATCAACAT GAAATTGATG 420 
AAATAACAAC AAAAGATCAA TCAGACGATG ATATTAACAC ACCAAACGTT GCAGAAGATA 480 
AATCACAAGA CGACTTGAAA GATGATTTAA AAGAAAAGCA ACAATCAAGT AACCATCATC 540 
AATCCACGCA ACCTAAGACC TCACCATCAA CTGAAACAAA CACGCAACAA TCATTTGCTA 600 
ATTGTAAGCA ACTTAGACAA GTATATCCGA ATGGTGTCAC TGCCGATCAT CCAGCATATC 660 
GACCACATTT AGATAGAGAT AAAGATAAAC GTGCATGTGA ACCTGATAAA TATTAAACAA 720 

55 
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GGGAGATTTT TTAGGCATGA GCAATCAATT 
ACAATTCCAA GCTTTCCAAA ATCAACAAAA 
5 ATCTAAAAAA GGATGGTTCT GGGGCTGTGG 

CATCGGTATT TCAGCTTGTA CAGCTGGTAT 
AGAAACGAAC AAAACCCATA AAATCGGTGa 

10 

TGTAAATTCA GTGGAAACTA TGAAATCTGT 
AGGTATATTT GTCGTTGCTG ATGTGACGAT 

75 TGATAGTTCA ATGTTTAAGC TAAAATCCGG 
TTCAATGTCT GCTAATCAAA GTGACAATGG 
TATAAATCCA GATAGCACTG CTCAAGGTAA 

20 AACGCAAAaG ATAAAAAATT AGAAGTTATT 
TTTGATTTAT CCGATGCTAA AAAAACATCA 
GTAGCTGTTG CGAGTTCAAA TAGCGATAAT 

25 GcTACAACTT CTAGTGCGGA TACTGATTCT 

GATAAGCAGA ATGCGTCTAA AAgTGATAAA 
GAAACTGCTC CTGTAGAGCC CATGCCCCAT 

30 

AGCCAAAATA TTCACAaTGa AGATAGCmTG 
tAGCTCGGCT ACCCTTCTTT TACGGAAAAA 
AAAAAATGAA ATTCAAAGCT ATCGTTGCAA 

35 

GTGGTGCTAA TCAACATAAA GAAAATAGTA 
AACAAACTGA CAACACTACA CAGTCAAATA 

40 AAGATATAGT TCGAAACGAT TACAAAGCAA 
ATAAAACAAA TCTTGAACGA AGCAATGAAC 
ATGCAGTTGG CACACCATTA AAACGTTGTG 

4 * TTAATATTTT TGATGATATG TCAGAAAAAG 
GAAGCCCTAA ATACAATCCA GGTATGAACA 
ACATTCAACA TCATGACATT GATAATAACA 

SO 

AAGTCGACGA TAAAAATGAT AAAAATGCTG 
CAAATAATTC TGAAGAAACT AAAGTTAAAT 

55 



CAAAAGCGAA GAAGAGCGAA GACAATGGGA 840 

CCAACAGAAC CAGCAATACG GACAAAAGAA 900 

TGGTTGTCTA GTATTATTTA TTTTAATTAT 960 

TACAGGTAAC CTTGGCGGAA ATAGTTCTAA 1020 

AACTGTTAAA AATGGCGACC TTGAAGTCAC 1080 

AGGACCATCT CTTGCACCAA CAAACGCTAA 114 0 

TAAAAACAAA GGTAAAGAAG CGTTAACAAT 1200. 

TGATAAAACA TTTGAAGCAG ATAATACAGG 1260 

TAGTATAGAA AATTCATTTT TCTTACAGCG 1320 

AATTGTTTcG ATGTGTCAGA AAACATAGCC 1380 

TCTAGTTTAT TTAGCGTCAA GAAGATTACA 144 0 

AAAGCTAAAA AAGACAAGCA AGATACAGAA 1500 

GTAAGTTATG AAGCTTCGGC TACTACACCT 1560 

GAAGATAGCG AAAAGTCTAG TAAAGATGAG 1620 

TCTAGTGTAG AAAAAAGTGA ATCTAATGAG 1680 

AGCAAACCTA CCACTAGTGA aGCACCACCT 1740 

TACGACGCTT CAACAGAATA AAATtnyCAG 1800 

TTAATTATAC ATAATCaAAT CaAGGAGATA 1860 

TCACATTATC aTTGTCACTA TTAACTGCCT 1920 

GTAAATCAAA TGACACTAAT AAAAAGACGC 1980 

CAGAAAAGCA AATGACACCA CAAGAAGCCG 2040 

GAGGCGTTAA TGAATATCAA ACATTAAATT 2100 

ATGAATATTA TGTTGAACAT CTAGTCCGCG 2160 

CTATTGTTAA TCGACACAAT GGCACAATTA 2220 

ACAAAGAAGA ATTTGAAGCA TTTAAAAAGA 2280 

ATCATGATGA AACAGATGGT GAGTCAGAAG 2340 

AAGCAATTCA AAATGACATA CCAGATCAAA 24 00 

TTAATAAAGA AGAAAAACAT GATAATGGGG 24 60 

AATGGCATAC TTTGATTAAT CGTAATTTTT 2520 
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ACTATGCATG GTCTTTTTAA TCAACTTAAA CTCGGCATTA TTTCAATCGA AAACGCAGAG 2640 

CATACGCTTT TTACACCTTA TATGTTGGAA ACGCTCTCTT CCCTAGGCGT GAAAGACAGC 2700 

5 

ATTGTCGATT TAATTCATAA AGGGACTGAA TTAGAAGACT TTGCGGCATT TAATTTATCA 2760 

ATTGAAGACA CAGTTACAGT CTGTTTACAA AGAACTGAAG AACTATTAAA ACAATACAAA 2820 

AATGTGGAAT TCAATGACAA AATATTAATC AATTGGCGTA TTATACAAGA GAAATAGACA 2880 

10 

TATAAAAGTC GAATGTAACt ACGTGAGTAT TGATTTTATT CTTTGTAAAT TACAAGCATT 2940 

TCATATTATA AAGTTTGAAA AGAGGTATAT TGAAATGGAG AAAAATGAAT ATATAGCTAA 3000 

15 ATATAATGAA TATAGTCAAT TATTAGACGC TACATACTCG CAAGCTGTAG CATmCCTTTT 3060 

AAGtAAATaT GGCGCTGTAA CCGATGATTA TTATAAAGaA AAATCATACA CGCGATTTTT 3120 

AAAtGGAGnA ATCAAAAGTA TTTCAAAAGG AAAATACACT AGAGCTAGCG AaGGATTATA 3180 

20 TTGCCATCAT ATAAGCGAGG ACAAATTCCA AAATCTATCT GATCTAAGAT TCATtTCCAA 3240 

ATTTAAGTAC TCATACGACG TTCAAAAGAA AGAAAACTTA GTGTACTGTG ATCTAATCGA 3300 

GCATTTAATT TTACATGCAA TTATTACAAA AGAATCCCAT GGCCAATTTG GTGTAGCTGG 3360 

25 

ATTATGTCAA ATGATCAAAC CAACAGTCAT TGATTGGTAC ATTGGCGAAT ATAATCCAAA 3420 

ACCAGCATGG ATGCAAGCCA CCAAAGCACG TGCCTATTTG CCTGGAATAT TAGTAGAGAA 34 80 

ATTACTCATT AAAATTGACG ATATGTTAAA AGGAATAGAA ATATAAGATT TCCTTGAGTC 354 0 

30 

TAGATAAATG ATTAATGTAG ATTTATTTTT TGCTGTTGAG ATTTTGTTAT AGATGTTTAA 3600 

ACCTGTAATT AAATATATTT TATAAAATAG ACCACGCATA CCTATCTATA AACGGrCAAT 3660 

55 GTTTATAAAT GAGTTTGCAT GGtCTTGAAT TGTATTAAAT TTCTTTTGGT TTTAATAAAT 3720 

CGACTAGATT TTCACAATAT TTATCAAATA TGTATTCCTA AATTATACAG CCTTAATCCA 3780 

GCACjtTACTT TCGAAACTTC CAACTTAGTT GATATAAGGT TCAATAGTTT GTTTCGTTCT 3 840 

40 TTTTCAGATA AACCAGAACT TAAATTGATA TTATTGACTT CATAAAAATT ATAGACTAAT 3900 

GCCTCTATTT GCTTTTTAGG CATAAGTAAG TCGACTGAAA ACTGATTTAC GTCGCTTTCA 3 960 

TAAATCATTT CATGTAAATT CTTTAGACTA TTATCGTTAC TATCTCTCAT TAAGTCTGTA 4020 

- 45 

TTTTTAAATA AATAACGGCC CAATTCACGA GCTATTGAAA ATCTTGTATT ATTAATCGAG 4 0 BO 

TGATTATTAT TAATATAGAT TGTTCTTCCA CTTAAATAAC CCGAAGTATT ACCCTCCATT 4140 

TTAATATATC TAACATTTAA ATTAAGTTGA AATAATAGCT TGTCTATGTC AATAGCAAAG 4200 

50 

TGTTCAGAAG TAATAAAAAG TTGATCCATT TTGTCCTTTA TAAATGCCTG AAATAATCGA 4260 

ACTATTTTTG ATTCTAAAAT ATCTTCATAA TGAACTTTCT CAATAACTTT CAATTGATTC 4320 
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AACATTATTA AAATAAAACC CCTCTACTAC TATATGTAAC GAAGGGACAT GATTTCAAAA 
TAAAATACCT TTTTTATAAA TnTATTATAA TATCCCCCAC TATACnAC 
(2) INFORMATION FOR SEQ ID NO: 229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 846 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 229: 
TATGGCGCCA TATTAGTTGT AACTGGTTTA AGAGGTCcAa GAAaTATCaA ATAAAGTTGT 
tCCTGGGCTT GGTACTGTTA TCTCaATATT GmwTGCaTTT GGTGGTCTAG CTTTTAATAT 
TGGTAATATT GCTGGTGCCG GTTTAGGTTT AAATGCAATT TTTGGATTAG ATGTAAAATG 
GGGCGCAGCT ATTACTGCAA TCTTTGCAAT ATTAATCTTT GTAAGTAAAA GTGGCCAAAA 
AATTATGGAC GTTGTTTCAA TGATTCTTGG TATTGTGATG ATTTTAGTTG TGGCATATGT 
GATGTTTGTT TCTAATCCAC CTTATGGTGA TGCTTTTGTG CATACATTTG CGCCAGAACA 
TCCAATGAAA TTAGTCTTGC CCATCATTAC GTTAGTTGGT GGAAcTGTar GTGGTTATAT 
TACCTTTGCA GGTGCACATC GTATATTAGA CTCTGGCATT AAAGGTAAGC AATATTTACC 
ATTTGTAAAT CAATCAGCAA TTGCTGGTAT TTTAACTACA GGTATTATGA GAACGTTACT 
ATTCCTAGCG GTATTAGGAG TTGTTGTAAC AGGTGTGACA CTAAGTTCTG AAAATCCACC 
AGCGTCAGTT TTTGAACACG CAATTGGACC AATTGGAAAG AATATTTTTG GTATTGTGTT 
ATTTGCTGCA GCTATGTCAT CAGTAATTGG CTCAGCATAC ACAAGCGCAA CATTTTTAAA 
AACACTTCAT AAATCACTTA ACGAAAGAAG TAATTTAATT GTGATTGTGT TTATCGTTAT 
TTCAACAATG ATTTTCTTAT TTATTGGAAA ACCAATCAGC CTTTTAATTA TAGCAGGCGC 
GATAAA 

(2) INFORMATION FOR SEQ ID NO: 230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 230: 
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TCTTTTTAAA AGGTACTAAT ATTTCTTTAG 
CCTTGAGTAT ATTATTATAG ACGGAATCTG 
ACCTCCTTAT GTTGTCAGCA TAAAGGATAA 
CTAATTGAmA AAAATTAAGA AAAGTATGTG 
GGCGAATAAG TGTCTaAAAG CATCTTAAAG 
ATAAGGGaAA ATCCTTATGT TCACTTTTTC 
TACTTTGTCT ATATTGAATG TTTTCATATA 

1S CTAATATATA GGCAGAGTAT TGATATCTAT 
ACGTATGTAC AAAACAAAAG GTGGCTTTCA 
TGGGTTTATG GCTTGGAGTA TAATTGCGCC 

20 TGTTACTGAA GGTCAAATAT CAATCATTTT 
CCGTGTGCCA TTTGGTTATT TAACAAACAT 

TTTTATCGTA TTGTTATTCC CGATATTTTT 

25 ■ 

AATGGCTTCA GGATTTTTCC TTGGTGTAGG 

AGTTCCTAAA TATTTCCCTA AAGAAAAAGT 

AAATATCGGT ACAGCAGTTT CTTCATTTTT 

30 

GCAAACAACA GTTAGAAGTT ACTTAATTAT 
TTTTGGTGAC ACACAAGAAC GTAAAATTAA 

35 ATCTAAAAAC TACAAATTAT ATTACTTAAG 

TGTAGCATTT GGTATTTTCT TACCTAACTA 
AGATGCTGGT ATTCGATCAG GTGTATTCAT 

40 TGGCATTTTA GGTGATAAAT TTAATGCAGT 
GATTATCGGT GCCATTATTT TAGGTATTTC 
TTTAACAATA AGTATTTGTG CAGGTATTGG 
ATACTTCTTA AATGAAGCGG GATCCGCAAA 
AGGATTCTTC CCACCACTAG TAATCACGTA 
AGCATTTATT TTCTTAGCGG TATTnGGAnG 

50 

CCAAAAAGAA TATGGCTCAT TGAAaAACGG 
TAAATACAAA GCCTTaATAT CATGCGCAAT 

55 



TGAAAATTGA ATCACGGTCG TTTATTGGTG 120 

ATCTAATAAT ATTGATTTTA TACATGATAA 180 

CGTAACGTGA TTTTCAAGCA GTAATTGTAA 24 0 

AGTGTTCCTA AwTAATATGa TTAAAATGAT 300 

GGACATTGTA TAGGGTAAAT CACTTCATAA 360 

ACAATCATnA TAAAATATAT ATGTAGTCAA 420 

AATGAAAGCA TTTTTAAATA ACATTGACCT 480 

TAAAAAATAA ATGATTTTGA TGAAGGTGAA 54 0 

ACTTACATTA CAAACATTAA GTTTAGTGGT 600 

ATTAATGCCC TTTATTAAAC AAGATGTCAA 660 

AGCGATACCA GTTATTTTGG GATCGGTGCT 720 

TGTTGGCGCT AAATGGGTAT TCTTTACTAG 780 

CTTAAGCCAA GCACAAACAC CGGGTATGTT 840 

TGGTGCAATT TTCTCAGTTG GTGTTACATC 900 

AGGTCTAGCA AATGGTATTT ATGGTATGGG 960 

AGCACCACCG ATAGCGGGTA TTATTGGTTG 1020 

TATCGCTTTA TTTGCATTAA TTATGTTCAT 1080 

AGTACCATTA ATGGCtCAAA TGAAAmCATT 1140 

TTATTGGTAT TTTATTACTT TTGGTGCTTT 1200 

CTTAGTAAAT CATTTTGGAA TTGACAAAGT 1260 

TGCGCTGGCA ACATTCTTAA GACCAATAGG 1320 

TAAAGTATTG ATGATTGATT TTGTTGTTAT 13 BO 

AGACCATATC GCATTATTCA CTGTAGGCTG 1440 

TAACGGCTTA ATCTTCAAAT TAGTACCATC 1500 

TGGTATCGTA TCAATGATGG GTGGTTTAGG 1560 

CGTAGCTAAT TTAACAGGAT CAAGTCATTT 1620 

TATTGCATTA TTTACCATGC GTCATTTATA 1680 

TTGATATGTA ATACATGCCA TTcATTTAGT 1740 

ATTCGTAGCA TGACATTAAG GCTTTAGTAG 1800 



1000 



EP 0 786 519 A2 



CTTGGTTTGA TTTTAGGCAA GGTAATGGTT AATAACCCAT TTTCAAAACT AGCAGTAATA 1920 

TGTTGCTTAT CAACAGCTTC AAAATCAAAT TGACGCATTA ATGATTCGAA GTTACGCTCA 1980 

TCTAAAATGA GTTGTTCAGA TTTGTATTTT GCGCTTCTAG TAGCTTGAAT AGTGAGCGwA 2040 

TTAttATTGA AATCgATACT AATAtCTccC TG 2072 



(2) INFORMATION FOR SEQ ID NO: 231: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
fC (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 231: 



20 



25 



30 



35 



40 



45 



50 



CGTCTTCTCT TGGTTATTCT 


CTGTGTTTTG 


TCTTTGTTCA AATTCGATTT 


TTTGTTGTTC 


60 


GAATGAATGT AATACTTTAT 


TTTTTTTTGG 


CACATAATCC 


AAGTnATTTT 


TAGGAATTAA 


120 


TATACGGTCT TTAAATGCTT 


CTTGTATTTT 


GCTCACAATC 


AATTGGTATA 


GTTCCTCTTC 


180 


TTTTGATAAA CGCACTTCTA 


GTTTTGTTGG 


ATGAACATTT 


ACGTCTACTA 


AGATTGGATC 


240 


CATTTCAATA TTAATATAAC 


AAATCGGGAA 


CCTACCTATT 


GTTAAGAGTG 


TATGATAGCC 


300 


TTCTAAAATC GCTTTATTTA 


GCATAAAGTT 


TTTAATGTAT 


CGTCCATTAA 


TAAAAATAGA 


360 


AATATAGTGC TTATTACTTC 


TAGAATGTTC 


AGGCTTTGCA 


ACAAAACCTT 


CAATGTGATA 


420 


ATCACTTGTA TCTCCAGATA 


TATGTACTAA ATCTCGTGCA ACTTTCATCC 


CATAAATCTC 


480 


TGCCATCACT TCATTAGTTC 


GTCCTGAACC 


ATTTGTACTT 


AACATTGTTT 


TGCCATCTGA 


54 0 


AATGAGTGCT ATTCGAATGT 


CCGGATGGCT 


CATTGCCATT 


CTGTTGACAA 


TATCTGTTAT 


600 


TTTACCTAGT TCAGTGTATA 


AACTTTTAAT 


ATATTTTAAA 


CGTGCTGGTG 


TATTATAAAA 


660 


TAATGATTCT ACAAGTATAT 


CTGTTCCTTT 


TTTCGCTTTT 


GCAGGCTTAT 


GATTTAATAT 


720 


TTCACCATTT TCTACATATA 


TTTCATTTCC 


ATTAGCATTA 


TCCGTGCAAG 


TCTTCAATGT 


780 


TACTTTAGCA ACTGATGAAA 


TACTGGCTAG 


TGCTTCACCA 


CGGAATCCTA 


ATGTCCTAAT 


840 


ATGAAATAAA TCTTCATCTT 


GATCTAATTT 


ACTAGTCGCA 


TGTCTATGAA ATACTAATCC 


900 


TAAGTCTTCC GCTTCAATTC 


CGCTTCCATT 


ATCGACTACG 


CGAATAGATT 


GGACGCCAGA 


960 


tTCCTCTACT TCAATGCTTA 


TTTCTGTAGC 


GCCTGCATCT 


ATAGCATTTT 


CCAACAGTTC 


1020 


TTTCACAACA GAACTCGGTC 


TTTCAACTAC 


TTCACCTGCT 


GCGATTTTAT 


TTGCTAATGA 


1080 


GGTTTGGAGT TCTTTAATTT 


TCCCCATTTT 


GCAACACCTC 


TATTTTAATT 


GATTTTGTAA 


1140 
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TTGTAGTTCA ATCTCGCTTT TTTGATCATT 
GTCTTTTTGA GATAAAGTAT CaGTTGTTTC 
AATTTCCAGG TTCATTTTCG ACCATTTTTA 
ACGCTTCAAA CTCGCTTAGA ATCACTTGTG 
CTAATTTCGC AACTTGAATA CCATAAtATC 

10 

GAATATAAGT TCACCTTTAT ATTCATTAGC 
TCCTTGATCT AATGTTGTCA ATTCATGATA 

15 TGTTTCAGCT ACATACTCTA TCATTGGCTG 
ACCACGTCCA ATTTCATCGA AAATAATCAA 
TTGTGCTTCT AGCATTTCTA CCATAAACGT 

20 ACCTATTCTA GTGAATATTT GATCAAATAT 
AGCTCCCATT TGGGCCATTA TACTAA7TAT 
AGACATATTC GGACCTGTAA TTAAATATAT 
AGGCACATAG TCATTATAAT CCATTACTCT 
TTCTAATGTT TTATTTTCAC TAAATGAAGG 
TGCAAAGCTC TGTAAACAAT CTAGCTCTGA 

30 

agtatatttt' ttaacttctt cacgtagctg 

TTTGTCTTCC GCACCTAAAA TGATATCTTC 
35 TTCAGCATTC GATAACGTTT GCTTCCTCAT 

CAAGTTGGCA CGTGTTATTT CTATAAAATA 

TGATTTTATT CCTGTACGTT GTCTTTCTTT 
40 GTTTTTTGAA GCTTCAAGAT ATTCATCTAA 

GCCATCTTTA ACTGAAATTG GTGGTTCTTC 

ATCATCaAGG GGTTCTAGTT GATTAACTTG 

45 

TAATGCTTTA ATATTCGGTA TTTCAGAAAT 

ATTAACATTT CCGTAACTAA CACGCCCAAC 

ATATGTTCTT AAGGTGTCTC TTTCTATGAA 

SO 

TCGTGCTTCA ATTTGTTCTT TACTTATTAG 
TGCTCCCATT GGTGTTTTCG TTTCGTCCAT 

55 



TTCAAACAAA TcAAATGATG CyTGTTCAAA 1260 

TTCaACACTT aAGTTTAAAT TTTCTTGATT 1320 

AATTTGATAT CGATGATtTT TCACCAGCAG 1380 

CTCTGCTAAT AACTTTTTCA GGTAAATCAG 144 0 

GTCAACTGCA CCATCTTTGA CTTTATGCAA 1500 

AGCGACGTGA ACATTTTTTA GACTTGGTAA 1560 

ATGTGTTGAA AATAACGTTT TAGCATGTGA 1620 

CGCTAAAGCT AAACCGTCAT , ACGTTGAAGT 1680 

ACTATCCTCT GTTGCATAAG TTAATGCCTT 174 0 

ACTCTTACCT GAAACCAAAT CATCTGCCGC 1800 

AGGTAACACT GCCTCTTTAC AAGGGACATA 1860 

GGCAACTTGT CTCATATATG TCGATTTACC 1920 

AAATGTTTCA TTATCTAATC GACAATTATT 1980 

TTCCACTACT GGGTGCCTAG ATTCCACTAA 204 0 

CCTAGTGTAA TTATATTTTT GAGCAATTTC 2100 

AATAATTTTA GCTTGTTGTT GTAAACGTTC 2160 

AACAAATAAT TGATATTCTA ATTCGATGGC 2220 

TTTTTCTTTA AGTTCATCAG TTATAAAACG 2280 
ATAACCAAAT TCACTTGGTT CAAAATTTTG . 2340 

ACCAAACACT TTATTAAAGC TTATTTTCAA 2400 

GGCTTGTAAT TCTGCTAACC ATGTTTTTCC 24 60 

TTGCGTATTA AAACCAACTT TGAATAGTCC 2520 

TACTAAACTC TGTTCTAATA TATCAAGTAA 2580 

TACAAGAGTA TTCTGATTCA TAGAATTTAG 264 0 

GGAATGTTTA AGTTGAATTA AATCTCTCGC 2700 

AAGACGTTCA ATATCATACA CTTGATTAAG 2760 

ATGAGCACTA AATTCATCAA CGATATCTAA 2820 

TGGTCTATCT ATCCATTGTT TTAAGCGGCG 2880 

TAGCCAAAGT AGCGTTCCTT TTTTTGATTT 2940 
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ATCTATAGCT GCATATTGAA CAACATCCTC GATATGCGAT AAATCACGTT TTTGTGTATG 3060 

ATGAATATAA TCTAGCAATA ATTGTGTCGC TTGATACaTT AATTTATGTT CAGTTTGATT 3120 

5 CACACTATAG ATTtCTGATG ATAACGTTTC CCTGACTGT 3159 

(2) INFORMATION FOR SEQ ID NO: 232: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH : 1238 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

is 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 232: 

TAAGCGAGAA GCAATTGGTG TTATGTATGC TAGTGATAAA CCAACAGGTG AAAGTACAAG 60 

20 , GTCATTTGCT GTTTATTTCT CTCCTGAAAT TAAGAAATTT ATTGCAGATA ATTTAGATAA 120 

ATAAATCATC CATCCATACA TTGATAAATG ATTTTyAGAA ATTAACAACA AAATCAACAA 180 

TTTTAAACAT CTCTGTGATT CTATTTATTC GAAATGATTT AAAAAATAAA ACTTCAAAAA 240 

CCTAACCTTA TATTTATACG AATACTTAGA GGAGCACAAA AATGAATAAA AATATAATCA 300 

TCAAAAGTAT TGCAGCATTG ACGATTTTAA CATCAGTGAC TGGCGTCGGC ACAACAGTGG 360 

TTGAGGGTAT TCAACAAACG GCTAAAGCTG AACATAATGT GAAACTAATC AAAAATACTA 420 

ATGTAGCACC ATACAATGGT GTCGTTTCGA TAGGATCTGG AACAGGTTTC ATTGTCGGTA 4 80 

AAAATACAAT TGTTACCAAC AAGCATGTCG TTGCAGGTAT GGAAATTGGT GCACATATTA 540 

TAGCGCATCC CAATGGTGAA TATAATAATG GCGGATTTTA TAAAGTTAAA AAAATTGTCC 600 

GTTATTCAGG TCAAGAAGAT ATTGCCATTC TACATGTGGA AGATAAAGCT GTTCATCCAA 660 

AAAACAGGAA TTTTAAAGAT TACACAGGCA TTTTAAAAAT AGCATCAGAA GCTAAAGAAA 720 

40 ATGAACGCAT TTCAATTGTT GGCTATCCAG AACCATATAT AAATAAATTT CAAATGTATG 780 

AGTCAACAGG AAAAGTGCTG TCAGTTAAAG GCAACATGAT TATTACTGAT GCTTTCGTAG 84 0 

AACCAGGCAA CTCAGGTTCA GCTGTATTTA ACAGTAAATA CGAaGTtGTA GGTGTTCACT 900 

TTGGTGGAAA CGGCCCTGGA AATAAAAGTA CAAAAGGATA TGGTGTTTAT TTCTCTCCTG 960 

AAATTAAGAA ATTCATTGCA GATAACACAG' ATAAATAAAT CCTTACATAG ATAAATGATT 1020 

TTAAAAATTA ACAACAAACT CAACaATTCA AATCATCTCT GTGATTCCAT TTATTCGAAA 1080 

TGATTAAAAA AAATAAAACT TCAAAAAGCT AACATTATAA TTATACAAAT ACTTAGAGGA 1140 

GCAGAAAAAT GaATAAAAAT ATAATCATCA AAAGTATTGC AGCATTGGAC GATTTTTAAC 1200 

55 



25 



30 



35 



45 



50 
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(2) INFORMATION FOR SEQ ID NO: 233: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 6444 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 233: 





TGATAAGTCA TTTAAATTGT 


CACCTATTGA 


CATGACTTCT TTCATTTCAA 


TCCCTAATCT 




15 


TTCGGCAATT GTTTCTAGCG CAATACCTTT TTGTGCATCT GAATGCGTTA TTTCTATATT 


ion 




TCCTCTCGAA GATGATGATA 


TAGCTAAATT 


CGGAGAkTCA GCTAAAATTT 


TACTAGCTTT 


X o u 




GTCAATTTTT TCTAAATTTC 


CATCAAATGC 


TAATATTTTC ATAATTAATT 


CACCAGGTAT 




20 


GTTTTCAATA GCATCATAAT 


TATCAACAAC 


TyTCAACGTA CCATTATCTA TGCGTCTTTG 


300 




AATACCATTT TTAATACGCT 


CAACGTTTGC 


ATGTTGACCT GCACGCTCAG 


CAATATCTAT 


360 


25 


GTAAATGTCT AAATCTCTTT 


GTGGATCTTC 


AGTATAAATC GCACGACTCG 


TGTATACTTG 


420 
"i a \j 


ATAATAAATA CCTGCATCTT 


TTAAAACATT 


TGTAATTTTG TGTACTAACG 


ATTTATTAAG 


480 




GTGTGAAGTG CTCATTACAT 


TGAAAGTTTC 


ATCACGTACT TCAGCACCAT 


TCAAACAAAT 


540 


30 


ATATGGTACT GTTAAATCTG 


TGTCAGCAAC 


TGGTGCTTGk GCTTCATAAA 


ATGCTCGACC 


600 




TGTCGCGATA ACAACCGTTA TCCCTTGTTC , TTGAGCGTAT TTAATCGCAT 


CAATATTAGG 


660 




TTGAGAAATT TCATGTGCTG 


CATTAAGTAG 


CGTGCCATCC ATATCAGTGG 


CTATTAGTTT 


720 


35 


TATCATTATG TnACCTCGTT 


TCGTAAATnT 


AAAATCTTGT TCTTAAATAA 


GrATATATAC 


780 




TCAGCGCACA TACTTTtCTA 


TTAmCATTTA 


TATkGTCATT aATTTATCAT 


ATAATGTAAT 


840 




TCTaACAAAT nTTAAtTAGT ATGTACTATC GTCTAATTGG TGGATTTCTT ATTGGCTCTT 


900 


40 


AAgTTTTTAA AAAATGTTGT TAATAATGTG 


CTACATGCTT CTTTAAGTAC 


ACCTTTATCA 


960 




ACAATTGCAC GATGATTAAA 


ATTAGATTGT 


TGCAATAAAT TCATTAAACT 


GCcACTACAA 


1020 


45 


CCACCTTTAG GATCATCTGC 


GCCATAGACG 


ACTCTTGGAA TGCGACTCAT 


TACAATTGTT 


10B0 


• CCTGCGCACA TGACACATGG 


TTCTAAGGTT 


ACATATAATG TGCAACCTTC 


TAAACGCCAA 


1140 




CTACCTAACA CTTTGGCTGC ACGTTCAATT 


GCAATATGTT CAGCATGCGC 


CGTTGGTTGT 


1200 


50 


TGTAGTGTTT CTCTTaAATT 


ATGTGCTCTA 


GCGATAACTT CATCATCTTT 


AGTGATGATA 


1260 




GCACCTATAG GTACTTCGCC 


TAGTTGAGCT 


GCTTTTTTAG CTTCTTCAAT 


CGCTAATGTC 


1320 




ATAAAATATA TATCATTTGT 


CATTTATGTC 


CAGATACCTC ACTTATGGTA 


CAATACTCAA 


1380 



55 
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CTATTGGCGT AGGTAAATCT TCACTTGCAC 
AAGAAAAAGA AATCATCACA GAAAATCCAT 
5 AATGGAGTTT TCAAACTGAA ATGTTCTTTT 

TAACACAACT AAATCAAGGT GTAGTTAGTG 
CTAAAAATAC TTTGAGTTCT GTTGAATTTC 

10 ' 

CTGAAGATAT GATTATGCCG AATATGATTA 
AATCTAGAAT TGCTAAACGT AACCGTAGTT 

r5 TTAAAAAAAG ATTATCGTGA GTATTATGAG 
TTAATCGATA CnACTTCTAT TGATTTTCTT 
CATATTATAT TACCTATGAT AGGAGATATT 

20 TGCCATTATA ACCATTGCAG GTACAGTTGG 
TGCAGATAAA TTAAACTTTA AAACGTCTTT 
TAAATTTTAC AGCGATTTTG AACGATGGAG 

25 ACGTTTTAAA GAACAAAAGC GTATGTTTGA 
AATTTATGAA GATGTTGATA TTT TT GCAAA 
AGAAGATTTC AAAACATATT CAGACTTATT 

30 

TAAACCTGAT GTAATGATTT ATTTAgAATG 
TGAACGTGGT CGCGAAATGG AAATTAATAC 
ACGCTATGAC GATTGGATTA ATAGCTTTAA 

35 

TGAATATGAT ATCCATAAGG ACCCCGAATC 
AATTATTCAA ACATATCGAC AAGTAGATAC 

40 GTTTATATTC AATGTATATT CCATAGATAT 
CATTAATTCA CATATACACT ATGTCTTTCT 
GCACTATTTA AAATAATATC AGCTTCATGT 

45 ACACCTATTG CCATACCTAA ATTTGCATTA 
CCTACTATAG CTACTTTCTG AGGATCTACA 
TTAGGATTTG GCTTCTCATA GGCATCCGCT 

50 

GTAGCATTGG TATGTGCTAA AAATTGTTCT 
CCAAGTTGAT AGCCTTTTGC TTTCAAATCG 

55 



ACAAATTAAG TCAAACTTTA GATTTTTATG 1500 

TTTTATCAGA CTTTTATGAA GATATCTCTA 1560 

TATGCAATAG ATATAAGCAA TTTCAAGATG 1620 

ATTATCATAT ACATAAAAAT AAGATATTTG 1680 

AGAAATTCAG TAAAATTTAT GATATTTTAA 174 0 

TCTTTTTAGA TGCAGACCTT GATGTGTTAA 1800 

TTGAGCATCA AATAGAaGtG AtAcTGTaAg 1860 

TCCTTACAAA GTAATGGTTC AAATGTAGTT 1920 

AAAAATGAAC AAGATTACGA AGATATATTA 1980 

ACCAATGAAT AATTACGGTA TTCCACAAAA 2040 

TGTTGGAAAA TCAACACTAA SGCAAGCACT 2100 

TGAAAATGTC GAACATAATC CATATTTAGA 2160 

TTTCCATTTG CAAATTTACT TCTTAGCTGA 2220 

ATATGGTGGT GGCTTTGTCC AAGATCGATC 2280 

AATGCATGAA GAAGAAGGCA CAATGAGTAA 2340 

TAATGCCATG GTCATGACAC CTTATTTTCC 2400 

TAACTATGAT GAGGTCATTG ATCGTATTAT 2460 

AGACCCTGAA TACTGGAAAA AGCTATTTAA 2520 

TGCATGTCCA GTTGTACGTA TCAATATTAA 2580 

TTTAAATCCT ATGATAAACA AAATTGCTCG 2640 

ACGATAAAAG ACTAAAGACA TAGCGTATAT 2700 

TATCGATTAT TTTATCAATT CTATCGAATA 2760 

TTTTAATTTA AAGCTTCTAA AATATCTGCC 2820 

AATTCTTCTT TTGTTGCAAT ACCTGTTAAT 2880 

CTTGCTGTCT TCATATCATT AGCAGTGTCT 294 0 

TTATATTGCT CAAATAAAGG CGATAATACT 3 000 

TCGGTAGAAA TGATCAAATC GAACAACGAG 3060 

ACACCTTTTT TAGTATCACT CGTAACAATA 3120 

ATAAGTGCTT CTTTAACACC TTCTACCCAA 3180 
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GTATCTTGTC CCGTCACATC ATTAAATGCC TGGATAATTT GTTGTAAAGA TCCTGAACCC 3300 

ATCACTGATT TTGGATCAAT AGATTCTTTA ATGACACCGA GTTGTCTTAA AGCAGCTTCT 3360 

5 

TTATTATGTA CTGGGAAAGT CTCAAGCAAT GATTGTACAA ATCGTACCCC TATTTTTTCC 3420 

CAACTTCTAT CAAATTCAAT TAACGTACCA TCTTTATCAA ATAATATCCA TTCCATTGaT 34 80 

ATCAATACTC CTATTTATTT ATTTCGTATT ATGCTGATTC TATGATATTC GTTATCCCCT 3 54 0 

10 

GAAAATGAAC TCGTAGTATT GTTCTATTTA AATATTGaAT TAAATATAAT AATAAGTGAA 3600 
ATCCCCTTCA ATACTTAACA ATAAACATTG TAAACTTAAT TTATTACCAT GCTTCGCTTC " 3660 

15 ATTGAAAGGG ATTTTAGTCA TGATTAACTT TTGCATATTG TTTTCATGAT TATATTCAAT 3720 

TTTTATTAAT ATTTTGGTAC AACGACTCTC CAACCATTTT TATCTTCTAA AGTACdATTT 3 780 

TGAATACCAG TATAGACGTC GTATAATTTT TGAGTAATTT CACCAGTCTC ATTATTATTA 3840 

20 ATAACGATTT CACGATCTTC GTATCTCAAT GTAGCCACAG GTGAAATAAC TGCTGCAGTA 3 900 

CCACTACCAA ATACTTCTGT TAACTCACCT TTATCATATG ATTCGAATAA TTCATCGATT 3 960 

GAAACGCGGC GCTCTTCGAC TTCATATCCT AAGTTTTTAG CTAATTCGAT AATAGATTTA 4 020 

25 

CGTGTAATAC CAGGTAAAAT ACTGCCATTC AACTCTGGTG TAATTACTTT GCCATTTTCA 4080 

ACGAAGAAAA TGTTCATGCT ACCAACTTCT TCGATATATT TCTGTTCAAC ACCATCAAGC 4140 

CATAATACTT GGTCATAACC TAATTTATTT GCATTAGTTT GTGCTAATAA ACTTGcCGCA 42 00 

30 

. TAGTTACCTG CAACTTTTGC AAAGCCTACA CCGCCACGaA CAGCACGCAC ATATTCATCT 4260 

TCTACATAGA TTTTAGTTGG TTTTAAAGTT TCACCACCAT AATATGCACC TGAAGGAGAT 4320 

35 AAAATAATTA ATAATTTATA CTGATGTGAT GCACCAACGC CAAGTGCCCC. TTCTGTTGCA 43 80 

AAAACAAATG GACGAATATA TAATGATTGA CCTTCCCCTT CAGGAATCCA ATCTCTTTCA 444 0 

ATAT€AACTA ATTGTTTTAG CCCCTCTAAC AATTCTGCTT CGTCTACTTG AGGCATTTCT 4500 

40 AATCGTGCTA ACGAGTTATT AAGACGCTTA AAATTTTCTT CAGGACGGAA AAGTGCAACT 4 560 

TCCCCATCTC TTTTATATGC TTTTAATCCT TCGAATACCG ATTGACCATA ATGAACACCT 4620 

TGTGCAGCAG GTGAAATTTC AATAGGACCA TAAGGTACTA TCTTCAAATC ATGCCATCCT 4680 

45 

TTATCTGCAT CATAATCATA ACTCAACATA TAATCAGTAA AATATTTACC AAAACCTAGT 4740 

TGAGATGTAT TTGGTTTTTG TTTTAATGTT TCTCGTCGTT CAACTTTAAC TGCTTGTGAC 4800 

ATGGTGATTG CCTCCTAATA ATATTGTATA AGAATTTGTT TAACTTAAAT TATAACAATC 4860 

so 

CaTATTTTGC TGTTCAACAA ATTTTCTAAA AATTCAAAAT TAATTAACAG ATTTCTAGAA 4 920 

AGACTATATC TTTTAGTATA AACGTATTAA TTTCACAGAG ACAAGTAATC TGTGTTTTAC 4980 

55 
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TAAGTATAAT GAATAATATT AGAATTCATG CACTAGTTTA TTAAAATAAA GAGTAATTTA 5100 

AAATATCATT CCGTGTATTA AAGTGAATGG AAATGATTAG TTATTATTTT TAACAGTATC 5160 

S T T TTT G TTCA ATAGCTTCTA ACATTAATTT AGTCATGCTC GCTAAATCAT ATTTAGGATC 522 0 

AAATCCCCAT TCGCCACGTG AACAGCTTGT ATCAATAGAA TCCGGCCAAC TATTAGCGAT 5280 

ACCTTGTCTA ATAGGATCAA CATCGTAATC TAATGTAAAA' TTGGGATAGT ATTCTTGAAT 5340 

10 

TG CTTCTT TT ACCATCTCTG GATCAAAACT CATTGCGCTC AAATTATAAC CATTTCTAGT ' 5400 

TTCTAATTTA GCGTCGTCTG CTTCCATAAG TTTAATAATT GCTTCAATTG CATCATCCAT 5460 

ATACATCATA TCCATATACG TGCCTTTATC TATGAAGCTT GTATAATGAC CCTCTCTTAC 5520 

75 

TGCTTTGAAG TATATTTCAA CAGCATAGTC TGTAGTACCG CCACCTGGCT CTTTAACATG 5580 

CGAGATTAAA CCTGGGAATC TAACACTTCT TGTATCTACA CCAAAACGTT TGAAATAGTA 564 0 

20 TTGACACAAT AATTCTCCAG CTACTTTATT TACACCATAC ATTGTCGTAG GTTGCTGAAT 5700 

CGTTACTTGT GGCGTATTAA CTTTAGGAGT TGAGTCTCCA AATGCACCAA TTGAACTTGG 5760 

TGTGAAAAAG TGCAAATTAT AAGTTCTTGC AGCTTCTAAT GCATTCATTA ATCCACCCAT 5820 

25 ATTTAAATCC CAAGCTAGAA TTGGATTTTT CTCAGCAGTT GCTGATAATA ATGCTGCCAT 588 0 

ATGCATTAGA CTATCCGCTT CAAAGTCCCT AACTAACTCA AACATACGGT CACGATCTGT 5940 

TACGTCTAAG ATTTCAAATG GTCCATTTTG TACAGGTGAG TCTGCTTCAG GTTCCCTAAT 6000 

30 ATCTGTAGCA AGAACATTAT CTGTCCCATA. AATTTCTCTG CACTTAACAA CTAATTCTGT 6060 

ACCAATTTGT CCTAATGCAC CAGTAATCAT AATTTTTTTC ATAGAAATAT CTCCTTTGtC 6120 

TCTTCTATAT AGCTATAGTC CATCACAAGC GGaCATAATA TTCATTTTCA TAATAATTAT 6180 

35 

AATATAAAAG CGCTTTCTTG TATATATGAC ATGTACATGT TGCTGATATk TCTGTAAATG 6240 

GAAATTCTAG TTGTATTAAT TGATTTTAGT AATTTATAGC GTTTATTATT GCTAATTACT 63 00 

GtCAAATTAA ATTTTTTATC CCTCAACTCT TAAACTCTGG ATATCTTTCA TTATATTAGC 6360 

40 

TTTTTTATAA CCATGGATAT CATGTAAAGC CTTATAAGCn TTAAATAATG TTTCATACCT 6420 

TTGTACTTnT TCCGCTTCTG GATT 6444 

45 (2) INFORMATION FOR SEQ ID NO: 234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4721 base pairs 
CB) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

50 (D) TOPOLOGY: linear 



55 



1007 



10 



15 



20 



EP 0 786 519 A2 

GCCCATGAGA CAATTTTACT TGCTTTTCCC ATTGGTTATC ACGTTCTTAT TACATAGATT 60 

TAAACCGAGA AATATTATTC AAACGCTATT TATTGTATCG TTGATTTCTT TAGGACTTAT 120 

GATAGTGATT CATTTCATCA CTGGAGATAA TTCACGTGTG TATTTTGGGA CAGATACACG 180 

ACTGCAAACT TTATTGCTTG GTTGTATATT AGCATTTATT TGGCCTCCGT TTGCTTTGAA 240 

AAAAGATATT TCTAAAAAGA TTGTCGTATC ATTAGATATT ATAGGGATAT CTGGTTTTGC 300 

GGTTCTAATG ACTTTGTTCT TTATAGTTGG AGACCAAGAT CAATGGATCT ATAATGGAGG 360 

ATTTTACATT ATATCATTTG CAACTTTATT CATTATTGCA ATTGCGGTAC ATCCTTCTAG 420 

TTTATTTGCT AAATTTTTAA GTATGAAACC TTTACTAATT ATAGGTAAAC GATCATATAG 480 

CTTATACTTA TGGCATTATC CTATCATTGT TTTTGTGAAC AGTTATTACG TACAAGGACA 540 

AATACCGGTA TACGTTTATA TTATAGAAAT TTTGTTAACA GCGTTAATGG CTGAAATTTC 600 

GTATCGCTTT ATTGAAACAC CTATACGTAA AAAAGGATTT AAAGCTTTTG CATTTTTACC 660 

TAAAAAGAAG GGGCAATTTG CTAGAACAGT GTTAGTTATC CTATTATTGG TTCCGTCTAT 720 

CGTTGTGCTC AGTGGACAGT TTGATGCACT TGGCAAACAA CATGAAGCCG AGAAGAAAGA 780 

25 GAAGAAGACG GAATTTAAAA CAACGAAGAA AAAAGTCGTT AAAAAAGATA AGCAAGAGGA 840 

TAAGCAGACA GCGAATAGCA AAGAGGATAT TAAAAAGTCA TCACCACTAT TAATTGGTGA 900 

CTCGGTCATG GTGGATATTG GTAATGTCTT TACTAAGAAA ATACCAAATG CACAAATTGA 960 

TGGTAAAGTT GGACGGCAAC TCGTTGATGC TACACCAATT GTGAAATCGC AATATAAAGA 1020 

CTATGCTAAA AAAGGTCAAA AAGTTGTAGT AGAGCTTGGT ACAAATGGGG CATTTACGAA 10B0 

AGATCAATTA AATGAACTAT TGGATAGTTT TGGAAAAGCA GACATATATT TAGTTTCTAT 1140 

TAGAGTACCT AGAGATTATG AAGGTAGAAT AAATAAATTA ATTTATGAGG CAGCTGAAAA 1200 

GCGCTcTAAT GTACATCTAG TCGATTGGTA TAAAGCTTCT GCAGGTCATC CGGAATACTT 1260 

TGCATATGAC GGTATTCACT TAGAATATGC AGGTAGTAAA GCGCTGACTG ATTTGATTGT 1320 

AAAAACGATG GAAACACATG CTACAAATAA GAAATAATTT GATGCACTAA ACTTTTGAAA 1380 

TATTACATTA CTTCTGATAT TTATTATCAA AAATGATGTA TTTCATTAAA AGTTTAGTGC 1440 

45 TTTTTTATTT TCAAATCCCA TAGTAACGGT GCAGAAAAAG TGTTGTAAAC ATTCTAATTG 1500 

GTATATTACA TTCAATGAAG CTTTATTAGG AACAGATTAC ATTATGATAA CAAAGCCCGC 1560 

AAGACACCTA ATCTCTGTTA TAGTTTGTTT TGTCGCAAAA CTATAAAAGT TATAATTGTT 1620 

50 TGCATACTAA AAAAATAAAA AATATAAAAT TTAAAATAAT TGAGTCGCTA ATGACTATAT 1680 

GTATAGAGTG TTTTGATTAT TGGGAGGATA TTTAATTATG AAAAAAATCG TTACAGCTAC 1740 
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ACAAAATAAC AATGGATATA ATTCTAATGA CGCTCAATCA TACAGCTATA CGTATACAAT 1860 

TGATGCACAA GGTAATTATC ATTACACTTG GACAGGAAAT TGGAATCCAA GTCAATTAAC 1920 

5 GCAAAACAAC ACATACTACT ACAACAACTA CAATACTTAT AGTTATAACA ATGCATCTTA 1980 

CAATAACTAC TATAATCATT CATATCAATA. CAATAACTAT ACAAACAATA GTCAAACAGC 2040 

AACAAATAAC TATTATACTG GTGGTTCAGG TGCAAGTTAT AGCACAACAA GTAATAATGT 2100 

10 TCATGTGACT ACAACTGCAG CGCCATCTTC AAATGGTCGT TCAATTTCTA ATGGTTATGC 2160 

ATCAGGAAGT AACTTATATA CTTCAGGACA ATGTACTTAT TATGTATTTG ATCGTGTTGG 2220 

TGGGAAAATT GGTTCAACAT GGGGTAACGC AAGTAATTGG GctAACGCAG CTGCATCATC 2280 

15 

TGGCTATACA GTGAACAATA CACCAAAAGT TGGTGCTATC ATGCAAACAA CACAAGGCTA 2340 

TTACGGTCAT GTTGCTTACG TTGAAGGCGT TAACAGCAAC GGTTCTGTTC GTGTTTCAGA 2400 

2Q AATGAACTAT GGACATGGTG CTGGTGTGGT TACGTCTCGT ACAATTTCAG CAAACCAAGC 2460 

AGGTTCATAT AATTTCATTC ATTAATCAAA TGTAAATCAA ATGACGTCAA TATTCTCTAA .2520 

CATGAGAGTA TTGGCGTTTT TGTTTTATAT AAATATAAAT GAGAGCGGTT TATTCACTGA 2580 

25 TCTTTAGGGA ACTAAGTAAT AAAGTGATAA TTTATACTAT GTCAGTATGA TTGAGAGTGA 264 0 

TTCAATTTAG ATGAAAACCA TGAAAAAATA TATTAAAACA GCATTTTTTT GTAGTATGTA 2700 

TTGGTTAATT GTTCAACTAA ATATAGCAAA TTTAGGTACA AGAATTCCTG ATAAGTATTT 2760 

30 TCGTCAGAAG TACATAATAT TTAAATCATT TAACTTTGAG AAGCATGGAA AATTTTGGAA 2820 

CAAATGGTTT TACGTAAGAA AATGGAAACA TAAGATTTTA GATGGTCATC AGCTTAATCA 2880 

AAATATATAT GATCAGCGTC ATTTAATGAC AATCAATACT GATGAAATTG AAAAAATGAT 2 940 
35 ' 

TATAGAGACA AAGAGGGCAG AGTTGATTCA TTGGATATCG ATACTTCCAG TCATCATATT 3000 

CAATAAAGGC CCTCGTTTAG TAAAGTATAT AAATATTTTC TATGCAATGA TAGCTAATGT 3060 

TCCAATCATT ATTGTGCAAC GCTATAATCG ACCGAGATTA ACGCAGTTAC TACGCATATT 3120 

40 

AAAACGAAGA GGTGAACGTC ATGACTAAAC ATATCATCGT TATTGGTGGT GGCTTAGGTG 3180 

GGATTTCTGC AGCAATTCGA ATGGCACAAA GTGGCTATTC GGTCTCATTA TATGAACAAA 324 0 

45 ATAATCATAT AGGAGGCAAA GTGAATCGTC ATGAATCAGA TGGCTTTGGC TTTGATTTAG 3300 

GTCCATCTAT TTTAACGATG CCTTATATTT TTGAAAAATT ATTCGAATAT AGCAAGAAGC 3360 

AAATGTCAGA CTACGTTACA ATCAAGCGAT TGCCACATCA ATGGCGTAgC TTTTTTCCAG 3420 

50 ATGGAACGAC TATCGATTTG TATGAAGGTA TTAAAGAAAC AGGTCAGCAT AATGCGATAT 3480 

TGTCGAAACA GGATATAGAG GAACTGCAAA ATTATTTGAA TTATACAAGA CGAATCGATC 354 0 
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TTCATGGGCC 


ATTAAATGCT 


CTTATTAATT 


ATGATTATGT 


ACATACTATG 


CAACAGGCCA 


3660 




TAGACAAGCG 


TATCTCGAAT 

t 

ATCATACGAT 


CCATACTTGC 


GACAAATGTT 


AGGCTATTTT ATCAAATATG 


3720 


s 


TAGGTTCTTC 


GCGCCAgCTG 


TATTATCTAT 


GTTATTCCAT ATGCAACAAG 


3780 




AGCAAGGCCT 


TTGGTATGTA 


GAAGGTGGAA 


TCCATCATTT 


AGCCAATGCC 


TTGGAAAAGc 


3840 




tAGCGCGTGA 


AGAAGGTGTC 


ACAATTCATA 


CAGGTGCACG 


TGTGGACAAT 


ATTAAAACAT 


3900 


10 


ATCAAAGACG 


TGTGACGGGT 


GTCAGATTAG 


ATACAGGTGA 


GTTTGTAAAG 


GCAGATTATA 


3960 




TTATTTCAAA 


TATGGAAGTC 


ATACCTACTT 


ATAAATATTT 


AATTCACCTT 


GATACTCAAC 


4020 


1 o 


GATTAAACAA 


ATTAGAGAGG 


GAATTTGAGC 


CGGCAAGCTC 


AGGATATGTG 


ATGCATTTAG 


4080 


GTGTTGCTTG 


CCAATACCCG 


CAATTAGCAC 


ATCATAATTT 


CTTTTTTACG 


GAAAATGCTT 


4140 




ATCTCAATTA 


TcAACAAGTT 


TTTCATGAAA 


AGGTATTGCC 


AGATGATCCG 


ACCATTTATC 


4200 


20 


TAGTAAATAC 


GAATAAAACT 


GATCACACAC 


AAGCGCCAGT 


AGGTTATGAA AATATCAAAG 


4260 




TCTTACCACA 


TATTCCATAT 


ATTCAAGATC 


AGCCTTTTAC 


CACTGAAGAT 


TATGCGAAGT 


4320 




TTAGGGATAA 


AATTTTGGAT 


AAATTAGAAA 


AAATGGGACT 


TACTGATTTA 


AGAAAACACA 


4380 


25 


TTATTTATGA 


AGATGTTTGG 


ACACCGGAGg 


ATATTGAAAA 


AAATTATCGT 


TCTAATCGTG 


4440 




GTGCAATATA 


TGGTGTTGTA 


GCAGATAAAA 


AGAAAAACAA 


AGGATTTAAA 


TTTCCTAAAG 


4500 




AAAGTCAGTA 


TTTTGAAAAC 


TTGTACTTTG 


TAGGTGGATC 


AGTAAATCCT 


GGTGGTGGCA 


4560 


30 


TGCCAATGGT 


TACATTAAGT 


GGGCAACAAG 


TCGCAGcAAg 


ATAAACGCGC 


GAGAAGCGAA 


4620 




GAATAGGAAG 


TGATATCTAT 


GAAATGGTTA 


TCACGAATAT 


TAACAGTAAT 


AGTGACCATG 


4680 




TCTATGGCGT 


GTGGTGCaTT 


GATATTTaAT 


CgTAGACATC 


A 




4721 



35 

(2) INFORMATION FOR SEQ ID NO: 235: 

7 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3516 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 235: 

TATTCGTGCG CAATGGGCTA AATTAGGTCT AGGTTTAGAT TATAGTAGAG AACGTTTTAC 60 

TTTAGATGAA GGTTTAAGTA AAGCAGTTAA AAAAGTTTTT GTTGATTTAT ACAATAAAGG 120 

50 AATTATTTAT CGTGGCGAAC GTATTATAAA TtGGGATCCn AAAGCACGTA CAGCTTTATC 180 

TGaTATTGAA GTAATACATG AAGATGTTCA AGGTGCGTTT TATCATTTTA AATATCCTTA 240 
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TACAGCGATT GTTGTTAACC CTAATGACGA ACGATACAAA GATGTAATCG GTAAAACTGT 


' 360 




TATATTACCA ATCGTAGGAC GCGAACTGCC TATTTTAGCA GATGAGTATG TTGATATAGA 


420 


5 


CTTCGGTTCT 


*f 1/ • TV TV TA *7V ^"*TV f+C* TV ^^TV OTV TV"* TV /"><T>1V T» T*f~* T\ T»T» rnrn/"< TV TV TV T*T*ffO 

GGTGCTATGA AAGTGACACC AGCACATCAC C CTAATGATT TTGAAATTGG 


480 




I GAAAGACAT 


j\ TA 'T M F'Tv TV TA TV ?\ ^ TV 1 1 M T ' TV T* i"** i*** T* T*TV rr*r*f* T\ TV* tv tv T\ TV ov^ i^""n TV tv tv tv fTV** TV TV f+f* TV ^ TV TA 

CAATTAGAAA ATATTATCG T TATGGATGAA AATGGTAAAA TGAACGACAA 


540 




AG GGGG X AAA 


T'TA TY^TA 7A^V"T*7A *TVTV*7A /"^fY^fP'T' TVTV TrVT/TT TV TV TV/^TV^'/ "1'TVO T*FT\ TV 7V^»TV T»T«H' 

TATGAAGGiA IWjALLAjIII TGATTGTCGT AAACAGCTAG TTAAAGATTT 


600 


if) 


AAAAGAAGAA 


/^7A TI*T''TAr ir T*T f TA T/^TATVOTA*pnV^TA A TV T^TV TV' ™T*V r* TA qwpirw TV f^TT^Ti ' T w IT* 1 TV TA 

GA III AG 11 A T UAAGA I IG A AG AT GATGTT CATTCTGTAG G 1 GATTCAGA 


660 




ACGATCTGGC 


GCTGTTGTTG AACCATATTT ATCAACACAA TGGTTTGTGC GCATGGAAGA 


720 


15 


CTTAGCGAAA 


CGTTCATTAG ATAACCAAAA AACAGATGAT CGTATTGATT TTTATCCGCA 


780 


ACGTTTCGAA 


CATACATTTA ACCAATGGAT GGAAAATATT AGAGATTGGA CGATTTCAAG 


840 




ACAATTATGG 


TGGGGTCATC AAATTCCGGC TTGGTATCAT AAAGAAACAG GCGAAATATA 


900 


20 


TGTTGGAGAA 


GAAGCGCCAA CTGATATTGA AAATTGGCAA CAAGATGAAG ATGTATTAGA 


960 




TACGTGGTTC TCaAGTGCTT TATGGCCtTT CTCyACGTTA GGTTGGCCTG ATTTAGAAAG 


1020 




TGAAGACTTT 


AAACGATACT ACCCAACAAA TGCCTTAGTT ACAGGTTACG ATATTATCTT 


1080 


25 


TTTCTGGGTA GCACGCATGA TATTCCAAGG CTTAGAATTT ACAGATCGTC GTCCATTTAA 


1140 




TGATGTATTA 


TTACACGGTT TAGTTCGTGC TGAAGACGGG CGTAAGATGA GTAAATCATT 


1200 




AGGTAATGGT 


GTGGATCCAA TGGATGTTAT TGACGAATAC GGTGCTGATA GCTTGCGTTA 


1260 


30 


CTTCTTAGCA ACAGGTTCAT CTCCAGGACA TGATTTAAGA TACTCAACTG AAAAAGTTGA 


1320 




GTCAGTGTGG 


AACTTTATCA ATAAAATCTG GAATGGGGCA CGTTTCAGTT TAATGAATAT 


1380 


35 


CGGTGAAGAC 


TTTAAAGTTG AAGATATCGA TTTAAGTGGT AACTTATCAT TAGCAGATAA 


1440 


ATGGATTCTA 


ACACGTTTAA ATGAAACGAT TGCAACAGTT ACTGATTTAA GTGACAAATA 


1500 




TGAATTCGGC 


GAAGTTGGAC GTGCATTATA TAATTTCATT TGGGATGATT TCTGTGATTG 


1560 


40 


GTACATTGAA ATGAGTAAAA TTCCAATGAA TAGTAATGAT GAAGAACAAA AACAAGTTAC 


1620 




ACGTTCAGTA TTGAGTTATA CTTTAGACAA TATTATGAGA ATGCTACATC CATTCATGCC 


1680 




ATTTGTAACA 


GAGAAAATAT GGCAAAGTTT ACCACATGAA GGTGACACAA TTGTTAAAGC 


1740 


45 


TTCATGGCCA GAAGTGCGTG AATCATTGAT TTTTGAAGAA AGTAAACAAA CAATGCAACA 


1800 




ACTTGTTGAA ATCATTAAAT CTGTAAGACA ATCACGTGTA GAAGTAAATA CGCGATTGTC 


1860 




TAAAGAAATA 


CCTATTTTAA TTCAAGCTAA AGATAAAGAA ATTGAAACAA CTTTATCACA 


1920 


50 


AAACAAAGAT TATTTAATCA AATTCTGTAA TCCTAGTACC TTAAATATTA gCtGACGTGG 


1980 




AAAwTCCTGA GAAAGCAATG ACaTCAGTTG TAATTGCAGG TAAAGTGGTA TTACCATTAG 


2040 
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AAAGCGAATT AGATAGAGTA GATAAAAA6C TCTCTAATGA AAACTTTGTA AGTAAAGCAC 2160 

CTGAAAAGGT TATAAATGAA GAAAAACGTA AAAAACAAGA TTATCAAGAA AAATATGATG 2220 

GTGTGAAGGC AAGAATTGAA CAATTAAAAG CATAGGAGTT AGTAACAATG AATTACCTAG 2280 

AGAGCTTGTA TTGGATACAC GAAAGAACTA AATTTGGCAT CAAACCAGGT GTTAAACGTA 2340 

TGGAATGGAT GCTAGCACAA TTTAATAATC CTCAAAATAA CATTAAGGGT ATTCATGTAG 2400 

10 

GTGGCACAAA TGGTAAAGGC TCTACAGTTG CTTACCTTAG AACAGCTTTA GTTGAAAATG 2460 

GTTATGAAGT AGGTACATTT ACGTCGCCGT TTATTGAAAC ATTTAATGAA CGAATTAGTC 2520 

?5 TAAATGGTGT GCCAATATCA AATGACGCTA TTGTAGAATT AGTATCACGT ATTAAACCAG 2580 

TAAGTGAAAT GATGGAACGT GAAACAGATT TAGGTGTTGC AACTGAATTC GAAATAATCA 264 0 

CAGCGATGAT GTTTTTATAT TTTGGTGAAA TACATCCTGT TGATTTTGTC ATTGTTGAGG 2700 

20 CTGGATTGGG TATAAAGAAC GATTCGACAA ATGTCTTTAC ACCGGTTTTA TCAATCTTAA 2760 

CTAGTATCGG TCTAGACCAT ACAGATATTT TAGGTGGTAC TTATCTAGAT ATTGCTAGGG 2820 

ATAAAGGCGC GATTATAAAG CCTAACGTTC CAGTGATATA TGCTGTTAAA AATGAAGATG 2880 

25 CATTAAAATA TGTTCGTGAA CGCGCAATTG AACAACATGC AAAGCCAATT GAATTAGATA 2940 

GAGAAATTGT TGTTGTATCG CAAAATGATG AATTTACTTA CCGTTATAAA GATTATGAAT 3000 

TAGAAACAAT CATTTTAAGC ATGTTAGGTG AACATCAGAA ACAAAATGCT GCATTAGCCA 3060 

30 

TAACAGCTCT TATTGAATTA AATGAACAAG GATTAATTGA ATTAGATTTC AATAAGATGA • 3120 

TAGACGGTAT TGAATCAGTT CGTTGGACTG GACGTATTGA GCAGGTGCAT GACAAACCTT 3180 

TAATCATTTT GGATGGCGCA CATAATTCAG AGAGTATAGA TGCTCTAATT GATACAATTA 324 0 

35 

AACAGTACCA TGATAAAGAA AAAGTAGATA TTTTGTTCTC AGCAATAAAC GGAAAACCGA 33 00 

TTAA&AGAT GGTCAAACAT TTAAGTTTAA TTGCGCATAC GTTTTATGCA ACTGAATTTG 3360 

40 ATTTTCCGAA AGCGTTACGC AAAGAAGAAA TTGTAGGTAG TATTGAAAAT GATGAAATAC 3420 

AATTAGTAGA TGACTACGTT GAATTTATAA AAAATTATCA AGGTGATACA TTAGTAATTA 34 BO 

CCGGTAGTCT GTATTTCATA AGTGAAGTTA AATCAA 3516 

45 (2) INFORMATION FOR SEQ ID NO: 236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7481 base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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10 



15 



TGAGTGATAG AATCAAAAAA GCCATCTCAA AAATTAATCA AGCAAACAAC ATTCCAAACA 60 

ATGsTCGCAA ATCACCAATG TATCACTCTC CAATTACGTA ACTATGATTT AATTTAAGCA 120 

TAGTTATTGA GGTTTTGTGA TATATAGTAT AAAATTAATG AGAATTAAAT TTAATAATGT 180 

AAAATTCATm TTCgGGGTCG GGTGTAATTC CCAACCGGCA GTAAATAAAG CCTGCGACCT 24 0 

GCTAGTATGT ATCATATTAG TGGCTGATCT AGTGAGATTC TAGAGCCGAC AGTATAGTCT 300 

GGATGGGAGA AGATGGAGGT TTTTTGTTGT GCAATAATCC TCCTATTCTT ACGAGATGAA 360 

TGGAAGGAGA AAATTGAATA TGCAACAAAA TAAACGTCTT ATCACAATAA GTATGTTGAG 420 

CGCGATTGCG TTTGTGTTAA CTTTTATCAA GTTTCCTATA CCATTTTTGC CACCATACTT 4 80 

AACTTTAGAT TTTAGTGATG TACCGTCACT ACTAGCTACA TTTACGTTTG GACCAGTTGC 540 

CGGTATCGTA GTTGCACTGG TTAAAAATTT ATTGAACTAC TTATTTAGTA TGGGCGATCC 600 

20 ' AGTTGGACCA TTTGCTAACT TTTTAGCAGG CGCAAGTTTC TTATTAACTG CTTACGCCAT 660 

CTATAAAAAT AAACGTTCAA CAAAATCTTT GATTACTGGA TTAATCATTG CAACAATCGT 720 

TATGACTATC GTGTTGAGTA TTTTGAACTA TTTCGTTCTA TTACCTTTGT ACGGTATGAT ' 780 

ATTTAACTTA GCTGATATCG CAAATAATCT TAAAGTAATC ATTGTTTCAG GAATTATACC 840 

ATTCAATATT ATTAAAGGTA TCGTTATTTC TATTGTATTT ATTTTACTAT ATAGAAGGCT 900 

TGCGAATTTC TTGAAAAGAA TTTAATCAAA TTAAAGCAAA ATAATATACA CATAATAATA 960 

AAAAGCAGGT GACTATCAAT AAACGATAGC TTGCCTGCTT TTTCTATAGA ACATTTGTCT 1020 

AAAAAATCAA TTATTCaAAT TTTAAAGCGT CCCCATCAAA TGATTCGTCT GCAATTTTAA 1080 

TAGAATCTGt AGGGCATCCa TCAATTGCAT CTTCCATATC TTCATATAAT TCCtCAGGTA 114 0 

CTTCTGCAGT ACCTTGGTTA TCGTCAAGGA TTACGAAAGC AATACCTTCG TCGTCGTAAT 1200 

CATATATATC TGGCGCTGCT GCACCGcATG CACCACATGC AATACAAGTA TCCATATCAA 1260 

40 CGATTGTATA TTTTGCCAAT GTCTTCGCCT CCTTTGATAA AAATGCTAAA ATAGTAATGT 1320 

GACTAAAATT TTAGACAGCA TCATTTTTAT TTTCAAATTA TCCGTTTTAC AGAGTGAGGG 13 80 

TTAAATTTGC AACACATTAT AAAAACAGCA TTACAACAAA CATTTAACTA TAAAACAAAT 144 0 

45 AAAAGTATTT ACAATATCTT AGTTGGTAAG AAATCTCACC AAACCTTTTT TGACGCTTGT 1500 

' AGTCAACAAC AGTTGTCATT ATATCACAGT TTACCACTAT TAAAATATCC GTCTTTTGAG 1560 

CTATTTCTAG AAAAAATCAA TGAATTTAAT GCTGAAATGG AAATCATGTT GCATCCTAGA 1620 

so 

TATACATTTG aAAGCATGGG GCAAACATTT CAAGCAATTC AACTATTAGT GCAAaCCATG 1680 

TCTAATACCA AACAACATGT TTTTCATTTT GTACCAATCT CTCAAAATAA TAAGATACAm 174 0 
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10 



15 



20 



25 



30 



40 



45 



50 



AATGAACTAC ATAACTTGTT TAAAGCAATA ACTTTAAAAG GGCCATGTTA CTTACATTAT I860 

TATTTGCAAG GCTATGATGA ACCAATGTAT ACGAGACAGC AAGTTAGTTT AATAGAAAAG 1920 

CTATCTCAAC AGCAATTGTT TGAATACGAA ATGAATAATT TAGTGACAAT GATGTTTGAA 1980 

TTAGAAAGTG GAGAATATAC TATTTTATCA AAAATAATAA TGAAACCTAC ATTATTAAAT 2040 

CAAACTTATA TTACTTATAC AAAATTGCTT GAACAATTCA CGATGGAAGA TATAGCGGCT '2100 

CAACAACAAG TTAAAATCAA TACTATCGAA GATCATGTAC TTGAAATCTT AATCAAAGGT 2160 

TACATGTCTA ATTACGATGA TTATGTTGAA CTAGAAGATC AACTCCAGTT TTTGAATTTT 2220 

TATCAACAGC ATCGTGGCGA ACGATTAAAA TTTTACAAAG AACAATTTGA CACGTTATCA 2280 

TATTTTCAAT TAAAAGTATT AATCGTTGGA TTTGAAAGAG GTGATCTGAA TGTTGCATGA 234 0 

TATTTTACGA AACAAATTTG GATTCGAGAG TTTTAAACCG GGACAACAGG AAATTATAGA 24 00 

AAGTATAATG TCTCAACAAC ACACTCTAGG TATACTTCCA ACTGGAAGTG GAAAGAGTTT 2460 

GTGTTATCAA ATACCTACGT ATTTATCAGG TAAGCCGACA TTAATTATCT CACCGTTAAT 2520 

ATCTTTAATG GATGACCAAG TTATGCAGTT GAAAATAAAT GGAGAAAAAC GTGTAACATG 2580 

TATTCACTCT GGTATGGATG AAATTGAGAA AAAGCATAAT ATTAAATGTT TACGACATAG 264 0 

CCGCTTCATC TTTCTAAGTC CAGAATTTCT CCTGCAACCG TCAAATTTTA AATTAATATC 2700 

TATGATAGAC TTTGGCATGA TTGTTCTAGA TGAAGCACAT TGCCTATCTG AATGGGGATA 2760 

TGATTTCAGA CCACATTATG CTCTAATAGG AAAAGTAACA AAGCATTTTA AAGAAGCGGT 2820 

TGTCTTAGCA TTGACAGCAA CTGCACCACC GCATTTACAA GATGATTTGA CGGAAATGTT 2880 

AGCGATTCAA TTCAATGTTA TTAAAACTAC AATGAATCGC CCAAATATAA GCTTTAAGCA 294 0 

TCTTAATTTT CATG ATGATG AAGAT AAAAT TGAATGGTTG CTGCCGTTTC TACAACAGTC 3 000 

GGGAGCAACG ATTATTTATG TCTCATCGAA AAAGATGTGT CTGAATTTAG CGCAACTTAT 3.060 

TTATGATTCA GGTTTTCTTA CAGGTATTTA TCATGGTGAT ATGAATTATC AAGAGCGACA 3120 

CACAGTTCAA CAACAATTTT TAAATAATGA TATTCCGATT ATAGTCGCAA CGAGTGCTTT 3180 

TGGTATGGGA ATTAATAAAA AAGATATTCG CACAATCATT CACTTTCATC TTTCAACAAG 324 0 

TCCTTCTAAC TACATTCAAG AAATTGGCCG TGCGGGTCGC GATGGTGAAC TAAGTCAGGC 3300 

AATTAGTTTA TTCCAACCGG ACGATAAATA TATTTTAGAA ACGTTATTAT TTGCAGATAT 336 0 

GATAACAGAA GAAGATGTAC AAAATTTCGA AATAGGAGAA TTTTTAGCTC CCGATAAACA 3420 

AGCCGTTTTG ACAACGTTGc AATCATTCTA TAGTATCGGC GCCTTGaAAC AGATATTTAA 3480 

GCAATCATTT AAACGAAAGC AATTAGGATT CTTTCGCATG ATTGGCTATT GCAAATTGGA 354 0 
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ATGTTGTGAC AATGATTCTA ATATAACTGA TATCGCAATT TTAAATAAGA AGAAGGTAAT 3660 

TAGAAGTATT GGATTTGATG AAAAGTTGCA AAATTTATTT CTCAGATAGT ATTACTTTAC 3720 

5 

TAAAAGAAAA TTGACAAGCT ATAATTAGTG TATACACAAT TGAAAAATGA TTGAAATAAT 3780 

TTTGAAAAAT ATACATAAAC ATATGTCATG TGGGTATATT TTATGTAAAA TCATTGTAAT 3840 

AGAATAGAAA GGAAGATGGC TATGTCTAAT AATTTTAAAG ATGACTTTGA AAAAAATCGT 3 900 

10 

CAATCGATAG ACACAAATTC ACATCAAGAC CATACGGAAG ATGTTGAAAA AGACCAATCA 3960 

GAATTAGAAC ATCAGGATAC AATAGAGAAT ACGGAGCAAC AGTTTCCGCC AAGAAATGCC 4020 

,5 CAAAGAAGAA AAAGACGCCG TGATTTAGCA ACGAATCATA ATAAACAAGT TCACAATGAA 4080 

TCACAAACAT CTGAAGACAA TGTTCAAAAT GAGGCTGGCA CAATAGATGA TCGTCAAGTC 4140 

GAATCATCAC ACAGTACTGA AaGTCAAGAA CCTAGCCATC AAGACAGTAC ACCTCAACAT 4200 

20 GAAGAGGAAT ATTATAATAA GAATGCTTTT GCAATGGATA AATCACATCC AGAACCAATC 4260 

GAAGACAATG ATAAACACGA TACTATTAAA AATGCAGAAA ATAACACTGA GCATTCAACA 4320 

GTTTCTGATA AGAGTGAAGC TGAACAATCT CAGCAACCTA AACCATATTT TACAACAGGT 4380 

25 , GCTAACCAAT CAGAAACATC AAAAAATGAA CATGATAATG ATTCTGTAAA ACAAGATCAA 4440 

GATGAACCTA AAGAACATCA TAATGGTAAA AAAGCAGCAG CTATTGGTGC TGGAACAGCA 4500 

GGTGTTGCAG GTGCAGCTGG TGCAATGGCT GCTTCTAAAG CTAAGAAACA TTCAAATGAC 4 560 

30 

GCTCAAAACA AAAGTAATTC TGGCAAGGCG AATAACTCGA CTGAGGATAA AGCGTCTCAA 4620 

GATAAGTCTA AAGATCATCA TAATGGCAAA AAAGGTGCAG CGATCGGTGC TGGAACAGCA 4680 

GGTTTGGCTG GAGGCGCAgC AAGTAAAAGT GCTTCTGCCG CTTCAAAACC ACATGCCTCT 4740 

3d 

AATAATGCAA GCCAAAACCA TGATGAACAT GACAATCATG ACAGAGATAA AGAACGTAAA 4800 

AAAGGTGGCA TGGCCAAAGT ATTGTTACCA TTAATTGCAG CTGTACTAAT TATCGGTGCA 4 860 

40 TTAGCGATAT TTGGAGGCAT GGCATTAAAC AATCATAATA ATGGTACAAA AGAAAATAAA 4920 

ATCGCGAATA CAAATAAAAA TAATGCTGAT GAAAGTAAAG ACAAAGACAC ATCTAAAGAC 49 BO 

GCTTCTAAAG ATAAATCAAA ATCTACAGAC AGTGATAAAT CAAAAGAGGA TCAAGACAAA 5040 

45 GCGACTAAAG ATGAATCTGA TAATGATCAA AACAACGCTA ATCAAGCGAA CAATCAAGCA 5100 

CAAAATAATC AAAATCAACA ACAAGCTAAT CAAAATCAAC AACAGCAACA ACAACGTCAA 5160 

GGTGGTGGCC AAAGACATAC AGTGAATGGT CAAGAAAACT TATACCGTAT CGCAATTCAA 5220 

SO 

TACTACGGTT CAGGTTCACC GGAAAATGTT GAAAAAATTA GACGTGCCAA TGGTTTAAGT 5280 

GGTAACAATA TTAGAAACGG TCAACAAATC GTTATTCCAT AATATAACTA TATAAATTGT 534 0 
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TAAATTGCGC TTATAAGTAT GTAGCGGTTT TTTCATTTTT CAAAGTTTGT TATTTAACAA 5460 

GGTCTTGTCT CGAATATTGG CATATCAATT TAACTTTTTA AATAGTCATC AAAAAGATAA 5520 

5 AACACCACAA TCAACAAATT TAACGAGGAA GAATAAAAAA TAAATCAACA TATTAAATTG 5580 

TAGTGTTATT CAACTCCGTA GCTAACAATT CTCTATTCAC ATTAAACAAA TTGTCAAAAA 5640 

TATATCATAA ATCTTCAAGC ACAGACTTAG CGCATCAATC ACTGAACTGT TATAATAGTT 5700 

10 

TGGGATTAAA GGAGGCCGAA ACAATGCAAA AAGTTGAAAG TATCATAATT GGTGGAGGGC 5760 

CATGCGGATT AAGTGCGGCT ATTGAACAAA AAAGAAAAGG TATTGATACC TTAATTATTG 5820 

AAAAGGGTAA TGTCGTTGAA TCAATCTACA ATTATCCTAC TCACCAAACA TTTTTCTCAT 5B80 

75 

CAAGTGATAA ATTAAGTATT GGGcGAgTAC CGTTTATCGT TGAAGAAAGT AAACCAAGAC 5940 

v 

GTAATCAAGC GCTAGTTTAT TACCGAGAAG TTGTAAAACA TCATCAATTA AAAGTAAATG 6000 

20 CATTTGAAGA AGTATTAACT GTTAAAAAAA TGAATAATAA ATTTACTATT ACTACGACGA 6060 

AAGATGTTTA TGAATGTCGA TTTTTAACAA TCGCGACAGG CTATTATGGT CAGCATAATA 6X20 

. CATT AGAAGT TGAAGGTGCG GATTTACCTA AAGTGTTCCA TTATTTTAAA GAGGCACATC 6180 

25 CGTATTTTGA TCAAGATGTT GTAATTATCG GTGGTAAGAA TTCGGCTATC GATGCTGCTT 6240 

TGGAGTTGGA AAAAGCTGGT GCTAACGTGA CGGTTCTATA TCGTGGTGGA GATTATTCGC 6300 

CTTCAATTAA ACCGTGGATA CTTCCAAATT TCACAGCATT AGTAAATCAT GAAAAAATTG 6360 

30 

ACATGGAATT TAATGCTAAT GTTACCCAAA TAACTGAAGA TACTGTGACT TATGAAGTAA 6420 

ATGGTGAAAG TAAAACGATA CACAATGATT ATGTATTTGC GATGATTGGT TATCATCCCG 6480 

ATTATGAATT TTTAAAATCT GTAGGCATTC AAATTAATAC AAATGAATTT GGAACAGCGC 6540 

35 

CTATGTATAA TAAAGAAACA TACGAAACAA ATATCGAAAA TTGCTATATT GCAGGTGTAA 6600 

TTGCTGCAGG GAACGATGCG AATACCATTT TTATTGAAAA TGGTAAATTC CACGGGGGCA 6660 

TTATTGCTCA AAGCATGCTA GCTAAGAAAC AAACGCCCTT AGAATCATAA AAATAAAGGT 6720 

40 

CTATGTACTA AATAACTTAG TTTTACAACG ACTGACATTC ATGATATGTC AAATGAGGTT 6780 

GATGACTATT GATTGTACAT AGACCTTTTT ATGTTACGTA TTCATTATAA TTCAAAATAT 6840 

45 GATTTGATTT CAGCTTTATC TAAATTGTTG CTTAACGCGA CTAATAATTT TAATCTTGCT 6900 

TTTGGACCAT TCAAGCCGTT AGAAAAAATA AAACCTTGTT GTGCGAGTTG GTAACCACCA 6960 

CCATCGTATG CGTAAGTTGG ACTCACAATA CCATTAAAGG AACGTGAAAC TAGCACAATA 7020 

50 GGTATATTTA AAGATACTAA TTGTTGAATG CCTTCTAATG CGCTTGGAGG TATGTTGCCT 7080 

TGTCCTAACG CTTCAATAAC CATACGATCG ATACCTTCTC GACTATAAAA ACTAAAAATG 7140 

55 
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TCCAATGCTT GTTGGCGATA TGGCATATGA TGGAATTGCA CACGATCCTT TGTCAATACA 



7260 



CCTAGCGGAC CATGATTTGG ACTTTGAAAT GTGTTTGTAT TAGACGTATG TGTTTTGGTA 



7320 



ACATTACGCG CCGTATGAAT TTCATCATTA AaTACAACCA TCACGCCTTT ATGACGGGCC 



7380 



TTTTCATCAG AGGCAACGCG AATAGCGGAA ATATAATTAT ATAATCCGTC AGAACCAATT 



7440 



TCATTAGACG AGCGCATTGG CCAAGAAnTA ACAACAGGTT G 



7481 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 237: 

ATGAATTATA CCTTCCACTA GAAGTGTCGG TATTTATGTG CCTGGTGGTA AAGCAAGTTA 60 

TCCTTCAACG GTATTAATGA CAGCGACGTT AGCACAAGTA GCAGGTGTGG AAAATATTGT 120 

CGTTGTGACA CCACCTCAAC CTAATGGTGT GTCTCAAGAA GTATTAGCAG CATGCTATAT 180 

TACGCAAGTT GATCAAGTAT TTCAAGTTGG TGGTGCTCAA AGTATTGCTG CGTTAACTTA 24 0 

TGGCACAGAA ACGATACCTA AAGTTGATAA GATTGTAGGT CCAGGCAATC AATTTGTTGC 300 

ATATGCCAAG AAATATTTAT TTGGACAAGT AGGTATTGAC CAAATAGCTG GACCAACAGA 360 

AATAGCACTT ATTATTGATG AAACAGCAGA TTTAGATGCC ATCGTATATG ATGTTTTTGC 420 

ACAAGCAGAA CATGATGAAT TAGCACGTAC ATATGCCATT AGTGAAGATG CGCAAGTCCT 4 80 

TAAAGATTTA GAATCACGTA TTGCTAAAGC ATTGCCTAAT GTGGACAGAT ACGACATTGT 540 

TTCTAAAAGT ATCGCTAATC AACACTACCT TATCCATGCT AGTAATTTTG ATGAAGCATG 600 

CCATGTCATG AATACAATCG CGCCTGAACA TGCGTCGATT CAAACAGTAA ATCCTCAACC 660 

ATATATTGAA AAAGTGAAAT ATGTGGGTGC ATTGTTTATT GGACATTATT CGCCAGAGGT 720 

CATAGGAGAT TACGTTGCAG GTCCAAGTCA TGTATTACCT ACAAATAGAA CAGCTAGATT 780 

TACCAATGGG TTATCGGTCA ATGATTTCTT AACACGGAAC ACGGTCATCC ATTTATCAAA 840 

AGATACGTTT GAACAAATTG CTGATTCAGC ACAACATATT GCTCATGTTG AAGCATTATA 900 

CAATCACCAG CAGTCTATTT TAATACGTCA GTCTTAGGGG AGTGTAATTG AAATGATTTA 960 

TATTGATAAA AATGAAAGTC CAGTTACGCC GTTGGATGAA AAAACAATGA CGTCTATTAT 1020 

TAGTGCAACg CnATATAATT TATATCCTGA TGCAGCATAT GAACAATTCA AGGAAGCTTA 1080 
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GATTCAAAAG TTAATGCTGA TCATGCCAGA AGGTCCGGCA 


TTAACGCTAA 


ATCCTGATTT 


1200 




TTTTATGTAT CAAGCATATG 


CGGCACAAGT 


AAATCGTGAA 


ATTGCATTTG 


TAGATGCAGG 


1260 


5 


ATCAGATTTA ACGTTTGATT TGGAAACCAT 


TTTAACGAAA 


ATCGATGAAG 


TACAACCATC 


1320 




ATTTTTTATT ATGAGTAATC 


CACATAACCC 


TTCAGGCAAG 


CAATTTGATA 


CGGCATTTTT 


1380 


10 


AACAGCTATT GCAGATAAGA 


TGAAAGCATT 


AAACGGATAC 


TTTGTCATTG 


ATGAAGCATA 


1440 


TTTAGATTAT GGTACGGCAT ATGACGTGGA ACTGGCACCA 


CACATCTTAA 


GAATGCGTAC 


1500 




ATTATCAAAG GCGTTTGGAA 


TTGCCGGCTT 


AAGATTAGGT 


GTCTTAATTA 


GTACTGCTGG 


1560 


15 


AACGATAAAG CATATTCAAA AAATAGAACA TCCATATCCA 


TTAAATGTAT 


TTACGCTAAA 


1620 




TATTGCGACT TATATTTTTA 


GACATAGAGA 


AGAGACAAGA 


CAATTTTTAA 


CGATGCAACG 


1680 




ACAGTTAGCT GAGCAGTTAA AACAAATATT 


TGATACACAT 


GTTGCAGATA AAATGTCAGT 


1740 


20 


GTTCCCATCA AATGCTAATT 


TTGTACTTAC 


TAAAGGCTCA 


GCAGCGGAAC 


AATTAGGACA 


1800 




ATACGTATAT GAACAAGGAT 


TTAAACCTCG 


CTTTTATGAT 


GAGCCGGTGA 


TGAAGGGCTA 


1860 




TGTAAGATAC TCAATTGCAA 


CAGCATCACA 


GTTAAAGCAA 


TTAGAAGAAA 


TTGTTAAAGA 


1920 


25 


ATGGAGTGCA AAATATGATT 


TATCAAAAAC 


AACGAAACAC 


AGCTGAAACG 


CAACTAAATA 


1980 




TTTCAATATC TGATGATCAG 


TCACCATCGC 


ATATTAATAC 


AGGTGTGGGC 


TTTTTAAATC 


2040 




ATATGTTAAC CTTGTTTACA 


TTTCATAGCG 


GTCTGTCATT 


AAACATTGAG 


GCACAAGGTG 


2100 


30 


ATATTGACGT AGATGATCAC 


CACGTAACTG 


AAGATATCGG 


CATTGTCATT 


GGCCAATTGT 


2160 




TACTTGAAAT GATTAAAGAT 


AAAAAGCATT 


TCGTTCGTTA 


TGGAACGATG 


TACATTCCAA 


2220 


35 


TGGATGAAAC ATTAGCACGT 


GTCGTTGTGG 


ATATAAGTGG 


GCGCCCATAC 


CTATCATTCA 


2280 


ATGCATCATT AAGTAAAGAA AAAGTTGGTA 


CGTTTGATAC 


GGAGTTAGTA 


GAAGAATTTT 


2340 




TTAGAGCGGT CGTAATCAAT GCAAGATTAA CAACGCATAT TGATTTAATT CGTGGAGGTA 


2400 


40 


ATACACACCA TGAAATTGAA 


GCTATATTCA AAGCGTTTTC 


CCGTGCATTA 


GGCATAGCGC 


2460 




TAACTGCAAC TGATGATCAG 


CGTGTGCCGT 


CATCGAAAGG 


TGTGATTGAA 


TGATTGTCAT 


2520 




CGTTGATTAT GGATTAGGGA ATATTAGTAA 


TGTAAAACGC 


GCTATTGAAC ATTTAGGGTA 


2580 


45 


TGAGGTGGTT GTCTCAAATA 


CCTCAAAAAT AATCGATCAA GCAGAAACAA TCATATTGCC 


2640 




CGGTGTCGGC CATTTTAAAG 


ATGCGATGTC 


AGAGATAAAA 


CGATTAAATC 


TCAATGCAAT 


2700 




ATTGGCTAAG AATACTGATA 


AGAAGATGAT 


TGGTATTTGT 


TTAGGCATGC 


AATTAATGTA 


2760 


50 


TGAGCATAGT GATGAAGGCG 


ATGCATCTGG 


ATTAGGGTTT 


ATCCCAGGAA 


ATATTTCGCG 


2820 




TATCCAAACA GAATACCCAG 


TGCCACACTT 


AGGCTGGAAT 


AATTTAGTGA 


GTAAGCACCC 


2880 



55 
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15 



AATTGCATAT GCGCAGTATG GGGCTGATAT TCCGGCAATT GTTCAATTTA ACAATTATAT 300 0 

TGGTATTCAA TTCCATCCTG AAAAAAGCGG TACATATGGG TTACAAATTT TGCGTCAGGC 3060 

AATACAAGGG GGATTTATAA ATGATTGAAT TATGGCCAGC GATTGATTTG ATTGGGTCAA 3120 

CAAGTGTGAG GTTAACAGAG GGTAAATATG ATAGTGAAGA AAAAATGTCA CGCTCGGCTG 3180 

AAGAAAGTAT TGCTTACTAT AGTCAATTTG AATGTGTGAA TCGTATTCAT ATCGTCGACT 3240 

TGATAGGTGC TAAGGCACAG CATGCCCGAG AGTTTGATTA TATTAAGTCA TTGAGGAGAT 3300 

TAACAACCAA AGATATTGAA GTAGGTGGTG GCATTCGTAC GAAGTCACAA ATCATGGACT 3360 

ACTTTGCCGC AGGGATTAAT TATTGCATAG TTGGAACGAA AGGTATTCAA GATACTGATT 3420 

GGCTTAAAGA GATGGCACAT ACATTTCCAG GTCGCATTTA TTTATCTGTT GATGCCTATG 3480 

GAGAAGATAT TAAAGTGAAC GGATGGGAAG AGGACACAGA GTTAAATTTA TTTAGTTTTG 3 54 0 

20 TGAGACGGTT ATCGGATATA CCTCTTGGCG GCATTATATA TACTGATATT GCTAAAGATG 3600 

GCAAAATGTC CGGACCTAAC TTTGAATTAA CTGGTCAATT AGTAAAGGCA ACGACGATTC 366 0 

CTGTCATTGC TTCCGGTGGT ATTAGACATC AGCAAGATAT TCAACGATTA GCGTCGCTAA 3720 

25 ATGTTCACGC TGCTATTATA GGAAAGGCTG CACATCAAGC ATCTTTTTGG GAGGGGCTAA 3 780 

AATGATTAAA AAACGTATCA TTCCATGTTT AGATGTCAAA GATGGTCGTG TCGTTAAAGG 3840 

GATTCAATTT AAAGGATTAA GGGATATTGG GAATCCTGTT GATTTAGCAA TGTATTACAA 390 0 

TGAAGCGGGT GCTGATGAAT TAGTATTTTT AGACATCTCT AAGACGGAAG AGGGTCATAG 3 96 0 

CTTAATGCTA GAAGTGATTG AACAGACAGC GTCACGCTTG TTTATCCCTC TTACTGTAGG 4020 

GGGTGGGATT CAAAGTCTCG ATGATATTAC CCAATTGCTA AATCATGGTG CAGATAAAGT 4080 

ATCATTAAAT TCAAGTGCTT TAAAAAATCC ACAGCTCATT AAACAAGCGA GTGATAAATT 414 0 

CGGTAGACAA TGCATCTGCA TAGCAATTGA TAGCTATTAT GATCCTGAAA GAAAAGCACA 4200 

TTATTGTTGT ACGACTGGTG GTAAAAAAAT GACAAATATT AAAGTATATG ACTGGGTACA 4260 

GCAAGTAGAA CAGTTAGGTG CAGGTGAGCT CCTCGTTACA AGTATGGGAC ATGATGGTAT 4320 

GAAACAAGGC TTTGATATTG AACACCTAGC AAATATTAAG TCTCTTGTAA ATATTCCAAT 4380 

45 CATTGCTTCT GGTGGTGGTG GCAATGCACA ACACTTTGTA GAATTATTTG ATCAGACGGA 444 0 

TGTTTCTGCA GGTTTAGCTG CAAGTATATT ACATGATCGA GAAACGACGG TTCAATCTAT 4500 

TAAAGAAGTG ATACGGCAAG GGGGTATAGC AGTAAGATGA CCAAATATAA AATTGATTTT 4560 

50 AGCAAAGGTT TAGTGCCAGC AATTTTACAA GATAATCAAA CAAAACAAGT ATTGATGTTG 4620 

GGTTATATGA ACCAAGAAGC TTTTGATAAA ACGATAGAAG ATGGTGTGGT ATGTTTCTAT 4680 



55 



30 



35 



40 
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AAAGATATTC ATGTAGATTG CGACAATGAC ACTATTTTAA TTGATGTCAT ACCAAATGGA 

CCAACATGTC ATACAGGCAG TCAAAGTTGT TTCAACACAG AAGTTCCATT TTCAGTGCAA 

ACATTAG CGC AGACAGTTCA AGATAGTGCC CAATCCAATA ATGAAAAGTC ATATACAAAA 

TATTTATTAA CAGAAGGTAT AGAAAAGATT ACAAAAAAAT ACGGTGAAGA AGCTTTTGAA 

GTCGTAATTG AAGCAATTAA AGGTGACAAA AAAGCATTTG TAAGTGAAGT AGCAGATGAA 

CTTTATCATT TATTTGTCTT GATGCATGCG CTTGGCGTCG ATTTTTCAGA AATTGAGGCG 

GAATTAGCGC GTAGACATCA TAAGCGCAAT AACTTTAAAG GTGAACGACA AAATATCGAA 

CAGTGGTAAA GCAAGTATGG ACTAAGATAT AAGGAAAAGG ATCATGGCTT ATACACTTAC 

AAATATTGTG GAAAACGTGA CATTTTCAAG TTTAAAATAC GACACCAACA TATTTTAACT 

ATGAATGCTG TGATGGTACT AAAGTTGCGA ACTCGTTATA GATAAGTAGT GGATAATCAC 

AATACGAAAT CAAAAATAAT TATAAAAAGT AAATTGAGCA ACTCAGGAAT AGATGTCACT 

GTTAAAGATG TCGAAAAGTA TATGAATCGA TATAATGAAG TTATGAAGGG AAAAAATGGC 

GAAAAAGCTA AAGAGTTATG TTTGTCGTTA CTACCTATTA ATATCATAGT TGTCTTTACA 

TTCTTTGTAT TTATACTATA AATACAAATA TATCTAGCCT GAAATAGAAA TGTCATAGCC 

TATTTAAAAG ACAATCTCCA TTAGAACTAA GAT AT G CATC CCGAAAGTTA GACTAAAAAA 

CTAACTTTAT GGGATGTATT TTTATGCTAA TCATCATAAA TTCGAGATTA AGTTAAAGGT 

AGTTCAAGAG TAATTAAACA ATAAATwAAA AATAGTAGGA TACTTACTTT GAGGGAAGAA 

AATTAACTGT ATATATTTAG TTTAGGAACA AGTATTACGG TTTATCCTGA TACAATTATT 

GTGGATGGGA TGATATTTTT AGGTTTAAAA TACGACACCA GCAAACATAA TAACTGTAAT 

AGCTCATAAA TCTCCCCATA TAGCTAATCT AAAAAAATAA TACATCATTG GAATTAAGCC 

CCAAQCATGT AAATATTAAA AATCAAAAmA GATATmTGTA AAAaAGTTAC AATTtGCATA 

ATTAAATTGT GTCTAATTAT TGACTAATTA AATTTTGCCA AATATAATAT TAATTAATAA 

TTTGaAATGA TTAGCGTATA CACTTTAAAT TCTCTTTGGA GAATATATTT TTTAAATACA 

AATGTAAACG CTTTCTCGTC AAATTAAACA ATAGAAAGGA TGGTCATTAT GAGTGCTTGG 

TTAAGTAAAT TATTTGAGTT TATTCCTCGA ATAATTATCA ATTTGTTTAT CTAAAATAAA 

AAAATAGAGG TGCTGACAAT GATGAAAAGT CAAAATAAGT ATAGTATTCG TAAATTTAGT 

GTAGGTGCAT CTTCCATTTT AATAGCTACA TTACTATTTT TAAGTG 

(2) INFORMATION FOR SEQ ID NO: 238: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3775 base pairs 
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"CO STRANDEDNESS : double 
(D) TOPOLOGY : linear 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 238: 





TGATAATGAT 


TCTCATTGTC 


ATACATCACG AAGGAGGCTA ATTAGTCAAT 


GAATAAAGTA 


60 

D U 


10 


ATTAAAATGC 


TTGTTGTTAC 


GCTTGCTTTC 


CTACTTGTTT 


TAGCAGGATG 


TAGTGGGAAT 






TCAAATAAAC 


AATCATCTGA 


TAACAAAGAT AAGGAAACAA 


CTTCAATTAA 


JxmTGPAZlTO 
rtuM. lULnft x o 


t no 




GGtACAACTG 


AAATTAAAGG 


GAAACCAAAG 


CGTGTTGTTA 


CGCTATATCA 


2if2f2 , 7Y - 2pr , 2i r*n > 

rtoJAji 1 ot X 


OA o 


15 


GACGTCGCTG 


TATCTTTAGG 


TGTTAAACCT GTAGGTGCTG 


TAGAATCATG 


GACACAAAAA 






CCGAAATTCG 


AATACATAAA 


AAATGATTTA AAAGATACTA AGATTGTAGG 


T P L. 24 tZ 21 21 Cf*P 


i e o 

J D U 




GCACCTAACT 


TAGAGGAAAT 


CTCTAAATTA AAACCGGACT TAATTGTCGC 


Vjx Uvuwui 1 


A 9 O 


20 


AGAAATGAAA AAGTTTACGA 


TCAATTATCT 


AAAATCGCAC 


CAACAGTTTC 




a an 




GTTTTCAAAT 


TCAAAGATAC 


AACTAAGTTA 


ATGGGGAAAG 


CTTTAGGGAA 




C^A O 


25 


GCTGAAGATT 


TACTTAAAAA 


GTACGATGAT 


AAAGTAGCTG 


CATTCCAAAA 


AfiznYirjiii 21 21 


ouu 


GCAAAGTATA 

w WWWI w X A 1 f\ 


AAGATGCATG 


GCCATTGAAA 


GCTTCAGTTG 


TTAACTTCCG 


X VJT^. X X w-v X 


££0 




ACAAGAATTT 


ATGCTGGTGG 


ATATGCTGGT 


GAAATCTTAA 


ATGATTTAGG 


ATT H-AAACGT 




30 


AATAAAGACT 


TACAAAAACA 


AGTTGATAAT 


GGTAAAGATA 


TTATCCAACT 


TAPHTPTZlAZl 
InUniVv innn 




GAAAGCATTC 


CATTAATGAA 


CGCTGATCAT 


ATTTTTGTAG 


TAAAATCAGA 


TCCAAATGGG 






AAAGATGCTG 


CATTAGTTAA 


AAAGACTGAA AGCGAATGGA 


CTTCAAGTAA 


AGAGTGGAAA 


900 


35 


AATTTAGACG 


CaGTTAAAAA 


CAACCAAGTA 


TCTGATGATT 


TAGATGAAAT 


CACTTGGAAC 


7 v w 




TTAGCTGGCG 


GATATAAATC 


TTCATTAAAA CTTATTGACG ATTTATATGA 


AAA.G'1 w I 1 21 21 2\ T 


1020 




ATTGAAAAAC 


AATCAAAATA 


ATTAAGGAGT 


TTTACGATGC 


TACTTAAACC 


AAAATACCAA 


1080 

X v O \J 


40 


ATCGTTATTG 


CTGGTTTATG 


TCTTGCAATA 


GTAGCTATCT 


TAAGTTTAAT 


GATTGGAAAT 


1140 




ACGCTTGTGT 


CACCAGGTAC 


GGTGATACAG 


GCGTTATTCA ACTTTGATAG 


TGAAAACGAT 


1200 




TTACATGATG 


TTGTCACTGG 


TGCACGGGCG 


TCGAGAACAA 


TCATTGCGTT 


ATTGACTGGT 


1260 


45 


GCTGCCCTTG 


CTGTCTCAGG 


TTTGTTGATG 


CAAGCACTTA CACGAAACCC 


AATAGCCTCA 


1320 




CCAGGGCTTT 


TCGGTGTCAA 


TGCAGGCGCA 


GTATTTTTTG 


TCATTTTTAG 


TATTACATTT 


1380 


50 


ATCCAAATTC 


AATCTTTTAA 


AATGATTGTA 


GTTATTGCAT 


TTTTGGGGGC 


TATTGTTGTT 


1440 


ACTGTATTAG 


TTGTTGCACT 


AGGTATGTTT 


AGACAAACAC 


TATTCTCACC 


TCACCGTGTC 


1500 




ATTTTGGCAG 


GTGCTGCGAT 


TGCGATGCTA 


TTTACAGCCT 


TTACTCAAGG 


CATACTTATT 


1560 
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20 



AATATTTGGG ATATCCCATG GATTATTCCG CTTGTATTGA TACTTATTTT AATTGCATTT 1680 

AGCATGGCTG CACACATCAA CATCTTGATG ACAAGTGACG ACATTGCAAC CGGCCTCGGT 1740 

CAAAACATAA AATTAATCAA ATGGATGATT ATTATGCTCA TCAGTATGTT AGCCGGTATT 1800 

TCGGTAGCCG TAGCTGGATC AATCGTCTTT GTGGGTCTTA TCGTACCGAA TATTAGCAAA 1860 

CGATTATTAC CACCAAACTA TAAGTATTTA ATTCCTTTTA CTGCATTAGC TGGAGCAATC 1920 

CTAATGATCA TTTCAGACAT TGTTGCTCGT ATAATAATTA AGCCACTAGA GTTGCCTATC 1980 

GGTGTCGTTA CCGCTGTCAT TGGCGCTATT GTCTTAATCT ATATTATGAA GAAAGGACGT 204 0 

CAACGCTTAT GACCGAAAAG ATTAATAAAA AAGACAATTA CCATCTCATC TTCGCGTTAA 2100 

TCTTTTTAGC CATCGTTTCA GTGGTAAGTA TGATGATTGG TTCAAGCTTT ATACCATTAC 2160 

AACGCGTACT GATGTACTTT ATAAATCCAA ATGACAGTAT GGATCAATTC ACTTTAGAAG 2220 

TATTACGCTT ACCTCGCATT ACACTTGCGA TTTTAGCAGG TGCCGCACTA GGAATGAGTG 2280 

GTTTAATGTT GCAAAATGTA TTAAAAAATC CAATTGCCTC ACCTGATATT ATCGGTATCA 2340 

CAGGTGGTGC TAGCTTAAGT GCTGTTGTCT TTATTGCATT TTTCAGCCAT TTAACAATAC 2400 

25 ATTTACTTCC ACTATTTGCA GTATTAGGTG GCGCAGTTGC AATGATGATA CTATTAGTGT 24 60 

TTCAAACGAA AGGACAAATA CGCCCGACAA CACTCATAAT CATCGGTATT TCGATGCAAA 2520 

CGTTGTTTAT TGCGCTTGTC CAAGGATTAC TCATTACAAC GAAGCAATTA TCTGCTGCCA 2580 

30 AAGCTTATAC ATGGCTAGTC GGAAGTCTTT ACGGTGCTAC GTTTAAAGAT ACAATCATTT 2640 

TGGGTATGGT TATTTTAGCT GTTGTGCCGT TGTTATTTCT TGTTATACCA AAAATGAAAA 2700 

TATCTATACT TGATGACCCT GTAGCGATTG GCTTAGGCTT ACATGTACAA CGTATGAAAC 2760 

TAATCCAATT AATCACTTCT ACTATACTCG ' TATCTATGGC AATCAGTTTA GTAGGTAACA 2B20 

TTGGGTTTGT CGGTTTAATC GCACCACATA TCGCGAAAAC AATCGTTCGC GGAAGTTATG 2880 

CTAAAAAGTT ACTAATGTCA GCAATGATTG GTGCCATATC AATTGTTATT GCAGACTTAA 2940 

TTGGGCGTAC CTTATTCTTG CCTAAAGAAG TGCCAGCAGG TGTATTTATT GCTGCTTTTG 3 000 

GTGCCCCATT CTTCATATAC TTATTATTAA CCGTGAAAAA GTTATAACGA TATTATTAAA 3060 

ACAAAATGAC CTCACAACGA AGTTAGCTAA ATGATTCAGT TAACTAACCG TTGCGAGGTT 3120 

TTTTTATACA TATAGTTGTT GTTATTGTTA ACAAGCGTCG ACTTTCTTAA TTACATATTA 3180 

ATACTTTATA TACAAATAAC ACCGACTCAT ATTCTATAAT ATCAATCAAT ATTCTTCGAT 3240 

50 TTTTCAAATA TCGATAACTA TTTCTTATTT AAATATAGTG TTTGATAATG TCATTTATTC 3300 

AAAAACACAA ATTTTAATAA AAATATCATA TTATTTTTAA TTGTAAATTA TGGATTATTT 3360 



55 



35 



40 



45 
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ATATAATAAC CATGACAAAA TTAGAGATTT TATAATCATT GAAGCATATA TGtnTCGTTT 34 80 

TAAGAAAAAA GTCAAGCCTG AAGTCGATAT GACTATAAAA GAATTTATAT TACTGACTTA 3540 

5 TTTATTTCAT CAGCAAGAAA ACACACTTCC ATTTAAGAAG ATTGTTTCAG ATTTATGTTA 3600 

TAAACAATCG GATTTAGTAC AGCATATAAA AGTACTTGTG AAACATTCAT ATATTAGTAA 3660 

AGTTCGAAGT AAAATTGATG. AGCGTAATAC TTACATTTCA ATATCTGAAG AACAACnAGA 3720 

10 

nAAAATTGCA GAACnTGTTA CATTGTTTGA TCAAATCATT AAACAATTTA ACCTT 3775 

(2) INFORMATION FOR SEQ ID NO: 239: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 239: 





AGAAAAATTA 


GCCTACCTAT 


GCAAGTTGTT 


ATTGCATTAG 


TGTTAGGTGT 


TGTCGTAGGA 


60 


25 


CTTTTATTAT 


ATGGACAAGA 


AAACGTAGCA 


AATTACATTA 


AACCATTTGG 


TGACGTATTT 


120 




TTAAATTTAA 


TTAAAATGAT 


CGTTATACCA 


GTTGTATTTT 


GCTCACTAGC 


GCTTTCTATT 


180 




TCGAACGTTG 


GGGAATCGAA 


AACTGTAGGG 


CGTTATGGCT 


GGaAAACAAT 


TTTATACTTT 


240 


30 


GAAATTATTA 


CAACAATCGC 


AATAGGTTTA 


GGGATTATCT 


TCGGTAACCT 


ATTTAAACCA 


300 




GGTGCTGGAT 


TAGACCCAAC 


AAAATTACCT 


AAAGGTGATA 


TTTCTAAATA 


TCAATCAACT 


360 




GCACATGCAG 


CAGAACAATC 


TACATATGGa 


AATCATTTTA 


TTGATACCAT 


TGTACATATT 


420 


35 


ATTCCGACAA 


ACTTTTTTGA 


AGCTTTAAAT 


AAGGGTGAAT 


TATTACCTAT 


TATCTTCTTC 


480 




GCAGTArTCT 


TTGGATTAGG 


ATTAGCTGCT 


GTAGGTAAAA 


AAGCAGAACC 


AGTTAAAGAA 


54 0 


40 


TTTTTAAGCG 


GATCGCTTGA 


AGCTGTGTTC 


TGGATGATTA 


ATAAAATTTT 


AAAATTAGCA 


600 


CCACTTGGAG 


TGTTTGCATT 


CATTTGTACT 


ACAATTATTA 


CATTTGGTGC 


ATCCGCATTA 


660 




TTACCACTAT 


TAAAATTAGT 


ATTAGTTGTT 


GTCTTTGCAA 


TGGTGTTCTT 


TGTATTCGCT 


720 


45 


ATACTAGGAC 


TAGTTGCATG 


GATGTGTGGT 


ATTAATATCA 


TGAATATTAT 


TAGAATCTTG 


780 


AAAAGTGAAT 


TGCTTTTAGC 


ATTTTCTACA 


TCAAGTTCGG 


AAGCTGTACT 


TCCTGTAATG 


840 




ATGAAGAAAA 


TGGAAAACTT 


CGGTTCTCCA 


AAAGAAATTA 


CTTCTTTTGT 


TATACCAATT 


900 


SO 


GGTTATACGT 


TTAACTTAGA 


TGGATCAGCA 


CTTTATCAAT 


CTATTGCAGC 


ATTATTCGTT 


960 




GCACAGATGT 


ATGGAATGCA 


CTTAACATTA 


TCAGAGCAAA 


TTGTGTTGAT 


GTTAACATTA 


1020 
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ACATTAGGTG CCATGGGCTT ACCGGCACAA GGTTTAGCAT TAATTATTGG TGTTGACCGT 114 0 

ATCTTAGATA TGGTACGTAC ATGTGTAAAC GTTATTGGTA ATGCATTATC AACAATCGTT 1200 

ATAGCTAAAT GGGAAAACGT ATATGACAAA GCAAAAGGTC AAGAATATTT AAAATCAATT 1260 

TAAAAAATAC TATCTGACAT TTAArGnCCC TTACAACCTT TGGTTgTnAG GGCTnTTTTA 1320 

TGTCATGCGT CTTAAAGCCA GGCCGTATAn CGGTAAGCGT A 1361 



(2) INFORMATION FOR SEQ ID NO: 240: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 89 base pairs 

(B) TYPE: nucleic acid 

is (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 240: 





TCAATATGTA 


AATCGGCGAT 


TGTCCCTATC 


tTCATACCCT 


TACCTTCTTT 


ACTCAAATTA 


60 




AAATGTTATT 


TATTATGCCA 


TaGCTTATCT 


AATATATATA 


GTTAACTGCT 


TCATTTAGAT 


120 


25 


GATTATTTTA 


TATTTTTGCA 


TAAAAACTTA 


i>il L. J. 1 I 1LA 


AAAAAATCGA 


TAAGTTTTAG 


1 on 
1 O U 




TTATCATACC 


CTTACCTATC 


AAATGTTTTT 


TCTTATATTT 


AAAAAAATAA 


TTGCTTTATT 


240 




AAATGGATTT 


CTTTAGTAT? 


TATAATTAAG 


AAAACGCTTA 


CACACAACTT 


TTTTATTTGC 


300 


30 


TTTATCCTGA 


GGAGGAAAAT 


TATGGCAAGA 


AAATTGCATA 


GAGAGTTGAA 


TAACAGACAC 


360 




ATCCAATTAA 


TAGCAATTGG 


GGG CGCAATT 


GGAACTGGGT 


TATTCCTAGG 


ATCAGGTCAA 


420 




ACAATATCTT 


TAACTGGTCC 


ATCACTGTTA 


TTCACATACA 


TGATTATTGG 


GGTTGTACTA 


480 


35 


TTCGCTTTTA 


TGCGCGCATT 


AGGCGAATTG 


TTGTTGAGCA 


ATACAAGATT 


TAATTCATTT 


54 0 




GTTGATATTG 


CAAATGAATA 


TTTAGGCCCT 


TTTGGTGGCT 


TTGTCATTGG 


CTGGACTTAC 


600 




TGGTTATGTT 


GGATTGTATC 


AAGTATGTCA 


GACCTAACTG 


CGATGGGACA 


ATACTTTGCA 


660 


40 


TTTTGGTATC 


CACAAGTCCC 


AAATTGGATT 


ACCGTGCTAT 


TTATTGTTTT AATCTTGATT 


72 0 




AGCTTCAACT 


TATTAGGTGC 


CAGATTATTT 


GGTGAACTGG 


AGTTTTGGTT 


CTCGATTATT 


780 


45 


AAAGTTGTCA 


CAATTATTGC 


GATGGTTATC 


GTTGGTCTTG 


TATTAATCTT 


TTTCTCATTT 


840 


AAAACACATT 


ATGGACATGC 


ATCATTCACA 


AACTTAATCA 


GTCACGGTGG 


CATGTTCCCT 


900 




GGTGGAaCAT 


TTGGTTTCTT 


AATGTCATTC 


CAAATTGCTG 


TATATTCATT 


CATTGGTATT 


960 


50 


GAACTTATAG 


GTGTAaCTGC 


TGGTGAAaCG 


AAAGATCCTG 


AAAAAACCTT 


ACCGAAAGCA 


1020 




' ATTAATAATG 


TACCTATCCG 


TATTTTATTA 


TTCTATATCG 


GTGGTCTATT 


AGTAATTATG 


1080 
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TTAATCGGCG TACCATTTGC 


AGCAGGTGTC GTTAACTTTG 


TCGTGCTAAC 


TGCCGCGGCC 


1200 




TCTGCTACAA ATAGTGGTAT 


CTATTCGAAT AGTCGTATCT 


TATTCGGACT 


GTCACAACAA 


1260 


£ 


GGGTTAGGTC CTAAAGTTTT AAATAAAACG AATAGTCATG GCGTGCCTTA TTTATCAATG 


1320 




TTAGTTTCAT CAATTGCATT 


ACTTATAGCA GCCTTGTTAA ACTACATTTT 


CCCTAATGCA 


1380 




ATTCAACTAT TCATATACGT 


TACAACGTTA tCAACTGTGT 


TGTTTTTAGT 


TGTtTGGGCA 


1440 


10 


ATGATnATTG TCGCTTATCn 


AATGTATTTG GAAAAAGCAT 


CCTGAGGCA 




1489 




(2) INFORMATION FOR SEQ ID NO: 241: 








15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 5000 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








20 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 241: 








TTTTCCATCA TrATcACCAT 


TTAAGACAAT AGCTGTATGA 


CCATATCCTC 


CACCAAATCT 


60 


25 


TCCACTAAAA ACCACTAAGT 


CCCCAGGTTC AGCTTTGAAA 


GTTGGTGTGT 


TGTGATAAAT 


120 




TTTAGCTTCA CTATTAAAAT 


TATTTGCATA TGGTATATCT 


TTAGCTCCAT 


ATCCTTTtAA 


180 




TCCAkGACCA TAAAGAkGAT 


TCCAATATAC ATTAACTAAA TCGAAACATT GCCATCCATA 


240 


30 


ACTACCGTCG AAATCCCATC 


CTCTATTCTC TAATGTTTTC 


AAATAATTTA 


ATGTACTGCT 


300 




ATTACTATTA CTTTTATTAT 


TTGAAGACAC TGTTTTTGGT 


TTTGGTTCTA 


CTAATGGTGT 


360 




CATTGGCACT TTTAATTTTT 


GACCAATAAA TATTAAATTA 


GGATTTGCTA 


TATTATTTGT 


420 


35 


ATTTTGAATA TTTGAAACTG 


TAGTTTTGTA TTTTAATGCT 


ATAGCACTAA 


GTGTGTCTCC 


480 




TTTTKTACA GTATAGATTT 


GTGTTTTTGG AGCTTCTCTA 


AAACTGTAGT 


AACCAAAGTA 


540 


40 


ATTATTAGTA ACTTTATTCG 


TTTTCTTCTG ATTAGAATTT 


TGAGCTTCCA 


AGTTTGCAAT 


600 


TTTAATTTCT TTAGTAAGTT 


CATTGTTATT AATAACTAGA 


TTGTTACCTT 


GGCTTGAGTT 


660 




TTTCGGAGTA TTTGAAATCT 


TTATATCTTG ATTAATTTCA 


TTTCCGTTTG 


AAATTGCTGA 


720 


45 


TTTGTTGTCT AACTTTAAAC 


TTGTGTCCGA TGTTTTAACA 


GCACCTTCAT 


TTTTTATTTT 


780 


GTCTTTTGTC GTATTTTTAT 


TAGCATTTAA CTCTGATTTC 


GCGAATACAT 


TTTGCTCATA 


B40 




CCCTCTTGTA AAATCTTTAG 


ATTTATCAAT TTCATCTGCA TATGCTTTGT 


TCGACATACC 


900 


50 


CAATGCCAAA AACATACCTA 


TTGAAATTGA CAAAATTCCA ATACTAACTT 


TTCTAATTGA 


960 




ATAGCGTACT TTACTTTGTT 


GTTTATTCAT GATGAAACAC 


TCCTTACAAT AATATACAAA 


1020 
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GTAGGAGGTA AAATAATTAA CTTGTCTTTC CAAAATATGA AAAGTGTACT AAAaTTCATC 1140 

GCACGACAAA TAGCCCATTT CCGATACTTT TATAAAGTAT GGAaTGGGCT ATAGCCATTT 1200 

5 ATATCATCTT TTAACTTTAT TTATTAACAG TTAATAATGA TTCATAAATA CCTGCTTCTT 1260 

TAGCAGCTTC AATTAATGTT GAACCAATTT CTGAAGGTGT TGCCGCTGTT TTCACACCAC 1320 

AACTATTTAA TGTTTTAATT TTCTCTTCAG CAGTACCTTT ACCACCTGAA ATGATTGCAC 1380 

CAGCATGTCC CATACGTTTT CCAGGAGGTG CTGTTTGTCC ACCGATAAAG CCTACAACTG 1440 

GTTTTGTCAT ATTCGCTTTA ATCCATTCAG CTGCTTCTTC TTCAGCCGTA CCACCGATTT 1500 

CACCAATCAT AACAACTGCT TTCGTTTCGT CATCTTCATT GAATGCTTTT AAAACATCAA 1560 

is 

TAAAGTTTGT TCCGTTGACT GGGTCTCCAC CAATACCAAC AGCTGTAGTT TGACCAATAC 1620 

CTTCTTCAGT CAATTGGTGC ACTGCTTCAT ATGTTAATGT ACCTGAACGA GATACTACAC 1680 

CAACATGACC TTTTTTGTGA ATATAGCCAG GCATAATACC AATTTTACAT TCATCTGCTG 1740 

20 

TAATCACACC TGGACAGTTC GGACCAACTA AACGTGTTTT TCTACCTTGT AAGTAGCGTT IB 00 

TAACTTTAAC CATGTCTAAT ACAGGAATAT GTTCAGTGAT ACAAATAACC ATATCTAAGT i860 

25 . CTGCATCAGC TGCTTCTAAA ATTGAGTCTG CAGCAAATGG TGCTGGAACG TAAATGACTG 1920 

AAACCGTTGC CCCAGTTTCA TTTTTAGCTT CTTCAACAGT GTTGAAAACA GGAACGCCTT 1980 

CAACAACTTG ACCACCTTTA CCAGGCGTCA CACCTGCTAC TATTTTCGTA CCATAATCAA 2040 

30 GCATTTGTTT TGTATGGAAA AGGGCAGTAG ACCCTGTAAT ACCTTGTACC ATTACTTTAG 2100 

TATTCTTATC TATAAATACA CTCATCTTAG TGCTCCCATC CTTTCCTTAT GCTTCTTTGA 2160 

CTAGTTTAAC AATTTTTTGT GCACCTTCAG CCATTGTTGC TGCTGGTTCA ATTGCTAATC 2220 

35 CTGAGTCTTT TAAGATTTTT TTACCTAACT CAACATTTGT ACCTTCTAGG CGTACAACTA 2280 

GTGGTAAAGT TAAATCTACT TCTTTTACAG CTTCAACGAT ACCTTCTGCG ATAACATCAC 2340 

ATTTCATAAT GCCACCGAAA ATGTTTACAA AAATACCTTT AACATTTTCA TCACCTAAAA 2400 

40 

TGATTTTAAA TGCTTCAGTT ACTTTTTCTC TAGTAGCGCT TCCGCCTGCA TCTAAGAAAT 2460 

TGGCTGGGTT TCCACCGAAA TGATTAATCG TATCCATTGT TGCCATGGCT AAACCTGCAC 2520 

CATTAACCAT ACATCCGATG TCACCATCTA ATGCAATGTA TGATAAATCA TGTTTAGACG 2580 

45 

CTTCAATCTC TTTCGGATCT TCTTCTTCTA AATCACGTAA TTCTACAACA TCTTTATGTC 2640 

TGAATAATGC ATT AT CAT CA AAATTAATTT TAGCATCTAA TGCCAATACA TCACCATCAG 2700 

50 CTGTTGTAAC TAATGGGTTG ATTTCTACGA TTGAACAATC TTTTTCAATG AATACATTAT 2760 

AAAGTGCTAA TAAGAATTTA GCAGCTTTGT TAACAGATTC TTTAGGAATA TTAATATTAA 2820 
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AGATCTTTTC AGGAGTCTTC GCAGCAACTT CTTCAATCTC AGTGCCCCCT TCTTCAGACG 2940 

CCATCAATGT TACTTGGTCA GTCGCACGAT CAATAACGAA TCCAACGTAA TATTCTTTTT 3000 

5 GAATAGCACA ACCTTCTTCG ATATATAAAC GCTTAATTTC TTTACCTTCT GGACCAGTTT 3060 

GATGTGTCAC CAAAGTTTTC CCTAATAATT CTTTTGCATA TGTTTCTACC TCAGATAAAG 3120 

ATTTAGCAAT TTTTACTCCG CCTGCTTTAC CTCTACCTCC AGCATGAATT TGTGCTTTTA 31B0 

70 

CAACATAAAC ATCAGAATTT AATTCTTTTG CTTTCTCCAC CGCTTCTTCA GCAGTAAATG 3240 

CTACTCGTCC TTCTGGAACT GCAACGCCCA TTGAACGAAA TATTTCTTTA CCTTGATACT 3300 

CGTGGATATT CATCTTCCAT CCTCCTGTTA CTTAGGTTAA GTTCCCTTAC AATTATAAAA 3360 

IS 

AATGTAAGCG CTATTGTAAA CTTAAATGCT ACTTTTTTAT CATTTAATTG AATTTTACGA 3420 

TTTACAGTAA CGATTTTATA GGTTCAAAGC TTTTTCTATG CTCTTTCATA ATGCCAATAT .3480 

CATCGATTGC TAGTAAATGT TGTTTGGTAC CGTAACCCGC GTTTTTTTCA AAACCATATT 3540 

CAGGATAATC TTTAGATAAC TGTGTCATAT AATCATCACG AAAAACCTTT GCCATGATAC 3600 

TTGCAGCTGC AATGGACACA CTTCTTGCAT CACCCTTGAT TAAAGATACT TGAGGCAGTG 3660 

25 CATTATCAAG CGTCATCGCG TCTATCAATA AATGCGTTGG TTGTACTGAT AATCCATCAA 3720 

TAGCTCGCTG CATGGCGATT TGAGTAGCTT TATAAATATT AAATTCATCT ATTTCTTnCA 3780 

GtGTCGCGAT CCCATATGCA AAAGCAGTAA CTTCATTTTT TAGTGCTTCA TTTAATTCTA 384 0 

30 GACGTTTCGT AACAGGTACT TTTTTCGAGT CATCAAGGCC CAAATAATTG TGATTTGAAT 3900 

TTAAAATTGT. TGCGCATGCA ACGACTGGAC CTGCTAAAGG TCCTCTTCCA ACTTCATCAA 3960 

TCCCACAAAT AATAGCATTA GGATGCTCTT TTAATATTTC ATTTTCAAAG TAAGTCATTT 4020 

3t> CAACATACTT TTCTTTTAAA GCTTGTTCTT TTTCTAACGC TTTTCTGCGC CTAGCTATGG 4 080 

CATTTTGAAC ACCTTTTCGC TCATCTAAAA AGCATTCATG ATTTTCTAAT TCTTCTATTG 4140 

TATTAACCGC ATTAATCAAC TGCGTAACTT CTTTAATTGT TAGCGTCATT TGCTAATTCC 4200 

40 

TCAGTCATAT CTTTAAAAAT ATCAAAACAA TAATTTCCTA TTTTAGCATT TCGAATATCA 4260 

TAAATAATCA GTTGAATGAC TGCTTCGTAA TCAATTTCAT TACCACGTCG AATTAAGCCc 4320 

ACGTTTTTTC CCTATCGCAT CAAACCACGC TATGATTTCT GCAtCTTCAG GAACTTCAAT 4380 

45 

ATTATAATGT GACTTTAATC GCGCTAAATC ATTTTGAATT AAAAAGTTTA ATCCATAGAT 444 0 

GGCAACTTCA TCTAAGTGCA CAATACTATC TTTTATCGCA. CCAGTTAAAC TCAACTTCTT 4500 

SQ ACCGACTTCT TCATCTTCAA ATTTAGGCCA AAGTATCCCT GGTGTGTCTA ATAGTTGTAA 4560 

TGCATTACCA ACTTTAATCC ATTGTTGTTG TTTGGTCACA CCTGGTTTAT TACCAGTCTG 4620 
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AACGATCATT GCTCTTATCG CTCTAGGTTT 


AAGTCCTTTC 


GCTTTTTCGC GTTCAAATTT 


4740 




TTCAGCAGTC GCCTTAATTG CTGCAGCTTC 


CACTTTCTTT 


AAATTTTTAC CGTGCTTAGC 


4800 


5 


ATCCACTGAT ACAGGATAgT AtCCTTTATC 


AATAAAAAAT 


tGTTCCCATT TTGACATCTC 


4860 




ATTTAAATTA GACATATCTT TTTTATTTAA 


TATAACAACA 


CGTGGTTTTT GGTTAATAAC 


4920 


10 


TTCATCTATC ATAGGGTTTC TTGAACTATA TGGAATTCTT GCATCTACTA GTTCAAACAC 


4980 


TACATCTACT TTTTTTAAIT 






5000 




(2) INFORMATION FOR SEQ ID NO: 242: 






15 


{i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1700 base pairs 

(B) . TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 






?n 
av 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 242: 






AACCCGnAGA CGAAACTnCA TAGTTGCCTG 


TTATGTTGAT 


TAGTGCGTTT GTAGCTATTT 


60 


25 


nAAATCAAAC ATTATTAAAT ACAGCGTTaC CTAGTATAAT 


GAGAGAATTA AATATCAATG 


120 




AAAGTACATC GCAATGGCTA GTTACTGGGT 


TTATGCTTGT 


TAATGGCGTC ATGATACCTC 


180 




TGACGGCATA TCTAATGGAT AGAATTAAAA 


CTAGACCTTT 


ATACTTAGCG GCGATGGGGA 


240 


30 


CATTTTTATT AGGTTCTATT GTTGCAGCCT 


TAGCTCCGAA 


TTTTGGAGTT TTAATGTTAG 


300 




CTCGTGTAAT TCAAGCGATG GGTGCAGGCG 


TACTTATGCC 


CTTAATGCAA TTTACGTTAT 


360 




TTACATTGTT CAGTAAAGAA CATCGAGGTy TtGCAATGGG 


ACTAGCAGGT TTAGTAATTC 


420 


35 


AATTTGCACC AGCAATAGGA CCTACAGTTA 


CAGGATTAAT 


TATTGATCAA GCGAGTTGGC 


4 80 




GAGTTCCATT TATTATAATT GTAGGAATTG 


CTATACTTGC 


CTTTGTTTTC GGTTTGGTTT 


540 


40 


C^TCTCGAG TTACAATGAA GTGAAATATA 


CGAAATTAGA 


TAAGCGTTCA GTAATGTATT 


600 


CAACTATTGG GTTCGGGTTA ATGCTATACG 


CATTTAGTAG 


TGCAGGAGAT TTAGGATTTA 


660 




CAAGTCCAAT AGTAATAGGT GCGTTGATAT 


TAAGTATGGT 


TATTATCTAT TTATTTATAC 


720 


45 


GTAGACAATT TAATATTACT AATGCACTTT 


TAAATTTAAG 


GGTTTTTAAA AATAGAACAT 


780 




TTGCATTATG TACGATTAGT TCAATGATTA 


TAATGATGTC 


AATGGTTGGA CCTGCGCTGC 


840 




TTATACCGCT' ATATGTTCAA AACAGTTTAT 


CTTTATCTGC 


CTTGTTATCA GGACTTGTTA 


900 


50 


TCATGCCTGG TGCAATAATA AATGGTATTA 


TGTCAGTTTT 


TACAGGTAAA TTTTATGATA 


960 




AGTATGGTCC TAGACCATTG ATTTATACTG 


GTTTTACAAT 


TTTAACAATT ACTACAATTA 


1020 
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GAATGTTTTC AGTTTCTTTA CTCATGATGC CGATAAATAC TACAGGAATT AATTCTTTGA 1140 

GAAATGAAGA AATCTCACAT GGCACGGCTA TTATGAACTT TGGTCGTGTA ATGGCTGGTT 1200 

5 

CACTAGGCAC AGCTTTAATG GTTACATTAA TGAGTTTTGG TGCAAAAATA TTTTTATCTA 1260 

CATCGCCATC. GCATTTAACT GCAACTGAAA TTAAACAGCA ATCCATTGCT ATAGGGGTGG 1320 

ATATCTCATT TGCTTTTGTA GCTGTGCTTG TTATGGCAGC TTATGTGATA GCACTTTTTA 1380 

TAAGAGAACC TAAAGAAATA GAAAGTAATA GAAGGAAATT TTAAAATAAT TATAGTAGTT 144 0 

GGTCTATTTA AAATAATAGG CTAACTGCTT TTTTTATTTA ATAAAAAGTT TTATACTTTT . 1500 

is AGTGATAGAC TAAGCAAAAA TTGTTATTTG CTATGATGTA GATGTCTTAA AATGATTAAG . 1560 

GGGGATTTGC TTTGTTAACG GTAGATCAAG TGAAAGAATT GGTAGGAGAA ATTAAAGATC 1620 

CTATTATAGA TGTGCCTTTA AAAGAAACAG AAGGTATTGT TGAnGTTTCT ATTAAGGGAG 1680 

20 AAnAAGAACA TGTGAGTGTT 1700 

(2) INFORMATION FOR SEQ ID NO: 243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10146 base pairs 
2S (b) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 243: 

TGCATCAACT TTCAAACAAT AAATCATCAC AATAACCACA CCTAATTCAA CACTTTTCAA 60 

ACATAAGTAT TGACACATTG AGCAAAATGA TTTTTAATTG TAACTAATAC AGTTACAATT 120 

35 

ATGAGGTGAG AAACATTGAA TTTAGAATTT AACATTGCCG TGCATGTATT AGCTTTTTTA 1B0 

ACT AAG CATC aTTCAGAAAA ATTCAATAGT AGTTCATTAG CAGAaTTAaC TTGTTTAAAT 24 0 

CCTGTTCAAT TACGACGCGT GACGACTCAA CTTGTCGATT TAAAAATGAT TGACACAATA 300 

40 

CGAGGTAAAG ATGGCGGTTA TTTAGCAAAT GATCAAAGTG CTGATGTCTC TCTAGCAACA 360 

TTATaTAAAC ATTTTGTCTT AGAGAAAGAA CACCACACAC GTCTATTTAC TGGCGACGAA 420 

45 GGCAGTCACT GTCAAATTGC TCGTAATATT GCAACTACCA TGTCACATTA TCAGCAAGAC 4 80 

GAACAGAATA TCATTATTAA TTTTTATAAT GGAAAAACAA TCAAAGATGT CATTGAAGAC 540 

ATTCAAAAGG AGGATTTATG TCATGAAAAC ATATGATTTA ATTGTAATAG GATTTGGGAA 600 

50 AGCTGGTAAA ACTTTAGCGA AATACGCTGC ATCAACAGGT CAACACGTCG CAGTTATCGA 660 

ACAATCTCCG AAAATGTATG GAGGCACTTG TATAAACATA GGATGTATAC CTTCGAAGAC 720 
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TGTTGTCAAT GCGCTAAACA ATAAAAATTA CCACTTATTA GCAGACGATA ACAACATTGA 84 0 

TGTACTGGAT TTTAAAGCGC AGTTTAAATC TAATACTGAA GTGAATTTAT TAGATCAACA 900 

TGACGATATC GTCGATAGTA TTACTGCACC TCATATCATT ATTAATACCG GTGCTACCTC 960 

TGTCATTCCT AACATTAAAG GCCTTGATCA AGCTAAACAC GTCTTCGATT CGACAGGTTT 1020 

ATTAAACATT AGCTATCAAC CTAAGCACCT CGTTATTGTA GGTGGCGGTT ATATCGCCTT 1080 

10 

AGAATTTGCT TCAATGTTTG CGAATTTAGG TAGTAAGGTA ACAGTATTAG AACGTGGCGA 1140 

ATCATTTATG CCACGCGAAG ATCAAGATGT CGTTGCATAT GGTATTACTG ACTTAGAAAA 1200 

TAAAGGCATT GCATTGCATA CAAATGTTGA AACGACTGAA TTGTCATCTG ACAATCATCA 1260 

IS 

TACAACAGTC CATACCAACG TTGGTAACTT TGAGGCTGAT GCAGTACTTT TGGCTATCGG 1320 

GCGCAAACCG AATACGGATT TAGCTTTAGA AAATACTGAT ATCGAATTAG GCGACAGAGG 1380 

20 CGAAATTAAA GTCAATGCTC ATCTTCAAAC AACTGTGCCG CATATTTATG CTGCAGGTGA 144 0 

TGTTAAAGGC GGACTTCAAT TTACGTATAT ATCTTTAGAT GATTATCGAA TTATCAAATC 1500 

AGCGTTATAT GGTAATCAGT CACGTACGAC TGACAATAGA GGCAGCGTGC CTTATACAGT 1560 

25 ATTTATAGAT CCACCATTAT CACGTGTTGG ATTAACTAGT AAAGAAGCTG CCGCTCAACA 1620 

TTATGATTAC ACTGAACATC AACTTTTAGT AAGTGCTATA CCTCGTCATA . AAATTAACAA 166 0 

TGATCCAAGA GGTTTATTTA AAGTAGTCAT TAATAATGAA AATAATATGA TTTTAGGTGC 174 0 

30 TACATTATAT GGTAAGCAAT CTGAAGAATT AATTAATATA ATTAAACTTG CGATTGATCA 1800 

AAACATTCCA TATACCGTAT TACGAGATAA TATTTATACG CATCCTACGA TGGCCGAATC 1860 

ATTTAATGAT TTATTTAATT TCTAGACAAA ACATAAAAAC CTGGTGGCAC GCATTGAATG 1920 

35 

ATGCTGCCAT CAGGCTTTAT TGTTGTGCTT TTCGCTTTTC TAATTTTTCT TTAAGCTTTC 1980 

TATCTTGTTC TTCTTTACGA CGTTTACGTT CTTCATGTCG TTTTCTTAAA CGCTCTTCTT 2040 

CTTCAGGATC ACGTGGTTTC TTTAATTGTT GAGAAACTTT TTCGATTAAT TCTTCTTCAG 2100 

40 

TAAGCGCagc CAGTGGGCGG TTATTAACAA AAGTGAATGT TTTTCGGCGT CCAGGTCCAC 2160 

AATAAGATTG ACAACCTATC ACGATTTCAG CATCGGGATC TAATTTTTCC AACTTCTTTT 2220 

GTAACGTTCT TATATTGACT GCCTGACATT CATCACAAAT AAGGAATGTA TTTTTCATAT 2280 

45 

TGCTACCCAC CTTTCTTTAT CATATCTATA TCGTCGATTT CATTAATTTT TTCGTTAACT 2340 

CTATCTATTT TACTCTTTTT AATATTTTTT TCAAGATACG TAACACGGCT GaCAATAAAA 24 00 

so AATGGAGCAT TTATCTTCTA ATTAAATTAG ATGaTTGCTC CCCTATCAAA TCATTTATTG 2460 

CCCATGATAA ATATTAAATT TTAATGGTTT AATACCATGT TTTGTCCATT CATCATAAAT 2520 
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TGATGTTTTA GATGCGCCGT GATACTTTTC AGCAATATCA CACAAATATT TTAGCTTTTC 2640 

AGTTTCTATA TCAACTGTAG CTTCTTTATC CATACGTTGA ATAATTGTAC GATTCTGACG 2700 

5 CACCATCTTT TGCACACCTT TAATGTTATT TGTTTTAAAA GCATGAATAA GTTTTTCAAC 2760 

ACAACGATGT GAATCTTCTA AGAAGTCACC GTAAAATGAA GGATCTGATT TCAAACGTTT 2820 

CACTTCGCTA ACAAAGTGTG GTGATGACGC CGGTGAGCCA GTCCAACCGA TAAGTACTTC 2880 

10 

CATATTTTCA GGTGCTTGTA ATGGTTCGAT GTGCAATCCA GGCCAGTTTT TGATTAAAAC 2940 

TTCTTCAACC GTAGTATCTT CAATTTGATG CTTAACCCAT TCATGATCAA AAGTACTATA 3000 

CGCTAGCCAT CCACTATATA CACTCACAGC AATATCTCOG CATGAACTTA AACTTTGTAA 3060 

15 

CTTCATATTT GCAATCACTG CTAGTTTATA AATGTATAAA TTAGATAACT TCATATCATA 3120 

AAATTCATTT AATACTTTTA TAACTGACAC AAGTACTGCT GCACTTGAAC CTAATCCATA 3180 

2Q TTTATGACCA TTTGAATCAT CTAAATTACT ATCAATAGTC AGATGAAAAT GCTTCATCGC 3240 

TATATCGCAA CTTTTCGCGT ATTGTTCAAA TATTTCAATA GCTGTGaCCA CATAATTTAA 3300 

TTGTTTTGCT GcATGTGGAT CTGAAATGAC AATACTATCT TCATCTCTAC TAAATGTAAC 3360 

25 TGGGTTATGA TGTAATGCTT TTGAATGAAT GGTACCTTTA TATTGGTCTG CTTCTTCAAT 3420 

AGTAGCAGTT ACAAAACGAT CTAACGCAAT AAGTACAGAT TTATATCCTG GTTCTGTTAC 34 80 

AGCATATTCT CCAGCAATAT AAAGTTTTCC GGGTGCTTTG ACCTGAATCA TTTTATCTCT 354 0 

30 TCCTTACTCA ATTATTTCAA TTCCTGTGGC AATAATGTCA CTATCAATAA TTTGGTTATT 3600 

ATCAAACTGT GTTAATAATT TATCTATAAT CTGTTGCTTG TTTTTCTTTT CTACAAGTAT 3660 

TTTCACATTA GGTCCCGCAT CCATTGTAAA ATAACACGGA TACCCCGCTT CTCGGCATTC 3720 

GTGAACAAGC GCCATGACAT CATAACTTTC TTGCACAAGA TATGTGAACG GCGGTGTTGA 3760 

TCCT^GATTC GTGGCATGCA TACGCAAACC ATTTTCTTCA ATTACTTCAC CAAGGCGTTT 3840 

AAAATCTTTG TCTTGAATCG CTGCTTTTGC TTCAGCTAAA TCTTCATCAA TATGATCTAA 3900 

40 

CCAATATTGA TAAAACCTTG ATGTGTTTCG TGTCAATGAC ATACCATATC GACTAGGTAC 3960 

CTTTTTAGAA TGTTGATTAA TCACAACAAA TATCATGGCA AGGTCATCTT CAAAATGATT 4 020 

CGATTCAAGT GGAACGGCAT ATGACGTCTC ATCACTATAC CCTTTTTCCC ATTCTGCAAA 4080 

45 

TCCACCATAA ATACTACGCG ACGCAGAACC CGAACCAATT CGCGCCAATC TCGATAAATC 4140 

CTTATCTGAC AGCTGCATGT CTAGCGCTTG ATTACAAGCT GCTGCTAAAG CTGCATATGC 4200 

SO GCTTGCCGAT GAAGCCAACC CTGCTGCTGT TGGTACAAAA TTGTCGCTTT CAATTTCTGC 4260 

i 

ATACCAATCG ATGCCAGCTC TATTTCTGAC AATATCCATA TATTTTGAAA TTTTCTCTAA 4320 
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AAAAGTGACT TTCGTTTCAG TGTAAAATTT TTCTAATGTA ACAGATATGC TATTATTCAT 4440 

TGGAATGATT AGTGCTTCAT CTTTTTTACC CCAATATTTT ATAAGTGCAA TATTCGTATG 4500 

TGCACGTGCT TTGCCACTTT TAATCAACGC ATTAACCTCC TAAATTCTCA ATCCAAGTAT 4560 

GTGCTGCACC AGCTTTTTCT ACAGCTTTTA CAATATTTTT CGCTGTTGGT AAATCTTTGG 4620 

CAAGCAATAA CATACTTCCA CCACGACCAG CGCCAGTAAG TTTTCCAGCA ATCGCACCAT 4 680 

10 

TTTCTTTACC AATTTTCATT AATTGTTCTA TTTTATCATG ACTAACTGTC AACGCCTTTA 4740 

AATCCGCATG ACATTCATTA AAAATATCCG CTAAGGCTTC AAAGTTATGA TGTTCAATCA 4800 

1S CATCACTCGC ACGTAAAACT AACTTACCGA TATGTTTTAC ATGTGACATG TACTGAGGGT 4 860 

CCTCACAAAG TTTATGAACA TCTTCTACTG CTTGTCTTGT TGAACCTTTC ACACCAGTAT 4920 

CTATAACAAC CATATAGCCG TCTAAACTTA ACGTTTTCAA CGTTTCAGCA TGACCTTTTT 4980 

20 . GGAACCAAAC TGGTTTGCCT GATACAATCG TTTGCGTATC AATACCACTT GGTTTACCAT 504 0 

GTG CAATTTG CTCTGCCCAA TTAGCCTTTT CAATGAGTTC TTCTTTCGTT AATGATTTCC 5100 

CTAAAAAATC ATAACTTGCA CGAACAAAAG CAACCGCGAC AGCTGCACTC GATCCTAATC 5160 

25 CACGTGATGG TGGTAAATTC GTTTGGATCG TTACTGCTAG CGGCTCTGTA ATATTATTTA 5220 

ATTCTACAAA ACGGTTCACC AAAGACTTAA GATGGTCAGG CGCATCATAT AACATACCAT 5280 

CGTAAACATC GCTTTTAATA GACGAATAGT TCCCGCTCTC TAAGGCTTCT ATTAAAACTT 534 0 

30 

TGATTTTACC TGCGTTAAAC GGTACTGCAA TAGCAGGCTC TCCAAATGTA ACAGCATGTT 54 00 

CTCCTATTAA AATAATCTTA CCTGTCGATT CCCCATATCC TTTTCTTGTC ATGTCAATAT 5460 

CACCTTTTAT ATTTATCCTA TACTTGATTC ATTATTTTTA TTTATTAGTA AAAGACATCA 5520 

35 

TATTCTAAGT TGCAtngCAT TCGCGTTAAA TTTCATTGCA GTCTTTATCT CACATTATTC 5580 

ATATTATGTA TAATCTTTAT TTTGAATTTA TATTTGACTT AACTTGATTA GTATAAAACT 5640 

AACTTTCGTT TACTTCAAAG TTTAAATCTT ATCGAGTGAT ATTTCAGATT CTTTATCTTT 5700 

40 

TTATAAAATA GCCCTACAAT TTATAATTTT CCACCCTAAC TATAATACTA CAAATAATAA 5760 

TTGGAATATA TAGATTTACT ACTAAAGTAT TAGAACATTT CAATAGAAGG TCGTTTCTTT 5820 

45 CATAGTCATA CGCATTATAT ATACCCTATT CTCAATCTAT TTAATACGTA AAACATGAAA 5880 

TTTTCTTATT AAATTTATTA TTTCCATCAT ATCATTACTT TTAATTTAAT GATGTTCAAT 5940 

TTAAATATTA GGTCAATAAC ATATTTATGC TTTTTATGGA TACTTTCAAA AATAACAGCC 6000 

SO CCAAACGATA ACTTGAAAGG GGCTGTTAAA TATTTAACTA TTGCATTTGA TCTATCATTT 6060 

TCTTGTTTCT TTCAATCATT TTATCAAAAT ACGTATCGTA TCTTTGCCAT TCTTCTTGAG 6120 
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TTACATCTTG AACAGTAATC GTTTTGTTAA GCAATGTCTC TAATGAGGCC ATACAAGATG 624 0 

GTTCAATTTC AGGATATTTA AATTTAGTCA CTTCACCTTT TAAAGCATGT TCATAAAATG 6300 

5 TTTGCATCAT CAATGCACGT TCTGAACCAG AGCCTTCAAC ACAAAGATAA ATTTGTACAG . 6360 

CAATACCGCC TCTAACTCTT CGTTGCGATA TGCCTGCAAA TTTCTTACCA TCGATACTTA 6420 

AGTCAAATTT TCCTGGGCAA TAAGAATGTT CAATTTCCAT CGTATCAATA TCAACATTCT 6480 

10 CATTTTCGAA CATTTTGCTA ATTAAGAGGT ACATCACAGT AAACGCTTCA TCAATCGTTG 654 0 

TTTCTGTTTG TCCTTTGAAC ATCAGCGATA TATTTAATAC ACCTTGATCT AGAACGACAC 6600 

CTAAGCCACC AGAATTTCTA ACAATGGCAT TATAACCAAT CTCATTCGTT AAATAATCAA 6660 

15 

TGCCATCTTT TAAAAACGGC AATCTTGAAT CATGAATACC AAGAATAACA GTATGTTGAT 6720 

GAATCCAAGT ACGCACAACA TTATCTGATA TATCTTTGCC CACACTTTCG CAAAATGTAT 6780 

CATCGAATGC GAAAGATTGC ATAGGTTCTA ATCCAGAAGA ATGATCGATA TATCGCCAGT 684 0 

20 

• TGACGCCATT AAAATATTTA CTCGCTAAAT CCATCGTTAT TGTAAGGCTT GCGCTGCTGT 6900 

AATAATTGAA AGATTGTATA CATCTTCAAT TGAGCAGCCA CGTGATAAGT CATTTACTGG 6960 

25 AGAATTTAAA CCTTGTAATA CTGGACCAAC TGCATCATAT CCACCTAAAC GTTGTGCAAT 7020 

TTTGTAACCA ATATTACCAG CTTCTAAACT TGGGAATACA AAGACATTTG CATCACCTTG 7 080 

TAATTTAGCA CCTGGCGCTT TTTTCTCAGC AACACCTGGT ACAATCGCAG CATCAAATTG 7140 

30 GAATTCGCCA TCAATGATTG CTTCTAATTT TtCTTCTTCA GCTTTTtGTT GTGCTAATTT 7200 

GACAGCTTCT TGAACTTTTG TCACGTCGTC TGATTTAGCA GACCCTTTTG TTGAAAAGCT 7260 

TAACATTGCA ACTTTTGGAT CCATGCCAAA GCTTAATGCT GATTTTGCAC TTTCTACTGC 7320 

35 AATTTCTGCA AGTCCTTGTG AATCAAGTTC TGGATTGATT GCACAATCAC CAAAGATGTA 7380 

TTGrrCATCA CCTTTAATCA TAAAGAAGAT ACCTGATGTT CTTGATACAC CTGGTTTCGT 7440 

TTTGATGATT TGTAAAGCTG GACGCACAGT GTCGCCTGTT GAATGTGCTG CACCACTAAC 7500 

40 

TAAACCATCT GCTTTACCAG CATAAACAAG CATTGTACCG AAGTAGTTCA CATTGTTTAA 7560 

TAATTCTTGT GCTTGTTCTT CAGTCGCTTT ACCTTTACGT CGTTCAACAA ATGATTGAAC 7620 

TAATTCAGCT TTCAATTCAC TTGTCGCAGG ATTAATTAAT TCAATATTAG AAATATCAAG 7680 

45 

ATCAAGTTTT TGCGCTAAAG ATTGAACCTT AGTCTCATCA CCTAACACGA TTGGTGTAAC 7740 

ATAATCTGTT GCTTGTAATT GTGTTGCAGC TGTTAGAACA CGTTCGTCCT CTCCTTCAGG 7800 

TAATACGATT TTAACGTTTT TACCAGAAAG TTTGTCTTTT AATACATTTA ATAAATCAGC 7860 

oO 

CATAATGTCC TCCTGTAATA TAAATCTTAT TAATCATTCA CGGTATAATT ATACGCCATT 7920 
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TATGATAAAA TTTATAAAGA ACTGATGATT TTTGAAAAGG AGCGATAAAC ATGAGTCAAG 8040 

CAGCCGAAAC ATTAGATGGT TGGTATAGTC TACATTTATT TTATGCAGTT GATTGGGCAT 8100 

5 CATTACGTAT AGTTCCAAAG GACGAACGCG ATGCACTTGT CACTGAATTT CAATCATTTT 8160 

TAGAAAATAC AGCAACTGTA AGATCATCAA AATCTGGTGA TCAAGCTATT TATAATATAA 8220 

CTGGTCAAAA AGCAGATTTG TTATTATGGT TCTTACGTCC TGAAATGAAG TCTTTAAATC B280 

10 

ATATTGAAAA TGAATTTAAC AAATTGCGCA TTGCTGACTT CCTAATCCCT ACATATTCAT 8340 
ATGTATCAGT CATTGAATTG AGCAATTATT TAGCTGGTAA ATCTGATGAA GATCCTTATG ' 8400 

AGAACCCTCA TATCAAAGCA AGATTATACC CAGAATTACC ACATTCTGAT TATATTTGTT 8460 

75 

TCTATCCAAT GAACAAACGT CGTAATGAAA CTTATAACTG GTACATGTTA ACTATGGAAG 8520 

AACGCCAAAA ATTAATGTAT GACCATGGTA TGATTGGTAG AAAATATGCT GGCAAAATCA 8580 

20 AACAATTTAT TACTGGTTCT GTAGGGTTTG ATGATTTCGA ATGGGGCGTA ACATTGTTCT 8640 

CAGATGACGT ATTACAATTC AAAAAAATTG TATACGAAAT GCGCTTTGAT GAAACAACAG 8700 

CACGATACGG TGAATTCGGT AGTTTCTTTG TAGGACATAT TATTAACACA AACGAATTCG 8760 

25 ATCAATTCTT TGCGATTTCT TAATACATTG GTACGTTTAT AAATTAATAA AAAAATTCCA 8820 

AGCTTATCGG TTTAAGCTTG GAATTTTTCG TTTATCTTCA GTATATTCCC GTATACATAA 8880 

GACGTGATTT GGTAAATAGT TGAAATCTGT ATGTTTAAAC TTATATATAT GTGCTAATGT 894 0 

30 ATTATCAATA ACAAAGtACA CTTTGCTCAT AGCAAgTsaC CCGAgTAGTC TTCCTTGGGA 9000 

GAACTTTAAC TACTATCACT ACATATAAAC GTTAACCTCA ATAGAAATTA TACAGTCGCT 9060 

ACTCTATACA ATTTTTGTAA TGGTTAACTA ATATTATTTT AACCTATTTG AAATATTTGA 9120 

35 

AACATATTTT TGTCGAATTT TTTTCAATAA TTTTTCCTTT TTATACTTCA AGAGAATTTT 9180 

AACTACTAAA AATTCCGATG ATTATTATTA CAATAGTATC AAATATTAGT TTTTTAAAAT 924 0 

CAATAACAAC TTATCAAAAA GCTCATGTGG TTATTTTATA GTGTATAAAC TATAATGAGT 9300 

40 

ATTAAATTCT TATAAACAAT GGTGATGAAA TGGACATAAA TTCAGAAGAA TACAAACAAG 936 0 

AGGTACTTAT CAAAGACGTT GTCATGCTTG CTGCTCGCAT ACTATTAGAA TCTGGTGCAG 9420 

AAGGTACGCG TGTAGAAGAT ACCATGACAC GTATTGCAAA AAAACTTGGT TACAGTGAAA 94 80 

45 

GTAACAGCTT TGTTACAAAC ACTGTCATCC AGTTTACGTT ACATTCGGAA TCGTTTCCTA 9540 

GAATATTTAG AATTACCTCT CGAGATACAA ACTTAATAAA AATTTCTCAA GCTAATAAAA 9600 

SO TTTCGCGTCA AATTACAAAC AATGAAATTT CTTTAGCCGA AGCAAAAACG CAACTTGAAA 9660 

AAATATATGT. TGCTAAGCGT GACAGCAGTC TTCCCTTTAA AGGTTTTGCT GCAGCAATGA 9720 
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45 



TAGCAGGTAG TCTAGGATAC CTAGTCACTG AGATTTTAGA TCGTAAGTwA CACGCACAGT 9 840 

TTATCCCAGA ATTCaTTGGT TCaTTAGTTA tTGGGATTAT CGCCGTTATT GGACATACAC 9900 

TTATTCCAAC AGGTGACTTG GCAACTATTA TCATTGCGGC AGTCATGCCT ATTGTTCCTG 9960 

GTGTATTAAT AACAAACGCA ATACAAGATT TATTTGGTGG ACACATGTTG ATGTTCACAA 10020 

CGAAATCATT AGAAGCATTG GTTTnGCGTT TGGCATCGGT GCTGGCGTTG GTAGCGTATT 10080 

AATTTTAGTA TAGGAGTATC AGACTATGTT TTGGATCTTA AACTTTATCT TTAG CTTTT T 1014 0 

AGCTTC 10 146 
(2) INFORMATION FOR SEQ ID NO: 244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 244: 

25 ATTTAATTGG TTGGTGGCGT ATTCTCnTTT CCAAGATTTG CCACGATTGG ATGTTCAGTC 60 

GACAAATATG TAACCAGTCC CGATAAATTA CGTTCCTCAA TCATTGATAG TATGTCTGGT 120 

GTTTGTAACG ATTTATCGAG TAAGGCATCA AGCAATTGAT AATGTCCCAA TACAACAAAA 180 

30 TGCACGTTGT CTCTCAACTG CTGTTGAATA AACTGAATAA AGAGCTTTAA GCTCTGTTGC 24 0 

ACATTCGCTA ATGATGGTGC ATAGTTTTCC AAACCAACTT GTACAGCCGC TTCATTATTC 300 

CGAATGATTA AACCTGTGTA TGCCACTTTT GTTGCTGCAG TTGGATACAT TGAGTAATAA 360 

CGCAATAATT GATCTGTAAA ATCATTTCGA AGTGCATAAA TTTGATGCTC ATGTTGCCAA 420 

AAATfTCGCT CACCCATCTG CTgCAAATCC TCATGGTTCA ATTGTTTCCA GTCCAACTTT 480 

TCAACCACAC TAAAATCAAC TAACTCATAA TCCGCTTTAT TAAAATATTT TAAAAATGCT 54 0 

GTTTCCGATT CTTTTAACGC AATTAATTGT TCTGnATTAT TCACTCGACC ACCCTTTACT 600 

TTCAATACTG TATTTAAAAT CACTTGGTAT TTTCGTTGTT TGCTTTACTT CTCTACCACG 660 

CTAAAGTGTA ATATGATTAA TAACTTATCA TTTTTAGCAA TACATTACAA CCTTTTTCAG 720 

AAAATTCGGT GTATTGATTT TAAAATTTTT TAAAATAnAA AAGGCAAGAC ATTTGTGCCT 780 

ATAAAAATGC TTAACCAAGA TTTTTATATT GaAGTTGTAC TTCTTGCACA TATTGTCCTT 840 

SQ GCCTTATTAT GTAAAGTTAT TTTCTTTCTA TCTTTTTATT AAATTTAACT ATTCTTCATA 900 

ATCCCGATTC CCTTTAAAGT AACGTCTATC TTGTTTACTA TATACATTTT CAGGATTAAA 960 
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TTGGTAAACG TTCGTTGCTG ATATATCTGT AAAATTGTTT GGACCGACAC CTGCAATAAA 1080 

CTTAAACTCT GCTTCATCTA CCAAATAATC ATACGCTTGT GTATGTCTAT CCTGTGCGCC 114 0 

ATGTGGAAAT ACAAACATAT CTGTTTTACC TACAATTGGT TCAACTTCAT CTTTCCATCT 1200 

TTTAGTATCA CGTTTAATAC CTTCTAAAGA TGTTTTTTCA AAATTAATGT GACCATATGA 1260 

ATGACTCGCA AATGACCATC CATCCCGTTT CATTGCGCGA ACAACTTCCT CAGCTGCCTT 1320 

TTTATTCTTT GTATAATCTT TACTCGTTAA TTCATTCGTG CGATAACCTA ATACGCCCTC 1380 

ATAACCGGTT AAAGCAACAA CACCTTTTTC ACCATTTAAA GAAAAATCTG GATGCTCTTT 144 0 

TACAAATTTA TTTAAAATTG GCACGATATC ATTGTCATCA GAATAAGTAG CATGGCCTTT 1500 

TTTGTCTGTA GTTTCAGAAA CAACATGTTT ATTTTT ATCG AGTACTAAAC GGTCAGCATA 1560 

ACCATGGTGT CTCATGTAAC TATAGTAATT CATATCATCA ATTGAGATGA TTAGTGGCTT 1620 

20 TTTACCTTTC GGCAATTTTA TTTTTTTGGC TTTTACATGA TGAGATGATA AGTGGTATAC 1680 

ATCATGTGGA TTAACGATGA TGTAATTATT TTTATATAAT TCGTTCAATG ATTTTTTAAA 1740 

TTCACTTACA GTAATCATCC AATCATTGTT GCCCTTAGCT TGGTGTGTAT CTCCTGTAAA 1800 

25 CGCAACTTTT GGGTCTGTAA TTAATGGGTG ATAAAACACA TGATAAACTT GGCCGTGATA IB 60 

TGTTTCCCAA TGTTCATCCA TTTTcGATTT aTGCTTTGCA TACTCATTTG GATTAACAGA 1920 

TTTATTkTGA GCTTTCTCAT TTTGCTTGGA ACAGCTATAT mACAATGCAA CTGATAATAA 1980 

30 CAGAAAAAAT AGCAATAAAT ATTTTTTATG CATTAAACAT TC , 2 022 

(2) INFORMATION FOR SEQ ID NO: 245: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 134 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 245: 

ATAGAGTGAT AAAGGATGGT TGTCATATGA TAAATGCAGT AGTAATAGCA GTAATTTTAA 60 

45 . TGATTGTGCT ATGTTTATGT CGATTAAACG TnTTATTAAG CTTATTTATC AGTGCGCTAG - 120 

TTGGTGGCTT AATTTCAGGC ATGAGCATTG AAAAAGTTAT AAATGTATTT GGGAAAAATA 180 

TAGTCGATGG TGCTGAGGTA GCATTAAGCT ATGCTTTATT AGGTGGATTT GCAGCATTAA 240 

SO TTTCATACAG TGGTATCACA GACTATTTAG TAGGAAAAAT TATAAATGCA ATTCACGCTG 300 

AAAATAGTCG ATGGTCAAGA GTTAAAGTCA AAGTGACAAT AATCATTGCA TTATTAGCTA 360 
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CACCATTGTT AAGTCTGTTT AATGACTTAA AAATAGATAG ACGTTTAATC GGTTTGATTA 480 

TCGGTTTTGG TTTATGTTTC CCGTATGTGT TATTACCATA TGGATTCGGT CAAATTTTCC 540 

5 AGCAAATTAT TCAAAGTGGC TTTGCAAAGG CAAATCACCC AATTGAGTTT AATATGATTT 600 

GGAAAGCAAT GCTTATTCCT TCAATGGGGT ATATTGTTGG CTTACTTATC GGTTTATATG 660 

TATATCGTAA ACCACGTGAA TATGAAACAC GTAAAATTTC AGATAGTGAC AATGTTACAG 720 

10 

AGTTAAAACC ATATATCTTA ATAGTAACAA TTGTAGCAAT ACTAGCTACA TTTTTAGTAC 7B0 

AAACATTTAC AGATTCAATG ATTTTTGGTG CACTGGCAGG GGTACTCGTA TTCTTTATTT 84 0 

CACGTGCATA TAATTGGTAT GAATTAGATG CTAAGTTTGT TGAAGGTATT AAAATTATGG 900 

75 

CTTATATTGG TGTAGTTATT TTAACAGCAA ATGGATTTGC TGGTGTAATG AATGCTACTG 960 

GTGATATAGA TGAATTAGTT AAAACTTTAA CAAGTATTAC TGGTGATAAT AAATTATTTA 1020 

GCATTATCAT GATGTATGTG ATAGGTTTAA TTGTCACTTT AGGTATTGGA TCATCATTTG 1080 

20 

CAACAATTCC TATTATCGCA TCATTATTCA TTCCTTTTGG AGCGTCAATT GGACTAGATA 114 0 

CAATGGCATT AATCGCATTG ATTGGAACAG CGAGTGCATT AGGTGACTCA GGTTCGCCTG 1200 

2s . CAAGTGATTC AACATTAGGA CCAACTGCGG GATTAAATGT TGATGGCCAm CATGATCATA 1260 

TACGTGATAC ATGTGTACCA AACTTCTTGT TTTATAATAT TCCTTTAAAT GATTTTCGGT 1320 

ACTATTGCTG CTATGGTACT 134 0 

30 (2) INFORMATION FOR SEQ ID NO: 246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3365 base pairs 
<B> TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 246: 

40 v . 



45 



SO 



CAAAATCTAA 


GAGAATAAnA 


TTTGTTAATT 


TnAAATAGCA 


AGCAATTCAA AGTTATATGT 


60 


GTAATAGATA 


AAATAGATAT 


CCCTATAGTG ATGCGTTACT 


AGCTAAACAT AATAACACAT 


120 


TAGAAGATAA 


TGAAGTTAAG 


GAGTTACTGG 


ATTGTTTCGA 


CTATGTAATT AAGTATAAAA 


1B0 


ATATCCAACG 


ACAAAACGTA 


ATTATAAAAT 


GGTAAAAGCT 


ATGGTACAGT TTCAAATTGC 


240 


TAATGACATG 


CGTATCGGTG 


AGCTACTTGC 


AATAAAGAGA 


GTAAATATAA ACTATGAAGA 


300 


TAAAACGCTA 


GATATCGACG 


GTAAAGTTAA 


TTGGATAACT 


GAAAAAAGAC GGGAGCATTC 


360 


GGAGTAAAGG 


AGACAACTGA 


AAGAAGTAAT 


AGCTATAAGG 


CCACAGGGCT CACTACCCAA 


420 
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TATTCACAAA TACGTCTGGT AGCCCTATCG ACTCGAACAA AATTAGCCAC ATTATTAAAG 54 0 

GGGGGCGCTG ATATTAGTTC TATTAAGAAA CCTATAACGA CGCATACATT ACATCATTCG 600 

CATATATCTA CACTTGCTCA ATTAGGAATT AACTTAAAAG CAATGCAAGA GCATGTAGGT 660 

CATTCAGATT ATAAAArAAA TCTAGAGATA TACACACATG TTACTAATCA GATGGCGAAA 720 

GATATGATGA ATAAATTTGA ACGATTGGGG AGTTAAAATT GGAAAAAGAT GaTACACTAG 780 

10 

CAGAAATTAA GCCTATGCTC AATTTTGATG AGCAAATAGC AAAATTAAAA CAGATGAATA 84 0 

TATTTTTTAA TATTATTGAC ACCGAAAAAG CAAATGAAAT TCTTAGAAAA AATAATTACT 900 

TCTTCAAACT wGcTTATTTC CGaAAAAATT TCGrAAAAAA GaATGGCGGC TATTTCATAG 960 

15 

AATTTGCTTA TTTATCAGAT TTAGCAACTA TAGATATGaA ATTAAGATAC ACAATGTTGC 1020 

ATTTAACTTT AGATATTGAA CATAGTTTAA AGTATCTAGT CTTAAAACTA ATAACAGAAA .1080 

20 ATAACCAAGA AGATGGTTAT AAAATAATAG ATGAGTTCTT ATGTATTGaT AAATCATATA 1140 

GCAATTCAAA TTTTGACACA AATTCAAGAA CACCAGAAGA AGTTATGGAA ACCAAAATCA 1200 

AAAATAAAAA CGAAATATTC AAGCATATGA ATAAACGAGG ACAACTACCC GAGAAGTTGa 1260 

25 ATAAATACTA TCmAAATCCA CCCGCnnGGk TTTGCaTTGr ATTCATGCAA CTAGGTCAAT 1320 

TCGTTTCGTT TCTCAACTTC TATTACAAGA AGTACAATGA CGAAGAATTG AGAGTTGCTA 1380 

ATATTTTAAT GCCTTTAGTT AAAAATATAA GAAaCAAATC AGCTCATAAC CAACCCATCA 1440 

30 TAGCAAATCT AAATTATGAC AGTAGATTAC CTCAATATTT ATTTGAAAAA GGGAATAATA 1500 

TAGGCATATC TAGAAACATG TTCGGAATAA AAAATTTCAT AGATACTkTC ksTACGCTAG 1560 

AATTACATAA TCAAGTTTGT AGTAATGCAA TTATCCAAGC AAGATATCAC GATTTGGACC 1620 

35 

AACTTCAAAA GCGATATAAA AGrAACGrAA GCTATTATAA TAATGCATTA GCTATCAAAA 1680 

GATTTTTTAT AGCTTTAGAT AAAATTATTG ACTTCAACAG ACCAAAAGTA TAAACTATCT 174 0 

AGTGAGGAAA GAGACTTATA GGTCTCGCGA GTTATTTTAA TTCGTATGCA AGAAAAAGAA 1800 

40 

GAGCTATGCA TTTTATTTAA AATGCGTAGT TCTTtTTTTA TGCATCTAAA TTCATATTAT I860 

TTTTGCAATA TAAACATATC TTTGTGCAAA TTCCGAACAC AAAACATTCA CATCATCCTT 1920 

TTTtGCCCTT TTTCTATACC CCAAAACACA AAAAGCCCGG TAAGCCTATG CcTACGGGgT. 1980 

45 

TTGACAATAA ATTATATATT ATTGTTCTTC TTTAACATAT GGTAATAATG CCATATGACG 2040 

AGAACGTTTG ATAGCTGTAG TCAaCATACG TTGATATTTA GCTGAAGTAC CAGTTACACG 2100 

50 ACGTGGTAAA ATTTTACCGC GTTCTGAGAT AAAACGTTTT AATAATTCAG TGTCTTTGTA 2160 

GTCGATATGT GTAATACCAT TTGCTGTGAA ATAGCATACT TTTTTACGAC GACGTCCGCC 2220 
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CGTTAATTTT TATTAGAATG GTAAGTCATC ATCACTTATA TCAATCGGTC CGTTTGCATT 2340 

TGCAAATGGA TTATCAGATT GTTTCGTGTT TGATGAATTA TTGTACGAAT TGTTTTGTCC 2400 

TGATTGTTGA CCACCGAATC CTTGACCGTA ATCTTGGAAT TCATTTTGTT GACGTTGGCC 2460 

ACCATTTTGT TGCGCATTTT TAGGTTCAAG GAATTGAACG CTATCACACA CAACTTCAGT 2520 

AACAAACACA CGACGACCTT CTTGATTTTC ATAATTACGG GATTGTAAGC GACCATCTAC 2580 

ACCAGCTAAA CTACCTTTAG ATAAATAGTT ATTTACATTA TCTGCTTGTC TTCTAAAAAC 2640 

AACACAGTTA ATAAAATctG CtTCGCGCTC CCCTTGAGCA TTCGTGAACG TACGATTTAC 2700 

TGCAAGAGTG AATGtCGCTA CACTCACACC TGAGGGAGTG GTTCTGTATT CCGGATCTTT 2760 

CGTTAAACGA CCTACTAATA CAACTCTATT TAGCATTTAA ACGCCCCCTC TAATTATTAC 2820. 

TTGTCTTCGT CTTCACGAAT AACCATGTAA CGAATGATAT CGTCACTGAT TTTAGCTAGA 2880 

CGTTGGAATT CGTCAGTAGC TTTGTTGTTA TCAGATTTAA CACGTACGAT GTTGTAGAAG 2940 

CCATCTTTGA AATCATTGAT TTCATAAGCT AGGCGACGTT TACCCCAGTC TTTTGCTTCT 3000 

AAAACTTCTG CACCTTCAGT AGCTAAGATA CCGTTGAAAC GTTCAACTAA CGCTTTTTTA 3060 

25 GCATCTTCCT CAATGTTTGG. GCGTACGATG TACATAACTT CATATGTTCT CATTTTATAT 3120 

TTGCACCTCC TTGTGGTCTA TACGGCTTAT CAATCTTAAA ACAGATAAGC AAGGaATAAT 3180 

TTTCATTACT CACAATAAAG AATTaTATCA TGCGCCATTA CTTTTTACAA TaATAATTcA 324 0 

30 AACTACTCTT CATATCATTT TTGATATtAA TTCATTTGaA ACTTTCnATG ATATTTTnAA 3300 

AAATACACTT CACAAAAGCG AACATATGTn CTATAAnAGT TGTGAGGTGG TAAGGAATGA 3360 

ATTTA 3365 
(2) INFORMATION FOR SEQ ID NO: 247: 



10 



15 



20 



35 



?i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1032 base pairs 

(B) TYPE: .nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 247: 

GTTAAAAGTA ATTGGTGGTA TTGATGATGA TTTTACAGCC AATGTTATGC ATCCAAATCA 60 

ATATCGAATT CGATATTCGT CTCAGAAACA GGACCTTAAT GAAGATATGA CAGTTTTTGA 120 

TGCAGTATTA AGTTCTGATA CAACAACTTT ACGCATCATC AAGCAATATG AGCAGGCAGT 180 

ACAAGCTTAT GCGGATGACC AAAGTGATAA ATTGTTCAAG CGAATGATGG ATGCGCAAGA 240 



1039 
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ACTAGGTATA CATGATACT A CTAAATACAT TAAAGAATTA TCCGGCGGAC AACAAAAACG 360 

TGTTGTACTT GCTAAAACAT TAATAGAACA ACCAGATTTA TTGTTATTAG ATGAACCTAC 420 

5 GAACCATTTA GACTTCGAAT CAATCAGCTG GTTGATCAAT TATGTGAAGC AATATCCTCA 460 

TACTGTTTTA TTCGTAACCC ATGATCGATA TTTTTTAAAT GAAGTTTCCA CTAGAATTAT 540 

TGAACTAAAC AGAGGTAAGT TAGCGTCATA TCCTGGTAAC TATGAATCTT ATATTGAAAT €00 

10 

GCGCGCTGAA AGAGAAGTAA CACTTCAAAA GCAACAACAA AAGCAACGAG CTTTATATAA 660 

GGAAGAACTT GCTTGGATGA GGGCTGGgAG CTAaGGCTCG TACTACAAAG cAACAAGCTA 720 

GAATTAATCG ATTTAATGAC CTAGAmAATG AAGTTaACCA GCAATATAAA GACGATAAAG 780 

IS ; , ' 

. GTGAATTGAA TCTTGCTTAT TCaAGATTAG GTAAGCAAGT GTTCGAATTA GAAGACTTAT 840 

CAAAGGCTAT TAATGATAAA GTATTATTTG AACATCTGAC GGAAATTATT CAAAAmGGTG 900 

20 AGCGTATTGG TGTTGTTGGG CCAAATGGAG CTGGTAAAAC AACACTCTTA AATATTTTGA 960 

GTGGAGAAGA CCAACAATTC GAAGGTAAAT TGAAGACTGG GCAGACGGTT AAAGTAGCTT 1020 

ATTTTAAGGA AA 1032 

25 (2) .INFORMATION FOR SEQ ID NO: 248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 
30 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 248: 



40 



45 



50 



TGTGATTAAC GAAGCTTATT TTCGTACACC 


TTCAACAACT 


GATTACAACG GCGTTTATCA 


60 


AGGTTPATTAT ATTGATTTTG AAGCAAAGGA AACTAAAAAC 


AAGACGTCCT TTCCTTTAAA 


120 


TAATATTCAT GACCATCAAG TCGAACATAT 


GAAAAATGCA 


TATCAACAAA AAGGTATTGT 


180 


GTTTTTAATG ATTCGTTTTA AAACGCTAGA TGAAGTTTAT 


CTTTTACCCT ATTCAAAATT 


240 


CGAAGTATTT TGGAAGAGAT ATAAAGATAA 


TATTAAAAAG 


TCTATAACAG TTGATGAAAT 


300 


ACGAAAAAAT GGTTACCATA TTCCTTATCA 


GTATCAACCA 


AGATTAGACT ATCTAAAAGC 


360 


AGTTGATAAG TTGATATTAG ATGAAAGTGA 


GGACCGCGTA 


TGACGGAAAA CAAAGGATCT 


420 


TCTCAGCCTA AGAAAAACGG TAATAATGGT 


GGGAAATCCA 


ACTCAAAAAA GAATAGAAAT 


480 


GTGAAGAGAA CGATTATTAA GATTATTGGC 


TTCATGATTA 


TTGCATTTTT CGTTGTTCTT 


540 


TTACTAGGTA TCTTATTGTT TGCTTATTAT 


GCTTGGAAAG 


CACCTGCTTT TACCGAAGCT 


600 
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TTAGATAATG GCCAAAGACA TGAGCATGTA AATTTAAAAG ACGTGCCGAA ATCAATGAAA 720 
GACGCAGTAC TTGCAACTGA AGACAATCGT TTCTACGAAC ATGGCGCACT TGATTATAAA 780 

5 

CGTTTATTCG GTGCAATTGG TAAGAACTTG ACTGGTGGAT TTGGkTCtGA AGGtGCCTCA 840 
ACATTAACAC AA 852 
<2) INFORMATION FOR SEQ ID NO: 24 9 : 

10 

(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5B04 base pairs 

(B) TYPE: nucleic acid 
<C> STRAND EDNESS : double 
(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 249: 



35 



40 



45 



CACTTTTTTC 


ATTAAAAATC 


TCATATTTAT 


ACACTGAACG 


TAATCTCGAA 


TATTTTTCAA 


60 


CCCAAGTTTT 


AACTTTAACT 


TTTTCTGGAT 


AAAAAATAGA 


CTTTTTATAA 


TTGACATTGA 


120 


GGTCAGTCAC 


AGGTGAAATG 


ATTCCTTGTT 


TTTCCATATC 


AGCATAACTA AAACCTAACT 


180 


TCGATATATA 


ATCCAACCGC 


GCAACTTCAA 


ACCAAGTTGC 


ATAATTCCCG 


TGATAAATTA 


240 


LAttLAIUI 1 


A 1 \JJ\\3 111 \JA 


VjvJA 1 HALAjUo 


Li 1L Lril i XL 


TGTAATACTA 


TATATCATTT 


inn 
JJ u u 


TAAGCCTTCT 


TTCAGTTTAA 


CTTTATATCT 


CATTCTAACA 


TAAAATACAA 


GAAGAGGCCG 


360 


GCCAAGAACA 


CAAAGGkTTT 


GAACCGACCT 


ATTATATCAT 


AAaGTTTATA 


GAAGTATTTT 


420 


TGAGCACTAT 


CAAAGTGCCT 


CAAATACCGA 


TTAAAATTTT 


ACTGTGATAT 


CTATTTTTTA 


480 


TTGCGCTAAT 


TTATTTCTTA 


AAACCATTTG 


TAAAATTCCA 


CCGTGACGAT 


AGTAATCCAT 


540 


TTCAACAAGT 


GAGTCAAAAC 


GAACCATAGC 


GTCAAATTCT 


ACCAAATCAC 


CATCTTGCTT 


600 


CTTAjSCAGTA 


ACTTTGACGT 


AGtCATGTGG 


TTGAACATTT 


TCATCAATAT 


TAACAGrAAT 


660 


TTCTTCTGTA 


CCATCTAGAC 


CAAGAGAATC 


AGCTGATTCA 


CCTTTTTTAA ACTCTAATGG 


720 


TAATACACCC 


ATCATAACTA 


AATTTGAACG 


ATGGATACGT 


TCATAACTTT 


GTGCAATAAC 


780 


TGTTTTAAGA 


CCTAATAAGT 


TTGTACCTTT 


TGCTGCCCAG 


TCACGAGATG AACCCATACC 


840 


ATAATCGTTA 


CCAGCTAATA 


CAACTAAACC 


TGTACCATCT 


TCTTTATATT 


TCATTGCAGC 


900 


ATCAAAGATA 


GGCATTACTT 


CATTTGTTGG 


CCAATAAGTT 


GTAAAACCAC 


CTTCAGTACC 


960 


TGGCGCTAAT 


TGGTTTTTAA 


TACGTATATT 


AGCAAACGTA 


CCTCGAACCA 


TTACTTCGTG 


1020 


ATTACCACGT 


CTTGAACCAT 


ATGAATTAAA 


TTCACGAATA 


GGCACTTGAT 


GATCTTGTAA 


1080 


ATATTTACCA 


GCTGGCGTAT 


CTTTACCAAT 


TGCACCTGCT 


GGAGAGATGT 


GGTCAGTTGT 


1140 
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10 



15 



20 



TTCTTTAGAT AATCCTTGGA AGAATGATGG ATTTTGAATG TATGTTGAAT TAGGATCAAA 1260 

GTCATATAGA GGTTGATCAG TTACATCAAT CTCATTCCAT AATTCGTTGT TATTGTATAC 1320 

GTTATTATAT TCTT CAATAA ATAATTCAGG TGTTACAACA CTATCAACGG TATCTGAAAC 1380 

TTCTTTAATT GATGGCCAAA TATCTTTCAA ATATACATCT TCACCGTCAT TACCTTTACC 144 0 

AATAGGTTCA TTTTGTAAAT CAATATCAAC CGTTCCAGCT AATGCATAAG CAACAACTAA 1500 

CTGTGGTGAA GCTAGGTAAT TGGCTTTAAC AAGAGGATGG ATACGACCTT CAAAGTTACG 1560 
GTTACCAGAT AATACAGATG TCACTAATAG GTCCTCATCA GCAATCGCTT TTTCAATTTC ' 1620 

TGGTAATAAA GGACCTGAAT TACCGATACA AGTTGTACAT CCATAACCAA CCAAGTTGAA 1680 

GCCTAAATCA TCTAAATAAG GTTGTAAGCC AGCATCTCTT AAATATCCGG TAACAACTTT 1740 

TGATCCTGGT GCTAGAGAAG TTTTAACGTA TTCAGGAACT TTCAAGCCTT TTTCAACTGC 1800 

TTTTTTAGCA ACTAAACCTG CACCTAACAT TACATAAGGG TTAGATGTAT TTGTACATGA 1860 

TGTAATTGCT GCTATTGCAA TATCACCTGT TTTCATTGTA GCTTTTGATC CATCTTTAAA 1920 

GTTAATTTCA GCTTTCTTAT CAAATTCACT TTTATCTAAA CCGTGTCCTT GGTTGCCTGC 1980 

25 TGGAGCTGTT ACAGAATTTT CAAATGATGA TTTCATATCA CTTAAGAAAA TTAAATCTTG 204 0 

. AGGACGTTTT GGTCCTGAAA GCGATGCTTC AACTGTTGAT AAATCCAATT CGATAACATC 2100 

TGTATAATTA GGATCTTCTT TCTCAACATC AAAGAACATA TGGTTTTGTT TCAAATATTC 2160 

TTTTACTAGC GCGATATGTT CGTCTGATCT ACCAGTTAAC TTCATATATT TAAGAGATTC . 2220 

ATCATCAACT GGGAAGAATC CGCAAGTTGC TCCATACTCT GGTGCCATGT TTGCAATTGT 2280 

AGCACGGTCT GCTAGTGGTA AATGTTGTAC ACCTGGACCA AAGAACTCCA CAAATTTACC 234 0 

AACAACACCT TTTTTACGTA GCTCTTGAGT TACTCTTAAC GCTAAATCAG TTGCTGTTGC 2400 

GCCTTGTGGT AATGAATTTA CTAGTCGTAC ACCAATAACC TCTGGAATTG GGAAATAAGA 2460 

AGGTTGTCCA AGCATTCCAG CTTCAGCTTC AATACCACCA ACACCCCATC CTAGTACGCC 2520 

AATACCATTT ATCATTGTTG TATGTGAATC AGTACCAACT AATGTATCTG GAAATGCAGT 2580 

TTTTTCACCA TCTACATCAC GAACATGTAC AACACTTGCT AAATATTCTA AGTTAACTTG 264 0 

GTGAACTATT CCAGTTGCAG GAGGAACTGC ATTGTAATTA TCAAATGCTT TCGTTGCCCA 2700 

. ATTTAAAAAC TGATAACGTT CATAGTTACG TTCAAATTCT AATTTCATAT TACGTTCAAG 276 0 

AGCTTCTGGA TTTGCATAGC TATCCACTTG AACTGAGTGG TCAATAACTA AATCCACCGG 2820 

SO TACTTCTGGA TTAATTTTAG TAATATCTCC CCCAACGTCA TCCATTGCTT TACGTAAAGA 28 80 

AGCTAAATCA ACTACGGCTG GTACACCTGT GAAATCTTGT AAAATAACAC GAGAAGGTTT 294 0 



55 



30 



35 



40 



45 



1042 



10 



15 



20 



EP 0 786 519 A2 

GTCTGTAATT ACAAAATCAT CTTCTTGACG AAGTAAAGAT TCTAACAAAA CACGAATTGA 3060 

ATAAGGTAAA TTGGAAACTT TAGTAATACC TTGCTCTTCT ACAGCTTTTA AATCATAGTA 3120 

AGTATAACTT TGGCCATTCA AGTCAAAATG TTTTTTTGAT TGCTCTTTAA AATTTGCAGC 3180 

CATTTAATGA TCCCCCTTGA TACATTTTTA TATTTATATG CCTTGATTAA ATTGTATTAT 3240 

TATATTTATT GATAAACAAC TCATCATGCT TAGAAAACGC TTAATTTAGG TTTTGACTTT 33 00 

TTAATCAGAG TATATAAGCA AAACTTATCA TACAGGTAAG GTGTAATAAG TATTTTTTAT 3360 

TAATTGAGAA TAATTATCAA TTTCGCGAAT GATTCAATTC AATTTTTAAA CGTATTATTT 3420 

CATTGAGCAG AAAGAAAATT ATGGCACCAA ACTTTAATAT TTTTTTCAAT GTCATTCTTT 3480 

TGATGGGAGT GGGACAGAAA TGATATTTTC GCAAAATTTA TTTCGTCGTC CCACCCCAAC 3540 

TTGCATTGTC TGTAGAAATT GGGAATCCAA TTTCTCTTTG TTGGGGCCCA TCCCCAACTT 3600 

GCACATTATT GTAAGCTGAC TTTTCGTCAG CTTCTGTGTT GGGGCCCTCA CCCCAACTCG 3660 

CATTGCCTGT AGAATTTCTT TTCGAAATTC TCTGTGTTGG GGCCCCTGAC TAGAATTGAA 3 720 

AAAAGCTTGT TACAAGCGCA TTTTCGTTCA GTCAACTACT GCCAATATAA CTTCGTAGAG 3780 

25 CATAGAATAT TGATTTATGT CCCAGCCTGA GTTAATTTTC TATAAAAGTA TATTTAATTT 3 840 

GCGTTTATAC CGTCAAACTT CACTTTAGCT TTGTCAAACC CCTTTCTATT AAGTTTTCAG 3900 

AAATAAACCT ATCTTAAAAT ATAAAAAAAT CGAGAATT CG TAGTTTAATA ACGAAATTCT 3 960 

30 CGTTCTTATC CTTTTGAATA TACTCAATTT TCCACAAAAA CAAACAAGTA GTATATCTGT 4 020 

TCTAGCTACT AGAATGACAT ACTACTTGTT ATTAAAATAC TTAACTAAAC TTTATTAGTT 4 080 

ATCTTTTTTC TCTATATTTC TACGTGACTG ACGCTTTTCA AGAATGTCAG ATTCATAATC 4140 

TTCTTGTTGA CTCTTGATAT ATTCTTGTAA GCGATGTTTA TTCGGAGTCA ATGTTAAACC 4200 

TAGGAATTTA CGTTCCTGGT TCGCATCCTT GTAGAAACTT ACCATCATGA GTATGACGAC 4260 

AAAGGAGAAT GGGAATGCAC TTATAATTGC AGCACTTTGA ATCGCATTTA AAGCTTCAGC 4320 

GCCGTTACCG CCACCAGCTA ATAAAAGTAC AAATGCTATT AAGGCCTGTG AAATTCCCCA 4380 

AACAACTTTT ACCATACTAG ATGGATTTAA TGAACCAAAT GTTGTTTGCA TTCCTAATAC 444 0 

AAATGTTGCT GAGTCAGCAG ATGTAATAAA GAATGATGCA ATTAATAATA ATGCAATCAA 4500 

CGATAAAACA ATGCCAAATG GCACATGATT AAACACTCCA AATAGCTGTG TTTCAGGAGT 456 0 

CATATCAAAA ATTTCTTTGT GTTTCTTACC TGTCTCGATG CCTAATACAC CAAAGACACT 4 62 0 

50 AAACCAAACA AAACTAACAA TTGCTGGAAC TAGCAAGACA CCAGAAATGA ACTCTCTAAT 4680 

TGAACGTCCT TTTGAAACTC GTGCAATAAA CACTCCAACG AATGGACTCC AACTTAACCA 4740 
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TGCTGTATCA AAACTATTAA ACAAGAATGT GTTTAGTAAA CTACCCGTAG AGCTAGTTAA 
CATATTTAAA ATAAGAACAG TTGGTCCAAC AATTAAAGCA GCTACCATTA AAATAGTACC 
TAAACCAATG TTCAAGTTAC TTAAGTATTG AATACCTTTA CTTAATCCAG ACCATGCACT 
TGCTATAAAT AAGATAGTAA CAACAATGAT GATAATCGCT TGTACAAACG TATTGTTTGG 
AACATTGAAC AAGTAATGTA AACCACCATT AATTTGTAGA GCACCCATAC CTAACGAAAC 
GGCTACCCCA ACGATTGTCG CAAATACAGA TAAAACGTCA ATAAAAATCC CAATAGGACC 
TTCTACTTTA TCACCTAAAA GAGGACGTAA AGTTCTAGAT AATAAACCTG GTTCACCTTT 
ACGGAATTGC GAATATGCCA ACGTAACGCA ACAACACCAT AAACAGCCCA AGCATGGAAT 
CCCCAATGGA AAAATGTTGA ACGTAGAGCT TCAGTATAAG CTTCAGTAGT TTTGGGATCT 
GCTGTAGGTG GCGTAGCAAA GTGCGCCATC GGTTCAGCTG CACCATAAAA CACCAAACCT 
ATCCCCATAC CAGCACTAAA CAACATAGCA AACCATGAAA TTGTATTAAA CTCAGGTTTG 
TCATTTGGTT TACCTAGTTT AAGTTTTCCA ATAGGACTAA AAATAAGGAA TATACAGAAG 
AACACGATAA TCGTAGTAAG AATAAGATAA TACCAACCTA ACTTTTCTGT AATCCACATT 
TTAATATTAT TGGTAACATA GTTGAATTGT TCAGGTAAAA ATGCACCAAG TAATACGACT 
ATAGCAACAA CAATTGCACT ATAGATGAAG ACTGGTGAAT ACTTCTTTCC ATTTGGATTC 
TCTGGTGAAG AAGAATTCAT AATTAATTAC TCCCTTCAAT TCTATATTTA ATTTTATGTA 
GTAGAATAAA AATATTATCT AAACATTTTA TTCAATAACT CACG 
(2) INFORMATION FOR SEQ ID NO: 250: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
r (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 250: 
ACCCGCGAAT ATGGTCCATC CTATCGATTT ATTTTTAACT GGTTTGACAA TATTTAATTT 
TTCATAATCA TTCTTAGTGA TTTTGACATA TGTTTTCGGT ATGAGCCAGT TAATAAATGG 
AAAGAAGAAG ACAATCCAAT TACTTGCCAA ATCaATCATT AAATATTCAC TATCGTATTT 
GATTATTCGA TATTTAGGGT TTTTATTAAT AACTTTAGAT TCGCAAAGCA. ATGTCTCCAC 
ATCCCTTTAA TTTTATGTGT AATACATTTT TCGATACTTC AAAAGACATT CAAATACTAT 
CAAGTTACTG TCATCAAAGG TTTTATTAAC TGATATTtTC ATATTTTTAa TCTGAATTTA 
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(2) INFORMATION FOR SEQ ID NO: 251: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 964 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



70 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 251: 1 



20 



30 



35 



CCAGGGTGCG 


GAAAGCTTTA 


AAATTTGGAC 


TAATAAAGAT 


GCTGATATTA ATTCTATGAA 


60 


AACAGCAGTT 


TTACAACAAT 


TAAAAGGAGA 


ATAACATATG 


CTTACTGGcA AACAAAAAAG 


120 


ATACTTAAGA 


AGTTTAGCAC 


ACAATATTGA 


TCCGATTTTT 


CAAATTGGAA AAGGCGGTAT 


180 


CAACGAAAAT 


ATGATTAAAC 


AAATAGATGA 


TACGTTAGAA 


AACAGAGAAT TGATTAAAGT 


240 


ACATGTACTA 


CAAAATAACT 


TTGATGATAA 


AAAAGAATTA 


GCTGAAACAT TAAGCGAAGC 


300 


TACTCATAGT 


GAATTAGTGC 


AAGTGATTGG 


ATCTATGATA 


GTGATTTATA GAGAATCTAA 


360 


AGATAATAAA 


GAAATTGAAT 


TGCCATAATA 


ATGAAAAAGA 


TAryACTTTA CGGCGGTCAG 


420 


TTTAACCCTA 


TCCATACTGC 


ACATATGATA 


GTAGCTAGCG 


AAGTATTTCA TGAATTACAG 


480 


CCAGATGAAT 


TTTATTTTTT 


ACCTAGTTTT 


ATGTCTCCAT 


TGAAAAAGCA CCATGATTTT 


540 


ATAGACGTTC 


AGCACAGATT 


AACAATGATA 


CAGATGATTA 


TCGACGAGCT TGGTTTTGGA 


600 


GATATTTGTG 


ACGATGAAAT 


TAAACGTGGT 


GGTCAAAGTT 


ATACCTATGA CACGATCAAG 


660 


GCATTCAAGG 


AGCAACACAA 


AGACAGTGAG 


TTGTACTTTG 


TTATTGGGAC GGATCAGTAT 


720 


AACCAACTAG 


AGAAATGGTA 


TCAAATTGAA 


TACTTAAAAG 


AAATGGTTAC TTTTGTAGTT 


780 


GTAAATCGAG 


ACAAAAATAG 


TCAAAATGTT 


GAAAATGCTA 


TGATTGCAAT TCAGATACCT 


840 


AGGGTAGATA 


TAAGTTCGAC 


AATGATTCGA 


CAAAGAGTTA 


GTGAAGGGAA ATCTATCCAA 


900 


GTTCTTGTTC 


CTAAATCCGT 


TGAAAACTAT 


ATTAAGGGGG 


AAGGATTATA TGAACATTGA 


960 


AAAA 










964 



(2) INFORMATION FOR SEQ ID NO: 252: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 1193 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 252: 



55 



1045 



EP0 786 519 A2 





TTGTCATGAA 


ATAAATGGGA 


TGAATATCAC 


GACTAGAAGT 


AATGTTACGA 


ACAGGAGCGT 


120 




ATAAACTAGA 


GACGCTAAAT 


TCGACATAGT 


ATGTnGCTCA 


ATTATGGCTG 


ATGATGAATT 


1B0 


5 


TAAAGTATGT 


GCGTTGGAAC 


TGTCGGATTT 


TTGTTCATAA 


TGTTTTGCAT 


ATTGCGCCAT 


240 




GATGAATAGT 


GTAAAAATAA 


ATAAAACAAT 


AAGAGATATA 


ATGCCCATAA 


TCAAAAGTAT 


300 


10 


TTGTTTAGAG 


CCTTTCATTA 


TTTCACATCC 


TTTCTAAAAT 


ATATTTGTAA 


CTAAATTTAA 


360 


AATAGTTATT 


TTTGTAATTC 


TAAACCTTTT 


TCATCGCGAA 


AACAATTAAA 


TAGGTCGCGG . 


420 




TATTAATTAT 


TATATTATTA 


CCGCTTAATA 


TGAAAAATAC 


ATGAaAATTA 


ATTTTCTAAT 


480 


15 


ATACTTTTGA 


AAAATTATTA 


CAAATTAGCC 


CCTTCAAAAC 


GCGAAAACAT 


AAGGATTCTA 


540 


GTTTCAAAAG 


GGCTGATAAG 


CATAAAATGA 


AATGTAATAT 


TTCGATGTAT 


AAAATTTTTA 


600 




ATTAGCTAAA 


AATCATCGCA 


TTAATTTTTT 


GAGCTACATC 


ATCAAAATTC 


GGACATTTTA 


660 


20 


ACGACACATA 


TAATTTAATT 


TTAGGTTCAG 


TACCAGAAGG 


ACGTAAAGCG 


ATAAATCCTT 


720 




CGTCAAATAA 


GACACGAATA 


ACATTTGATT 


TAGGAGAATT 


AATCTGCGAC 


GTTGTATCTT 


780 




TATCCAAATG 


ATAAACCTCG 


CTAGTTAAAT 


AATCTTCAAT 


TGCTTTCACT 


TTGAGTCCTT 


640 


25 


GAATCTCTTG 


CGGTGGATTT 


GAACGGAATT 


TGGTCATTAT 


TGCATTAATT 


TTCTTTTTCC 


900 




CTTCAAATCC 


TTCTAGCGTA 


TGGGgAATAA 


TGTATCCTCA 


TGTCTACCAA 


CAGTTTGATA 


960 




AATCTGTTCT 


AATTCATCTT 


TCAATGTTTT 


GCCATATAAT 


TTTAACTCAG 


AAGCGTATTT 


1020 


30 


TATAATGAGT 


GGCACAATTT 


GTACGGCATC 


TTTATCACGT 


ACAAAAGGCT 


CTGATAGAAA 


1080 




ACCGTAACTC 


TCTTCAAATG 


CGAAAATCAT 


ATTTGATGAT 


CATCCAGTTG 


TCTTATTTCC 


1140 


35 


TGAGCAATAA 


ATTTAAGCCC 


GTCAGCACCT 


CTTTGGTATT 


CAACATTATT 


ATA 


1193 



(2) INFORMATION FOR SEQ ID NO: 253: 



CU SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1098 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 253: 



TATCAGGATG ATTATGnCGG GGTTTTATTA AGTCTGAGCG 


TATATnCCTT 


TTATTCTCCT 


60 


AAATGGTTTG GAATTCAAGG AGAAGACGAT GAAATGGTAT 


CCAAAAACCA 


AnGAATATAT 


120 


nGCATTAAGT CTGAGGATGA TAGTGCGGTG GCAATTCGTT 


CATTAATTTT 


GCATAAAGAT 


180 


GAACCTATGT ATTTAAAAAA ACGTACATGT GTACCTACTT 


TGTTAATTAA 


TGGGGAACAT 


24 0 
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10 



is 



20 



AAAAAAATCT TCGAACATTC AGGACATGCA CCGCATATTG AAGAACCAGA AGCATTTATG 360 

AATTATTATT TAAAATTTTT AAAAAGCGTA TCATAATATG TGATATATAA ACCTAGGGCA 420 

TAAAGTCCTT AGGCAATGTG AAAAAGCTGA TTACTATTCA TTATTTGATA GAAATCAGCT 4 80 

TTTTTTGAAA TGTATTTGAT ATATACTGCT CGTTATGCGG CTATCTTCCT TATATTAAGT 54 0 

GCCATTAGTG CAAAACCTCT TAACAATTAG GTAAAAAGAG CATAAAAAAA GGAAGTTTAA 600 

TAGAATGTAT CATCTATCAA ACTTCACCAA ATTGCGCTAA ACAAAATTAT AGTTCAATTT 660 

CGTTGTTTGC TTCAGTGATT CGTTTATTTA CTCGACTCAA TAATGATTCG ATTTTTTTAC 720 

GTTGTTGTGC ATTAACAAGA ATTAATACAG TTCTTTCATC ATGCTCATTA CGTTTTTTAT 780 

CGAAGTAATC TTCTTGAGAT AAAATTTTAA CTGCTTTAAC AACTTGTGGT TGTTTGTAGT 840 

TTAAATGATT AATAATATCt TTAAGATAGT ATTCTTTCTC TTTGTTTTCG CTGATGTATG 900 

TCAATACAGC GAATTCTTCA AAGCTAATTG AnAATTCCTT TTTAATTAAA CTTTTTAATT 960 

TGTCAGCATA AGTGACCATT GATAACAACT CAAAGCAATC ATTGATTTTT GTAATTGCCA 1020 

TGTTTAAAAC CTCCCTATTT GATGCATCTT GCTCGATACA TTTGCCCCGA TAATATAnTG 1080 

25 TATCTAATCT TTATGnAT 1098 

(2) INFORMATION FOR SEQ ID NO: 254: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2881 base pairs 
30 ' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CO) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 254: 

CCAGGTAAAA TTGTGCAATC ATTTGACGCA TTAATGGACG CCTTGGACAA TGAAGATTAT 60 

GAAGGAGAAA AAGTCATTCC ATTCTTAGAT AAACATTTTA AATATCAAGA TGGCCGATCA 120 

AGTGAGCGTT TAGTCAGAAA TTTATTTGGT AGCTAAGTTT ATATAGTAGT CAAAGTGGGA 180 

GAGGTATAAT GATGAAATTT TCAGTAATAG TTCCAACATr CAATTCAGAA AAGTATATAA 240 

CAGAATTACT TAATAGCCTT GCGAAACAAG ATTTTCCGAA AACTGAATTT GAAGTGGTTG 3 00 

TAGTTGATGA CTGTTCAACA GATCAAACGT TACAAATAGT TGAAAAGTAT CGCAATAAAT 360 

TGAACTTGAA AGTAAGTCAA CTCGAAACAA ATTCTGGTGG TCCAGGTAAA CCTAGAAATG 420 

so kGGCGTTAAA ACAAGCAGAA GGTGAATTTG TATTATTTGT GGACTCCGAT GACTATATAA 4 80 

ACAAAGAGAC TTTAAAGGAT GCAGCAGCAT TTATTGATGA ACATCACTCA GATGTCTTAT 54 0 



55 



40 



45 



1047 



EP0 786 519 A2 



CACCTGAAGT TACTTTGTTA AATTCAAGAA TTATCTATAC TTTAAGCCCG ACTAAAATCT 660 

ATAGAACAGC ATTACTAAAA GATAATGACA TTTATTTTCC AGAAGAATTA AAGAGTGCAG 720 

5 AAGATCAATT ATTTACAATG AAAGCATATT TAAATGCAAA TCGAATCAGT GTGTTAAGTG 7B0 

ATAAAGCGTA TTATTATGCT ACAAAGCGTG AAGGTGAACA TATGAGTAGT GCGTATGTTT 84 0 

CACCTGAAGA CTTTTATGAA GTCATGAGAT TGATTGCTGT AGAAATATTA AATGCAGATT 900 

10 

TAGAAGAAGC CCATAAAAAT CAAATCTTAG CAGAATTTTT AAATCGTCAT TTTAG TTTTT 960 

CTCGTACGAA TGGCTTCTCA CTTAAAGTTA AACTAGAAGA TCAACCACAA TGGATTAATG 1020 

CTCTAGGAGA CTTTATACAA GCAGTTCCAG AACGTGTAGA TGCATTGGTG ATGAGTAAAT 1080 

75 

TACGACCATT GTTGCACTAC GCGAGAGCGA AAGATATAGA CAACTATAGA ACTGTGGAAG 1140 

AAAGTTACCG TCAAGGTCAA TACTACCGTT TTGATATTGT AGATGGTAAA TTAAACATTC 1200 

AATTCAATGA AGGCGAACCA TACTTTAAAG GCATTGATAT CGCTAAGCCA AAAGTGAAAA 1260 

20 

TGACAGCATT TAAATTTGAT AATCATAAAA TTGTTACAGA GCTAACGTTA AATGAATTTA 1320 

TGATTGGCGA AGGACATTAT GATGTCAGAC TTAAATTACA TTCACGAAAC AAGAAGCACA 1380 

2 5 CAATGTATGT ACCTTTAAGT GTCAATGCGA ATAAACAATA TCGTTTTAAC ATTATGTTAG 144 0 

AAGATATTAA AGCGTATTTA CCTAAAGAAA AAATTTGGGA TGTTTTCTTA GAAGTCCAAA 1500 

TAGGTACGGA AGTATTTGAA GTGGGTGTTG GTAATCAACG TAATAAATAT GCATATACTG 1560 

30 CAGAAACAAG TGCATTAATT CATTTGAATA ATGATTTTTA TAGATTAACA CCGTATTTCA 1620 

CAAAAGACTT TAATAACATT TCGTTATACT TTACAGCTAT TACATTAACG GATTCAATCT 1680 

CATTGAAGTT AAAAGGTAAA AACAAAATCA TTTTAACTGG TCTGGATCGT GGTTATGTAT 174 0 

55 TTGAAGAAGG TATGGCTAGT GTCGTACTAA AAGACGACAT GGTGATGGGA ATGTTAAGCC 1800 

AAACfiTCAGA AAACGAAGTG CnAAATCTTA CTTAGCAAAG ATATTAAAAA GCGAGACTTC 1860 

AAAAATATTG TTAAGTTAAA CACTGCACAT ATCACTTATC CACTAAATAA ATAATAAATG 1920 

40 

CCCTCAAATC ATTGTGAGCC AACATGATTT GAGGGCTTTA TTTTGCTGTT TATGACATGA 1980 

TTATGACATT TCCCTGATTT TCATTTTCAT ATACATTAAA TTGTATACAC TGGAAATGAG 2040 

GAGGTTATCT ATAATGATAA ATAAAAATGA CATAGTAGCA GATGTAGTAA CTGATTATCC 2100 

45 

GAAAGCAGCG GATATTTTTA GAAGTGTGGG AATAGATTTT TGTTGTGGCG GACAAGTAAG 2160 

TATAGAAGCA GCAGCCTTAG AAAAGAAAAA TGTAGATTTG AACGAATTAT TACAGCGTCT 2220 

50 CAACGACGTT GAACAAACGA ATACACCAGG TTCGTTAAAT CCTAAATTTT TAAATGTTTC 2280 

ATCACTTATT CAATATATTC AATCAGCATA TCATGAACCT CTAAGAGAAG AATTTAAAAA 2340 

55 
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TGAGTTAAAA GAAACATACG ATACATTTAA AAATGGCATG TTAGAGCATA TGCAAAAAGA 2460 

AGACGATGTC GATTTTCCAA AACTCATTAA ATATGAGCAA GGTGAGGTAG TAGACGATAT 2520 

5 TAATACTGTG ATAGATGATT TAGTTTCAGA CCACATTGCA ACGGGAGAAT TGTTAGTAAA 2560 

AATGAGCGAA TTAACATCTA GTTATGAACC TCCGATAGAA GCGTGTGGTA CTTGGCGACT 264 0 

TGTTTATCAG AGATTAAAAG CACTTGAAGT GTTAACACAT GAACACGTAC ATTTAGAGAA 2700 

TCACGTATTA TTTAAAAAAG TATCATAAAT AACGCGATTA GAAACTGTTG GCAAAAATAA 2760 

GTCCAGCAGT TTTTCGCTAT GTATAAAAGT CATAATAGTG ACATAAACAG CATTATTTGA 2820 

AAAGAAnAAT GGTCAACTTA GCATAAAAAT TGATATGAAn ATTTAATGGT ATAGATAATT 2880 

15 

A 2881 

(2) INFORMATION FOR SEQ ID NO: 255: 

<i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 1056 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 255: 





ACCGTCGAAT 


ATCGCTTGTG 


ATTTACAATT 


TGTGTATTAA 


GATGCTCAAC . TAATTTGGGT 


60 


30 


ACATATTCCG 


AATTTAGATT 


TGCAAGTACA 


ACAATTCCAT 


AATTTTGTTT 


TGGATTTAGT 


120 




AAAATAAATG 


ATGAAAAGTT 


ATCTAGCGTT 


CCTGAATGAA 


ACACTAAATG 


TTCATCATTA 


180 




TTGGTAAACC 


AGCCGGAAGC 


ATATGCATTG 


GCATTAGGTT 


CACCAATTGT 


TGAAGATAAA 


240 


35 


TTTTTATGTG 


ATTGTTGAAC 


TAATGATTTG 


TATTTATCAG 


GTGGATTAAG 


TTGGAATTTT 


300 




ATCGAATGTT 


CCAAATCTTC 


AGTTGATGTC 


ATCATATATG 


CTGATGGTGT 


ATCCCAAAGG 


360 


40 


TTAAATTCAG 


GTTTAGAGAC 


GACAGGTGTC 


GAACCTTGTA 


ATTCATAGCC 


AATAGCATCA 


420 


TGTTTTGATT 


TGTAATTGGT 


TTGTTTGAAT 


GATGTATGTG 


TCATATGCAA 


AGGCTTGAGC 


480 




CATGAATTTG 


TAATATATTT 


TGTATAGGAT 


TGCTTCGTAA 


CGTTTTGGAT 


AATTAAACCT 


54 0 


45 


AATAAATCAT 


AGTTCATATT 


TGAGTATTCA 


AATTCTTCTC 


CGGGCTTATG 


ATGTAATTCA 


600 


TCACCCATAA 


TTGCATGGGT 


TACATCATTT 


AAACGATTAT 


TTTTGCTTGT 


CACAGAATCT 


660 




tCGCTTGTAA 


TAtCACTAGG 


TATACCACTT 


GTTTGAGCCA 


AAAGTTGCTT 


AATCGTAATA 


720 


50 


GTTTCATTTT 


GACCATTATA 


GTTCATTTTA 


AAATGAGGCA 


CATGTTTGGA 


TACGGCATCA 


780 




TTTAAGTTTA 


ATCGACCTTC 


TTGAGCTAAT 


TTTAAAATTG 


CAAGACCTGT 


GAAAGCTTTC 


840 
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TGATAACCAT AACCTTTATT TAAAAAAACT TTGCCATTTT TTACTAyTAA AATTGATGCT 960 

CCAGGAATGT GTCCCTTTTG TAAATCATGC TCGATAATTG TATCTATTTG TTGTTGCGAA 1020 

5 - TCATTGGTTA ACCGTGTCTT CGTATTGCTA TTTAAT 1056 

(2) INFORMATION FOR SEQ ID NO: 256 : 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 256: 

ATGCCaCACT TATTGGTGGC AGGATCGACG GGTAGTGGTA AATCTGTTTG TATAAATGGT 60 

ATTATTACAA GTATTTTATT AAATGCTAAG CCGCATGAAG TTAAACTTAT GTTAATCGAT 120 

20 

CCGAAAATGG TTGAACTAAA TGTTTATAAC GGAATTCCAC ACTTATTAAT TCCGGTTGTT 180 

ACAAATCCTC ATAAAGCTGC TCAAGCTTTA GAAAAAATTG TAGCTGAGAT GGAAAGACGT 240 

2S TATGATTTAT TCCAACATTC ATCAACTAGA AACATTAAAG GTTATAACGA ATTAATCCGT 300 

AAGCAAAATC AAGAATTAGA TGAGAAGCAA CCAGAATTAC CTTATATCGT TGTTATTGTA 360 

GATGAGCTTG CAGATTTAAT GATGGTAGCT GGTAAAGAAG TTGAAAATGC GATTCAACGT 4 20 

30 ATTACACAAA TGGCACGTGC AGCAGGTATA CATTTAATTG TAGCGACACA AAGACCTTCT 4 80 

GTGGATGTAA TTACAGGTAT CATTAAAAAT AATATTCCAT CTAGAATAGC TTTTGCTGTG 540 

AGTTCTCAAA CAGATTCAAG AACTATTATT GGTACTGGCG GCGCAGAAAA GTkACTTGGT 6 00 

55 . AAAGGTGACA TGTTATACGT TGGAAATGGT GACTCATCAC AAACACGTAT TCAAGGGGCG 660 

TTTTTAAGTG ACCAAGAGGT GCAAGATGTT GTAAATTATG TAGTAGAACA ACAACAGGCA 720 

AATTATGTAA AAGAAATGGA ACCAGATGCA CCAGTGGATA AATCGGAAAT GAAAAGTGAA 780 

40 

GATGCTTTAT ATGaTGAAGC GTATTTGTTT GTTGTTGaAC AACAAAAGGC aAGTACATCA 840 

TTGTTACAAC GCCAATTTaG AATTGGtTAT AATAGAGCAT CTAGGTTGAT GGATGATTTA 900. 

GAACGCAATC AGGTAATCGG TCCACAAAAA GGAAGCAAGC CTAGACAAGT TTTAATAGAT 960 

" 45 

CTTAATAATG ACGAGGTGTA AAAAAATGTC AGAAATGAAT GCGGTATATA ACGTTAAACA 1020 

ATaCATTTtA AATTTgATTA AGCAAAATAA ATTGGAATAT GGTGACCAAC TTCCAAGTAA 1080 

50 TTTATCAATT GCCAGAGAAT TAAATGTAAA AACCGACGAT GTTTATGAAG CAATTCAGcA 1140 

TTGATTACTG AACAAGTCAT TAAAGATAtT TTGAAGAGGG CACAAGTGTT AAGTCACTGC 1200 
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GrtTTGAATG CGGAACT 1277 
(2) INFORMATION FOR SEQ ID NO: 257: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 257: 

TACCGTCGTT TTATGyGTCA AATTTTACAG TAAATTTTGC TTCATCAAAA GAAATAACCT 60 

75 

TTAACAAGTA TAGTAATTTT ACATTTTACA ATGTTACAAA ATATAATTTT TTATAATTAG 120 

TTAAAATCAC TAAAACGCTT TTATACACTA TCAAATCAGC ATTTATAAAA ATATGAACCG 180 

ATATCCTAAA ATGTTAATAA TATTACAAGA TAATAACArA CCACACAAAG CTACTTATTT 24 0 

20 

TTGATAATAT GGAAATCGTA ATATAAAACn AAAACTTAAT TTACTATATA AATTGTCTTA 300 

ATAATTTTTA AAAGTAGTAA AACATAATTT TAAGGAGGAG TCCCTTTGAA AAAATTAGCA 360 

25 TTTGCAATAA CAGCAACATC TGGTGCAGCT GCATTTTTAA CGCATCATGA TGCACAAGCT 420 

TCTACACAAC ATACAGTACA ATCTGGTGAA TCATTATGGA GTATTGCTCA AAAATACAAC 4 80 

ACTTCAGTAG AGAGTATTAA ACAAAATAAC CAATTAGATA ACAACTTGGT ATTCCCTGGT 54 0 

30 . CAAGTTATCT CAGTAGGTGG AAGTGATGCA CAAAATACGT CAAACACTTC TCCACAAGCT 600 

GGTTCAGCAT CATCTCATAC TGTACAAGCT GGTGAATCAT TAAATATCAT TGCTAGCAGA 660 

TATGGTGTTT CAGTTGATCA ATTAATGGCA GCCAATAACT TACGTGGTTA TTTAATTATG 720 

35 CCTAACCAAA CATTACAAAT TCCTAATGGT GGATCAGGTG GTACAACACC AACAGCTACA 7B0 

ACAC25TAGCA ATGGCAATGC ATCATCTTTT AATCACCAAA ATTTATACAC TGCTGGTCAA 840 

TGTACATGGT ACGTATTTGA CCGTCGTGCT CAAGCTGGTA GTCCAATTAG CACATATTGG 900 

40 

TCAGACGCTA AGTATTGGGC TGGTAACGCA GCTAATGATG GTTACCAAGT AAACAACACA 960 

CCATCAGTTG GTTCAATTAT GCAAAGCACA CCTGGTCCAT ATGGTCATGT TGCTTATGTT 1020 

GAACGTGTCA ATGGTGATGG TAGTATCTTG ATTTCTGAAA TGAATTACAC ATATGGTCCA 1080 

45 

TACAATATGA ACTACCGTAC AATTCCAGCT TCAGAAGTTT CTAGCTATGC ATTCATCCAT 1140 

TAATTAAATA AATTGTACTG ATATATACTA GCAATTCACA TCATGTGAGA TTGCTAGTTT 1200 

TTTATTTTTG AAAAAAATTT TCATTTTGGT ACAAAAAATT ATCTCACCCT TCCCTATCAT 1260 

50 

ACATATTTAT ATTTTGTATG AATGGTAGTT AGGTAAAAAT TAACAACCTA CCTATTTGAT 1320 

55 
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ATTTAATTTG TTATACCAGT ATTTTACGCT 
ATAAAGCCCA ATACAATTTA GGTTAATTAA 
5 ATTGAAGAAT ATAAACTATT AAATCATTAT 

CTGaTTATGT TCCATGGAAT ACATTTATAA 
GGTCATTCTA CGTAAATTCT ATAAAATATG 

10 

ATACCACCAT ATGTTTGTGA TACTGAAGTT 
AAGTTAAGTA TATAGAATAG TAAACCTGCT 
CATGTAATGA TCATTCGTTG CGCAgTGCTC 

.15 

CCAACACCTA AGATTGATGA TGACACAACG 
GATGCAACAA AAATTGTTAA TGCAGATGAT 
ATTTTCATAA TATTACCACC TACAGTTTTG 

20 

AGCCCCATTG cTGTCGCACA GGCAAATTTT 
TTCTGTACAT ' TAGCGACAAT CAATGCCAAC 

25 TTCGTACCGT GAGAGAATGA TTGTAACGCT 

CGATTCGCTC TTGTTAAATT TGCATTTTTA 
AAACCAACAC AAAATGCGAT AATCGGTGAA 

30 CCTTGGTAAT GTAACACTCC AAATGAGCCT 

CCTATAAGTG CATGTGAAGA CGAACTTGGA 
ATAATAGCCG CAAGTATTGC AGCTAACACA 

35 TCGACAATGT CTTTAGTAAT GGTGCCTGCA 

TTCArTCACTG CTGCCATTAA AATTGCCGTT 
TAGCTACTGC ATTGGcTGTA TCATGGAaTC 

40 

ATAACTACAG CTATAGTGAC GATGATTATA 
TTTTTCATAA TAATAGTTTC AAAATTATTT 
TCCATGCTTT CATAAATATC TTTTATTTTA 

45 

AAAATATGTT TAATTGACTG TCTTAAAATA 
TTAATTGAAT GAATACGCAT ATGTGATAAT 
50 TTCATTTCTG CAACTGCTTT TTGAATGTTA 

TATTCGATTG AATACATTTC AAACATrGCT 

55 



TTTTCGTCTA 


CATATACAAA 


TTTATATTAA 


1440 


ACAAGTTGAT 


AACTATTTAA 


TTATTCCTTC 


1500 


TTTGCTCTTA 


CATATATTTT 


AATGACCTAA 


1560 


TATAGCCTCC TAATTAArAT GCyTTGTCTT 


1620 


TTATCTACTT 


ACATAArArn 


CTGrACTTCA 


1680 


CAGTTTAGTT 


TTATTTTCAA 


TTAGAAAAAT 


1740 


AACAATGCTG AAATAGGTAA 


TGTAATCACC 


1800 


CATTTTACAC 


CTTTAGCTCG 


GTTAGAAGCA 


1860 


TGAGTTGTTG 


ATAATGGGAA ATGTAGCGAT 


1920 


AAATCGGCCG 


CAGCACCATT 


TGCTGGACGT 


1980 


ATAATTTTCC AGCCACCAAT TGCAGTACCA 


2040 


ACCCATAACT 


GTGGTTCAAC 


ACTGCCATCA 


2100 


GTAATAATAC 


CCATTGATTT 


TTGCGCATCA 


2160 


GCTGTGAAAA 


TTTGGAAAAA 


TCTAAAGTTA 


2220 


AAGATAACTT 


TAAAAATTGA 


ATACATCAAG 


2260 


ACGATTAATA 


CAATAATAAT 


TTTTGTGAAA 


2340 


TCAGATGCGA 


TTGCTGCACC 


CGCAATTGAA 


2400 


ATTCCGTAAA 


ACCAAGTAGC 


TAAATTCCAA 


2460 


ACAACTAATC 


CATTTTCCAA 


TTTAAATGGA 


2520 


ACGCCCGTAA 


ATGTTAAAGC 


ACCTATAAAG 


2580 


TTAGGgTTAA CGCTCTAGTA 


GATACAGCAG 


2640 


CcATTGATAA AGTCAAATAT 


CAGCGAGAAA 


2700 


TATGACATAA ATATATACTC 


CCCTTAGCTA 


2760 


GCTACGATTT 


GACATTTATC 


AGCGATTTCT 


2820 


ATTAAAGTGA TTGGATCTGT 


TTCGCTATTG 


2880 


CCATCACAGT TTGTTTCAAA 


TTCTTTAATA 


2940 


TTTTTATCGA 


CTAATAAGCC 


GACAGCAAGT 


3000 


TCAACAAACT 


CAGCCATATA 


TTCATCTGTG 


3060 


GCCGTTTCTT 


CAATTGCATC 


TAAAACATCA 


3120 
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TTTAAATCAG 


TAATTACTTG 


ATGTACTAAc 


tCGcACCATG 


TGACTCATAA GTTTTAATGT 


3240 




TGTCTGAGTA 


TGCTTTTAAA 


TCTAAATGTG 


TATTGAAATC 


CATTTTACCG AATTCAATAG 


3300 


£ 


CAGCACGATC 


CAGATTGAAA 


ACCATCTCTT 


CTAATTGAAC 


CATAAACTTA TCTTTTTTCT 


3360 




TACTAAACAT 


TTAAAATCCT 


CCATTTAAGC 


GATTGTCACC AATCACATTC AGTTATAATT 


3420 




TGTTTCAAAT 


TAAGACAAGT 


GAATTTACAA 


ACTAATGATA 


CAAATTTGTT ATTATCAATC 


3480 


10 


GTCAGTATAA 
CTGTATTTGC 


TTTTAGTGTA 
CCTTATA 


CTGATATTAA 


TTTCAAAAAT 


GCCTCACAGT AAACAATTTA 


3540 
3557 


15 
20 


(2) INFORMATION FOR SEQ ID NO: 258: 






(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 258: 




25 


AACTATACAT 


TTCGAAAAAT 


TCTTCTAGTG 


AACCTGCGCC 


ACCAGGAGCC ATGACAAATG 


60 




CATCTGCAAG 


TTCTGCCATT 


TTATTTTnAC 


GTTCATGCAT 


AGAATCAACT AAAATTAATT 


120 




CAGTTAAACG 


TTGGCTTGTG 


ATTTCATGTT 


CATCTAACAT 


TTTAGGCATG ACGCCAATAG 


180 


30 


CTTTGCCGCC 


ATGATCTAAT 


ACACCATCTT 


GaATGGCACC 


CATAATGCCA ATTGACCCTG 


240 




CACCAAATAC 


TAATTCATAA 


CCTTGTTCAG 


CAAAATATTT 


ACCTAAATCG TATGCTTTTT 


300 




GTACATATGA AGGGTCATGA 


CCTTTGCTTG 


CACCACAATA. 


AACTGCGATT CGTTTCATGT 


360 


35 


TAATCCAGCT 


CCTTAATTCG 


ATGAATGACT 


TTTAATAGTG 


ATTGTTCAAA CACTTTTTGA 


420 




TCTT9CTTTG 


TAAAAGGTGG 


GGGACCTTTG 


TGGCGACCAC 


CTTGTTTTCT AATTTGTGCA 


480 




TTCATATATC 


GTTTATCTAA 


TAGTTGTTGA 


ATATTTTTGG 


AATTGTATAT CTTCCCATTA 


54 0 


40 


TGATGCATGA 


CAATTAAGAC 


TTTGTCGACT 


AATAAACTTG 


CGAGTCCATA ATCTTGAGTG 


600 




ACTACGATAT 


CATCCTTCGT 


TGATAATTGA 


ACAATTTTGT 


AATCAACTGC ATCTGGTCCA 


660 


45 


TCATCAACAT 


ATAATGTTGA 


TACATGTGGA 


GGATATAATT 


GGTTCGAAAA ATGGCTGAAG 


720 


CTCCGAATAA 


TTGTCACAAA 


AATGCCTGTC 


TCAGTTGTTA 


AATCTATAAT AGAATCAACA 


780 




ACAGGACAAG 


CATCTCCATC 


AATAATAATA 


TGTGTCACAA 


TTATGCCTCT GTATTGTTTT 


840 


SO 


CTTTATTTTG 


TTGAGAGGCG 


CTTTTGGCAA 


CATAATCTTT 


ATATTTTTTA AATGACTTGA 


900 




TGCGTGCTTT ATCAGCTTCT 


TGTTGGCGTT 


TTTGTTCTTC 


TTTGTGTCGT TTTTCAATAT 


960 
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CGC C TT TT T T CTCAGTTTTC TCATCTAATT TATTAGGTGT TAAGCCTGCT TTTTCTTCGT 1080 

ATTTTTGTGA TTTTTTCATA TCTTTAATAC GTTGTATTTC ATTCTTTTCG CGGGCTTTTT 1140 

5 GCT CTTCTTT ATGACGCTTT TCGATATTTT TTTGAAGTAT TTTATTCATT TTATCAGCGT 1200 

CTTTACGATT TTGTTTAGCT AATTTTTCGC CTTTTTT CTC AATATAGGCA GGATCATGTT 1260 

CTCTAGCAAA CTTTTTAAGT TCACGTTTAT TTTCAAAATC TTGTTTTTTA TCGCCGACAT 1320 

ATTCTTTAAC ATCACTCGCT GTGTTACTGA TTGCTGCAGA TGTTTTTGAA GCAACTTTAC 1380 

TTGTAGCATC TGTAACTTTT TGTACGTCCG GATGTTGTTT GATACGTTTA CGTTCAACAA 1440 

TTAACGGTAC CAATACAATT GGTAATACAT TAATCATAAA TTTGATGACT TTTTTCTTAT 1500 

is 

CCATAGATCT TGcCTCCaTA ATTACTTTAT TAAtTTTACa TACCCtATGa TACATCAATA 1560 

TAAACGATGA TAGTAGTGAA TCACTATTAA GTATTTCAGA TGTTTtTTAA AAgaAGaCCC 1620 

2Q AATTAnAAAA A 1631 

(2) INFORMATIOK FOR SEQ ID NO: 259: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6645 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 259: 



35 



40 



45 



SO 



CGAAATCATG 


ATTTAATGCT 


TTTTCATATA AGCTTTTCCA 


ATTAATCTTT 


CGTCCATGAT 


60 


ATTCTTCAAC 


TGTTGCTAGA 


TATTGTGCAA TTTTAGTTAC 


TTTAAAGGAG 


TGTGCTGCAA 


120 


CaTTGTGkTC 


mAAATATTTA 


AATTTTCCaG GtAATCTTAT 


AAGTCTTTCC aTATCTGATA 


180 


ATCTJTTTAAA 


ATATTGATGT 


ACACCCATTT CAATTACCTC 


CTCCATTAAT 


TAATCATAAA 


240 


TTATACTTTC 


TTTTTACATA 


TCAATCAATT AAATATCATT 


TaAATATCTT 


CTTTaTATAA 


300 


cTCTGATTAA 


ATGATACCAA 


AAAATCCtCT CAACCTGTTA 


CTTAAACAGG 


CTAAGAGGAT 


360 


AGTCTTGTCT 


TGATATATTA 


CTTAGTGGAT GTAATTATAT 


TTTCCTGGAT 


TTAAAATTGT 


420 


TCTTGAAGAT 


TTAACATTAA 


ATCCAGCATA GTTCATTTCA 


GAAACAGTAA 


TTGTTCCATT 


4B0 


AGGGTTTACA 


GATTCAACAA 


CACCAACATG TCCATATGGA 


CCAGCAGCTG 


TTTGGAAAAT 


540 


AGCGCCAACT 


TCTGGTGTTT 


TATCTACTTT AAATCCTGCA 


ACTTTTGCTG 


CGTAATTCCA 


600 


GTTATTTGCA 


TTGCCCCATA 


AACTTCCTAT ACTTCTACCT 


AATTGTGCAC 


GACGATCGAA 


660 


AGCATAATAT 


GTGCAGTTTC 


CATAAGCATA TAAGTTTCCT 


CTGTTAGCAA 


CTGATTTATT 


720 



55 



1054 



EP 0 786 519 A2 



TACATTAACT GTCTTAGTTA CTGCTTGCTT AGGTGCTTGC TTAACTACTA CTTTTTTAGA 84 0 

TGCTTGTTGT ACAGGTTGTT TTACTACCTT TTTAGCTTGG CTTGCTTTTC TTACTGGTGA 900 

5 TTTAACCGCT TTAGTTTGTT TCACTTTATT TTGAGGCACA AGTGAAATCA CGTCACCAGG 960 

AAAAATTAAA GGTGTTACAC CAGGATTGTA TGAATATAAT TGATTCAACG TTAAGTGATG 1020 

CTCTAAAGCA ATCTTATATA ATGAATCGCC AGCAACTACT GTATAAGTTG TCGGTGATTG 10 BO 

10 

CGTTTGTGCT TGAACATTTG ATACATAATT ATGTTGAACA GGTGTTTTTA CTTGTGTGCC 114 0 

ATGTTGTTGT GCATGTGCTG CATTATTTAA AGCTAAAAAA GCTAACACTG ACGAAACCGT 1200 

CACTGTAAGA GATTTTTTCA TCTTGCTGTC ATTCCTTTGC TGTTAGTATT TTAAGTATGC 1260 

75 

AAATACTATA GCACAATACA TTTTGTCAAA AGCTATTGTT ATAACGATGT AATCAAATGG 1320 

TTAACAATAT AAAAAGAATA CAACCTTTTA TCATAGTGTA AAATGTATTC ATACCATGTA 1380 

ATTGAGAACG TTTTCAATAA TTAATTCAAT ACCTTGAAAA TCGCCATAGG TAATATTACT 144 0 

20 

AAATGCACAC TGCATATGTT GTTTTAACAA ACACAACTTT TAAAAAATAT ATTCTAACTC 1500 

TATCTACCGA ATTGTACTTA AATATTCATA AACAAATCAT ATTCCAAAAT CTAATTTACA 1560 

25 ATTTATTTAG CTACCTTTAA AAAAACCAAA AACCGACGCC CTTTTAGAGC CTCGGTTTTA 1620 

AAATATATAT TAATCGTGCG ACATTGTCTG TCTTAAATAT GATTCGATAA ATGGTCCAAT 1680 

GTCTCCATCC ATCACTGCAT CAACCTTACC TGTTTCTTCG TTCGTACGAT GATCTTTCAC 174 0 

30 CATTGAGTAT GGATGGAAAA CATATGATCT AATTTGGCTT CCCCAGCCGA . TTTCTTTTTG 1800 

TTCGCCACGA ATTTCAGCCA TTTCACGTGC CTGCTCTTCC AATTTTAATT GATATAATTT 1860 

AGACTTTAAC ATTTTCATAG CTGCTTCACG GTTTTTAATT TGAGAACGTT CATTTTGGTT 1920 

35 ATTAACAACT ATACCTGAGG GGTGGTGGGT AATTCGTATT GCCGATTCAG TTTTGTTAAT 1980 

ATGCTGACCA CCTGCACCAG AAGCTCTGAA TGTATCAACT GTAATATCAT CCGGATTGAT 2040 

TTCAATCTCT ATTTCATCAT TATTAAAATC TGGAATAACG TCGCATGATG CAAATGATGT 2100 

40 

ATGACGACGT CCTGATGAAT CAAATGGAGA AATTCGTACT AGTCGGTGTA CACCTTTTTC 2160 

AGCTTTTAAA TAACCATAAG CATTATGCCC TTTGATGAGC AATGTTACAC TTTTAATCCC 2220 

CGCTTCATCC CCAGGTAGAT AATCAACAGT TTCAACTTTA AAGCCTTTCT TCTCACAATA 2280 

45 

ACGTTGATAC ATTCTAAATA GCATATTAGC CCAATCTTGA GACTCCGTGC CACCTGCACC 2340 

AGGATGTAAC TCTAGAATTG CGTTATTGGC ATCGTGAGGC CCATCTAATA ATAATTGCAA 2400 

so TTCGTATTCA TCCACTTTAG CCTTAAAATT AATGACCTCT TGCTCTAAGT CTTCTTTCAT 2460 

TTCTTCATCA AATTCTTCTT GTAATAAATC CCAAGTAGCA TCCATGTCAT CTACTTCTGC 2520 

55 



1055 



EP0 786 519 A2 



10 



15 



20 



TTGCGCTTTC GTTTGGTTAT CCCAAAAATT AGGTTCTGCC ATCATTTCTT CATATTCTTG 264 0 

AATATTAGTT TCTTTGTTCT CTAAGTCAAA GAGACCCCCT AATTTGTGTT AAATCTTGAT 2700 

TATACTTATC TATATTTCGT TTGATTTCTG ATAATTCCAT AGcATTCGCT CCTATTTATA 2760- 

TTTCAATTCA AGTCATTGAT TTGCATCTTT TATAATGCTA AATTTTAACA TAATTTTGTT 2820 

AAATAACAAT GTTAAGAAAT ATAAGCACAC TGACAATTAG TTTATGCATT TATTGTTAAA 2880 

AATCAG1ACA .TTTATCATCG ACATATGCCT AAACCGATTT TTTAAAACTA AGTACATAAC 2940 

AACGTTTAAC AACTTCTTCA CATTTTTTAA AGTATTTAAC GCTTGTAAAA TAAAAAGACT 3000 

CCTCCCATAA CACAAACTAT AGGTGTTTAA TTGGAAGGAG TTATTTTATA TCATTTATTT 3060 

TCCATGGCAA TTTTTGAATT TTTTACCACT ACCACATGGA CAATCATCGT TACGACCAAC 3120 

TTGATCGCCT TTAACGATTG GTTTCGGTTT CACTTTTTCT TTACCATCTT CAGCTGAAAC 3180 

GTGcTTCGCT TCACCAAACT CTGTTGTTTT TTCACGTTCA ATATTATCTT CAACTTGTAC 3240 

TACAGATTTT AAAATGAATT TACAAGTATC TTCTTCAATA TTTTGCATCA TGATATCAAA 3300 

TAATTCATGA CCTTCATTTT GATAGTCACG TAATGGATTT TGTTGTGCAT AAGAACGTAA 3360 

25 GTGAATACCT TGACGTAATT GATCCATTGT GTCGATATGA TCAGTCCAAT GGCTATCAAT 3420 

AGAACGAaGT AAAATCATAC GCTCAAACTC ATTCATTTGT TCTTCTAAGA TATCTTTTTG 3480 

ACTTTGATAT GCTGCTTCAA TCTTAGCCCA AACGACTTCG AAAATATCTT CAGCATCTTT 3540 

30 ACCTTTGATA TCATCCTCTG TAATGTCACC TTCTTGTAAG AAGATGTCAT TAATGTAGTC 3600 

GATGAATGGT TGATATTCAG GCTCGTCATC TGCTGTATTA ATATAGTAAT TGATACTACG 3660 

TTGTAACGTT GAACGTAGCA TTGCATCTAC AACTTGAGAG CTGTCTTCTT CATCAATAAT 3720 

35 ACTATTTCTT TCGTTATAGA TAATTTCACG TTGTTTACGT AATACTTCAT CGTATTCTAA 37.B0 

GATA£GTTTA CGCGCGTCGA AGTTATTACC TTCTACACGT TTTTGTGCTG ATTCTACAGC 384 0 

TCTTGATACC ATTTTTGATT CAATTGGTGT AGAGTCATCT AAACCTAGTC GGCTCATCAT 3 900 

TTTCTGTAAA CGTTCAGAAC CAAAACGAAT CATTAATTCA TCTTGTAATG ATAAATAGAA 3960 

GCGACTATCC CCTTTA7CAC CTTGACGTCC AGAACGACCA CGTAACTGGT CATCAATACG 4 020 

ACGAGATTCA TGTCGCTCTG TACCTATTAC TGCTAAACCG CCTAATTCCT CTACGCCTTC 4 080 

ACCTAATTTG ATATCTGTAC CACGACCAGC CATGTTAGTG GCAATAGTAA CGGCACCTTT 4140 

TTGTCCAGCG CCTGCAACAA TTTCAGCTTC ACGTTCATGA TTTTTCGCAT TTAACACATC 4200 

ATGACGGATA CCACGTTTTT TAAGTAAATT TGAAATATAT TCAGAAGTCT CAACTGCAAC 4260 

AGTACCTAAT AGCACTGGTT GCCCTGCCTT GTGTTTTTCA ACAACATCTT CTACTACTGC 4320 



40 



45 



50 
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5 



10 



20 



25 



35 



SO 



TTTATTTGTC 


GGAATTTGAG 


TTACTGTCAT 


GTTATAAATA 


TTTCTAAATT 


CTTCTTCTTC 


4440 


AGTTTTAGCT 


GTACCTGTCA 


TACCCGCAAG 


TTTATTGTAC 


ATTCTGAAAT 


AGTTTTGGAA 


4500 


TGTAATAGAC 


GCCATAGTTT 


TAGATTCATT 


TTGAATTTGA 


ACGCCTTCCT 


TCGCTTCAAT 


4560 


AGCTTGGTGT AAACCTTCCG AGAAACGACG 


GCCTGGCATT 


GTACGTCCTG 


TAAATTGATC 


4620 


GACAATTAAT 


ACTTCGCCAT 


CAACAACCAT 


ATAGTCTACG 


TCACGTTGTA ATGTAACGTG 


4680 


CGCACGTAAA 


GCTGTGTTGA 


TATGACTAAT 


AACATCAACA 


TTTTGTACAT 


CATATAAGTT 


4740 


TTCAACTTTG 


AACATACGTT 


CAGCTTTATC 


CGCACCTTGT 


TCTGTTAAAT 


GTACAGCTTT 


4800 


CGTTTTTTCA 


TCGTATTTAT 


AATCTTCGTC 


CTGTTTTAAC 


ATTTTCGCAA 


AAACATTTGC 


4B60 


TTGTGTATAA 


AGTGACGTTG 


ACTTTTCAGC 


TTCACCAGAA 


ATAATTAATG 


GCGTACGTGC 


4920 


CTCGTCGATT 


AAAATTGAGT 


CAACCTCATC 


AATGATTGCA 


AAATGTAATG 


GACGCATTAC 


4980 


TCTATCTTCA 


GAATAATTCA 


CCATGTTATC 


TCGTAAGTAA 


TCAAAACCTA 


GCTCATTATT 


5040 


AGTACTGTAA 


GTAATGTCTT 


GTGCGTATGC 


TTCACGTTTT 


TCTTCTGTCG 


TCTTACTGTT 


5100 


TAAGTTTAAT 


CCGACAGTCA 


AACCTAAGAA 


GTTATATAAC 


TCAGCCATTT 


CTTCACTTTG 


5160 


AACACTTGAT AAGTATTCAT TGACTGTAAT AACGTGAACA 


CCTCTACCAG 


CTAATGCATT 


5220 


TAAGTATGTT 


GGCATTGTCG 


CTGTTAATGT 


TTTACCTTCA 


CCTGTTCTCA 


TCTCAGCGAT 


5280 


ATCACCTTTA 


TGAATTGCAA 


TACCACCCAT 


AATTTGAACT 


TTATATGGTG 


TCATATTGAA 


5340 


TACACGTTTA 


GAGCCTTCTC 


TAACAAGTGC 


ATATGCTTCT 


GGTAAAATTT 


TATCTAAATA 


5400 


ATCATTTTGC 


TTTTTGACAT 


TATCAATGTC 


AGCTAATTCT 


GTTTGGAATT 


GTTTCGTTTT 


5460 


ATTACGAATT 


TCTTCATCAG 


TTAAAATTGC 


CGTTTTTTCT 


TCTAAAGCGA 


TTACTTTATC 


5520 


AGCAAGTTTA CCTAACTGTT TAATTTCTTT ATTATTGCCA 


TCAAGAATTT 


TTGATAAAAA 


5580 


TCCG^TTTCG TTCGCTCCTT TAGCTAAAAA ACTGTTTGGC 


CTACAACAAT 


ATATCTTATC 


5640 


ATTTATAGTT 


AGAAAATTAT 


ACTTATTTAC 


TCATTTGTAG 


AATCAATATA AATATATTTA 


5700 


TGACATACTT 


CATTCACATT 


CTGTTGTCAA 


CAAGTTTATC 


ACTAATAAAT 


ATATTCTCAA 


5760 


TACGCAATTA 


TACTTCCTAA 


TAAATTATAT 


TATAAATATT 


TTACGATTTT 


CGACTCGGAC 


5B20 


TATACAATAG 


ACTGACATAC 


TATTATTAAC 


TTAACATTCA 


AATATATACA 


TCCATTAACA 


5880 


TTAGCATAGT 


CACTATGTTT 


CATTCAACAA ATTACATTAT 


CGAACTATGA AATAGTCATA 


5940 


ATTTGCTTTT 


GGAGTATAAA 


AAAGCACTTG 


TGCAAAAACA 


CAAGTGCTTT AAACTTAATT 


6000 


TATTGTTCAC TAGTTTGAAT 


CAAGCCATAT 


TTACCGTCTT 


TACGGCGGTA AACGATACTT 


6060 


GTTCCATCAG 


TTTCTCTGTC 


TGTGAATACA 


AAGAAGTCAT 


GACCTAATAG ATTCATTTGT 


6120 
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ATCTCGTTAT CATCGTAAGC GTCATTATCA ACTTGTGTTT CTTGCATTTC TTGTAATTCG 
GCAACAAACA CTTCTTGATC TCCTCGATCA CGGCTCTTAC GATTAATACG TGTTTTATAT 
TTTCGAACTT GTCTTTCAAG TTTATTATTA ATTAAATcAA TACCTGCGTA TAAATCATCG 
TTTCGCTCTT CAGCTCTTAA CGTAACATTT TTCAATGGAA TTGTTACTTC AATTTTAGTA 
GCTGAATTTG AATAAGTTTT AACTTTAACA TGCGCCACTG CATTTGGTAC GTCATTAAAA 
TAACGTTCCA ACTTACCAAT TTTTTCCkCA ATATAGTTGC GAATAGCATG TGTGATAGTG 

AGGTTATCTC CATGAATTTC AAATCTAATC ATAGTAAATC TCTCCTTAAA CCTCTTTATn 

t 

GGnAACTCnT TATTATATTT AACATTTTTA CGCCAATCGT GCAAA 
(2) INFORMATION FOR SEQ ID NO: 260: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 7430 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 260: 
CAGTTCCAGC ACAtCTATTG GGGATCAACA AACTAGGGAA AATGCTAATT ATCAACGTGA 
AAACGGTGTT GACGAACAGC AACATACTGA AAATTTAACT AAGAACTTGC ATAATGATAA 
AACAATATCA GAAGAAAATC ATCGTAAAAC AGATGATTTG. AATAAAGATC AACTAAAGGA 
TGATAAAAAA TCATCGCTTA ATAATAAAAA TATTCAACGT GATACAACAA AAAATAACAA 
TGCTAATCCT AGCGATGTAA ATCAAGGGTT AGAACAGGCT ATTAATGATG GTAAACAAAG 
TAAAGTGGCG TCACAGCAAC AGTCAAAAGA GGCAGATAAT AGTCAAGATT CAAACGCTAA 
TAAC&ATCTA CCTTCACAAA GTCGAATAAA GGAAGCACCA TCATTAAATA AGTTAGATCA 
AACAAGTCAA CGAGAAATTG TTAATGAGAC AGAAATAGAG AAAGTACAAC CACAACAAAA 
TAATCAAGCG AATGATAAAA TTACTAACTA CAATTTTAAC AATGAACAAG AAGTGAAaCC 
TCAAAAAGAC GAAAAAACAC TATCAGTTTC AGATTTAAAA AACAATCAAA AATCACCAGT 
AGAACCAACA AAGGACAATG ACAAGAAAAA TGGATTAAAT TTATTAAAAA GTAGTGCAGT 
AGCAACGTTA CCAAACAAAG GGACAAAGGA ACTTACTGCA AAAGCGAAAG ATGATCAAAC 
GAATAAAGTT GCCAAACAAG GGCAGTATAA AAATCAGGAT CCTATCGTTT TAGTGCATGG 
TTTCAATGGG TTTACAGATG ATATTAATCC TTCAGTGTTA GCTCATTATT GGGGCGGTAA 
TAAAATGAAC ATTCGCCAAG ATTTAGAAGA AAATGGTTAC AAAGCTTATG AAGCAAGTAT 
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TCGTGTAGAT TATGGTGCAG CACATGCAGC AAAATATGGA CATGAACGTT ATGGAAAAAC 1020 

» 

ATACGAaGGA ATTTACAAAG ACTGGAAACC AGGACAGAAG GTACACCTAG TTGGACATAG 1080 

5 ' TATGGGCGGT CAAACGATAC GTCAACTAGA AGAATTACTG CGTAATGGTA ATCGTGAAGA 1140 

AATAGAGTAT CAAAAGAAAC ATGGTGGcGA AATTTCTCCA CTATTCAAAG GTAATCATGA 1200 

CAATATGATT TCATCAATTA CTACTTTAGG AACACCACAT AATGGTACAC ACGCATCAGA 126 0 

TTTAGCTGGT AATGAAGCTT TAGTGAGACA AATCGTATTT GATATCGGTA AAATGTTTGG 1320 

TAATAAAAAT TCAAGAGTAG ACTTCGGGTT GGCTCAATGG GGTCTAAAAC AGAAGCCAAA 13 BO 

TGAATCATAT ATTGATTATG TCAAACGCGT TAAACAATCT AATTTATGGA AATCAAAAGA 1440 

15 

TAATGGATTT TACGATCTGA CGCGTGAGGG TGCAnCAGAT TTAAATOGTA AAACGTCGTT 1500 

GAACCCTAAC ATTGTGTATA AAACATACAC TGGTGAAGCA ACGCACAAAG CATTAAATAG 1560 

CGATAGACAA AAAGCAGACT TAAATATGTT TTTCCCATTT GTGATTACTG GTAACTTAAT 1620 

20 

CGGTAAAGCT ACTGAAAAAG AATGGCGAGA AAACGATGGT TTAGTATCCG TTATTTCTTC 1680 

TCAACATCCA TTTAATCAAG CTTATACAAA AGCGACAGAT AAAATTCAAA AAGGCATTTG 174 0 

2S GCAAGTGACG CCTACAAAAC ATGATTGGGA TCATGTTGAC TTTGTAGGAC AAGACAGTTC .1800 

TGATACAGTG CGCACAAGAG AAGAATTACA AG AT TT TTG G CATCATTTAG CAGACGATTT IB 60 

AGTGAAAACT GAAAAGCTGA CTGATACTAA GCAAGCATAA TTTATAAAGT AAAGGGAGGA 1920 

30 ATTAATAATG ACTGCAGACT TCTTTCAATT AATCGGATCA TTATTTAGAA TTCTAAAAGA 1980 

ATTATTCAAG TAAAACATTG GCGAGGCCCC AACATAAAGA ATTTCGAAAA GAAATTCTAC 2040 

AAACAATGCA AGTTGGCGGG GCCCCAACAA AGAAGCTGGC GGAAAGTCAG CTTACAATAA 2100 

35 TGTGCAAGTT GGCGGGGCCC CAACATAGAA GCTGGCGGAA AGTCAGGTTA CAATAATGTG 2160 

CAAGJTGGGG TGGGACGACG AAATAAATTT TGCGAAAATA TCATTTCTGT CCCACTCCCA 2220 

TTGGCATTTA CGAAGTTTAA ATGTGCAATT AGAATATATG TATAACAATA TTAAACACGC 2280 

40 

GGTAAAACGA AGTCAGTCAA TTCAAACTGA TTTCGCCCAC CGCGTGTTTT TAACATAGCT 2340 

TAATAATTAA TAAGCATTAA TGTTCAATTT ATATGGTTGT TTTCCAATAA TAAACCTAAA 2400 

GATATAGAAT TCACGCAATA TCATGCCGAC ACCTATACAT AATCCTAAAA TGAATAGTAG 246 0 

45 

r TGATATCGCT AGAAAGACCA TTGTATTATC CTCAAATATA TTTGTATATG CAAACAATGA 2520 

GTCTAGAATG ATTGGATGTA ATAAATAAAT AAAGAATGAG AAAGCACTAA TCATTTGAAT 2580 

CGTATTAAAT \ AACATTGTTT TAAAATGCGT GCAAATACCC AAGATAACAA TAAACATAAT 264 0 

50 

ACTATTATAT GGTGTTAATG AATATGAAAA GCTGGTAACG TTCCAATAGT CTCCaTTTGT 2700 

55 
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TCTAAGAAAT TTAATACACG TTCGTAGTTA TAACCCATAT ATGCACCTAA GAAGAAATAA 2820 

AAAATCCATC CGAATATTAT AGTATTTTCA CTTAATGGAT AATAGTGTAG CACGGTATCG 2880 

5 TGAAACGCTG TGTTGTTCGT AAAGTAATAT AAAAATGATT GCTGTAAAAT AAAAGATAAC 2940 

AATAATAATA .TTTTACTGTT GAATAGGTTA TAGTTAATTT TAAAAATGAT ATAACTCAAA 3000 

ATAAAGAATT GCATGATAAC AACGATAAAA TAGCCATACC ATTGACCTAA TAGGACATTT 3060 

10 

TCAATGAATT GTTTATTGAA ACTTGAATCT GTTAATAATG ATTCACTATA ACTGTAAAAC 3120 

AATCCCATTA ATATGTAAGG AATAAGTATA TATTTTACGC GTGTAGTTAA GTATCTATAG 3180 
GTOACTTTTT GGTAATTCAA GGTTGTCAGT AACTGTGACA AGATAATAAA GCAAGGTGTA ' 3240 

IS 

CCAAAAATCA CAATATTACG AATGTAAAAT TGTAACACTA AGGATCCACC CTCCATATTT 3300 

TCATGTTTTA AAGTAATTTG TGTAAGTAAA TGTGTGATAA TAATAATTGC ACATATAATA 3360 

GCACGTAAAT ATACGAGTTC AAGTCTAATC TTTTTCATGG AATCCGTCCC ATCTCTTAAT 3420 

20 t 

TAAATGCTCA AAAGCATCAT CACTAATTAA TATTCTAGGG ATGTAATAAT CATTGGAGTT 3480 
CGGAGTGACT GCTTTTTCCT CTAATGAAAA ACCGTATTTT AACCCAGCTT TTTTGATTAC . 354 0 

25 CGGTAATTTA TCGTCATTCA TCAAGCCATA AGGATAGGCT ATAGTTTTCT GCGACTTTTT 3600 

AAAGTTTTTA GTTAGATATT TTTCACTTTT GTTTAAATCT TTTATGATTG TAGCTTCAGA 366 0 

AGCTTTCATT AATTTTGACT TATTATTTTT AGATAAGTTA TGCAAATCGT GGGTATGTGT 3720 

30 TTCAAA7TCC CATAACCCAG TTTTATACAT TTCTTTTAGT TCTTTTTTAC TAATCATATC 3780 

GAGGTTGTGA AAGTTTTCTT CCCCAACATG ACCTGTGATA ATAAACCCAG TTGCCGGTAT 3840 

TTTATATTTT TTTAAGATTG GATAAGCATT TTCATAAATA GTTTCATCCA TATCATCAAA 3900 

35 GTTAATCCAT ACACTTCGTT TTGGaAACTT ACCTTTTTTC TTGtAATATA AAAATTCTTT 3960 

CAAC^aTTAAA AATTTAGCAT CATGTGATTT TAGCCATTTT ATTTGAGATT CAAATTGTGA 4020 

TTGACTAACA CTATAATTTT TAATTTCTTT ACTACTAGAA AAGAAGTAAA TAAAATTATT 4080 

40 CAGAAAATTC GCTTTTCTTA CACGGTGATA ATTTAATGCC AGAGCACTAT TTTCTTTATA 4140 

TTTCAGTTTT TTAGGTGAAT CGTCATCTGC ATTTGCAATA TGATGACCAT CCAGTGTGCT 4200 

TACAGGCAAT ATGATCAAGA TACTCAACAC TAAAATTATA AATTTTCTAT ACTTCACGAT 4260 

45 

TCTCTTCCTC TCTGCCATTT TTGAATCAAT ATGCTAATTG TAAAAAATAC AAAAATGATA 4320 

ATCGCGAAAA TGCCCATAGT TTCAAATATA TCTAAAATTT CAGTATTTTC AATGTTTAAA 4380 

GCAACACGTA TTGTATTGAT ACTTTCGTCA TGAATTTCAA ATATAGTACC AATATAAACG 444 0 

SO 

AGTAGAACAA CTAAACAATA TATCCAAAAG ACACAAGATA TAGCGATAAG TGCTGTTTCT 4500 
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TTGCGTAACC ACCTTTCTTA CGTTTTAATG CTTTTGGAAA TGCGACAAGA ACTACTGCTG 4620 

CGTTAATAAT CCAGTATACT GTCGGATACC AACTTACAAA TATGAGTCCA GCCATATTCT 4 680 

TTTTCTCGTA GCGACTATCA ATAAAGAGTG CGACTGTAAA TTGAATAACG TTTATAAAAG 4740 

TCATAGTAAA TGATGATAGT AGAAATATTG AAAAACTATA TGTCATAAAT GTATAGTCTA 4 800 

AGAAGTTTGC TGTTATGAAC AAATAGCCTA AATATAGAAG CACTATATAT ACCCATAAAA 4 860 

10 

TCGAGATGAT TTGCTCAAAC ATCAAAATAT ATAAAGGAAA CCTTTTCGTT TTCATTGTGC 4 920 

TAAAAAAGTC TCGTAGTAAT ACTTCGTGTC CCCCTTGAGC CCATCTCACG CGTTGCTTCC 4 980 

AAAGACCTCC CAATGTTTCT GGAACCAACA TCCAACACAT GGCAAGCGGT TCATACTTAA 504 0 

15 

TACGATATCC ACGTAAATGC AATTTCCAAG AAACTGCAAT ATCTTCGGTA ATCATATCAG 5100 

TATCCCAGTA GCCAACGTCG ACAACTGCAC TTTTTTTAAA TAGAGTGAAG ACACCCGAAA 5160 

TAGTATTGAc TGCGCCAGCA AGTGTCTGAC TtCGCTTAAT ACAGCCAATT AAACTTGCAT 5220 
20 > 

ATTCTATCGT TTGAATTTTA CCTAAAATAG AACTCTTATT TCGAATTCTA GGATTACCTG 5280 

TAACTGCACC AAGTTTTGGA TCATGTTTGA AATTCTCAAT CATATAATAT GGTGCATCTT 534 0 

25 GATCAACGAT AGTATCTGCA TCCAAGCACA TTACATAATC ATATGAAGCC TGTTTAATGC 5400 

CTTGATTGAG TGCGTTGGCT TTACCTCTGT TTTCTTGTAA ATCGACGAAA ATAAAGTCAT 54 60 

TATTTTCTTT GATTTTATAG ATGAGTTCTG CTGTATTATC TGAACTTCCA TCATTAATGA 5520 

30 TAATAATTTC TTTCTTCTCG TATTTGAGTG CAAGAACATT AGACAACGTA TCTTCAAyCG 5580 

TTTCA CTTT C GTTATAACAG GCAAGTAAAA ATGTAATGCC TTCTAATTCA TCCACATTTA 564 0 

TGTCAGGCTT CTTGTTCAAT GAATATCTAA TTTCTCTGGT AAAATAGAAA TAAATTGAAC 5700 

55 CGACAATCCA GTAAATAGAC ATAAATACAG GATAAAAAAG CAAAAAGTTA AAAAATTGCA 5760 

ATTfcTTTAC CTACCTTTCG TTAGTTAGGT TGTAAGCCAT ATGGTAATTG ATAGTATTTT 5820 

AATTTGCAAT AGATTGTTGT TATAATTAAA CGGAAATATT TGTAATTGCA ACTTAATTTT 5880 

40 

CCTGTAACAT AGTGTGATTA ATTTTCAGTA GGGGGTTATA AAAATTGAAG GATAAGATTA 5940 

TTGATAACGC AATAACCTTA TTTTCAGAGA AGGGGTATGA CGGTACAACA CTTGATGATA 6000 

TAGCTAAAAG TGTAAATATA AAGAAAGCGA GTTTATATTA CCATTTTGAC TCGAAAAAAA 6060 

45 

GTATTTACGA ACAAAGTGTT AAATGTTGTT TTGATTACCT TAATAATATT ATTATGATGA 6120 

ATCAAAATAA ATCGAACTAT TCAATTGATG CTTTATATCA ATTCTTATTT GAGTTTATTT 6180 

TCGACATCGA AGAAAGGTAT ATTAGAATGT ACGTTCAATT ATCTAATACG CCTGAGGAAT 6240 

SO 

TTTCTGGAAA TATTTACGGA CAAATACAAG ATTTAAATCA ATCATTAAGT AAAGAGATAG 6300 
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TGCTGTTTCT TGAAAGTTGG TATTTGAAAG CATCCTTTTC GCAAAAATTT GGAGCAGTGG 6420 

AAGAAAGTAA AAGTCAATTC AAAGATGAAG TGTATTCGCT ACTAAATATA TTTTTGAAGA 64 80 

AATAATTTTT GTTACTAGTT TGTAATAATT AACTTACTTT TGTAAAAAAA GACATGAGAT 6540 

TATTTTTTTA AATCTATATA AAGTTGACAA TACAAATCGA TATTGAGAAT ATTAAGATGT 6600 

ATATGAATTT TATAAATTAA ATGCAATACA TTAATATAAA TATCAATTGT TGCAAAATAC 6660 

10 

GATTTGTTCA ATGATTTGAT AATATTATTC TTTATATTTG TGAATGGTTA AGTTTGTCTT 6720 

TGAACATATT ATAAAAGTGT AATGTTCCCC TGAAAAGAAT AAGTTGTCAT CTAATTACAG 6760 

15 GAAATCCGCA TAAATTAGAT GAAATGGAAA GTAATAAGTA ATAATTTATT GATAAGCGCC 6840 

TATGTGATGG TAAATCATGA CATAGGCGCT TTTTTTTATA AGTTAAAAAT GTAAATAAAA 6900 

ATTATATAAA TTACCCACAT CTTTTTAAAA GGTGTGGGCT TTATTATCAT TAACCCAACT 6960 

20 CACAGTGACG GGTTACGCAA GGTATTGAAT TACCGAGTAC GGGCACGCTC GGTGTTGTAA 7020 

AGAGCAAATA ATCAAGTAAT GATGATGCTT CTACTCGATT ATAAGAAAGC CATGATAGAG 7080 

TACGATGGTA TCTAGTTTTA TTATTAATAG GTTTGGATAT TTAAAGTTGG ACAATATTAT 7140 

ATCTTGTGCA AAAATATAAA TAAGTTATAC ATAATGGTAG AGAATCATGA TATAATTTTA 7200 

AACGATAAAA TATTTATATA AATAATTAGA GaAAATGTAG TTGTGTATGT yTTGTGGtCG 7260 

TTAAACTAGA TATAATTGTC CGATTTATAA AACATACATA ATGAATACaA TGATTGATTA 7320 

30 

TGTGGAGGAA ACCATGAmAG AmAAGTTTgA TTTAGTAAAA CTATTAAATA TTCTAAAGAA 73 80 

GAATATTAAA TTATTGCTTA TTTTACCGGC AATATGTCTT GTAGTAAGTG 7430 

35 (2) INFORMATION FOR SEQ ID NO: 261: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 082 base pairs 
C (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
40 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 261: 

45 

ATTGTTACTC ATTATTTTTT CATATTCACA CAAATGATCT TGTTTATATT TAGCTAATTG SO 
ATTTTTATCT AGCATTTTAT CCTCCTGCTG AGTTTGTAAC CTTTAATAAT TTATTTTCTA 120 
TAAAAACTTA GTATTCCAGT TGCTTATTAT ATCATTGATG AAAGGCTGAA ATAAAACATA 180 

50 

AACTGTTCCG ACCATTAACG CTGTAGCTAA AGATAAGTCT ACAAGTCCAC CTGTTTTAAA 240 
TTGAATCGGT GTCTTCACAT TAAACGGTAA gGAknAAAAT AATTTCACGC CTTTTGGTGT 300 
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AGGCGTCTGA ATAATTTGCA ATAAAAATGC TATGATTGCG ATAAATAATA TTGAATGCGT 420 

AAAGGTTCGA TGTCCAAAGA TCAATCTCAC AAAAAAACTA ATTACCTTAA ACCTTCTGCC 480 

AATCTTACTT TGAGTGTGAC ATATATCGGG TAATAAGCTA GCTAGAGTTG CTAGAATGAT 540 

AACCGTAACC GACGAAAAAA TATCCGTTTG AAAATATTGT GTTGTTAGCG CTCCAACGAG 600 

CATGCCGCAT GAAGCATGTG TTTTACCTGT CATATTTGTT CTCCTTTAAT ACTCACATTT 660 

TACCACATCC CTAACAAAAA CACGAACATA TTTTCGGGTT AAAATTCATT AGTATGACAC 720 

AATTTAAAAA AGTATCACAT AACTCTTGAA AACGATTACA AAATCGTTTA TGATGTATTT 780 

ACAAAATATT TAAAGGATGT GTTTGAATAA TGGCAATGAC AGTAAAAAAG GATAATAATG 840 

AAGTGCGTAT TCAATGGAGA GTTGCTGATA TCAAAATTCC TACAAGTGAA ATTAAAAATA 900 

TTACACAAGA CCAAGATATT CATGCAGTTC CTAAATTAGA CAGCAAAGAT GTATCTAGAA 960 

20 TCGGCTCAAC GTTTGGTAAA ACGAATCGCG TTATTATCGA TACTGAAGAC CACGAATACA 1020 

TTATTTATAC TCAAAATGAT CAAAAGGTTT ACAATGAATT AACTAAATAA ATTGTATAAA 1080 

AaAATCATTC ATGGTGAGGG CTTCATGAAT GATTTTTTTA ATTGATTCAA CACCCAGCAT 1140 

AAACAAATAC AAAAGGACAA CTGTTCCCAT AATTTTAACA GTTGTCCTTT TTCACATATA 1200 

. TTTATAACAA AAGATGTGCC ATCAAAGAAA TAATTGGTAG TGTAATGATT GTTCTAATCA 1260 

AGAAAATCAT AAACAATTTG CCGATGCTTA CAGGAATCTT CGAACCAAGT ATGACGCCAC 1320 

CTACTTCAGA CAAGTATATT AACTGCGATA TACTAAGTGC CCCAATAACA AAACGAGTTA 1380 

TATCATTTTG TACACCTTCA ATTAATATAG AAGGTAAAAA CATATCGGCA AAACCGATAA 1440 

TAATCGTTTG AGAAGCCTGT GCCGCTTCAG GTATTTGCAT TAACTCTAAA AATGGAACAA 1500 

AAGGTTTACC CAATATGACA AAAAAGGGCG TGTAGTTCGC AATAATGGTA GCAATAGTAC 1560 

CAATACTCAT TACTACAGGC AAAATAACAA ACCACATATC AATGACTGTT TTTAATCCTG 1620 

40 ACTTAAAAAA GTCAATAACG CCCGGTGCTT TAATACCTAC TTCTGTTGCA GTATCAAAGC 1680 

CATGTCTCAA TGCCGTCTTT CCTTCTGGCA ATGCCTCAGT ACGCGCACTT TCAGGTACCT 1740 

CCTTAGCATA CTCATCAGGA ATTTTATTTA AAGGCCAAAT TCTTGGCATA ATGACTGCTG 1800 

CAACGAGGCA GGATACTATC ACTGATAAAT AGAAAGCAAA AAATTGATTT TGCATGTGCA 1860 

CTGTTTCAGC AACTACAATT GCAAAGGTGA TAGAAACTAC ACTAAATGTC GTTGAAATAA 1920 

CTGTTGCCTC ACGACGAGAA TAATATCCTT CACCATATTG TCTACTTGTA ATTAAGACAC 198 0 

CAACAGTTCC GTCTCCAATA AATGATGCTA AATTATCTAC CGTCGAACGT CCTGGCAATG 2040 

TAAATAAAGG TCTCATAACC. GGTCTAAAAA TAGGACCCAA CATCTCTAAC AAACCGTATT 2100 
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AACTTGAGAA 


CAATAATCCA 


CCCGTTTCAT 


CTGAGTAAAT 


AACCTTTGAA 


CCAATTCGTA 


2220 


AAAATGTCAT 


CCATGCAAAA 


ACAACTGCTA 


ATATTCGTAA 


AATTAACCAA 


CCAATTCTAA 


2280 


CGTTAAAAGC 


ATTGTTCATT 


AGCCCGTCAG 


GTTTCAATTT 


ATCTTTTAAA 


ATAGTTGAAC 


2340 


AAATCAGAGT 


TATGATACCC 


GATAAAGTAA 


TTATCGTCAC 


AATTAAAAAT 


GGCATTACGC 


2400 


CACCTAATAC 


ATCTTTAAGC 


ACGCCTGCTA AAAATGCCAC 


GGGCAACGTT 


GTTTGCTTCT 


2460 


GTCCATCTTG 


TTCGACTGGA 


ATTGGTACTA 


AAAATAATAA 


GATACCAATT 


AAAGACATCG 


2520 


TAATAAACTT 


AAGTCTCCCA 


ATAACTATCT 


CTTTCCTTGA 


AAAGCTATCC 


ATAAAATCAA 


2580 


TCCATTTCTC 


TATGTATTCG 


TTTTAAGTAT ATACAGAATT 


CTATTCAGTT 


AACAAACATA 


2640 


TTCCTTATCA 


TTCTATCTTT 


CAAAATGTTT ATGTATGCAA 


AATAATGAAT 


AATTAGAGTT 


2700 


ATTAAATATA 


CGCTATTTCT 


TGTAATTTTT 


CAAGATGAAT 


TCAAAAAAGG 


TTAAGTACAA 


2760 


TTACTGATTT 


CGTACTTAAC 


CTTTTTTAAA 


CTCTAATCAT 


ATGTTAGTTA 


TTTCATTCTT 


2820 


CGTAATAATA 


TTAAGAAGTA 


TGGTGCACCG 


ATAATTGCAA 


TGATAACCCC 


AACAGGAATA 


2880 


TCCAGTGGCG 


GATGAATGCC 


ACGGGCTAAA 


CCATCTCCAA 


ATGTTAACAA 


TATAGCACCA 


2940 


ATTAACCCCG 


ACATGATAAT 


AACGTGTAAT 


GTTTTATTTC 


CTATTAATTG 


TCTCGCAATA 


3000 


TGAGGTGCAA 


TTAATCCTAA 


AAAGCTAATA 


CCACCGACAA 


CTGAAATTGC 


GGATCCTGCT 


3060 


AATATTACTG 


CTAAAATTAA 


CAATAGCATT 


TTAATAGTTT 


TAACTTTTAA 


ACCGAGTGCG 


3120 


GTTGCAACAG 


CATCACCTAG 


ATTCAATACA 


TCTAATTGAT 


AACTCAATAA 


AATGATGATT 


3180 


GGTATCGTTA 


TTAAAAACCA 


AGGTAATATA GTATAAATAT 


TCGACATATC 


ATGTCCATAT 


3240 


AGACTACCTG 


TCAACCAAAC 


AAGCGCTTTG 


TTTGCTTCCA 


GTGGATTTCT 


GATTAATAAG 


3300 


AACTGCACAA 


TCGCCGTACA 


TATTGCGCCT 


ATTGCTAAAC 


CAATTAAGGC 


AAGCTTTGAA 


3360 


CCTTlAACAT 


CATATTTTGA 


AATTAAAAAT 


GATAAAAATA 


AACTTACTGC 


AAAGGCACCT 


3420 


AAGAATGAAC 


CTATAGGTAA 


TACAAACAAT 


GGTGCTGTTG 


GAAAGGTCAT 


AATAATAATC 


3480 


ACAGCAGCTA 


AACTGGCACC 


TTTAGAAATA 


CCTATAACAT 


CAGGTGAGGC 


TAACGGGTTT 


3540 


CTTATTACAG 


CTTGTATAAT 


TGCACCTGAA ATAGCCAAGC 


TACTACCGAT 


AATAATACCA 


3600 


AGTAATGTTC 


TAGGTATACG 


ATACTCATTT 


AAAATAAAAT 


CATCTTGTGT 


AAAGATTCCC 


3660 


TTAATAGCAT 


CAATCGGATG 


AATCATGACA 


GACCCTACAC 


ATAAACTTAT 


GAATATACTC 


3720 


ACAATTAAAA 


GGATTGTGAT 


TAAACTATAA 


CGACGTATAA 


TTTTCGTTGT 


CATCATATTC 


3780 


TTTTCACCCC 


TTTAATCGTT 


ATAAATAAGA AGTAAAGTGC 


ACCTACGAAT 


GATGTAACAA 


3840 


TCCCTACTGG 


TGATTCATAA 


GGATATGTAA 


TTAAACGACT 


TAATACATCT 


GATAGTAGTA 


3900 
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AGCGTTTGAC TATATGCGGT ACGATTAAGC CAACAAATCC AATTGGTCCT GCCACTGACA 4020 

CCGACATACC TGTAAGAATA ATGACTAATA GTCCAATGAT AATTCTAACT TTATTTATAT 40 BO 

TT 4082 
(2) INFORMATION FOR SEQ ID NO: 262: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 262: 

TATTAGAAGG TCGTTCGGAT GAACAATTAA AAAATTTAGT TAGCGAAGTA ACTGACGCCG 60 

20 TAGAAAAAAC AACGGGGGCA AATAGACAAG CAATTCACGT TGTTATAGAA GAAATGAAAC 120. 

CAAACCATTA TGGTGTGGCT GGCGTAAGAA AGTCAGATCA ATAATTCTTC ATAAGATGCA 180 

TGCCAATTAA TTCTTTGAAA ACGAACAAGG CGACTTCTAT CTGaGTATGA TAGAAATCGC 240 

CTTGTTTATT TTTAATCTTC ATCTAAAAAG TCTTTAATAG CTTGTTTATT TGTGTTTTTA 300 

TTAATCTGTA ATGCACTACC ATCAGTATTT GTATTGACAT CyTCGTATGA GTTCTTGATT 360 

GGCaCAGTCA ATGACTTAAC ATCTTTTTCA CCTCGGATAC CaAAACTCAA ACCTGTTTGG 420 

AAAATCCCTG AATCAGGAAT GTTTGTATTC ACATAGCCTC TTAAAATACC TGCAACTTTT 480 

GGTAATTTAA CAACTGTTCT AAAATTAACC ATTTCTTTTT TCAATGTTTG CATCACTTGT 540 

3S TGCTGACGTC GCACGCGTCC GAAGTCACCT TCAGGGTCGT GACGGAATCT TGCATAACCA 600 

AGTAATTCTT TACCATTCAA CCTATGGTTA CCCTTTTTCA AAGATACACC AATATTTTTC 660 

GACATATCTT TTTCGACATT AATTGGTACA CCTTCAGGCA TTAATTCATC AATCATTTTC 720 

40 TCAAATCCAG TAAAATCAAC TACTGCATAA TATTCAGGAT TAATTCCTAA ATTTTTATCA 780 

AGTGTTTTTC TAAGTAGCTC TGGACCACCT AAAGCGTATG CTGAATTAAT TTTGTGTTTT 840 

CCATATCCTG GAATATCTGC ATAAATATCA CGCATGACAG ACATCATTTT CATCTTTTTA 900 

TTGATAAAGT CATATTGAAC AACCATGATA GAATCTGTTC TTGATTGTCC ACCTTGTGCT 960 

TTATCTGCAC CGAGTACAAG AATAGAAATT nTACCATCAT TTTTTACTGG TCCATTAAAT 1020 

TGATGTACTT TAACATCTTT CGCATGTTTC TTGGCATATT CTACACCGCT ATTGTAACTA 1080 

TGTACAATAT ATACAACTAA TGCCGATAAG TAAAATTACA ACAATCAGAA GAATGATAGG 1140 

TAATT H45 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 7075 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 263: 

TATGGCTCAT CATTAATGCA CGTATCGGGT AGCGTTTACC ATTGATAAGT GCTTCATGTT 60 

TAGCACGAGT TCTTAAAATT CCATCGCCAT AACCGATATC AACTACAGCT AATTTTGTAT 120 

TGTTTTTAGT CACTTCAAAG GCAAAGCTAT AACCGCAATA ATCACCAGCT TGTACTTCGC 180 

GCACTTGAAT AACATGTGCT TTTAAAGTTA ATGACTGAAC TATATCATGT TGATTCAGTG 24 0 

AACTATATGG TCTTGAACCG TATAACGCAA TACCTACACG CGCATGTGTA TGGTGGGGTA 300 

20 GTAATAyyyg TCCTTCCCGA TAAAAACTCG CACTATTTTG AGCATGGATT AGGTCGAACT 360 

GATAACCTTC AGATAAAAGT GCTTCAACAA TTTCCATCCA TTGTGAACGT TCAACATTAT 420 

AAtCTGACAC ATCGAATTCA TCAGCATATC CAAAATGGGk CCATAAACCA CTAATAATCA 480 

25 TTTTTGCATT TTGATTATGA TGGTGATCTT TCAATACTTC TTTAATTTCG TTTAAATCTT 540 

TAAATCCAGA CCGATGTAAT AAATTTTCAA ATTCTAAGTG AACATGAATA CCAGCTAAAT 600 

CATTTTTATG GTTATAGTAA TATGTCAACG ACGGCAAAGT CATGTGTATT TGATGTTCAC 660 

GGACTAAATC AAACTCGTAA ACTGCATTCA TTAAAAAGAT TGTTGCATCT GGAGCAAGTT 720 

GTCTAATTTG AATTGCTTCT CGTAGTGATG TTGTGCTAAA TGTATCTATA CCTGCATGGA 780 

TAAACTGAGT TACAGCAAAT TCTAGGTCAT AGTGATATGC ATTaTTTTTA ACAACTGCCA 840 

TTAATGGCTG ATTGTTTTTG ACTGTGATTG CATTTTGTAA AAATATTTTC TTATTTACAG 900 

ACC^TGTTGC TGTCAATGTA TTACACCTCT TTGTAATTAT TTAATAAATT TTCGTAAAAA 960 

40 TTAACCACGT TTATTAACAC TTTTTCATCA AAATTTAAAT GTGATGTGTG CAAACCAGTT 1020 

ACAAAACCTT TATCTTCATT TCGTGTTCCT ATAAAAACAA AGTAAGCTGG AGCTAGTTGT 1080 

TGACCATAAA AACTAAAATC TTCCCCAAAT AAGAATGGCG TTGGTTTGTC ATAGACATTT 114 0 

45 AAATCAGCTT TTATTAAGGC GTCCTCTATT TGAGTACGTA ATTTCGGACT ATTGATTGTA 1200 

GGGGGATAAC CTTCTGCAAA TTTAACTTCA CAATCTACAT TAAACAGAAG CTTGACACTT 1260 

TCTGCTATCT TGTGCATTTG ATTTTTAACG ATTGTTAAAT CATCAATATC ATATGTACGA 1320 

50 

ATAGTACCTT CTAAATAGCC ATTACTTGGT ACAGTGTTAA TCGCTTCACC AGCTTTAAAA 13 80 

TGACCAATAT GAACAATATT TCGTTTCAAA CCGTTAAGGT GAAATTGTTG AATTTGTGAT 1440 
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ACATGACTTG ACAGGCCTGT TAAGAAAAAG CGATACTCTG TTGCGCTGGC CGTAATTTCT 1560 

TCATCTCTTA TCACTGCAAT GCCTTCATCA GCAAATGGGT TAACATGAAT ACCAAATACC 1620 

GCTTCAATTG GATACTTATC AAAGGCACCG GCTTTTATTA ATCGATTTGC ACCGCCACCA 1680 

GTTTCTTCTG CAGGTTGGAA AATGAAAACG ACATTTTGCG GTAATTGACC TGCATCTTGC 1740 L 

ATGTCTTTGC AACGTTGTAC AAAAAGCATT AATGCAGTTG TATGACCATC ATGTCCACAA 1800 

GCATGCATCA CATGATCAGA TTGACTGCGA TAAGGCACAT CATTTTCCTC TAAAATAGGT 1860 

AACGCATCAA TATCAGCTCT ATACGCTATC GTATGTGAGC CATTACCTTC TAAGTATGCA 1920 

ATGACGCCAG TTTCCAATGG GCAATCGTAT TTAATATTTA AACTATCTAA AAACGCTTTA 1980 

ATATAAGCAG TTGTTTCAAA TTCATGTAAG CTTAATTCAG GATGTTGATG TAAATGACGG 204 0 

CGATGTTTCG TAACAAATTC TAATTCATTC ATAATTATCA ATCCTTTGTG TTAAATTACT 2100 

ATATAAATAG TGTAACGtAT TTCGAAATTT GTGATCATAA GTTTATTCAA TGCTAAACAA 2160 

TAAGGTTGAG ACATAATCGT ATCTCAACCT TGAAATTATT ATACGTTGAC GTCACTAGTC 2220. 

ATTCAGTTTT CTTAATGCTG CTACAATCTC TTTTTTAGTA TCTTGTACTT CAGAAGCTTG 2280 

CTTAATCACT TTTGCAGGTG TACCAGCAAC AACTGCACCA GCTGGTACAT CTTGTGTCAC 2340 

AATCGCGCCA GCTGcAACAA TAGCAGCTTT ACCAACACGT ACACCTTCTA AAATAACTGC 2400 

ATTTGCACCG ATTAATACAT CATCCTCGAT TATAACCGGT GAAGCACTAG GGGGTTCAAT 2460 

CACACCTGCT AATACTGCGC CAGCCCcTAC ATGTACATTT TTACCAGTTG TAGCACGACC 2520 

ACCGAGAGTA GCATTCATAT CAATCATTGT ACCTTCGCCA ACGACTGCGC CAATATTAAT 2580 

TGTTGCGCCC ATCATAACGA CAGCACCATC TTCAATAATG GCTTGTTCTC TAATAAACGC 2640 

ACCTGGTTCA ATTCGTGCAT TCGTATTTGT TAAGTCTTTT AATGGAATAG CAGAATTGCG 2700 

ACGATCCATT TCAATTTCTA TATCTTCGAA TTGACTACCA TATGCTTCGT AAAAAGGTTT 2760 

CCAATCATCC GCTTCACAAA AGATTACTTT AGATTGTTCT GAACCAAATA CTTTAAAACT 2820 

TTCTGGATAT GTGATGCCTT CAAAATTACC ATTTAAATAT ACTTTTATTG GTGTAGACTT 2880 

TTTAGCATCA CTTATATATT GAATAATTTC TTCAGCTGTT AAATGTTGTA CCATAAAATA 294 0 
ATCGATCTCC TTTAATATGT TTATAAGTTG TCAAACGTAT AAAAGCCGTT TGGTTTATTA - 3000 

ACTAAGCGTT CTGCTGCTTG TATTGCACCA TTCGCAAAAA TATCTTTTGA TTGTGCACGA 3060 

TGCGTGATTT GAATCGTTTC ATCAGTGCCA GCAAATAGAA CTTCATGTTC ACCGACAATC 3120 

GTACCTCCAC GAATAGAATG TATACCAATA TCTTGTGGCT GGCGTTTTTC ATTTAATTCA 3180 

TGTCTATCAT ACACAGGTGT TACATTTTCT TTCAAAGATA CGATCACATC ATACAATTTT 3240 
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TCGAAATCAT CAAGTAGGGG AACAGCAGCT GCTAAAATTT TAGTCAATGC ATGAACGCCA 33 SO 

TAACTCATGT TCGCGCTGAA AAACACAGGC ATATTTTGAC TCAATTCATC TAACTTATTA 3420 

5 

AGTAGTTTTT CTTTCTCGnC CAGTgTTGCC ACAACTAATG GCAAATGAAA ATCTTCATCT 34 BO 

AATAAAGGGA AAAGCAGATT TGGATTTGAA AAATCTATTG CAACATCGGC ACCTTTAACA 3540 

TCTGCAATAT GTTGATATTG TTGATATGGC GTTGTTGCTT TCGGTGTATT TTCAATGACC 3600 

10 

CCAACGATTT CATGTCCTTT TTCTTCTGCT AATCTAGCAA CGCGTTGATT CATTGCGCCA 3660 

TAGCCAATTA GTAATATTTT CACTCATTTT CACCCGCTTT AAATGTGTCA TATGTTTCAC 3720 

75 GAAGCACTTT AGTATCTGTA TCTTCTAGGC TAACCAATGG TAGACGTAAT TCATAATTTC 3 780 

CAAATCCTAA ATAACTTGTT AGAGCTTTAA TAGGAATTGG GTTAATATCA ACTGATAAAG 3B40 

CTGATAACAG TGTGCCGATT GGTTTAAATT GATCTTGAAT ATCTAATCCA CTTTGTTGAG 3900 

20 CATCGTATAA CGCTTGAAAT TCTTTAGGAA TGACATTGGC AATAACAGAG ATAACCCCTT 3960 

GACCGCCACG TTGATAGTAT TCGACGACGT TGTCATCATT GCCACTATAT AATGCAAATG 4020 

AATTTGTATC AATGCGCTTT TTCACTTCTT CTAAATACTC AAAATCATTC GTAGCATCTT 4080 

25 

TTAAAGCAAC TATATAAGGA TGTTGACTTA ATATTTCTAC AGTTTCTGGT TCAATTGTCA 414 0 

TGTTCGTTCT TGAAGGAACA TTGTACAGCA CGACTGGTAA TTTCACAGCA TCTGCAATCG 4 200 

CTTCAAAGTG TTTGACTAAA CCACGTTGGT TCGTTTTGTT GTAGTAGGGC GTAATTAACA 4260 

30 

TAATTGCATC AGCCCCTAAG GCTTTAGCTT GGATTGAAGC TTGGATTGAC TTTTCAGTAT 4320 

CATTAGTGCC AGTTCCTGCT ATGACAGGAA CACGTTTATC TACAAGATCA ATAACTGTTT 43 80 

3S TTAGAATGCG TTCTTTTTCA TCTGTTGTTA AAGTAGGGCT CTCAGCAGTA GTTCCATTAA 444 0 

CGATGATTGC TTGGGCATTA TTTTCTAGTA AAAAATTAAC GTGTGTTTTC AAAGCTTCAA 4 500 

TATTAACTTT GTTATTTGTA AAAGGGGTTG TAAGTGCaAC sCCAACACCC TCAAATAAAT 4560 

40 GTGTCATTTT AATTCGCTCC TTTTAAACGC ATAACTTGTT CCAATACTTG TACAGCATTT 4620 

AATGCAGCAC CTTTTAATAA ATTGTCTGAT GTACACCATA CATGGAAAGT ATTTTCTAAT 4680 

GAATCATCTC TACGTATACG GCCAACAAAC ACTTCATCTT TATTAGTAGA ATTGATTGCC 4740 

45 

ATTGGATATT CATTGTTCTC TGGATTGTCT ACTAAAACAA CGCGGTCATC TTGATCAAAT 4 800 

AACGCTTTAA TATCTTCTGC TGTTGTTTCT TTGTCAAGCG TTACATCAAT TTCAACACTA 4860 

TGACTATCTT GAACAGGCAC ACGTGCGCAT GTTGCTGTTA CTTTTAAGTC TGGCGCATTT 4920 

50 

AAAATTTTTC TCGTCTCATC AATCATTTTT TGTTCTTCTT TTGTATATCC GTTTTCTAAA 4 980 

AACACATCAA TATGCGGTAA CACATTATTA TAAATTGGAT GTGGATATGC TTCTGGTGCT 5040 

55 



1068 



EP 0 786 519 A2 



TGATATGTTG TATATGCCAC TCGTTTTAAA 
ACAACAGATT GAATCGTAGA GCAGTTTGGA 

5 

TCATTGACTT CCGGAACGAT TAAATCAATA 
TCTATAACGA TTGCACCAGC TTTTTCAAAA 
CCAGCACTCA TTAATACATA ATCGAAATGT 

10 

ACTGTATATG TTTTT CCTTG AAATTCAACT 
AATACTAATT CATCGAAAGG AATATTTTTA 

15 ACTAATCCTG TTGCACCCAC AACTGCTAAC 

TAATTTCCaA TTTTTAGAAT ATTTTAACAA 
- TATAGTTTTT CTACCGCTTG TTGCCCATTA 

20 TCAGATGTTG TTGTTTGGTA GAAAGGTATA 

GATGCCACAC CTGACATATC ACGCATGGCT 
TCATTGATTT TATAAGCTAA TGCTTCATAT 

25 

ATTTGATGAA AATCACTATC TTTAATCGTG 
ATTTGTGAAA TCATATCAAC ATTTACAGCA 
AGTAGCTGAT TGTCAGGTAG GGGATAACTA 

30 

GCCACACCAG TAACTGCTTT TTTCTCTAAT 
TTCACGTTCG ATAAAGTTTT TCCTAAATAT 

35 ACAC7TCTTG TTTCAAGTAC ACCAGCACCT 

GAGACGATGT CTAGTCGTTT AGCCTTTGGT 
TCAflCGTCGG TATAAATTTC ACAAGGTATT 

40 GTATCAGAAC CACCTCTGCC TAAAGTTGTT 

CCAGCAACTA CTAAAATATC GTTTTCTTGA 
TGAGCAATTT TACTTTTTAA ATGATGGCCA 

45 

ATTGCTTTGG CATTCATACC GATATCATTT 
TGCTCTCCGG TTGTCAATAA TAATGCCAGT 
GATACATTCG TCATTAATTG ATCTGTTGTG 

SO 

TTGTTCATCT TGATTGACTC GCTCCTTTAA 
ATCACTGACG GATGATCCGC CAAATTTCAA 

55 



CCATAAGCAT 


CTTGCAATAC 


TTTTAGAGGT 


5160 


TTGGCAATGA TACCTCTTGT AAATGTAGGT 


5220 


TCTTCTGCCA TACGCCATTG ACTTGAATTG 


5280 


AGTGGGGCAA AGTGTTCGCT 


TGTACCGCCA 


5340 


TCACTTGCAC 


GAGCATCAGT 


TAATTCTTGA 


5400 


TCTTGCCCTG 


CAGAACGTGC 


TGATGAAAAT 


5460 


CGATTTAATG 


TCTCCAACAT 


TTTTGTACCT 


5520 


TTTGTCATAA CTTGTCACTC 


CATTTTATAA 


5580 


TCATTTTACC ATTAAATGTT AAATGCGTCA 


5640 


AAATCATCAA TGACGTATGA AATACTTATT 


5700 


TTATTTTCAA 


TTAATGTCAA 


AAATGCTTTT 


5760 


GAGCCAATTA 


ATGAAATTTT 


GACATAATGC 


5820 


TGATTCTTTA 


ATGTTTCAAG 


AATCATAGAA 


5880 


AAGGATAGTT 


GTAGCCCATC 


CAAGTTGACG 


5940 


CCTTCTTCAA GTTCCGTAAA TAGTTGGGTA 


6000 


ATTGTTACAT 


GCATCATATG 


TTTATCCAAA 


6060 


ATTTCTTCAT 


TTGACATAAT 


CCATGTTCCT 


6120 


AAAGGGATAT 


TATAGTTTTT 


AGCTAATTCA 


6180 


AAAGCGCTCA 


TTTCCATCAT 


TTCTTCATAT 


6240 


AAAAGTCTTG 


GGTCAGTGGC 


ATACACACCA 


6300 


TGATTACTAA 


CAGCAAGTGC 


CACAGCGGTC 


6360 


AATTCCTGAT 


GTTCATTGAT 


GCCTTGAAAT 


6420 


AAGGCTTGTT 


CAAATGTTTG 


AGGATTAATT 


6480 


ATGGTTTTAA 


TACCCGCTTG 


ATAGCCAGTC 


6540 


AATACCATTG 


ATAAATAAGA 


TACAGTTTGT 


6600 


TCTTGTTGTT 


TTGGTGCTTT 


AGTCAAGGTT 


6660 


TTACCCATAG 


CACTTACAAC 


GACAATTAAa 


6720 


CATTTCAGCG 


ATCCTTTTTA 


TTTTTGTAAA 


6780 


CACACTTCTT 


GTTACCATAT AATCCTCCTA 


6840 
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TAATCTATAT ACAAGTGATG CACTCCATTA TTTTTAAATA ATGACAAACT CTCAGCTCTT 6 960 

AACCAAAAAG TCCAACAAAT TATAACTGCT ATTATAATTG CTTCGGCATC GCACCCTTTC 7020 

5 

AAATTTAGCT GTTAGCAGAC AGTAATCTAa ACTTTACTCA TGATTGATGC GCCTC 7075 

(2) INFORMATION FOR SEQ ID NO: 264: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 5171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 264: 
AGACGTACTT TGTGATTmCG AAGyrCGTAC TGmAGCACTT GTCGACGTTG ATGTACTTGT 60 
20 TGAACCTGAT TGACTAGTAC TTTGTGATAA TGACTTACTA TCAGAATCAG ATGTACTTTG 120 

TGAATCACTT AATGATTCTG ATGTACTACC TGACTGAGAC GTGCTCATTG AACTACTTAC 180 
GGACATTGAT TTACTGTCTG ATGCAGATAA TGACCCACTT GTACTGATAG AGTCACTTAC 240 

25 

TATCTCTGAA GTACTCATCG AGTCTGATGT ACTTGTTGAG ACACTTTGTG ATGCTGCTAT 300 
GCTTAGTGAT CCAGAAACAG AACCACTTGT GCTCGTCGAA TCGCTCAATG ATTCTGATGT 360 
ACTCATCGAT TTTGAATCAC TTGTACTTAA TGATATTGAT GTACTTTGTG AATCTGATTT 420 

30 

GCTTGTTGAC GCACTTTGAG AGTTGGCTAT GCTATTTGAA ATACTGATAG AGTCCGAGGT 4 80 

GCTAGCTGAC TCGCTCAATG ATGTTGATGT ACTAATTGCA TTCGATGTAC TGTCACTTAA 540 

35 TGATGCTGAT GTACTAGACG ACCCTGATAT ACTCGTTGAT AAGCTTTGTG ACTTAGACAA 600 

GCTTCCTGAT GTACTCATAC TTAATGAGTC ACTGAGTGAT GTTGATGTAC GCAATGAATC 660 
AGATSTACTT GTTGATAGAC TTTCGGATTT TTCAGTACTG CTAGAGTTTG AAATAGAATC 720 

40 GCTTAATGAT GTTGATTTAC TAGCTGAATC CGACATGCTT GATGATACAC TTTGTGAATT 780 

CACTAAACTT GTGCTTGTTG AGCTTGATAC ACTATTACTT TCAGATGTGC TTAATGACTT B40 
AGATGCACTC ACAGAATCAG ATAGGCTTAC ACTTGTCGAT TTCGAGGTAC TAGCTGATGT 900 

45 

AGATACCACA ATCGATCCTG ATGTACTCGT TGATGCACTT TGTGAGTCAG CTTTACTTGT 960 
TGACACACTT TGAGATTGTT GTGTACTTCC TGATGTTGAT ACGGAATCAC TCATGCTATT 1020 
TCTTGTTACT TCATATTTAA AAGTTGTCGT CGTTTTGTTA CCGCTCGCAT CTGTAGAAAC 1080 

SO 

GATTGATATA GTACTTGTAC CAATGTTTGT TGGTGTACCA CTAATAGTAT TATTTGTACT 114 0 

ATCAAATGTT AGTCCGGATG GCAATCCAGT CACTGTATTC GTCACCGCAT TTCCACTGTT 1200 
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ATTGGTGTCA 
TTGTTATTTG 
GTCCCAATGA 
GTATTTGTCA 
TTACCCACTT 
TAATATGTCA 
GTTGACGCAT 
TGCTGaTTAT 
ATATCTTTAC 
TCGAATGTTC 
GATAATGAAA 
GTTTGACCTG 
TCTTGGaACG 
CTTGATGTAT 
CCTCCACCAG 
GTATTGTGAT 
GCGGCACCGT 
TCTCCACCAT 
AAACTCTTAT 
TCCTGTGTTA 
ATATATTGTT 
GTCgCTGctG 
GCGGTGCTAG 
TGTGAAGTGT 
CTTGAAGTTG 
GATTCTGATG 
ACCTTTTCAG 
TTTcTCGTTA 
TGATTACCTA 



CTTGGTCAGT 
TTGAATTCGT 
CAGTCCCAGT 
CTATGGTTTG 
CTGATTGTCC 
ATGAACTATC 
CGATTGTCAC 
CTGTTGTTAC 
CAAATTGTAC 
AGTTGGTCGT 
CATATTTGAC 
TGTTATTTGT 
ACGTTGTCGC 
CTACATTAGA 
ACGTATCCAA 
TTAACCCTGT 
TTCCATGCCC 
TAGAGTCAAT 
ACGTCACAAT 
TTAAGTTATC 
ACGCAAATGT 
TTGACGTTGT 
TATTTGATGC 
AAGCTACAGA 
TAGATGTCAA 
ACTTTTCACT 
CACTTGTGGA 
CTGTTTCACT 



TGTCGTATCC 
AGACACAACT 
TGCACTATCG 
ACCATTATCC 
ATTGCCTACA 
AGCATTCGTC 
GACGGACGTG 
GACTTGATCA 
ATCAACGTAT 
TTGTTGTAAA 
ACCACTTTTC 
AGTCATAACC 
AGGTTGaACA 
AACACCATAA 
TGGGTCAGCA 
TTTGAAGCCA 
TTCACCTAAT 
TTCATATTTG 
ACGTGTTCCT 
ACCGGTACTT 
TTTATTAACT 
TGACATAGCT 
GCTAGTTTTG 
AGTACTTTGA 
TTTAGTATCA 
TTTCTCTTGT 
TGAGACTGTG 
ATCGGCTGTT 
TTGTGTGTTT 



ACAACATTTA 
GTCACTGTTG 
TAACTTAATC 
GTTGTAGTCA 
GTTACAGTTG 
ATTTTTACAG 
TAGTTATATC 
ACATTTCCTG 
CTCACTTGTG 
TTTGTCGCGC 
GCAATCCAAT 
TTTGTATCAC 
TTTAACTTCG 
CTATCTGTTG 
TTCGCCTTTG 
AATGCGTTAC 
ACACCTGGTG 
TTACCTAAAT 
AATGTAATAG 
TGATCATAGG 
GTAATTGTAT 
AAGCGACTGA 
TTTAAGTTGA 
TTTGTTGATG 
GAACTTGATG 
TGATTGCTTG 
TCACTATTTG 
GATGTTGATG 
AATTCAGAAG 



TTGTAAAAGT 
ATTGACCAAT 
CGCTTGGTAA 
ATACAATAGG 
GTGCTTTTAC 
TTTTATTTGT 
CTTTAGCAGT 
AATATGTTTT 
TAnCAGCAGA 
CACCTGTTGA 
CTGAAATATT 
CATTATAGTT 
CAGCATTATC 
TTACAAATGC 
CAGCTGAATT 
TTAAGCCACC 
AAAAGGCAAA 
TTACTTTTCC 
CACCTTTTTG 
TAGCATTACC 
TAGCAGTTAC 
AAGTTCGAAG 
CCGAAGATGG 
TATTAATTGG 
TAGTATTCTT 
TACTATTAGT 
ATGTTTGTAC 
CTTCGATTGT 
TTAATGGTGC 



TGTCGTCGAT 

TTTTGTTGGT 

TCCTGTAACT 

ATTCATTGTT 

ATCAGTAAAA 

ATCATTATAA 

CAATGCAGAT 

TGGTGGAATA 

CTCTGTATAT 

GGCTGTCATT . 

ACGTGTCCAT 

AATATCAAAA 

AGCTGTTGAA 

ACCAAACGCA 

TGGTTTAGAT 

AATACCTACT 

ACCGATACCA 

AGAAAAATGA 

GCTGTATGCA 

TGACGTTGTC 

TGCGGTTGTC 

TTTTACTGGT 

CGTTGTGCTT 

TTGTTCTGTA 

TGAGGATGTT 

TGTCGAAGTG 

CGAACTACTA 

CGTTGAGTTT 

ATCAGAAGCC 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
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GTTTTCAGTC CGTATCCCGT CATTTTTTTA 
CTATGACTAA TAAATGGTAG CCCCATAATT 

5 

TTTACCCAAT TTTTTCCAGA TTTATAAAGT 
TCATGAAATG CTTTCTGTCT TTTACTCATG 
TTAGATTATA ATATATGCCA CTATTCAATT 

10 

TCATTTTGTT AAAATTTGTA AAAATTCaTT 
TTTTTAAAAT TACTTTTTCT CGTTTATGCG 

15 TTGGCTTTAT GTTTAATCAT TATAATTGTT 
ATCTATACCA TCCACCTTTT ATTTATAAAT 
TATGCAAAAT ACATCTTTAA TATTAAAGTA 

20 ■ AATATATGTG TAATTCTAAA AGATTCTTCT 
TTAACATTTT CATTTATAAA GCGAAAAAAT 
GATGCATTGA ATTCACTAAA ATGATTAAAT 

25 

GAAATTTCTT CCTTCTAAAC CTGCTAACTC 
AAATATCTCA TTCACTACTG TGTAATCGTA 
CTTAATGTCC AATTTACCAT TATCAAGAAT 

30 

TCTTCCTATA ACAATATCTA CGGTAGATAC 
AACGTACTCA CATTCAAAAT GAACTGGCGA 

35 ACATTTTTCC TTTGTTACAC CTGCAAAATT 
CGATGATAAA TTAACTGCTT CTCTTAAATC 
TGTCTCTTCA GCATTTTTCA CTGTATCTTT 

40 TGCTGCGAAC ATAACCATAG GCGGATCCCA 
TAAATTATCT TTCCCATCTT TCGATACAGT 
ACTA CT T TT A AATGGGTCGT GCGGTAAACC 

45 

.ATTCATACTA TCTTCACCCC TTATAAGTAA 
AGCGTCTAAT ATAAATAAAC AATTTATTTA 
TATAATTTGC GTTCTTGATT CGAAAAATTC 

50 

ATATCTTTTG TAAATTTAAT TAATTATAGT 
AAGTGAAATT GAAAAAGACA GCTATTATGC 

55 



CTAATGCTTT GATTATCTTG ACTCACTAAA 3120 

TTGAACATTT CTATTTCTTT AATTCCGGAT 3180 

CTTACTCTTG TTTTTTCGTT TGCTAAGCTG 3240 

TAATAACTCC TTGTATTATC TTTACATTCA 3300 

TAATACAACT CTTTTTTGAT ACAAAAATAC 3360 

TTTATTCGTC TAAATGTAAT CGTTTTCATA 3420 

TATAATCTTT TTTTATATAA ATTTGGCTAA 34 80 

TCGTTTTTAA AATAATTATT GTATTAATAT 3540 

AGTTAATTTA CAACTAAACG ATAAATATTA 3600 

ATACCAATAT TTTTTCAATA AACCTAGTGT 3660 

TTAAAAATAT AAATACCACG ACATATTGCT 3720 

GCATCGCTAC TAAGTTGAAT GTTTAGTAAG 3 780 

TACTTATATC TTTTCATCTG ATTGATTATC 384 0 

TTCTTTAGAA GCTGCAGGTG CTTTCATTTC 3 900 

ATATCCTAAT CTGGCAATAG GTTTAATCGA 3 96 0 

AACCTTATCG TCAATATGAA CTTGGGCAAC 4 020 

TGGATCTCCA GTTGGAATAC GAATCGTTTG 4080 

TTCTTTTACA CGATATCCTG GAGCTTCTAT 4140 

AAATTCATCC TCTTCTGGTG GCAATGCTTT 4200 

ATACGTTGCC ATATTCCACA CAAACCAACC 4260 

ACGTTCGTGA TCACCAAGAA CGGATTGATT 4320 

AGTTAAGTTT TGATACTGAC TATAAGGCGC 4380 

AGAGATCCAC CCTATTGGAC GTGGTACTGT 444 0 

ATGACTTCTT ACACCTTGTT TTGGCGAATA 4500 

TTACATTTAA GGTTACGCCC TCTTTACATA 4560 

TAAGTAGAAA CTATATATGA CGTGGTTGCT 4 620 

AGATAAGGAT TTATACAATT AATATTTATG 4 680 

TACTTCAATC ATGATTAGTT TATAATAATA 4 740 

GATGAGCGAA AAACTTCAAG TAAAACAAGA 4 800 
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TAAGTTCAAA AAAGAATTCA AACCTGTTAT GCACTTAAAA GGTGATGCAT TCAATCAACA 4 920 

GTTACAATCT TTGATTAACA AATATCCACA AATACAAAAA AATATGAAAT CAGAGTTCAT 4980 

5 

TGCTTATTAT GATAAAGAAA AAAATAGAGA AACAGTAAAA AACTATGCTT GGAACCTTCA 5040 

AAAATCTATA AATGACATTA TGCAATCATA TCCTAGCACA AAATTTGTAC AGTTTTATAA 5100 

AAGATGATGT TTCCCCGTCA ATGGTAGATG GAAATGGCCG TTTAAAATCG GGATACTAAT 5160 

10 

GTATTTCCAT C 5171 
(2) INFORMATION FOR SEQ ID NO: 265: 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3589 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 265: 



25 



30 



35 



40 



45 



. 50 



CTACACACTA 


AACCTATTTC 


AGTTATGGGT 


GGTACAGTGA 


TTCTCTTTTC 


ATTTTTAATA 


60 


GGTATTTGGA 


TTGGTCATCC 


TATTGAAACA 


GAAATCAAAC 


CACTTATTAT 


TGGTGCGATT 


120 


AOTATGTACG 


TACTTGGGCT 


TGTAGATGAT 


ATCTACGATT 


TGAAACCGTA 


TATAAAATTG 


180 


GCTGGTCAAA 


TTGCCGCTGC 


CTTAGTAGTT 


GCTTTTTATG 


GTGTGACTAT 


TGATTTTATT 


. 240 


TCGTTGCCAA 


TGGGTACAAC 


GATTCATTTT 


GGATTTCTTA 


GTATTCCAAT 


TACTGTGATT 


300 


TGGATTGTTG 


CTATTACAAA 


TGCAATTAAC 


TTAATTGATG 


GACTCGATGG 


TTTGGCGTCG 


360 


GGTGTTTCkG 


CAATCGGACT 


CATTACAATA 


GGGTTCATTG 


CAATTTTACA 


AGCTAATATT 


420 


TTCATAACGA 


TGATTTGTTG 


TGTTTTATTA 


GG CTCTTTAA 


TTGGGTTTTT 


ATTTTACAAT 


480 


TTCCATCCTG 


CCAAAATATT 


TTTAGGTGAT AGTGGGGCTT 


TAATGATTGG 


ATTTATCATC 


540 


GGATTCCTTT 


CTTTACTCGG 


ATTCAAAAAT 


ATTACAATTA 


TTGCATTGTT 


CTTCCCAATT 


600 


GTTATCTTAG 


CAGTTCCATT 


CATTGATACT 


TTGTTCGCAA 


TGATTCGACG 


TGTGAAAAAA 


660 


GGGCAGCATA 


TAATGCAAGC 


TGATAAATCG 


CATTTGCATC 


ATAAACTATT 


AGCTTTAGGC 


720 


TACACACATA 


GACAAACAGT 


ATTATTAATC 


TATTCAATCT 


CTATTTTATT 


TAGTCTTTCG 


78.0 


AGCATTATTT 


TGTATGTATC 


GCCACCATTA 


GGTGTTGTAT 


TAATGTTTGT 


ATTAATCATA 


840 


TTTAGTATTG 


AATTAATTGT 


TGAATTTACA 


GGATTAATAG 


ATAACAACTA 


CCGACCAATA 


900 


TTAAATTTAA 


TTAGTCGTAA GTCATCTCAT 


AAAGAGGAAT 


AGGGAATGAA 


AGCATAGCTG 


960 


TATGGGATAA 


TTTGTATTAT ATGGCTTTAC 


TCTTTACAAT 


TTTTTTGTAT 


TAAATTTCAA 


1020 
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^ ATTTACCGTC TTATGATAGT GCTTTTTATT 
GCTTTGGAGT TTCTTCAGTC AAATCGAAAT 
TAAACGCTTC ATAGTCACTT TTAACGACAT 
TTTGGTTTCT TAACATAAAA TGAGTTGAAG 
CATAATTCAG TGTTACTATA CATGGTACTG 

10 

AAATGACGTC TCTAACAGCA CCGCTATAGG 
TACCACCAAA ATATCTTGTT ACTACGACAC 

1$ TGTCTAACAT TGGGaCACCG GcAGTTCCtG 

TATTCATTtC AGGTCCAATA GTATATGCAG 
. TTTTTATTGC AGCAATAAAT GCTTTaGCTT 

20 CAATGAATCT TGATTTACTA ATCACATTTT 

TATTTTGTGT CATAATAACT CCTTAATTCA 
CACTGAAAAT GACATGACTA TAAATCGTTT 

25 GTATCATTGC TATGAGTATA TTTTAGGAGG 

TCTACAAGTT ATCTGTCGCA GGACTTAATC 
AGTGTGACTT TTGAAGATGG CAAGATTATA 

30 

GCCATTCAAA CATTAGAAAA GAAAGTATTA 

TTATTTGTCA TAAATGGAGA TCATTTCGAA 

s 

GATGATTGTC CTTCAGCTTA TCAAGTAGCA 

35 

CATTTAGGTT CTGGTGGATT AGGTTTAGGC 
TTATAAAATT TTAATAAAAG AGTCTATATT 

40 CTTTAAATGT TCATCATTTG AAAGCCAAAA 

TTAGGCTCTT TTTTACGTTG AAATGAGGTT 
CATCCTCATA CTTCACTTAC TAATGAAAAA 

45 CAAAATTCAA ACTATTATTG TGTTCAATGT 

TATGATTCCT CAGTACATTC CAAGAAAATT 
CGAATGGATA ATGTAACAAG ATATAAAATA 

50 

TATTATCATC TCTCATTTGA ATTGTCGGAA 
CGAGCCATTA GAAAGAGACA AACGATTTTG 

55 



TTTATTCAGT TGGTATATCG AAAGGTAACT 1140 

TTCCTGCAGT CATTTGATTT AAAAAGTTAA 1200 

CGATATAGTA GCTTACCTTA TCAGTGTAAG 1260 

CTAATTCATA TTCAAATTTA CCAGTTTGAT 1320 

CTTCTCGTAG TTCGACACGC CCGATATCAT 1380 

CGCGAATTAA ACCGCCACCA CCTAATTTAA 1440 

ACGCATTATG aACATCGrGC TTTTTTaATA 1500 

TCGGTTCACC ATCATCATTC GCnTTTTGAA 1560 

AACAATTATG AGTGGCATCT TTATGTTCTT 1620 

CATCTTCATT TTGAACAGGT TTGATATGAG 1680 

CAATAATGTG TTCTTTTTTA ACAGTAATGA 1740 

TAAGCTTAAG ATTATTTAAT CTTCATTATA 1800 

GATTGCCATT TTCTTTTTAA CTGAAATATT 1860 

ACGACTATGA AAATTGCTGT GATGACCGAT 1920 

GATAAATATA ATATTcAAAT AGCGCCATTA 1980 

CCAGAAGAAA AAGTTCGTAC TAAAAAGCGT 2040 

GATATTGTAA AAGACTTTGA AGAAGTAACT 2100 

GATGGTCAAG CGTTATACAA AAAGTTACAA 2160 

TACTCTGAGT TTGGTCCAGT TGTTGCAGCA 2220 

TATGTTGGCA GAAAAATAAG ATTAACATAA 2280 

GTAATTGGAA ATTATCTCTC GTATACATGG 234 0 

TGCTAAAGAT ATAAGAAAAT CATTATAATA 24 00 

TTAAGCATTA AACATTACGG GAAATTAATT 2460 

ATTAAAAAAG AAGTAACAGG TGTCATCAAA 2520 

GAAAGTACAA ATCCAAAGCA TTTTTATCAG 2580 

GTATATTGCA GAAATTGTAT ATCACTGGGT 2640 

ACAGAGAGTT CGCAAAGTTC ATCACAAGCA 2700 

CAGCAGTCTT ATGCCTCAGA ACATATTGTT 2760 

TTATATGCCG TAACAGGTGC AGGTAAGACA 2820 
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70 



75 



20 



TCACCACGTG 


TAGATGTTGT 


TGTAGAAATT 


AGTAAACGTA TTAAAGACGC ATTTCTTAAT 


2940 


GAAGATATAG 


ACATACTACA 


CCAGCAATCA 


AGACAACAAT TTGAAGGGCA TTTTGTTGTA 


3000 


TGCACAGTGC ATCAACTTTA 


CCGATTCAAA 


CAGCACTTTG ATACTATTTT TATTGATGAA 


3060 


GTCGATGCCT 


TTCCTTTATC 


AATGGATAAA 


AATTTACAAC AAGCATTGAA GTCATCTTCT 


3120 


AAAGTTGAAC 


ATGCAACAAT 


TTATATGACA 


GCAACACCAC CGAAACAACT TCTGTCAGAG 


3180 


ATTCCCCACG 


AAAATATAAT 


TAAATTGCCA 


GCTCGCTTTC ATAAAAAATC ACTTCCAGTT 


3240 


CCTAAATATC 


GTTATTTCAA 


ACTTAATAAT 


AAGAAGATTC AGAAAATGTT ATACCGAATT 


3300 


TTACAAGATC AAATTAATAA 


TCAACGTTAT 


ACACTGGTGT TTTTTAACAA TATAGAAACA 


3360 


ATGATTAAAA 


CATTTTCGGT 


TTATAAGCAG 


AAAATTACTA AATTAACATA CGTCCATAGC 


3420 


GAGGATGTTT 


TTCGCTTTGA 


AAAAGTTGAA 


CAATTAAGGA ATGGACATTT CGATGTCATT 


3480 


TTTACTACGA 


CAATATTAGA 


ACGTGGATTT 


ACAATGGCAA ATTTGGATGT TGTTGTTATC 


3540 


GATGCACATC AATATACTCA 


AGAGGCTTTA 


ATACAAATTG CTGGACGTG 


3589 



(2) INFORMATION FOR SEQ ID NO: 266: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 266: 



35 


TTTCCAAGAA 


GGcTTgAAAA AtGTTwCaAG TGGTGCGAmC 


CCAGTTGGTT TACGACAAGG 


60 


TATCGACAAA 


GCAGTTAAAG TTGCTGTTGA AGCGTTACAT 


GAAAATTCTC AAAAAGTTGA 


120 




AAATAAAAAT 


GAAATTGCGC AAGTAGGTGC GATTTCAGCA 


GCAGATGAAG AAATTGGACG 


180 


40 


TTATATTTCT 


GAAGCTATGG AAAAAGTAGG TAACGATGGT 


GTCATTACAA TTGAAGAATC 


240 




AAATGGACTA 


AACACTGAAC TAGAAGTGGT TGAAGGTATG 


CAATTTGATC GTGGTTATCA 


300 




ATCACCGTAT 


ATGGTTACTG ATTCAGATAA AATGGTTGCT 


GAATTAGAAC GCCCATACAT 


360 


45 


TTTAGTAACA 


GATAAGAAAA TCTCGTCTTT CCAAGATATC 


TTACCTTTAT TAGAACAAGT 


420 




GGTTCAATCT 


AATCGTCCAA TCTTAATTGT AGCTGATGAA 


GTTGAAGGCG ATGCATTAAC 


480 




AAATATCGTG 


CTAAACCGTA TGCGTGGCAC ATTTACAGCT 


GTTGCAGTAA AAGCACCTGG 


540 


50 


TTTTGGTGAT 


CGTAGAAAAG CGATGCTTGA AGATTTAGCT 


ATTTTAACTG GTGCGCAAGT 


600 




GATTACTGAT 


GATTTAGGCT TAGATTTAAA AGATGCATCA 


ATTGATATGT TAGGTACTGC 


660 
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CAGCATTGAT GCACGTGTTA GCCAATTGAA ATCTCAAATT GAAGAAACTG AATCTGACTT 780 

TGATCGTGAA AAATTACAAG AGCGCTTAGC TAAATTAGCA GGTGGTGTTG CAGTTATCAA 840 

AgTAgGTGCA GCAAGTGAAA CAGAGCTTAA AGAACGTAAA TTACGTATTG AAGATGCATT 900 

AAATTCTACA CGTGCAGCAG TTGAAGAAGG TATTGTTGCA GGTGGTGGTA CTGCATTAGT 960 

AAATGTTTAC CAAAAAGTAA GTGAAATTGA AGCTGAAGGT GACATTGAAA CAGGTGT 1017 



10 

(2) INFORMATION FOR SEQ ID NO: 267: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 267: 



25 



35 



40 



'45 



50 



TTAATCCAGC 


GTTAACTGTA 


TTTGCATTTA 


TTATGATTAT 


TTCGATTCTT 


TTAGCGTATG 


60 


TATTTAAATG 


GCTTGGATTA 


GTGGATGATG 


TGTTATTAAT 


GGTCATTATC 


ATTTCAACTA 


120 


TTTCCTTAGG 


CGTAG TTGTT 










180 


GGCAATTTAT 


CCTATTAGTA 


GCAGTACTTG 


CGGACTTAGT 


AACTATGATT 


TTATTAACGG 


240 


TCTATGGCGC 


AATCAATGGT 


CAAGGCGGCA 


GTACAATATG 


GTTAATAGGT 


ATATTAGTTG 


300 


TTTTCACAGC 


AATTTCATAT 


ATTTTAGGTG 


TTCAATTTAA 


AAGAATGTCA 


TTTTTACAAA 


360 


AATTGATGGA 


TGGTACGACG 


CAAATCGGTA 


TTCGTGCGGT 


ATTTGCATTA 


ATAATATTAT 


420 


TAGTAGCCCT 


AGCAGAGGGA 


GTTGGCGCAG 


AAAATATATT 


AGGTGCATTC 


TTAGCAGGTG 


480 


TCGTTGTTTC 


ATTATTAAAT 


CCAGATGAAG 


AAATGGTTGA 


AAAGTTAGAC 


TCATTTGGTT 


540 


ATGGGTTCTT 


TATTCCTATT 


TTCTTTATAA 


TGGnTGGTGT 


AGATTTAAAC 


ATACCTTCAT 


600 


TAATTAAAGA 


ACCGAAATTA 


CTAATTATCA 


TACCGATTTT 


AATCGTnGCA 


TTTATCATTT 


660 


CAAAATTAAT 


TCCAGTCATG 


TTTATTCGAC 


GTTGGTTTGA 


TATGAAAACA 


ACGATTGCAT 


720 


CAGCATTTTT 


ATTAACATCA 


ACATTATCGC 


TCGTGATAGC 


TGCAGCCAAA 


ATTTCAGAAA 


780 


GATTAAATGC 


TATTTCAGCT 


GAAACGTCAG 


GTATATTAAT 


TTTAAGCGCA 


GTCATTACAT 


840 


GTGTATTCGT 


TCCGATTATT 


TTCAAAAAAC 


TGTTTCCAGT 


TCCAGATGAG 


TTTAACCGTA 


900 


AAATTGAAGT 


TAGTTTAATT 


GGTAAAAATC 


AATTAACGAT 


TCCTATAGCG 


CAAAATTTAA 


960 


CATCTCAGTT 


ATATGACGTG 


ACATTATATT 


ATCGCAAAGA 


CTTGAGTGAT 


CGTCGTCAAT 


1020 


TGTCAGATGA 


TATCACGATG 


ATAGAAATTG 


CTGATTATGA 


ACAAGATGTT 


TTAGAACGAC 


1080 
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AAGTTGCTAA ATTAGCCAAA GCACATCAAG TTGAGCGTGT CATTTGCAGA CTTGAAAGCA 1200 

CAACGGACGA TACAGAGTTA GTTGATTCAG GTATTGAAAT TTTCAGTAGC TACTTAAGTA 1260 

ATAAAATCTT ATTAAAAGGT TTAATTGAAA CACCTAACAT GTTGAATTTA TTAAGTAATG 1320 

TTGAAACGTC ACTATATGAA ATTCAAATGT TAAATTATAA ATATGAAAAT ATTCAATTAC 1380 

GTAATTTCCC ATTCGGAGGA GACATCATC 1409 



10 

(2) INFORMATION FOR SEQ ID NO: 268: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4702 base pairs 

(B) TYPE : nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 268: 

AAAGAGGGTT TTTTACTACT ATAATCCATC CTTTAATGGA ATTCCATTTG TCCCTTTGGC 60 

CCGTTCATAA CCATAGGAAA ATATATGCnG nAATCATTGA TAGTAAATAC ATCAATAATA 120 

25 CGAAXAATGT CGGTGAAATC ACGTCCGTAT TACCATTATT AGCTAAAACA TTTTCCAAGT 180 

TTTCTTTTGA ACCGGATACT CATAAATGCT TTTAATGCnT GGTTTTGTCT GTCGCCATCT 240 

TTAGAATTTG TAATACTTTT TTAAATTCTT TAGCGAACAA CTCATTATCT TTATOGTTTT 300 

30 

TAGCCATTTG ACGATTCAAA TCGTTAGCTC TTACGCCTAA TGCTCGACCT GTCGCATGTA 360 

GCTTATTCAC ATTATTGTCG ACTTGATTTA ATTGGCCACT AACAGAATCT GCAATTGATT 420 

TTGATTCTTG TGTATCTGAT AGCAATTGCG TACTCTTTTC AGAAATTCTA CTAATTTCTT 480 
35 ' 

TATCTAAATT TGAAGACATC GTATTAAATT CATCATTTTT GCCTTTATCA ATTTTTGGTT 540 

• CTTOTGGCTC TTCAGCAAAA GTCTTTTTAA CGTTTTCTAA CTGATCAATC AGCTTGGAAA 600 

TATCTTCTTk ATTTTTTGTT gTATTCTTTT TGTTATTTAA AATGTCATCA ATCAGTTTGT 660 

40 3 

CTGAGTTTTC TTCCATTGAA TCAATTTGaT GTAACACAGC TACTTTATCG TCTTTGAAAC 720 

TTTCCATGTC ATTGATAACT TGGTCAACCA TCATATCAAT TAAACGTTTG TTGTCGAATG 780 

45 GTTTATCTTC tCTGCCTTTT GTATCTGTGT ACATTTTATA ATGCGCATCA AACCTAGATA 840 

ATGCACTCAA TTGCTGGCTT AATGCATCTT TCGATAAGCG ACCATCaAGG TTATGATTCA 900 

ATGTTACATC CACAACACTC GTTGCTTTCT TATCATTTGG TTCATCTTGA CGATTTGCTT 960 

50 GTCCAAATAA CAATTGTAAA TGCATTGTTT TATCTTTTAA GAAATCTTTC TCAGCATCCT 1020 

TTTTCAATTT AGCAACGCCA TTGACTTCAA CTTTATATTC CTTGTTAGAT GTATCGAGTT 1080 

55 
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TAAAATGAGG 


AtCTGTTGCA ACAGTTAATT 


GATTAATATC 


ATTACTTTTA 


ATCGTTTCAG 


1200 




TACGTTGCAC TTTGACACCA TCATTAAT.CA AACTACTTGT GTCTTGCGCA ACTTGGTTAT 


1260 


£ 


CGTAATCTGT 


TAAATTAATG 


TGTTCTGCTA 


ACGGTTTTTT 


CAAATTATAT 


TCATTTTTAT 


1320 




AACGTTTTGC 


TTCTTTGACA ATTGCTTCGT 


ATTTATTAGC 


TTCATCCTCA 


TTTAAACCTG 


1380 


10 


CAGCTATAAA 


GTCTTGTTTA 


GACATGTTAT 


AGATAAATGT 


TGTATCTGTA 


TCAGGTTCTT 


1440 


TGACAATATC 


ATCATGAAGT 


TGTTTCTCTA AGTTTTCAGC 


GAATTGAGCA 


TTGTTCATTT 


1500 




TAATGCTATT 


TAGCGCATCT 


TGTAAGTCTT 


TGTTATTTGC 


AAGCTCATCT 


TGCAGTGATT 


1560 


lo 


CTGTTAATTG 


CTTACGATAn 


TCTTCAATCA 


TACCTTTTGA 


AAATGGTGAC 


TCTTGTGATT 


1620 




GAATGATTTC 


TCTTAATTTA 


TCTAAGTTTT 


CTTTAACAGT 


TTGTTTATAT 


TCTTCTTTAC 


1680 




CTGTATCTTG 


CATACTTGAT 


TGTTGATCAA 


TTTGGCTGTC 


CATCTGTTTT AATGCATTGA 


1740 


20 


TATAGTTATC 


AAGTTCCACG 


CTATC'lTfTT 


GCGATTTATA ATCTTGTAAC 


ATTTTATCCA 


1800 




TCGCTGTATT GTGCTCGTCA AATAATGAAT TTTGTTTTTC AATTAAAGTC GAAACATTAT 


1860 




AATCTGTGTT 


CACTCTGAAC 


GTATCTGAAT 


TCGCACTCAA 


TAATGATTTA 


TTGTATGTTT 


1920 


25 


GGAACCATTT 


TGTAATGTCT 


TTGTTTGCAG 


AAATTGAATT 


TACAAGCGTA 


TCTGTAAATA 


1980 




ATTCCGGGAA 


GTCGTTAATT 


GGATTTAATA 


AGTAATTCGA 


GAATTTACTA 


TTCACACCAT 


2040 




GTTCACGCGT 


CATAATAGCG 


CCAACATTTT 


TTTGTGCATT 


ATGTAAATTA 


TCAATGATGC 


2100 


30 


TTGTTAAATA AATTTCGACT AAGTTTTTGT 


TAAAGTCGTT 


AAGTACATTA 


CTTACAACTT 


2160 






TTTAGCTACT. 


TCTTCTTTTT 


GTCCTACAGC 


TGTT1TATAC 


TGTAGCGATA 


2220 


35 


TTTTCGATGG 


TGTTTTAGCG 


TCTAATTGCA 


TTGCCAATTT 


TGAAAAGTTT 


TCTGGGATAA 


2280 


CAATCATGAC 


TTGGTATCCA 


CCATTTTTCA 


AACCAGACTC 


AGCAACGTTT 


CTTGTTACTG 


2340 




TTTCAAATTT 


ATAGTTTTTC 


TCATTTGCTA ACCTTTTAAT AAATGCTTGA 


CCCAGCTCAA 


2400 


40 


CTTTTTTACC 


GTTATATGTC 


GTTGGTTGAT 


CCTCGTTAAC AATTGCGATA 


TGTATTTTAT 


2460 




TATTTTTATT 


ACTTACACTT 


TGGGATCCTT 


TTTCTGATTG 


ATCTCCATAT 


TTTGTTTGAA 


2520 




CAAAAAATAT 


CATACTAACT 


ATGGCAATTA 


TAATAATTAA 


AGTGACAATT 


AATGCATAAA 


2580 


45 


TCCAATTTTT 


d u i CATG 


CTTATTTCCT 


TTCAGTTGTT 


TTCTTAAAAA 


AATGATAAGC 


2640 




AAAGCCACAT 


TAGAAAATGT 


GACTTTGCCA 


ATTTCAGAAT 


GCTTATTGCA 


AACCGAAATT 


2700 




ATTAGAAAGT 


TGTTGGTCTT 


GTTCTTGAAC 


GGCATCAGCA 


GTGCtATTcA 


ATTGTTGTTT 


2760 


SO 


AATTTCTTCT 


AATAATTGTG 


CAAATTTTTC 


TACTTTAGGA CTAAGTTGTT 


GGAATTGCTC 


2820 




TTCGAAACGG 


CTGAAAGCTT 


GACCTTCCCA 


GTTCGCTGCA ATTTCACCTt 


GTGCACGkGT 


. 2880 
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TCTGATTTC.C TCTGGACTCA TCTTAATCAT TGCCATAACT AGAAACCTCC TGAATATTTT 3000 

AAGTTTATCa AAACTTTTTA GGGACACTAT TTTTTGAAAA AGTGCTCCTT ACTCAAATAA 3060 

5 TATATAAATT ATTAGTATAT .GTATATAGTt TTTTAAGTAT TTTTAGCTTT TTTAAAATAA 3120 

ATATATTGAA TATAACCATA TATTTTTAAT TAACCATTCA TTTTTGTAAT ATAAATGTGT 3180 

ATACTAAAAT TAAATTAAAT ACATAAAGGA TTAAATGGTT ATTATGAAGA AAACAATTTT 3240 

10 

ACTGACGATG ACAACTCTTA CTTTATTTAG TATGTCGCCT AACTCGGCTC AAGCATATAC 3300 

GAATGATAGC AAAACATTAG AAGAAGCAAA GAAAGCACAC CCAAACGCAC AGTTCAAAGT 3360 

GAATAAAGAC ACCGGCGCGT ATACTTATAC ATATGACAAA AACAACACGC CAAACAACAA 3420 

TCATCAAAAC CAGTCACGTA CAAACGACAA TCATCAACAC GCAAATCAAC GTGATCTTAA 3480 

J 

CAACAATCAG TACCATTCTT CATTAAGTGG TCAGTATACG CACATTAATG ACGCAATTGA .3540 

20 TTCACACACA CCGCCTCAAA CGTCACCAAG CAATCCTTTG ACACCAGCAA TACCGAATGT 3600 

CGAAGACAAT GACGATGAAT TAAATAACGC TTTTTCAAAA GATAACAAAG GGCTTATTAC 3660 

AGGCATCGAT TTAGACGAAT TGTATGACGA ATTACAAATC GCCGAATTTA ATGACAAAGC 3720 

25 AAAGACCGCT GACGGTAAAC CTTTAGCATT AGGTAACGGT AAAATCATTG ATCAGCCTCT 3780 

TATCACAAGT AAGAACAACT TATATACTGC TGGACAATGT ACATGGTATG TCTTTGATAA 384 0 

ACGTGCCAAA GATGGACACA CGATTAGTAC ATTTTGGGGA GATGCTAAAA ACTGGGCAGG 3900 

30 

CCAAGCTTCA AGCAATGGCT TCAAAGTAGA TAGACACCCA ACACGAGGAT CAATTTTACA 3960 

AACAGTAAAT GGTCCATTTG GTCATGTAGC CTACGTTGrA AAAGTTAATA TTGATGGAAG 4020 

TATTCTAATT TCAGAAATGA ACTGGATTGG TGAATATATC GTTTCATCAA GAACCATCTC 4080 

35 

TGCTTCAGAA GTTTCATCAT ATAATTACAT CCATTAAATT AATCATGACA TCAATAAAAA 4140 

GCGACCAGTT CGCAGTTTAC AATTCGTAAC ACTGCAAAAT TGGTCGCTTT ATTTTGTATG 4200 

TTATTCGATT ATAAAATTAC AAAGAAATGT TCTCTACATT CCCCATTAAT CAAAATCGTT 4260 

40 

TAOGAAAGTA TAATTGTAGC TATAATAATC CAAGTCGTAA CAACTAGTGG CACTATCGTC 4320 

TTGAATAAGA ATATACCGTA TTTTTTCTTG CGATATATAT CCAGTACTAG CCAAATTAAA 4380 

45 ATGATTATAA CACCAACAAA AATAAATACA GGATTCATCG ATATAGCATC TGCCTGTAAC 444 0 

TCAGGTTGCA TTCTTAATTT AGTGATAATT AACATCACTA CTGAAATAAT GAAAAAGTAG 4500 

ATACCTCTTA TCTTTGATGT CTGTAAATCT AATTCCTGCT CTTCAATGAC CTCTTTAGAT '4560 

SO 

TCACCCAATT CTTTTGCAAT CAAATAATTT ACTACCTTAG GTTTCACCCA TAAACACTTA 4620 

ATTGCAAAGT ACATAAAAAT ATATGATCCA GTATCCATAA ACATTAAAAA GTTGCTTAAA 4680 
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(2) INFORMATION FOR SEQ ID NO: 269; 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 2004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 269: 

ACAAAAAATT CACCCTCATT AATATTGAAA CTAATGTTAT CGACAGCAAC ATGTTTGCCA 60 

1S TAACGCTTAG TTACATTTTT AAACTTAATC ACTTTGCCAC CTCTTTTTTT CTCATAGCAT 120 

AAAACCGAGA TTATATGTAT GTATTCCCTA TTTAACCACG TTTATTACAA TTTTCAAATT 180 

TAAATGATTT ATCCTTGAAC TTTTTTAATA AAATAATGAA TAAwAGGwAA TCwCCAGTTA 240 

20 AGAAATAGTG TTATTTTACC TTGAATTCAA AAAAACACCC AGTAAAACAA GGAATGCTTA 300 

CTAGGTGTCT TCACTATACT TTGGCTTTAT AATTTTGAAT CGTTtCTAAA AATGCTGGAC 360 

AATAATGTTT TAATTTGTAA CTACCTACGC CATCmATATT AATCATATCT TGTTTCGAAG 420 

CAGGCTTACG TTTAGCAAAT TCCTCCAACG TGTAATCAGA AAATATACTT ACAGGTGCTA 4 80 

TCGTTAATTT GTCACTTAAC TTTTTACGAA CTTCTACCAA CTGACTGAAT AATACTCGGT 540 

CAACCCCTTC AACCGTATTT ATAAATACTT TTTCAGTCGC TTTTTGCTTA AATGGTGTTG 600 

TGAATACTTC TACTTCATTA CTGAGTAATT TTTTAATTGA AGTATCACAC ATTAATATTT 660 

CGTCATTTTC ATTTAAGAAC CCTTTGAATC TTAATTCATC TATTAAGTGA CTTAATTCTG 720 

ATGTTGTGTA ACCTTTCATT AAACCATGGG TTGAAATTTG GTCATAACCT TTATACTTAA 780 

TATAATCTGk TGACTCTCCT CTTAACACTT GAATGATAAC ACTATAACTC TCTTGTTGTT 840 

TCATACGAGC GATGCAACTA ATAATCATCT TAGCTTCTTG TGTCATATTA TATGATTTAT 900 

40 CTTGTTGAAC ACAATTACTA CATTGTTCAC ATTCTTCTAA TTTTTCATTC GGTTCAAAAT 96 0 

AATGGACAAT TGTTGCTTCT AGACATTTTT TTGTTTTTGT ATATTGAATC ATTTTAGTTA 1020 

ACTTTTCGCC CATTTTATCT TTATAGTCAT CATCAGCTTG AGAGACTGTT ATAAAATACT 1080 

45 CGTGTAAATT GATAtCGCGT TCGCTAAATA ACAAAATACA TTCACTTTTC AACCCGTCAC 1140 

GACCTGCACG ACCCGCTTCT TGATAATAAG ATTCTAAATC TCCAGGCATA TTATAATGAA 1200 

TAACAAAGCG TACATTGGAT TTATCAATAC CCATACCAAA AGCATTTGTA GCAACGACTA 1260 

50 

CTTTAACACG ATCAAATAAG AAATCATTCT GCGCTTCTTC TCTTTCTTTA TTGCTCAAAC 1320 

CTGCATGATA TATAACACTT TCAATTTTCT GACTTTCTAA GGCTTCTTGA AGCTCTT CAA 1380 

55 



25 



30 



35 



1080 



EP 0 786 519 A2 

CCAATATAAA TTTTTGACGT TGATAAGTAG GATTTACTTT AAAAATTAAG TTTCTACGCT 1500 

TAGTACTCGT TTTAATTTGA TCAGTTTGAG CGATATTTAA CTTTTCTCTA ATATCTTGCT 1560 

GTACTTCAAC CGTGGCAGTT GCTGTCAACG CTATTATTGT AAAATCTTGA GGTAACGTAA 1620 

ATACTTTTGA AATAACATTT TGGTAACTCG GCCTGAAATC ATGACCCCAT TTAGAAATAC 16 BO 

AATGCGCTTC ATCAAACGCG ACTAAGTGAA TCTTTATACG CTGAAGCATA TTTAAAAAAT 1740 

ATCGGTTTTC AAATCGTTCT GGTGCAACAT ACAAAAATTG AATTTCTCCA TTTGATAATG 1800 

CTTTTTCAAT ACGTTGTTGC TCTTTTTGAG TCAAACTACT ATTTAAAAAA GCAGCTTGAA 1860 

TTCCCATCGC TTTTAATTGA TCCACTTGAT CTTTCATTAA TGATATTAGT GGACTTATTA 1920 

CAATTGTTGT ACCACCTAAC mATAAACCTG GTACTTGrTA GCmTATAGAC yTACCTCCAC 1980 

CAGTtGGkAA GrCACCAAGC ACAT 2004 
20 (2) INFORMATION FOR SEQ ID NO: 270; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



10. 



15 



30 



35 



40 



, Cxi) SEQUENCE DESCRIPTION : SEQ ID NO: 270: 

AAAGATTGCT TGCCTTGAGG GTTTATATAT CTGACTCAAT TGCCACATTT TTATCAAGAG 60 

TAGTTGATAA TACTCATCAT AATTATAGCT AATATTATAT TTTTTTAAAA GATAGTGTAT 120 

GATTTTCTGG TGTTTGTTGT ATACGTCATT AAATTTCAAG TAGTCATTCT CCAAGTTATA 180 

CGTATAACAA ATATTTCCGG ATAAAGTTAG AATAAAATAT TTAGAAAAAT CATTCATTTG 240 

CGTAATCGCT AAATTAAGTG TTAAATATAA GACATAAGTA ATTAATTTAA TGTGATATGA 300 

TGTATTATTA CTTTGCTAAA TAGTAGATAG AACAAAATTT GTAATCGGGA GGTAACAATG 360 

GATTACGCAC ATTTAAATTT AGAACATTTT TTTGCACGAA ACGACGATTT AGATGTTATA 420 

AGAGATCGCG CTGATTTCGT GATGATAAAT AACTTCACTA ATGAAATGAT GTATCGTGAT 4 80 

45 GGTCAAATTG AAGGCACGAT TGATTTAAAT CAG TACT ATT ATAAAAATAG ATCAAATGCA 54 0 

GCAAGTTTTA TTATGATGGA TTATAAAAAA GAAACTAAGT AAACGAACAA AAGAATTTTT 600 

TGTTTTTTAA TACGTGAATA ATAAGATTAT TGATATAAAG GTTTTCAAAG GTTATACAAA 660 

50 AAGATAAAAC ATTTATGATT CGTAGATCAA CGTAAAGTAA TGTTGATAAA TGGTTTAAAA 720 

CGTTTCATTT ACATTACTGT TTATTTATGA ATATGTAACA ATGCATAGAT AAAATTGTTA 780 
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ACCTAAGAGG 


TGTGGATATG 


AATAAACACA 


AGAAAGGTTC TATTTTTQGA ATAATAGGAC 


900 




TTGTTGTCAT 


ATTTGCTGTT 


GTCyCaTTTT 


TATTTTTCTC AATGATATCC GATCAGATAT 


960 


5 


TTTTCAAACA 


TGTTAAATCC 


GACATTAAGA 


TTGAAAAGTT AAATGTTACA TTAAACGATG 


1020 




CAGCAAAGAA 


ACAAATAAAT 


AATTATACGA 


GTCAACAGGT ATCAAATAAA AAGAATGATG 


1080 


10 


CATGGAGAGA 


TGCATCTGCA 


ACTGAAATTA 


AAAGTGCAAT GGATAGCGGT ACTTTTATCG 


1140 


ATAATGAAAA 


GCAAAAATAT 


CAATTTTTAG 


ATTTATCAAA GTATCAAGGG ATTGATAAAA 


1200 




ATAGAATTAA 


ACGTATGTTA 


GTAGATAGAC 


CAACGTTATT GAAACATACG GATGATTTCT 


1260 


15 


TAAAAGCTGC 


TAAAGATAAG 


CACGTTAACG 


AAGTTTATTT AATTTCACAT GCATTATTAG 


1320 


AAACTGGCGC 


AGTTAAAAGT 


GAATTAGCTA 


ATGGAGTCGA AATTGATGGC AAAAAGTACT . 


1380 




ACAATTTCTA 


TGGAGTAGGA 


GCCCTTGATA 


AAGACCCAAT TAAAACAGGT GCAGAATATG 


1440 


20 


CTAAAAAGCA 


TGGTTGGGAT 


ACACCTGAAA 


AAGCTATTTC AGGCGGTGCT GATTTCATTC 


1500 




ATAAGCACTT 


CTTATCAAGC 


ACAGATCAAA 


ATACATTGTA TAGTATGAGA TGGAATCCAA 


1560 




AAAATCCAGG 


AGAACATCAA 


TATGCTACAG 


ATATTAAGTG GGCAGAAAGT AATGCAACAA 


1620 


25 


TTATCGCTGA 


CTTTTATAAG 


AACATGAAGA 


CTGAAGGAAA ATACTTCAAA TACTTTGTGT 


1680 




ATAAAGATGA 


CAGTAAACAT 


TTGAATAAGT 


AATTTGATAA GCTACGAGTT GTTTTTATGA 


1740 




CTCGGACATA 


CTAAAAAGAC 


GCTTTCTATC 


TTGTTTTGAT AGAAAGCGTC TTTTTGCATT 


1800 


30 


AGAGAAAACA 


CATTGATkGA 


TAAtCCCaCC 


aATGCAAgTG GGGcAGGACa TCGATAAAGA 


1860 




ATTACTTTTT 


CTTTAGAAAT 


TAGTATTTCT 


TATGCATGAG TTTTACTCAT GTATTCCTAT 


1920 


35 


TTTTAAGTAC 


ACATTAGTTA 


TAGCTAATGA 


TAAAGAACCA CTACATAATA AATCATTAGT 


1980 


GTTTTTTTAT 


CATTTCTGTC 


CCaCTCTCAT 


CGTGATTTGA AATTTTCAAT TGCGATTTTA 


2040 




ATTTCATCTC 


TTACACGTTG 


GAACTCTGAC 


CAAGGCTTGC CTGCAGGATC ATCAAATCCC 


2100 


40 


CAATGTTCTT . 


TCTTAACATT 


TGTTGGTAAA 


GAAGGGCAAT TTACGTCTGC ATCACTACAT 


2160 




AATGTAACAA 


CTAAATTTGA 


ATTTnTAATA 


ATATTATTAT CGGATTAAAA TCTGATGGAT 


2220 




GATTTGATAT 


ATCAATGCCT 


ACTT 




2244 


45 


(2) INFORMATION FOR SEQ ID NO: 271: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1371 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 
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ATAAGCAATT TAATTTTGAG TCTACAATGG AGGAATTATC ATCTTTATCA GAGACTTGCC 60 

AACTTGAAGT GTTGGGTCAA ATTACTCAAA ACAGAGATCG TGTAGATCGC AAATATTATG 120 

5 TTGGTAAAGG TAAAATTGAA GAAATTCAAG CATTTATTGA GTTCAAAGAT ATTGATGTAG 180 

TCATCACAAA TGATGAATTA ACGACTGCAC AATCCAAATC ACTAAATGAA GCTTTAGGTG 24 0 

TAAAAATTAT TGATAGAACT CAGTTGATTC TTGAAATATT TGCATTAAGA GCAAGAAGTA 300 

10 

AAGAAGGTAA ATTGCAAGTA GAGCTAGCAC AACTTGATTA TTTATTACCT AGATTGCAAG 360 

GCCATGGTAA AAGCCTTTCT CGTTTAGGTG GCGGTATTGG AACTAGAGGC CCTGGTGAAA 420 

CGAAGTTAGA GATGGATCGC AGACATATTC GAACTCGTAT GAATGAAATT AAACATCAAT 4 80 

15 

TGCGGACGGT AGAAGAACAT CGCGAAAGAT ATCGAAATAA AAGAAATCAA AATCAGGTGT 54 0 

TTCAAGTAGC TTTAGTTGGT TATACAAATG CTGGTAAATC ATCATGGTTT AATGTTTTAG 600 

20 CAAATGAAGA GACGTATGAA AAAGATCAAT TATTTGCAAC GTTAGATCCT AAAACACGAC 660 

AAATTCAAAT AAATGATGGA TTTAATTTAA TTATTTCAGA TACTGTTGGT TTTATACAGA 720 

AACTACCTAC GACGTTAATT GCAGCTTTTA AATCAACTTT AGAAGAGGCT AAAGGTGCAG 780 

25 ATTTATTAGT ACATGTCGTA GATAGTAGCC ATCCTGAATA CCGTACGCAG TATGACACAG 84 0 

TTAATGATTT AATCAAACAA TTAGATATGA GTCATATTTC TCAAATAGTT ATTTTTAATA 900 

AAAAGGACTT ATGTGATCAT GCATCAAATC GTCCAGCAAG TGATTTGCCT AATGTTTTTG 960 

30 

TTTCTTCTAA AAATGATGGT GATAAATTAC TTGTTAAGAC GTTATTTATT GATGAAATCA 1020 

AAAGGCAATT AACTTATTAT GATGAGACAA TTGCGACGAA TAATGCAGAT CGATTATATT 1080 

TTCTAAAACA ACATACATTA GTGACTGAAC TTAAATATGA TGAAATTGAA AATGTTTATC 1140 

35 

GTATAAAAGG ATTTAAAAAA TAATAAAAGG ACGAAATTCA AATGAAAGAT ATAAGTAAGA 1200 

TAGTAGCTGA CGTCGAATCA ACGTTAGCAC CATATTTTAA AGAAATTGAA GAAACAGCAT 1260 

40 ATATTAATCA AGAAAAAGTA TTAAATGCAT TTCATCATGT CAAAGCAACC GAAAGTGATC 1320 

TACAAGGATC AACAGGATAC GGGTATGATG ACTTTGGACG TGATCATTTA G 1371 
(2) INFORMATION FOR SEQ ID NO: 272: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6035 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 272: 
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CGTAAAcCTA TGCGTTTTAA TATTCTGAAG TTACTTAGTT CATCCTCAGT TTCATCCATT 120 

TGTTTAATAT AAATAATACA TCCAGCTGCT ACTAAAAATG CTAATCCTAA AAATGATGTA 180 

ACAAATATTA GAATACCGTT AGTAGCATcG ACCTCTTTTT TCATTTCATC ATACGTGATG 240 

ACTTTGTCTC CAAACTGTTT TGCAATTGCT TGAGCTTTTT CCTTTTGTGA TGTTTGTTTA 300 

ATATCATATC CATAAAAAGT ATGAACGTTA TTTTGTGTTT TCAACTGCTG ATACTTTTCA 360 

GGACTTACTT CGATGACAGG TGAGTTGAAG CTTAGATTTA AAGGATAAAC CTTACCTTTG 420 

T CTTCTT GTG TAcACGGAAA GTTTCATTCT TAGTACCTTT TACTACTAAA TCTTTGTTTA 480 

AATGGATATT AATAATGTTA GGCAGCGATT TTGTATTTGT AATGATGGCA TTGTTGCCTG 540 

TtAACTTGTT ATTTGCACTT AAAATAGAAT TCGTGCGACC TGAATCACTA CCATTTTCCA 600 

AAGTAATAAC CTGATCATTA ACATTATCTA CAGTAATAGT TTCGTAAGCA TTTTTAGAAA 660 

ATGTAATTTG TTGTTGGCTt AGTTTAGTTT CAAATTGTTT AGCATCTTGA GTAGCGACCA 720 

CGTTAAATTC ATTTGGTGCC ATAGATGTAA GGGTTTGATC TGTATTTGAT TTAGATAATG 78 0 

CCGCAAAACA CAATACAGTT ACTGTAACTG CAGAAATAAT TGCAATGATA GTTAAAGACA 840 

25 TGGCATTTTT CTTCATTC TG TACATAATAG ACGATGTGAA TACAACATCG GTAATAGATA 900 

CGCGTCCATT TTTTGATTTT TTCAATGTTT TAAAAATAAG TGACACGGAA CTTCTGAAGA 960 

ATAAATAGGC GCCTACAACC GTTAAAAATA AAATGATAAA CGGTGATGTC ATAGCCATAG 1020 

3D 

TTAGTGCTTT GAACGTACCA AACATTTCTG TCGCCATATA ATAGCCTAGT GCAATCATAG 1080 

CAATACCTAA TACGCCTGAA ATAACCTCTG CAGTCGTTAC TTTAGCAGTT GTGGCATCAG 114 0 

TTTTAATTGA ATCTTTCATC ATTGATAAGA TACTACGTCT TTTTAGAAAT AAAGCACTTT 1200 

GAAATAAAAT CAGTACATAC GCAATAATTA GCATGAAAAT AGTTAAAACA AGGGCCATAG 1260 

GTTCGAAATG TATCGATAAG TTAATCGATA ACGACATCAA TTTAGATACT ATGGAAAGCA 1320 



10 



15 



20 



35 



40 



ATAaTTGTGC ACCTGCAATG CCACATAATA CACCGACAAC ACCTGTGATT AAAAATACGA 1380 

; 

TCATTtGTTC AAGTGCTAAC ATTTnCAAAA TGTTTTGTCG TGTTAAACCA ATCaACTGAA 1440 



ATAGCGCAAA TTCACGTGTA CGGCGTTTTA CGrmTAAATG ATTGGCATAC ATTAAAAAGA 1500 

45 TGACAATAAT GATAAATAAA AATATTGATC CGACTAAAGC ACCTTTCTTA ATGATGGCCA 1560 

TCGAGTCGTC ATTATTTACA CCTTTAGTAA ACTGTAAGGT TGTAAAACTG AAATATAAGA 1620 

CGATGCTAAA AAATAATGAA AATAAATACA TTGCATAATG TTTTAAGTTT TGTCGTAAGT 1680 

50 TTTTGAAAAC GATATGGTTA AATGTCATTT GAGACACCAC CTAATACTGA TTGAAGATGT 174 0 

ACAATGTCTT CATAAAAGGC CTGTTTAGAA CGTCCTTCCT GATAAAGTTG TGTATGAATT 1800 
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ACCATGACAA 


TAGTTGTATC AAACGATTTA TTCATTTCTT 


CCAAACGTTG 


TAATAGGTCA 


1920 




TTTGCACTTT 


TCGAGTCGAG 


TGCGCCTGTT 


GGCTCATCTG 


CAAATATGAT 


TTGTGGTTTG 


1980 


5 


TGAACAAATG 


CTCTCGCTGC 


TGCAGTTCTT 


TGTTGTTGAC 


CACCAGATAA 


TTCGCTAGGG 


2040 




TATTTATTTC 


CTAGGTCATA AATACCTAAT 


GCTGTCGTGA 


TCGCTTTATA 


ATTTTCTTCC 


2100 


10 


ATTGTTGCCT 


TCGACATTTT 


TTGAACAGAT 


AAAGGTAACA 


TAATGTTTTC 


TTTAACGGTT 


2160 


AATGTCGGCA 


GAATACTGTA ATCTTGGAAG ATGAAACCTA 


ATGATTCTIT 


GCGGAATTTG 


2220 




. GCAAGTGCTT 


TTTGATTAAG 


TTTATTAAGC 


TCTTGTCCGT 


TAGCAATCcg 


CTACCGCTAG 


2280 


IS 


AAATTTGGTC 


AATTGAACTT AGTACATTTA ATAAGGTTGT 


CTTACCTGAT 


CCAGAAGGCC 


2340 


CCATAATCGC 


AACGAATTCG 


CCTTTTTGTA 


TGTCAAAGTT 


AATATCTTTA 


AGTGCTTGAA 


2400 




ATGTGTGCTT 


TTTACCGTAT GTTTTTGAAA CATGTgCACT 


GATAATATCG 


TCATAGTCTC 


2460 


20 


ACTCCTTtTG 


TATTTAATTT 


CATTTTAAAT AATGTTTGGA 


GTAGTAGCCT 


TTATCTAAAC 


2520 




TTACAATTCA 


ATGAATGAAC CTTACAGAGT TGAAArcTAT 


CGCTACTTAG 


TAGATTTTTG 


2580 




AGTGAGGATA 


CAGATTCATC GTACATATTA GACAAAAGCA 


ATGGTGCTTT 


CTAAGTGATG 


2640 


25 


ATGTTTGTGT 


AAATTGAGAA AAGGGAATTT AATTATTGTA 


TAATAAATTT 


TTTGTAAAAA 


2700 




TTAAAAGAGG 


GTTTTATTTG 


AAAGGAATTG 


ATTGTTATGG 


AAAAAGGAAA 


TCAAGGTATT 


2760 




AAATGGTCTA 


GTTTAATAAT 


GGGTGTATTA 


TTATTAATGT 


TGGCAGTCGT 


TATTTTTACA 


2820 


30 


TTTCCAATTG 


AAAATTTTTA 


TGCTATTACC 


TGGTTGATTG 


GACTGTTTGT 


ATTAATTAAC 


2880 




GGTGTGATTC 


AAATCGTTTA 


CCGTAGAAAA 


GCAAAAGCTT 


TAGTAGGTGG 


TAACCAAAAT 


2940 




TGGATTCTGT 


TTATGGGGAT 


TGTAGATATT 


CTATTTGGTC 


TATTAGTTAT 


TTTTAATGTT 


3000 


GGCGCAAGTT 


CAGCATTCTT 


TATTTATATG 


TTTGC1TTTT 


GGTTTATTTT 


TAGTTCTATC 


3060 




TCTGGATTAT 


TTACGTTTTC 


GGGTAGTGGT 


AGCTTAAAAC 


TAATTTCAGT 


GATTTTTAAT 


3120 


40 


TTATTAGGTA 


TTGTTTTCGG 


TGTCATTTTA 


TTATTTAATC 


CATTAATGGG 


TATCGTCTTT 


31B0 


ATTTCGACGA 


TGATTGCTAT 


TGCATTTGTA 


TTCGTAGGTG 


TCATTTATGT 


TGTAGATGCA 


3240 




CTTGCTTAAG 


TAAAATGAAG 


CGGTTCAAAA GAAGGGTGTG 


ACATGAAGTT 


TGTGTCATAT 


3300 


45 


CCTTTTTGTT 


GTGTTTATGA AGCATAAAAA AGGGGCGCTA 


CCTACAATAA 


GTAAGATACG 


3360 




CCCATATTTT 


TATATTTTAC 


TATTATTGTT 


TTTCAATACG 


ATTAATAGTT 


ACATTTAGTC 


3420 




CAAAATATTT 


TTCTAAAAAA TGTTTATAGT 


TATCTTTAGT 


GACATCAAAT 


TTTTCTGAGC 


3480 


50 


TACCATTCCT 


TGTTAAAGTT AAATGATTTT 


CAGACATTGT 


AGCACGGCCA 


AATGATTGTG 


3540 




GCATTGTAAT 


TAATAAATGC 


TGTACAAATA TTGAATCTGG 


ATGCXJTTTGA 


TTATATTCGA 


3600 
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AATGATCATT TTCGAATTTT TGAACATAGA AAATATCCTT GTCTTCGTTG TTAAAAATAG 3720 

CGCGGAATGT ACCACTGATA TCAGTAATTG GTTGTGTATG CTCAGATGAA GTAATAGGAA 3 780 

5 TGGCATGTAG AGGTAAGTCT CCAAAGCCAA CATCAGTTAC ATAGAATACA TCATTTATAG 3840 

AAACAACAAG TGAAGCATGT GAACCGTTCA GACTACGACC GCCACCGGGw GTGTGAATAG 3 900 

TAGCTGACAT TAATTCAGGA TTAAATCCTT TTTGTTGTAA ATAGGCTTTG AAAAATGTAT 3960 

10 

TTAATTCATA ACAAAAACCA CCACGTTTAT CATGAACAAT TTTATTAAAA AGTGCATCGA 4020 

TATTTATAGA TATCGGCTTA CTATTTTGAA CATCAATATT TTCAAAAGGT ACAGTTAACA 4080 

TAAAACGTGT TGCATAATAA TTTAATGCTT CAATACTCGG TCGATTATAA CGAGATGAAT 4140 

CAATTTGTAA ATAATTCTCT AACTTCGCAA TATTCATAAG CATAGCGCCT CCTGTATTAA 4200 

AGATTATAAT TAAATTTTAA ACAGAAATAC TGAAATTTTA AATTCGAAAG CATTGAATTT 4260 

2Q TGGATAAATA CATTTTAAAT AGAAAAATAC GCTCTCAAAA TGAAGTCATC TCTAAAAGAA 4320 

ACGATTTAAA GATGACTACT GAGAGCGTAg CATAATGGAA GAAGTGTGCA GGGTGTCTAA 43 80 

AAATGCAACA ATACAAAGGT AGTTGCAAGA CAAGTTGCCT TATCTAGACC ATTTGTGTTC 4440 

25 TATGCGACCA AACTTCCAAA TTAAACTTGA AATAAGCCAA GTAATTAAAA ATAATGCAAC 4500 

TAAAATATAG CCTAAATAAT CAAATTCGAT CGAACCAATG AATGCCCAAA ACGCACCATG 4560 

TAAATCTAAC TTATCAGCAA GAATTTGTAG CAATTCAATC ATCCCAATCA CTAATGCTGC 4620 

30 CATGACTGAT ATCGCAGTAA TCGTTATATT GTAATAGATT TTGCGAATAG GATTGAAGAA 4680 

TGCCCAATTA TAGGCATACT TCATTACAAC ACCATCTAAT GTATCCAATA AACTCATACC 4740 

TGATGCGAAT AAAATTGGTA AAGATAAGAT TCCGATAAAT GAAATGGCTT GTTGTGATGC 4 800 

35 

GCCTGAAGAA AGAGCGAGTA ACGCAATTTC ACTAGCTGTA TCAAAACCAA GTCCAAATAA 4 860 

AAAGCCAAGT GGCAATACGT GCCAACTACG CGTGATTAAT TTGAAATAAG GTCCTACAAA 4 920 

TCGAGAAACC AATCCTCTAG ATTCAAGTAA TGCATCGACT TCAGCTTCTT CAATGTGTTC 4 980 

40 

ACGACGTAAT TTAGCGAACA AGTTAATTAA AGAGATTAAA ATAATTAGAT TCAACACACC 5040 

GATAAGCACT AAAAAGAAAC CTGAAACTAG TGTACCAATC GTTCCACCAA TATCTTGGAA 5100 

45 ATGCGGTAAT TCATCTTTAG CCCATTTTAC AGATACCCCT AAAAAAACAG CCATTAAAAA 5160 

TACGACAGAT GAATGTCCAA TTGAAAAATA GAAACCCACA CCAGATGGAT CTTTGCGTTG 5220 

CTGTAATAAT TTGCGaACCG TATTATCTAT TGCAGCAATG TGATCTGCAT CAAATGCATG 52 80 

50 ACGCAAACCT AATGTATATG CAAGAATCCC CaTACCAAAT AAGATATGAT GGTCTTTTCC 5340 

AGCAATCCAT AAAAAACTAA ACCCAATAAC GTGTAACAAA ATGACAATAG CTATGTATGG 5400 
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ATATTTAATC ATACTGTATG TTCAATGGGC . ACTCTAGTAA TAAGTGTTCA TATAACAAAA 5520 

ATGTTATGCC AAATTATTTG TTATATAAAA ATATACATGT AACCACAAAA GATTTTTTGC 5580 

5 GATATATATA ATTTGATAAA TTAACCAACA ACAATGTAAG ATGTCACTTT GCTTAACTTG 5640 

GCATCCTTTT TATGATTTTC AAATTCAAAA AAATGAGCAA AATGAATCTC TTTAcCAGTT 5700 

TTTAATATTT CaATACCATG CATGGAACCT AAGCACCCAT GTGTGATGCT GGAATGGATA 5760 

10 

TTGAGACTAG CAACCTGATT GTAATGATTA GATAGTTCTT GAATTAATAT TTGAGGTCCG 5820 

TATATGTCAA AGCGGCCAGG GACAGACCAA ATAAATTCTG TTGTAACGAG TGAACGTAAT 5880 

AATTCAATAT CTAATGCTGC TGTAACAACT ATAAAATCTA TCATTTGTTG ACGTTTAGGC 5940 

15 

GCATGATTGC ATGACACATC TCCTGTTAAC TTAAAAGGTA ATGATGACTG AACTTCCGTT 6000 

TTAAAATGTA GTTGGTGCTG AAATAAAGCT TGTTC 6035 
20 (2) INFORMATION FOR SEQ ID NO: 273: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1039 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 
ss (d) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 273: 
TTTTGAACAG CCATATTTAT TCACCCTCAA CATCATTATA ATGGTATTAG TCGCATTACC 60 

TTCACTTGTT TTAGCTATAT ATGATTATAT GAGTTTTAGA ATTTCTTCTG CTATTTTACA 120 

ATTTCTAGGG GCTATCTCTT GGTTCTTTTT ATCATTGATA TTATCGCTCA CACAATTTAC 180 

35 

ACCTTTTACA TTAGCGTCAT TTATAACTTC AATTATTTTG TTCACAAGCA CAATTATCAC 240 

ATTAGCCATT GGTGGTAAGT CTGTTGAAAA GAATGATTCC CCTTAAATTC CAAATGAAAA 300 

AAAGGTTCTG AAGGCCGCTA TAAAACACAG TTTTTCAGAA CCTCTATACT TCTATTCAAT 360 

40 

GATATATGGT TTGCAATTTT CTACCTTTAA ATCCACAGCT TCTGCCCTTG AAACTTTGTT 420 

AAAATAAACC ATCAAACAAC GAATGACAAC TTGATGTGCA ACAATGACAA TATCATCTTT 4 80 

45 TTGTGTATCT TCATTGACAA CATGATTCAT AAAATGTTCT ACGCGTTGAT ATACATCTTC 540 

ATAACTTTCT CCTTCAGGCG CTTTTTGTGA AAAACTATGA CGAAAGTCTT TAAAGTTTGG 600 

ATCATTGAAA TATTTTTCAT ATTTCGGATT CGCACTGATT TCATCTTTAT ATTCACCCTC 660 

50 AAATACGCCA AGTGAACGTT CTCTTAATAG AGGGGTAGTC GTTGATGCAA TGTCATATGG 720 

AAAAATATGT TCAAACGTTT GCTGTGTTCT TAATAAGTCT GAAACATATA CATGTTTAAT 7B0 
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CGGCACATCT AATTGTCCAC AAAAATAAGA TCGAAAATGT TTATTATCAT AATTCGATTT 900 

TGATTCGCCA TGTCTAACTA AATAAATCGT CATAATATTA CTCCTTACCT TATGTATTTC 960 

ATATCTACCA TAACACTTTG ACTACTAATT CGATATCAAT CTTAATATTC TATTCTAAAA 1020 

AAAGAATTAA TTCATATnT 1039 



(2) INFORMATION FOR SEQ ID NO: 274: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1496 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 274: 



20 



40 



45 



GAGAGAATnT 


GCAATTAGTT 


ATTCAATTAG 


TTGATTTAAG 


ACATGATCCA ACACAAGATG 


60 


ATATCTTAAT 


GTACAATTAT 


TTGAAACATT 


TTGATATTCC 


TACTTTAGTT 


ATATGCACTA 


120 


ArGAaGACAA 


AATTCCaAAA 


GGTAAGGTyC 


AAAAGCATAT 


TAAAAATATT 


AAGACACAAT 


180 


TAGATATGGA 


CCCAGACGAT 


ACAATTGTAA 


GTTATTCATC 


AATTCAAAAT 


AATAmaCAAC 


240 


AACAAATATG 


GAATTTAATT 


GAACCGTATA 


TTTCATAGTT 


TTTGTACGTC 


AAAACTTATA 


300 


CAAAAATTTT 


AAAAATAATG 


TAAGCACGAA 


ACTTTTAATT 


AGTACACAAT 


TGATAACATT 


360 


TTTCAACGTT 


CATCATTTTG 


TCAAAAACTC 


AAAAGTAAAT 


TAGAAAGATT 


ATAATTTATT 


420 


TAAGCATCGT 


ACTTAATTGG 


ATTTTAAATT 


ATGTTATAAT 


ATTTGTATTG 


TTAGTATATA 


480 


TGGGGGCTTT 


TCAAATGCAT 


TTTATTGCAA 


TTAGTATAAA 


TCATCGCACA GCTGATGTgC 


540 


ACTAAGAGAG 


CAAGTTACTT 


TTAGAGATGA 


TGCCTTACGA 


ATTGCCCATG 


AAGATTTATA 


600 


TGAAACTAAA 


TCTAyTTTAG 


AAAATGgTCA 


TATTaTCAAC 


ATGTAATCGA ACTGAAGTAT 


660 


ATGCTGTTGT 


TGATCAAATT 


CACACAGGTC 


GTTACTATAT 


TCAACGATTT 


CTAGCTCGTG 


720 


CATTTGGATT 


TGAAGTAGAT 


GATATTAAAG 


CAATGTCAGA 


AGTAAAAGTG 


GGGGACGAAG 


780 


CaGTAGAACA 


TTTATTGCGT 


GTCACTTCTG 


GTTTAGATTC 


AATCGTACTT 


GGAGAAACTC 


84 0 


AAATTTTAGG 


TCAAATAAGA 


GATGCATTTT 


TCTTAGCGCA 


AAGCACAGGT 


ACGACAGGrA 


900 


CAATTTTTAA 


TCATCTATTT 


AAACAGGCAA 


TTACTTTTGG 


AAAAAGAGCA 


CATAATGAAA 


960 


CAGATATAGC 


TGATAATGCT 


GTAAGTGTGT 


CTTATGCTGC 


GGTCGAGTTG 


GCGAAAAAAG 


1020 


TATTTGGCAA 


ATTGAAAAGT 


AAGCAAGCTA 


TCATTATTGG 


TGCAGGGGAA 


ATGAGTGAAT 


1080 


TATCACTATT 


AAATCTTCTT 


GGTTCTGGAA 


TTACTGaTAT 


TACAGTAGTA 


AATAGAACAA 


1140 
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TACCAAATTT 


ACTTGAAAGT 


GCAGATATTG TGATTAGTTC AACGAGTGCA 


CAATCTTATA 


1260 




TCATTACAAA TGAAATGATA GAAAGAATTG CAGAAAATAG AAAGCAAGAT 


TCACTAGTAT 


1320 


£ 


TGATTGATAT TGCAGTTCCT CGAGATATTG AAC CTGGTAT TAGTGCCATC 


ACAAACATCT 


1380 




TTAATTATGA 


TGTTGATGAC 


TTAAAAGGTT 


TAGTTGATGC 


AAACTTACGT 


GAGCGACAAT 


1440 


10 


TAGCGGCTGC 


AACAATTTCG 


GAACAAATTC 


CTACAGAAAT 


ACATGCACAC 


AATGAG 


1496 


(2) INFORMATION FOR SEQ ID NO: 275: 








15 


(i) SEQUENCE CHARACTERISTICS: 
' (A) LENGTH: 4 826 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 








20 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 275: 








CTTGATTTTT 


TCCCtTTAGT 


ATTTTCCaTt 


TGanTGTCGC 


AGCTTCTAAA 


TCCTGCTTTG 


60 




GTTCTCTAGT 


GAACTTCATA 


ATTAAAGCAG 


CTACAACGAA* TGATACAAGT 


GCAGCAAGGA 


120 


25 


AGACACCGAG 


TAACATGTGC 


AAGAATTCAC 


CTCTAGGTGC 


ATTTAAACAG 


TAAACTATAA 


180 




ATGAACCTGG 


TGACGCGGGA 


CTTTTAAATC 


CAAATCCTGT 


TGCTTGATAA 


GTTGCAACAC 


240 




CAGTCATTCC 


ACCTAAAATA 


ACAGCGATAA 


ATAATAAAGG 


ACGCATTAAT 


ACATATGGGA 


300 


30 


AATAAATTTC 


ATGAATACCA 


CCTAAGAAGT 


GGATAATTCC 


AGCACCATAT 


GACGTTGCTT 


360 




TTGCAGTGCC 


TTTTCCAAAA 


ATCATATAAG 


CAAGTAAGAT 


ACCTAAACCT 


GGTCCAGGGT 


420 


35 


TAGATTCAAT 


TGTGTATAAA 


ATTGATTGAC 


CAGCTTTTGC 


AGCTTGATCT 


GCACCAAGCG 


480 


GTGTGAATAC 


ACCATGGTTA 


ATCGCATTGT 


TTAAAAATAC 


AATTTTTGCA 


GGCTCTACTA 


540 




AAATACTTAC 


AAGTGGAAGT 


AGGTGTGCAT 


GTACTAATGC 


TTCAACTGCC 


ACTGATAAAA 


600 


40 


tatg"cataat 


AAATTTCATA 


AGTGGTGCTA 


AAATTTTAAA 


TCCTGCAATC 


GTCATGATAA 


660 


ATCCTAAAAT 


ACCAGCAGAA 


AAGTTATTAA 


ATAACATTTC 


AAAACCTTGC 


GGCGTTCTAG 


720 




GTTGAATCAA 


TTGGTCGGTC 


TTCTTCATTA 


ACCAACCAAC 


AAGTGGACCC 


ATAATCATTG 


780 


45 


CACCAAGTAA 


CATTGGTGTA 


TCAGGTAATG 


CAACGATGAC 


CCCCATAGTT 


GCTGTTGCTG 


840 




CGATGATACC 


ACCACGTAAA 


TCATAAATTA 


AACGACCACC 


ACTAAATGCG 


ATCAATAATG 


900 




GGATTAAATA 


AGTAATCATT 


GGTCCTGCTA 


AAGTAGCTAA 


ATCTTTGTTA 


GGTAACCATC 


960 


SO 


CATTATCTAT 


AAAAATGGCC 


GCGATAAAAC 


CCCAAGCGAT 


GAAAGCGCCA 


ATGTTTGGCA 


1020 




TGATCATACT 


ACTTAAGAAT 


GATCCAAATG 


CTTGAACACG 


ACGACCAATT 


CCTTTTTTCT 


1080 
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GAGAGGTTAC TTGTTACTCA ATATAAACAA AAATCAACTT TGTCAAAATA AATGTGACAA 1200 

AATTAAATAA AGTGTCATCA ATGTGACAGT ATAGATATTT TGAAAAAGTA AAACAAAAAA 1260 
5 ATTGTTTTAG GATTTTTAAA ATTTTATTGT GAAAATATTT GCAAAACAAA ACAACACCGT . 132 0 

gTACAATAAT GATTAATGGA AAGGGGGAAA GTTCGGCAGT ACAGTTAAAG CGCCTGTGCA 1380 

AATAAATATT TGTATTTGAA GATTAAAGGT TAATATATGA GTGGCCTTTA TAGAGTGCAA 144 0 

10 

TATATGTATT TGTAGACGAG GAGGATAGTG ATCGAATAGA TCGGCGGATG CTATCCCGGA 1500 

TGTGGCTCAT TCGTTAGCTT ATTAAGTAAA ACATTAGGGT GACTTAATGG ACAAAGTTAA 1560 

TAAGATCGCC AGAAATTGAA TATAAAAAAT ATTAATATGG AAAGTACAGT GTGAGCAATT 1620 

15 

TGTATAGTTG TAAAAATAAC TATGCTTAAT TTGTTATGGA TGAATGCGAT GATAGCATGT 1680 

TCCTATTTAT ATTATGAAAG CAGATTGTCA ATCTAAATTA TCGGCAATAA ATCATAATTT 1740 

20 ACGCGTACTA TTCCAATATG GAGGAAAATG TCGTTATGTG TGGAATTGTT GGTTATATTG 1800 

GCTATGaTAA TGCCAAAGAA TTATTATTAA AAGGTTTAGA AAAATTAGAA TACAGAGGTT 186 0 

ATGACTCTGC AGGTATCGCA GTAGTAAATG ATGATAATAC AACTGTATTT AAAGAAAAAG 1920 

25 GTCGTATTGC AGAATTACGT AAAGTTGCTG ATAGTAGCGA TTTTGATGGA CCTGTTGGAA 1980 

TCGGTCACAC ACGTTGGGCA ACACACGGTG TACCGAATCA TGAAAACTCT CATCCACATC 204 0 

AATCATCAAA TGGCCGTTTT ACTCTAGTTC ATAACGGTGT TATTGAAAAC TATGAAGAGT 2100 

30 TAAAAGGTGA ATACTTACAA GGTGTATCAT TCATTTCAGA AACAGATACA. GAAGTTATCG 2160 

TTCAATTAGT TGAATACTTT TCAAATCAAG GACTTTCAAC TGAAGAAGCA TTTACAAAAG 2220 

TTGTGTCATT ATTACATGGT TCATATGCAT TAGGTTTATT AGATGCTGAA GACAAAGACA 2280 

35 

CAATCTATGT TGCTAAAAAT AAATCACCAT TATTATTAGG TGTTGGTGAA GGTTTCAATG 234 0 

TTATCGCATC AGACGCACTT GCAATGTTAC AAGTGACAAG CGAATATAAA GAAATCCATG 2400 

ACCATGAAAT CGTTATTGTT AAAAAAGATG AAGTTATTAT TAAAGATGCA GATGGAAACG 2460 

40 

TTGTAGAACG TGATTCATAT ATTGCTGAAA TTGATGCATC AGATGCTGAA AAAGGTGTTT 2520 

ATGCACACTA CATGTTAAAA GAAATTCATG AACAACCAGC AGTAATGCGT CGTATTATTC 2580- 

4S AAGAATATCA AGATGCAGAA GGTAACTTGA AAATTGATCA AGACATCATC AATGATGTTA 2640 

AAGAAGCAGA CCGCATTTAC GTTATTGCAG CAGGTACAAG CTACCATGCA GGTTTAGTAG 2700 

GTAAAGAATT TTTAGAAAAA TGGGCTGGCG TACCAACTGA AGTACACGTT GCATCAGAGT 2760 

50 TTGTCTACAA CATGCCATTA TTATCTGAAA AACCATTGTT CGTTTATATT TCTCAATCAG 2820 

GTGAAACTGC AGATAGCCGC GCCGTATTAG TTGAAACTAA TAAATTAGGT CATAAATCAT 2880 
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TACACGCGGG TCCTGAAATC GCAGTTGCAT CTACAAAAGC ATATACTGCA CAAATTGCAG 3000 

TATTATCAAT CTTGTCTCAA ATCGTTGCAA AAGAGCaTGG TCGTGAAGCA GATATTGATT 3060 

5 TATTGAGAGA ATTAGCAAAA GTAACAACAG CAATAGAAGC AATTGTTGAC GATGCACCAA 3120 

TTATGGAACA AATTGCTACA GATTTCTTAG AAACAACACG CAATGCATTC TTTATCGGAC 3180 

GTACTATTGA CTATAACGTA AGTTTAGAAG GTGCGTTAAA ACTTAAAGAA ATTTCTTACA 324 0 

10 

TTCaAGCAGA AGGTTTTGCT GGTGGAGAAC TTAAACATGG TACAATTGCC TTAATCGAAG 3300 

AAGGTACACC AGTTGTAGGT TTAGCAACAC AAGAGAAAGT TAATTTATCA ATTCGTGGTA 3360 

ACGTTAAAGA GGTAGTAGCA CGTGGTGCAC ATCCATGTAT TATTTCTATG GAGGGTCTTG 3420 

1$ 

AAAAAGAAGG CGACACTTAT GTCATTCCTC ATGTACATGA ATTGTTAACG CCATTAGTAT 34 80 

CAGTGGTTGC ATTACAATTA ATTTCATACT ATGCAGCATT ACACAGAGAT TTAGATGTTG 3540 

ATAAACCACG TAACCTTGCT AAATCAGTTA CTGTGGAATA ATTCACTTTT TTAGAATCAA 36 00 

20 

TCATGTATTA AAATTAAAGT ATATGGCACC CTTTTAGATT AATCGACTAG AAGGGTGCTT 3660 

TTTTAGGTCG ACTTaGCTTT TACTTCATCT TAATTTGGCA GAAATGCGTa AAAATGAAGT 3720 

25 GTTTTATTTA TTTAAATAGT CTGACAATTA AGGGTGTTAT GTTAATATGA TTTTATGAGA 3780 

AGTATGGAGT AGCAATAAAG GGGTGACCTC GCATGTTAAT TCAATTAGAT CAAATTGGGC 3840 

GAATGAAGCA AGGAAAAACA ATTTTAAAAA AG AT T TCTT G GCAAATTGCT AAAGGTGATA 3 900 

30 AATGGATATT ATATGGGTTG AATGGTGCTG GCAAGACAAC ACTTCTAAAT ATTTTAAATG 3 960 

CGTATGAGCC TGCAACATCT GGAACTGTTA ACCTTTTCGG TAAAATGCCA GGCAAGGTAG 4 020 

GGTATTCTGC AGAGACTGTA CGACAACATA TAGGTTTTGT ATCTCATAGT TTACTGGAAA 4080 

35 

AGTTTCAAGA GGGTGAAAGA GTAATCGATG TGGTGATAAG CGGTGCCTTT AAATCAATTG 4140 

GTGTCTATCA AGATATTGAT GATGAGATAC GTAATGAAGC ACATCAATTA CTTAAATTAG 4200 

TTGGAATGTC TGCTAAAGCG CAACAATATA TTGGTTATTT ATCTACCGGT GAAAAACAAC 4260 

40 

GAGTGATGAT TGCACGAGCT TTAATGGGGC AACCCCAGGT TTTAATTTTA GATGAGCCAG 4320 

CAGCTGGTTT AGACTTTATT GCACGAGAAT CGTTGTTAAG TATACTTGAC TCATTGTCAG 4380 

ATTCATATCC AACGCTTGCG ATGATTTATG TGACGCACTT TATTGAAGAA ATAACTGCTA 444 0 

45 

ACTTTTCCAA AATTTTACTG CTAAAAGATG GCCAAAGTAT TCAACAAGGC GCTGTAGAAG 4500 

ACATATTAAC TTCTGAAAAC ATGTCACGAT TTTTCCAGAA AAATGTAGCA GTTCAAAGAT 4560 

50 GGAATAATCG ATTTTCTATG GCAATGTTAG AGTAAATATT TTGCAAATAA TAAGTAATAA 4620 

TGACAAAATT TAATTAAGAT AAAATGGACA GTGGAGGGCA ATATGGATAA CGTAAAAGCA 4680 
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AAAGATGTCA TTAATCAATT GAGAGAGAAA GGATATAAAG TATTTTTGGC AACAGGACGT 4 800 

TCGCATTCTG AAAATACATC AACTTG 4 826 
5 (2) INFORMATION FOR SEQ ID NO: 276: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4846 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 276: 

GAATAAAAAG TAAAATTACT CGCCTTTGTT ACCTTTTACT TTATCAATaA AATCAGTTGC 60 

TTTTTCTTTT GCATTTTCAA CGAATTCTTT CGCTTTACCA GAAGCTTTAT CTTCTTTACC 120 

20 TTOGTTTTCT AAATTTTTAT TATCAGTAAC ATTACCTACT GTTTCTTTAA CATTACCTTT 180 

TGCTTGTTCA AATTTACTTT CGTCTGCCAT AATAAATGCC TCCTCGGAAT AATTAAATGT 240 

• TATATATAAT ACTTACCCAC TGAAAAATTA TCTAAACATT TTACTTAAAT AATTTTTGAT 300 

25 ATTGATTTGA CGTCATTTTA TAACTAGCGA AATAGATTCA TCATTAACTT GAGGGAGTGG 360 

GACTGAAATA ATAAAGAATC ACTAATGATT TATGATGTAT TAGTCACTAG CCATGTGAAA 420 

TTAAAAATAA GAATAAATGA GTAGCACGCA TGCATATAGG ATTTTACTTT ATCCGTAATA 4 80 

3D 

GCATCTCATT CCTAAATATC ATATAAATAC CTGTTTAAAT TAAAAAGCCC AGCAACATCA 540 

CGTTACTGAG CCATTAATAT GATTTATTTA GCAGGAATAA CTGCACCATT GTATTTTTCA 600 

TTAATGAAGT CTTGAATATC TTTAGATTGT AATACTTCAA TTAATGCTTT GATTTTCTTA 660 

35 

TCATCTTGAT GTCCTTCTTT AACAGCAATT AAGTTTGCAT AAGGATTATC TTTCGCACTT 720 

TCTACAGCAA TAGAATCTTT TTTAGGATTT AGTTTTTGTT CGATTGCAAA GTTCGAATTA 780 

ATGATAACAG CATCAGCGTC TTCATTTTGA TAAATTTTAG GTAAGAATTC TGCTGATTGT 840 

40 

TTATTATTAA ACTTAATATC TTTTTTATTC TCTGTAATAT CACTAAACTT AGCATCTTCA 900 

ATTTTTACGC CTTTTTTGAT- TTTAATTAAA CCTGCATCAA CGAAGAATTT TAAGAAACGT 960 

45 CCTTGTTCAG CTGGATTATT AGACACATAG ACTGTTGCAC CTTTTGGTAA TTCTTTTAAA 1020 

CTTTTATACT TTTTAGAGTA TACAGCCATA GGTTCTAAGT GAACATCACC GGCACTTACG 1080 

ATTTTGTAAC CTTTATCCTT TTTCTCTGTG TTTAAATATG GTGTATGTTG GAAATAGTTT 114 0 

50 GCGTCAATTT CACCTTTGTC TAGTAATTTA TTAGGTGTAG TGTAATCGTT AATTGTTTTA 1200 

ATATCTAGTT CATAACCTTT TTTCTCTAAT AATGGTTTTG CTTTTTCTAA AATTTCAGCA 1260 
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TTACCGCAAG CTGCTAATAC AACTGCAAAT GTTAATACTA AAATAAGACC AAATAATTTT 1380 

TTCATAAAAT GAAACCCCCA ATTTATCGTT TATCAAGTTT ATTTGTAAGC CAATCCCCAA 1440 

5 TGAATTGGAT TATAAATACA ATAATTAAAA TAAAAACTGT TGATACTAAA ATGACATCAT 1500 

TTTGATTTCG AGTGAAACCT GTTAAGTATG CTAAATTTCC TAAACCACCG GCACCAATTA 1560 

CACCTGCAAC TGCTGTTGAA CCAACTAAAG CGATTGCTGT AACTGTAATG CCAGACACTA 1620 

10 

GCGCTGGCAT AGCTTCAGGT AAAAGGACTT TACGAATTAC TGTCCAAGTA- TTAGCGCCCA 1680 

TTGACCAAGC CGCTTCGATG ACACCTTTAT CAATTTCTTT AAAAGCAATT TCTACGAGCC 174 0 

TTGCATAAAA CGGTGctGCG CCAATGATCA AGGCTGGTAA CGCACCTGTC GGACCACTTA 1800 

15 

TCGTTCCAAG TATCAAACTT GTAAATGGAA TTAATAATAA AATTAAAATA ATAAATGGTA 1860 

TCGCTCTAAA TAAGTTAACA ATGAAAGAAA CGATAGAATA AAATAACCTT GCACCGATAG 1920 

2Q ACTTACCTTT AGCAGACAAG AATAATAACA CACCTAAAAT AAGACCAAGT ATAAATGCAA 1980 

ATATAGTTGA GACGACTGTC ATGTATAGTG TTTCGACTAT TGCAGTCCAA ACTTCTGGCC 2040 

ACTGAATATT AGGCATTGTA ATCATTTCAT TTATAATTTC ACTAAATGAT TTACCCATGT 2100 

25 CTTAACACCT CCATTTTAAC TTGTCGCTCA ATTAACTCTT TTTCGAATTT TCCGAAATCT 2160 

ACACTTGAAA TATATGGAAT ATGCAGAACT AAAAAGCCGA CTGTTCCATT TTTTGTATTT 2220 

TTAATATTTG CTTCTAAAAT ATTAATTTTA ATATCATAGG CAGTTGATAG ACTCGATACA 22 80 

30 ATAGGCTCGG TTGTTGTTGA ACCAGCGAAA ACTAATCTAA GGATATATGC ATCTTTTTCT 234 0 

AATGGCTCTA ATTCTGTTAA AGATGTTTCG AAATCATCAT TTAAATCGTC TTTCACAAAT 24 00 

CGTTTTGTCA CAGTGTGTTG CGGATTTTCA AAAACCTGTG TCACCGGTCC TTGTTCTATC 2460 

ACTTTACCAC TTTCCATAAC TGCAACTTCA TCACAAATAC GACGAATGAC ATGCATTTCA 2520 

TGCGTAATTA GTACAATTGT TAAATTTTGT TGTTCTCTAA TTTTTAGTAG TAGATCTAAA 25B0' 

ATTTfcATCTG TTGTTTGCGG ATCAAGTGCA CTTGTTGCCT CATCACAAAG CAAGACCGTT 2640 

40 

GGATCATTaG TAACGCTCGT GCAATCCCAA CACGTTGCTT TTGTCCACCT GATAACTCTG 2700 

ATGGATAAGC CTTTTCTCTA CCTTTTAAAC CGACGAGTTC GACAAGTTCT AATGCTTTTT 2760 

GCTTAGCTCT CCTTCTAGGG ACACCTGCAA TTTCAAGCGG AAACATAATA TTTTTTAACA 2820 

45 

CAGTCCTTGA CCATAACAAA TTAAAATGTT GGAAGATCAT ACTTACTTTT ■ TGTCTTTTTG 2880 

CTCTTAATCC ATTTTTGGAC AATTGACCTA TATGGTCTCC ATCTATAATA ACTTCACCTG 2940 

SO ATGTAGGCGC TTCTAAATGA TTAAACATTC GAATCAAAGT ACTTTTTCCT GCTCCAGAAA 3000 

AACCAATGAC GCCATAAATC GATCCTGCTC GAATCGATAA ATTAACGTGA TCTACAGCAA 3060 
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TTCCTCCCTG TGTTGCTTAA TAAAATAAAA AATGCTTTCT CAATATCGAT AGAAAAATTG 3XB0 

AGAAAGCAAT AGTAGTATTG TTTCTCTCAT CTTCAAAAGT TAAAACTTTA TGTGAATTGG 3240 

CACCATTTCT ATATAAGACG GTTGCCGGGC TTCGTAGGGC ACATCCCTCC ACCACTCTCG 3300 

ATAAGAGTTT ACGCATCATT ' TAATTTGTAT TAATCCTAAC ACCTTAGTAA AATTTCGTCA 3360 

ATAACTATTT TAAATTTTCT AACAAATCAG TCACCGATTT AAATGCATAA ATTCGTTTTA 3420 

10 

C TTCTTTA TC TTTATTCATC AACAATAAAA TCGGCGTAGA CATGATTTGC ATATCTTTAC 34 BO 

AAAACTGAGG ATAAAAGTTT AAATCTATTT TCAATAATGG TAACTGCAAT ATTTCATTAG 354 0 

CAATGTCTAA CATTCTTTCT GaAACCTTAC AAGTACCACA CGTTGGTGTA TAACCAAAGA 3600 

TTAAATGTTT GTCTTCCTCA TAAAATGTAG TTACATCTTT GATGTCTAAT GAATTATTCA 3660 

TTTACTAAAA CTAACCTTTC ATTATTTATA TTCGGTAAAA GAGGTGTTTC TTTCTTACAA 3720 

20 GTAAAGCCAT GTTTTGAAAG TACATGCGCC AAATATTGTT TGGGGCAATT CGCAACTTGA 3780 

CAGTAAGTTT TATCAATAAA TATATGTTCA CTTTCACTCA AATAACGTTT AAACCAATTT 3840 

CTAATTCGAT CTCCTTCGTC ATCAGAATCG GCTAATACAA AAACTTGTTT ATCATACAGT 3900 

25 GATTCTATCA TATCATCAAG CTTATCTATA CTCATTGTTC CATGAGTACA AATAATATTG 3960 

ACTGGTTCTG CAATAACCTG TTGCACCCTT TTTTTATCAG ATTTTCCTTC AACAATTATC 4 020 

ACTTTATTTA CAATAGCCAT CATCATCAC.C CTTTAAAATC AATAAACATC TGTCACTGTA 4 080 

30 TCATTTCACA AAATTGGTAT GAATAAAACA TAAATCACAA AAAATTTAAA CTAGCTTAAT 414 0 

ATAATAATTA CAAACTCAAT GTTTGACTAG CTGGAACATT TAACATAAGC AGACAAAGGC 4200 

TAAGTCAAAA ATCAACATCC TAAAATCTAC AATGTTATAT TAACAATAGT TAACCAAAAG 426 0 

35 

AAAATACACC TATAACAAAC TTTTCAATTA TAGCGGGGCC CCAACACAGA AGCTGATGGT 4320 

AAGTGAGCTT ACAATAATGT GCAAGTTGGC GGGGCCCCAA CATAAAGAAA TACTTTTTCT 4380 

TTAGAAATTA GTATTTCTTA TGCATGAGTT TTACTCATGT ATTCCTATTT TTAAATACAC 444 0 

40 

ATTAGCTGTG GCTTATGAAA ACAGGCTGGG ACATAAATCA ATGTTCTATG CTCTACGAAg 4500 

TTATATTGGC AGTAGTTGAC TGAACGAAAA TGCGCTTGTA ACAAGCTTTT TTCAATTCTA 456 0 

45 GTCAGGGGCC CCAACACAGA GAATTTCGAA AAGAAATTCT ACAGGCAATG CAAGTTGGGG 4620 

ATGGGCCCCA ACAAAGAGAA ATTGGATTCC CAATTTCTAC AGACAATGCA AGTTGGGGTG 4680 

GGACGACGAA ATAAATTTTG CGAAAATATC ATTTATGTCC CACTCCCTAG ATTGATCTAT 474 0 

50 AGATACTACA CTTATTAAAG TAATATATTT TTATGATTCT CTTAGCTGCA ATCCCATGAA 4 800 

TACATGTAAT CATCAAACTT CATAGCCTCA AGGTCAGTAG ATTTCA 4 84 6 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 277: 

AACAAAGACA CAATCGAACA TGAACCATCA GTAAAAGCTG AAGATATATC AAAAAAGGAG 60 

GATACACCAA AAGAAGTAGC TGATGTTGCT GAAGTTCAGC CGAAATCGTC AGTCACTCAT 120 

15 AACGCAGAGA CACCTAAGGT TAGAAAAGCT CGTTCTGTTG ATGAAGGCTC TTTTGATATT 180 

ACAAGAGATT CTAAAAATGT AGTTGAATCT ACCCCAATTA CAATTCAAGG TAAAGAACAT 240 

TTTGAAGGTT ACGGAAGTGT TGATATACAA AAAAAACCAA CAGATTTAGG GGTATCAGAG 300 

GTAACCAGGT TTAATGTTGG TAATGAAAGT AATGGTTTGA TAGGAGCTTT ACAATTAAAA 360 

AATAAAATAG ATTTTAGTAA GGATTTCAAT TTTAAAGTTA GAGTGGCAAA TAACCATCAA 420 

TCAAATACCA CAGGTGCTGA TGGTTGGGGG TTCTTATTTA GTAAAGGAAA TGCAGAAGAA 480 

TATTTAACTA ATGGTGGAAT CCTTGGGGAT AAAGGTCTGG TAAATTCAGG CGGATTTAAA 54 0 

ATTGATACTG GATACATTTA TACAAGTTCC ATGGACAAAA CTGAAAAGCA AGCTGGACAA 600 

GGTTATAGAG GATACGGAGC TTTTGTGAAA AATGACAGTT CTGGTAATTC ACAAATGGTT 660 

GGAGAAAATA TTGATAAATC AAAAACTAAT TTTTTAAACT ATGCGGACAA TTCAACTAAT 720 

ACATCAGATG GAAAGTTTCA TGGGCAACGT TTAAATGATG TCATCTTAAC TTATGTTGCT 780 

35 TCAACTGGTA AAATGAGAGC AGAATATGCT GGTAAAACTT GGGAGACTTC AATAACAGAT 840 

TTAGGTTTAT CTAAAAATCA GGCATATAAT TTCTTAATTA CATCTAGTCA AAGATGGGGC 900 

CTTAATCAAG GGATAAATGC AAATGGCTGG ATGAGAACTG ACTTGAAAGG TTCAGAGTTT 960 

ACTTTT A CAC CAGAAGCGCC AAAAACAATA ACAGAATTAG AAAAAAAAGT TGAAGAGATT 1020 

CCATTCAAGA AAGAACGTAA ATTTAATCCG GATTTAGCAC CAGGGACAGA AAAAGTAACA 1080 

AGAGAAGGAC AAAAAGGTGA GAAGACAATA ACGACrCCAA CACTAAAAAA TCCATTAACT 1140 

GGAGwAATTA TTAGTAAAGG TGAAyCgAAA GAAGAAATCA CAAAAGATCC GATTAATGAA 1200 

TTAACAGAAT ACGGACCAGA AACGATAACA CCAGGTCATC GAGACGAATT TGATCCGAAG 1260 

SO TTACCAACAG GAGAGAAAGA GGAAGTTCCA GGTAAACCAG GAATTAAGAA TCCAGAAACA 1320 

GGAGAyGTAG TTAGACCACC GGTCGATAGC GTAACAAAAT ATGGACCTGT AAAAGGAGAC 1380 

TCGATTGTAG AAAAAGAAGA rATTCCATTC rAGAAAGAAC GTAAATTTAA TCCTGATTTA 144 0 
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CCAACACTAA AAAATCCATT AACTGGAGAA ATT ATT AG T A AAGGTGAATC 


GAAAGAAGAA 


1560 


ATCACAAAAG 


ATCCGATTAA TGAATTAACA 


GAATACGGAC 


CAGAAACGAT 


AACACCAGGT 


1620 


CATCGAGACG 


AATTTGATCC 


GAAGTTACCA 


ACAGGAGAGA 


AAGAGGAAGT 


TCCAGGTAAA 


1680 


CCAGGAATTa 


AGAATCCAGA AACAGGAGAT 


GTAGTTAGAC 


CACCGGTCGA 


TaGCGTAACA 


1740 


AAATATGGgA CCTGTaAAAG GAGACTCgAT TgTaGGAAAA AGarGAATTc 


CaTTCaGGAA ' 


1800 


AGAcGTaATT 


TaTCCTGTTT 


AGCACCCGGG 


GCAGAAAAGT 


TAC 




1843 


(2) INFORMATION FOR SEQ ID NO: 27B : 








(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 853 6 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 278: 






TAAACAGCGC 


GTGTACTTGT 


GATTCCCCCT 


TCTTCTATTT 


TACCCACCCG 


GGAAATAATA 


60 


CTTTTCGCGA 


TTCCTTACTT 


GaACAAGCAA 


TATTTTATCa 


GCTGTTTCTT 


CAACTAAACA 


120 


GACACATTTA 


ATCATCTTTG 


ACACCCCAAC 


TTTGTGAAAT 


CAATTTTTCA 


AATTATACTG 


180 


TACAATTATG 


TTATCATATA 


TGAGTAGTTA 


TAGCGCAAAA 


CGTTAGCAAT 


TCAGCGCACC 


240 


CAACTTTTCA 


TATAAACAGA 


AGATACTAGG 


GGGAATTATT 


ATTATGGCAA 


AACgTTCCAA 


300 


ATCACAACGT 


TTATCAAGTT 


TACTAAATGT 


CGCAGGTTTC 


ATAGTCGACG 


GCTACAATGG 


360 


CTATAAATAT 


CATGCTAAAA 


ATAAAAAATT 


AGTATATCTT 


TCATTAGGTT 


TAAGCACTGT 


420 


AGGAACCGTG 


TTAGACTTTT 


ACATTTCAAT 


TAAGTCACCA 


CGTAAGTTCA 


AAAAAGCAGT 


480 


GGCAGTTGTT 


ACTTTAATAA 


CAAACGGTGC 


TAGATTATTT 


ACAAGCATTC 


GCAAAGTAAA 


540 


ACATGAATAC 


TAATTCAGAA 


AAGGATTGGT 


CGAACATAGA 


ACATGAAGTT 


CATTCGACCA 


600 


ATCCTTTTTC 


ATATACAAAA 


ATTCCTATTA 


CAACAATCAC 


GACTTTGATA 


GCCCCGCCAT 


660 


AAAATTTAGG 


ATTCAATCCA 


ACTTTTCAGC 


TTGTGaAATG 


TAATAGGaAT 


TCATTATATA 


720 


TTTATATACG 


TAAGACTTTA 


GTGAATATAT 


CTATAATTAT 


TTACTTGGTA 


AGCTGGTACC 


780 


GTTCTGTAAG 


TTAAAATACC 


TGGTGCAGCT 


GAATAGTTCA 


TTTCTGAAAC 


TAAAATACTA 


840 


CCATCGTTAT 


TTACACGTTC 


TACAAACATA 


ACATGACCAT AGTAACCTAC 


ATCTGTTTGA 


900 


GCGATAGAAC 


CTACAGTAGG 


TCTATTGTCG 


ATAGTGTAAC 


CATCTGCTGC 


CGCTGCGTTA 


960 


TCCCAGTTAT 


TAGCATTCCA 


CCAATAAGTA 


CTAATACCTT 


TACCAATTTC 


AGCACGACGA 


1020 
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GTATTGTAAC CTCTATTTGT CGTTGTTGCA GATCCTGAGT TCGTAGATGC aTTACCAGTT 
ACTTTCAATT TTTGACCCGG ATATATAAAG AAATTATTTA AGCCATTTAA GCTCATAATT 
TTTTGATAAG TTGTACCATA TTTTGATGCA ATTAATGACA ATGAGTCACC TGCTTGTACT 
GTATAGTATG ATCCGCCACC TGAGTTCGTT GATGGACGGC TACTATTGCT CGCAGCGTTA 
CTTGAGCTAG CAGTACCTGA TACTTTTAAT TTTTGACCTG GATAAATAAA GAAATTATTT 
AAACCATTAA GTCGCATAAT GTTTTGGTAA GTTGTACCAT ATTTTGATGC GATTAATGAT 
AATGAGTCGC CTGCTTGTAC TGTGTAGTAT GATCCGCCAC CTGAGTTCGT TGATGGACGG 
CTACTATTAC TCGTAGAATT ACTTGAGCCA GATACTTTTA GTACTTGGTT TGGGAAAATT 
AGATTAGATG TTAAATTGTT TAATGACTTT AATTTAGCAA TCGAAATCCC ATACTTATTT 
GAAATTGCCC ACACTGATTC ACCCGGTTTT ACTGTGTGAG TTGTAGCCGC ATTTGCTTGA 
GTTGCCGCAA CAGCGCTAAT CGCGCTTGTC CCAATAATAG CTGCAATTAC TTTTTTTTGC 
ACTTTAAAAT CCTCCTCTTG CTTAACTTTC CTAACATTCT TTTATCCGAA TTTATGAATA 
CTACATCATT ATACGATTTT ATTATGTATA ATAGGTTGAT GTTTGATGAC ATTATGaTTA 
CAAAAAAATC ATATACTGTA TCATCAAATT TTATAATTAT CCCTTAAAAT TATTACAACT 
TATTAGATTT TACAATATCT AAATTATTAC AATTTCATAA TATTTCACTA TAAAATGATT 
ACAATGCCTT TCTCTATTGG AAATAATTTT. ATTCTCCAAC AATAACGCCC TACAAACATA 
AGCATGAACT TTGCTTGTAG GACGCAATAT AATTTATTTT GCTAATGACG TTTCTATTGC 
CTTAATCTCA TCTTTAGATA AATTAACAGG TTTCTCTCCA TCTTTGACAT CTTCCTGCAA 
CGCTTTTTGA GCTTCTTTTG- AATGATACAA TTCTACGATT TTAGCATATG TTTTGTTATC 
CAAGTCTTTG TCATTAACTG CAACAATATT AATATATGGC TTTACTGCAT CTGAATTTGA 
TTTTTCTAAA AATATCGGAT CATTTTTAGG ATCTTTACCC GCTTTAGTTG CTACACCGTT 
ATTAATAACT GCAATATCGA CATCAGATAA AGCACGTGCA GTTTGTTGTG CATCTACTGC 
AGTAATTTTT AAATGTTTTG GATTTGACGT TATATCTTTC ACCGTGCCTG CTAATCCGAA 
ATCTTTTTTC AGTTTTATTA AACCAGCTGC TTCTAATAGT TTAAGTGCAC GTGCTTGGTT 
TGACACATCA TTTGGAATGA CAACTTTAGC ACCATCTTTA ACCTTTTTGA CATCTTTAAT 
TTTATCTGAG TAAATGCCCA ACGGTGCTAA AACTGTTGTA CTTAATGCTG AAATCTTTGT 
TCCTTTATGC GCCTTTTTAT ATTGATCTAA AAATGCAAAA TGTTGGAATG CATTCATATC 
AATATCACCA TCATTTAATG CTTTATTCGG TAAATTGTAA TCTGAGAAGT GCTTAATCTC 
CACATCAATA TCATCTTTTT TAGCTAATTC TTTAACCTTC TCCCAAGCCT TAGTGTCATT 



1097 



AAGTGCTACG ATAACTAACC CAATCAATCT 
CACTATATAT CATTAATGTC TACGTATGAA 

5 CACTTGGACA ATAATGACTA ATACAATAAC 

TTGATAACCA TACACTAAAG CTAAGTCTCC 
CGCCGTACTT CCAATAAGTC CAATAATCGC 

'° AGCTTCAGGA ATTAAAAAAT ATCTAATGAT 

CGCTTCAATA ATCCCCTCGT CTACTTCCAA 
AGGTGCCACA TATACTGTTA AAGGCACGAT 

15 

TACTAATTTT GTGAATGGCA CAATCGCAAT 
AATATTGATT AAAGGATTTA AAACTTGATG 

20 TCGAGTAATT ACCAATAAGA CACCTAATGG 

AATAGATACC ATATATAGCG TTTCGTACAA 
AGAACCAAAC ATATGTTAAT GcACCTCCTC 

25 ATTTATTGCC GTGTCTTCAA ATTGTTGATC 
GGTGTTACCT TGTATTTCTG ACATAGATGA 

TTGAATCAAG TCATTTATAA TCGGTTGTGT 

30 

ATCTTTAAAA TCGCCAACTT GTTCGTCATT 

AGTCTGTATA ACTGTAGACA CAAAATTTTG 

CTCTTTAAcA GTTCCTGTTT CAACCACTTT 

35 

AATGTCTTTA ATAACGCGCA TTTCATGTGT 
ATTGfiCATTC TTTAATAACG TCAATATCGA 

40 TGCTTCATCG CATAGGAGTA TTTTCGGATT 
TTGCTTCTGC CCACCAGATA ATTCATCAGG 
AAATTCAAGC ATTTCCGTTA CTCGTTGCTT 

45 GAGTGGCATT GCTACATTTT TAAATACGGT 
TATCATACCG . ATATCTTTCT TAATATCCCT 
ATGTCCATCT ACAATCACTT GTCCATTCGA 

so 

CAACGTACTT TTTCCTGCAC CACTATATCC 
CGTAAATGAT ACGTCCTTCA AAGCATCTAT 

55 
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TTTCATTCTA TCAATTCGTT TCAAAATCTT 2940 

TCTAGCTAGA ACATTCCCTA GCGTTTGAAT 3000 

GGTAATAATA ATGACCGTCG TATCAAATCT 3 060 

TATACCACCA CCGCCAACAG CTCCTGCCAT 3120 

AGTGGTAATT GCTAATACTA ACGAACCTAA ,3180 

TTGTAGTGGT GAAGCGCCCA TCGnTTTCGC 3240 

TAATGAGTTT TCAACAAGTC TTGCAATGTA 3300 

GGCAGCAGTC GTACCAATTG AAGTACCTAC 3360 

TAACAAAATA ATAAATGGTA GTGACCTTAA 3420 

TATCACTATA TTGGGCCATA TGCCTTGTTT 3480 

AATACCAATC ACTGCTCCTA AAAATAAAGC 3540 

TGCTTGTAAT AACTGTGCAC TGTCTAAATC 3600 

AAATTGAATA TTTTTCTCTT TGAAATATTG 3660 

CATATTAAAT CGAAGCCACA TATAACATAC 3720 

AAATAAAATT TTAACCTCTC TGCCACAAAT 3780 

CACCTGAGTT TCCTCGACGA AGATTTTATA 3840 

CAATCGACGA ATCAATGATG TACTTGGCTC 3 900 

AGCAATCGTC GTTTTAGGAT GACTAAACAC 3 960 

CCCCTTTTCC ATTACAGCAA CACGATTACA 4020 

AATCATCATA ATTGTAATGC CAAAGGTTTG 4080 

AGCAGTCGTT GCTGGATCCA ATGCGCTTGT 414 0 

AGTAACAAGC GCTCTTGCAA • TAGCCACCCT 4200 

AAATTGGTCT TTTTTATCAC TCAATCCTAC 4260 

AATTTCTGTT TTGCTTTTCT TACTTAAAAT 4320 

AGCTGAATTT AATAAATTGA AATGCTGAAA 4380 

CATCATTTTA TCGCTATAAT TCGTAATATC 4440 

GGCAGCTTCA AGATGATTCA CGAGTCTTAC 4500 

AATCACACCA AAAATATCAT TGCGATTGAC 4560 

TTTTTGCTTC TTTTTATTAA AGGTCTTACT 4620 
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TAAGAGCATT ATATGTAAAA TTGCATATAT CGTcAATACA 
AATTAAAAAA TAAGTAATTC ATGTGACAAT GACGAATTGT 
CAAATTAAAT CCATAAAAAT GTCCACCAAT CCTCCACAAC 
TCGCACAAAA AAGCACTAGC ATATTCAAGA ACAACAAACG 
AGTGCTGCTA TTATTTATAA AGTATCTAGT GCTTGTTTTA 
GTATCTTCAA TACCTACAGA AATTCTTACA AGTCCGTCTG 
ATATCTGCTG GAATGGATGC ATGTGTCATC AATGCAGGTA 
GCACCTAAAC TTTCAGCTAA TGTGTAATAC GATGTTGCTT 
TCTGTATTTT TCACTTCAAA TGCAATCACA CCTGTATGGC 
ACATCATGAT TTAAATGACT TTCAATACTT GGATGGAACA 
GCTTGTAACA TTTTAATAAT TTCAATAACG CTGCGATTAA 
AATGTTTTAA TACCCCTCAC AAGTAAATAG CTATCTTGAG 
GAATTTGAAA TAAATGCTAA ACGTTCTGCA AGCTTGTCAT 
GCAACGACAT CACTATGTCC ACCTAAATAT TTCGTTGCAG 
CCTAAATCTA ATGGATTCTG ATAATAAGGT GTCATAAATG 
AAACCGTGTT CTTTGGCAAT TTCAGCAGAC TTTTTAATGT 
TTAGAAGGTG TTTCAATAAA CAACATCTTT GTTGTTGGGC 
TCTGTATGCG TTGTATCTAC AAAATCCACT TCAATGCCAA 
AATGCGCGAT AAGTACCGCC GTATACATCT GAATTTAAAA 
AACAGCATAA CAACTGCACT GATTGCTGCA ACACCTGAAC 
CCATTTTCTA ATGTCGCAAT AACGCTTTCT ACAGAACTTC 
GAATATTCAT ATCCTTGACG TAAATCACCA ATATCATCTT 
TAAATTGGTG TTGTAACGGC ACCTGTATAA TCGTCTGTTG 
TTAGTTTTCT TGTTCATTAT TATTCTCCTC ATAATTAAAT 
ATCACTACCA TCTGGAAATA CGACAACAAT CGTACCTTCA 
CAATGCACCT TGTAATGCTG CACCTGaAGA ACTGCCTACT 
CAAACTTTTG ACATTTCGAA AGGCATCTTG ATCTTTAATC 
ACGTCTCTCT AAAAATATCG. GCCATTTCTC AGAACCGATA 
AGCTGGCCCT CCATTTAACA CGGACCCTTC TGGCTCAACG 



ATTTGCCGAA TTTTCTAAAA 4740 

GAGACTACTA TGACATTTAT 4800 

GCAATTACTA AATATTAACA 4860 

TTGAACTCAA AATATATGCC 4920 

AATCATCGAC TAAATCTTCA 49B0 

TAATACCTTC TTTAGCTCGA 5040 

CTGAAATTAA ACTTTCCACT 5100 

TAATCAATTG TTTGGCACTT 5160 

CATCCGCTTG AGCCATATGG 5220 

CTTGTTGCAC AGCTGGATGT 52 BO 

TTTGTTCCAT ACGTAAACCT 5340 

GTCCTAAAAT GCCACCTGTT 5400 

CCGATGTTGC AACTAAACCA 5460 

AATGTAAGAC AATATCGATA 5520 

TGTTATCAAC AACTGAAATC 5 5 BO 

CAGTAACACG TAATAATGGA 564 0 

GTATCGCTTG TACAATTGAA 5700 

ATCGTGTAAA TACTTTTGTC 5760 

TAATATGATC TCCTTTGTCC 5820 

TAAATGCAAA GCCATGTTTG 5880 

TTGTTGGATT CGCAGTACGA 594 0 

GTAAATATGT ACTTGTTTGA 6000 

TGTGCCCACC ATGAATTAAT 6060 

ATTTGCTTAG ACATATATCG 6120 

GATAATTGCG CTTTTAAATT 6180 

AACAACCGTT CATTTATAGC 6240 

GTAAATATGC CATCTACAAG 6300 

CCTTCAGTGT CATGTGCATG 6360 

GCATAACATT GCACGTGATG 6420 
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ATAATCAATT TGCTGTAATG 


CTGAAGTCAA TTCGGGTCCC 


AATGTATGAA 


AATATGTATC 


6540 


CGGATTATGT TCGGATTCAA ATTGATTCAT ATAAACGGCA 


CCATATTTTT 


CAGCATAGGA 


6600 


ACGTGCAGCT AATTGTGCCC 


CATGCATACC 


TTCAGACTGA 


CTCGTCCTTG 


AAACTTCTGC 


6660 


ACCAAGCGCT ATCATAATAT 


TAATCTTTTC 


TTCTGAAAAA 


CCATACGGCG 


CAAAGATCTT 


6720 


ACATTTCAAA TGATGTCTAT 


TCGCTGCAAT 


AGCTAACCCT 


ATGCCTGTAT 


TACCAGCAGT 


6760 


CGCTTCAACA ATAGTTTGAC 


CTGCACGCAC 


ACGCCCTTCT 


TGAATTGCCT 


TCTCTACTAA 


6840 


ATATTTCCCG AGTCTGTCTT 


TAACACTGCC 


TCCAGGATTC 


CATTGTTCAA 


GCTTGGCATA 


6900 


AATTTTAACT TTATCATCAC 


TATAATGTTC 


TAACAGTACT 


AATGGTGTAT 


TGCCaATTAA 


6960 


ATCATAAGTA ATCATAGATG 


CACCCTCATC 


TGACATGCCG 


ATCAAATGAA 


TGAAACCTTT 


7020 


CTTCATGTCT CAATTTTAAT 


TCTTACTTTT 


CAGATAAGAA 


TTATAAACGA 


CATTTTGTTA 


7080 


TTTTGCAATT ATCTAAGTTT 


CGATTAATTC 


AGAACCAGTA 


CTAAATTTTC 


AATTCCAAAC 


7140 


AAAAAAACAC CTGAGCAACA 


CAAATACTTG 


TGTGTCAGAT 


GCTTCTATAT 


ATTAACTAAA 


7200 


TAATTGCACG ATAAAGACTA AAATAATAAC GACAGGCATC 


GCATACTTAA 


TTAAGTAATA 


7260 


CCAACCACTG AATAATCTAA ATCGATCTTT 


ACCAAAATAT 


TGTTGTAATA 


ATTTTTTATC 


7320 


TAATAATTGT CCTAOGACAA 


GCGTAGTACC 


TAATGCGCCT 


AATGGCATCA 


ATACATTCGA 


7380 


AACGATGAAA TCCATATTAT 


CAAAAATCGT 


TCCCGCACCG 


AATCTTACAT 


CTTTTAAGAT 


7440 


ACCAAAAGAT AAGGTTGCTG 


GAATACTAAT 


GATAAATACT 


AAAATACTAC 


CGATCACTGC 


7500 


GACTTTTTTA CGTTTTGTAT TGTCATTCTT CGTGAAGTTA 


GAAACATTTA 


ATTCTAATAA 


7560 


AGAAATAGAT GACGTTAAAG 


CCGCAAATAA 


GAACAGCACT 


AAGAATCCCA 


AATAGAATAA 


7620 


TGTGCCTAGA TGCATTTGAC 


TAAAGACCAT 


TGGCAGTACT 


TTAAATAATA 


ATCCAGGCCC 


7680 


TTCI^GTGGT TCATAGCCAA AACTATGTAA AGCCGGAAAT 


ATAGCTAGAC 


CTGCCAATAC 


7740 


AGAf ACAAAG ATATTCATAA 


CAACGATAGA AATAGCTGAT 


GACTTAATCG 


TCATGTCTTT 


7800 


AGAGGCATAA CTCGCATAAG 


TAATCATACC 


TGTAGTTCCT 


AATGATAACG 


TAAAGAATGA 


7860 


TTGACCTAGC GCAAACAAGA 


TGCCATCAGC 


AGTAATCTCT 


GATACTCTTG 


GTTGTAAAAT 


7920 


AAATTTCACA CCTTCTAAGA 


CGCCATCTAA TGTTAAAGAC 


TTAATCACAA 


TGACGATTAA 


7980 


AAAGACAAAC AGCAATGGCA 


TCATAACTTT 


CGATGCCTTT 


TCTAATCCTT 


TTTCAACACC 


8040 


TAACATGACA ATAATCATCG 


TAGCGAATAT 


GAATATACCT 


TGCCCTAGAA 


CGGTTAACCA 


8100 


AGGATTTGAT ATTACCGCTT 


CAAAATTCAT 


TTCTTGGAGA 


TGATTGATGC 


GTTGAAATAT 


8160 


AACTAATTGC CATAATACTT 


GTCCGATGTA 


AATGACAATC 


GAACCACCGA 


TAACACTATA 


8220 
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TTTACCAGTT AATTTACTAT ATATTTGTGT TGTATATGTC CGTCCCATTT TCCCAACAGT 8340 

GAATTCCATA ATGAGTAATG GCAACCCAAC AAAAATGGTG AATATTAAGA ACATAGCTAG 64 00 

AAAGGCACCG CCGCCATAAA TCCCTGCCAT ATATGGGAAT TTCCACATGG CACCAAGACC 8460 

GATTGCAGAA CcCGCACTAG CTAAAATAAA TCCAGTTGAT GACTTCCATT GTGATTGTTG 8520 

TCTTTTCATC ATTCAC 8536 



10 

(2) INFORMATION FOR SEQ ID NO: 279: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4328 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 279: 

GCTTTGGCCA TTTTATGTGG CGATTGAGAC AATCtGTkGT TGTCTTATTT GATGTTGTAT 60 

TTCAACTGGT AATTCTAGTT GCGATTGAAA TAATGGCAAC TTTTCCCAAT CATTAACAAA 120 

25 

TAATTCAATA CCTGCTATGT CTAATACTTT AGCACGTGCA TCATCAACAA GACGGCGTTC 180 

CAATTGATTT GCTTCTTCTT TAATACCTGG TGACGTACTT TCTAATATCA AATTAGATAT 240 

AGGGATGTGA CCATTAATTG CATAATATAA TGCAACACGC CCACCCATTG AATATCCAAA 300 

30 

CAATGTTATT GATTTATCTT TATATTTATC TAAAATTCGG TCTAACAACG TCGTAATATA 3 60 

ATCAAAATTC CACGTTTCAT CCATTGAAGA CTGATCTTCG CCATGGCCTG GTAAGTCTAT 420 

35 AGTGATGACA TGATAGTTAT CAGTAAATTT TTCGATGTGA TTATGATAAG TACGGCTGTC 480 

. GCTAAGAAAT CCATGCAGAA ATACTAAAAC TTGATTGGTC TCAACGTTTG CTTCATAAAA 540 

TTTOTAATGT GTCATGAATC ATTTCACTCA ATTTCTGGTA TAAAATTTGA TGCTGTTTAA 600 

40 AGTTATCTTC GCGATTCGTT ATCAATTCAT AAATCGTCGA AGTTTCAGAT AACAATGTGG 660 

CATTTTTAAA TTCTGAAACA CTGTTAAAAC GTTTAAAATC GAATTGATAT AACTTAGCTG 720 

TATACTCGAA ATCCAATCCC GTCGGTGTGC CAAACAACCG TTCAAAATAG TCAGTTGCAC 780 

45 

TTTCTTTTTG TGGTAAATAT GAAAAAATAC CGCCACCATC GTTGTTCAAT AATACAATAT 840 

TCATCTGAAT ATTATTTAAT TTTGACATTA ATAGTCCATT CATATCATGA TAAAATGATA 900 

AATCACCTAT CAATAATGTT ATTCGTTTAT GCACAGCCAT ACCCAGTGCA GTTGAAACGA 960 

50 

TACCATCAAT ACCATTCGCA CCACGATTCG CATAGACATC TATATTTTTA TTCAATAACA 1020 

AGTTATCTAC ATCTCTGATA GGCATACTAT TACTAATAAA TAATGCATCT TTTTCAGATG 1080 
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TAATTTCTTT ACGCCCTTTT TTCTCTAAGC ATTGCCATTT TTCTAACCAA CTTACGCGAT 1200 

TAACTGTCGT GTCTTCCATT AATGACCTAA AGAAATCATT CGCAGAAATC TCATATGAAA 1260 

TATCTGGCGC TATCGGAAAG ACATCAATCT TATCATTGTT TTGCACTAAT ATTTGAAATG 1320 

CATCAGTTTT CTTTAACCAT TGATTTAACT TTTTAGAAAT CACTGGTTTC CCAACACGAA 1380 

TTACGAAATC CACATTTAAG TCTAAGCCGC TTCTAAACAG CAAATCATAT GTACAGATAA 1440 

10 

CATTCGGATG ATCAAATTTT CTTAAATGAC TTAAAGGATC AGCTAAAATA GGCAAATCAT 1500 

ATATCGTTGA ATACGTTAGT ATTTGaTCAA CTTCTTGGTG CTGCATATCC CCTACAATAA 1560 

15 TTAAACCTTT TTTCTTATTT AAAATGTGTC TTAATGCCGA TGCATCTATA CTTTTTTGAT 1620 

AGTGCGGTAA AATCTTCATC TCAGAAGTTA ACAATTCTGT TGCATTCAAA TCAGGTGTTA 1680 

ACGGATCTCT AAATGGCAAG TTAAAATGAA TTGGCCCTTT ATGTGGTCCA TATAAATATT 1740 

20 GACTAGCAAT TTGCATTTGA TAGTAAATTG CATCAATGGT CTCTTTACTA TCATCCGCAA 1800 

TAGGCATATC GAACTCATAA CTTACATAAT TATTAAACAT ATTTACTTGA TTAATCGCTT 186 0 

GTGGTGCGCC TACACTTCTT AATTCATGCG GACGGTCACT TGTTAAAACG ATTAAAGGAA 1920 

25 

TTCTACTAAT TTGGCTTTCA GCAATTGCAG GCGTATAATT CGCTGCTGCT GTACCTGACG 1980 

TACATAATAT AGCGACAGGT CTTTCACTGC CTTTAATTAA CCCAACTGCA AAAAACGCTG 204 0 

CACTTCGCTC ATCGGGGTGT ATCCATGTTT TAATATTTGG ATGTGCTTCA AATGCAAGTG 2100 

30 

CAAGTGGCGT TGAGCGTGAT CCCGGACTGA TAACTACTTC CCTTACGCCG TACGCATATA 2160 

ACTCAGATGC AAATGTAAAA ACTTGCTTCG TTAAAGCTGC TTTATGATTT CCCATTCATA 2220 

3S TCGACTCCTA ATGCATTCAT CAT AGGTGTG AACTTAAGGT TCGTTTCTGC CAATTCACTA 22 BO 

TCTGGATCAG AATCTTTAAC AATGCCACAC CCAGCAAATA AAGTTGCTTG TGCTTTCTTA 234 0 

ATAASCATCG AACGAATTGC AACAATAAAT TCACAATCAT CGTATATATC TATATAGCCA 24 00 

40 . ACCGGTGCAC CATATAATCC TCGCGTACCA AATTCTTTCT GCTCAATAAA ATCCATTGCA 2460 

AATTCTTTTG GATAGCCACC TAAAGCAGGT GTTGGATGTA AATTATCAAT TAAACTAATA 2520 

TACGAATCAT CCTTCAGTGG CGCCTTTATT TCAGTGTACA AGTGATATAA ATGATCATTT 2580 

45 

TTTAGAATTT TAGGCGTCTT ATCATAATGT AATTCAGTGA TATAAGGTTT AATATCATGT 2640 

AAAATACTGT CAACAACAAA TCGATGTTCG ATTAAGTTTT TATTATCTTT TAAAAATGCT 2700 

TCAACATTTT TTGTATCTTC GTCCTCATCT TGTGAACGTT TAATTGTACC TGCTACAGCT 2760 

50 

TTAGTCGATA GTATTTTATT ATTGACCTTT ATTAATTGTT CAGGTGTTTG TGAAAAGAAT 2820 

ATAGAATCTT GTGATTCTAA CAAGAATATA TAACTGTTTT TTTCTTTAGA ATATGCTTGC 2880, 
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ACAATTTTTT CTTCATTATT AATAGATTCT ATAGCTTCTA CTACAAGTTG ACGCCAGTCA 3000 

TCTTTATAAA TATCTTCATT TCTAGTAATT TCCCCAATTT GCTCGTCCAC ATCTATGTCC 3060 

5 GATATATTGT TGAACAAATC CATTAAATCG TTCAATGCCT CAACAGTAAA ACTTTCCCTT 3120 

TTAACTGTAT AAGTTAAAAA TGTCCCATTA TTATCAGTTG AAATTAAAAC TTCAGGTAAT 3180 

ACAAAATGAT TTAGTCCAAA CTCTCGCCAT TCATCATCTG ATTTATGACT TGAAAATTGG 324 0 

10 

AACCCTCCAA CAACTCGAAG ATGATGTTTC TCAGATTGCG GATGTATAAA TGTGATGTTA 3300 

TGTTTTAATT TTTCCCAGTC TTTAAAAATA GATTGTTTAT TTTTAGAATT ATTTTTGAAT 336.0 

is AATTGAATTG CTTTGTAGCC AAAATATGAC GTTCGATTAT CATTCAAACG CATATAAAAG , 3420 

CGATCTCCTG CCTCATTGTC AGTGAGATGA AATAATGTGC TCGGGTCTAG TGACTGTGAT 3480 

AATTTCACTT CAACTGAAAC CCATTCCTTT GAGCTGCCAT ATATCTCTTT GACAATATCG 354 0 

20 TCCTCTAATA CGCCCGTAGC CATCCATTTC ACTTCTTTCT TCGTCTTTTT TCACTCATTA 3600 

TTATATTGTA TCATTTTTGG ATAATTGTGT TACAAGAATT GCTTAAACTT ATCTTGCAAT 3660 
TTTTCACGTC AATTGACCTT TATGCTACTT TCTATTAAAA TATCTTTGTT ATAAAAAATA . 3720 

25 TGATTTAAAG AGGTTTTGTA TTCAATGAGT AATCAATATC AGCAATATTC TACAGTTAAG 3780 

AAATATTGGC ATTTAATGCG TCCTCATACA TTAACTGCTT CCGTAGTACC CGTTTTAGTT 3 84 0 

GGTACAGCAG CATCTAAAAT ATATTTTCTT GGTAGCGAAG AT CAT ATT AA AATCAGCCTA • 3900 

30 

TTCATTGCCA TGTTACTAGC ATGCTTACTT ATTCAAGCAG CAACTAATAT GTTTAATGAA 3960 

TACTATGATT ATAAAAAAGG CCTCGATGAT CATGAATCTG TAGGCATTGG TGGTGCCATT 4 020 

GTTCGCAACG GTATGAGCCC AGAGCTTGTG CTACGATTAG CCATTGCATT TTACATCTTA 4080 

35 

GCAGCAATAT TAGGTTTGTT TTTAGCTGCT AACTCTTCAT TTTGGTTATT ACCAGTTGGA 4140 

TTAGf ATGTA TGGCTGTTGG TTACCTATAT ACAGGTGGCC CTTTCCCTAT TTCATGGACG 4200 

40 CCTTTCGGTG AATTATTCTC AGGCGTATTT ATGGGTATGT TTATTATCGT TATTGCATTC 4260 

TTTATTCAAA CTGGCAATAT TCAAAGTTAT GTAATTTGGT TAAGTGTACC TATAGTAATC 4320 

ACTATCGG 4328 

45 (2) INFORMATION FOR SEQ ID NO: 280: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1450 base pairs 

(B) TYPE: nucleic acid 
50 <C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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10 



15 



GTTCAATACA GAAAAAATAA ATTTAGATGT TGAAGCATCC TACAATTAAT ACAGATCCAT 60 

TTCAATATAT TTAAACTAAA ATCTCGGGAT TTCTAAATTT TGAAATTTCG AGGTTTTnAT 120 

ATTTTTATTT AAAATAGCAC ATTTATACTT TATAATAGTA AAGATGAACA TATAAGGAGG 180 

CCAAATCATG GCAAAACATC CATTCGAACA ATTTAATCTA GAATCTAGTT TAATTGACGC 240 

TGTGAAAGAC CTTAATTTTG AAAAACCAAC TGAAATTCAG AATCGAATTA TTCCAAGAAT 300 

ACTAAAGAGA ACAAATTTAA TTGGTCAATC TCAAACGGGT ACAGGGAAAT CTCATGCATT 360 

TTTATTACCA TTAATGCAGT TAATTGATAG TGAAATAAAA GAACCACAAG CAATCGTAGT 420 

TGCACCAACA AGAGAACTTG CACAACAACT ATACGATGCA GCGAACCATT TAAGCCAATT 480 

TAAAGCTGGT GTTTCAGTTA AAGTTTTTAT TGGTGGTACA GATATAGAGA AAGATAGACA 54 0 

ACGTTGTAAT GCACAACCAC AATTGATTAT AGGCACCCCT ACTAGAATTA ATGACTTAGC 600 

20 TAAAACGGGA CATTTACATG TGCACTTAGC ATCATATTTA GTTATTGATG AAGCGGATCT 660 

TATGATTGAC TTAGGATTAA TTGAAGATGT AGATTACATT GCTGCAAGAT TGGAAGATAA 720 

TGCAAATATT GCGGTGTTTA GTGCTACAAT CCCACAACAG TTACAACCAT TTTTAAATAA 780 

ATATTTAAGT CATCCAGAAT ATGTAGCTGT CGACAGTAAA AAACAAAATA AAAAGAACAT 840 

CGAATTCTAT TTAATACCTA CTAAAGGTGC AGCTAAAGTT GAAAAGACTT TAAATTTAAT 900 

TGATATACTA AATCCATACT TATGTATTAT TTTCTGTAAT AGTAGAGATA ATGCAAATGA 960 

TTTAGCACGT TCACTAAATG AAGCTGGTAT TAAAGTTGGT ATGATTCATG GTGGCTTAAC 1020 

GCCaCGTgAA CGTAAACAAC AAATGarACG TATACGTAAT TTaGAATTCC aATACGTTAT 1080 

TGCCaGCGAT TTAGCATCTC GTGGTATTGA TATTGAAGGT GTTAGTCrTG TCATCaATTT 114 0 

TGATGTGCCA AATGATATTG ACTTCTTTAC GCATAGAGTC GGACGAACTG GTCGTGGGAA 1200 

TTATTrAGGT GTAGCAATTA CGCTTTATAG TCCTGATGAA GAACACAATA TTTCATTAAT 1260 

40 AGAASATCGC GGTTTTGTAT TCAATACTGT TGATATTAAA GATGGTGAGT TAAAAGAAGT 1320 

TAAAGCGCAC AATCAGCGTC AAGCAAGAAT GCGCAAAGAT GACCATTTAA CTAATCAAGT 1380 

GAAGAACAAA GTTCGAAGTA AAATTAAAAA CAAAGTTAAA CCAGGTTATA AGAAGAAATT 144 0 

45 TAAACAAGAA 1450 

(2) INFORMATION FOR SEQ ID NO:. 281: 

(i) SEQUENCE CHARACTERISTICS: 
so (A) LENGTH: 1139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 281: 



5 



10 



15 



30 



35 



AGTCAGGTAT 


ATCATGCCaT 


yCTGAATTGG 


TCGATATTAA 


TATCAGTGGT 


GTTAAAGAAC 


60 


GAATTGTATA 


CCAATAGACG 


CTTTATATTG 


TAAAATAGTA 


TTAAATGCaG 


AATAGAGAGG 


120 


AGATTTAATG 


CGATATGACA 


AATTATAAAG 


TTGTCGTTTT 


AGATATGGAT 


GACACATTGC 


180 


TAAATTCAGA 


TAATGTGATA 


TCAGAAGAAA 


CTGCAAATTA 


TTTAACAGCA 


ATTCAAGATG 


240 


AAGGTTATTA 


TGTTGTTCTA 


GCATCTGGTA 


GACCTACTGA 


AGGTATGATT 


CCAACTGCTA 


300 


GAGATTTAAA 


ATTACCTGAA 


CATCATAGCT 


ATATTATTAG 


TTATAACGGT 


AGTAAAACGA 


360 


TTAACATGAC 


TAATGAAGAA 


GTAGAAGTAA 


GTAAATCGAT 


TGGTAAGCAA 


GATTTCGATG 


420 


AAATTGTAGA 


TTATTGTCGA 


GATAGAGGCT 


TTTTCGTTCT 


TACATATCAT 


GATGGTCAAA 


480 


TTATTTaCGA 


CAGCGAACAT 


GAGTATATGA 


ATATTGAAGC 


AGAATTAACA 


GGTTTACCGA 


540 


TGAAACGTGT 


TGATGATATC 


AAAGCGTATA 


TTCAAGGCGA 


TGTACCCAAG 


GTCATGGGTG 


600 


TAGATTATGT 


AGCGAATATT 


ACAGAAGCTA 


GAATTGATTT 


GAATGGTGTG 


TTCAATGATA 


660 


ATGTAGATGC 


TACGACAAGT 


AAGCCATTCT 


TCTTAGAATT 


TATGGCCAAA 


GACGTTTCAA 


720 


AAGGTAATGC 


AATTAAAGCG 


TTATGTCACA 


AATTGGGATA 


TTCGGTGGAT 


CAAGTCATTG 


780 


CTTTTGGTGA 


TAGTATGAAT 


GATAAATCAA 


TGTTTGAAGT 


CGCAGGTCTA 


GCTATTGCTA 


840 


TGGGGaATGC 


ATCAGATGAA 


CTTAAGCAAT 


ATGCAAATGA 


AGTTACGTTG 


GATCATAATG 


900 


AAAATGGTAT 


TCCACATGCG 


CTCAAAAAAT 


TGTTATAAAT 


TTTAAAATAA 


GCCTTAACAC 


960 


ATGATATTTG 


AATAAGATAT 


CTTGTGGTTA 


AGGCTTTTTA 


TTTTTGTGAA 


AATGACTTCA 


1020 


GTTATACTAT 


GGAGGATTTG 


AAATACATAT 


TTTAGATTAG 


TAATGATATC 


AAACGAATAG 


1080 


AGTAAATGTA 


TATTTttTGA 


ATAAATCAAG 


TATTAACTAG 


TCACGGAAGG 


nAGATAAAT 


1139 



(2) INFORMATION FOR SEQ ID NO: 282: 

40 ' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 282: 
TCTAAAAATG CTGTGAAATT CTTTTATAAA TATCTAAAAG GAATTAATGT TGATAACATT 60 

50 

GCTGTGATAG GAAGTAAGAC AGCGCAATAT TGTGAATCAC TTGGCATTCG AGTTGATTTT 120 
AtGCCAAACG ACTTTTCTCA AGAAGGATTT TTAAAATCAT TTAATCAAAC TAACCAAAAA 180 
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20 



AATGAAGTTG TTAAAATAGA TTTATATACT TCAGTGCCTA ACAAACAAAA TATACAAGAT 3 00 

GTTAAAGAAA TGATAGAACA TCAACAAATC GATGCATTAA CATTTTCAAG TTCGTCGGCA 360 

GTACGTTATT ATTTTAATGA AGGATTTGTA CCAAAATTCA AGTCGTATTT TGCTATTGGA 420 

GAACAAACAG CACGGACCAT TAAATCATAT CAACAACCAG TAACAATTGC AGAAATTCAA 480 

ACACTCGAAT CACTAATTGA AAAGATTTTA GAAAGTAGGG GCTAAAAATG AAATTTGATA 54 0 

GACATAGAAG ATTGAGATCA TCAGCGACAA TGAGAGATAT GGTTAGAGAG AATCATGTAA 600 

GAAAAGAAGA TTTAATATAT CCAATTTTTG TAGTTGAAAA AGACGATGTG AAAAAAGAAA 660 

TTAAGTCATT GCCAGGTGTA TACCAAATCA GTTTGAATTT ACTTGAAAGT GAATTAAAAG 720 

AAGCTTATGA CTTAGGCATA CGTGCCATTA TGTTTTTCGG TGTTCCAAAC TCAAAaGATG 7B0 

ATATAGGTAC TGGTGCATAC ATTCACGATG GTGTTATTCA ACAGGCAACA CGTATTGCTA 840 

AAAAAATGTA TGATGACTTA TTAATTGTTG CAGACACTTG TTTATGTGAA TATACTGATC 900 

ATGGTCATTG TGGCGTGATT GATGACCATA CACATGACGT TGACAATGAT AAATCATTGC 960 

CACTACTTGT TAAAACAGCA ATTTCTCAAG TGGAAGCTGG TGCTGATATT ATTGCGCCAA 1020 

25 GTAATATGAT GGATGGTTTT GTTGCTGAAA TTCGTCGTGG ATTAGATGAA GCCGGCTATT 1080 

ACAATATTCC TATAATGAGT TATGGTGTCA AGTATGCATC AAGTTTCTTT GGAC CTTTT A 1140 

GAGATGCAGC AGATTCAGCG CCATCATTTG GGGATAGAAA AACGTATCAG ATGGACCCTG 1200 

CTAACCGTTT GGAAGCACTT CGTGAATTAG AAAGTGATCT TAAAGAAGGG TGCGAGATGA 1260 

TGATTGTTAA ACCTGCTCTA AGTTATTTAG ATATAGTTCG AGATGTTAAA AATCATACGA 1320 

ATGTTCCAGT TGTTGCATAT AATGTGAGTG GAGAATATAG TATGACTAAA GCAGCGGCAC 1380 

AAAATGGTTG GATAGATGAA GAACGTGTCG TTATGGAACA AATGGTTTCA ATGAAACGTG 144 0 

CAGGTGCTGA TATGATTATT ACGTATTTTG CAAAGGACAT TTGTCGCTAT TTAGATAAAT 1500 

AAGGTTTTAT ATTTATGATT TTCCATAAAC TGTAGGAGGA ATTTACTTTA TGAGATATAC 156 0 

GAAATCAGAA GAAGCAATGa AGGTTGCTGA AACTTTAATG CCTGGTGGTG TAAATAGTCC 1620 

AGTACGCGCA TTTAAATCAG TAGATACACC AGCAATTTTT ATGGATCACG GTAAAGGTTC 1680 

45 AAAAATTTAT GATATCGATG GTAACGAGTA TATCGACTAT GTACTAAGTT GGGGACCACT 1740 

TATTTTAGGA CATAGAGACC CTCAAGTTAT TAGTCATTTA CATGAAGCAA TTGATAAAGG 1800 

TACAAGTTTT GGTGCATCAA CATTACTTGA AAATAAATTG GCGCAgcTCG TTATTGACCG I860 

50 AGTACCTTCA ATAGAAAAAG TGCGTATGGT GTCATCTGGT ACAGAAGCTA CATTGGATAC 1920 

TTTAAGATTA GCACGTGGTT ATACTGGCAG AAATAAAATT GTGAAATTTG AAGGTTGCTA 1980 
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GCCGGATTCT CCTGGTGTGC CTGAAGGTAT TGCTAAAAAT ACAATTACAG TTCCATACAA 2100 

TGATTTAGAT GCACTTAAAA TCGCTTTCGA AAAATTTGGa AAGGATATTG CTGGTGTAAT 2160 

CGTAGAACCT GTTGCTGGTA ATATGGGTGT CGTACCGCCG ATTGAAGGTT TTTTACAGGG 2220 

ATTAAGAGAT ATTACGACTG AATACGGCGC ATTGCTAATT TTCGATGAAG TAATGACTGG 2280 

TTTCAGAGTC GGTTATCATT GTGCACAAGG TTACTTTGGT GTGACACCAG ATTTAACTTG 234 0 

CTTAGGAAAA GTTATCGGTG GAGGACTACC TGTAGGTGCA TTTGGTGGTA AAAAAGAAAT 2400 

CATGGATCAT ATAGCACCAT TAGGAAATAT TTATCAAGCG GGTACGTTAT CAGGAAATCC 2460 

, 5 TCTTGCAATG ACAAGTGGTT ATGAAACGTT AAGCCAATTA ACGCCAGAGA CATATGAGTA 2520. 

TTTTAATATG TTAGGCGATA TACTTGAAGA CGGTTTAAAA CGTGTATTTG CTAAACACAA 25 BO 

TGTACCAATA ACTGTAAATA GAGCAGGTTC AATGATTGGT TATTTCTTAA ATGAAGGACC 2640 

TGTAACTAAT TTTGAACAAG CGAATAAAAG TGATTTGAAA TTATTTGCAG AAATGTATCG 27 00 

AGAAATGGCA AAAGAAGGTG TGTTTTTACC ACCATCTCAA TTTGAAGGTA CATTCTTATC 2760 

TACGGCACAC ACGAAAGAAG ATATTGAAAA AACGATTCAA GCATTTGATA CGGCTTTAAG 2820 

TCGTATTGTA AAATAAATAT ACGGACAAAT TGAGAGCCTG AACTTTGTTC AGGCTCnTTT 2880 

TAAATGTATA TAAGGCATGG GCGGCGACTT GATAGTGAAA GTCCACTACT A 2931 
(2) INFORMATION FOR SEQ ID NO: 283: 



20 



25 



30 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1421 base pairs 

(B) i TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 



40 



45 



T(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 283: 

AATTATGAAT GCATTACCAG TATTATTACA AAAGAACAAT TAAAAATGTT TGTTTATGAT 60 

TATGATACGC ATCTCATTAA AAATGTAaTG GTTGCAGCAG ACGTGTTAAA GGCAAATGAT 120 

ATTCAAGGAC ATGAACCATT AATCGTTAAC CTTCAAACGA TTGATGAAAC ATTACATCGT 180 

TTACCTATGC ATAATAGAAA AGACATGATG GTTAATGGCG GTGTACTTAT GGCACATTTA 240 

AATGCCAAAA GTGGTCCGTG GTTAAAAGAT GTGCTAAGAC AAATTGAGAT AGCGATTGTA 300 

50 ACAGGTAAAG TAAGCAACGA AGAAACTGAA ATTTTGAAAT GGGTGGATAA TCATGTCAAA 360 

ATATAGTCAA GATGTACTTC AATTACTCTA TAAAAATAAA CCGAATTATA TATCTGGACA 420 

AAGCATTGCG GAATCACTTA ATATTTCACG CACTGCAGTA AAAAAAGTGA TTGACCAATT 480 
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CCCAGATATT TGGTATCAAG GTATAATAGA CCAATATACA AAAAGTTCTG CTTTGTTTGA 600 

TTTTAGTGAA GTATACGATT CAATAGATTC TACACAACTT GCTGCGAAAA AGTCACTTGT 660 

TGGAAATCAA TCTTCATTTT TTATCTTGAG TGATGAACAA ACGAAAGGTC GTGGGCGATT 720 

TAATAGACAT TGGAGTTCTT CAAAAGGGCA AGGACTTTGG ATGTCTGTCG TGTTAAGACC 7 BO 

TAACGTTGCA TTCTCAATGA TATCTAAATT TAATTTATTT ATTGCATTAG GGATAAGAGA 840 

TGCGATTCAA CATTTTAGTC AAGATGAAGT CAAAGTGAAA TGGCCGAATG ATATATTTAT 900 

TGATAATGGT AAAGTGTGTG GTTTCTTAAC TGAAATGGTT GCTAATAATG ATGGTATAGA 960 

15 AGCAATAATA TGTGGTATAG GTATTAATTT GACGCAACAA CTAGAAAACT TTGATGAAAG 1020 

TATTAGACAT AGAGCAACAA GTATACAATT ACATGATAAA AATAAATTAG ATAGATATCA 1080 

ATTTTTAGAG ATATTACTTC AAGAAATTGA AAAAAGATAT AATCAATTTT TAACGTTACC 1140 

TTTTTCTGAA ATTCGTGAAG AATATACTGC AGCTTCTAAT ATTTGGAATA GAACGTTGCt 1200 

ATTTACAGAA AATGATAAAC AGTTTAAAgG ACAAGCAATT GATTTAGATT ACGATGGCTA 1260 

TCTAATTGTT AGAGATGAAG CGGGTGAATC ACACCGTTTA ATTAGTGCAG ATATAGATTT 1320 

26 i . . . - - . . .... , 

TTAACACTAA AGCAAGGAGA GATAGCTATG GGTATGGCAA CCTATGCCGT TGTGGATTTG 1380 

GnAACAACAG GCAACCAATT AGATTTTGAC GATATCATTC A 1421 
30 (2) INFORMATION FOR SEQ ID NO: 284: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2202 base pairs 

(B) TYPE: nucleic acid ■ 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 



20 



-(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 284: 



40 


CCAAGTTGCC 


TAAAATGATT 


AAGCAAGGTT 


TATACCCTAT GCnAACGATT GAACAAGAAT 


60 




CTGGAGCCAT 


CCGACTGCCA ACGATTTCTA 


GAGTGAnGCG TTCATTACAA TGGGGTAATG 


120 


45 


ATGCTTATAC AATGATTTTA GATCGTATGA 


ATATTGAAAC AAATGAATAA TAAATGAACG 


180 




ATAAACAATG 


GTTATCTATC 


TGCACTAATA 


AGGTAGATAA TCATTGTTTT TTCACGAAAA 


240 




AATTTACAGA 


GTAAAAGAAC 


TTAAATTTCA 


TATTAAGTCT TTAGAACTCG ACACTTAAAA 


300 


50 


ATGCTATAAT 


CATATGTATG 


TTAAAAAAGG 


AGTTTCGGAA AATGTATGAC ATTAAAAAAT 


360 




GGCGCCATAT 


TTTTAAATTA 


GACCCAGCTA 


AACATATTTC AGATGATGAT TTAGATGCGA 


420 




TTTGTATGTC 


TCAAACAGAT 


GCAATTATGA 


TTGGTGGAaC TGATGACGTT ACTGAAGATA 


480 
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CAAACATCGA AAGTGTAATG CCTGGTTTTG ATTtTTATTT TGTACCTACA GTACTGAACA 
GTACAGATGT TGTATTTCAC AATGGTACAT TATTAGAAGC GCTTAAAACA TATGGACATA 
GTATAGATTT TGAGGAAGTA ATATTTGAAG GGTATGTCGT GTGCAATGCT GATAGCAAAG 
TGGCAAAACA TACCAAAGCA AATACAGATT TAACAACAGA AGATTTAGAA GCATATGCCC 
AAATGGTCAA TCATATGTAT CGATTACCGG TTATGTATAT AGAGTATAGT GGCATTTATG 
GCGACGTATC AAAGGTTCAA GCTGTCTCAG AACATCTAAC AGAAACGCAA CTTTTTTATG" 
GTGGCGGTAT TTCCTCAGAA CAACAAGCGA CAGAGATGGC AGCTATTGCA GATACAATTA 
TCGTCGGTGA TATTATTTAT AAAGATATTA AAAAAGCTTT AAAAACAGTA AAAATAAAGG 
AGTCTAGTAA ATGAATGCGT TATTAAATCA TATGAATACA GAGCAAAGTG AAGCTGTAAA 
GACAACAGAA GGACCATTGT TAATTATGGC AGGTGCTGGT TCAGGGAAGA CACGTGTTTT 
AACACATAGA ATTGCTTATT TATTAGACGA AAAAGATGTC TCACCATACA ATGTTTTGGC 
TATTA C TT TT ACAAATAAAG CTGCAAGAGA AATGAAAGAA CGTGTTCAAA AATTAGTAGG 
TGATCAAGCA GAAGTTATTT GGATGTCAAC ATTCCACTCA ATGTGTGTTC GTATTTTACG 
TCGTGATGCA GATCGAATTG GTATAGAACG CAATTTTACG ATAATTGATC CTACAGACCA 
AAAATCTGTT ATTAAAGACG TCTTAAAAAA TGAAAATATT GATAGTAAAA AGTTTGAACC 
TCGTATGTTT ATCGGTGCGA TCAGTAATTT GAAAAATGAA CTTAAAACAC CTGCAGATGC 
TCAAAAAGAA GCCACAGATT aTCACTCgcA AwTGGTaGCA ACgGTTTaTA GTgGATATCA 
ACGCCAATTG TCACGTAATG AAGCGTTAGA TTTTGATGAC CTTATTATGA CAACGATTAA 
CTTATTTGAG CGTGTACCAG AAGTTCTAGA ATATTATCAG AACAAATTCC AATATATTCA 
TGTAGATGAG TATCAAGATA CTAATAAAGC ACAATACACA TTAGTTAAAT T ATT AG CAAG 
TAACTTTAAA AACTTATGTG TTGTAGGTGA CTCAGATCAG TCAATTTATG GTTGGCGTGG 
TGCTGATATT CAAAATATCT TATCATTTGA AAAAGACTAT CCAGAAGCGA ATACAATCTT 
TTTAGAGCAA AATTATCGTT CGACGAAAAC GATTTTAAAT GCGGCTAACG AAGTGrTTAA 
AAATAATTCT GAACGTAAGC CAAAAGGACT GTGGACTGCA AATACGAATG GTGAGAAAAT 
TCATTACTAT GAAGCAATGA CGGAACtGAT GAAGCGGAAT TTGTAATACG AGAAATTATG 
AAGCATCAAC GTAATGGTAA GAAATATCAA GATATGGCAA TTTTATATAG AACGAATGCA 
CAATCACGTG TACTTGAGGA AACATTCATG AAATCTAATA TGCCATACAC AATGGTTGGT 
GGCCAAAAGT TCTATGACCG TAAAGnAATC CAAAGATTTA TT 
(2) INFORMATION FOR SEQ ID NO: 285: 
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(A) LENGTH: 785 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 285: 



AGTGGTGCAA 


AGATAGGCAT 


TGATaATACC 


GCTAAGCCAG 


CAAGATGATG 


GCACGATAAA 


AC CT AAACAG 


AAGAAmATAA 


ATAGTAATAC 


GATGATAAAT 


AATGGTCCAC 


TCATATGTTG 


AACTAAAGAT 


GATGAAAAGT 


GTAAGATTGT 


ATCTGAAATC 


ATACCTTCAT 


TCAACACTAA 


ATT AAT AC CT 


CGAGCTAAAC 


CAATAATTAA 


AGATACACCT 


ACTAAACTTG 


ATGCACCATT 


GACAAATGCA 


TCTACAGTTC 


CTTTTTCTCC 


CAATCCAGAT 


TTACCTGTCC 


CAGCAATAAA 


CATTATTATA 


ATTGTAAATA 


TTAAAAATGC 


TGAAGCCATA ACTGGGAACC 


ACCAACCTTG 


CGTCATAACT 


CCCCATACCA 


TAATTGGAAA 


TGGTAGTACA AATAATGTAA 


GGATTATCTT 


CTTACGCAAA 


GTAAAATGGG 


CACTATCGTC 


ATCTTTTAAT 


ACAGACCATT 


GCTGTTCAAA 


AGCATCTTTG 


TCTTCATAAG 


AATATGACGC 


TTTAGGATCG 


TTTTTAATTT 


TTTTACAGTA 


CCAATATAAA 


TAACTAATAA 


CAAAAATCGC 


ACCGACAATA 


CAAGCACCTA 


TTCTCCAATA 


CAAGCCATCC 


GTAAAAGTTG 


TACCAGCGGC 


ATTAGAGGCA 


ATTACAACCG 


AGAACGGGTT 


AATAGTTGAA 


AATGTACTAC 


CGACAGAGCT 


GGcAAGGAAT 


ATGGCACCAA 


CTGGAAACGA 


TAGAATCGTA 


TCCTAACGCT 


AATAAATATA 


GGGACTAAAA 


TCGGATAAAA 


TGCTACAGCT 



TCTTC 

(2) INFORMATION FOR SEQ ID NO: 286: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 
- (B) TYPE: nucleic acid 

{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 286: 
CTAACGnGAT AAGGTTGCAA nTTTATCTGA ACATCTGATG ACTGTAATTT TGTtAATGAT 
AAAATATTTG TCACTAATAG ATATAAATAC TGACTTTCTT GAAAACTATG TACAAGTAAT 
TGTTCCTTTT CTATGATAGA CATATCTTTA CTATGTGATA CTAAAATATC TAAATkTCCC 
ATAATTGTTG TTAACGGTGT ACGTATGTCA TGCGAAATTG ATCTTAAAAA ATTTGAATGT 
GTCAGTTGAC GTTCAGCCTG TAACATGGAT TCTCTCGTTT GTTTAAGTAA CGTCACATTT 
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ATCACTTGAG AACTTTGGTA ATCAATGGCT AGAATGCCTT TAATCGGAGA TGTGCCAATT 420 

GGTATCAACC ATTTATTAAT GCCTGGAAAT GTATCTGTTG TTGCACCAGC TTGTCTTTCA 480 

TTTTTAATTA CCCAGCTTAA TGCTTGTTCA TGCTGTTGAG TCGTATTATC GATATGGTTT 540 

TGCAATGGTA TTGTTTTAAT TACTTTCGAT TGATTGATAA CGTATATAGT AATTGATTGT 600 

TGCAATAATT GATTAATTTG GTATCCAGCA TTTATTAGTA AGTTTTCAAC TGTATAAGTT 660 

TGTTTAATCG AATCATTAAA TTGAAATAAT AAATCTGTAC GATAAAGTTG CTTTTTAGTA 720 

ATGGaGTAwT GGAATTTAAT TTGTnTTAAT AAAGCACTCG TTAAAATACT TGTnAAAATG 780 

CTAACGATAA ATGTAATAGG ATAGTCAAAG CG 812 
(2) INFORMATION FOR SEQ ID NO: 287: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1732 base pairs . 
20 (B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:, 287: 

ATnnATTATT ATTACTGCTA TTTTAAATTT TAAAAAATGC TTTTGATTAT ATTCAACAnT 60 

TTGTATAAAA TTAAATTTGC TTTTGATTAA AGCATGAAAA TTGTAATCAA ACCATAAATT 120 

. GTCGTATGAT GTAGTTAGAA TTTTAAAATG CAGGAGGTCa AGTATATGAC TGAAATaACA 180 

TTCAAAGGTG GACCAATCCA CTTAAAAGGT CAACAAATTA ATGAAGGTGA TTTTGCACCT 240 

GATTTTACAG TGTTAGATAA TGACTTAAAT CAAGTAACAT TAGCAGATTA TGCTGGTAAA 300 

AAGAAATTAA TTAgTGTGGT ACCATCAATT GATACAGGTG TTTGTGATCA GCAGACTCGC 360 

AAATTCAACT CTGATGCTTC TAAAGAAGAG GGGATTGTGC TTACAATTTC AGCAGACTTA 420 

40 CCATTCGCAC AAAAAAGATG GTGCGCTTCA GCAGGTTTAG ACAATGTCAT TACATTAAGT 480 

GACCACCGTG ACTTATCATT TGGTGAAAAC TATGGCGTTG TTATGGAAGA ACTTCGCTTA 540 

TTAGCTCGTG CAGTATTTGT ATTAGATGCA GATAATAAAG TTGTTTATAA AGAAATCGTT 600 

AGTGAAGGTA CTGATTTCCC AGATTTTGAT GCTGCTTTAG CTGCATACAA AAATATTTAA 660 

TCATTAAAGA GATAAATCTT AAAATGTATA CATCGTGTCC ATCGTTGTCA ACAGCATTAA 720 

AATAGAATTG TTTTCTATGA TTGCTAAGAC CTATGGGCAC TTTTTATTGG AGAGGGACGA 780 

ATATGGCAGA ACAACAAACA ATTATGGAAC GCTTGTTTCA TACATTAGAT GAAAAAGCTA 840 

AAACATTAAA TAATGAAAAT GGCCaAAGTT TTATTGAAAA TCTTGGGCTA GCAATGGAAC 900 

55 
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CATTCCAATT TGCATATTT A AGTTTAATGC 
TTACACCAGA TTCAATTGGA TTGATACTAG 
5 AAGAAGAATT ACATATTGTT GATATTGCAA 

AAGAAGTGTT ACCTGraAtT GcGGTTATGc 
CACGTGTTAG TGTACATTTA GCAAACTTCT 
ATGCCATCAT GCCACTACCA TTAGAAGAAG 
GCTATTATCC AATTGATGAA AGAAGTAAGG 
GTTATTCACA TTATTTATTA ATAGAACAAG 

75 

CCTTTCTAGT GGTACCAAGT AATATTTTTA 
ATATTGCAAC AGAGACAGAG ATGCAAGCAT 

20 ATGAAAAAGC GCGAAAATCT ATATTAATTT 
CAGTTGAAGT ATTATTGGCA AATATTCCTg 
GATTTATGGA CAGAGTTAAA ATCCAGTGGG 

25 (2) INFORMATION FOR SEQ ID NO: 288: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2779 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 288: 



so 



AAAAGAACTA 


GCTAAACGCA AGCAAGAAGC 


TATTAGTAGA ATTAAAGACT 


TTTCAAATGA 


60 


AAAAATAAAT 


AGTATTCGAA ATAGTGAAAT 


TGGCACAGCT GATGAAAAAC AAGCAGCAAT 


120 


GAATCAAATT 


AACGAAATTG TGCTTGAAAC 


AATTAGAGAT ATTAATAATG 


CGCATACATT 


180 


ACAGCAAGTT 


GAGGCTGCAT TGAACAATGG 


TATTGCTCGA ATTTCAGCAG 


TACAAATTGT 


240 


AACATCTGAT 


CGTGCTAAAC AATCGTCAAG 


TACTGGAAAT GAATCTAATA 


GCCATTTAAC 


300 


AATTGGTTAT 


GGAACTGCAA ATCATCCATT 


TAACAGTTCG ACTATTGGAC 


ATAAAAAGAA 


360 


ACTTGATGAA 


GATGATGACA TTGATCCACT 


TCATATGCGT CACTTTAGTA ATAATTTCGG 


420 


TAATGTTATT AAAAACGCTA TTGGTGTGGT 


GGGTATCTCT GGTTTACTAG 


CTAGTTTCTG 


480 


GTTCTTCATT 


GCCAAACGTC GTCGTAAAGA 


AGATGAAGAG GAAGAATTAG 


AAATAAGAGA 


540 


TAATAATAAA GATTCAATAA AAGAGACTTT 


AGACGATACA AAACATTTAC 


CACTTTTATT 


600 



55 



aGGAAGAAAA GATAcAAGCA 
GATTTTTAGT TGAGCGTTTT 
GTGGTGCCGG TCATTTAAGT 
AT cATTTaAT TGaAGTTGAt 
TAGAAATTCC TTTCGATGTG 
CAGATATCGT TATTGGTGAT 
AGTTTAAGCT AGGTTTTGAA 
CAATAAATGC ATTAAAAGAT 
CAGGTGAACA TGTAAAACAG 
TTTTAAATTT ACCACCAACT 
TACAAAAGAA AAAATCGGGT 
ATTTCcAAAA TTCCTTCACC 
ATGGGACCAC CAAATCGTCC 



AATCATCAAA 1020 

ATGAACAACC 1080 

GCTACTGTAA 114 0 

CCAGTTTTAT 1200 

TATCCTCAAG 1260 

TTTCCAGTAG 1320 

GAAGGACATA 1380 

GCTGGATATG 1440 

CTTGAAAAAT 1500 

TTATTTAAAA 1560 

GaAACAAAGC 1620 

AATTTCCAAG 1680 

TA 1732 
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AAATAATGGC GAGTCACTCG ATAAAGTTAA ACATACGCCG TTCTTCTTAC CAAAACGTCG 720 

TCGTAAAGAA GATGAAGAAG ATGTGGAAGT TACAAATGAA AACACAGATG AAAAAGTGTT 780 

GAAAGATAAC GAACATTCAC CACTCTTATT CGCAAAACGA CGCAAAGATA AAGAGGAAGA . B40 

TGTTGAAACA ACAACTAGTA TTGAATCTAA AGATGAGGAC GTTCCTTTAT TATTGGCTAA 900 

AAAGAAAAAT CAAAAAGATA ACCAATCCAA AGACAAAAAG TCAGCATCAA AAAATACTTC 960 

TAAAAAGGTA GCAGCTAAAA AGAAGAAAAA GAAAGCTAAG AAAAATAAAA AATAATTTGT 1020 

TTCTTTGATA AATAGaGGAG CACCGATTGA CATCACATCA GTCGGTGCTC CTTTTATTTA 1080 

TTCTTTTTAA TTAATTTATA CAATGCCTGT TGAGCGTGTT GATTCGCTTC TTTGTTTTGT 1140 

TCTCTCGGTA TCCATTTAAC AAATAATAAA TCAAAATCTT TTTCAAATAT TTCTATTTGA 1200 

TCAAAATAAG GTTTGAAATT TGCGTTTTTC ACATAACCAG CTTCAATGCT ATCTGCAATT 1260 

20 AGCTTTGAGT CTGTATATAA TAGTGCGTTT TGAACATTTA ATTCACGTGC ATGTTCTAGT 1320 

GCATAAATAC ATGCAGCCCA TTCTGCAGTG TGGTTATCCA TTTCGCCTAA CTCATGTGTA 1380 

TATGTATAAT GCTGCTCATO TTCTTTGATT ACAATGGCAC ATGTACTTAT GCCTGGATTT 1440 

25 cCTTTCGTCG CAGCATCAAA ATTTATTTTC GCCATAATAA ACCTACTTTC TATTCAATAC 1500 

TTAGTTAAAG TTACTATTAC TGTAATACAA AATATGTTGG GTAATCCATT AAAAAACACG 1560 

CATCACTTAA ATAAGTAACA CGTGTTTAAA ATACTCGCTG ATTCAAAGAT GATTTTCTAA 1620 

TACGTAtACT GTaATATACT TCCTAAAAAA ATCATCTTCA GGCTGGGACA TAAATCAATG '1580 

TTCTATGCTC TACGATGTTA TATTGGCAGT AGTTGACTGA ACGAAAATGC GCTTGTAACA 1740 

AGCTTTTTTC AATTCTAGTC AGGGGCCCCA ACACAGAGAA /TTTCGAAAAG AAATTCTACA 1800 

GGCAATGCGA GTTGGGGTGT GGGTCCCAAC ACAGAAGATG ACGAAAAGTC AGCTTACAAT 1860 

AATGTGCaAG TTTGGGATGG GCCCCAACAA AGAGAAATTG GATTCCCAAT TTCTACAGAC 1920 

40 AATGCAAGTT GGGGTGGGAC GACGAAATAA ATTTTGCGAA AATATTATTT CTGTCCCACT 1980 

CCCTTAAAAC TTATTCTTTT GTGTAGTAAG TGCGTTAATA GCCTTGATCT AACTTATCAA 2040 

TCTTACCTTT ACGATAAAAT GATTTAGGAA TATATCCAAA TGGTACATTG AAAACTGTTG 2100 

AAGCTAATTT TAATACGTAC GTTGTAATAA ATATTTCAAa TACAAmTGTa CCAGGTAAAC 2160 

TTCCGATAAA TGCGATAGCT ACAAATAAAG CTGTATCAAT TATTGrGCTT AAAAATGTAC 2220 

TACCATATGg CACGGATGAA AAACGTTTTa TCmGaACTAA ATACTTTTTT AATTAGTGAA 2280 

AAGATAAATA CATCAATATG TTGACCAATA ATATATGCGA CGATTGAGCC TAAAGCAATG 2340 

CGTGGCACAA CATCAAAGAT TGCGTGTAAT GCTTTTTGTG CCATATCTTC TGGTGCAGGA 2400 
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CAAACTGCTC TTTTTGCAAC TCTACGCCCA TAAATATCGT TTAATATATC TGTTGCTAAA 
TAAATAGAAG CAAACATGAC ATTACCTAAA GTTGCTGAAA TACCAAAGAT TTCTACAGTT 
TTAATCACTT GTATGTTGGC AATGATTGTG CCAATTGCAA CCCATGCAAT TAAACCTTGT 
TTACCAAAAA AGCGATACAT AAGTACCATA AGCACGAACG TTGCAATAAA CGTAACTAGT 
CCTAAAATTT CATTATACAT ATTAAAATGT CCTCCTAAAT TTTGATCATG CGGGTGTTTA 
GAAACCGCTC AATAAATAA 
(2) INFORMATION FOR SEQ ID NO: 289: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1999 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 289: 



ACTGATGTGC 


GTTCATCAAA 


AACAATATAA 


TCAAATTCAT 


TTTCATCAAA 


TTGCTTAAAA 


TTATCATCTC 


TAGATAATGT 


TTGAATGGTT 


GCAAATAAAT- 


ATTTGGCATC 


GACATCTCTA 


TGTTTTCCTG 


TCAACAATCC 


AAAATCACTA 


TCATTTTTTA 


TAGGTAATAC 


TTTTTTAAAT 


TCTTCCTTAG 


CTCTATTTAA 


AATCCCCTCA 


TTATGAACAA 


TAAATAAAAA 


TTTATTAGGG 


TTTACTTCTC 


TAACATCTAA 


TGCACATAAA 


ATCGTTTTAC 


CTGTACCAGT 


TGCAGATATT 


ATTAACGCCT 


TATCTTTGGC 


TTTATCCCTA 


ATAGCTTTTA 


ATGACCTTAA TGCTTCTGCT 


TGCATTAAAT 


TGGGTACAAT 


TTCCACTGAT 


TTTTTCACCT 


TATCAGCTAG 


CAGCATTTGA 


GTTTGTTCAA 


CCTCCGCTAA 


TTTTTCTAAG 


GAACGGTACT 


CAAATGATTC 


TTTATATGAA 


TTAATCCATT 


GCTCAGTCAG 


TGGGGTACTC 


TTTTGCCATA 


ACAAGTCAAA 


TTCACTTTTT 


ACACTATCAA 


CTAAATCGCC 


ATTTTTCATA 


GTAGACAGTA 


AAACATTATG 


CTCATAATTA 


ACCTTTAACG 


CATTAGATGT 


TAAATTAGAG 


CTTCCTATTA 


CCATAGAACT 


ATAATCCTTA 


TGCTCAAAAA 


TATATCCTTT 


GGCATGGAAT 


CCAGCAATAT 


CAGTTAATCT 


TACCTCTACA 


TTTTTTAATT 


TAAGTAATTC 


TCCATACATT 


TTAGGACTAT 


TAAACCCTAA 


GTAATTAGAT 


GTTAATATTT 


TCCCTTTAAC 


ACCCTTATTG 


CTTAAATCTA 


ATAGTTGAGC 


CTTTAAGCTG 


GcTAAACCGC 


TTTCTGTTAT 


AAAAGCCACA 


GAAAAATAAA 


ACGTTTCACA 


TTTTTGAAGT 


TCATCTATAA 


TTGTTGAAAG 


AACTTTTTCA 


TTTTTATTAT 


TTACTAAAAG 


CTTCGGTGTA 


TAATTCCCTT 


TATGAGAAAT 


ATGTTTGTCT 


ATAAACCCTT 


TATGTAAAGA 


TTGATTGAAA 
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CGCTGGGGCC CAATTTAATT TATCAAGTTC GTTTATTGAC AACCATTCAA TACTCTTATG .1140 

TTCAGTTAGA GTTGGTAACT CTTTGTTTAA AGTACATTTG TATGTTGTTA ACCTAACAAT 1200 

TCCAAAATCA TATTCATGTT CTGTAGTTAT AACTTTGTCT CCAACAATTA AATCACATTT 1260 

CATTTCTTCT CTAATTTCTC TAATCAAAGC GTCTTTTTCA GTTTCATTCT TTTCAACCTT 1320 

ACnGCCAGGA AATTCCCACA TTAAAGGCAG ACTCATTTTT TCACTTCTCT GTGCACAAAG 1380 

-AATTTTGTTA TCAGAAAAAA TAATAGCTCC TACTACATTG ATTACTTTTT TCATAAGACT 1440 

CACCCTTCAA TTTAAAATCA TCTTAATTGT TATTCTATCA AAAATTACAA AACTATATAT 1500 

AAATCAATAT TAAAAATTAA TATTTTACAT TCACATGAAC GCTCTACTCC ATGCATTTTC 1560 

ATACACATCT ATTATATAAT ACTTGTGAAA AGTATTGTCT TGGGGCTGTG TTTTTTTACT 1620 

TTTGGGGCGT ATTTCTTTAT AATTCATTAC ATAAATGTAA GGGCTTTAGT TTTCATGTTT 1680 

20 TATTAAGTCT AACTGAGATT TTGAAAGGAT GTTTAGCAAC AATGGATAAA GAATTATGGA 1740 

TAGAACGAGC TAATGATAGT TTAGTTAAAC ATTTTTATGa GCAGCAATCT GACATTGAAC 1800 

AGCGAGAAGG TTTTGAAAGT AAATTAACAT TTGGTACTGC GGGTATACGC GGAAAATTCG 1860 

GTCTTGGTGA AGGTCGACTT AATAAGTTTA CTATTGAAAA ATTGGCATTA GGTTTAGCGC 1920 

GTTATTTAAA TGCCCAAACA AACAGTCCAA CAATAGTCAT TCATTATGAT ATTAGACATC 1980 

TTTCcAACTG AATTCGCCC 1999 
(2) INFORMATION FOR SEQ ID NO: 290: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1933 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 290: 

GATGACTTTT CCCCCTCATA ATCTTCATGG TCCAGGCGTC CATTAATGCG TCAAAGGATG 60 

GCACATTTTA CCTGGAACAA ATGATTCATA TGGTTCATAA AAATCACGCG TCGTAATATA 120 

ATCTTCTAAA TCAAATGCAT AGAAAATCAT TGGCTTTTTA AATACTGCAT ATTCATATAT 180 

TAAAGATGAA TAGTCACTAA TTAATAAATC TGTTATGAAC AGTATATCAT TAACTTCTCT 240 

AAAGTCAGAA ACGTCAACAA AATATTGTTT ATGTTTGTCT GCAATATTAA GTCTATTTTT 300 

CACAAATGGA TGCATTTTAA ATAATACAAC CGCGTTATTT TTTTCGCAAT ATCTTGCTAA 360 
ACGTTCAAAA TCAATTTTGA AAAATGGGTA ATGTGCTGTA CCATGACCAC TACCTCTAAA ■ 420 
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TTGTTTGATC TGTGTCGCAT AAGCTTCATC AAATAGTACA TCAGTACGTG GAACACCTGT 540 

AGGCACTACA TTTTT CTCTT TAATACCAAA TGCTTCAGCG TAGAATGGAA TATCGGTTTC 600 

AGATGATACA TAAGCTTTTG TATAGCTACG ATGATTTAAT GAATCAATAA ATGGTCCACC 660 

CTTTTTACCA GTACGACTAA AGCCAACTGT TTTAAAGGCA CCAACGGCAT GCCATACTTG 720 

AATAACTTCT TGAGAACGTC TAAAACGCAC TGTATAAATC AATGGGTGAA AGTCATCAAC 780 

AAAGATGTAG TCTGCCTTCC CAAGTAAATA TGGCAATCTA AACTTGTCGA TGATGCCACG 84 0 

TCTATCTGTA ATATTCGCTT TAAAAACAGT GTGAATATCA TACTTTTTAT CTAAATTTTG 900 

ACGTAACATT TCGTTATAGA TGTATTCAAA GTTTCCAGAC ATCGTTGGTC TAGAGTCTGA 960 

TGTGAACAAC ACCGTATTCC CT TTT TT CAA GTGGAAAAAT TTCGTCGTAT TAAATATCGC 1020 

TTTAAAAATA AATTGTCTTG TATTAAATGA TTGTTTGCGG AAATACTTAC GTAATTCTTT 1080- 

20 ATATTTACGA ACGATATAAA TACTTTTAAC TTCCGGAGTC GTTACAACAA CATCAAGGAC 1140 

AAATTCATTA ACATCGCTAG AAATTTCAGG TGTAACAGTA TAAACCGTTT TCTTCGAAAT 1200 

GCCGCCTTTT CTAAATTCTT TTAGGTAAGT CTGCAATAAG AAATTGATTT TACCATTTTG 1260 

25 TGTTTCTAAT TCGTTGTATT CTTCTTCTTG TTCTGGCTTT AGATTTTGAT ATGCATCATT 1320 

AATCACATCT GGGTTTAACT GTGCAATATA ATCAAGTTCT TGCTCATTCA CTAATAAGTA 1380 

CTTATCTTCA GGTAAGTAAT AACCATTATC TAAGATAGCT ACATTGAAAC GACAAACGAA 1440 

TTGATTCCCA TCTATTTTGA CATCATTCGC CTTCATTGTA CGTGTCTCAG TTAAATTTCT 1500 

TAATACAAAA TTACTATCTT CTAAATCTAG GTTTTCACTA TGTCCTTCAA CGAATAACTG 1560 

AACACGTTCC CAATAGATTT TAT CT AT AT A TATCTTACTT TTAACCAACG TTAATTCATC 1620 

CTTTTCTATT TACATAATCC ATTTTAATAC TGTTTTACCC CAAGATGTAG ACAGGTCTGC 1680 

TTCAAAAGCT TCTGTAAGAT CATTAATTGT TGcAATTTCA AATTCTTGAC CTTTTAACAA 1740 

40 CGCTAATTTA nCTACAATAT CTGGGTATTG v AATGTATAAG TCTACAACAT CTTGGAAATC 1800 

TTTTGAACCA CTTCGACTAC TACCAATCAA CGTTAGTCCT TTTTCCAATA CTAGACGTGT 1860 

ATTAACTTCT ACTGGGAACT CACTTACACC TAACAGTnCA ATGCTTCCTT CTGGTGAAAT 1920 

45 GTAATCGATC ATT • 1933 

(2) INFORMATION FOR SEQ ID NO: 291: 

(i) SEQUENCE CHARACTERISTICS: 
so (A) LENGTH: 2049 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS : double 
(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 291: 

nGTnCGGnCA GATATATTGG TGGTCTTTAG TAAGTGTATC AAATTCATCA GATGTCAAGG 60 

GCATGTTATC ACCTCCTTAG GTTGATAACA aCATTATACa CGaAAGGAGC , ATAAaCAAaT 120 

GAACACAaGA TCAGAAGGAT TGCGTATAGG CGTCCCACAA GTTTCTAGCA AAGCTGATGC 180 

TTCTTCATCC TATTTAACGG AAAAGGAACG TAACTTAGGA GCGGAAATAT TAGAACTTAT 240 

TAAAAAAAGT GATTACAGCT ACTTAGAAAT AAACAAAGTT TTCTATGCAT TAGATAGAGA 300 

ACTTCAATAC AGGGCGAATA ATAACAAACT TTAaCATTTA TCTAAAGGAG TGATAGAGAT 360 

GCCAAAAATC ATAATACCAC CAACACCAGA AAACACATAT CGAGGCGAAG AAAAATTTGT 420 

GAAAAAGTTA TACGCAACAC CTACACAAAT CCATCAATTG TTTGGAGTAT GTAGAAGTAC 480 

, AGTATACAAC TGGTTGAAAT ATTACCGTGA AGATAATTTA GGTGTAGAAA ATTTATACAT 540 

20 TGATTATTCA GCAACGGGAA CATTGATTAA TATTTCTAAA TTAGAAGAGT ATTTGATCAG 600 

AAAGCATAAA AAATGGTATT AGGAGGATTA TCAAATGAGC GACACATATA AAAGCTACCT 660 

ATTAGCAGTG TTGTGCTTCA CGGTCTTAGC GATTGTACTC ATGCCATTGC TGTACTTCAC 720 

TACAGCATGG TCAATTGCAG GATTCGCAAG TATAGTGACA TTCATATTTT ATAAGGAATA 780 

CTTTTATGAA GAATAAAAAA ACTGCTACTT GTTGGAGCAA GTAACAGTGC AAGATGAGCA 840 

ATTGTCTTAA ATAATTATAT AAGGAGTTAT TAATATGACC TTACAACAAA AAATACTATC 900 

ACATTTTGCA ACATATGACA ATTTCAATTC TGATGATGTT GTTGAAACTT TTGGGATATC 960 

TAAAACACAT GCAAAATCCA CACTTTCAAA ACTTAAGAAA AAAGGAAAGA TTGCAATGGA 1020 

AAGTTGGGGT GTCTGGCGTG TTATTGAATC GCAATTGCAT TTAAGTGTAG TCGAACGTAA 1080 

AAAAGAAATT TTAGAAGAAC AATTTGAATT GTTAGCAAGA TTAAATGAAC AAAGTGATGA 1140 

CCCTAGAGAA ATAGAAGAAC GTATCAAGTT AATGATTCGT CTAGCTAACC AATTTTAAGG 1200 

40 AGGAGTTAAT CAATGGCAGT ATTAGAAGGT ATTTTTGAAG AATTAAAACT ATTAAATAAG 1260 

AACTTACGTG TGTTAAACAC TGAACTATCA ACTGTAGATT CATCAATTGT ACAAGAGAAA 1320 

GTTAAAGAAG CACCAATGCC AAAAGAAGAA ACAGCTCAAC TGGAATCAAT TGAAGAAGTT 1380 

AAGGAAACTT CTGCTGATTT GACTAAAGAT TATGTTTTAT CAGTAGGAAA AGAGTTCCTT 1440 

AAAAAAGCAG ACACTTCTGA TAAGAAAGAA TTTAGAAATA AACTTAACGA ACTTGGTGCG 1500 

GATAAGCTAT CTACTATCAA AGAAGAGCAT TATGAAAAAA TTGTTGATTT TATGAATGCG 1560 

AGAATAAATG CATGAAGCTA GATCACTCAA ATAGAGCTCA TGCAAAGCTT AGTGCAAGTG 1620 

GAGCAAAACA ATGGCTAAAC TGTCCACCGA GTATTAAGGC AAGTGAAGGT ATTGCAGATA 1680 
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G7CTTAAATA TGAAGGCCTA ACACAGTTTG AGTTTAATAA AGCTTTTCAA AATTATAAGC 1800 

GAAATCAATA TTACAGTGAA GAGTTGCGCG AATATGTTGa AGAGTACGTA GCTAATGTAG 1860 

AAGAAAAGTA TAACGAGGCT TTGaGTAGAG ATGACGATGT AATAGCTTTA TTTGAAACAA 1920 

AATTGGATyT AGGTAAATAC GTCCCTGAAT CTTTTGGTAy TGGTGATGTC AtTATATTTT I960 

CAGGTGGTGT ACTTGAAATT ATTGACCTTA AATACGGTAA AGGCATTGAA GTTTCAGCTA 2040 

10 

TAGATAATC 2049 
(2) INFORMATION FOR SEQ ID NO: 292: 

. U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 292: 
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ATGATGTTTC 


TATATTCGTA 


TTAGGAAAAC 


CTGTTGGTAT 


TACAACAAAC 


GCCCTAAAAT 


60 


TACAATACCG 


CTGCTCCTAT 


ACCAATTGCA 


ACAACAGTTT 


TAACTGAAAT 


ATCTTGTTTT 


120 


TTCATCTTCA 


TTACTCCTtt 


ACATAAAAAA 


TTCATTATAT 


TGATGGTGCT 


TTAGATAAAT 


180 


GAATCGTCCA 


ATCATTTCCA 


GTACCAATAT 


GATATAAATC 


TGAAAATGAG TCTTGATTGA 


240 


CTGCTACACC 


AATATTTACT 


AGCGAGTTAA 


CATACACAAG 


AGGTTCACCC 


ACATTAACAT 


300 


CTGCAAACGA 


TCGCGCAAAT 


TTAATAATAT 


TTTGATAGAC 


TTTCTTATCT TGATGATAAA 


360 


TTGTTACCAC 


CAAATTATTA 


CCATGAACAA 


TTTCCAAGGA 


TTTTAAGAAT 


GCCAATGGAA 


420 


TATTTGTCCA 


TAATGACCCA 


AATCTGATAT 


CTAAAATATC 


AATGCTTCCT 


GTAACAGAAT 


4B0 


CCTCATTTTT 


TGTCACTTCT 


CTTATTTCTA 


ATGCCTCAAT 


ACTATCAACA 


TTAATTGCCT 


540 


GACCGAGACG 


TTCAAACGCT 


ATCTTATTTG 


CAGCTAATCT 


CGCACCATTG TATGCATAAA 


600 


CATCTCTACC 


ATGAAAAGTA 


TGACTTTCTT 


CCGAATGAGG 


CAATCGGCTT 


TTCACTTCAT 


660 


CAATTTCGAT 


AACTTTTTTA 


ATACCTTCGT 


AATGTTTAAT 


ATGACTTAAA 


GAACCATTAT 


720 


CAGGTGTAAT 


AATGTAATGA 


CCTGAATATG 


TTAAGCAAGC 


AATGTCCGCC 


TATCACTACC 


7B0 


TACACCCGGG 


TCTAC CACTG 


ATACAAAraC 


TGTGCCTTTA 


GGCCAGTATT 


TTACAGTTTG 


840 


ATATAAACGA 


TATGACGCTA 


CCCAAATGTC 


ATACGGTGGT 


ATATCATGCG 


TTAAGTTTTC 


900 


AACACGTATA 


TCATCATTAC 


AGTATATGCA 


ACTCCATACA 


TT 




942 



(2) INFORMATION FOR SEQ ID NO: 293: 
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(A) LENGTH: 1268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 293: 

10 TGTAATAAAA TTTTATGnAA CATGCTGnGA TGCTACCATG AACCTTCTTC ATTTCTTTAT 60 

GTGAGATTGT GaAATTAATC AAATAATAAT ACGGTGGATA CTTTCCTAAT TTACGATATT 120 

CCATTTCCTG ACGATAAAAT GTTAAATAAT C ATTTTTTT G AACATCCAAT ATTGAATAAT 180 

15 GATCTGGATT ATACGTTTGA ATGATGACTT GACCTGCCTT TTCATGACGA CCAGCTCTAC 240 

CAGCCACTTG CGTTAATAGT TGATAAGTAC GTTCGCTCGC CCGAAAATCA GGTAAATTTA 300 

ACATTGTATC TGCATTCAGC ACACCAACTA AAGTAATATT TGGATAATCT AATCCTTTCG 360 

20 

CAATCATCTG AGTACCTAGT AAAATGTCAC CGTTACCTTT TTCGAATTCA GTCAATAACT 42 0 

TTTCATGTGC ACCTTTCTTT GAGGTTGTAT CTACATCCAT CCTAATTATG CGCGCATCTT 480 

CAAATTCTTG TTGCAATAGT TCTTCAACTT TCTGAGTACC AGTACCTACT TGTCGAATGT 54 0 

25 ... 

GTTCACTCTC ACAATTTGGA CATTGATTCG GTGGCGTCTC TTGGTAACCA CAATAGTGAC 600 
ATTTTAATAA GTCTGTCGTT TTATGATACG TTAATGAAAT ATCACAGTTT GGACATTGCG . 660 

30 GTACATATCC ACAATCCCGA CATAACATAA ACGATGCATA ACCACGTCGA TTTAAAAATA 72 0 

AAACAACTTG TTCCTGTCGA TCTAATCTTA ATTGTATGGC TTCACGTAAA TCTTTTGAAA 780 
ACATTGACCG ATTACCTTCA CTCAATTCTT CACGCATGTC TACTATATCA ATTTCAGGTA . 84 0 

55 AAGCTTGTTG GTTCACTCTG TTTGGTAATG ATAGCAAATG ATAAACGCCT TTTTCAGCTC 900 

GTGCATAACT TTCAAGACAT GGTGTTGCAC TTCCTAAAAT GACTGGACAG TGATGATATT 960 

CAcTTCGCCA TTGGGCAATT TCTCTAGCGT GATATCTCGG ATAATCTTCT TGTTTATATG 102 0 

40 

TAGATTCATG TTCTTCATCA ATGATGATTA ACCCTAAATT TTTGAAAGGT GCGAACACAC 1080 

TTGACCTTGC ACCAACACTT ACTCTCGCAC GACCATCCCT AATTTTTTGC CATTCATCAT 1140 

AACGTTCCCC ATTAGATAAG CCAGAATGTA ATACAGCAAC GTCATCACCA AATCGACGTT 1200 

45 

TGAAGCGTAA AACCATTTGC GGTGTTAGAG CGATTTCAGG AACTAACATC ATCGCCTGTT 1260 

TTCCTTGG 1268 
so (2) INFORMATION FOR SEQ ID NO: 294: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 629 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 294: 
TACCACCAAA TAATATATTA GCTGGCATTT TAATAACATT TAnATTTGTC ATGATATCAT 
CAATAAAATG TTGAAACTTC GTAATTTCAC CTTCATAATC ATCAATTGCT GCTAATTGCG 
CACTCGATGC TTGCTGATCT AAATTTAAAA TATTCGACAT GCGTTGACTA TAATAAACTA 
AATGTTCTAT TAAGCCATCG TCACTCTTTT CCTTTGGTGC TGACATGACA GCGATACGTT 
TCAAAGGATA GTGTTGCGCC AATTTTAATG TCATTAATCC ACCTAAAGAC ACACCCGTTG 
CACTGATAGA TTCATAACCT TCATTGACTA AAAATTGGTA AGCTTTCTCA ACTTCTTCCC 
ACCAATCATC TACATTATAT GTCATGAAAT CTTTCAACAA TAAACCATGA CCTGGATAAT 
TCGGTGCATA ACAACTAAAT CCTTGGTCAT TTAACTCAGC TGCAAGATGC TTCACATCCC 
GATTTGTACC TGTAAATGAA TGTAATAATA ATATCGCATG TCCATTTGTG CCTTTTAAAT 
ACGtGGACTC GGTGTTTTAA TTCTCATTTT TCtaTATAcC TCCACTAtGT CTAAAGaxGT 
TkGCTAAACG CGTTGtCGTC GATGATTAA 
(2). INFORMATION FOR SEQ ID' NO: 295: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 295: 
TATGAAAGTA ATGAATGGTA ATATTATTAA ATTTGATGGA AAAGTAGATA TTGATAATGC 
AGA7AATATC GGTTTTTTAA TTGAGCATCC TAAATTATAT GATAATAAAT CAGGATTGTA 
TAACTTGAAA TTATTTGCAC AAGTATTAGG TAAGGGTTTT GATAAAGCAT ACACAGACAA 
AATTATAGAT GCATTTGGTA TGAGACCTTA TATTAAAAAG AAAGTTAAGA AATATTCAAT 
GGGGATGAAG CAAAAGTTAG CAATTGCAGT ATCTTTArTG AATAAACCTA AATTTTTAAT 
CTTGGATGAG CCTACAAATG GkATGGATCC AGATGGCTCa ATTGATGTGC TGACTACAAT 
TAAGTCTTTA GkAAATGaAC TTGATATGAG AATTCtAATA TCAAGTCATA AGTTAGAAGA 
TATTGAATTA ATTTGTGATA GAGCTGTATT TTTAAGAGAC GGnCATTTTG TTCAAGATGT 
AAACATGGAG GAAGGTGTTG CATCTGACAC AACGATAGTT ACTGTTGATC ATAAAGACTT 
TGATAGAACT GAAAAATATC TTGCAGAGCA TTTCCAATTA CAAAATGTCG ACAAAGCAGA 
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10 



15 



ATTAGATATT TATCCGAAAT ATATTGAAAC ACGTAAAAGT TCATTGCGTG ATACGTACTT 720 

CAATATAAAT CAAAGAGGTG ATAAATAATG AGAATTTTAA ATTTAGTTAA GTATGATTTT 780 

TATAGTATAT TTAAaArTCC TTTAACATAT TTAGCGaTAC TAGTCGTATC TAGTTTGATT 840 

GCAACTCAAA GTATACTTAT GGCAAATTCG ATGGATAACC CGAAACATAT TATTGTCTAT 900 

GGATCTGTAT TTGCTGCAGC AAAATGGTTA TTGTTAATAA TTGGATTAAT GTTTGTTGTT 960 

AAGACAATTA CGCGTGATTT TTCACAAGGT ACAaTTCAAC TATATATGAG TAAAGTTAAA 1020 

ACACGCGTTG GAtACATTAT TTCGAAAACA ATTTCAATTa TTTTAATTTC AATATTATTT 1080 

GCATTAATTC ATTATGTGAT TTTGATTGTT GTGCAGGCAT CTAGTAATGG AAAAAATTTG 1140 

GCGTTTTCTA AATATGTAGA TAATTTATGG TTCTTCCTAA TCTTTTTACT ATTCTTTGGC 1200 

TTGTTTTTAT TCTTAATCAC ACTTGCATCA CAAAAAACAG CAATGATATT TTCATTAGGT 1260 

20 GTATTTTTAG TACTGATTGT ACCGTTTATT AAACCTTTTA TTACATTTAT CCCAAGATAC 1320 

GGTGAAAAAG TTTTAGATGC TTTTGATTAT ATCCCTTTTG CTTACTTAAC TGATAAAATG 1380 

ATTAGCTCTA ACTTTGATTT TAGCAATTGG CAATGGGTAA TTTCATTAGG TTCTATAGTG 1440 

ATATTCTTCA TTTTGAATAT CTTATATGTC GCTAAAAAAG ACATyTAATA AAAATAATTT 1500 

TGAGGTTGGG AATTTTAAAT TTTCCCAACC TCAAAGTTTG TCTTATTGTA AATTTATTTA 1560 

TTTTCTAATT TATTTAGGAT GGAATTATAA ACTGCTTTCC AAAATGAAGC GTCAGTTTTA 1620 

TAGCGGTTTG ATATAACTAA GTGTGTTTCT TTTTCTAAAT CTGCATAGTC TGGATGATCT 1680 

TTGCTCGGTA ATTTATCAGC ACGAACATCA GTTACAAATT TTTGGACTTC ATTTGCTCTT 174 0 

GGTCCCCAAA CTGTTTCTTG TTCGAATTGA TCATTCAAGA ATACGAAGAT AGGAATTGCA 1800 

CGTGATTTAC CATTTGTTAA ATATTGATCG ATCAGTTTTG TATCATCATC TCTATGGAAC 1860 

ACGCGTACTT CTAAATTTAA TGCTTCACTG ATGTGTTTTA GAATTGGGAG ATTCATCATT 1920 

40 GCATCTCCAC ACCAGTCTTC AGTAATTACT AATACTTTAG AATAATTCAT CTCTTTTATT 1980 

T TTTT GATGC GTGAATCATC TTCTGGTAAC TCAAATGATT GATAGATACT GAGAACGGTA 2040 

TCTTGATTTG TCTTCATTCC ATCAATGTAT TCATTTAAGG GTTGGCTATT TTTGAAATAA 2100 

GTTTCTAAAT TTGTCATTGT AAAAACCTCC TTTAGCATTT ACAACATTAT ACCAATTTAT 2160 

AGTAATAAAA GGTAATGAAA TAAATTAATT GCAAATTCTT TGTTAATTTT TGTTAAGGAT 2220 

GAAACGGGAA GCACCTTATG CTATATTTAA ATAAGTACAA AGAAAGGGTG ACATCAGTGC 2280 

GTATTCAAAA TCGCTGGGTT GTGTTTATAT TATTTTTAAT CTGTTCTTTT GGTGTATTAA 2340 

TTGGTTTATA. TCAATATCGT CATACGAAAA CTGTGGATTT GTCTAATCTT GAAATAAACG 2400 
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TTGATCGTTT TAAATTTTAT 


AACAGTAAAG CTCACCCTGA 


TCTTACCGTT 


AAAGTGAGAG 


2520 




AAAAGGATAA CATCGTTAAG 


GGGATAATAT TAGTAAGAGA 


TGAAAAGATA 


CATACTAATT 


2580 


5 


TTGAtGGGGG AATTGGTTCG CCGATAAATA ACGCgATTGA AAATCTTGGa 


TTCgGATATA 


2640 




AAAGrACaAA AGTTGGcAAT GrtTkCtCAT CgGTAAAGTA TATTGATAGA 


GATAACCATT 


2700 


10 


TAAAATTAAA CTTACTTTAT 


CAAGATTTAG AAATTAAACG 


TATTGAATTT 


TTTAGTAAAT 


2760 


AGCTTTAGGT CTTAAAGTTw 


TAAAAAACGA ATGAaTAATT 


TTATTGGGAT 


GAGTGAC 


2817 




(2) INFORMATION FOR SEQ ID NO: 296: 








15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1607 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








20 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 296: 








TCTGTTAAAA TGATTTTTCT 


TTTAnAAAGG CCGnAAATCA 


ATGTTCGATT 


nTTATTTGCA 


60 


25 


TTATGGTCTC GATATTGGTA 


GAATATCAAA. TGGTTAAATG 


AGAAAAACTT 


GGAGGTGCTC 


120. 




ACATGTCATC AATCGTAGTA 


GTTGGGACAC AATGGGGAGA 


CGAAGGAAAA 


GGAAAAATAA 


180 




CGGATTTCTT GGcAGAACAG 


TCAGATGTTA TCGCGCGTTT 


TTCAGGTGGT 


AATAATGCAG 


240 


30 


GCCATACCAT TCAATTTGGC 


GGAGAAACAT ATAAATTACA 


TTTAGTACCA 


TCTGGTATCT 


300 




TTTACAAAGA CAAATTAGCG 


GTAATCGGTA ACGGAGTCGT 


TGTTGATCCA 


GTTGCACTAT 


360 


35 


TGAAAGAATT AGACGGATTA 


AATGAACGTG GCATTCCTAC 


AAGTAATTTA 


CGTATATCTA 


420 


ATCGTGCGCA AGTGATTTTA 


CCATATCACT TAGCACAAGA 


TGAATATGAA 


GAACGTTTAC 


480 




gTGGtGACAA TAAGATTGGT 


ACAACTAAAA AAGGTATCGG 


TCCAGCATAT 


GTAGACAAAG 


54 0 


40 


TTCAACGTAT CGGTATTCGT 


ATGGCAGATT TACTTGAAAA 


AGAAACATTC 


GAAAGATTAT 


600 




TAAAATCAAA CATTGAATAT 


AAACAAGCAT ATTTCAAAGG 


TATGTTTAAC 


GAAACATGTC 


660 




CATCATTTGA TGATATCTTT 


GAAGAATATT ATGCAGCAGG 


TCAACGTCTA 


AAAGAATTTG 


720 


45 


TAACAGACAC ATCAAAAATC 


TTAGACGATG CATTTGTAGC 


AGATGAAAAG 


GTACTTTTCG 


780 




AAGGTGCGCA AGGTGTAATG 


TTAGATATCG ACCATGGTAC 


ATATCCATTC 


GTTACATCAA 


840 




GTAATCCAAT TGCAGGTAAC 


GTTACTGTTG GTACAGGTGT 


AGGTCCTACA 


TTCGTTTCAA 


900 


50 


AGGTAATTGG TGTATGTAAA 


GCTTATACAT CACGTGTTGG 


TGATGGTCCA 


TTCCCTACTG 


960 




AATTATTCGA TGAAGATGGA 


CATCATATTA GAGAGGTTGG 


TCGTGAATAC 


GGTACAACAA 


1020 
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TAAGTGGTAT TACAGATTTA TCTATTAACT CAATCGATGT TTTAACAGGC CTAGACACAG 1140 

TGAAAATCTG TACAGCTTAT GAATTAGACG GTAAAGAAAT TACTGAGTAC CCAGCAAACT 1200 

5 TAGATCAATT AAAACGTTGT AAACCAATCT TTGAAGAGTT ACCAGGTTGG ACAGAAGACG 1260 

TAACAAATGT GCGTACTTTA GAAGAATTAC CTGAAAATGC ACGTAAATAT TTAGAGCGTA 1320 

TTTCAGAATT ATGTAATGTA CAAATTTCTA TCTTCTCAGT TGGTCCAGAT AGAGAACAAA 1380 

10 

CAAACCTATT AAAAGAATTG TGGTAGAACT TTATATAAGT CATACACAAT GATTATAAAT 1440 

ACATGAGCCT TCTATCTTTA TTGGTAGGAG GCTTTTGTTA TGCTTGCTTC TGTATCGATT 1500 

CGATTATTTA GATAAAAAAT ACTAACGTAA AGGCGATATT TGCTAGTCAT AATTTAGAAG 1560 

is 

rTTAgATGAt AtTtAACGAA AAtTAAgATG anATACtTGA ATGGTAA 1607 
(2) INFORMATION FOR SEQ ID NO: 297: 

20 tt) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3055 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 297: 



30 



35 



40 



TTAGAAGTAA GCACTTTAAT 


ATCTTTACCT 


ACCCATGTAC 


CAACACCTTC 


TTTAGGCTTT 


60 


GGATTTTCAG CATGGTTATT 


TGATTTATTC 


ACCTGTTCAG 


ATCTATTTAC 


TTTATTACCT 


120 


TTATTCGGGT TCTCTAATAC 


ATCAAATTTT 


AATCTCGGTG 


AATAAAAAAG 


ATATATTAAA 


180 


AATCCAACTA AAAATAATAC 


ACCGACGACT 


CTTATAATTA 


ACITITTCAT 


CAATCAACCA 


240 


CCTAAAAAGT ATTAATACTA 


TTGTAAAAAA 


CAACACATTA 


ATTAGCAAAT 


TTTCAACACT 


300 


GACATAACTG TGTCGTTTCG ATAAAACAAA 


ATCATCTTCA 


GGCATAATTT 


TAACGTCATA 


360 


AAGATTTTCC CTACACTCTA 


TATCATAACC 


TATCTTTATG 


TTTTCAGGTT 


GAATTTCATT 


420 


ATTAAGATTA AAATATGTAT AAAAAAATGG 


ACAGTTAAGG 


TATCAAATGA ATACCATCAA 


480 


CTGTCCGACT ATTCTTCATC 


AAAAAACCTG 


ATAAAACAAA 


TTGCCTTATC AGATTAGTAT 


540 


CATTTGTATA AGCATATTAA 


TGACCAAATG 


TTGCTTTAAT 


CAGTGATGTC 


GTTTCTCCAC 


600 


CTGGATATAA TACATATAAT 


AATAAGTAAA 


CTGCTACACC 


TGTAATTGCA 


GTACAGAACC 


660 


ATATAACTGA AGCGAATGGC 


CCGAATTTGC 


GGTGTACATT 


ATATTTATCT 


TTAAATGCAG 


720 


TAATAATTTG AACTAGGCCT 


AGAATACCAC 


CAATTGTTGC 


TAAATTAATG 


TGGAAAAATA 


780 


AGAAAATCGT ATAATATTTC 


TTAATTGATG 


CTGGTCCGCC 


AAATGCTGTA 


TTACCGATAA 


840 
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TAACATTTTT GTGTTTATTT ATTTCCCTTT TCCAAATAAG TCTCCAACCA ATGGCAATTA 960 

AAATTGCACT AATGACAATA CATGTCGTAC TAATCGTTGG TAAAATTGGA ACGCCCATAT 1020 

TTTTCATCCT AACTTAATTA ATCTAGATCA AAGTAAGTAA TGAAACAATC ACAGCTAACA 10 80 

CGAAAAAGAT CACTAAATAA TTTAGTGAAT ATATAAACAT TTGTGTTGCC CATTTTGTTT 114 0 

GATCTGAATT TTTCTTAAAT GTTGTTAAAC CTAATGCAAT CCATCCTAAA TTTAATAAGG 1200 

TrGCTAACAC TaCGAATACG ACACCTAAAT TTATTAGTAA TAAAGGTACT GGCAATAAAA 1260 

TAATCAACCA GATAAACATA CTGACACGTG TACGTTTAAA GCCCTTAACT GATGGTAACA 1320 

TTGGAATATT TGCAAGTGCA TATTCATCTT TACGTTTAAT AGCTAAGGcA TAAAAATGAA 1380 

TTGGtTGCCA ACmAAATACA ACTAAAAACA GCGCAATCGC TGTTAAACTA ATTTGTCCTT 1440 

CAATTGCAAC CCATCCAATT AGTGGTGGTA CTGCTCCAGG AAAACTCCCA ATCACTGTGT 1500 

20 TCCATGTTGT ATGTCTTTTA GACCATATTG AGTAATAAGA CACATAACCT ACAATCCCCA 1560 

TAAGACCAAG TACGCCTGAT GGTATATTCA ATAAAAACAA ACAAATTTCT CCAACTAACA 1620 

TCATACCAAA ACTTAATAGT AATAAATTTT GATCTGTAAT TCTATTATTT ACAGTTGGTC 1680 

25 TATTTTGTTT ACTAGGCATA ATACGATCAA TATCTTGGTC GTAATAATTA TTTAACGCAC 1740 

ATGCGCCACC CATAATTAAA GTAGATCCAA ATAGCATTAA TAAAATTTGA GGTATTGATG 1800 

ATAAGAAGGA ATGATTTGTC ATTACAACTG CTAGCCATGC GCcCGCAAAA GCCGGAATTA 1860 

AGTTACCTTG AACAAGTCCC ATTTTAATTA TCTGTTGCAA TTCTTTGAAG TTAACTCTGC 1920 

TAATATTTTG TGACmAAGTA TGCTCTTTGC TCATAATCCC CCTCCTTAAA TTTGTTCATA 1980 

TAAGATTATG ATATCTTAGA TTGCATAAAA AGACTAGGTT TAATAAAATT AAATTGTGAC 2040 

AAATTAACGA CAAGAGAAAA TGTCAATTTT GTGACACAAA TAACATTTAA TTTATTGCTA 2100 

TAATGTATAT GTTAGAAAAT TTTAATAAGT AGAATCATGC ATCTAAAAGA GATTAATATT 2160 

TAAGCTTCAA ATTTGAGTAA ACGTGGATTA CATAATTATC CCAATAAAAA AATCATTACG 2220 

ATTAAGTTCT TTTTA TGTCG TCCACATACA ATACTTGTAA AATTAAATCA TATTTCCTGC 2280 

GTTGGATCCC ATCTTTTCAT ATCCTACAAT CAGGTCTATT TATAGTATCA TCTCAAATCC 2340 

45 GGCTATTAAT TCTAATTCTC AGTGATGCGT TTTTTTATGA TGGGGTGTAT AAATTGTTTG 2400 
GCAAAAAGAA TTTAAAATGG TTAGGTGTCG TAGCAACGTT AATGATGACA TTTGTACAAC ' 2460 

TTGGTGGAGC CTTAGTTACC AAAACCGGAT CAGCTGATGG TTGTGGTTCT TCTTGGCCAC 2520 

50 TATGTCATGG TGCGTTGATT CCAGAATTCT TTCCTATTGA TACGATTATT GAGTTAAGTC 2580 

ATAGAGCCGT TTCAGCTTTG TCTTTATTAA TGGTCTTATG GTTAGTTATC ACTGCATGGA 2640 
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TATTGCAAGC 


ATTAATCGGA 


GCTGCTGCTG 


TTATTTGGCA 


ACAAAACGAT 


TACGTTTTAG 


2750 




CATTGCACTT 


TGGTATATCA 


TTAATCAGTT 


TCTCATCTGT 


ATTTTTAATA 


ACATTGATTA 


2820 


5- 


TTTTCTCTAT 


AGATCAAAAA 


TATGAAGCTG 


ACGAATTATA 


TATCAAAAAG 


CCATTAAGAC 


2880 




GTTTAACATG 


GTTAATGGCA 


ATCATCATTT 


ATTGTGGTGT 


TTATACTGGT 


GCACTAGTGa 


2940 


10 


' GACATGCGGA 


TGCAAGTTTA 


GCATATGGTG 


GTTGGCCATT 


GCCATTACCA 


CGATCTTGTA 


3000 


CCACATTCAG 


GAACAAGATT 


GGGTTCAACT 


CACGCATCGT 


ATCAnGGTCn 


nTTAA 


3055 




(2) INFORMATION FOR SEQ ID NO: 298: 








IS 
20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 748 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
■ (D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 298: 








TTCTTATTTA 


AAGAAGTCAT 


TTTTAGAAAT 


TGTTGAGACT 


TTAAAAAATG 


ATCCGTATAA 


60 


25 


AATAACACAA 


TCTTTTGAAA 


AATTAGAGCC 


TAAATATTTA 


GAGCGATATT 


CAAGAAGAAT 


120 




TAACCATCAG 


CACAGGGTCG 


TCTATACCGT 


AGATGATCGA 


AATAAAGAAG 


TATTAATACT 


180 




ATCGGCATGG 


TCACATTATG 


ATTAATGAAT 


ATTCAATATC 


TGAATAACTT 


TAATGATAAG 


240 


30 


TTAATTAAAG 


AAACTAGTAT 


TTAAGTGTAG 


GGAAAATAGC 


GACGTTAATG 


CGTTGTTATC 


300 




TCTACACTTT 


TTAATTTTAT 


AATAGCGCAA 


GACTAAACAG 


ATTGAAATTA 


GTAACAATAA 


360 


35 


AAGAATAACG 


TATTATAATA 


AGGAATTTTA 


AATTGTGACT 


TTTTCGGAAT 


ATTAAATTTT 


420 


AGAAATATGA 


GGTTTTTAAG 


CGGATTCCTC 


ACAAAATTTT 


AAAAATATTT 


AAGCCTGAAA 


460 




ATGATAAAGC 


GGTAGGGAAC 


GTTTTTCTGA 


AaGTTAGTGA 


TACAATAGTT 




54 0 


40 


ATACAGGAGG 


ATGAATAACA 


TGAATCAGTC 


AGTCAAATTA 


CTTAAACATT 


TAACAGATGT 


600 




AAACGGCATT 


GCTGGTTATG 


AAATGCAAGT 


TAAAGAAGCA 


ATGCGTaACT 


ATATAGAGCC 


660 




TGTCAGTGAT 


CaAATTATTG 


AAGATAACTT 


GGGTGGCATT 


TTTGGAAAGA 


AAAATGCTGA 


720 


45 


GAATGGTCAA 


TACTCAATTA 


TGGAnTTC 








748 



(2) INFORMATION FOR SEQ ID NO: 299: 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4718 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 299: 





ACCTCCGAAT 


AATTGATTCC 


ATTAACTTTT 


TTTGTGaAAA 


ACTTAAACAG 


GCGAAGTCTT 


60 


5 


CAATAAGTGA AGATGTCCAA 


ATTATAAAAA 


ATACATTCCA 


AAAAGAAAAG 


TTAGGTACAG 


120 




TAATTACtAC ycTGGCGCAA GTGGTGGTGT TACGTATAAA CCAATGATGA GTAAAGAAGA 


180 


10 


GGCGACTGAA 


GTTGTTAATG 


AGGTCATTAC 


TCTATTAGAA 


GAGAAAGAAC 


GTTTGTTAcC 


240 


TGGCGGATAT 


TTATTTTTAT 


CAGATTTGGT AGGTAATCCA 


TCGCTACTAA 


ACAAAGTTGG 


300 




TAAGTTAATT 


GCCAGTATTT 


ACATGGAAGA AAAATTAGAT 


GCTGTTGTTA 


CCATTGCGAC 


360 


15 


AAAAGGTATT 


TCATTGGCAA ATGCGGTTGC 


TAATATTTTA AATTTACCAG 


TAGTAGTGAT 


420 




TAGAAAAGAC 


AACAAGGTGA 


CTGAAGGTTC 


TACAGTTTCA 


ATTAATTACG 


TTTCAGGATC 


480 




TTCAAGAAAA 


ATAGAAACAA 


TGGTACTTTC 


GAAGAGAACT 


TTAGCAGAAA 


ATTCAAATGT 


540 


20 


TTTAGTTGTC 


GATGATTTTA 


TGAGGGCTGG 


TGGCTCTATT 


AATGGTGTTA 


TGAATTTAAT 


600 




GAATGAGTTT 


AAAGCCCATG 


TAAAAGGGGT 


ATCAGTACTT 


GTAGAATCAA 


AAGAAGTTAA . 


660 




ACAAAGATTG 


ATTGAAGATT 


ATACTTCCTT 


AGTGAAATTA 


TCTGATGTAG 


ATGAATATAA 


720 


25 


TCaAGAGTTT 


AACGTAGAAC 


CTGGCAACAG 


TTTATCTAAG 


TTTTCATAAA 


AGGAGTTTTA 


7B0 




GTATTATGAA 


AATCATTAAC 


ACAACAAGAT 


TACCGGAAGC 


ACTTGGACCA 


TATTCGCATG 


840 




CAACAGTTGT 


GAATGGTATG 


GTTTATACTT 


CTGGTCAGAT 


TCCATTGAAT 


ATTGATGGAC 


900 


30 


ATATCGTAAG 


CGCTGATGTT 


CAAGCACAGA 


CAAAACAAGT 


TTTAGAAAAT 


TTAAAGGTTG 


960 




TTTTGGAAGA 


AGCAGGATCT 


GATTTGAATT 


CTGTTGCGAA AGCGACCATT TTCATTAAAG 


1020 


35 


ATATGAATGA 


TTTCCAAAAA ATAAATGAAG 


TGTATGGTCA 


ATATTTTAAT 


GAACACAAGC 


1080 


CAGCGCGTAG 


TTGTGTAGAG 


GTTGCGCGTT 


TGCCAAAAGA 


TGTGAAAGTA 


GAAATTGAAT 


1140 




TACKVAGTAA 


AATTAAGGAA 


TTATAATTTT 


CGATTAATAT 


GTTTAATCAA 


GCTTCTAAAT 


1200 


40 


AAAACAGAGA GATATATACT ATAGGGGGGC TCACTACATG AAAGTGACAG ATGTAAGACT 


1260 




TAGAAAAATA 


CAAACAGATG 


GACGAATGAA AGCACTCGTT 


TCCATTACAT 


TAGATGAAGC . 


1320 




TTTCGTAATT 


CATGATTTAC 


GTGTAATTGA AGGAAACTCT 


GGCTTGTTCG 


TTGCAATGCC 


1380 


45 


AAGTAAACGT 


ACACCAGATG 


GTGAATTCCG 


CGACATCGcg CATCCTATTA ATTCAGATAT 


1440 




GAGACAAGAA 


ATTCAAGATG 


CAGTGATGAA 


AGTATATGAT 


GAAACAGATG 


AAGTAGTACC 


1500 




AGATAAAAAC 


GCTACATCAG 


AAGATTCAGA 


AGAAGCTTAA 


TCAATTTTAT 


ATTTAGCGAT 


1560 


SO 


GTAATACATT 


TGCAATAAGT 


TGATTTGATA 


CTGTCGATAA 


AGCATAAAGC 


TTTGTCGGCA 


1620 




GTTTTTTTAG 


TTTGTATTAA 


TGTTTTTTTA 


TTTTTAATGA 


AAGGCTAATA 


AATATATACG 


1680 
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TGaTGCTCGT ATTTTTG AAG TAAGAAAAAA GTTGTTTTTA AAATTACAAC GAATTAAAAA 1800 

CAATGCCTTT TATATGTTGA AAGAGTATTG CaGATTAAAT TaTAATAATG ACGAaGgTAA 1860 

5 AATTTAATGG GGGTTAATGT TCATGCGAAG ACACGCGATA ATTTTGGCAG CAGGTAAAGG 1920 

CACAAGAATG AAATCTAAAA AGTATAAAGT GCTACACGAG GTTGCTGGGA AACCTATGGT 1980 

CGAACATGTA TTGGAAAGTG TGAAAGGCTC TGGTGTCGAT CAAGTTGTAA CCATCGTAGG 2040 

10 

ACATGGTGCT GAAAGTGTAA AAGGACATTT AGGCGAGCGT TCTTTATACA GTTTTCAAGA 2100 

GGAACAACTC GGTACTGCGC ATGCatGCAA ATGGCGAAAT CACACTTAGA AGACAAGGAA 2160 

GGTACGACAA TCGTTGTATG TGGTGACACA CCGCTCATCA CAAAGGAAAC ATTAGTAACA 2220 

15 

TTGATTGCGC ATCACGAGGA TGCTAATGCT CAAGCAACTG TATTATCTGC ATCGATTCAA 2280 

CAACCATATG GATACGGAAG AATCGTTCOA AATGCGTCAG GTCGTTTAGA ACGCATAGTT 2340 

20 GAAGAGAAAG ATGCAACGCA AGCTGAAAAG GATATTAATG AAATTAGTTC AGGTATTTTT 24 00 

GCGTTTAATA ATAAAACGTT GTTTGAAAAA TTAACACAAG TGAAAAATGA TAATGCGCAA 2460 

GGTGAATATT ACCTCCCTGa TGTATTGTCG TTAATTTTAA ATGATGGCGG CATCGTAGAA 2520 

25 GTCTATCGTA CCAATGATGT TGAAGAAATC ATGGGTGTAA ATGATCGTGT AATGCTTAGT 2580 

CAGGCTGAGA AGGCGATGCA ACGTCGTACG AATCATTATC ACATGCTAAA TGGTGTGACA 2640 

ATCATCGATC CTGACAGCAC TTATATTGGT CCAGACGTTA CAATTGGTAG TGATACAGTC 2700 

30 ATTGAACCAG GCGTACGAAT TAATGGTCGT ACAGAAATTG GCGAAGATGT TGTTATTGGT 2760 

CAGTACTCTG AAATTAACAA TAGTACGATT GAAAATGGTG CATGTATTCA ACAGTCTGTT 2820 

GTTAATGATG CTAGCGTAGG AGCGAATACT AAGGTCGGAC CGTTTGCGCA ATTGAGACCA 2880 

35 

GGCGCGCAAT TAGGTGGAGA TGTTAAGGTT GGAAATTTTG TAGAAATTAA AAAAGCAGAT 2940 

CTTAAAGATG GTGCCAAGGT TTCACATTTA AGTTATATTG GCGATGCTGT AATTGGCGAA 3000 

CGTACTAATA TTGGTTGCGG AACGATTACA GTTAACTATG ATGGTGAAAA TAAATTTAAA 3060 

40 

AcTATCGTCG GCAAAGATTC ATTTGTAGGT TGCAATGTTA ATTTAGTAGC ACCTGTAACA 3120 

ATTGGTGATG ATGTATTGGT GGCAGCTGGT TCCACAATCA CAGATGACGT ACCAAATGAC 3180 

45 AGTTTAGCTG TGGCAAGAGC AAGACAAACA ACAAAAGAAG GATATAGGAA ATAATCATTT 3240 
ACGTATTTAA AATGGCTAGG ATAAAAGGAT AATCCTATGT AATATTAATG TAATCTTTAT ' 3300 

GATTTAATGA TTCGCATAGT AATGGAGTTA CATyTTATAT ATAATAGTAA TTGCGTAAGT 3360 

50 AAATAATTGG AGGACTATAA ATGTTAAATA ATGAATATAA GAATTCGTCA TTAAAGATTT 3420 

TTTCATTGAA AGGAAACGAA GCATTAGCGC AAGAAGTTGC TGACCAAGTA GGAATTGAAC 3480 
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GTATTCGTGG TTGTGACGTA TTTATTATTC AACCAACATC ATATCCTGTG AATCTACATT 3600 

TAATGGAATT ATTAATTATG ATTGATGCTT GTAAACGTGC TTCTGCAGGA ACAATCAATA 3660 

5 

TTGTAGTGCC ATATTATGGA TATGCAAGAC AAGATAGAAA AGCCCGTAGC CGTGAGCCAA 372 0 

TCACTGCTAA ATTAGTTGCA AACTTAATCG AAAcAGCtGG CGCAACTCGT ATGATTGCGT 3780 

TAGACTTACA TGCACCACAA ATTCAAGGAT TCTTTGATAT TCCAATTGAC CACTTAATGG 3840 

10 . 

GTGTGCCAAT TCTTGCTAAA CATTTCAAAG ATGATCCGAA TATTAACCCA GAAGAATGTG 3900 

TCGTTGTTcA CCAGACCATG GCGGsnTTAC ACGTGCACGT AAATTAGCTG ACATTTTAAA 3960 

15 AACTCCAATT GCAATTATAG ATAAACGTCG TCCTAGACCA AATGTTGCTG AAGTGATGAA 4020 

CATTGTTGGT GAGATTGAAG GACGTACGGC AATTATTATT GACGATATTA TTGATACAGC 4080 

AGGTACAATC ACTTTAGCTG CACAAGCATT AAAAGATAAA GGTGCTAAAG AAGTATATGC 4140 

20 TTGTTGTACA CACCCTGTTT TATCAGGACC GGCTAAAGAA CGTATCGAAA ATTCTGCTAT 4200 

AAAAGAATTA ATCGTAACAA ACTCAATTCA TTTAGATGAA GATCGCAAAC CATCTAACAC 4260 

TAAAGAATTA TCTGTTGCTG GTTTAATCGC ACAAGCTATC ATTCGTGTAT ACGAAAGAGA 4320 

25 ATCAGTTAGC GTATTATTTG ACTAATATTT AAAAGGCGTT TGACGAACAT ATTCCAAACG 4380 

TGTATAATAG TTTCGTTCGT GATTATACGA ATAAATAAAC ACTTGCAAGC AACGATGATG 4440 

TTGATGGGTA AGTGAGGTGC TCGTTTTGAG CAAAAATGAA AGGTGGAAAT GAGAATGGCT 4500 

30 

TCATTAAAGT CAATCATCCG TCAAGGTAAA CAAACACGTT CAGATCTTAA ACAATTAAGA 4560 

AAATCTGGTA AAGTACCAGC AGTAGTATAC GGTTACGGTA CTAAAAACGT GTCAGTTAAA 4620 

GTTGATGAAG TAGAATTCAT CAAAGTTATC CGTGAAGTAG GTCGTAACGG TGTTATCGAA 4680 

35 

ttaggcgttg gttctaaaac tatcaaagtt atggttgc 4718 
(2) Information for seq id no: 300: 
4q (i) sequence characteristics: 

(A) LENGTH: 3181 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 300: 

AAAATGCATA TAAATACATA TTAAGGAGGA TTTTATGAAT TTTCTTAAAC CTGCAAAGCA 6 0 

50 TATTAAGCCT TTGCCAGAAA ATCAGATAGA TGATACCTAT AAACGATTAC GTCTCCAAGT 120 

ATTTCTTGGT ATTTTCATCG GTTACGCTGG GTACTATTTA TTACGTAAAA ACTTTTCGTT 180 
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TGCTGTTTCC ATCGCATATG GATTTAGTAA GTTCTTTATG GGTACTGTAA gTGATCGGAG 


300 




CAATGCTCGG ATATTCTTAG 


TTCTTGGATT 


AGCACTCACT 


GCTATCGTCA ' ATTTGTTAAT 


360 


5 


GGGATTTGTA CCGTTCTTTA 


CATCAGGTAT 


CGGTATTATG 


TTTGTCCTAT TATTCTTAAA 


420 




TGGATGGTTT CAAGGTATGG 


GCTGGCCACC 


TTCAGGCCGT 


GTTCTCGTTC ACTGGTTTAG 


480 




TGTAAGTGAA CGCGGAAGTA 


AGACTGCCCT 


TTGGAACGTT 


GCGCATAATG TTGGTGGAGG 


540 


70 


TATTATGGCA CCTATTGCTG 


CTTGGGGTAT 


TACAACAACT 


GCATTTATCA ACTTTGGTTA 


600 




TTTAAAAGGT TTCGAAGGTG 


TATTCATTTA 


CCCTGCACTC 


TTAGCACTTA TCATTGCCGC 


660 


75 


AATTTCATAC GTATTGATTA 


GAGACACACC 


TCAATCTCAA 


GGTTTACCTC CAATCGAAAT 


720 


TTATAAAAAT GACTTTGCTA 


CAAGCGATAA 


GAAAACATTA 


GAAACAGAAT TAACTACAAA 


780 




AGAAATTTTA TTTAAATATG 


TACTGAACAA 


TAAATGGGTA 


TGGGCAATTG CCTTTGCAAA 


840 


20 


TATATTTGTT TATTTCGTGC 


GTTATGGTGT ACTTGATTGG 


GCGCCAGTCT ACTTAAGTGA 


900 




AGAAAAACAT TTCGACTTAA AAGCATCAGG 


TTGGGCATAC 


TTCTTATACG AATGGGCTGG 


960 




AATTCCTGGT ACATTATTAT 


GTGGTTACAT 


TTCTGATAAA TTATTCAAAG GTCGTCGTGG 


1020 


25 


ACCTGCAGGT TTCTTCTTTA TGTTAGGTGT CACAGTATTT GTATTAATTT ATTGGTTAAA 


1080 




TCCTCCAGGC AATGCTTGGT 


TAGACAATGT 


CTCATTAATT 


GCCATTGGTT TCTTAATATA 


1140 




TGGACCAGTT ATGTTAATTG 


GTTTACAAGC 


ATTAGATTAT 


GTACCTAAAA AAGCAGCTGG 


1200 


30 


CACAgcAGCT GGATTAACAG GATTATTTGG TTATCTGTTT GGTGCTGTAA TGGCCAACAT 


1260 




CGTCTTAGGT GCTGTAGTTG 


ATAAATTCGG 


ATGGGATGTC 


GGTTTTATTT TATTAACAGC 


1320 




AATTAGTGTG TTTGCAATGT 


TGAGCTTTAT 


CCTCACTTGG 


AATAAAGTAG GACAAGAAAC 


1380 


35 


CGTTCATCAT TAAATGATAA 


AAAATAAAGT 


CATATGGTTA 


TCTTATCGAA AGATGATATA 


1440 




TTCffTCTCTT ATAAGTTCAA ccatatgact ttttattagt attcaaaaaa atatttacat 


1500 


40 


TGCCACTTTT GTGTTTGCCC 


TGCTGTTTTA 


TTCAATTGAT 


TACACCACTT AGGATAAACT 


1560 


CTAAAAGCCA ttttcccttg 


ATACTTAGAT 


GAAGCTAAGA 


TACCTTTTTC TACCAATAAT 


1620 




TCCCTAGGAA ATAGAAAGTA 


GCCATTTAAT 


TCATCATCGA 


TAACAGCAAC AATCAGGTAA 


1680 


45 


TCAGCAAACT CTTCAATTTT 


GTATGGCCGA 


TTATAATTAT 


CtTCGTCTTT TGTCCAACAT 


1740 




GTCACGAAAT ATCCCGATTT AGTCGGTGTT 


TTCTTAGCTA ACCTACTTTG ATATGTTTCC 


1800 




TCTTTAAAGC TAAATGTTAA TGCyTCGTAA TCTTGATTAT ATTTTTCTTC AGTTAAATCT 


1860 


50 


TTAACTTCTG ATTCTTCACT AAAAATATTT 


TTCAACAGTA 


TTTTAGATTT ACACATATGC 


1920 




CAATCTCACT TTATTTTTTC 


ATAATCGTAT 


CATATATTTA TTTTTTTCGA AAAATACACT 


1980 
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10 



15 



20 



25 



30 



35 



40 



TGTATCAACT GTCAAACGAT CACGTAATAA ATAGACGATT AACATTGcTG CTAAAGCACA 2100 

AAGTGACTCA GCAATTAATA ATGACCAAAT GACACCTGTT AGTCCAAACA AAGCATTCAT 2160 

AATAAATAAT ACTGGAATGA TAATTGCACC TTGTAAAATG GCCATAATTG TAGCACCACG 2220 

ACCTTGCCCA GTCGCTTGAA GCATACCAGT AAACAAGAAA CCTATACCAT TTAATAATAA 2280 

TGATGCCATT GTTACTTTCA AAATAAATGt CGCCATCTCA ACAATGGCTT GATCAGTAGT 2340 

AAATAGTCCG ACCATATGAT GTCCAATTGT AAATACAGCA CTCATACATA CAACAAAGAT 2400 

AACGCCGATA GACATGATAA CTGCTTTGAT AACGTCTTTC ATACGGCCTT TATTTGCCAT 2460 

AAAGTTATAT GCAATTAGTG GTACAACACC TTCACATAAT CCCATGATAA TAAGTTCTGG 2520 

aAATTGCACA AGTCTAAATG AG AT AC CAT A ACTTGCaATC GCGAAGTTTC CATAATGTGC 2580 

TAAAAATAAA TTTAAAACTA ATCCTGTGAA TCCCATTAAG ATACTCATTA AAAATGCAGG 2640 

AATACCGATT TTAAAGATTT CAGAAAGCAT TTCTTTATTA GGTTTCGCAA GTTTAATATT 2700 

AACTGACACA ACGTCACTAT TTTTCATAAA ATAAATGATA AAGAACAGAG CAGCAGCAAC 2760 

ATTACTGATT GCAGTACCCA AAGCTGcACC AACAACGTTT aAATCAAAAC CAAAAATTAA 2820 

AATTGGATCT AAAATAATAT TTAAGCCTAC ACTAGCTAAC ATACCAATCA TAGAAACCAT 2880 

TGGTGCCCCA ATTGCACGTG CAAATTGTTC TAATATGAAG AACAAAATTA CAAAAGGTGC 2940 

ACTTAAAAAC ATTACTTTCA AATAATTACT TGTTAAAGCT AACGTTTCAC CTCTCGCCCC 3000 

TAAAATTGCT GCGATTTGAT CACTGAATGG TAAAGTAACT AAAATCACGA TAAGTCCTAG 3060 

TGCAATACCA CCATAAATAG AGAAACTACT TACAAATTTA CTCTTAmTAT AGTCTTTCGC 3120 

ACCTAATAAA CGTGAAATAT AAGTTCCTGC ACCAACGCCA AATAAATTAC CTAACCCCAT 3180 

T 3181 

(2) Information for seq id no: 301 : 

" (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4029 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



45 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 301: 

TCATCACCTT CAAATAAATA GGCATGCGAT AATTTATTTG AATGATATGC ATTCGTCAAT 60 

TGTTGCTGTT CATCCATT.CA TTACAAACTC CCTTTTGCTT TTATATAAAA AGGTTGCCAA 120 

AGAGCAACTA ATTACTGCAA CAATAGCGAC CATGATTTGT GATTATTTTT AGAACCACAA 180 
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TTATTAATAC ACTATTTTAA GTATCTCTAA TTGATGATTG TATTATAGAA TTAAAATTGA 


300 




TGGAATGCAT CAACTGGCAT AACAAATACA GTAGCACCGC CAACTTCAAC 


TTCAACTGGA 


3S0 


5 


TATGGAATGT ACGAATCCGC ACTGCCTCCC ATAGGTGTAA TAGGTGAAAC 


CAACTGTTCT 


420 




CTATTACCAC ACGTTTGATT AATCACAGAC AATATTTCAT CTACACGGTC 


ATCATTGACA 


480 


10 


CCACATAAGA ATGTTGTATT ACCCGCTCTT AAAAACCCAC CTGTTGTTGC 


CAATTTTGTT 


54 0 


GCTCTAAAGT TATTTTTAAC AAGTTGATCT GCAAGTTCCT GACTATCTTG 


ATCTTGTACG 


600 




ATCGCTATAA TCATTTTCAT TTTATAACAC CTCTTCTAAC AATTATATCA TATCTTTTCT 


660 


15 


AAATATTTGA TGATAGTTTG ATACGTGTCT TCAACAACAT TTTCAAGAGG 


TTGATCTGCA 


720 




TTAACGCTTT TGAACCGTTG TGATTCATTA TGAATGATTT CTTGGTAACC 


TTCAATTACT 


780 




TTTTCGTGAA cTTTAAATCT TCTTGATCTA ATCTATTTTG ATCTCTTGAA 


TTTTTAATAA 


640 


20 


TACGTTCGCG ACCTACTTCA GCACTAACAT TTAAATAAAT CGTCAAGTCT 


GGaTATAATC 


900 




CATTTATTGC AAATTCGTTT AATGCTCTTA CTTCTTCAAC GCCAATCCCT 


CTAGCATAAC . 


960 




CTTGATAAGC TAATGAACTA TCGATATAGC GATCACACAA CACAACCTTA 


CCTTCTTTTA 


1020 


25 


AAGCTGGTAT GACCTTTAAT ACAAGATGTT CTCTTCTAGA TGCAGCAAAT AACATTGcTT 


1080 




CaGTtCTAAT GTCCATATCA TTGCCcTTCT AATACAATTT TACGTATTTC 


TTCACCAGTA 


1140 




GGAACACCAC CTGGTTCTCT AGTCATAATG ACATCATAAT CTTTTACTAA 


TCTATGGTAA 


1200 


30 


ACTTCATTAA TTACAGTTGT TTTTCCAGAG CCTTCTGGGC CCTCAAAAGT 


TATAAAAGCT 


1260 




GACATTTTAT TCATCCTCAA CTAAAATTTT ATTATTTTTA ATTCCTTCAA 


CTATCATTCC 


1320 


35 


AGTTTCCAGA TATTCATTTA CCAATTCTAT CATATTTTCA GTTATTGTTT 


CTCCTTTGAA 


1380 


AATAATAGGA ATcCCTGGcG GATACGGGAC AATATGTCGC GCCAGAACTT TACCTTTgCC 


1440 




TTTTCAAATC ACACCAAGTA ACATATTCAA AGCGTTTTGG TTTATAGTTA CCTTCAGTyG 


1500 


40 


TTAAAAGTTG TGTTTGTTTA ACTTTAGAAA CTGATriTTT CGGTAAAATC 


ATATCTTCAA 


1560 


TTTTACGTAA AAGCGAATCA AATAAATACG TATCATCATG ATGCCATAAC 


GGCAATATCG 


1620 




CTAATGCTTG ATAGTCGTCC GCTAATTCTA AATAGATATG TGCATTCATA 


AACCAATTTT 


1680 


45 


GAATATCATG ACCTGTAAAA CCTTCATATT TTATCAGCAA CTTCAACGGA TCATCAACTT 


1740 




GAAGCATTTC AAAACCCTTC TTCTCCAAAC ATTCGATTAA TTGCGCTCTC 


TTATCAAAAA 


1800 




ACACGGTACT ATCATATGTT TTATAGAACT CGGCAGCTGA CTCTAAACTA 


GCCATAATCA 


1660 


50 


AATACGAAGG ACTAGATGTT TGGAAGTAGC TTAGATATTC TATAATAGTT 


TCTCTATAAG 


1920 




GTGCATTTTT ATGAATATAA AGTACCGAGC CCATCGTTAA AGCTGGTAAC 


GTTTTATGAA 


1980 
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CAAAGTGCGC GCCGTGTGCT TCGTCAATGA GTACAGGAAT ATTTAATTGG TGCAAAGATT 2100 

TGATAACCTC TTCTACATTA AATGTTTCAC CGTAATAGTT AGGATAAGTC AACACAGCAA 2160 

GTTTGTGACC GTCATTATTC AAACGGCTTA AATTAACTTT ATTATAATGA TTCGTTAACG 2220 

GACTTTGATG CGTTTCAATA AAATGCCCTT CTTGTTGGCT AATATCGAGC GCATGTAACA 2280 

CAGATTTATG TACATTTCTT GCCATTAAGA TATCGCCTTT TTTCTGTGAA AAAGACTGGA 2340 

10 

TGACAGATAA TATTCCTGAA GTGGTGCCAT TCACTAAGAA ATAAGCATCA TAATCTGAAT 2400 

GTTTCTCCAC CTGCTTCATA CTTTCCAAAA TGACTTCTTC AGGATGATGT AAATCAnCTn 2460 

AATCCAGGTA TTTCAGTTTT ATCCATTGTC ATTGATAATT GAGATAAATG ACCGATAGTC 2520 

15 

ATATTTTTAT GACCCGGAAC ATGCAAAGAA ATCGCTTCTT CTTGATTTAA ACTTTCTAAT 2580 

TTATTTAAAA TAGGTTGCTT CATGATATAC GCTTCCTTTA TTTACACTGT TTTGGAATTA 264 0 

2o GTTACTTTCA AAAGTATTAA TTATATAGTA ACACTTCTTT GACAAAAGTT AGTGTTACTT 2700 

ATGCAATAGC TTGTCTATTG TATAATAATT AATTTCCTTT TTGTACTTCG ATTTAAAAGA 2760 

TATTAGACAT AAAATCTAAA AACAGCAGTA AGATGATTTA TGATTAAAAA CTATCTTACT 2820 

25 GCTGTTCACT TTTTATAATA CTTCTGAATG TCTTCACTTA TACTTCTAGT CACAGATTTA 2880 

AATAATCAAA AGTGCACATT ATTAAAATAT CAATTTCACA CTCAATGCGG CTCATCGCAT 294 0 

TCATTTCTTG TCTAGCAACG TTCTACTCTA GCGGAACGTA AGTTAGCTAC CATCCTCGCT 3 000 

30 

AAGAACCTTT CTTGACTTGT GACAATCGCT TGCTTCTTTC CTCTCCTTCG GCTCTCGCTT 3060 

ACTCATTTAG CTCTACTAAA CTCGTTGCGC TCTTTTCTCG TTTCGTCAGA TTCAAACGTT 3120 

TTCACTTCGC CAAGCCATTT TTCTTTGTGT TTACTTTTTA TTTTGACGTT TTAGACATAA 3180 

35 

AAAAAGAGAC cTCACGGTCT CAACTTGCCT GGCAACGTTC TACTCTAGCG GAACGTAAGT 3240 

TGGCTACCAT CGTCGCTAAA GACCTTTCTT GACTTGTGAC AATCGCTTGC TTCTTTCCTC 3300 

TCC1TCGGCT CTCGCTTACT CATTTAGCTC TACTAAACTC GTTGCGCTCT TTTCTCGTTT 3360 

40 

CGTCAGATTC AAACGTTTTC ACTTCGCCAA GCCATTTTTC TTTGTGTTTA CTTTTTATTT 3420 

TGACGTTTTA GrCATAAAAA AAAGAGACcT TGCGGTCTCA ATGCGGCTCA TCGCATCCAT 3480 

4S TTTTTGCCTG • GCAACGTTCT ACTCTAGCGG AACGTAAGTT GGCTACCATC GwCGCTAAgA 3540 

aCCTTTCTTG ACTTGTGACA ATCGCTTGCT TCTTTCCTCT yCTTCGGCTC TCGCTTACTC 3600 

ATTTAGCTCT ACTAAACTCG TTGCGCTCTT TTCTCGTTTC GTGAGATTCA AACGTTTTCA 3660 

50 CTTCGCCAAG CCATTTTTCT TTGTGTTTGC TTTTTATTTT GACGTTTTAG ACATAAAAAA 3720 

AAGAGACCTT GGGGTCTCAA TGCGGCTCAT CGCATCCATT TTTTGCCTGG CAACGTTCTA 3780 
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CATGGGAACA GGTGTGACCT CCTTGCTATA GTCACCAGAC ATATGAATGT AATTTATACA 3900 

TTCAAAACTA GATAGTAAGT gAAAAGTGGA TTTTGCTTCG CAAAACATTT ATTTTGGATT 3960 

AAGTCTTCGA TCGATTAGTG ATTCGTGCAG CTCCACATGT GCACCATGCT TGCCACCTCG .4020 

GAACCTATT 4 029 
(2) /INFORMATION FOR SEQ ID NO: 3 02: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 302: 



20 


GATGCAATAG 


TTGAGTAGTT 


ATAAGTAGCT 


ACATACAACC 


GCTCAAATAT 


AGGTTCAAGA 


60 




ACATTTTTAA 


TGAAGAAAGC 


GATTTTCCCm 


CAGAGTGCAA 


ACGCTTGACT 


CAATAACGAA 


120 




ATGTCACCTA 


TGTATGGCGT 


GACTTATTAT 


ACAGGaGGTG 


CAAAGTATGT 


TTGCTATTAT 


180 


25 


TGAAACAGGT 


GGAAAACAAA 


TCAAAGTAGA 


AGAAGGTCAA 


GAAATCTTCG 


TTGAAAAATT 


240 




AGACGTAAAC 


GAAGGAGATA 


CTTTTACATT 


TGATAAAGTA 


TTATTTGTAG 


GTGGAGATTC 


300 




AGTTAAAGTT 


GGAGCGCCAA 


CAGTTGAAGG 


TGCAACAGTT 


ACTGCTACTG 


TTAATAAACA 


360 


30 


AGGTCGCGGT 


AAAAAAATCA 


CTGTATTCAC 


ATACAAACGT 


CGTAAAAATT 


CAAAACGTAA 


420 




AAAAGGCCAT 


CGTCAACCAT 


ACACTAAATT 


AACAATCGAT 


AAAATCAACG 


CGTAATTATT 


480 


35 


ATGATTACTG 


TTGATATTAC 


AGTTAATGAT 


GAAGGCAAAG 


TAACAGACGT 


TATTATGGAT 


54 0 


GGCCATGCTG 


aCCATGGTGa 


ATATGGTCAT 


GATATCGTTT 


GTGCTGGAGC TTCAgCTGTA 


600 




TTCFrTGGTA 


GTGTTAATGC 


GATTATAGGA 


TTGACATCTG 


AGAGACCAGA 


TATCAATTAT 


660 


40 


GACGACAATG 


GTGGTCATTT 


TCATATAAGA 


AGCGTTGATA 


CAAACAACGA 


TGAAGCGCAA 


720 




CTAATTCTTC 


AAACAATGCT 


TGTGTCTTTA 


CAAACTATTG 


AAGAAGAATA TAATGAGAAT 


- 7B0 




ATTAGATTAA 


ATTATAAGTG 


AGGTGCATTC 


CGATGTTAAA 


ATTAAACTTA 


CAATTCTTCG 


84 0 


45 


CATCTAAAAA 


AGGGGTAAGT 


TCTACAAAAA 


ACGGACGTGA 


CTCTGAATCA AAACGCTTAG 


900 




GTGCTAAACG 


TGCTGACGGT 


CAATTCGTAA 


CAGGTGGTTC 


AATTTTATAT 


CGCCAACGTG 


960 




GTACTAAAAT 


TTACCCTGGT 


GAAAATGTAG 


GTCGTGGTGG 


CGATGATACA 


TTATTCGCTA 


1020 


50 


AAATCGACGG 


CGTTGTTAAA 


TTCGAACGTA 


AAGGTCGCGA 


CAAAAAACAA 


GTTTCTGTAT 


1080 




ATGCAGTAGC 


TGAATAATTT 


TGTCTAGTTA 


ACACCAGAAG 


TGAATCTTCT 


GGTGTTtTTT 


1140 
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AGACGTTATA CTAAATGTGC AeTGTATAAG 


AGCCCCTAAT 


CACTAAACTA 


TAAGGGGGAC 


1260 




AAAGGAATAC AGTTGCAGCG TTTAAAGAAT 


AAACTGTACC 


ACAATTGGTG 


CTGAGAAATA 


1320 


5 


TAAGTATTTT AAAGGAAAGA TTTATAAAAG 


TAACTGCATA 


AGAGCCCCTA 


ATTATTTACA 


13B0 




ATATATAAGG GGCTCTAATA TGCTATAATT 


ATTGGGAAAA TGAAAATTAT 


ATGTAAAAGA 


1440 


70 


GGTGAGATAT ATGTTTG^CG ATCAAGTCAA 


AATATCTCTT 


AAAGCCGGTG 


ATGGTGGTAA 


1500 


TGGTATTArr G f AT AC AG AA GAGAAAAATA 


TGTACCATTT 


GGTGGACCAG 


CTGGCGGTGA 


1560 




r*nr2T i nn , T , A a & nnTnr^vrcLCz tpgtat*t*tga 


AGTGGATGAA 


GGTTTAAGAA 


CGTTATTAGA 


1620 


75 


TTTTanaTaT r'L.amTPivTT ttaaagpaag 


CAAAGGTGAA AATGGCCAAA 


GTAGTAATAT 

W X X X *» X 


16B0 




r^r'tTfinT'aaa aaTnrrjiaar: z^tttagtatt 


AAAAGTTCCA 


CCTGGTACAA 


TTATTAAAAA 


1740 




TTTTnaaafa narvm zirrTYJT tagpagatgt 


TGTTGAAGAT 


GGTCAAAGAG 


CTGTAGTAGC 

>-• x w i*»vJ x^».ww 


1800 


20 


nain'x/infTinT pnarvzTfsrzrT' gaggtaattg 


ACGTTTTGCA 


ACACCTAGAA 


ACCCTGCACC 


1860 




r m2vr ,f rTr , Af2T nAAA&AraryrYZ aaphaggtga 


GGAATTAGAT 


GTATCTTTAG 


AATTGAAATT 


1920 




aTTaprrraT fiTunfiivTTAfn TAfinTTTrrr 


TAGTGTGGGT 


AAATCGACTT 


TATTAT CTT AT 


1980 


25 




ATATCATTTT 


ACAACGATTA 


AACCAAATCT 


2040 




anrsTftTTYtTT TPnnrnrrTrj atcaapgtag 


TTTTGTTATG 


GCAGATTTAC 


CAGGTTTAAT 


2100 




TfSL.&fiTPfiPtt Tr"maTnnm tty^gattagg 

l VaKM-VjVj l\»Ui\ lL.l\J.r\lV»jVjUL7 liUVjnllnSjU 


ACATCAATTT 


TTAAGACATG 


TAGAGAGAAC 


2160 


30 


A HHnTTaTT nTTGAGATGA TTGATATGAG 


CGGTTCTGAA 


GGTAGAGAAC 


CTATTGAAGA 


2220 




TTATAAAGTP ttTTAATGAAG AATTAGGTGG 


GTACGAGCAA 


CGTTTAGAAG 


ATAGACCTCA 


2280 


35 


AATPGTAGTA GTTAACAAGA TGGATTTACC 


TGAATCACAA 


GATAATTTAA 


ACTTGTTTAA 


2340 


AG A AG AAA TT GGGGAAGATG TG C CAGTTAT 


TCCAGTTTCA 


ACAATAACGC 


GTGATAATAT 


2400 




TGATCAATTA TT AT ATG CAA TAGCAGATAA 


ATTAGAAGAA 


TATAAAGATG 


TTGACTTCAC 


2460 


40 


AGTTGAAGAA GAGG AGT CAG TTGGCATTAA 


CCGAGTATTA 


TATAAACATA 


CACCGTCACA 


2520 




AG AT AAATTT ACAATTTCAA GAGATGATGA 


TGGTGCTTAT 


GTGGTAAGTG 


GTAATGCTAT 


2580 




TGAAAGAATG TTTAAAATGA CTGACTTTAA 


CAGTGATCCA 


GCAGTACGTC 


GATTTGCTCG 


2640 


45 


TCAAATGCGT TCGATGGGTA TTGATGATGC 


GCTTAGAGAA CGTGGTTGTA AAAATGGTGA 


2700 




TATCGTTAGA ATTCTTGGCG GAGAATTTGA 


ATTCGTTGAA 


TAGGAGCGAA 


ACATGATGGA 


2760 




CAATAAAGAT TATAAAAAGT TTTATTTAAT 


TAGAGAAGAT 


GTCTTGCCTG. 


AATCCGTGGT 


2820 


SO 


TAAAACATTG AAGATTAAAG ATGCCTTAAA 


AAGTGATCCG 


ACATTGTCCA 


TTTATGATGC 


2880 




CGTTAAACAG TTTGATCTAT CTAGAAGTGC 


TTTTTATAAA TATAGAGAAA 


CGATATTTCC 


2940 
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TGTTGGTATG TTGGCACGTG TACTAGATGT TATATCAAAG TTAGAACTAT CTGTATTAAC 3060 

GATTCATCAA AGTATTCCAA TGGAAGAAAA AGCAACAATA ACATTATCAC TGAATGCTAA 3120 

5 

ATCTAAAGAA ACTTCAGTAG AAGATGTTAT TGGCGCTTTG AGAAATTTAG ATTATGTATC ' 3180 

AAAAGTAGAA TTAATTAGTA TGAGTATGTA AGGATGTGCC TATAATGTAC GCGTATGTCA 3240 

AAGGTAAGTT AACACATTTA TATCCTACAC ACGTAGTTGT TGAAACTGCT GGTGTTGGTT 33 00 

10 

ATGAAATTCA AACACCAAAT TCTTATCGTT TTCAAAAGCA TCTAGATCAT GAAGTTTTAA 3360 

TTCATACATC TTTAATTGTT CGTGAAGATG CACAATTATT GTATGGATTT AGTAGTGAAG 3420 

AAGAGAAAGA TATGTTCTTG AGTTTAATTA AAGTTACTGG TATTGGTCCG AAATCAGCTT 3480 

15 

TAGCTATTTT AGCGACAAGT ACGCCTAATG AAGTAAAACG TGC CATTGAA AATGAAAATG 3540 

ATACGTATTT AACTAAATTC CCAGGAATTG GTAAGAAAAC GGCAAGACAG ATTGTCTTAG 3600 

20 ATTTAAAAGG TAAAGTGAAA ATTACTGAAG AAGATAGCGA TTCATTATTA CAAGTAGACG 3660 

CTACTTCGAC GGTGCAAGAT CAATTCGTGC AAGAAGCAAT GTTAGCGTTA GAAGCATTAG 3720 

GTTATTCTAA ACGAGAGCTT GCAAAAGTTG AGAAAACGTT AAATAAAAAT AAATATGACT 3780 

25 CAGTTGATGA AGCTGTTAAG GGAGGTCTTC AATTAGTTGT ATCTTAATTT TAAATAGATT 3840 

AATAGGGGAA GTGTTGTCAT GAATGAGCGT ATGGTTGATC AATCAATGCA TAGTGAAGAA 3500 . 

ACTGATTTCG AATTGTCGCT TAGACCTACG AGATTACGAC AATATATTGG TCAAAATTCA 3960 

30 

ATAAAAAGTA ATTTAGAAGT ATTTATTAAA GCGGCTAAAC TTCGTCATGA ACCATTAGAT 4020 

CATGTATTGC TTTTTGGCCC CCCTGGATTA GGTAAGACAA CATTATGTAA TATCATTGCC 4080 

AATGAAATGG AAGTTAATAT ACGTACAGTA TCAGGGCCTT CATTAGAAAG ACCTGGTGAT 4140 

35 

TTGGCTGCAA TTTTATCAGG ACTTCAACCT GGAGATGTTT TGTTTATTGA TGAAATACAC 4200 

AGACTGAGTA GTGTTGTTGA AGAAGTGTTA TACCCTGCAA TGGAAGATTT CTTTTTAGAT 4260 

ATTATCATTG GTAAAGGCGA TGAGGCTAGA AGTATCCGTA TCGACTTACC TCCATTCACT 4320 

40 

TTGGTAGGTG CAACAACGCG AGCTGGCAGC TTAACAGGTC CACTAAGGGA TCGATTTGGT 4380 

GTGCACTTAA GATTAGAATA TTATAACGAA TCAGATTTAA AAGAAATCAT TATTAGAACA 4440 

45 GCTGAGGTTT TAGGCACAGG TATTGATGAA GAAAGTGCCA TTGAACTTGC TAAACGTTCT 4500 

AGAGGGACTC CAAGAGTAGC AAATCGACTA TTGAAGCGGG TAAGAGACTT CCAGCAAGTG 4560 

AATGAAGATG AACAAATATA CATTGAAACA ACGAAGCACG CATTAGGTTT ACTTCAAGTT 4620 

50 GATCAACACG GACTAGATTA CATTGATCAT AAAATGATGA ACTGTATTAT TAAGCAGTAT 4680 

AATGGCGGAC CTGTTGGTTT AGATACGATT GCCGTAACAA TTGGTGAAGA ACGTATTACA 4740 

55 



1135 



EP 0 786 519 A2 



GGCAGAAAAG CAACACCATT AGCTTATGAA . CATTTTGCAA AGTCGAATGA GGAGAGAGAA 4 860 

TAACGTGAAT ATTGAAGAAT TTGACTATGA CTTACCAGAA TCATTAATTG CTCAAACGCC 4920 

5 

TTTAAAAGAT CGTGATCATA GTCGTTTATT AGTCATGGAT AGAGAAACTG GTGAAATGAA 4 980 

ACATTTACAT TTCAAAGATA TCATTGAGTA TTTTAGACCT GGTGATACAT TAGTGCTTAA 5040 

CGATACGCGA GTAATGGCAG CTAGACTTTT TGGTTTAAAA GAAGAAACTG GTGCAAAAGT 5100 

10 

TGAAATGTTA ATGTtAACTC AAATTGAAGG TAATGATTGG GAAGTCTTAC TGAAACCAGC 5160 

TAAGCGTATT AAAGTTGGTA ATAAATTGAA TTTTGGTAAT GGCAAAATTA TAGCTGAATG 5220 

1S CATAAAAGAA ATGGATCAAG GTGGACGCAT CATGCGTTTA CATTATGAAG GTATTTTACA 5280 

AGAAAGATTA GATGAATTAG GGGAAATGCC ACTGCCACCA TACATCAAAG AACGTTTAGA 5340 

TGATCCAGAT CGTTATCAAA CAGTTTACGC TAAAGAAAGT GGTTCAGCGG CAGCACCAAC 54 00 

20 AGCAGGATTA CATTTTACTG ATGAGTTATT AATTGAAATT AAAAATAAAG GTGTTAATAT 5460 

CGCATTTGTT ACATTACATG TTGGGTTAGG TACGTTTAGA CCGGTGAGCG TAGACGATGT 5520 

GAATGACCAC GAAATGCATA GTGAATATTA TCAAATGACm CAAGAAACAG CTGATTTATT 5580 

25 AAATGATACT AAGcCAAAGG ACATCGCATT ATATCAGTTG GTACAACTTC AACACGTACA 5640 

CTTGAAACAA TTCGACGCGA TCATGATAAA TTTGTTGAAA CGAGTGGCTG GACTAATATA 5700 

TTTATTTATC CAGGATTTGA TTTTAAAGCA ATTGATGGCC AGATTACTAA TTTTCATTTA 5760 

30 

CCAAAATCAA CATTAGTTAT GCTAGTATCA GCGTTTAGTA GTCGTGAAAA TGTTCTGAAT 5820 

GCTTATAAAA CGGCAGTAAA TTTAGAATAT AGATTCTTTA GTTTTGGCGA TGCAATGTTA 5880 

ATTATATAAA AAGAATGTGA GGATTTTGAA TATGCCTGCA GTAACATACG AACACATTAA 5940 

35 

AACTTGTAAA CAATCAGGTG CGCGTTTAGG TATCGTGCAC ACACCACACG GTTCATTTGA 6000 

AACACCTATG TTTATGCCAG TTGGTACTAA AGCAACCGTT AAAACAATGA GTCCAGAAGA 6060 

4Q GTTAAGACAA ATTGAAGCAA AAATCATTTT GGGCAACACA TATCATTTGT GGTTACAACC 6120 

CGGAAATGAT ATTATCAAAC ACGCTGGGGG ATTACATAAA TTCATGAATT GGGATGGTCC 6180 

' GATTCTTACA GATTCAGGCG GTTTCCAAGT GTTTAGTTTA AGTAATTTAC GTAAAATTAC 6240 

45 AGAAGAAGGC GTGGAATTTA GACATCATAC TAATGGGTCT AAATTATTTT TGAGTCCTGA 6300 

GAAATCAATG CAAATTCAAA ATGATTTAGG ATCTGATATT ATGATGGCAT TTGATGAATG 6360 

TCCACCGATG CCTGCTGAAT ATGATTATGT AAAAAAATCT ATTGAACGTA CAACACGTTG 6420 

50 GGCGAAAAGA TGTCTAGATG CACACCAAAG ACCTGAAGAT CAAGCATTGT TCGGCATTAT 6480 

ACAAGGTGGC GAATATGAAG ATTTAAGAGA ACAAAGTGCA AAGGATTTAG TAGAATTAGA 6540 
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AATGGTTGAA CATACAGAGC AGTTTATGCC TAAAGATAAA CCAAGATATT TAATGGGTGT 
AGGATCTCCa GATGCGTTAA TCGAATGTAG TATTCGCGGC ATGGATATGT TTGATTGTGT 
CTTACCGACA CGTATTGCCA GAAATGGTAC TTGTATGACA TCGCAAGGTC GTTTAGTTAT 
TAAAAATGCA AAATTTGCAG ATGATTTAAG ACCGTTAGAT GAGAATTGTG ACTGTTATAC 
ATGTCAAAAC TATTCAAGAG CGTATATACG TCATTTAATC AAGGCAGAGG AAACTTTTGG 
TATTCGTCTT ACTACTATTC ATAATTTACA TTTTCTGCTA AAATTAATGG AAGATATAAG 
ACAAGCCATT CGAGAAGATC GTCTTTTAGA TTTCAAAGAA GAATTCTTCG AGCAATATGG 
ATTAAATGTT GAGAACCCAA AAAACTTTTA AGCAAGAGGA GCGTATAAAA TGCAATTTTC 
ATTACTAATA TATATAGTCG TAATTTTTGC GGTTATGTAT TTCTTGATGA TCAGACCACA 
ACAAAAACTG TGCGAAACA 
(2) INFORMATION FOR SEQ ID NO: 303: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3159 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 303: 



TCCATTTATA 


GAAATTTCTA 


AAGCAGAnAA 


TAAGATAGAA 


GATATCGGCC 


AAGGTGCAGA 


AATCATCAAA 


AGAACACAAG 


ACATTACTAG 


CAAACGATTA 


GCTATAACTC 


AAAACATTCA 


ATTTGATTTT 


GTAAAAGATA 


AAAAATATAA 


CAAAGATGCC 


CTAGTTGTTA 


AGATGCAAGG 


CTTCATTAGC 


TCTAGAaCAA 


CATATTCAGA 


CTTAAAAAAA 


TATCCATATA 


TTAAAAGAAT 


GATATGGCCA 


TTTCAATATA 


ATATCAGTTT 


GArAACGAAA 


GACTCTAATG 


TTGATTTAAT 


TAATTATCTT 


CCTAAAAATA 


rAATTGATTC 


AGCAGATGTT 


AGTCAGAAAT 


TAGGCTATAA 


TATCGGCGGA 


AACTTCCAAT 


CAGCGCCATC 


AATCGGAGGC 


AGTGGCTCAT 


TCAACTACTC 


TAAAACAATT 


AGTTATAATC 


AAAAAAACTA 


TGTTACTGAA 


GTAGAAAGTC 


AGAACTCTAA 


AGGTGTTAAA 


TGGGGAGTGA 


AAGCAAATTC 


ATTCGTTACA 


CCGAATGGTC AAGTATCTGC 


ATATGATCAA 


TACTTATTTG 


CACAAGACCC 


AACTGGTCCA 


GCAGCACGAG 


ACTATTTCGT 


CCCAGATAAT 


CAACTACCTC 


CTTTAATTCA 


AAGTGGCTTT 


AATCCATCAT 


TTATTACAAC 


ATTGTCACAC 


GAAAGAGGTA 


AAGGTGATAA 


AAGCGAGTTT 


GAAATCACTT 


ACGGCAGAAA 


CATGGATGCT 


ACATATGCTT 


ACGTGACAAG 


ACATCGTTTA 


GCCGTTGATA 


GAAAACATGA 
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AAAAATTAAA AGCATCACAC CTAAGTAAAC AGTTCaATCA TCTTAAAAAA TCCTGGGACA 900 

CTTCATACTT GTCTCAGGAT TTTTTmACAA ATTGAATCAG CCTCATAACA TTAAATTATT 960 

s TTATCGTACA TTAAATTTAA TAATAACAAC TGATTTTTAT AAGAATAAAG TATCGAmCCA 1020 

TAGTAGATAC ACAAATAATA CAAATGAAAC AATTTAACTT GAAAGCTTAa ATAAATATTA 1080 

TCAAGTTAAT AAACAATTAA TTTTTAGATG GATTCATCAA AAATCGTAAA AAAGCACAAT 114 0 

10 

TTGTATTTTA CAAACATTAA TTAAAAAAGA AAGCAAGACA TTCGTGCAAT CGGTTACCTT 1200 

AAATTGTTTA CAACTGTCAA CAATACCAAG GTTTTATTAA CTATATTTCT CACAAAATTA 126 0 

1S GCTTTTAGCA TTCCAAACAA AAAAGGTTAA ATCGAACGGA ATTATGGCAT TTTTAACTTA 1320 

" ATTGTAAAAA AAGTTGATAA TGGTCAATTG TTAATGAACA GTTAATTATA ATAACGCCCA 1380 

AAATATATTA TTATTTAATT AAGTTAAATA AAATTATAGA AAGAAAGTGA AACTTATGCT 1440 

20 TAAAAATAAA ATATTAACTA CAACTTTATC TGTGAGCTTA CTTGCCCCTC TTGCCAATCC 1500 

GTTATTAGAA AATGctAAAG CTGCTAACGA TACTGAAGAC ATCGGTAAAG GAAGCGATAT 1560 

AGAAATTATC AAAAGGACAG AAGATAAAAC AAGTAATAAA TGGGGCGTGA CTCAAAATAT 1620 

TCAATTTGAT TTTGTAAAGG ATAAAAAATA TAACAAAGAT GCTTTGATAT TAAAGATGCA 1680 

AGGATTCATT AGCTCTAGAA CAACATATTA CAACTATAAA AAAACTAATC ATGTTAAAGC 1740 

TATGCGATGG CCATTCCAAT aTaATATTGG TTTAAAAACA AATGATAAAT ATGTTTCTTT 1B00 

30 

AATTAATTAT TTACCTAAAA ATAAAATTGA ATCTACAAAC GTGAGTCAGA CATTAGGATA 1860 

. CAATATCGGT GGTAATTTCC AATCAGCCCC ATCACTCGGT GGTAATGGAT CATTTAACTA 1920 

35 TTCTAAATCG ATTAGCTATA CACAACAAAA TTATGTAAGT GAAGTAGAAC AACAAAACTC 1980 

AAAAAGTGTT TTATGGGGCG TCAAAGCGAA TTCATTCGCC ACTGAATCAG GTCAAAAATC 204 0 

AGCCTTTGAT AGCGATTTAT TTGTAGGCTA CAAACCTCAT AGTAAAGATC CTAGAGATTA 2100 

40 TTTCGTTCCA GACAGTGAGT TACCACCTCT TGTACAAAGT GGATTTAACC CTTCATTTAT 2160 

CGCCACAGTA TCTCATGAAA AAGGTTCAAG CGATACAAGC. GAATTTGAAA TTACTTACGG 2220 

AAGAAACATG GATGTCACTC ATGCCATTAA AAGATCAACG CATTATGGCA ACAGTTATTT 2280 

45 

AGACGGACAT AGAGTCCATA ATGCATTTGT AAATAGAAAC TATACTGTGA AATACGAGGT 2340 

CAATTGGAAG ACTCATGAAA TCAAGGTGAA AGGACAGAAT TGATATGAAA ATGAATAAAT 24 00 

TAGTCAAATC ATCCGTTGCT ACATCTATGG CATTATTATT ACTTTCTGGT ACTGCTAATG 2460 

50 

CTGAAGGTAA AATAACACCA GTCAGCGTAA AAAAAGTCGA TGACAAAGTT ACTTTATACA 2520 

AAACAACAGC CACAGCAGAT TCTGATAAAT TTAAAATTTC ACAGATTTTA ACATTTAATT 2580 
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ACTCAGGCTT TGTGnAACCT AATCCTAATG ACTATGACTT TTCAAAATTA TATTGGGGAG 2700 

CTAAATACAA TGTATCTATA AGCTCACAAT CTAATGATTC AGTAAACGTC GTTGATTATG 2760 

CACCAAAAAA TCAAAATGAA GAGTTTCAAG TTCAAAATAC TTTAGGCTAT ACATTTGGTG 2820 

GTGACATTAG TATCTCTAAT GGTTTATCTG GTGGACTTAA TGGAAATACA GCTTTTTCTG 2880 

AAACAATTAA TTATAAACAA GAAAGTTACA GAACAACATT AAGTCGCAAC ACAAATTATA 2940 

10 

AAAATGTTGG CTGGGGAGTT GAAGCACATA AAATTATGAA TAATGGTTGG GGACCTTATG 3000 

GAAGAGATAG CTTCCACCCA ACATATGGTA ATGAACTCTT CTTAGCTGGC AGACAAAGCA 3060 

15 GTGCATACGC TGGCCAAAAC TTCATAGCGC AACACCAAAT GCCATTATTA TCTAGAAGTA 3120 

ACTTCAATCC AGAA TTTTT A AGCGTACTAT CACACAGAC 3159 
(2) INFORMATION FOR SEQ ID NO: 304: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 304: 

ATAATGGCAT GAACAAAATT GATATGATGA 60 

GTTATCACGC TGGATTTTTA TTTGCATTAT 120 

TTATGTTACA AGGTAAGAAA AAAGAAGTTG 180 

ATATAAAGCT TTTTAGCAAA AGTTCAACAT 240 

ATTTTTATTC CGATTAGAAT AATAAGAATA 300 

TTGTGATGAT AGCTTTCGTT TTCATATTGG 360 

AGGAAGCTAG TAAAGATAGC AAGACAATTA 420 

TTAGTTTCCA CGATAAAGAT GGTAAATTAA 480 

TGGCTAAAGA AGAAGGTTTA AAACTTAAAT 540 

CAGGTTTAGA CGCAGGGCGT TTTGATGTAA 600 

GAGAAAAGAA ATATAAATTT TCTAAGCCTT 660 

GTGAAAATGA AAAAGATATT AAAGATTTTG 720 

CATTCACATC TAATTATGGT AAATTAGCTA 780 

ATGGCTTTAA CCAATCAATG GATTTATTAT 840 



30 


GCAAAAACTT 


TTCTCCAACT 


ATGTCAGACT 




ATCAAATTAA 


AGTTGATACG 


ATGTTGCATG 




TGATTACAGT 


TGTTAGTTTC 


TTTTGTTCAT 


35 


ATTCTCGTCA 


ATAAATATAA 


GTTGCTAGCT 




ATTGACTTAT 


CCGGCATTTC 


AGATTAAAAT 




AGGAGATATA 


TTCTATGAAA 


AGACTTTTAT 


40 


CTGCATGCGG 


AAACAATTCG 


TCGAAAGACA 




ATGTTGGGAC 


TGAGGGGACT 


TATGCACCAT 


45 


CTGGTTACGA 


TATTGATGTT 


ATTAAAGCAG 


TTAATGAAAC 


TTCTTGGGAT 


TCTATGTTTG 




TCGCGAACCA 


AGTAGGTATT 


AATCCTGATA 


50 


ACACATTCTC 


AAGTGCTGTT 


TTAGTTATTC 




ATGATGTTAA 


AGGTAAGAAG 


TTAGCACAAA 




AGGATAAAGG 


TGCTGATATT 


ACAAAAGTTG 
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AAAAACCTAA TGcTAAGATC AAAGCAATCA 
TTGCATTTTC TAAAAAAGCA GATGATGAAA 

5 AAATCGAGGA AAACGGTGAA TTAGCTAAAA 

CTAAATCTAA ATAGCGAACA ACAACATGCG 
ATGCTAGAAG GTTTGGTCAA ATATTCAATT 
TTGATTATTG CATTATTTAC AGCATTAATG 
ATTTCACGTG TCTATGTATC TATTATTCGA 

15 ATATTTTATG GTATTCCAGA ATTAGGTAGA 

ACATTGGCAC CTGTAGTGGC TGCTATTATT 
TCGGAAATTA TTCGTGGCGG TATTATTTCT 

20 TcCaTCGGTA TGACGTATGG TCAAACGATA 
GTGTCGATTC CTGCACTAGG TAATACATTT 
GGATTTATTT TAGTGGCTGA AATGTTTAGA 

25 

GAATATTTAA CAATTTATGT GTTAGTTGCG 
TCAATTATCC AAGGTATCTA TGAATCTTAT 
TGAACAATAT CCATAAATCA TTTAATGATG 

30 

TTGAACAAGG TGAGGTTGTA ACCTTAATCG 
TACGTATGAT TAATGCATTA GAAATTCCAA 

35 CATATACATC TAAAGATAAA AAATCACAAA 

TTCAAAGTTA TAACCTTTTT CCGCATAAGA 
TCACAGTTAA AAAGTTGAAA AAGGATGAGG 

40 AAGTTGGTTT AACACATGTC AAAGATCAAC 

AACGTGTTGC TATTGTCAAG AGCACTAGCA 
CCAACATCTG CACTTGATCC TGAACTTGTG 

45 

GCTAATGAAG GCATGACAAT GGTCATTGTG 
TCTAATAACA TTGTATTTAT TCmTGAAGGC 
50 ATGTTCAATC GTCCGAAAAC AGAAGAATTA 
TAATCAAATA GAACCACGTA TCATGTTTTA 
TGGTTCTTTG TTATGTTGTC ATAATCTTGG 

55 



AAGGTAATGC TGAACAAAGT AGATCTGCAT 960 

CAGTTCAAAA ATTCAATGAT GGCTTGAAAA 1020 

TAGGTAAGAA ATGGTTTGGT CAAGATGTTT 1080 

CTAGATGCTG CAAAACAAGC TTTCGGACCT 1140 

CCTATTACAT TAGTTACATT TGTTTTAGGA 1200 

CGAATTTCAA CGAGTAAAAT TTTAAGAAGT 1260 

GGAACACCAA TGATAGTACA ACTATTTATC 1320 

TTATTAACAA ATGACGCTGA CAACCAATGG 1360 

GGTTTATCAT TAAATGTAGG TGCGTATGCT 1440 

ATACCGAAAG GACAAACAGA AGCTGCaTAT 1500 

CAACGTATCA TTTTACCGCA GGCAATTCGA 1560 

TTAAGTTTAA TCAAAGATAC ATCATTATTA 1620 

AAAGCTCAAG AAGTTGCGTC TACAACATAT 1680 

CTAATGTACT GGGTGGTATG CTTTATTATT 1740 

ATTGAAAGAG GGTATCGCTC ATGATTCAAT 1800 

TTGAAGTCAT CAAAGGTATT GATTTATCTG 1860 

GTCGATCTGG TTCAGGTAAA ACAACATTGT 1920 

CTGAAGGTAC AGTTTATGTT AACGGCAAAA 1980 

TAGAAGTTCG TAAACAGTCT GGTATGGTAT 204 0 

CGGCATTAGA AAATGTAATG GAAGGTCTTA 2100 

CACGTGGGAA ATCACTTGAG TTACTTGAGA 2160 

GTCCACATGC ATTATCAGGT GGTCAACAAC 2220 

ATGAACCCTA AAGTGATGTT GTTTGATGAA 22B0 

AATGATGTTT TAAAGGTTAT TAAAGATTTG 2340 

ACACATGAAA TGCGTTTTGC TAAAGAAGTA 2400 

ATGATCGGAG AACAAGGGGC TCCAGAAGAG 2460 

AGACGTTTCT TAAATGTTAT AAATGAAGAA 2520 

GTATGGCGAT GAAGCCATAT ACATGATGCG 2580 

AGCGATATTT TAACGACGTT TATGATTTAA 2640 
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10 



15 



TTCTACATGT GCGTTAAAAC CTTTTTTGAA TTGTTGGACG CCATAGTCTT CTGATGACTC 2760 

TGAAAAGTCA CCGGTAATAC CATAAAAATT ATAGCGATCA ATATGATGCG CTTTAGCAAA 2B20 

CTTAATCATT TCCCaCTGCA AATGGTAGGC ACCCATATAA GCATTATATT TAGGGTTTGA 28 BO 

ACCACTAGAT AAGTAATAAA CTTCATGCTC ATTGTAGATA AATAAAGCAG AAGCTAAGTT 2940 

TAAGACTGCA CCATCTTGTT CAATTTGTTC TATTGTATTG TCGATTTTAC GCTTATTGCT 3000 

ATTTAGCTGT TGTTCTAGCT GTGTGCGTTT CGTTTTATTT TTCTTTGAAT TAGGACTTTC 3060 

TTCCAATGCT TCTTCAACAC CTGAGAGTTC AGCTGTTAAT TGTTGTTGCT TTAATTGTAA 3120 

CGTTTTTAAA TACTCGTTTA AATCAATATA CGCCAACTTT AACATGGCGT GGTCATCGTA 31 BO 

TAACTTTTGC ATTTCTTCAA AGTATGGTAA CTCACGGAAT TTGAAACCGT GCTTTTCCTC 3240 

AGCCATATGG AATAAGTCGA AAAAAGTTTG CGTTTCATCA ATCGTTAAcG TTTTaGTTTT 3300 

20 GACACCAAtA TCaTATGTtT tTtTAATATT ACGTCTCGtT TGATAATCCA TTTCTTTTAA 3360 

AAGTTGGTCT TCAGTCTTAT CTTTTAAATC TAACACTGAC AGCCAACGGA TTTGGCTCAT 3420 

TGAATCATAA CCTACAGGGA AACCTTGGTG TTTATAACCT AATTTATCCA TTGTTCTaAC 34 80 

AAATGCTCGG TTATCATAAG ATTTAACAAT TTCACCGTCT GCATTGCGTA AATTTTCAAT 3540 

TAAATATGGA TCTACAAGGA CATATAAACA ATTGTGTTTC TTTAAATATG ACGTTAATGC 3600 

TTTAAAGAAA AATGCTACTA ATGATTGATT TGTATAATCC ATCACTGGCC CGCGATGTGT 3660 

ATAAAAATAT TTGAAAAATT TAAGTGTGCG TGCTTCTGTC AATAAGCATC CTGCAATCAC 3720 

TTGACCATTG TCATCTTTAA CCCCTACrAG ATGCACATCG CCTTTTAAAT CAACTCTATG 3780 

ATTGTAATGA ATAGCTGATT GTGTGTAATG TGAAAAATGC T 3 821 
(2) INFORMATION FOR SEQ ID NO: 305: 



25 



30 



35 



40 



45 



50 



55 



<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1422 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 305: 

GCGACACCAT TGTTTGCAGC AACTGTTAAT TTACCAGTTG ATGGTTTCGA CGGTGTTGTT 60 

GG TTTT GATG GTGTAGTAGG GGTAGTTGTA GATTGCGTAC CCCATGGCGC CACTTTACCC 120 

ATTTTTATTA AATATTTTTC ATTAATTAAG TCATATAATT GATCATAACT ATAATTATGA 180 

CTTCTTAAAT ATCCATGTGG ATcGGCATGG TCAGTACCAC CTAAATATTT ACTTACAGCG 240 
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TATTGTAATT GTGTAGCTGC ATAGTCAGCA TAGTTATTCA TTGAACGTGC AAATGAAGCA 360 

TAGTCGTGTG TGTGTACGAT TTCAACATTG ATGAATCTAG GGTTACCGAC TGCACCGACA 420 

CCCCAAGATA AGTAATCCGT TGGTGCTGTT TCGATTATAC GATCCCCATC AACAAATGCA 480 

TGTACGAATG CGTTTTGATA GTTATTTTTC ATATAACTAA TTTCACCATT TATCGTCGAA 54 0 

CGATCATTAG CTGTATCATG AACTACGATA CCTTCAGGAC GACCTACGCC GTTACGGTAT 600 

GCGTATTTAG GGAAGTAAGA TGTATAATCT TCTTCAATTT TAGGTGCTTT TAAGTTATTT 660 

TTACAAATGT AATCGTTAAT TGAAGAGTTT ACTTGTGGTT TATATTTTGG CAAACTCGTT 720 

TTTGGTGTTG CAGCAACTGA TCTTGGTTGT GCTGAAGCGC TAAAAGTAGT TACTTTAGGT 780 

GTCGCTTCAG TTTTAGCTTT AGGTGCTGAT GTAGTTGCAG CTTTAGGTGC TGCGGTTTTA 640 

TATTGCGTyT CAAGAGCTGC AGGTTTAGCA GCTGATTTAA TTAATTCTGG ATTAATTTGA 900 

20 TTTTCTGAAT TATCATCTTC ATCATCAACT AAACTATAAC CAGCATTTGT AACATTAGTG 960 

TTAGTTTTAG GTGCTGTAGT GCTTGTTGAC TTTGCAACAG GCTG CGTATT ATTTGTAGTC 102 0 

GCTGATTGAT TAGCACGAGT GTCACCATTT ACTTGTGCAG TATCAACTTT TTGACTTACT 1080 

25 TGAGCATTGC CTGTTTTGTT ATTTGCTGTT TTTGGTTGGA CAATAGCAGG GTCTTGATAT 1140 

ACTTGAGTGC CAGAAATGTT TTGCGTTGGA TTTTTTACCT CAGCTTTTGC TTGTTCAGTA 1200 

GTTGCTTTAA CTTTATTACT ATCTAAAACG TTTTTATTAG TAGTTTGATC TTGTGTCGTC 1260 

TCAGCTGCTT GAACTTGATG TGCAGTGACT GCTGAACCTA CAAGCGTTAA TGCAACCATT 1320 

GAgGTAGTTT GTAATTGAAT TTTTTCGCCA TTCTATTTAT TACTCCyAAC ATTTATTAAT 1380 

TATtACTAAC ATTATAGTAC CTGTnTTATA TACCTGTGCG TA 1422 
(2). INFORMATION FOR SEQ ID NO: 306: 



30 



35 



SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 6076 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



45 



so 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 306: 

ACGCGTTGGT TATTTCAATC ATAATATTAC TCTGCAAATA CACAGCCCTG TAACCGAGTA 60 

AATGGATAGA GATTTGAACA AATGAAAACA ATCAACTAAT GGAAAGGATA AAATATTATG 120 

CACAACAATA ATGAATTCAA CAAAAAACTG AAAGATTTTA TAGGTAGCGA TAAACGGATG 180 

GCTTTAGTAA AGGGTTATGT TAACGAGTAT AAATTAACAA CTGTTTTAAG AGCTTTAAAT 240 
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GTAGATAATC GAATTATTCC TAAAAATATA . ACTCAAAACA CAATCTTCAA ATTGAGTAAT 360 
TTAACCTTAG AGGTTAACCT CTATGAAAGA AACACAATTT ATCACGGAGA CTTTGCTATT . 420 

5 

TACTATCCTG TAGAATCTGC TCTAATAATA GAAACAGATA CTCCTAGGCT AATTAATCAT 480 

ATAAATAACA ACCCAGTCCA AAAAATATTT ATAATAACAA CAAATGACTG GAGCTTTAAT 540 

ACACTTGaAT TAGAAAAAAT AGTAGACGAA ACTATAATTT ATGATTTAAA ACAAGAAGAT 600 

10 

CCTAAACAAT ATGAAATTTT ATATAAAAAT AAACACGGAA AATTGCCTTA TTGATGAATT 660 

AAAACGCAAA CTTAGAACAC CTTACCCCTT TTCCGTCCTT ATTGTAATTT TCCAACTACT 720 

15 CAAATTAAAA ACCACCCGTG CAAACAGGTG GTTTTATTAT ATATAATTAT TCCCACTCAA 780 

TTGTGCTTGG TGGTTTTGAT GTAATGTCAT AGACTACGCG GTTGACGTGA TCTACTTCGT 84 0 

TTACGATACG ACTAGAAATC TTTTGTAAGA CTTCCCAATC GATGCGTGCG AAGTCACTTG 900 

20 TCATACCGTC GATAGATGTT ACTGCACGAA TACCTACTGT GTGATCATAC GTACGGTAGT 960 

CTCCCATAAC ACCTACTGAT TGAATGTTTG GTAACACTGT GAAGTATTGC CAAATTTCTC 1020 

TTTCAAGACC TTCTTCTCTA ATCACTTGGC GTAAAATCGC GTCTGATTCT CTAACGATTT 10 BO 

25 

CTAGTTTATC TTCAGTAATT TCTCCAAGTA CACGAATACC AAGACCAGGT CCTGGGAATG 1140 

/ 

GTTGTCTCCA TACTAAATGT TCTGGAATAC CTAACTCAAT ACCTAATTTA CGTACTTCAT . 1200 

CTTTAAACAA TGTATTGATT GGTTCGATTA ATTCGAATTC CATGTCTTCT GGTAATCCAC 1260 

30 

CAACATTGTG GTGTGATTTG ATTGTTTGTG CTGTTTTAGT ACCTGATTCG ATGACGTCTG 1320 

TATATAGTGT TCCTTGCGCA AGGAAGTCTA CACCTTTCAG TTTTGATGCT TCATCATCAA 1380 

35 ATACGTATAC AAATTCATTA CCAATGATTT TACGTTTTTG TTCAGGATCT GAAACACCTT 1440 

TTAATTTATT CATAAAGCGA TCTTTCGCAT TAACACGAAT AATATTCATG TTGAAACCTT 1500 

CACCGAATTG CTCCATAACC ATGTCGCCTT CACCTTTACG AAGtAAGCCA TGGTCTACAA 1560 

40 AGATACATGT TAGTTGATCA CCTATTGCTT TATGCAATAG TACAGCTACA ACAGATGAAT 1620 

CTACGCCGCC ACTCATCGCA CATAATACAC GACGGTCTCC TACGCGTTGA CGAATCTTTT 1680 

CAATTTCGAT TTCGATAAAG TTTTCCATTG TCCATTGACC TCTACAATCA CAAACACGAC 1740 

45 GGACAAAATT ATTTAATAAA TCATTACCAT ATTCTGTATG ACGTACTTCT GGATGGAATT 1800 

GAACACCATA AATGCGACGT TTCTTATCTT CGATTGCTGC ATAGTCTGTG CTTGGGCTAT 1860 

CAGCGATAAC TTCAAAGCCT TCTGGAATTT CAATAACTTT ATCAGAATGA CTCATCCAAA 1920 

50 

CAGTTTGTTC TGCTGGTAAG CCAGCGAATA ACTCATCTGA CTTCGCATTA ATGATTGCTT 1980 

TACCGTATTC ACGTTCATTG GCACGTTCAA CTTTACCACC TAATAATTTA GTAGTTAATT 2040 

55 



1143 



EP0 786 519 A2 



TAAATGAACC TTCTTCATAA ACTGAATTTG 
TCATTTTCTT AATTTCTTCA ATTGAAATTT 

5 

TTTCACGAAT TCGGCGTGTA ATTAATTGGT 
TTAACTCTTG TTCTTTTQCC ATTTCCATAT 
AGTAGTTCGG TGATTCTTTC GTAATTTGAA 

10 

CAGGACCCAT ACGTGTAAAT TGTGCTTCTT 
TATAACCCAT. ACCAGCACGC ACACCGCCCA 

1S CTTTATAAGC CGTACGTCCT TCGATACCTT 

CTTGGAAGTA ACGGTCGTTT GAACCTTTTT 
ATACTTTATA TTGTCTACCT TGGAAAATTT 

20 CTAATAAGCT ACCTAACATA ACCGCATGTC 

CTGAGAATTT AATACCACCA TCAGCAATGA 
CACAATCATA AATTGCTGTA ATTTGTGGTA 

25 AAATTGAACC TGGGCCAATA CCAACTTTAA 

TTGTTGCTTC TGCAGTTGCT ACGTTACCTG 
TAATATGTTT GACTTGATCG ATAACACCTT 

30 

AGACATCCAC ACCTGCTTCG ACTAATTTTT 
CAATTGCTGC GGCTACAAGT AGACGACCAT 
TAACTTTTTC AATATCTTTA ATAGTAATAA 

35 

GTAACTTTTC AATCTTATGT TTTTGGAGAA 
CTGGAGCTGT AATTAAATTt TCTTGCGTCA 

40 CAATAAAACG TAAGTCACGG TTTGTTAAAA 

CAACAATTGG TACACCTGAA -ATACGGTATT 
TTTCTTCTGG CGTTAAGAAA AATGGGTTTG 

45 TTTGAACTTC GTCCGCTTGT TCTTCAACGC 

CTTGACGAGC CATAGCAATC GCCATTTTAG 
TAACTGGAAT ATTTAATTTA ACTTTGTCTG ' 

50 

GTAAAATATC AGATTGTGCT GGAATTAATA 
CAAATTTACT TTCCCACATT AAAAACAGCC 

55 



GACCACCTGA TAAGATAATA CCTTTTGGAT 2160 

CATGATCGTG TAATTCACTA TAAACGCCCA 2220 

TGTATTGGCT ACCAAAGTCT AAGACAAGGA 2280 

TTGTCGTTCT CCTTTATCTT AATTAGAATG 2340 

TATTATGTGG ATGGCTTTCT GCTAAACCAG 2400 

CGCGTAATTC TCTTAAATCG TGTGAACCAG 2460 

TTAATTGGTA AATTGTATCT TGTAACGCAC 2520 

CAGGAACAAA TTTCTTAGGC GCTTTGTCCT 2580 

CCATCGCACC TAAAGAGCCC ATACCACGGT 2640 

CTGTTGCGCC TGGGCTTTCT TCAGTACCTG 2700 

CACCAGCAGC TAATGCTTTA ATGATATCTC 2760 

TAGCTTTACC ATGTTTGCGT GCTTCAGTtG 2820 

CACCAACACC TGCTACAACA CGCGTCGTAC 2880 

CAATATCTGC ACCCGCTTCA AATAAATCTT 2940 

CTACTAATGT GATTTCTGGG TAAGTCTTCT 3000 

TAGAGTGACC ATGTGCTGTA TCGATAACTA 3060 

GAGCACGAAT ATCAGTATCT TTTGAAATAC 3120 

GTTCATCTTT TGCTGCATTA GGGAATTCGA 3180 

GACCTTCTAG ACGTCCGTCT TTAACTAATG 3240 

TTTTTTCTGC TTCTTCAAGT GTTGTATTCA 3300 

TTACATCTAC AATTTTAATC GAGAAGTCTT 3360 

TACCTACTAA GTTGCGATCT TCTTTATTAT 3420 

TACCCATTAA TGCTTCTGCT TCATAAACGC 3480 

AAATGACACC ATTTTCTGAG CGTTTTACTT 3540 

CCATATTTTT ATGAATAACA CCTAAACCAC 3600 

ATTCAGTTAC AGTATCCATA CCAGCAGAAA 3660 

ATAATTGTAC GCTTAAATCA ACGTCTTTCG 3720 

ACACATCATC AAACGTTAAT GATTCTTTTG 3780 

TCCATTTTTG AAATT AATTA GTTATATTAT 3840 
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GCAGAGATTG CGCCTAAAAC AATTCCGTTT TGAGTCAACC ATGCAAATTG TTCACCTAAA 3960 

CCTTTAAATG CTTGTGGTAC AGCGCTTATA CCAGTACCTA AtCCTACTGA TACAGCGATA 4020 

5 

ATTAATAAAT TGTTTTGATT TTTAAAATCG ATATGTCCTA ATATACTAAC ACCATATGCC 4080 

ATTACCATGC CAAACATAGC TATCATCGCA CCGCCTAACA CAGGTAGCGG TATGATATTT 414 0 

GCTAATGCGC CAAGCTTAGG TATACAACCA CATATAAGTA ATAACACGAC CATGCCGTAT 4200 

10 

ATAACATTGT TTTTCTTAGC GCCGGATAAA GAAACAAGTC CTACATTTTG CGAATAGGCT 4260 

GTATACGGAA ATGAATTGAA TATAGAACCT AAcACTATCG CTAGACCTTC CGCAGTATAA 4320 

15 CCTTTACGAA AATCTTTTCT TTCTAACTTC TTACCGGTAA TTTCACTTAA CGCATGATAG 4380 

ACACCTGTCG ACTCAATTAA ACTAACGATA GCTACAATAA AGAACACTAA CGTCGATGTC 444 0 

ACATCAAAGC TAAATCCAGA GAATCTAAAC GGCACTGGGA TGCCTAACCA ACCGGCATGA 4500 

20 TTGACTTGAT TAATATCGAC CATCCCAAGT AAGCCAGCAC CTATCGTTCC TAAAACGAGT 4560 

. CCAATTAATA TGGCAATACT CTTAATAAAT CCAGTTGTGA ATCTTTGTAA AAGAAGAATA 4620 

ATGATTAATG TCATTAAACC TAACAAAATG TTCTTAACAT CTCCATAGTC CTTTGCACCT 4680 

25 

TGACCTCCAG CTAAGTAATT CATTGCTACT GGCATTAAAT TGATACCAAT GATAGTAACA 474 0 

ACACTACCCG TTACTACTGG TGGGAAGAAT TTTACAAGAT GTGAAAAGAA AGGCGCGATG 4 800 

ATAATAACTA ATATCCCTGA TAAAAATAGC GAACCATAAA GTACATCTAT TCCTTTCGTT 4860 

30 

TGACCAATTA AAATCATGGG CGCAACAGCC GTGAATGTAC ATCCAAGAAC GATTGGTAAT 4920 

CCTGTTCCTG TTACTTTATT GGCTTGTAAA AATGTGGCAA CCCCACACAT AAATATATCT 4980 

3S ACTGTAACTA AGTAAGCGAT TTGTTCAGGT GTAAACTTCA AACTTGTACC AACAATGATT 5040 

GGAACTAAGA TAGCACCTGC GTACATAGCT AAAAGATGTT GAACACTTAG GATTAAATTT 5100 

TTCATTATTC TTCTCCCACC AATGTCACTT TGTTTCCTTC TAGTGAAGCA ACCTTGCAGA 5160 

40 GAGAAGAAAC TGTTAAACCT GCTTCTTCTA AACGTTGATG CCCATTTTGG AAACTCTTTT 5220 

CAACAACAAT ACCAATACCA GCTGTCTTAG CATTCGCTTG CTGTGCGATA TCGTATAATC 5280 

CTAATGAAGC ATCACCATTT GCTAAAAAGT CATCGATGAT AAGTACAGTA TCTTCTTCTG 5340 

hS ATAAAAACTC TTTTGAAACA ATGACCGTAC TTGTTTTATT TTTAGTAAAT GAATGAATAG 5400 

ATGTTTCATA ATAACCATCC GTCAAAGTGC TAGGTTTTGC TTTTTTCGCA AATAAACATG 5460 

GCACATCAAA ATGCAGTGCA GCCATGATTG CAGGTGCGAT ACCGGAAGCT TCAATGGTTA 5520 

50 

AGATTTTAGT AATCCCTTTA TCTTTAAATT GCTCGTAAAA AGTGCGACCA ACTTCATTCA 5580 

TTAACTTTGC ATCAATTTGA TGATTTAAAA ATCCATCGAC TTTTAAAATC TTCTCATCAA 5640 

55 
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ATTTGTGTGA AACATTTTGC TCTTAAATTG GTGCTAGATA CAAAAAAATC CCCAAACTAA 5760 

ATAATAGTTT CAGGGTTTAT GAGTGAACGa ACATGCATAA CGAATTTGTC ATGCAATCAA 5620 

5 ' 

TGTAAAAGAA GTTTCATCAA ATAAACTGTG ACCATCATAT AAAATGATAT AAATCACCCA 5880 

CCATGGTTAC AATTTAATGG CTGAAGCTAC TCCTAGTATT GTGTTGTTAC TCATAGTCAT 5940 

GTCGTTCAAG GCAACATGGT AGAAACTTCT AAAGCCATAT TCTTTAGATT ATATGAGTTT 6000 

10 

ATGTAAATTA TTTAACGATA ATAGCAAATT TTCGGCATTT TTTCAATAAC TGCTTAGGTA 6060 

ATCTTTTAAT AGTTTT 6076 
1$ (2) INFORMATION FOR SEQ ID NO: 307: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6136 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



{'Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 307: 



30 



35 



50 



ACCGCCGATT 


GATAGCTTTA 


CTGCTGCGAT 


AAAACTGCCT 


TCAACAATTG 


GTGCATCAAC 


60 


TTTTAACACA 


CGATGATTAC 


CATCATACAT 


TTCAATTGCC 


ATATCTACAT 


TCATTTCTGA 


120 


AGATCCAATA 


TCGTAAAAAC 


ACAATGCATC 


ATCCTCTAAT 


TTAGTCAAAA 


CTTCTTGGAT 


180 


GATATCAAAT 


GAAGTTCCAA 


TTGAACCATC 


TGGTAATCCC 


CCGATTGGTA 


TAATATCAAC 


240 


GTCACCTGCC 


ATTTGCTTTA 


ACAAAGATTT 


TGTACCACTT 


GcAATTTCTT 


TACTGTGGCT 


300 


AACAAGTATA 


ATTTTAGGCA 


TTATCTTCAT 


CTCCAATCAA AGCGTTTAAA ATATAAACCA 


360 


TACTTTGAGC 


ACCTGGATCA 


ATATAACCTT 


TTGATTCTTC 


TCCAAAATAT 


GCAGCTCTAC 


420 


CTTTCGTTGC 


TACCATATCT 


TTTGTATTAT 


CTGCTAATTG 


CTGTAAATCA 


TTGAATGTTA 


480 


AAGTTTCACC 


ATTTTTAAGC 


TTCTCTGCTG 


CTCGCGCTAC 


TACATCATAC 


ATTGTCTTTT 


540 


CATTTAAAGT 


AACTTTACCA 


CGTGATTCAA 


CCGCTTCGGC 


AAATGCCTGA 


ATTAGTGTAA ■ 


600 


TGAAATCTTG 


ATTATCCATA 


TCATCTTTGG 


TGACTGCAGA 


CATTTTAACA AAGCTAAAGC 


660 


CATACAGTGG 


TCCTGATGCA 


CCCCCAACAT 


TTGACATCAA 


TGCCATACCA 


GTTGATTTGA 


720 


ACAATGATTG 


CATTGAGCTA 


TCATCAAGTT 


TGTCTTTAAG 


ACTACTAAAC 


CCACGAACCA 


780 


TGTTAACCCC 


GTGGTCACCA 


TCACCAATTG 


CTCGATCTAA 


TTCAGTTAAT 


TCAGATTCAT 


840 


GTTTTTTAAA 


CGTTTCTTCT 


AAATTTAATA 


AACGTGCTTT 


CATATCATTC ACTTTCATTT 


900 


GTGCCACCTC 


ATAAATGTAT 


ATTTATTCAT 


ATTCACTTCT 


TATTTAAAGT 


ATTGACTTGT 


960 



55 
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ACCTTGCATG TCTAAAGATG TCATATAATC 
TCTTGCAGCT AAATTTTGTT GAATATATTT 

5 

ACCACCCATA CCATTTACCA TTAATATGAC 
GTCTAACAAT GTTCCAACAA TATGATCAAT 
TTCACCATGT ATACCAATAC CGATTTCCAT 

10 

TCCAGTAGTT GGAACAAGCG GTGGCTCAAT 
CAACGCTTCT ACACGCGATT TTATCTCTGT 

75 GGCAAGCTTA TGAACAAACA CTGTTCCTGC 
AGCAATGTCG TCACGAACAA TAACAGTTTG 
TTGTGCCATT TCGAAATTCA TCACGTCACC 

20 ATCACCAGTA TCTACTGCTT TAATAGCTTC 
TACTTCGCCA CAAACCGCTG CATCTAGCAT 
TTCATGTCCG CTTCCACCTC CAGAGACTAT 

25 

TACAATAACT GTATTAGCAA TCAGATCTAA 
CATATCAGTT AAAAATGTTT CTTTTTTATT 
TGGTATTATC AGTCATAGTA TAAAACAGAA 

30 

CTGATACCGT TATCAAATAT ATTTTTGATA 
TTGAAAAAAT AAATAGTCTT GAGTGGCTCA 

3S , AATAAAGTGC CTTTCATAGC aATAAGTTGT 
TACATTAAAT ATAAGAAATA CGGTGCACCA 
CCACCTGGTT GTAATACAAT TTTGCCAATT 

40 ACTAAAATGG CAATTGGTAA AAACAACTGG 
TGCGGACCCA TTAATCCGAT AAACGAAATT 
AACATCACTG CGATAAAGAA TAATATTAAA 
CGTGCAATAT TATCACCCGT ATGAATAATA 
GGAATAATAA TCAACACCCA CGGTAAAAAT 
ATATTACCTG CAAACCAAGC AGCGATGAAA 

SO 

ATTGTAATTG AGCCACCATA TAATGCTGTT 
GCAGGTGTAA CACCTTCATT TTTATTAAAA 

55 



ACCAACAAAC CATTTAGCAA CATTAACCGT 1080 

AGTTACGATA TTTAATTCAG ATAACGGCGT 1140 

ATCATTGGCA GTAACTTCTT TATACAATTC 1200 

ATCCTTTACT TCTTCCCTAT GAATACCTTT 1260 

TTTGTCGTCT TCAATATCAA AGCCATATTT 1320 

TGCCATACCA ATACTTTTAA TTTCAGGTAA 1380 

TAATGAATAA CCTTTTTCAG CAAGATAACC 1440 

AACACCACGA CGTTGTACTT CGTTTGTCAC 1500 

AACATTTATA CCTTCCATTT CTGCAAGCTC 1560 

TGCATAGTTT TTTACAACTA GTAATACACC 1620 

TAATATTTTA TCAGGTGTAG GTGATGTAAA 1680 

ACCTTCTGCA ACAAAACCGG CATGCGCAGG 1740 

TGCTACACCA TGTTCTTTCT TAGCTTTTTT 1800 

CTCTGGGTGC GCAATTAACA ATCCTTCAAG 1860 

GATTAACTTT TTCATCATGT TGTACCTCCT 1920 

TAAATGAATG CGCTATCATA AAGAATTAAC 1980 

AGATCCTCTT GATAACTTAA TGGTTCATTA 2040 

TTAAATAAAA AATGCTATGA AAAGCCTTAT 2100 

GTCCATTGAC ACTATACATT TTTCGTTTTG 2160 

ATAATTGCTA CGACAATACC TGCTGGAACC 2220 

GTATCAGCTA TAACAAGTAA ACATGCCCCT 2280 

TGACGTGGTC CAACGATACG TTTGGCAATA 2340 

GAACCTGCTA cTGCTACAGC AGCAGATGAT 2400 

CGTTCTCTGC TTAACCTTAC ACCTAGACCT 2460 

TTTAGTGTAT TCGATTTAAA TAGTAAGTAA 2520 

GCAATGACAA ATGGCCATTC GTCACCCCAA 2580 

TCAGATTGCT TATCATCAAA TTTTGACATA 264 0 

TGTAAACCTA CACCTATTAA TACCATACTC 2700 

CTGaAAATAA AAATAATCaA TGCAGTGgTG 2760 
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CCAATTGCAA TAAATAATGC GATTGCAAAT 
GTTCAGCTAT TGGATTTTTT GTGACACTTT 

5 

CCAGCCAAAA TAGTAATCAT CATCCGAGGT 
ATGGCATCAC CTTGTCCAAT TAAAGTTTTG 
CCTGAGGTAA TACTCCAAGT ACAACCTAGA 

10 

ATCAATTGTC TGCGTTTATT ATTTGAACTA 
AATATAAAAA GTAAGGAACA CCGATAAATG 

J5 AATATCGTGC TATCACATCG GCAACAAGCA 

GTAGAATTTT AGCATAATCA GTTCCAATTA 
CTACAAATGC AACTTGTCCA GCGATAGCAA 

20 - TTAAACATAT GCCTCTGATC ATTGTTACAT 
CAAGATTTAA AATGGTAAGT TGTTTACTAA 
TTAATGGAAT TGCCCACTTA AGGTGTGACC 
AAAATGTTAC TGTTTGATTT AGTCTAAAAG 
ACATAGCACT TACTGCTGCA CCCGCTAAAA 
TAGAT CGGCC TATCATTAAT ACAATAGCAC 

30 

ACATCAATAT TAAAAATGAA GTGTTTGGTA 
AAGCACCTGA ATTTAAACCT ATGAGCGCCG 
GTATAATCGC ACCAGAAACT GCAAGCGCCA 

35 

GAATCCTAAT CTCATTGATG ATGTTTTGCT 
CCTCTATAAT TGTAGAGGCT TGAATTTTGG 

40 TGATTAGTAG TATCATACTT AAACCTATAA 

TCTCTCTATT TGTCATGTTA ATTGTCCTTT 
AGCTCTGCGA CATAAGTCAT AAGTTACCAG 

45 TAAAACAACA TCAATATTAA AAACTTTTTG 

TGAACCTGTA GCGATGATAT CCCCTTCTTT 
CGCTTGGTTG ATATCATGAA GAACCATGAC 

SO 

CTGAACTAAT TCTAGTATTT CTAATTGATG 
TAAAAAGATA ATATCAGTTC TTTGTGCTAA 

55 



CCGCCACtGC GTTAATACCT AATATACCTG 2880 

GCACTATTGC ACCACTAATA CTAAgCtGCG 2940. 

AACCTGAAAT CTAATAAGAT TAACTCATCA 3000 

AAAAATCTTT CAACAGGTAT GTTGTATTCA 3060 

AATAGTAGAA TGCTAAAAAC AGCCAGTGCT 3120 

ATCATATTGA GCGTCCTCCT TTTTTAACTA 31 BO 

AAATGATTGC ACCAACAGGC GCTTCTCCTA 324 0 

CGAGTATCCC ACCTAACAAG GCTGTTAATG 3300 

AAAATCTTGC TATATGAGGT ACCATCAAAC 3360 

CTGCAATACC TGCTAGAATC ATAGCAATAA 3420 

TTTGACCTAA ACCTTTAGCT AATGATTCAC 3480 

TTGTTAATAT AATGAATAAC GCAATACCAA 3 540 

ATGTTGTGCC TGAAACGCCT CCAGCAGTCC 3600 

CTAATGCAAT ACCTTGACTT AGCGCTGTTA 3660 

TAATACGCAT CGGATTAAAT CCATCACGTC 3720 

CTCCTAGAAT AGCACCTAAA AATCCAGCAA 3780 

AAACTGCATA TGTTAATGCT AAAGCAAATG 3 840 

GATCAGCAAG ACCATTACGA GTAACACCTT 3900 

TACCTACAAT TACTGCTGCT ATATTTCTGG 3 960 

GTTGATTGCT AGGATTATAA TTAAAAATAG 4020 

CGTCACCTAT TAATGTAGAA ATAAATAGTG 4080 

TATAGGATAA AAACTTCAAT GGCGTTGGGT 414 0 

TTATCATATT AACTTACTTA ATTAAGAATA 4200 

TAAAGGTTTT CCAGTTTTAG GATCTTTACT 4260 

TAATATTTCC TGTGTTAATA CGTCTTCTGT 432 D 

CATCGCAATA AGATGATCTG AGAAACGAAT 4380 

AATTGTACAA CCTTGTTCCT GATTTAGCTT 4440 

ACAGATATCT AAATATGTTG TTGGTTCGTC 4500 

TGCCATTGCA ATCCAAACAC GTTGTCTTTG 4560 
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5 



10 



IS 



30 



35 



TGCCCAATCA 


ATtTCTTTCT 


TAtCCTCAGC 


AgTTAATCTA 


CCAAATCCTT 


TTTGATGTGG 


4680 


AAAACGACCA 


TATGAAACTA 


ATTCCCCAAC 


AGTTAAGCCA 


TCTGCTACTT 


CaGGTGaTTG 


4740 


aGGTAAAaTG 


GCtATTTTTt 


TGcAATCyCy 


TTCGTAGATt 


GTGtATGAAT 


ATTTTCACcA 


4800 


TCTAAAAATA 


CTTCGCCTTC 


TTTAACTGCC 


AATAAACGTG ACAATGCCTT TAGCAAAGTA 


4860 


GATTTCCCGC 


AGCCGTTAGG 


ACCAATGATT 


GACGTCACTT 


TGCCATCTGG 


TATTTCAACA 


4920 


TCTAATTTAT 


TTATAATCGT 


GTTATCCCCG 


TAACCAATTT 


TAACTTGTTG 


TCCATGCAAA 


4980 


CGATTCATAA 


TTTCCCTACT 


TTCAATAAAA 


TTCTTTCTGT 


TTATAAAAAA 


TAATTTCTAT 


5040 


TTTTAAATTA 


TCAATTTTCA 


AAGACATCCC 


AATTGATAAT 


GATTATCATG 


AACATCATTA 


5100 


TAACATTTTT 


CAATCTTATT 


GACTAACATT 


ACTTTTTAAA 


TTGGATAGCT 


CGATTTGTCA 


5160 


TGTCTTGTAT 


ATTACTTTTA 


TAAAATAAAA 


AACGCCCACA 


GATAAGTCTT 


CATAGTTCAA 


5220 


AAACTTGTCC 


GTGGACTTCT 


ATTTAAGTAT 


GTGTGCTCAT 


ACCATTTATT 


TATTCATCTG 


5280 


CAAGAAAGCC 


ATTACCATAG 


ACATCTCTTA 


CATCATGAAT 


TACGAGGAAT 


GCATCTTTAT 


5340 


CGATTTGTTT 


AATTAATCGC 


TTTGCTTTTG 


AAACTTGTGT TTTAGAAATA ACAACGTATA 


5400 


AGACATCTTT 


TTCTTCACGC 


GTATAATAGC 


CATGTCCGTT 


TAAAATGGTT 


AAACCTCTTC 


5460 


CAATTTGCTC 


GTCTATTGCT 


TTGGCAAGTT 


TGTCGGGATT 


AGTTGAAATA 


ATCGTCATAG 


5520 


CTTTTTTAGT 


GTTTAAACCT 


TCTATGACAT 


ATTCCATCAC 


TTTTGTTCCT 


ATATAAAGTG 


5580 


ATATTACTGT 


TACTAATACT 


TTATCAAGTG 


GAATAACTGT 


AAGTGAAATT 


GCAACAACGA 


5640 


TCATATCGAA 


GAAAAGCAAA 


GCATATGGCG 


TGCTTACATC 


GAGGTATTTC 


GTTGCAATTC 


5700 


TCGCCAAAAT 


TGTTGTACCT 


GCTGTTGTAC 


CGCCTGCAAG 


GATAATTACT 


CCGATTCCTA 


5760 


GTCCAACGCT 


TACACCACCA 


AAAATGGCAT 


TCACAATGCT 


GTTTCCAGTT 


TCTACTTGCC 


5820 


ATGATTCTGT 


TAAACTCAAA 


AATATTGAAA 


TAAGAATTGT 


TACAAGAATA 


GTTAAGTACA 


5880 


TACTTCTCTT 


ACTCAAAAAt 


TTATAAcCTA 


TGGCAATCAA 


TACTGCGTTG 


ACCAAGAAGT 


5940 


TAGTGATGGC 


TGGTGAAATA 


TGAAACGCAT 


AATATAAAAT 


AATTGCTAAA 


CCTGTAACCC 


6000 


CGCCTTCACC 


TAAGTTACCA 


GAAAT^ATAA 


ATGCATTTAC ACCTGCAGCA AAGATAAATG 


6060 


AACCTAAGAC 


AACTAGTATT 


AAATCTTTAA 


CCGTTTTATT 


CACGAAACCA 


TCCCCTTTAT 


6120 


ATATTTATTA 


GACTAT 










6136 



(2) INFORMATION FOR SEQ ID NO: 308: 

SO 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 2576 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 
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(xi) SEQUENCE DESCRIPTION: i 

5 

■ GATATCGTAs CTACTGAAAA GTCATCACCA 
TTCTATGCAA TTGCAATTCA ATACTTTGTT 
TGGTTACCGA CGTATTTAAC AGAAGTATTC 

10 

TCATTACCTT GGTTATTAAT GTTCTTCTTA 
GTATTAGGAT TAGGTCGTTC AAAATTCGTA 

75 ATTGTGTTTG CAGTTTCAAT TATCTTTGCT 

TTCTGGTTAT CACTAGGTCT TGGTGGTATC 
GCAACTGACT TAGGACGTAA CTTCTCTGGT 

20 AATATAGGTG CATTAATCAG TCCGCTATTA 
ACAATGACAT TCCAATTGTT AATCGTTCCA 

GTGAAACCAG ATCAACCTTT AATTGTTAGT 

25 

CAAGCAGTAA GCTTTCACAT AGTTGGGGCT 

TTTTAAAACA GATATGGTTT AAGATGAAAA 

ATAAGGTTTT AAAACATTAG TCAGGTAACG 

30 

CGCAAAGGTG GGCGACCAAA GtTcAACGAT 
GGACTTTAAA AGTTTTTAAA TTTAGATAAT 

35 - GCGTAATTAA AATTTATAGC AACAAATTCA 
AATATTTTGA CAAGGAAAGG AGAAGTCGAA 
TCAACCGACT TAACTTAGAC TTTTATAAAG 

40 CTGCAATAAT CGGTTACTTA TGTGGTAATT 
CACTAGCCCA TATTTATGTT TTTAAAGGTC 

TTTGTAATTT AGCGTTTGCA ATATGTATGA 

45 

TCGTTTTTGG AATGACATTA TTAATTGTTA 

TAAAAATAGC TGGACCGTCA TCGACATTCT 

TACCTATAGC TCCCGAAGAA GCATTATATA 

50 

TTGCCACTAT GATGGTGTTA ATCACGATCG 
CAATTCAAAA TGATTTTAAA CTCATATCTA 

55 



SEQ ID NO: 308: 

CCATGGAATG ATTTCTTTAA ACGTTTTAGC 60 

GTACAATTTA TCATTACATT ATTCTTAATT 120 

CACGTTAACT TTAAAGAAAT GAGCATTAGT 180 

ATCTTATCAG CAGGTGCAAT TTCTGACCGT 240 

GCTAGAGGTG TAATTGCAAT TGCAGGATTT 300 

GTACGCACAG GAAATTTATA TGTAAGTATT 360 

GGTATTTCAA TGGGTATGAG TTGGGCTGCA 420 

ACAGTATCAG GGTGGATGAA CTTATGGGGT 480 

GCAGGTCTAT TCGTAGAACA TTTGGGTTGG 540 

GCAGTAATCG CTGTGATTAT GTGGTTCTAT 600 

GATGATAAAG CAATAGAAAA ATAATTTAAA 660 

TATTGCTTTT TTTGCGTTGA AATTGAAACT 720 

TGAAGTTATT GAAATGATAT ATGTAAAGAA 780 

CTTGTaAAAG TACATATAAA TTTTAACTAG 84 0 

GTTAAATAAC aTTAGrAATT AATTTTAATT 900 

TGAGCATAAG GTGTTATAAT GACATATGTT 960 

TTTTAACTAT GCTAATAAAA AGATTATGGA 1020 

ATGACATCTT TTTGACATCA CTCATAAAAA 1080 

GTGTAAGACA GGGACTGTTA ATGATTATTC 1140 

TCCAATTTGG ATT ATT AG TT GCAACCGGAA 1200 

CGTCGCGATC TAAGCTGCGA ACTGTAATAA 1260 

TGCTTGGTAC GCTAACAGCC AAAACGCCAC 1320 

CGGTTATACC ATTTTATATA TTTACTGCCT 1380 

TCATTGTGAC ATTCAGTCTA CCCATTAACT 144 0 

GAGGCTTTGC GATTTTAGTA GGCGGTATAC 1500 

TATTTTCTAA AAACAAAGCT GAAGAACAAG 1560 

AGTTGTTACA CACTTATAAT GATAAATCTG 1620 
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TCACTTCTAC TTCAAGTAAC GATAAATTAA GTAGACGTTT CCAAAAATTA TTATTATTAC 1740 

ACACATCTGC CCAAGGGATT TATTCTGAAC TGTTAGAGTT GAACGCTAAA CAAATTCGAC 1800 

5 CATTGCCAGA TGAGTTAATT GAAATGATGG ATCATATCAT TGCACAACTA GATAATAGTG I860 

AGGAAAATGT AAGATATTGG CGAAAAGAAG TGACAGTAAC AGAGGAATTT CAAAATTTAT 1920 

TCAACCATAT ATTGAAAATT GATGAAATGG TGCATGGAAA TGAAGCGCGT ATTGCGTATG 1980 

10 

AAGCAGACAT GCGAAAACCT TTATATAGTA AACGCATTTA TCaAAATTTA ACaTTAGACT 2040 

CtATkGTTTT TAGAAATACA TTGAGATATA CAGCGATTAT GATGATAGCG ATATTTATTG 2100 

CGTTAATGTT TGATTTTGAA AAAGCATACT GGATACCGTT ATCTGCACAT ACAATATTAC 2160 

TAGGAACATC AACTATACAT GCAATCGAGA GAGGTATGGC ACGAGGTTTA GGTACTATTT 2220 

TAGGTGTGTT AGTACTTTCA GTCATATTGT TGTTTTCAAT ACCAACACCT GTTGCAGTAA 2280 

20 TTTTAATGGG CATTGCAGCA TTGTTTACTG AAGCATTGGT GGGAGCAAAT TATGCGATTG 2340 

CAGTAGTTTT TATTACAATA CAAGTTATTT TAATGAACGG ATTAGCATCA CAGAATTTAA 2400 

CAATTAACAT TGCGTTTCCA AGAGTTATTG ACGTTGCAAT GGGTATTGTG ATTGCAATCA 2460 

25 TAGGTTTATT TGTCCTTGGA CAACGTACCG CATCCGCATT GCTTCCTAAT GTAATGGCTG 2520 

AAGTTGTTCG TAAAGAAGCA ACGCTCTTTC ATTATTTATT TTCTGAAAAT CAATAT 2576 
(2) INFORMATION FOR SEQ ID NO: 309: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

~ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 309: 

40 CAAGCACATT ACGATGGTCT AATACTTTGC TAATAATTTC TTCTTGTCCT GGTCGAAACG 60 

TTTCATACCC AAAGTAATGC GATAATGTTT GTTGCATCAT AAATTGACCC CTTATTGTTG 120 
TTCTTTAATT TCTTCTAACT CACTCCATCT TGTGATGTCT AAATCATATT GAATTTCAAG * 180 

45 TTGTTCTTTT TCTTCGTTTA ATTCTTTAAT TTTCCCATAA TCTGCACTTG CCTCAATCAT 240 

GAGCACATCA ATTTCTTCCA TTCTTACTTC CGCTTGTTCT ATGCGTTTCA TCAATTGTTC 300 

ATATTCTAAT TTTTCTTTAT ATGATAAACC ATTTTTCTTA CGTACAGTTG TAGAAGATTT 360 

50 

AGATTGTTGC TTCAATGTGG ATTTATTTTT ATCTAATGAT TTTTTATAAC TTTCATAATC 420 

TTCAAAAGTT CCGATAATCT TTTCCATCTG ACCATCATGA ATAAACCAAT ATGACTGTGC 480 

55 
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AATATAATCT TCAAGTATTG TTAAAGTCTC AGTATCTAAA TCATTTGTCG GTTCATCTAA 600 
CAACAGAACA TTTGGCTGGT GTACGAGTAG ACGTAATAAA TACAAACGCT TTTGCTCTCC 660 

5 

ACCAGATA 668 

(2) INFORMATION FOR SEQ ID NO: 310: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 12173 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

is 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 310: 
CATCAGTTTA TTTTGAAAGG CAATGCGATC ATTT CATGT A TTTATGTTGT TTGAAACATC 60 
20 GTTAGATAAC AATAGTGATA TTGCACATTT TAAGCTGAAG ATGGAAAAAT ATTCGATAAT 120 
CAAATAAACA ATGAATTTTA GAAGGTACAA TGACGTTCAC TAATTTAAAT ATAGCTGAAT 180 
GTGTTGGTGA GTGATGTTCA CTATAGATTT ATATTAATAT ACAAAGACAA AGGTTGTTAA 240 

25 

TTTTTATTAA GCGTTAGGTT GAATGTATGA GAATTTTAGA TTTATAATAG AAGATAGAAA 300 
CGAAAATTTT TCTTAAAAGC AGTAATGTTG ACTCAAAATA AGCTATAATA ATGACACTTA 360 
TTTAATTGAT TAACATTTGC TAATAAATAT CAATATAGAA TATAACTTTC CAATAATGAC 420 

30 

TGAGAAAATC GAAATGTCAG TCTCGAATCA TATAATTAGA AAATTGATTA TTTTCTGTCA 480 
ATTTAGGGTT GAACTATACA TATGATATTG TTAGAATATT TTTTAACATT ATATTTTATT 540 
35 GCTTTAAAGT GGAATATACT TGAAATAATT AGTAGAGGTG AGTAAGGATG AGTAATAAAT 600 
TAGAATCATA CAGAAGTGAG ATTGTATCAC TGAATCATCA AATTTTAGAC TTATTATCTA 660 
AAC<JTGGTGA ACTAGCACAA AAAATTGGGG AAGAAAAATT AAAACAAGGT ACACGTATTT 720 
ATGATCCACA ACGTGAAAAA GAAATGCTTA ACGACTTAAT CGATAGTAAC AAAGGACCAT 780 
TCAACGATAA TACTATTAAG CAATTATTTA AAGAAATTTT CAAAGCCTCT ACAGATTTAC 840 
AAAAATCTGA AAATGAAAAA CATTTATATG TATCACGTAA GTTGAAACCT GAAGATACGA 900 

AS 

TTGTAACATT TGATAATGGG GGCATTATAG GAGACGGCAA TAAATCATTT GTATTTGGGC 960 
CATGTTCAGT TGAATCATTT GAACAAGTTG AAGCTGTTGC TAAAAACTTA CATGCTAAAG 1020. 
GTGAAAAATT TATTCGTGGC GGTGCATTTA AACCACGTAC ATCACCATAT GATTTCCAAG 1080 

SO 

GCCTAGGTGT TGAAGGACTT AAAATACTTA AACAGATTAA AGATAAATAT GATTTAAATG 1140 
TTGTCAGCGA AATCGTAAAT CCAAATGATT TTGAAGTGGC TGATGAGTAT TTAGACGTAT 1200 

55 
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AAAAGCCTAT TCTATTAAAA CGTGGTTTAT 
CTGAATACAT TGCTTCACAA GGTAATCAAA 

5 CTTATGAAAA GGCGACACGT AACACTTTAG 

V 

GTACACACTT ACCAGTCATG GTAGATGTTA 
TACCAACTGC GAArsAgCAT TAGCAGTTGG 

10 

AGATCCATCT GTTGCACTTA GTGATGCGGG 
ATTTTATGAT GAATTAAAGC CTTTAGCTGA 

' TTCCAAGGAA ACTATAGACT ACTTAACTAA 

GAATCGACAA TTTTAAACTT ACAGCCATTC 
ATTTTTTGTA TAGAAATAAA GGTATACTGC 

20 TGTTCAATTT CACAATTATT AAAGATTATG 
GAATTATCTG TAAAATATAA TAGATTCACA 
ACAAATGCAA TTGATATTTG AAAAGGCTTT 

25 GTTTATATAA AATTAAAATA TATCGTTCGT 

TTTTCGCTAA ATTTTTCCAA ACAAAAATAT 
GAAAATGAAA TGAAAACATG TTATTATAAT 

30 

ACAGTTACTA TATATGATGT AGCAAGAGAA 
GTTGTTAATG GGAACCAAAA TGTTAAAGCA 
AAGCGTTTGA ATTATCGTCC AAATGCTGTT 

35 

ACAGTAGGTG TGATCATTCC AGATATATCT 
CTTGAAGATA TTGCAACAAT GTATAAATAT 

40 CCTGAAAAGG AAAAAGAAAT TTTTAATAAC 

TTCCTTGGTG GTACAATTAC TGAAGAAATG 
GTAGTAGTAT CAGGAACAAA TGGTAAGGAT 

45 ACTGAAGCTG CGAAAGAAAT TACGGGAGAA 
TTAGTAGGTG GAGAACATTC TAAAAAAGCT 
GTGTTAAATA AAAATGGCCT TCAATTAGGT 

SO 

TATAAAGAAG GCGTAAAAGC TTTTGCCAAA 
TGTATCAGCG ACGAAGAAGC AATTGGTATT 

55 



CTGCTACAAT CGAAGAGTTT GTTTATGCAG 1320 

ACATTATTTT ATGTGAACGT GGAATCCGAA 1380 

ATATTTCAGC AGTACCAATT TTAAAACAAG 1440 

CGCATAGTAC AGGTCGTAAA GATATCATGT 1500 

TGCTGATGGA GTTATGGCTG AGGTGCATCC 1560 

TCAACAAATG GATTTAGATG AATTCCAAGC 1620 

TTTATATAAC GCTAAAAAGT TAAAATAATA 1680 

TATGTCATGT TGAAGTAGAA TATTATCTTT 1740 

TAAGAGTATA TTACTTTTAG AGTGGCTATT 1800 

ACTTAACGAT TGTTATAATA CTTCGACACT 1860 

ACTGATAGCA GTAATTAAAA TTATAACTAT 1920 

CATTTGTTGC TGAAATGTGA ACATTTTTCA 1980 

CTCAAAACAT TACAATTAAA AATGAAAAAA 2040 

TATCATTTAG CGTTTGTTTT TATTTCAAGC 2100 

GTTACTGTAA ATTAAAATAT GGTAAACTAT 2160 

GAATAAAACG TTTACAAGGA GGAAATTATG 2220 

GCGCGTGTCT CTATGGCCAC AGTGTCGCGT 2280 

GAAACTAAAA ATAAAGTTAA CGAAGTCATT 2340 

GCTAGAGGTT TAGCTAGTAA AAAGACAACA 2400 

AATATCTATT ATTCACAACT TGCTCGTGGA 2460 

CACTCAATTA TTTCAAATTC AGATAACGAT 2520 

TTATTAAGTA AACAGGTTGA TGGTATTATT 2580 

AAAGAATTGA TAAATCAATC ATCTGTACCT 2640 

GCACATATAG CATCAGTTAA TATTGATTTT 2700 

TTAATTGAAA AAGGCGCTAA ATCATTTGCT 2760 

CAAGAAGATG TTTTAGAAGG TTTAACTGAA 2B20 

GATACATTGA ATTGTTCTGG TGCTGAAAGT 2880 

ATGAAAGGCA ATTTGCCAGA TGCCATTTTA 2940 

ATGCATAGTG CAATGGATGC TGGTATTAAA 3000 
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10 



15 



CCACAACTTT CTAGTGTTAT TCAACCATTA TATGATATCG GTGCAGTAGG GATGCGCTTA 3120 

TTAACAAAAT ATATGAACGA TGAAAAGATA GAAGAACCAA ATGTAGTTTT ACCTCACAGA 3180 

ATTGAATACC GAGGAACTAC AAAATAAATT CACAAAATTA GGCATTCATC TAACGACCCA 3240 

AATTATATGG GTGTTGGAAG AATGCCTTTT ATTTATCTTT TAAAATCGTT GCAGATTAGG 3300 

TTACTTATTG ACGAGTAGAT TCGTACCAAC TCGCTATATG TAAAGCTAAT TTTTTATTTT 3360 

TTTCACTAAT TTCTTTTGTG CGGGGGACAT AGGTATAATC ATTTAAACGA TCTTCCCATC 3420 

TTTTAGGTAA TAATTCAGAT GAATAATGTT TCCATTTATT AATCCATTCT AACGGTAAAT 3480 

AACCACTTTG AATTGGTTGA TCAATTAAAC TTAAGAATAC ATGACTCCAT GCACGTGGTA 3540 

CGACTCTCCA AATATTGTAG CCTCCGCCAC CAAACATAAT TACCTTTCCA TTCGTATAAG 3600 

AATCAGCTAA ATATTTTACA AAATATGGAA TTTCATATAA TGAATGTAAC GTACAATTTA 3660 

20 GATGAGTTAG TGGATCACGA TAATGTATAT CGACACCATT TACGCTTAGA ATAATATCAG 3720 

GTTTAAAACT CTTTACGACA GGCTCAACTG TTAATTTAAA ACACTCCAAA AATGATGCAT 3 780 

CTTCTGTATA CGGTTCAAGT GGGACATTTA CAGTGTGTCC ATAGCCGATA TCTTCACCGC 3 840 

GCTCAGTATA GTGACCAGAG CCTGGGAAAA GAAATTTTCC GGTTTCATGG ATAGAATAAG 3 900 

TAGTAACATG GTTATCGGCA TAGAAACTCC ATTGTGTACC ATCTCCATGA TGTGCATCGG 3 960 

TATCTATGAT TAAAACGCGT TGATTGTATT CTTTAGCTAA GTATTGTGCG GTAATTGCAA 4 020 

TATCATTGTA TATACAAAAA CCACTTGCTC GACCAGGTTG AGCGTGATGC AAACCACCAC 4080 

CTAAGTGACA ACCATTTAAT ACTTTGCCTG ACATAATAAG ATCTGCTAAA GTTAAAGCGC 4140 

CTCCAACAAT TGTGGCACTA TGGCGGTGCA TATGCTTAAA TTGACCATTC TCTTCATCAT 4200 

35 

TTAATCCATA TTTCTTAGCC TCATCTTCAC TGATAATGCC ATGTGAAGCA TGCTTAATAG 4260 

CTTCGACGTA ATCATATTTA TGAATTAACA TTAATTCGTC ATCTGTTGCA ATTCTAGGTT 4320 

40 GTACTATTTG TTCTGGAGAC AATAAATTTG CATTCAAAAG TAGCTCTGTT GTTAATTTTA 4380 

AACGCATTTG ATTGAAGGGA TGTTGGTCAT GAAATCGATA TTGTAATAAC TTATCTGAAT 4440 

AAACATATGC AGTTTTTGAT GAATGTTGTT GCATATAATC CCTCCGATAT TCCAAAAATT 4500 

45 AAAAGAAAAA CCGATTCATA TAACGAATAT CATCAAACGC TTGTTGCTGT TCTAATGTAA 4560 

TGTTTTTGCC AATTCTTGCC ATTAAACAAT TAGCTGGATG ACTTGTTATT TCTGGATCAT 4620 

CTGTAGCGAA TATTTCAAGT CCACCAGTTG CCATTAACCG CTGCATTAAT TTTTTATAGT 4680 

50 

CAAATACATC TAACTTTGAA TTTTTTAAAT CCCAATGCCA GTAATATTCT GTAGTTATAA 4740 

CGATATAATT CTCGAATTCT GGTGTAGAAA GGCTAAGTTG TATCAGCTTT TCTGCAAGTT 4 800 

55 



25 



30 
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TACCTGTAGA CCAGCGCTCA ATTTCATCAG GATAGTGGAA AGTGACATAA CCCACAATGA 4920 

GTTGATCTTG GCGAGCAACA TAAATTCTAC CTTCTGGTAA TGTTGTAATT TCTAACAAAG 4980 

5 

CTTTATACTG ATCTTCAGCA TCTCTAAATG CGGTTAAATG CGCATCGAAA GTAAGCGCTT 5040 

TCAAATCTtC GTGTGTTAAA GGACCTTCAA TAACAAATTG CTTGTCATGA ATGTAATAAT 5100 

CTTCGGATTG ATACGTCTTT AAATGATTCA TATTTTCAAC TCCTCAATCG ACGTCGTGAT 5160 

10 

TGTATTAAAT TCATTATATA GAAaATTTAC AATAATTAAT ACTAGAAAAA GGATAAAAGT 5220 

AAAAATTTTG AATAATTAGA AATGTTATGT ATAATATTGA GAAAGAAAGC GTTTTCACAT 5280 

15 AACAAAGGGG GAGTTTCAAA TGAAAGTCGA AGTTTATAAA GGAGCGCAAG GTAAACATAA 5340 

CCTTAAAGAT TATGAAGAAA CATATAATAC TTTTGATTGG AAAGACGTAG AACAAGCATT 5400 

TTCTTGGAGT GAAACTGGAA AAATGAACAT GGCATATGAA TGCATAGATC GCCATGTAGA 5460 

20 TCAAGGATTA GGGGATAAAA TAGCGTTAAA TTACAAAGAT GAGCACAGAA AAGAATCGTA 5520 

TACTTATAAA GATATGCAAC GGTTATCTAA TAAAGCAGCG AATGTTTTGT CTGAACATGC 5580 

AGAAGTTGAC AAAGGTGACA GAGTATTTAT ATTTATGTCG CGTACACCTG AACTATATTT 5640 

TGCGTTGTTA GGTGTTTTAA AAATTGGTGC AATTGTTGGG CCGTTATTTG AAGCATTTAT 5700 

GGAAAAGGCA GTTGCGGATA GATTAGAGAA CAGTGAAGCT AAAGTGTTAA TTACTAATAA 5760 

GGCATTGTTA CCTCGAGTAC CTGTAGATAA ATTACCAAAC TTGAAAAAAA TTGTTGTCGT 5820 

30 

AGATGAGGAT GTAGAAGACA ATTACATAGA CTTCATTAGT TTGATGGAAA CTGCTAGCGA 5880 

TGAATTTGAC ATTGAATGGT TAAAGTCGGA TGATGGTTTG ATTTTACATT ATACATCAGG 5940 

3S TTCTACTGGG CAACCTAAAG GtGTATTGCA TGTTCAACAA GCAATGTTAG TGCACTATAT 6000 

TTCTGGAAAA TATGTATTAG ATTTACAAGA AGATGATGTT TATTGGTGTA CAGCAGATCC 6060 

AGGTTGGGTT aCAGGAACAT CTTATGGTAT TTTTGCACCA TGGTTAAATG GCGCTACAAA 6120 

40 TTGTATAGCT GGTGGTCGCT TTTCGCCAGA ACAGTGGTAT AGTATGATTG AAGATTTTAA 6180 

AGTGACGATT TGGTATACGG CACCAACAGC TTTAAGAATG TTAATGAGTG CTGGTGACGA 6240 

TATTGTTGAG AAATATGACT TGTCATCGTT ACGTTCGATT CTATCAGTAG GTGAGCCTTT 6300 

4 °' AAATCCTGAA GTTATAAAAT GGGCGAAAAA AGTATACGGT TTAACGGTGT TAGATACTTG 6360 

GTGGATGACA GAAACAGGTG GACATATGAT TGTTAACTAT CCAACGATGG ACGTcAAGCT 6420 

TGGCTCAATG GGCAAACCAT TACCTGGTAT TCAAGCTGCA ATTATCGATG ATGCAGGGAA 6480 

50 

TGAATTACCA CCAAATCGAA TGGGCAACCT TGCTATAAAA AAAGGCTGGC CATCAATGAT 6540 

GTATCGTATC TGGAAGAATC CAGAAAAATA TAAATCATAT TTTATTGGAG ACTGGTATGT 6600 

55 
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TGATGTAATT ATGACAGCTG GTGAACGAGT TGGACCATTT GAGGTTGAGT CTAAATTGGT 6720 

TGAACACGAA GCAGTTGCCG AAGCAGGAAT TATTGGTAAA CCTGATCCGG TTCGCGGTGA 6780 

5 

AATAATTAAG GCGTTTGTTG CACTGAGAAA AGGATATGAA CCAACAGACG AATTAAAAGA 6 340 

AGAAATTCGT ATATTTGTTA AAGAAGGTTT GTCGGCACAT GCAGCACCAC GTGAAATCGA 6900 

ATTTAAAGAT AAATTACCTA AAACACGGTC AGGTAAAATT ATGAGACGTG TATTAAAAGC 6960 

10 

TTGGGAATTA AATTTAGATG CTGGGGATTT AAGTACAATG GAATAATGAC ATGAATGTTA 7020 

TTGAAGATTT TTTTCGAAGA ATAAAGGGTG ACAACATATT TCATGTCAAT GTTTAAATAA 7080 

75 TCGTTTACTT TACGATAAGC AATATAAAGA ACTGTTAACT TGTGTCATAT CATTTCGTAG 7140 

AAAGCATTTG AAAATGATGA CATAACAATA ATGGCATATC TTTATATTGC TTTTTATTTT 7200 

TAATATGATC TTTGGAAGAT GATTATTTTA AATAATAGAA AAATATAGTT ATCAATAGTA 7260 

20 TCAAGCGCTA AAAGTTGTAT AATACAAAAC TTTAATAAGT GAATTTATTG CAAAAATGAA 7320 

AGCGCTAACC CGATTTAGTC GACAAGTTTT TAACAGTTCG TTATTATATG AATGTAAGTA 7380 

AAAATTTCTT AGCTACAACT TACATATTAT AAATGCATAA ATTAAACAAA AAGGGGCGAA 7440 

AAAAGTTGAC TCATTTATCA GATTTAGATA TTGCGAATCA ATCAACACTA CAACCAATTA 7500 

AGGATATTGC TGCATCAGTA GGTATTTCAG AGGATGCATT AGAACCTTAT GGTCATTACA 7560 

AAGCTAAAAT CGACATTAAT AAAATTACGC CAAGAGAAAA CAAAGGGAAA GTTGTTTTAG 7620 

30 

TAACTGCGAT GAGCCCAACA CCAGCTGGTG AAGGTAAATC AACGGTTACA GTTGGTTTAG 7680 

CTGATGCATT CCATGAGTTA AATAAAAACG TTATGGTTGC ATTAAGAGAG CCTGCTTTAG 774 0 

35 GACCAACATT TGGTATCAAA GGTGGTGCGA CTGGTGGTGG TTATGCGCAA GTCTTACCTA 7800 

TGGAAGATAT CAACTTACAT TTCAACGGAG ATTTCCATGC GATTACAACT GCAAATAATG 7860 
CATT&TCTGC GTTTATCGAT AATCATATTC ACCAAGGTAA CGAATTAGGA ATCGATCAAA . 7920- 

40 GACGTATTGA GTGGAAACGT GTATTAGATA TGAATGATCG TGCACTTAGA CATGTAAACG 79B0 

TTGGGTTAGG TGGACCTACA AATGGTGTAC CACGTGAAGA TGGCTTTAAT ATTACAGTAG 8040 

CGTCTGAAAT TATGGCGATT TTATGTTTAA GTAGAAGTAT TAAAGACTTA AAAGATAAAA 8100 

TTAGTCGTAT TACTATTGGT TACACTAGAG ATCGCAAGCC AGTTACAGTT GCAGATTTAA 8160 

AAGTGGAAGG TGCACTTGCA ATGATTTTAA AAGATGCAAT AAAACCAAAC TTAGTACAAT 8220 

CAATTGAAGG GACACCTGCA TTAGTTCATG GTGGACCATT TGCGAATATC GCACACGGTT 8280 

GTAACTCAAT TTTAGCAACT GAAACAGCAC GTGATTTAGC TGATATCGTT GTAACGGAAG 8340 

CTGGATTTGG TTCAGACTTA GGCGCTGAAA AATTCATGGA CATTAAAGCG CGTGAAGCAG 8400 

55 ^ 
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10 



is 



20 



25 



30 



35 



40 



45 



50 



GTGTAGCGAA ^ AGATAATTTA AAAGAAGAAA ATGTAGAAGC AGTAAAAGCA GGAATTGTTA 8520 

ATTTAGAGCG TCATGTTAAT AATATTAAAA AATTCGGTGT AGAACCGGTT GTTGCAATTA 8580 

ATGCATTTAT ACATGATACC GATGCAGAAG TAGAATATGT AAAATCTTGG GCTAAAGAAA 864 0 

ATAACGTACG AATTGCCTTA ACTGAAGTTT GGGAAAAAGG TGGTAAAGGT GGCGTTGACT 8700 

TAGCAAATGA AGTATTAGAA GTCATTGATC AACCTAATTC ATTTAAACCT TTATATGAAT 8760 

TAGAATTACC ATTAGAGCAA AAGATTGAAA AGATTGTGAC TGAAATCTAT GGCGGTTCAA 8820 

AAGTAACGTT TAGCAGTAAA GCGCAAAAAC AATTAAAACA ATTTAAAGAA AATGGTTGGG 8880 

ATAATTACCC AGTATGTATG GCGAAAACAC AATATTCATT CTCAGATGAT CAAACG7TGT 8940 

TAGGTGCACC ATCAGGATTT GAAATTACAA TTCGTGAATT AGAAGCGAAA ACAGGTGCAG 9000 

GATTTATCGT AGCGTTGACA GGTGCAATCA TGACTATGCC TGGTTTACCT AAAAAACCAG 9060 

CAGCATTAAA CATGGATGTT ACTGATGATG GTCATGCAAT TGGGTTATTC TAATAAATCA 9120 

TGTCAATTGT TTAATAAAGA TAAGTAAATA GTTTAATAGA CCGGACTGTT GGAGATGCAT 9180 

TATTTCAGCA GTTCGGTTTT TTGCTGTGCT AAAAATAGAT TCAATTTGGC GAATCTAACG 9240 

ACAATGTTTG AAGGTGGTTA ATTAATGTAT ATGAAGATAA AAAGTGGGCT TGAAGAATAG 93 00 

GAAAGCGATG CAATGAATAT TCCATATTAA AAAAAATTAA TAAAATAGGT TGCAATATTT 9360 

AATTGGGATG CGCTACAATT AACACTAATA ATTGATATTG ATAATTATTA TCAATTAAAT 9420 

ATAATCTTAT, AGGAGTTGTT AACAACATGA ACAAACATCA CCCAAAATTA AGGTCTTTCT 94 80 

ATTCTATTAG AAAATCAACT CTAGGCGTTG CATCGGTCAT TGTCAGTACA CTATTTTTAA 954 0 

TTACTTCTCA ACATCAAGCA CAAGCAGCAG AAAATACAAA TACTTCAGAT AAAATCTCGG 9600 

AAAATCAAAA TAATAATGCA ACTACAACTC . AGCCACCTAA GGATACAAAT CAAACACAAC 9660 

CTGctACGCA ACCAGCAAAC ACTGCGAAAA ACTATCCTGC AGCGGATGAA TCACTTAAAG 9720 

ATGCAATTAA AGATCCTGCA TTAGAAAATA AAGAACATGA TATAGGTCCA AGAGAACAAG 9780 

TCAATTTCCA GTTATTAGAT AAAAACAATG AAACGCAGTA CTATCACTTT TTCAGCATCA 9840 

AAGATCCAGC AGATGTGTAT TACACTAAAA AGAAAGCAGA AGTTGAATTA GACATCAATA 9900 

CTGCTTCAAC ATGGAAGAAG TTTGAAGTCT ATGAAAACAA TCAAAAATTG CCAGTGAGAC 9960 

TTGTATCATA TAGTCCTGTA CCAGAAGACC ATGCCTATAT TCGATTCCCA GTTTCAGATG 10020 

GCACACAAGA ATTGAAAATT GTTTCTTCGA CTCAAATTGA TGATGGAGAA GAAACAAATT 10080 

ATGATTATAC TAAATTAGTA TTTGCTAAAC CTATTTATAA CGATCCTTCA CTTGTAAAAT 1014 0 

CAGATACAAA TGATGCAGTA GTAACGAATG ATCAATCAAG TTCAGTCGCA AGTAATCAAA 10200 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



AGGCAACGAC CAATATGAGT CAACCTGCAC AACCAAAATC GTCAACGAAT GCAGATCAAG 10320 

CGTCAAGCCA ACCAGCTCAT GAAACAAATT CTAATGGTAA TACTAACGAT AAAACGAATG 10380 

AGTCAAGTAA TCAGTCGGAT GTTAATCAAC AGTATCCACC AGCAGATGAA TCACTACAAG 1044 0 

ATGCAATTAA AAACCCGGCT ATCATCGATA AaGAACATAC AGCTGATAAT TGGCGACCAA 10500 

TTGATTTTCA AATGAAAAAT GATAAAGGTG AAAGACAGTT_ CTATCATTAT GCTAGTACTG 10560 

TTGAACCAGC AACTGTCATT TTTACAAAAA CAGGACCAAT AATTGAATTA GGTTTAAAGA 10620 

CAGCTTCAAC ATGGAAGAAA TTTGAAGTTT ATGAAGGTGA CAAAAAGTTA CCAGTCGAAT 10680 

TAGTATCATA TGATTCTGAT AAAGATTATG CCTATATTCG TTTCCCAGTA TCTAATGGTA 10740 

CGAGAGAAGT TAAAATTGTG TCATCTATTG AATATGGTGA GAACATCCAT GAAGACTATG 10800 

ATTATACGCT AATGGTCTTT GCACAGCCTA TTACTAATAA CCCAGACGAC TATGTGGATG 10860 

AAGAAACATA CAATTTACAA AAATTATTAG CTCCGTATCA CAAAGCTAAA ACGTTAGAAA 10920 

GACAAGTTTA TGAATTAGAA AAATTACAAG AGAAATTGCC AGAAAAATAT AAGGCGGAAT 10980 

ATAAAAAGAA ATTAGATCAA ACTAGAGTAG AGTTAGCTGA TCAAGTTAAA TCAGCAGTGA 11040 

CGGAATTTGA AAATGTtACA CCTACAAATG ATCAATTAAC AGATTTACAA GAAGCGCATT 11100 

TTGTTGTTTT TGAAAGTGAA GAAAATAGTG AGTCAGTTAT GGACGGCTTT GTTGAACATC 11160 

CATTCTATAC AGCAACTTTA AATGGTCAAA AATATGTAGT GATGAAAACA AAGGATGACA 11220 

GTTACTGGAA AGATTTAATT GTAGAAGGTA AACGTGTCAC TACTGTTTCT AAAGATCCTA 11280 

AAAATAATTC TAGAACGCTG ATTTTCCCAT ATATACCTGA CAAAGCAGTT TACAATGCGA 1134 0 

TTGTTAAAGT CGTTGTGGCA AACATTGGTT ATGAAGGTCA ATATCATGTC AGAATTATAA 11400 

ATCAGGATAT CAATACAAAA GATGATGATA CATCACAAAA TAACACGAGT GAACCGCTAA 11460 

ATGTACAAAC AGGACAAGAA GGTAAGGTTG CTGATACAGA TGTAGCTGAA AATAGCAGCA 11520 

CTGCAACAAA TCCTAAAGAT GCGTCTGATA AAGCAGATGT GATAGAACCA GAGTCTGACG 11580 

TGGTTAAAGA TGCTGATAAT AATATTGATA AAGATGTGCA ACATGATGTT GATCATTTAT 11640 

CCGATATGTC GGATAATAAT CACTTCGATA AATATGATTT AAAAGAAATG GATACTCAAA 11700 

TTGCCAAAGA TACTGATAGA AATGTGGATA AAGATGCCGA TAATAGCGTT GGTATGTCAT 11760 

CTAATGTCGA TACTGATAAA GACTCTAATA AAAATAAAGA CAAAGTCATA CAGCTGAATC 11820 

ATATTGCCGA TAAAAATAAT CATACTGGAA AAGCAGCAAA GCTTGACGTA GTGAAACAAA 11880 

ATTATAATAA TACAGACAAA GTTACTGACA AAAAAACAAC TGAACATCTG CCGAGTGATA 11940 

TTCATAAAAC TGTAGATAAA ACAGTGAAAA CAAAAGAAAA AGCCGGCACA CCATCGAAAG 12000 
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CATGGTGGGG CTTATATGCG TTATTAGGTA TGTTAGCTTT ATTCATTCCT AAATTCAGAA 12120 

AAGAATCTAA ATAATTAnCT AAATATAGCA TATGTATGAT TAACTTTGTA GAC % 12173 

5 . 

(2) INFORMATION FOR SEQ ID NO: 311: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1316 base pairs 

(B) . TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 311: 





CAACATTAAT 


ATTGATATTA 


AATCTTCCTG 


GATAACGTGC 


TTGTTCGAGT 


GATAAGTATG 


60 




CACGCACTTG 


ACTTAACTCT 


TTATCTAAAG 


TAATCGTATG 


TTGCTTAGAG 


CCTTGTAAAT 


120 


20 


TCGCTCTGAA 


AAAATAACTC 


AATTCTAATA 


GTAACTCTCG 


TGCCTTTTCG 


CTATTTATTC 


180 




TAACTAAAGC 


TGAGATCGTG 


TTAATTGAAT 


TGAAGAAAAA ATGTGGACTC 


ACTTGTGCCT 


240 




* 'GTAATGACTT 


AATCTCAGCA 


TCTTTCAATA 


ACTTACTTTG 




TCACCAAGTT 


300 


25 


CAATTTGGCT 


ACTAAAAATA 


TTTGCCAATC 


CTTCTGCAAG 


TTGACGTTCC 


ACAAAAGTTA 


360 




AATCATTAGG 


GTTTGTAAAA 


TACATCTTCA 


ATGTACCGAC 


GATAGAACCA 


TGCATCTCAA 


420 


30 


GTGGTATCAC 


GATAGCTGCT 


CTAAGCGGGC 


AATTCGGATG 


ACTACAACCA 


ATCTCTTCTT 


480 


TAGTATGAAC 


TTCTTTCAAC 


TTTCCTGATT 


TCAATACATC 


TTTAGACAGA 


CTTGTTAATA 


540 




TTTCATTTGT 


TGGTATGTGA 


TGATCACTAC 


CTGCACCTAC 


ATGCGATAAG 


ATTTCATTTT 


600 


35 


TGCTTGTAAT 


TGCTACGGCA 


GATACTTTCA 


TTAAATTTTT 


AATAATCATC 


GCAATTTGCT 


660 




GTGCCGATTC 


TCTATTCAAT 


CCTTCTTTAA 


AATACGGCAA 


TGTCTGGTTC 


ATCAATTGCA 


720 




GTACATCATG 


TGTTTGAACA 


GCCTTCATTT 


GCTCCTCTTG 


CTTTAATGTT 


GAAATGATAA 


780 


40 


TAGACATAAA 


AATCGCCGTA 


CCAACGCTAT 


TAACAATAAT 


CATTGGTAGT 


GCAATTAATG 


840 




ATATGAGGTC 


AACCGCATAT 


GCTTTGTCGT 


GGGAAAATGT 


TAAAATGCTC 


AACATTTGAA 


900 




TCATTTCCAT 


AACAATTCCA 


ATCATGGCAC 


TTTTCGCAAT ACTCGGGTAA CGCTTfcCGTC 


960 


45 


TTTGAGCTTG 


TAAGCCAAAA 


TAACCAGCAA 


TTATACCAAT 


AAATATAGAT 


GAGATAAGAT 


1020 




AAACTTGTGC 


ATCCGCCCCA 


CCCATATACA 


CTCTGAAAAT 


ACCTGAAATA 


ACGCCAACAA 


1080 


50 


ATAGACCTAC 


AAAAGGGCCA 


CCAACTAATC 


CTGCGACACC 


TATCGTTAAT 


ACACGTGTGT 


1140 


TAGCTAAAGA 


TACATCATCA 


TCTAAACGGA 


AGTACACACT 


TCCTGACAAA 


CTATGTTGAT 


1200 




GATCGATGAC 


GATACCAGTT 


AAATTAGACA 


TTAAGGCAAA 


CAAACTGAAA 


ATAATACATA 


1260 
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(2) INFORMATION FOR SEQ ID NO: 312: 

(i) SEQUENCE CHARACTERISTICS : 
5 (A) LENGTH: 7972 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



W . ' 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 312: 

TATAAATATT ATTTTATTAT CGTTTATCGC TATTTGTGTT ACATAATCAA AACCATAAAT 60 

15 TCTTACTCAT TCAGATTTAC CCAATATTTT TACTTTTATA ATGTAATGCG TTTTATCCAA 120 

GTTATTTTTT AAAAATAAAT ATTGAATTnG GGGCTGnTTT CATGTCATTA AGAGATGAAG 180 

CATTGGAAAT GCACAAACGT AATCAAGGTA AATTAGAAGT TAAACCAAAT GTAAAAGTTA 24 0 

20 CTAATAAAGA GGAATTAAGT TTAGCATACT CACCTGGCGT TGCTGAACCG TGTAAAGATA 300 

TTTATGAAGA TAAAAGAAAA GTATATGATT ACACAATTAA AGGAAATACA GTTG CAGTTA 360 

TTACTGATGG AACAGCGGTA TTAGGTTTAG GTAACATTGG ACCTGAAGCA AGTATTCCTG ~"l 20 

25 

TAATGGAAGG TAAAGCAGTA TTATTCAAAA GCTTCGCTGG TATCAATGGG GTGCCTATTG 4 80 



CGTTAAATAC AACTGATACC GAAGAAaTCA TTAAAACAGT TAAGTTGTTA GAACCTAATT 540 

ATGGTGGTAT TAATTTAGAG GATATTTCGG CACCACGTtG TTTTGAAATT GAAGAACGAT 600 

30 

TGAAAAAAGA AACTAATATT CCGGTATTCC ATGACGATCA ACATGGTACA GCAATTGTAA 660 

CATTGGCAGG TTTGGTAAAT GCATTGAGAG TTGTTAACAA AGATATTGCT AAAATAAAAG 720 

35 TTGTACTAAA TGGTGCTGGT GCAGCAGGAA TAGCCATTGT TAAATTACTA TACGCGTATG 780 

GTGTAAGAAA TATGGTTATG TGTGACTCAA GAGGCGCAAT TTTTGAAGGA CGTTCATATG 840 

GTATGAATCC TACGAAAGAT GTTGTAGCAA AATGGACAAA TAAAGATAAG ATTGAAGGGT 900 

40 CTTTAGAAGA AGTCGTAAAA GACGCAGATG TATTTATCGG GGTTTCTGTA GCTAATGCGC 960 

TGTCACAAGA TATGGTTAAG AGTATGGCAG ATAATCCAAT TATATTTGCA ATGGCTAATC 1020 

CAAATCCTGA AATAATACCT GATGATGCCA AAGCGGCAGG TGCACGAGTT GTTGGTACAG 1080 

45 

GACGTTCAGA CTATCCTAAC CAAATTAATA ATGTATTAGC TTTCCCTGGT ATTTTTAGAG 1140 

GTGCATTAGA GGTTGAAGCT ACACATATAA ATGAAGAAAT GAAAAAGGCA GCTGTAGAAG 1200 

CGATTGCTGA TTTAATCGAT AGTTCTGAAT TAAATGAAGA CTACTGTATC CCAGGACCGT 1260 

50 

TTGATAAACG TGTAGCGCCA TCAGTTGCTC GTAATGTTGC TAAAGCGGCA ATGGAATCTG 1320. 

GAGTAGCTAG GATTGAAGTT GATCCGCAAG ATGTGTATGA TAAAACAATG AAACTTACAG 1380 

j 
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ATTAAAATGA TGAAAATGAA AGTTTATGAT AAACATTCAA CAGTCAAACG AATATAAATC 1500 

AAATAAATTT AAACCCGTTT TTAACTGGTC AAG7TCAGTT TAAGGCTCTA AATGGTTAGA 1560 

ACAGAGGTTA TTTGGAGGTT TTCCTATGTT TAAAGATTTT TTTAATCGAA CAAAGAAAAA 1620 

GAAATATCTT ACAGTACAAG ACTCTAAAAA TAATGATGTG CCTGCAGGTA TTATGACTAA 16 80 

GTGTCCAAAG TGTAAGAAAA TTATGTACAC AAAAGAATTA GCTGAAAATT TAAATGTGTG 174 0 

10 

CTTTAATTGT GATCATCATA TTGCTTTAAC TGCGTATAAA CGTATAGAAG CAATTTCTGA 1800 

TGAAGGATCA TTTACAGAAT TCGATAAGGG AATGACCTCT GCGAATCCAT TAGATTTTCC 1860 

75 aAGTTATTTA GAAAAAATTG AAAAGGACCA ACAAAAGACA GGTCTTAAAG AAGCAGTTGT 1920 

GACTGGTACA GCACAACTAG ATGGTATGAA ATTTGGCGTT GCTGTCATGG ATTCACGTTT 1980 

TAGAATGGGA AGTATGGGAT CGGTTATCGG TGAAAAGATA TGTCGCATCA TTGATTACTG 2040 

20 CACTGAGAAC CGTTTACCAT TTATTCTTTT CTCTGCAAGT GGTGGTGCAC GTATGCAAGA 2100 

AGGTATTATT TCCTTGATGC AAATGGGTAA AACCAGTGTA TCTTTAAAAC GTCATTCTGA 2160 

CGCTGGACTA TTATATATAT CATATTTAAC ACATCCAACT ACTGGTGGTG TATCTGCAAG 2220 

25 

TTTTGCATCA GTTGGTGATA TAAATTTAAG TGAGCCAAAA GCGTTGATAG GTTTTGCAGG 22 80 

TCGTCGAGTT ATTGAACAGA CAATAAACGA AAAATTGCCA GATGATTTCC AAACTGCAGA 2340 

ATTTTTATTA GAGCATGGAC AATTGGATAA AGTTGTACAT CGTAATGATA TGCGTCAAAC 24 00 

30 

ATTGTCTGAA ATTCTAAAAA TCCATCAAGA GGTGACTAAA TAATGTTAGA TTTTGAAAAA 2460 

CCACTTTTTG AAATTCGAAA TAAAATTGAA TCTTTAAAAG AATCTCAAGA TAAAAATGAT 2520 

GTGGATTTAC AAGAAGAAAT TGACATGCTT GAAGCGTcAT TGGAACGAGA AACTAAAAAA 2580 

ATATATACAA ATCTAAAACC ATGGGATCGT GTGCAAATTG CGCGTTTGCA AGAAAGACCT 2640 

ACGACCCTAG ATTaTATTCC ATATATCTTT GATTCGTTTA TGGAACTACA TGGTGATCGT 2700 

40 AATTTTAGAG ATGATCCAGC AATGATTGGT GGTATTGGCT TTTTAAATGG TCGTGCTGTT 2760 

ACAGTTaTTG GACAACAACG TGGAAAAGAT ACAAAAGATA ATATTTATCG AAATTTTGGT 2820 

ATGGCGCATC CAGAAGGTTA TCGAAAAGCA TTACGTTTAA TGAAACAAGC TGAAAAATTC 2880 

45 

AATCGTCCTA TCTTTACATT TATAGATACA AAAGGTGCAT ATCCTGGTAA AGCTGCTGAA 2940 

GAACGTGGAC AAAGTGAATC TATCGCAACA AATTTGATTG AGATGGCTTC ATTAAAAGTA 3000 

CCAGTTATTG CGATTGTCAT TGGTGAAGGT GGCAGTGGAG GTGCTCTAGG TATTGGTATT 3060 

50 

GCCAATAAAG TATTGATGTT AGAGAATAGT ACTTACTCTG TTATATCTCC TGAAGGTGCA 3120 

GCGGCATTAT TATGGAAAGA CAGTAATTTG GCTAAAATTG CAGCTGAAAC AATGAAAATT 3180 
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GGTGCACATA AAGATATTGA ACAGCAAGCT 
TTAGATTCAC TTGAGTCATT ATCACGTGAT 
5 AGAAATATCG GTTCTTATAT AGAATAATCA 

GGGTTTTACC ATAAATTGAA GTACATTAAA 
TAACTAAGAT TAGTAGTCAA AATTGTGGCT 

10 

AATTCAATAT TTAAAACGTT TACATCAATT 
CAATTTGTGG TAATTTAGAG TTAAAGATAA 
GAAAATTGCA GTTTTAACTA GTGGTGGAGA 

7© 

AGTTGTTCGT ACAGCAATTT ACAATGAAAT 
AGGATTGTTA AATGATGATA TTCATAAACT 

20 GCGTGGAGGT ACATTCTTGT ATTCAGCAAG 
TAAAGTTGCA ATCGAAAACT TACGTAAAAG 
TGACGGTAGT TATCGCGGTG CACAACGCAT 

25 CGGTATTCCT GGTACGATTG ACAATGATAT 
CACAGCATTA AATACGATTA TTGGCTTAGT 
CGCACGAACA TTTATCATTG AAGCAATGGG 

30 

TGGATTATCA GTTGGTGCTG AGACAATTGT 
AATAGCTGAT AAAATTGAAC AAGGTATTAA. 
AGCAGAAGGT TGTATGACTG CGCAAGATTG 

35 

TGATAATAGA GTGTCTGTGT TAGGTCACGT 
TAGAGTTTTA GCATCACGTT TAGGTGGATA 

40 . AGCTAAGGGT GTTGGAATTA AGAACAATAA 
TGATGGTAAA GATCATAAAT TTGATTATAG 
ATAAGATTTC AGGAGGAATT ATAAAATGAG 

45 AGCTTCAGAA TCAGAAGAAA TGATTGAGAA 
- ATTAAACTTT TCACATGGTA GTCATGAAGA 
AGTAGCTAAA AGATTAGACA AAATTGtAGC 

50 

TCGTACGCAT AATATGAAAG ACGGTATCAT 
TAGCATGAAT GAAGTTGAAG GAACACCTGA 

55 



TTAGCTATTA AATCAGCGTT TGTTGCACAG 3300 

GAAATTGCTA ATGATCGCTT TGAAAAATTC 3360 

ACTTGAGCAT TTTTATGTTA AATCGATACT 3420 

ACAATAATTT AATATTTAGA TACTGAATTT 3480 

ACTAATCTTT TTTTAATTAA GTTAAAATAA 3540 

CAATACATTA GTTTTGATGG AATGACATAT 3600 

ATCAGTTATA GAAAGGTATG TCGTCATGAA 3 660 

TTCACCTGGA ATGAATGCTG CCGTAAGAGC 3720 

TGAAGTTTAT GGTGTGTATC ATGGTTACCA 3780 

TGAATTAGGA TCAGTTGGGG ATACGATTCA 3 840 

ATGTCCAGAG TTTAAGGAGC AAGAAGTACG 3 900 

AGGGATTGAG GGCCTTGTAG TTATTGGTGG 3960 

CAGTGAGGAA V 'TGTAAAGAAA TTCAAACTAT 4020 

CAATGGTACT GATTTTACAA TTGGATTTGA 4 080 
CGACAAAATT AG^ATACTG* CGTCAAGTCA . 4140 

CCGTGATTGT GG AGATCTAG CATTATGGGC 4200 

AGTTCCAGAA GTGAAAACAG ATATTAAAGA 4260 

ACGTGGTAAG AAACACTCAA TCGTTCTTGT 4320 

TCAAAAAGAA TTATCACAAT ACATCAATGT 4380 

TCAACGTGGT GGTAGCCCAA CAGGTGCGGA 4440 

TGCGGTAGAC TTATTAATGC aAGGTGAAAC 4500 

AATTGTAGCA ACATCTTTTG ATGAAATTTT 4560 

TCTATATGAA CTTGCTAACA AGTTATCTAT 4620 

AAAAACTAAA ATTGTATGTA CAATTGGACC 4680 

ATTAATCAAT GCTGGTATGA ACGTTGCACG 4740 

GCATAAAGGT AGAATTGATA CAATTCGTAA 4800 

AATTTTATTA GATACAAAAG GTCCAGAAAT 4860 

TGAACTTGAA CGTGGCAACG AAGTTATTGT 4920 

AAAGTTCTCA GTAACATATG aAAACTTAAT 4980 
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TAAAGATATT GACCATGCTA AAAAAGAAGT TAAATGTGAT ATTTTAAACT CTGGTGAGCT 5100 

TAAAAACAAA AAAGGTGTTA ACTTACCTGG CGTAAGAGTA AGTTTACCTG GTATTACAGA 5160 

5 

AAAAGATGCT GAAGATATCC GTTTCGGTAT TAAAGAAAAT GTTGACTTCA TTGCAGCAAG 5220 

TTTCGTACGT CGTCCTAGTG ATGTTTTAGA AATTCGTGAA ATTTTAGAAG AACAAAAAGC 5280 

TAACATTTCA GTATTCCCTA AAATTGAAAA CCAAGAAGGT ATTGATAATA TTGCGGAAAT 534 0 

10 

TCTTGAAGTG TCTGATGGTT TAATGGTTGC ACGTGGTGAC ATGGGTGTTG AAATTCCACC 5400 

. TGAAAAAGTA CCAATGGTTC AAAAAGATTT AATCAGACAA TGTAACAAAT TAGGTAAACC 546 0 

15 AGTTATTACA GCTACACAAA TGTTAGATTC TATGCAACGT AACCCACGTG CTACACGTGC 5520 

AGAAGCTAGT GACGTTGCCA ACGCAATCTA TGATGGTACA GATGCAGTAA TGTTATCTGG 5580 

TGAAACTGCT GCTGG TTTAT ATCCTGAAGA AGCTGTTAAA ACAATGAGAA ATATTGCTGT 5640 

20 ATCAGCTGAA GCAGCCCAAG ATTACAAAAA GTTATTGTCA GATCGTACTA AATTAGTTGA 5700 

AACTTCATTA GTGAATGCTA TCGGTATTTC GGTTGCACAT ACAGCTTTAA ACTTAAATGT 5760 

TAAAGCAATT GTAGCTGCTA CTGAAAGTGG TTCAACGGCA CGTACTATCT CCAAATATCG 5820 

25 TCCACATTCA GACATTATTG CGGTGACTCC AAGTGAAGAA ACTGCACGTC AATGTTCAAT 5880 

TGTTTGGGGA GTTCAACCTG TAGTTAAAAA AGGACGTAAG AGTACAGATG CATTGTTAAA 5940 

CAATGCAGTT GCAACAGCTG TTGAAACTGG TAGAGTATCT AATGGTGATT TAATCATTAT 6000 

30 

TACTGCTGGT GTACCAACTG GTGAAACTGG AACTACTAAT ATGATGAAAA TCCACCTAGT 6060 

TGGTGACGAA ATTGCTAATG GTCAAGGTAT TGGACGTGGA TCAGTTGTTG GTACTACGTT 6120 

AGTTGCTGAA ACTGTTAAAG ATTTAGAAGG TAAAGATTTA TCTGACAAAG TTATCGTTAC 6180 

35 

TAACTCAATC GATGAAACGT TTGTACCTTA TGTAGAAAAA GCTTTAGGCT TAATTACAGA 6240 

' AGAAAATGGT ATTACATCAC CAAGTGCAAT TGTTGGTTTA GAAAAAGGTA TTCCAACAGT 6300 

40 TGTAGGTGTA GAAAAAGCTG TTAAAAACAT AAGCAATAAC ATGTTAGTTA CGATTGATGC 6360 

TGCTCAAGGT AAAATCTTTG AAGGATATOC AAACGTACTA TAATTTATAA AAAAACGTCT 6420 

TTCCATTTAT CAACAATGGA AAGGCGTTTT TTGGTTcATC TGGTATTTTA TGACGTAATT 6480 

45 AATAGGTTAT TTGATAATGA TAGTGTATGA ATGGCAATCT ATATAAATGT TTATATCTTT 6540 

TATACATGTA. CATTATCACC TTCAAACTTT CACTCATATT ACTTTGGAAA TTTATTATAA 6600 

AATAGAAGTA TGGATGTATT TCTGAAATGA TACATTATTA AATAGATGAG AAAGTAAAAG 6660 

SO 

TTTTGAGCCA AGTACGCAAT TTAATATTAT AAGTTGCATA TAAAACAGGA TGGGACATAA 6720 

ATCCCTAAAA AAACAGCAGT AAGATAATTT TCAATTAGAA AATATCTTAC TGCTGTTCTC 6780 

55 



1163 



EP0 786 519 A2 



5 



15 



20 



35 



tCTTCGACTG 


GCACTGCTCC 


CTCAGGAGTC 


TCGCCATTAA 


TACTACGTAT 


TAACATGTAA 


6900 


TTTTACTTTT 


ACATACTTTA 


AAAAAATAAA 


ACACTTTGCC 


CAACTTGCAC 


ATAAATGTAA 


€960 


AATTCAATAA 


AATGAATTTT 


CTGTGTTGGG 


TCCCTTCGTA 


TAATTTAATA 


AATACCACTA 


7020 


AACTAAATTA 


ACGAGGTGCC 


TTATGTATAA 


AATTTATAAC 


ATGACCCAAC 


TTACACTACC 


7080 


AATAGAAACC 


TCTGTTAGAA 


TTCCTCAAAA 


TGATATTTCG 


CGATATGTTA 


ATGAAATTGT 


7140 


TGAAACGATA 


CCTGATAGCG 


AATTCGATGA 


ATTCAGACAT 


CATCGTGGCG 


CAACATCCTA 


7200 


TCATCCAAAA 


ATGATGTTAA 


AAATCATCTT 


ATATGCATAT 


ACTCAATCTG 


TATTTTCTGG 


7260 


TCGTAGAATA 


GArAAATTAC 


TTCATGACAG 


TATTCGAATG 


ATGTGGTTAG 


CTCAAGATCA 


7320 


AACACCTTCT 


TATAAAACTA 


TTAATCGTTT 


TAGAGTGAAT 


CCTAATACTG 


ATGCGTTAAT 


7380 


TGAATCTTTA 


TTTATTCAGT 


TCCATAGTCA 


ATGTTTAAAG 


CAAAATCTTA 


TTGATAATAA 


7440 


TTCAATTTTT 


ATTGATGGTA 


CAAAAGTAGA 


AGCTAATGCC 


AATAGATATA 


CATTTGTGTG 


7500 


GAAGAAAAGT 


ATTCAAAATC 


ACGAATCGAA 


ATTGAACGAA 


AATTCAAAAA 


CATTATATCG 


7560 


TGACTTAGTT 


GAAGAAAAAA 


TAATACCAGA 


GATAAAAGAA 


GATGGAGATA 


GCGATTTAAC 


7620 


AATAGAAGAA 


ATAGATTTAA 


TTGGTAGTCA 


TTTAGATAAA 


GAAATCGAAG 


ATTTAAATCA 


7680 


TTCTATTGAG 


AACGAAGATT 


GTGCTCAAAT 


TAGAAAACAG 


ACCCGTAAAA 


AAATAACTGA 


7740 


GATTAAGAAG 


TTCAAAAAGA 


AATTTGATGA 


TTATTCCGAA 


AGAAAAAATA 


AATATGAAGA 


7800 


ACAAAAATCG 


ATTCTTAAAG 


ATAGAAATAG 


TTTTTCTAAA 


ACTGATCTGA 


TCATGATGCA 


7850 


ACTTTTATGA 


GAATGAAGGA 


AGACCATATG 


AAAAATGGCC 


AACTTAAGCC 


AGGATACAAT 


7920 


TTACAAATAG 


CGACAAATTC 


TCAAAAATGT 


TTTATCCTAT 


GACCTATTTC 


AA 


7972 



(2) INFORMATION FOR SEQ ID NO: 313: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3175 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 313: 
ATTTTTTAGT TAATTGTCTT TCTTAAAATA ATTTTAGCTT TCATTAAATT AAACAATTTT 60 
ACAAGCTTGG AACACCAATC AAAATCCTAA GTTCTAAAAT GCAATATTAG TAGTCGTTGA 120 

50 

CTGAATGAAC ATATGCTTAT AATATTTTTT TGCAATGCTA GTCAAGTTGA TTTATGCTCA 180 
CAAGGATATG CGATTTATAT TTTCTTACAA CAATGAAAAT GCCTGATACA ATGCGATCCT 240 

55 



1164 



EP 0 786 519 A2 



AATTAAATTA TTGTTGTTTT AACCATTGTA 
CTTTTGGACC ATTATATGAA GCTTTATTAG 
TGTGAGGTGT TTGACCACTA AAATGTGTTA 
GTCCATTCGT ATGGTTAACT AAATCAGCGA 
TTGCAGTAAT CGGTAATTCA AATACCATTT 

10 

CGTTTAATAT CACTTTACTA TTCGATTCAA 
CAATTGCTGC ACCCGTTAAT GTAGCAAAGT 

is CATAAAACAC AGCATCTGCA AGGACTAATC 

CAGTTTCACC ACTTAATGCT GTAAATACAT 
TATTTTCAGC ACACGCAAGC ACTCCGACAA 

20 CTTCAATGAT ACCAACGACA TTCGCAGCGC 

CATTCTTCGT TTTAATACTA TAACCACCAG 
CAATTGGTGC TTCATCTTTG TCTTTGCCAT 

25 GCTTACTACC TTTACCTACT GCTTGTAAAA 

AATCTTTAAC ATCTACTTTG ACCTTTGTAT 
ATGTTTGTGG TGtTAATACA TTCGGTGGCA 

30 

aTTGGcCAAT ACTGATACCC TCATGAATAA 
GGkTTAAATT CGTCTTAAAC GGtGCCTTCT 
kTGcACGCTC ACTTTGGAAT ACCGCATGCC 

35 

TTTGgAnATA AATGaATCCa TAAGTAAATA 
TTGGAAAAGA TGTCCCCaTA TTTTCAACAT 

40 AAACCAACTG TAATTAAACG ATATGTTTGA 

TTCCCAACTT TACTACCAAT AATATGTTGA 
TCGATATGAT TAAAACTAAT GCGCTCTAAC 

45 AATGTATTTA TTTCGTTGCT TAGTGTGTTA 

AGATTTGATA TGTTACTCAC TTCAAATTGT 

GACTTTATAA ATGAATAATT T AATTTTATG 

50 

AATTTACCTT TTTTGAATGC TAAACCGATA 

ATCACTTTTT TCATGAATGC TGCTTTTTTA 

55 



CTATCGTCGG AATCATAAAT 


CCTGTTGGCC 


360 


TCGTTGCTGG 


ACCTGCAATA 


TCGAAATGAA 


420 


CAAAACTTGC 


CGCAAATAGC 


GCTTTACCTT 


4B0 


TATCACTGTG 


TTTAATACTT 


GCACGTTCGG 


540 


CATCGACTTC 


AGAACTTATT 


TGTAATATAT 


600 


ATGCAGCAGC 


TTTATCATCG 


CCTAGTGCAa 


660 


CCATAATCAC 


ACTAGGCTGA 


TATTGATTTG 


720 


TACCTTCAGC 


GTCTGTATTC 


ATTACTTCTA 


7B0 


CATCTGGcTT 


CATTGATGCT 


TCATTTATCA 


840 


TATTTACAGG 


CAGTTGTAAA 


CGAcTAGCCG 


900 


CACACATGTC 


AAACTTCATT 


GTAGCCATGC 


960 


AATCATACGT 


TATACCTTTA 


CCAACTAAGG 


1020 


TATATGTGAT GGTTACTAAt 


CTCGGTTTAT 


1080 


GTCCGAATCC 


TTCAGAAACT 


AAAGTGTCAT 


1140 


TTTTAAAATG 


ATTAACAATA 


TCTTCTGcAA 


1200 


TATTACTAAA GTCTCyTGcC AAATTAATGG 


1260 


AATCTAATTC 


AATTAATGAT 


TCACTAATAA 


1320 


kACTTgrTTT ATAATGATCG aATyCATATG 


1380 


ATTAATACAT CTGgATAACT GgATCcATAT 


1440 


CGTATCTTCa 


ATGTGTTCTG 


ACTTTATGTA 


1500 


ATCTTGATAA 


tACGTGTCTT TAAGTTTCCT 


1560 


TCTTGTACAT 


CAAATGCAGT 


TGTATAAATC 


1620 


TGTTTTAGTC 


TTTCAAGTGA 


TTCTGTAATA 


1680 


TGATTTAAAT 


GTTCTGGTAT 


ACCAATAATC 


1740 


TTTAATTTAA 


AATTCATAAT 


GTACCTCCTT 


1800 


ACAACAATAA AGCCCTcAGT GACACTGAAG 


1860 


TGTTTAGCTG 


AAACTTTATT 


TAAAGCCTAG 


1920 


CCACCGATTT 


TGAATACCGC 


ACGTGTATCA 


1980 


CCAGCGATAG 


GTTTACCAAA 


TACCATACCT 


2040 
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GTTGATTCAC CGTTTAAGAT GCGTTTAATG TTTTTAGCAA CACTTTCACC TTGTTGCATT 2160 

GCAATTTGTG CTGTAGTTGG TAATGGACGT TCTTCTCCAG CTGGGATAAA CGCTGAACAG 2220 

TCACCAATAA CAAAAATGTT GTCGTAACCA TTGATTGTTA AATCTTGCTT TGTAACGATA 22 BO 

CGTCCACGTT TAACGCCTTC AAATGATTCT TCCATTAATT TACTACCACG TACACCAGCT 234 0 

GCCCATACTG AAGTACCTGC ATTTAATTGT TGTTTTTCAC CATCTACTTC AACTACAAAA 24 00 

CCTTTTTCGT TACAAGCAAC GATTGGTGTA GCAATTTTAA ATTCAACACC GCGGTCTTCT 2.460 

AAGTAGCTAA CTGCGTGGTT AACTAATTCT TCTGAGAACA TTGGTAACAT TTTAGGTGCT 2520 

GCTTCAACAC AAGTGATTTT AACTTTATTT TGATCCACAC CATATTTGCT ACATAATTCA 25B0 

GGAATTCTGT CTGTTAATTC ACCTAAGAAT TCAACACCAG TGAATCCAGC ACCACCAACT 264 0 

AAGATAGATA AATCGTTATC ATCTTTTTCT TTTGATGCTG CATAGTTAGC AAATTTGTCT 2700 

20 TCGATATGAC GTGATAATTC ACGTGCTGTG ATAACATTTT CAATTTGGAA AGCATGATCT 2760 

TTCATACCTT CGATGCCGAA TGTTTCACTA ACGAAACCTA ATGCTACTAC TAAAATATCA 2820 

AAGTCATAAA TACCTTGATT TGTTTCTACC TTTTTAGCAT CACGGTCAAT TTTTGTTACT 28B0 

25 TCTGCtTGAA CAAAGTTCAC TTTGTCTTTC TTCAAGACAC TTTCCACAGG ATATAATACA 2940 

TCTTCATAGT TTAGTGTACC TGCTGATGCT TCATGTAACC ATGTTGCTTC ATAGTGATAT 3000 

TCATTTTTAT TAATAAGCGT AATTTCTGCT TCTTCTGTTG ATATCGCTTT TTGCAATTTA 3060 

30 

GTTACAGTTT GTAAACCTGC ATAACCAGCA CCAAGTACAA GTACTTTTTT ACGATCTTGA 3120 

GCCATTTAAT TnCACCTAAG CnTTCATATT TTTTTAACCA AATGCTGATA ATTAC • 3175 
(2) INFORMATION FOR SEQ ID NO: 314: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 702 base pairs 
•(B) TYPE: nucleic acid 
(C) STRAND EDNESS : double 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 314: 

45 CATCATTATT AAAGATTTTC AATCAATACA gAATCACAAT ACGTACGCAT TGTGCACGAT 60 

AAAAATACAG ATGTGTATAT TAACTATGAA CTACAAGAGC AACTAACGAA CAAAGCTTAC 120 

ATTGGTGATC ATATTTATGT TGAAGGGATA TGGCTCGAAG TACAAGCTGA TGGTTTAAAT 180 

50 

GTATTGAGTC AGAATACAGT GGCATCGTCA TTAATTCGCT TAACACAAGA GATGCCACAT 24 0 

GCACAGGCAG ATGATTACAA TACGTACCAT CGTTCGCCAA GGATTATTCA CCGTGAACCG 300 
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TGGCGTTCCA TTATACCGCC ATTAGTAATG ATTGCTTTAA CTGTTGTCAT CTTTTTAGTG 420 

AGACCAATTG GTATTTATAT TTTAATGATG ATTGGTATGA GTACAGTAAC GATAGTATTT 480 

5 GGTATTACAA CGTATTTCTC TGAAAAGAAA AAGTATAACA AAGATGTTGA AAAACGAGAG 54 0 

AAAGATTACA AAGCTTATTT GGATAATAAA TCTAAAGAAA TTAATAAAGC GATTAAAGCA SOO 

CAACGTTTTA GTTTGAATTA CCATTATCCA ACGGtTGCTG AAATTAAAGA TATCGTTGAA 660 

10 

ACGAAAGCAC CAAGAATATA TGAnAAAACC ATCGGCATCA TC 702 

(2) INFORMATION FOR SEQ ID NO: 315: 

(i) SEQUENCE CHARACTERISTICS : 
75 (A) LENGTH: 4121 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 315: 
TATGTTCCGA CAACGAAACA AAGTGTAATT ACAAGAGCAA AGATAACTTT GAATGTTTGT 60 
25 AAACGTCCAT CTTTACCTTC AGTTAAATGC ATGAACATTA ATAATTGArG TCCTGCTTGG 120 
ACGAATGCAA AGCCAAAGAT AATTGTCAAC TTCGCGTGGA ATGTTAATGA CGTGTATAGT 180 
GTTACGTAAA CTGCTAAAAG CGTTAATACG ATAGATGCGA TAAATCCTAC AGTATGTTTC 240 

30 

ATTATTGTAC . TCATCCGCTA TACACCATCC CTATCATATA TACGGCAGTA AAGATGAAAA 300 
CCCAAACAAC ATCTAAGAAG TGCCAGTATA AACTTACTAT AAATAATTTT GGCGCATTAT 360 
ATTTGTCTAA TCCGCGTCGT TGGATTTGGA TTAATAAACA AATGGCCCAA ACGATACCTA 420 

35 

GCGATACGTG ACAACCATGC GTTCCTAATA GGATAAAGAA ACTAGACCAG TAAGAACCAA 480 
TTGTTGGGTT AACGCCTTCT GATGCATAGT GTGCGAATTC ATAAATTTCG AATCCAACAA 540 

40 AGACTAAACC TAAAAGTAAC GTAATGATCA TCCAAAACAT CATTAACTTT TGTTTTTCTT 600 

GGCGCATGTA GTAAATAGCA ATACCACATG TGTAAGAACT GAATAATAAT GCAAACGTCA 660 
TTATTAAAAC AAGAGGCAAT TCAAATAACT CAGTAGTCAT TTTACCTGCA TAATCGCCAC 720 

45 CATGTTGCAA AGTTAATAGT GTCGCAAATA GGGTACCGAA TAACGCAAAT TCGGCTGTAA 780 
TGAAAATCCA AAAGCCAAGC TTATTTAATT CGCCTTCATG TGTGCGTGAA TCAATAGTGT 840 
TTGTATCATG ACTCATGACT TACAGCCTCC CTTTCTTTAA TACGCGCTTC TCTTAATCTA 900 

SO 

GCTTCAGTTT CTGCAACTTC AGCAGCAGGG ATATGATATC CGTGATCGAT TTGGAAACTG 960 
CGATAAATCA TAGTACCAAA AATACCGAAT AAACAAATTA ATGCTGGAAT TACAGTTTCG 1020 
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TTGTTTGGCA TATGAATGTC TTTGTAATTA TGGTTGTCTA AGTAATGACG ACCATGTTCT 1140 

TTCATATCAA CAAATGTGTC GTAGTCATTC CAATCTGGTG TAATGGCAAA GTTGTATTTA 1200 

5 

GGTGGAATTG CTGATGCAGT AGTCCACTCT AGAGTACGAC CAAGGCCATC CCAGTTATCT 1260 

CCAGTTGCTT CACGTGGAGA TTTGAAGTGA CTGTATACGA TACTAACAAC AAGGAATAAG 1320 

' AATCCGATTG CCATTAATAA TGCACCGATA GTTGAAATTA AGTTTAATAA GAACCAACCA 1380 

10 

TCTGATGGCA TATAAGTGTA TAAACGACGT GGCATACCAT CTAATCCAAG AATGAATTGT 1440 

GGTAAGAAAC AAACGTTAAA TCCGATCATG AAGAACCAGA AGCACCATTT GTTTAATGTT 1500 

15 TCGTTTAATT TGTAACCCAT CATCTTTGGA TACCAGAAGA TTAAACCAGC TAAGCAGGCA 1560 

AATACAACAC CAGTAACCAA TGTATAGTGG AAGTGAGCTA CTAAGAAGTA CGTATTGTGA 1620 

TATTGATAGT CAGCTGATGC CATTGCTAAC ATTACACCCG TAACACCACC TAAAAGGAAG 1680 

20 TTAGGGATAA ATGCTAATGA GAATAGCATT GGTGACTCAA ATGTAATACG TCCTTTATAT 174 0 

AATGTTAATA ACCAGTTAAA CAATTTCACA CCAGTTGGAA TACCAATCAG CATTGTTGAA 1800 

ATTGAGAAGA ATGAGTTGAT TAACGCACCA TTACCCATTG TGAAGAAATG GTGAACCCAA 1860 

2 ° ACTAAGAAAC TAAGGAACGC GATACCGGCA GTTGCCCATA CCATACTTTG ATGTCCGAAT 1920 

AAACGCTTAC GAGCGAATGT CGGGATAATT TCTGAGTAAA TACCAAATGC TGGAAGGATA 1980 

ACGATATAAA CTTCAGGGTG CCCCCATACC CAGAAGAAGT TAGCCCAAAG CATTGGCATA 2040 

30 

CCGCCATGTG CAACTGTGAA GAATGCTGTG TCAAATATTC TATCAGTTGT CATTAATGCT 2100 

AACGCTACTG TTAAAGGAGG GAAAGCAAGA ATAACAATTA ATGTAGTAAT AAATGTTGTT 2160 

ACTGTAAACA TTGGCATTTG CATAAACTTC ATAGTTGGTG TTTTACATCT TAAAATTGTT 2220 

ACAAAGAAGT TGATACCTGT AGCTAAGGTA CCAAGCCCTG AAATTTGTAT AGCTATTAAG 2280 

TAATAGTTAA CACCCGGACC AGGACTGAAT TCACCTGCTA GTGGCGCATA GTTTGTCCAA 2340 

40 CCAGCTGCTG GTGAACCACC AATAATAAAT GACAGGTTGA ATAAAATCAT ACCTGCAAAG 2400 

AATAGCCAGA AACTTACGTT GTTTAATACT GGGAATGCAA CATCACGTGC TCCAATTTGT 2460 

AATGGAACAA CGATATTCCA TAAACCAAAG ATAAATGGCA TTGCCATGAA GATAATCATG 2520 

45 ATTACACCAT GTGTACTAAA AATTTCGTTA TAGTGGTTAG ATTCTAAAAA TTTGTTATCA 2580 

GGTACTGTTA ATTGCGCACG AATAAGTAAC GCATCAATAC CACCACGGAC GAACATTAAT 2640 

ACGGCACAGA TTAAATACAT AATACCGATT TTCTTATGGT CTACAGATGT GAACCATTCT 2700 

50 

TTGTAAAGAT ATTTCCATAA TTTAAAGTAA GTAATTACTG CGATTAAACC AATAACTAAG 2760 

AATGGGGCAC CAATTTGTGC CATTGTAATC ATcCAGTTAC CTTTAACTAG TAATTGATCC 2820 
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75 



20 



25 



TTGAAATTTT 


CTTCATTTCT 


• TTCGCATTTT 


TCGATTCATC 


TTTCTTGAAC TCATTGTTAT 


294 0 


ATGGTTCGTC 


ATTTCCAAGA 


ATCATCAACT 


TCATACCATG 


TCGTTTATAG TTCGCATTTG 


3000 


TAATTTGAGC 


TTTACGAGCA 


GGTATTAATG 


GTTTGTCTGA 


TACATCTTTA AACATATTTT 


3060 


CTTCACTAGT 


GAAGTTTGGA 


TCTTTCAATT 


CGAAATTGAA 


ACGTTTATAT GCATAGAAGA 


3120 


TGTATTCTGG 


ATCGGCTGCT 


GGATCAACAA 


ACGCCATATG 


TGTACCATTA AATTCTAAAG 


3180 


CTTTATTAGG 


TGTGCTTGGT 


AATAATTGTT 


TATCAAATGT 


ATCTTGATCT AACGTTTTCT 


3240 


TACCTTTAAC 


TTCTTTCACC 


CATTTGTCGT 


AGTCTTTTTG 


ACTAACGGCA TTTACTTTAA 


3300 


ATGTTTGACG 


TGAGAATCCT 


TCACCATTGA 


AGTTAGAGTT 


ACGACCTCTG AACGTACCAG 


3360 


TTTGAGATGC 


TTCTAACGTC 


CAATTCATTG 


TCATGCCAGT 


CATGGCATAT TTTTGACCAC 


3420 


CTAATTGTGG 


AATCCAGAAA 


CTTGTCATTG 


TATCCATAGC 


TTGAAGCTTA AATACAACAG 


3430 


GACGATCTTT 


AGGGATTGTT 


AATGTATTAA 


CAGTCTCTAT 


ATGTTCATCT GGATAAGCAA 


3540 


AGAACCATTT 


GTATCCTGCA 


CTTACTGCAT 


ATACAACCAT 


TGGATCTTTC TCACTCTTCG 


3600 


GTGGTTTTTC 


GTAATCGTAT 


AAAGTTTTAA 


CTGTAGGAAT 


AGctAAAGCA GCAACGATTA 


3660 


TGATAGGTAT 


TACAAACCAT 


ATTGTTTCAA 


TGATGGCATT 


ATGGTGCATC TTACCAGATT 


3720 


CGGCATTCTT 


ATTATAACTA 


TACTTGTAAA 


TAAAAATGGC 


GAACATGCCA AGTACAACGA 


3780 


AACAAATAAC 


AAGCATGAAG 


ACGATTGAAT 


AAAGAATCAA 


GAACTTCTGA CTACTTGCTA 


3840 


CTGGCCCTTT 


TGCGTTGAAA 


ATTTCTATAT 


TTGAACAACC 


ACTAAGTAAA ATTAGTGTGC 


3900 


CAAATAATAG 


AAGCAAAGAC 


TTAAATTTTG 


ACACTl-ITTT 


GACCTCCTAA TACTACAAAT 


3960 


GTAGGGCTTA 


ACATTAATTT 


TAAGTTATTA 


CACAATATTT 


ACAAGGGCTT ATGGGAAAAA 


4020 


AATTAATAAA 


ATTGTATCAA 


AAATGTTGAT 


AAATCAAGGT 


GTGACGTGGG TTCACACATT 


4080 


TGTTAAAATT 


ATGTGTACAT 


TTTGTGACTA 


ATAGCGTTTT 


T 


4121 



(2) INFORMATION FOR SEQ ID NO: 316: 

40 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 9310 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 316: 
so CGAGTGAGTA CAmACATATT TTTATTTGCA AGGGGTAAAT GGCATATAAC TATCTTTTTT 60 

ATGTAAGCTG GTATAAAATT TATACTAATA GGAGGGATAG TATGAATATA GTAGGGCATC 120 
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TTGGATTACG ATTAGTTGAA AAGTCGGTTA ATCAAGACAA TCCTTCAATG TATCATTTGT 240 

TTTATGGGGA CGAAgTAGGT ACAGCCGGAA CAATTTTAAG CTTTTTTGAA ATTCCCAATG 300 

5 CGGGTCATAA GCAGCCAGGT ACTGAAACGA TTTATCGATT TTCATTATTA GTACCAAATC 360 

AAGCGGCACT TCATTATTTT GAAAAACGTC TTGAGAATAA TGGTATTAAG TCTGAACGTT 420 

TGTACTATCT TGGACAAGAA GGTGTTGTCT TTAAAGATGA AGACGACTTA GAAATCATAT 480 

TGCTTGTTAA TGATAGTTTT GAAGTACCAC ATCAATGGCA ACATAACGCT TATAGTGAAA 540 

TACCTCAAGC ATATCAAATT TTAGGAATAG GGCCAGTCGA ATTAAGAGTT AGAAATGCAG 600 

CGCGTACGGT AGAATTTTTG GAAAATGTCT TAGGTTATCG CAAAAGAGAT AATAAATCAT 660 

is 

TCGATGTGCT GACATTAGCA CCACAAGGTT TATATTCGGA TTTTGTAGTT ATTGAGCAAC 720 

AGGGACAACG TGAAAGACCT GGACGAGGTT ATATCCATCA TATTGCAGTT AATACACCAC 780 

AAATGAGTGA CTTAGATGCA ATTTACAAGA AATTACAACA ACAACCACAA AGTAATTCAG 840 

20 

GTATAATTGA TCGCTATTTC TTTAAATCAT TATACTATCG CCATAATTCA ATTATGTATG 900 

AATTTGCGAC TGAAGCGCCT GGATTTACTA TTGATACACC TGTTGAACAA TTAGGAAGTC 960 

25 AATTGAACTT GCCTGACTTT TTAGAAGCAG AACGTGAACA AATTGAAAGT AAGTTACACG 1020 

AAATATAAAG GAGAATGTTT AATGGCCAAA TTAGAAATGA ATAAAAATAC GCCTC7TGAG 1080 

TTTGGTTTGT ATTCCTTAGG TGATCATTTA TTGAATCCAT TGAAAGGTGA AAAAGTTAGT 1140 

30 TATGAGCAAC GTATTAATGA^ AATTATTGAA GCAAGTAAAT TAGCAGATGA AGCAGGTATT 1200 

GATGTTTTTG CAGTTGGTGA AAGTCATCAG GAGCATTTTA CAACACAGGC ACATACGGTT 1260 

GTGTTAGGTG CAATTGCCCA AGCGACAAAG CATATTAAAG TTTCAAGTTC TTCAACGATT 1320 

ATTAGTGCAC AGATCCTGTA AGAGTATTTG AAGACTTCGC GACATTAGAT TTGATTTCTC 13 80 

ATGGTAGAGC CGAAATTGTA GCTGGCAGAG CATCAAGAAC AGGTATTTTT GACTTGTTTG 1440 

GCTATGATTT AAAAGACTAT GATGAATTGT TTGAAGAAAA ATTAGGTTTA CTTTTAGAGT 1500 

40 

TAAATAAAAC TGAGCGTATT ACTTGGTCTG GAAAATATCG TCCAGAACTT AGAAATATGA 1560 

AAATATTCCC AAGACCAATC GATAATATAT TGCCAATATG GCGTGCTGTT GGTGGTCCAC 1620 

CTGCAAGTGC TATTAAAGCG GGAAAACAAG GTGTGCCAAT GATGATTACA ACCCTTGGTG 1680 

45 

GCCCAGCAAT GAACTTTAAA GGTTCTATAG ATGCTTATCG TCAAGCGGCA ACTGAAGCAG 1740 

GTTTCGATgc TTCGCCTAAG TCTTTACCAG TAAGTACAGC GAGTCTGTTT TATACAGCTG 1800 

50 AAACAACTCA GGATGCTATG AGAGAATTTT ATCCACATTT GAATACAGGG ATGTCATTTA 1860 

TTCGTGGTGT TGGTTATCCG AAACAGCAAT TTGCTAATTC GTCAGATTAT CGAGAAGCGC 1920 
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5 



10 



15 



20 



25 



30 



35 



40 



45 



GTCATCAACG 


TTTTATGGCA 


CAGCTTGATT 




GPP & TTT^ A 21 
v»U\-Ai 1 IvjAA 


a a TPTT ATGA 


4. (J *» U 


AGAATATTGA 


GTTAATTGGC 


AACGACATTA 


1 A>»_\_VjVjV— vjrA 1 


TaaaaaPPaT 


TTATPAAAAT 
X irtlLnnnni 




AGGAGGGGCG 


TCATCATGAA 


TATTGTATTA 


T"T V i" 1 T *P aP-P'TT 
1 i.Ul ^-MOU X X 


ppapaGTAGG 


TTPTAAAACG 


2160 


AGAATTGCTA TGGATGATTT AAAAAATGAA 


prapaaGTpa 


TPaaTPaGPG 
X v— Mrl X VjrtVjvnj 


APATP.AAATA 




GAGTTGATGG ATTTACGAGA ACTTGAATTA 


P.AATTTAP.PP, 


TTGGAAAGAA 
X X Utfnnnunn 


T "pa tpt ag a t 




ACTACAGGAG 


ATGTATATAA 


ATTAACGACG 




APGPTPATPT 


GATTTTTATT 
vjr\x x x x ir\ l ± 




GGTTTTCCAA 


TTTTTCAAGC 


TTCCATCCCT 


pp 'pp ptttp a 


21 21 Zi Zi TGTGTT 


TPaTPTaPTT 
luniLlnLl X 




CCAGTCAATG 


CGTTTCGTGA 


CAAGGTAATA 


no a PTTPT & p 


PP. A P Zv PP H PP. 


TTPTaPTaa a 

11C XAolAAA 




CATTATTTAA 


TTCCTGAAAT 


GCATTTAAAA 




fTTl P IV TP Ti a 

Lil XAuAltjAA 


appapaTapp 

AbLALAiALb 




ATGCAAACGT 


ATGTATTTAT 


TGAAGAGAAA 


GATTTTTCAA 


Zi TPH a P Zi. Zl Zi T 
Al LAALAAA1 


TPTPaaTpaT 

lol LAAloAi 


*5 con 


GATGTTGTAT 


TTCGGTTAAA AGCGTTGGCA 


LAAl LLALAA 


fp pp A A PTP P 
IuUjAAL IuL 


paaapTapaa 

UinAblftLAA 


or* ft 


CAACAAGTGT 


TTGAAGAAGA 


AAACAACCAA 


IALuAL 1X11 


AAAVjIAIAAA 


a aTaapa ppp 


4. /UU 


TCGGCACACT 


AAATTTGTAA 


GTGTTTGAGC 


GTCTTTTCAT 


ATTAACTATA 


t apppa htp a 


27b0 


ACGACGATAA AGGCAAGTGA 


TGACAAGCAT 


All (jAviVj 1 AA 


1AA1 <aA X 1 Vj 1 


t-A 1 AAQjULjO 1 


T D t n 


TTAAGTGCGC 


GATTTTTAAG ATCTTTAAAT 


VjUAALnl 1 in 


apppTaaapp 

ALLv. X AAAoL. 


a zL"ppaTpppp 


O D D A 
Z DO U 


ATTAATAAGC 


AAATTGTTGA 


TACAGTATTT 


& & 2k aTa TTTfi 
AnnAlni X In 


ppa aTGPTGa 

uLnn X X un 


pppa ATapTT 

UVjV»AAxAol 1 


^ q a n 


ACATATGTAT 


TCACTAAGGC 


CATAATGACA 


AATP f AA TTA 


AAAAPTATGG 
nnnnu X x uu 


a aTPPTTaTT 

nniuLi ini x 


^ nnn 


CGACCCTTGC 


TAGATGATTC 


TGATGAACGG 


21 AAPGPATA A 


TT21 a A ATA AG 
1 innnni/viu 


TAPGATGGTT 
X nWwA X UU X X 


^n^n 


AATGGAATCA 


GTAAGAATAC 


TCTACCAAGT 


TTAPPAAPAa 


p' | ir ■ P Zi Zi ■ k J i " i ~ i" 
ulutnnl ill 


AAGTGPATPA 


-J X X u 


CTACCACCAA 


AGCCACCAGC 


TAAGACAAGG 


TfiTfi Pa A r i _ i~r 


P ATG AA G a PT 


aa PAP PAG AC 


5ian 

JXOU 


CAAGCGCCAT 


AAACATTTGT 


CGTCATTGAA 


AAGATAGC*GT 


AGATAGPTGT 


ATATATAAGT 




GAAAATATCG 


TACCAATCAA 


TGCGATGATA 


pppaTAPTA a 

U^uninL inn 


Tap ptgt atp 

X MurL, X \J X Ai V. 


r " f t " t " i ' P 7\ PPT 




GATTTGAATA 


TTGGAGCGAC 


TGCGGCAATA 


GC!AGr!AGr!A P 


papaaapgpp 


TP.TP.ppp. a p a 


■j-jca 


CCTAGTAATA 


ATGCGATGTT 


TTTGTCACCA 


TG P & A P A P. TT 


tgttg a p a a a 

x o x X ynUnrtn 


G AGP A TP a TT 


•j A 5 0 


ACAATACTGA 


AAATAACGAC 


ACCTACATCG 


21 r PCZCZC 1 *T A A T Zi 


g ttt a PT a P P 

V7X X InV-inCV. 


TTGAPPPaTa 
i IuALUjAIA 


lAnn 


ATATCGAATA 


TATTGAGTTT 


AAGTCCATAT 


AGGATGATTG 


CAAATCTTAA 


TAAATATTTA 


3540 


GATGAAAACG 


TAATACCTGA 


GCTATATTGT 


TCAGGATATC 


CTCTAAAGTG 


ACGATATAGA 


3600 


ATAGCGATTA 


ATATCGCGAT AGTTAATGCG 


CCAACCTTAT 


CTAGGATTGG 


CAATTTAGCT 


3660 


GCTAAAAAGC 


TAAATAATGC 


GACTATAAAT 


GTTAGTGATA 


GCCCAATCAT 


AAAATGCTTA 


3720 
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ATTTTTAAAT ATAAATTTGG AATGAATAAT AAAGTAGTGA TTAAATTAAG TTGTGTGATA 3840 

GGAAACTTGG ACATCAATCA AAGTAATAGG CACTACAACG CTTATTGGCG GGGCCCCAAC 3900 

5 AAAGAAGCTG ACGAAAAGTC agCTTGcAAT AATGTGCAAG TTGGGGATGG GCCCCAACAT 3 960 

AGAGAAATTG GGTCCGTAAT TTCTACAGAC AATGCAAGTT GGCGGGGCCC CAACATAGAG 4 020 

AATTTCGAAA AGAAATTCTA CAAGCAATGC AAGTTGGGGA AGGACAACAA ATTTAAGATA 4080 

10 

CAATGCGTAA CATTAATATG TTATTATAAT GATAATTTAC AGAATTATAT GAAAAATGAA 4140 

TGAGGATGTG ATGGTATGTT TGGAATGAAA GTGAATGAAC AAATAACATT AAAAATTTTA 4200 

GAAGCTCATG ACACAGAAGC GCTTTTCAAT TTAGTCAATC GTTCAAGAAA TTCACTTAGG 4260 

75 

GAATGGTTAC CTTGGGTAGA TGCAACTGAG CAACCATCAG ATACGCGTGC ATTTATTAAA 4320 

AGAGGACTTT TGCAATTTGC TGATGGTAAT GGATTTCAGT GTGGCATTTG GTATGAAGGA 4380 

20 ACGCTAGTTG GTGTCATCGG TTTACATGAA ATTAATCACA TGCACAGAAA AACTTCATTA 444 0 

GGGTACTATT TAGATAAAGA ATTTGAGGGT CATGGGATTA TGACACAAGC AGTTGAGGCA 4 500 

TTGATAAAGT ATTGTTTCGA AGAGCTTGAC TTAAACCGAA TTGAGATTAG TGCCGCAGTT 4560 

25 AATAATGAAA AAAGCCGGGC TATTCCTGAA AGGCTGGGAT TTACTAGAGA AGGTATGTTA 4620 

CGTGACAATG AATTACTAAA TGGTATTTAT TCATCGAGTT ACATCTATAG TTTATTAAAA 4 680 

TCAGAATACG ACCAAAAATG ACAAATTAGA CTTACAAAAG AGTGATGACA TTTAAAATGG 4 74 0 

30 CAGCGCTCTT TTATTTAATT TTTGAAAATA AAAGGTTGTT GACAGTATTA TTTTATAACA 4 800 

ATATAATGAT TTTGATAATT ATTATCAACT AGATGATGTT TATGGGAGGA TGCTTTAAAA 4860 

CAGCCGTTTT AAGTGTAATG TATTATTTTA GCGTGTAGGG AATGCGAAAA TAATATTTAT 4 920 

35 

AAGAACACAT CTATGGGGAT AATAGAATTT CTATAATGAG GTGTCAAAAT GAAAAAGTTA 4 980 

ACAACGCTAT TATTAGCATC AACGTTATTA ATTGCTGCAT GTGGGAACGA CGATAGTAAG 5040 

AAGGATGATT CAAAGACATC GAAAAAAGAT GATGGTGTTA AAGCAGAATT AAAACAAGCA 5100 

40 

ACAAAAGCAT ATGATAAATA TACTGATGAA . CAGTTAAATG AATTTTTAAA AGGTACAGAA 5160 

AAATTTGTTA AAGCGATTGA AAATAATGAT ATGGCCCAAG CAAAAGCGTT ATATCCAAAA 5220 

GTTCGTATGT ATTATGAACG CTCTGAACCA GTTGCAGAAG CATTTGGAGA TTTAGATCCT 5280 

45 

AAAATTGATG CACGTCTTGC AGATATGAAA GAAGAGAAAA AGGAAAAAGA ATGGTCAGGA 534 0 

TATCATAAGA TTGAAAAAGC ATTATACGAA GATAAGAAAA TTGATGATGT GACTAAAAAA 5400 

50 GATGCACAAC AATTATTGAA AGATGCAAAA GAATTGCATG CCAAAGCTGA TACATTAGAT 5460 

ATCACACCAA AATTAATGTT ACAAGGTTCT GTTGACCTAT TAAATGAAGT TGCAACTTCT 5520 

5S 



1172 



EP 0 786 519 A2 





GTTGAAGGCG CACAAAAAAT TTATGACTTA TTTAAACCTA 


TTTTAGAGAA 


AAAAG ATAAA 


5640 




AAATTAAGTG 


ATGATATCCA 


AATGAACTTC 


GATAAAGTGA 


ATCAATTATT 


GGATAAATAT 


5700 


5 


AAAGATAACA ACGGCGGTTA 


TGAGTCATTT 


GAAAAAGTAT 


CGAAGAAAGA 


CCGTAAAGCA 


5760 




TTTGCGGATG CTGTTAATGC ATTAGGAGAG CCACTAAGTA 


AAATGGCTGT 


GATTACTGAA 


5820 




TGACAAATTA TGAACAAGTT AACGATAGTA CGCAATTTTC 


AAGACGTACA 


TTTTTGAAAA 


5880 


10 


TGTTAGGTAT 


TGGCGGTGCC 


GGTGTTGCAA 


TTGGCGCAAG 


TGGTGTTGGT 


AGCATGTGGT 


5940 




CTTTCAAATC 


AATGTTCAAT 


ACACCAGAAG 


ATCCGGAAAA 


AGATGCGTAT 


GAATTTTATG 


6000 


15 


GTAAAGTGCA ACCAGGCATT ACCACACCCA 


CGCAAAAAAC 


ATGCAATTTC 


GTTGCGTTAG 


6060 


ATTTGAAGTC 


AAAAGATAGA 


GATGCAATTA 


AGGCAATGTT 


TAAAAAGTGG 


ACGGTTATGG 


6120 




CTGATCGTAT 


GATGGATGGT 


GATACAGTTG 


GCAAGCCGAG 


TAACAATCCT 


TTAATGCCAC 


6180 


20 


CAGTAGATAC 


CGGTGAATCG 


ATAGGATTAG 


GTGCAAGCAA 


GTTAACGATT 


ACCTTTGGGA 


6240 




TTAGTAAGTC 


TTTGATGAAG 


AAAATTGGGT TATCTAGTAA 


AATTCCCGAT 


GCCTTTAAAG 


6300 




ATTTACCGCA 


TTTTCCGAAT 


GATCAGTTAA 


TAGACGATTA 


CAGCGATGGT 


GATATTATGA ' 


6360 


25 


TTCAAGCATG 


CTCAAATGAT 


TCGCAAGTAT 


CCTTTCATGC 


GGTTCATAAT 


TTAGTTCGTC 


6420 




CATTTCGAGA 


TATTGTTAAG 


GTACGTTGGG 


CGCAATCTGG 


TTTTATCTCT 


GCTAAAGGTA 


6480 




AGGAAACACC 


TAGAAATTTA 


ATGGCATTTA 


AAGATGGAAC 


AATTAATCCT 


AGGAAGAATA 


6540 


30 


ATCAACTTAA 


AGATTATGTG 


TTTATTGATG 


ACGGATGGGC 


GAAACATGGA 


ACTTATTGCG 


6600 




TTGTCAGACG 


TATTCAAATA 


CACATTGAAA 


CGTGGGATCG 


TACTGCGCTG 


GAAGAACAAG 


6660 




AGGCTACATT 


TGGTCGGAAA 


CGACATAGTG 


GTGCACCGTT 


AACAGGTGGG 


AAAGAGTTTG 


6720 


35 


ATGAAATTGA 


CTTAAAAGCG 


AAAGATAGTC 


ATGGCGAGTA 


TATTATTGAT 


AAAGATGCCC 


6780 




ATACGAGGCT 


AGCGAAAGAA 


GCAAATACGT 


CAATTTTACG 


TAGAGCCTTT 


AATTATGTGG 


6840 


40 


ATGGTACGGA 


TGACCGCACA 


GGTAACTTCG 


AAACAGGCTT 


ACTTTTTATT 


GCTTTTCAAA 


6900 


AAGCGACAAA ACAATTTATC 


GATATACAAA 


ATAATTTAGG 


TAGTAATGAT 


AAATTAAATG 


6960 




AATATATTAC 


ACATAGAGGT 


TCTGCTTCAT 


TTTTAGTATT 


ACCAGGTGTT 


AGTAAGGGAG 


7020 


45 


GATACCTTGG 


TGAAACATTA 


TTTGACTAAA 


TTTGTAGCAA 


TGCTAATAAC 


TGCTGCTATG 


7080 




GTGTGTAGCT 


TTGGGTTACT 


GAAAAGTCAG 


GCAGCAGAAC 


AACAAAGTAT 


TAGTGATGTA 


7140 




TATAGTGTGA 


TAACGGATGC 


GAAATCTGCA 


CTTTCTAATA 


ATTCGATATC 


GAATGACAAT 


7200 


50 


AAGCAGAAAG 


CAATTGAGCA 


AGTGGTAAGT 


GCAGTTAAGA 


AATTATCGCT 


TGAAGATAAT 


7260 




AGTGAAAGTA ATGCTGTCAA ATCAGATGTG 


AGAAAGCTTG 


AAGATGCAAA 


AGCGAATGAT 


7320 
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GCTAGTAAAG ATGCGGGTTC TAAAATTAAA CTATTGCAAC AGCAAGTCGA TGCTAAAGAT 7440 

GCTGCGATGA CAAAAGCGAT TAAAGATAAA AATAAAGCGG AATTAGAATC TTTGAACAAT 7500 

S AGTTTGAATC AGATTTGGAC AAGTAATGAA ACAGTGATTC GCAATTATGA CGCAAATCAA 7560 

TATGGACAAA TTGAAGTCGC ATTATTACAA CTTAGAATTG CAATTCATAA GTCACCATTA 7620 

GATACGGCAA AAGTGTCACA TGCTTGGACA ACTTTTAAAT CAAATATTGA TCATGTCGAT 7680 

W AAAAAAAGTA ATACGTCTGC AAATGATCAA TACCATGTAT CACAATTAAA TGATGCGTTA 7740 

GAGAAGGCGA TTAAAGCTAT CGACGACAAT CAATTGTCGG ATGCTGcaTg TGCGCTTACA 7800 
CATTTTATAG AAACTTGGCC GTATGTTGAA GGTCAAATTC AAACTAAAGA CGGTGCTTTG - 7860 

15 

TATACGAAAA TTGAAGATAA AATACCATAT TATCAAAGTG TATTAGACGA ACATAATAAA 7920 

GCACATGTGA AAGATGGTTT AGTAGATTTA AATAACCAAA TTAAAGAGGT TGTTGGCCAT 7980 

AGTTATAGCT TCGTCGATGT GATGATTATC TTTTTACGTG AAGGGCTAGA AGTGTTGTTA B040 

20 

ATTGTAATGA CATTGACTAC CATGACGCGT AATGTAAAAG ATAAGAAAGG GACTGCAAGT 8100 

GTGATTGGTG GTGCAATTGC CGGACTTGTA CTGAGTATTA TCTTAGCAAT TACGTTTGTA 8160 

2S - GAAACTTTAG GGAATAGTGG CATTCTTCGT r GAAAGTATGG AAGCGGGATT AGGTATCGTT 8220 

GCGGTCATAT TAATGTTTAT CGTTGGTGTT TGGATGCACA AACGTTCAAA TGCAAAACGT 8280 

TGGAATGACA TGATTAAAAA TATGTATGCT AATGCGATTA GTAATGGTAA TTTGGTATTG 8340 

30 TTAGCGACGA TTGGTTTAAT ATCTGTGTTG CGTGAAGGTG TCGAGGTTAT CATTTTCTAT 8400 

ATGGGGATGA TAGGTGAGCT AGCGACCAAA GATTTTATTA TTGGTATTGC TTTAGCTATC 8460 

GTTATTTTAA TCATCTTTGC ATTATTATTT AGATTTATAG TTAAATTAAT ACCTATTTTC 8520 

35 TATATATTTA GAGTGTTGTC GATCTTTATT TTTATTATGG GATTCAAAAT GCTTGGCGTA 8580 

AGTATTCAAA AGTTACAATT ATTAGGTGCG ATGCCAAGAC ATGTTATTGA AGGATTCCCA 8640 

ACGATTAACT GGTTGGGCTT TTATCCAAGT TATGAACCAT TGATAGCACA AGGTGCTTAT 8700 

40 

ATTATGGTAG TTGCTATCTT AATCTTTAAA TTTAAAAAAT AAAAAACAGG CCGAGTGCCT 8760 

GTTTTTTTTG TTGCTATATT GGAAATATTC GGTATTGCAG TATAACGATA ATCACAGCAT .8820 

TGATTCGTAT AAGGTTAATG . TGTTGGCGGT TTGCCTCGGC ATGTGAACTT AACGATGAAC 8880 

45 

ATACTGAACT CAAAGAGCAA TATGAGTGGC AATGTGAGTA ATATATTTAA TGTTAAATCG 8940 

GGTGGTGCAA TGATACTTGC TAATACAAAG CAAGCGAAAT AAATATATTT ACGrTAATGT 9000 

50 TTCAATGATG TGGTATCTAT AAGACCGAAT TTTGCAAGAC CCATAAATAA TATTGGTAAT 9060 

TGAAATAGAA GACCAAATGT GAATAACCAA CGTATGAGTT CAATCAAATA TGCTTTAAAG 9120 
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GGAAAGCCAA CATAAAATGC AAAAGCGACG CCAGCACAGA ATAATAACAC GCTGAAAAAA 9240 

CTATATTTAT AAATAAATTG ACGTTCATTA TTATGTAATC CAGGTGCAAT GAATGCCCAC 9300 

5 AATTGATAAA 9310 

(2) INFORMATION FOR SEQ ID NO: 317: 

(i) SEQUENCE CHARACTERISTICS: 

(A) -LENGTH: 3458 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 317: 



20 



25 



40 



45 



SO 



ATTAGTATTA 


CACACTAATG 


TAAATAGATT 


GGTTGGAGAA 


GAGATATTTG 


CTAATAAGTG 


60 


CCTTGCTAAT 


AATGATGTTC 


AAATTATGAA 


TTCAATAAAA 


AAATTAATTG 


AAGCTGAGTT 


120 


ATTAACAACG 


ACTAATGATT 


TTGAAGTTAG 


TATATATAAA 


AAGACAAGAC 


CTGAATTACA 


180 


AAGTATTTTA AAAAGTTTTG 


GTATAAAAAC 


AACAGGTAAT 


AAACCTGACT 


TAATTAAACG 


240 


TATTGACGAC 


AATTTTCATA 


TTATTAATAA 


CTTAGATTTA 


CCATATGTAT 


ATATACCAAT 


300 


TAAGAAAGGA 


GAAGAAATTT 


TAAAGAAAAC 


AGAGTACTTA 


ACCTCTTTTA 


TACAAAGTTA 


360 


TGGTGAAATT 


TCTCTTGAGC 


GTGCTTATTA 


TTTGGTTGAA 


AACTATATAG 


ATGAAAATTG 


420 


CGATGATAAA 


GTTGCAGAAA 


TATACAAGTT 


TGAATTTCAA 


AGAAAATATG 


ACAATGGCGA 


480 


GTTTGATTTT 


AATCATGGAT 


ATAATTTCGA 


ATTGAATATG 


TTGATAGATC 


ACTATAAAAG 


540 


AGATGTAAAA 


GACTACGATA 


ATGCCAGAAA GTATTCAAAT 


ATTTATCTTT 


ACTTTGGTTT 


600 


GAGAGATTTC 


TTAAAAAAAT 


TAATGAGTAA 


TTATTCATAT 


TACGATAgTA AAGGGAATAT 


660 


AGATTTGAAC 


GAAATACAAA 


ACGATCTGAA 


TAGATTTATC 


AACTCTAGCG 


CTTCTGGTAT 


720 


GTACGAGCGA 


TTAATATATA 


ATGAAAATTT 


ATCCAATAAT 


ATTATGTTTG 


AATTATTTAA 


780 


AAAGGACACA 


CAAGATTATA 


GTGATTTGGA 


AGAACAATTG 


ATTGAAAAGT 


TCATAAACTA 


840 


TGTAGTGTCT 


AATGTAAAAA 


AAGAAAGTAG 


GAGTAATACT 


CTTATAGAGT 


TATCAAAAAT 


900 


TTTAGAGAAC 


GGATATACAA 


TTGATAAAGA AGAATTTAAA 


AAAGAAGATG ATTATCTTTC 


960 


TAAGTACATA 


TTTACTGACA 


TAGATTATTT 


GAAAAAGTTA 


GAATCAAAAA 


TAAACGTTGC 


1020 


TATTGATATT 


CGAAGTGGAG 


AAATTCATTT 


GGTATTAGAT 


GATGATAGCC 


TTGATATATT 


1080 


AATACAAAAT 


CAAAAATACG 


GCAATGAGTT 


TTGAGTCATA 


ACTAAATATA 


ATATGTTGAA 


1140 


GAGAGGTTTT 


ATTTAAATGG 


CTAAAATTGG 


TTATGATCGT 


GTATCAACGA AAGATTAATA 


1200 
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AACGTACAGA 


GCTTGTTAAG 


TGTTTAGATT 


ATTTACGAGA 


GAGCGACACA 


TTAGTTGTCT 


1320 




ATCAACTTGA 


TCGGTTAGGT 


AGAACGACAA 


AACTATTaAT 


TGAATTaTCA 


CAATGATTCG 


1380 


5 


ATGATAACGG 


AATTGACTTA 


CAAATTAGTA 


ACATGAACAT 


TTCAACGAAA 


GACACAATGG 


1440 




GCAAAATGTT 


TTTTACGATG 


ATGAGTGCAT 


TTTTCaGGAT 


TAGAAGTTAA 


TTTACTATGT 


1500 


10 


GAGTGTATAA AATAGACTTA 


GCAGCAACAA 


GAGCGAGAGG 


CCGAAAAAGC 


GGGCGCCCCT 


1560 


CTTTACCAGA GAATAAAAAA 


TGAGAAATTA AATTTTTATA 


TGATGAACAA 


ACGATAACAG 


1620 




GGGAAGAAAT AGCTAGTTAG 


ACAAGGGTAT GTCACTCAAC 


TGTTTATCGA 


GTTGATTAAG 


1680 


15 


AAATGAAAAA 


ACTTATACTA 


TGAATTACTG 


TTTAAAAGTG 


TGCATGTTAT 


AATATTTATT 


1740 




GAGCAAGTTG 


GATAGATGGT 


GGCTAATCTC 


TTAATAAAGG 


GGTGATGCCT 


ATGGTTATAG 


1800 




TTGTTACTCC 


TAGnAAAGGA 


CTAGCATGTC 


TGATTTTGAA 


ATGCTTATGG 


TTGTATTAAC 


1860 


20 


AATCATTGGT TTAGTATTGA 


TTAGTACTCA 


AGACCATAAA 


AAATAaCCTT 


CTATTCGCTT 


1920 




TGACCGGCAT 


TTTTGAAGGC 


TATTTTTAAA 


TAAAATATAA 


GGTCACCGTC 


TTTTTAACGG 


1980 




GCTCATTAGG 


GTAACATGTT 


TCCGAGTGTT 


GCCCTTTTTG 


TGTTTCAAGA 


GTTAATGATT 


2040 


25 


TTCATCTTTT 


GCTTCTACTT 


GCTACAAATA 


TATTTTAACA 


CATTTTTCTT 


ATGAATTGTA 


2100 




GTTCTGAACA 


TAATCAGAAT 


TAATAAAACC 


AACTTTCCAT 


ACAGCAGAAA 


ATACAATTAA 


2160 




AAGTATAGAA 


TGTAATCACG 


AATTATATAA 


AAAGTATTGG 


AGACCTCTTC 


ATATATAGAT 


2220 


30 


AATTCACTTA 


GTTATTTTAG 


AAAGAAGCCC 


CTAaCAACTA AAGTTGAAAA 


ATAGAGGAAC 


2280 




ACAGTTgGAT TACGCATCAA 


CTGCATaAGg 


CCCCTAAmAA 


CTAAAGTTGT 


AAGGGGCyCT 


2340 


35 


AAAATTTATT 


TTGGTTGATT 


GTCTTCTGGT 


TTATCTGAAG 


TCATTGTTTT 


TGTTGTATTA 


2400 


TCATTTAATG 


AATCTTGTGC 


TTTTTCTGTT 


TTTGTTTCAA 


CAGATGTTGT 


CACTTTATCA 


2460 




TTTTCTACTT 


TTGTATTTTT 


TGGTTCAGTG ACAACTTTTT 


TATTCTTGTC 


AGCTGTTTTT 


2520 


40 


ACTTTATCTT 


TAGTAATTTT 


TTGACCACGT 


TTTAAGTAAT 


ATTGAACGAT 


ACCCATTAAA 


2580 




ATGATTGCAT 


GAATAACAAC 


GAATAAAATA 


ATTGTCATTA 


CTGTATAAAC 


TCTTATGATA 


2640 




TTTTCTGCAA 


CACTTTGAGA 


GAGTGATTGC GTATTAAATG AAATCAAGTA 


ACCAACAGGT 


2700 


45 


GTTTTAAATA AAATAATAAT 


TAAGTTTAAT 


AATAGTATTC 


CAATGAAAAA 


TTTGAAAAAT 
x x x u/vvwi x 






GTTTTTTGAC 


CATTTTTCAT 


TGCTTTAAAT 


CCATTAGCTA 


AGTGTGTTTT 


TACTTTATCG 


2820 




TTAGTTGATT 


CaACGAAACT 


AGTCaTAAAG 


TTGAAAATAG 


GTATTAACAA 


TAACCAAGTA 


2880 


50 


ATAATTGCTT 


TAATCAATAA 


AACAACAATG 


ATTATGATGC 


TTTGAGTTGT 


AATAGCAATA 


2940 




CCAATCAGGT 


GTGTACTATC 


TGCATATGAT 


GATTGTACTG 


AATTCATAAT 


CATTTCTGAT 


3000 
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ATAGATACAA GACCGATCAG AACGCTCTTT AAATAATTAC CTTTTTTAAA TGCAATAAAT 3120 
AAATCGGTAA ATTTTACTTT TTCATGACTC ATTGCTCGTT TCATAACATT TGTAATTCCG 3180 
5 ATAAAAATTT GAACTAAAAC AAACAGTGAA ACTACAGCTG CTATAAGTAA TACAGCGATA 324 0 

ACTTTTAAAT ATGCATCTAC TGGTGGTTTT TGCCCAAATT GTGAATAAAT TGCAATAGTT 3300 
TGTGCATACT TAGCAAGCTG GAAATTAGCA AGTAAATATA CAACTGCAAT TACAGCAAAT 3360 

10 

GCTATAAGTG CATATATCAA TGTTATTAAT AATTGTGGTT TAGCATTTTT AAATGCTGAT 3420 
TTAAAGTAAG TAAACAAAGT GGTGCCTCCT TTTCTTCA 3458 
(2) INFORMATION FOR SEQ ID NO: 318: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 695 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) . SEQUENCE DESCRIPTION: SEQ ID NO: 318: 



25 


GCTTGCAAGA 


TGCTTTCATT 


AAAGACAAAG 


TCATCGATAT 


TATGAATATG 


TTCCAAAATC 


60 




ACACTGATAT 


CACTTATACG 


TTGAATAAGT 


CTCATGCACA 


TATTATATGT 


ACGCCAGAaA 


120 




TATTTGCTAA 


ATTGTTACAT 


ACGATTGCAA 


CTAGAAATAT 


CGACATTCTA 


TCTGCCAATT 


1B0 


30 


ATAGATCGTC 


ATCTATGAGC 


AAAGCTCGTA 


TATCATAATA 


AAAGACACAT 


CTATACGATG 


240 




ATCATTTACA 


ATGATTGCGT 


ATAGTGATGT 


GTCTTTTTTT 


GTATTATTTA 


TCTTCGGAcG 


300 




ATTGATCCTC 


AACCATTTGT 


TCAAATGCTT 


CACGAACTTG 


TGGTACTGTC 


ATACCTACTA 


360 


35 


CAACTTGGAT 


ACTCTTACCA 


CTCTTTACTA 


ATCCGTGTGC 


CATTTGTTGA 


TGCGTGAAAT 


420 




ATTCCGTATC 


TGCAACTTTA 


CTTTCATCAT 


AAACAGTTAA 


GCGTAATCTT 


GTTGTACAGT 


480 


40 


TAGTAACATC 


TTTGATATTT 


TCTTTGCCGC 


CTAAACCGTC 


GAGGTAATAT 


GCAGCTTTGT 


540 


CTTCGTATTC 


ATTACCAGAT 


GAAGCACGTT 


TAGAATCTAC 


GCTATCACCT 


TTTTTATTTT 


600 




TGTAATCTTG 


CTTAGAAAAT 


AATTTAACTT 




TTyCTTACGT CCAGGTAATG 


660 


45 


GAATATCAAA 


TTyCAAAATT 


AAGAATCTGA 


ATAAG 






695 



(2) INFORMATION FOR SEQ ID NO: 319: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 875 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 319: 



5 



10 



IS 



20. 



25 



AATATTGaTa 


TTATATATAG 


TCGAATCaAT 


GATGACCCaC 


ATCGACCtAA 


TGCAAATAAT 


60 


AAGACAATCA 


AGCAGTTAAA 


TGACTTGTAT 


TGCTCGCGGA 


ATTTAAGATA 


TAATGAAATC 


120 


GCATTCAAGA 


AATTTGATAG 


TCATTTGCTA 


TCAATTTCAG 


AAATATATTA 


TGAATTGCTA 


180 


AATTTAATAA 


AAGCGAGTGA 


TCAGTATTAG 


AGAGAATAGA 


GCGTTAAGAC 


TCTATCGCCG 


24 0 


AAGtGCAAGT 


AATTTATTAC 


GAAACTCTCA 


GGcAAAAGGr 


TAATACTGTA ACGCGTTCCT 


300 


GAATTGGTGA 


TTTATAAACA 


GGGTAGCGAT 


TGCTATCCTG 


TTTTTATAAT 


TTTAAGGGGG- 


360 


TATTTCAATG 


TCAAGTGATT 


TAAAACAAAC 


ACCTTTATAT 


CAAAATTATG 


TTGATAGAGG 


420 


TGCAAAAATT 


GTGGAATTCG 


GAGGATGGGC 


GATGCCTGTT 


CAATTTTCAA 


GTATTAAAGA 


480 


GGAGCATAAT 


GCTGTTCGAT 


ACGAAATTGG 


CCTGTTTGAT GTTAGTCATA . TGGGTGAAAT 


540 


X bMu XAAuA 


GGTAAAGATG 


CTAGTCAGTT 


TGTGCAATAT 


TTATTATCAA ATGATACTGA 


600 


TAATTTAACT 


ACTTCAAAAG 


CATTATATAC 


TGCTTTATGT 


AATGAAGAAG 


GCGGTATTAT 


660 


TGATGATTTA 


GTAATATATA 


AATTAGCTGA 


CGACAATTAT 


TTATTAGTTG 


TTAATGCTGC 


720 


TAATACTGAA 


AAAGATTTTA 


ATTGGATTTT 


AAAACACAAA 


GAGAAATTTG 


ATGTTGAAGT 


780 


ACAAAATGTA 


TCAAACCAAT 


ATGGTCAATT 


AGCAATACAA GGACCAAAAG 


CtAGAGATTT 


840 


AATTAATCAA 


TTAGTTGATG 


AAGAkGTAAC 


TGAAA 






875 



(2) INFORMATION FOR SEQ ID NO: 320: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5897 base pairs 

(B) TYPE: nucleic acid 
35 CO STRANDEDNESS ; double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 320: 

40 

TCTTTTTACG AAGGCTACCG TTTAGGATCT GATAACTTAA CTGTAGGAGA AATTGTATTT 60 

AATACAGCGA TGACAGGTTA TCAAGAAACT ATTTCAGATC CATCATATAC AGGTCAGATC 120 

45 ATTACTTTTA CGTATCCATT AATCGGTAAT TATGGTATCA ATAGAGACGA TTTTGAATCA 180 

TTAGTACCTA CATTAAACGG TATTGTAGTG AAAGAAGCGA GTGCGCATCC AAGTAATTTT 240 

AGACAGCAAA AGACACTTCA TGACGTTTTA GAATTGCATC AAATTCCAGG GATTGCAGGT 300 

50 GTTGATACAA GAAGTATTAC GCGTAAAATT CGACAACACG GTGTGTTAAA AGCTGGTTTT 360 

ACTGATCGAA AAGAAGATAT TGATCAACTT GTCAAACATT TACAACAAGT AGAATTACCT 420 
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10 



15 



AGTGTCGTAC TTGTAGACTT TGGTAAGAAG CAAAATATTG TTCGAGAATT AAACGTCAGA 54 0 
GGTTGTAACG TCACAGTTGT ACCATATACA ACTACTGCCG AAGAAATTTT AGCAATGGCT . 600 

CCAGATGGCG TTATGCTATC AAACGGACCA GGTAATCCTG AAGTTGTAGA ATGTGCGATT 660 

CCAATGATTC AAGGAATTTT AGGGAAAATT CCGTTCTTTG GTATCTGTCT AGGACATCAA 720 

CTTTTTGCAT TATCTCAAGG AGCAAGCTCA TTTAAAATGA AGTTTGGTCA TCGTGGTGCG 780 

AACCATCCAG TTAAAAATTT AGAGACTGGA AAAGTTGATA TTACGAGTCA AAACCATGGA 840 

TATGCAATAG ATATAGATTC GTTAAAAAGT ACTGATTTAG AAGTTACTCA TCTTGCATTA 900 

AATGATGGTA CTGTAGAAGG TTTAAAACAT AAAACATTAC CAGCATTTTC TGTTCAATAC 960 

CATCCTGAAG CAAATCCAGG ACCGTCAGAT TCAAACTATC TATTTGATGA TTTTGTAGCA 1020 

ATGATGACTA ATTTTAAGGA AAAGGAGCGT CATATCAATG CCTAAACGTA ATGATATCAA 1080 

20 AACAATTTTA GTAATAGGGT CTGGGCCAAT TATCATAGGT CAAGCAGCTG AATTTGATTA 1140 

TGCTGGAACA CAAGCATGTC TAGCTTTAAA AGAAGAGGGA TATCGAGTTA TTCTTGTAAA 1200 

TTCAAATCCA GCGACAATCA TGACTGATAA GGAAATTGCG GATAAAGTAT ATATCGAACC 1260 

25 GTTAACTCAT GATTTTATAG CGCGAATTAT ACGTAAAGAG CAACCTGACG CTTTACTTCC 1320 

AACTTTAGGT GGTCAAACAG GTTTAAACAT GGCGATTCAA CTACACGAAA GTGGTGTGCT 13 BO 

TCAAGATAAT AACGTCCAAT TATTAGGAAC TGAGCTAACA TCAATTCAAC AAGCAGAAGA 1440 

CCGTGAAATG TTTAGAACAT TAATGAATGA TTTAAACGTT CCTGTACCAG AGAGTGACAT 1500 

TGTAAATACA GTAGAGCAAG CCTTTAAATT CAAAGAGCAA GTGGGATACC CGCTAATTGT 1560 

TAGACCGGCA TTTACGATGG GTGGTACCGG AGgCGGTATT TGTCATAATG ATGAAGAATT 1620 

ACATGAAATC GTCTCAAATG GTCTTCATTA TAGTCCAGCA ACGCAATGTT TATTAGAAAA 1680 

ATCTATCGCA GGTTTTAAAG AAATCGAATA CGAgTAATgC GTGaTAAAAA CGATAATGCC 1740 

ATCGTTGTAT GTAACATGGA AAATATTGAT CCAGTTGGTA TTCATACAGG CGATTCAATT 1800 

GTTGTGGCTC CTAGTCAAAC ATTATCAGAT GTTGAGTATC AAATGTTACG TGATGTTTCA 1860 

TTAAAAGTTA TTCGAGCTTT AGGTATCGAA GGTGGTTGTA ATGTTCAATT AGCATTAGAT 1920 

45 CCCCATTCAT TCGATTATTA . TATTATAGAA GTAAATCCGC GTGTATCACG TTCATCAGOG 1980 

TTAgCTTCAA AAGCAACAGG ATATCCTATT GCAAAATTAG CTGCTAAAAT CGCGGTTGGT 2040 

CTAACATTAG ATGAAATGTT AAATCCAATT ACAGGAACAT CTTATGCAGC GTTTGAACCA 2100 

50 ACTTTAGACT ATGTGATTTC AAAAATACCA AGATTTCCTT TTGATAAATT TGAAAAAGGA 2160 

GAACGAGAGC TTGGCACACA AATGAAAGCA ACAGGTGAAG TTATGGCCAT TGGTCGAACT 2220 
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TTACCAAATG GTGAAAGCTT CGATCTTGAT TATATTAAAG AACGTATTTC ACACCAAGAT 2340 

GATGAACGAT TATTTTTCAT CGGCGAACAA TTAGAAGAGG CACAACATTA GAAGAAATTC 2400 

ATAATATGAC TCAGATTGAT TA CTTCTT CT TACACAAGTT CCAAAACATT ATTGATATTG 2460 

AGCATCAATT AAAAGAGCAT CAAGGTGATT TAGAATATCT TAAATATGCA AAAGATTATG 2520 

GATTTAGTGA TAAAACAATA GCGCATCGCT TTAATATGAC GGAAGAAGAA GTATATCAAT 2580 

TGCGTATGGA AAATGATATT AAACCTGTTT ACAAGATGGT TGATACTTGC GCAgCTGAAT 2640 

TTGAATCTTC AACACCATAT TATTATGGTA CATACGAAAC TGAAAATGAA TCCATAGTTA 2700 

CTGACAAAGA AAAAATCTTA GTATTAGGCT CTGGACCAAT TCGAATCGGC CAAGGTGTAG 2760 

AATTTGACTA TGCGACAGTT CACGCCGTTT GGGCAATTCA AAAAGCAGGG TACGAAGCGA 2820 

TAATTGTGAA TAACAATCCA GAAACAGTTT CAACAGACTT CTCAATTTCT GACAAATTAT 2880 

20 ACTTTGAACC TTTAACTGAA GAAGATGTGA TGAATATCAT TAATTTAGAA AAAGCTAAAG 294 0 

GTGTCGTTGT ACAATTTGGA GGACAAACAG CGATTAATTT AGCAGACAAA TTGGCTAAAC 3000 

ATGGTGTTAA AATACTTGGT ACTTCACTAG AAAATCTAAA TCGTGCTGAA GATAGAAAAG 3060 

25 AATTTGAAGC ACTATTAAGA .AAAATTAACG TGCCACAGCC ACAAGGGAAA ACAGCTACAT 3120 

CACCTGAGGA AGCATTAGCG AATGCTGCAG AAATCGGATA TCCGGTTGTA GTAAGACCTT 3180 

CTTATGTATT AGGTGGTCGC GCAATGGAAA TTGTAGACAA TGACAAAGAG TTAGAAAACT 3240 

ATATGACCCA GGCTGTAAAA GCGAGTCCGG AACATCCGGT ACTAGTCGAT AGATATTTAA 3300 

CTGGTAAAGA AATTGAAGTT GATGCGATTT GTGATGGAGA AACGGTCATT ATTCCAGGAA 3360 

TCATGGAACA TATTGAACGT GCTGGTGTGC ATAGTGGTGA CTCAATCGCT GTATATCCAC 3420 

CACAAACTTT GACAGAAGAC GAGTTAGCAA CACTTGAGGA CTATACTATA AAATTAGCTA 3480 

AAGGTTTAAA CATCATTGGC TTAATCAACA TTCAATTCGT TATAGCTCAC GATGGTGTGT 3540 

ATGTTTTAGA AGTAAATCCA CGTTCTAGTA GAACGGTACC ATTCTTAAGT AAAATTACTG 3600 

ATATTCCAAT GGCACAATTA GCTATGCGAG CAATCATTGG GGAAAAACTA ACAGATATGG 3660 

GTTATCAAGA AGGGGTTCAA CCATATGCTG AGGGTGTCTT TGTGAAAGCA CCAGTATTTA 3720 

45 GTTTTAATAA ATTGAAAAAT GTTGATATTA CTTTAGGACC TGAAATGAAG TCAACAGGTG 3780 

AAGTGATGGG GAAAGATACT ACATTAGAAA AGGCGTTATT CAAAGGGTTA ACAGGTAGTG 3840 

GCGTTGAAGT TAAAGATCAC GGTACAGTAT TAATGACCGT CAGTGACAAA GATAAAGAGG 3900- 

50 AAGTTGTTAA ATTGGCACAA CGCTTAAATG AAGTTGGCTA TAAAATTTTA GCAACGTCTG 3960 

GAAcAGCTAA TAAATTAGCT GAGTATGACA TACCTGCAGA AGTAGTAGGC AAAATTGGTG 4020 
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TGACTAAAGG TAAAGAAGTA GAAAGGGATG GCTTCCAAAT TAGACGTACT ACAGTTGAAA 414 0 

ATGGTATTCC ATGTTTGACA TCTTTAGATA CAGCTAATGC CTTAACGAAT GTAATTGAAA 4200 

GTATGACATT TACAATGCGT CAAATGTAAA TCAATCAAAC TGTATCGGTG GGGCTGTAAT 4260 

TAACCATTTA CTTAAAGAAG TTTATATTAC AGCCTCATTA TTTTAATGAA TTTCTTAATA 4320 

TAAAGGGAGA CaTATATGAT GAAAGATTTA CCAATTATTG CATTAGATTT TGAATCAAAA 4380 

10 

GAAAAAGTAA ATCAATTTTT AGATTTATTT GATGAATCAT TATTCGTAAA AGTAGGTATG 4440 

GAACTTTTTT ATCAAGAAGG TCCTCAATTA ATTAATGAGA TAAAAGAAAG AGGCCATGAT 4500 

GTATTTTTAG ATTTAAAACT GCATGATATT CCTAATACAG TTGGTAAGGC GATGGAAGGA 4560 

JO 

CTAGCTAAAT TGAATGTTGA TCTGGTAAAT GTTCATGCTG CTGGTGGCGT AAAAATGATG 4620 

TCTGAGGCCA TTAAAGGATT AAGAAAACAT AATCAAGATA CAAAAATTAT TGCAGTAACA 4680 

20 CAGCTTACGT CAACAACAGA AGACATGTTA CGACACGAAC AAAATATACA AACATCGATT 4740 

GAAGAGGCCG TTTTAAATTA TGCCAAGTTA GCAAATGCAG CTGGTTTAGA TGGCGTTGTT 4 800 

TGTTCACCTC TTGAAAGTCG TATGTTGACT GAAAAGTTAG GTACATCATT TTTAAAAGTA 4860 

25 ACACCAGGTA TTAGACCTAA AGGTGCATCT CAAAATGACC AACACCGTAT TACGACACCG 4920 

GAAGAAGCAA GACAGCTTGG TTCGACGCAT ATTGTAGTCG GTAGACCGAT TACACAAAGT 4980 

GACAATCCAG TCGAAAGTTA TCATAAAATT AAAGAAAGTT GGTTAGTATA ATGGCTAAAG 5040 

30 

AAATTGCAAA ATCATTATTA GATATTGAAG CTGTAACATT ATCACCAAAT GATTTATATA 5100 

CATGGAGTTC AGGTATTAAA TCACCGATTT ACTGTGATAA CCGTGTTACG TTAGGTTATC 5160 

CTTTAGTTCG AGGCGCAATC CGCGATGGTT TAATTAACTT AATTAAAGAA CACTTTCCTG 5220 

35 

AAGTAGAAGT TATTTCTGGT ACTGCAACAG CTGnTATTCC ACATGCAGCT TTTATTGCTG 5280 

AAAAATTAAA ATTACCAATG AATTATGTTC GTTCATCAAA TAAGAGTCAT GGTAAGCAAA 534 0 

ATCAAATCGA AGGTGCTAAA AGTGAAGGTA AAAAAGTAGT TGTGATAGAA GATTTAATTT 5400 v 

40 ... 

. CGACAGGGGG ATCTTCAGTC ACAGCAGTTG AAGCCTTAAA ACTAGCAGGT GCAGAAGTAT 5460 

TAGGTGTTGT AGCTATCTTT ACTTACGGTT TGAAAAAAGC AGATGATACA TTTAGCAATA 5520 

45 TTCAACTACC TTTTTACACT TTAAGTGATT ACAATGAATT AATTGAAGTA GCTGAmAmTG 5580 

AAGGTAAAAT TTCTAGTGAA GATATCCAAA CATTAGTTGA ATGGAGAGAC AACTTAGCAT 5640 

AATATAGACA CTAGAAGGAG GAATTCAACA AATGAATGAC AAAACATCTA ATGATTTATA 5700 

50 TGGGAAGATA AAACATTGTA ACGAATTTAT CAATCATTCA AATGATTCCA ATCTATCTAG 5760 

TAGTCACGAT GTCGACGAAA GTTCAACGAA GCAAAAACAT ATAAAAAATA AAACAACTAT 5820 
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TTAAAAACAA AAAAGCT 5897 
(2) INFORMATION FOR SEQ ID NO: 321: 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 7965 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 321: 
TCATTCTAAA TCAACTTATT TTCCATTGCA TAAATTGCTG CTTGTGTACG ATCGCTAACT 60 

15 

. TGTAATTTAC TAAATATATG ACTGACATGT GTTTTAATTG TTTTTTCAGA TACAAATAAA 120 
GTTTCTGCAA TCTCTTTATT TGTTTTACCT TTAACCATTT CACGTAACAC TTCAATTTCT 180 

20 CTCTTTGACA ACTTATTCGT GTAGTGTGGT TTTTGGCTAA CTGTTTCGAA TACATCTTGT 240 
GCCTTAGGAT GTATCATTTT TTCACCGTTC ATAACTCGTC TAATAGTTTC AATTAATTGC 300 
TGAGGCTCAA CGTCTTTCAT TTCATAACCA TCAGCACCTT TATTGATTGC TGAAATTACA 360 

2S TGTTCATCAT CAACATAACT TGTTAATACC AAAACTTTAA TATCCGGATA ATGTGCCTTA f 420 
ATATATTCCG TAATTTCAAT ACCATTCATG CCAGGCATCA CTAAATCTAA TAGCACAATA 4 80 

TCAGGGTGCT CATGCTCTTT TAAATATTCT AAAAATGTTT CTCCATCTGC AAAGTCTTGT 54 0 

30 

AAAACTTCTA TGTTTTCAaT CGTGGaTAAT AAAAATCGCA ATCCTTGtCG CACAATATAA 600 
kGGwCATCTA CTAATATnAC TTTGTTCATG GGTTATCTCC TTAAAtCAAG CTATTTTATA 660 
GGAATTGTGA ATTGTATTTG TGTACCCTTT GTTGGCTGAG AATGAAAGGT CACTTTACCT 720 

35 

CTTAATAATT TAACTCTTTG TTTTATGTTA TTAATACCGT GTGATGAAGC TATCTGAACA 780 
TTATCGATCT CAAATCCTTG ACCATAATCA ATCACGTCAA TATATAGTAT ATCGTTCATT 840 
TGTTTTAATG TAAGATCCAT TTTATTCGTA TCAGCATGTT TCTTAACATT ATTAATACAC 900 

40 

TCTTGTAATG CTCTGTATAT GTTTTCTTCG ATTTCATTAG ATAAATCGAT TAAACCTTCT 960 
ACATTTACAT TTAATTGTAT ATGCATTAAT TTACTATATG CTGTCAAAGC ATGAATTAAA 1020 

45 CCTTGCTCAA GTCCAACTGG CTTAAGTnGC CAAATCAATG CACGCATTTC ATTAACGGCA 1080 

TTTTG ACTCG TTtCCTCAAT CGTCTTGAAT GCTTGTTTAG CGATGGATTC GTTTGACATG 1140 
CCATACGCAG CATGTGCtGT TAGTTTTACA GAAAATAACA TTTGATTTAC TGAATCATGT ' 1200 

50 AAATCTCTAG CTAAACGATT ACGTTCATTA ATTTTTGCCG CTTCTTTTTC ACGGTCTGTT 1260 
AAATAAATAC GTTTGATGGC TGACCCTAAT TGAAATGCGA CAGACTCTAG CAACTCTAAA 1320 
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TGACCCGATT TAAGTGGTAC CGTTGCATGA TGTGTAATAT TGTCATTTTG GCTASGAAAT 1440 

GCTTTAGAGG CTAAGTTAAT ACGAGAACAA TTGACGATAT TCGACGCTTT CATTAGCCTA 1500 

CGTTGATTAA ATGCTTTCAC ACACCAACAA GACCCATCTT TAATATAGTG ACAGTGGTCT 1560 

GCTGTCAAAG ATTGTGGTAG AGCCACATGT GATACAAGTT CATGTTCACC TACGCTATTG 1620 

ATGAAAAATA TCCAGCCTGT CGTGAAATTA CTGCCCTCAA TTAAATATTT TAACGCACCT 1680 

TGGGTCATGC TATACATTTC rGTTTCTTCG TTTAAAAATT CGGCAATCTC TTTTAATAAA 1740 

GCTAGTCGCG TCCTTTGTTC CATCAAATCG CTCCAATTCA TTTTTACGTG TATTAACTAT 1800 

TATACATTGA GTTATTATAT TTTTAAATCT TAGACGTAAA CATGATAAAA TGGCCTTGAT 1860 

TACTCAATAG TTATATTTCG GAGAACTGAT TTGTGATATG ATATTAAAGA CTATAGGAGG 1920 

ATTTTATGAA ATTTAAAATA CCAGAAAACT TTAATGACTT AAGTTTACGA GATATTTTCC 1980 

20 AACAACTTAA GGTACCTAAA AAAGATTTAC ATCATTTAAA TATGTCTAAA GATATTACTA 2040 

TTAATGATAA ACCTGCGCGA TTAATGGATA AAGTGCATAC TGGCGACGAT GTATTTGTTC 2100 

CAACCATCGA TGAAAAAAGT AATTATGTTC CAAGTTATCG TTATGCACAA ATTAAATACG 2160 

25 AAGACGATGA TATGGCAATC GTAATGAAAC CTAAAGGTGT TAAGACTCAC CCTAATGATT 2220 

TAAAAGAAAG CAATACTTTA ATGAATCATG TGATTTACAC TATTGATAGT GACTATGTCG 2280 

AACCAATTCA TCGACTGGAC CAGGAAACAG TAGGATTATT AATTGTTGCT AAAAATCCTT 2340 

TAATGAAAAA AATTCTTGAT CGCATGTTAG AAGACAATGA TATTACGCGG ATATACAAAG 2400 

CAAATGTTAA GGCACTTTTA CCTTTAAAAC CACAAACGAT TGATATGCCA ATTGGTAAAG 2460 

ATAAATTCCA TTCGAATAAA CGACGTGTGT CTCCTACTGG ACAGCGTGCA ATTACACACA 2520 

TTTTAACTTC AAAAATGATA AAAGAAGCTG TGTGCCmACT TGAAATCAAG TTGGATACTG 2580 

GACGTACTCA TCAAATwCGT GTGCATTTAG CTGAAATTGG TCACCCTGTT ATTGGTGATC 2640 

CTTTATATGG TGATTCAACG TTAAGACAAT TAGAACTTGA AAGTTACAAA ATAGAGTTTG 2700 

TGcATCCCTT GACTAAGGAA GTCATTTCCG TTTCTTTGGA TGACTAATTT GATTAGTTTT 2760 

GCATGATATT tAAACATGCA ATACCGCATT GTAACTAAAT CAAGTATCAA CtTAAAcGGA 2820 

45 TAGATGGAAA ATTATTAATT TTTTCAGATG TTCGGTTTTT TTGTTTTTTA CGATGCTTAG 2880 

GATTTTATAT TTTGATATTT TAGTAATTAT TCATTTTATA ACATCCTTGG ATAATGACTT 2940 

GTAGTCTTTT TCAACTGCGT TACGTGTATC TATGGACAAT ACATGACATC ATAAGATTTT 3000 

50 TATCACAGGT TGTTTGGCCA ATACATGTAC AACAATTCAT CATATAAAAA ATAGGTTCTA 3060 
TAATAAAACG GACTCCATGA AAAGTTTTTC CTTTTCATGG CTCTATATCA AATCAGACTT ' 3120 
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CCAAACTTTA TTTTATATTA ATATTTAATT AATGAGGATC TACCATATCT TCTGGTTTAA 3240 

TCCATGCTTC AAATTGTTCT TCTGTAACAT ATCCAGTTTG AATTGCAGAT TCTTTTAAAG 3300 

5 

TTAAACCTTC TTTATGGGCT TTCTTAGCAA TTTGAGCTGC TTTTTCATAA CCAATATGTG 3360 

GATTTAATGC AGTAACTAAC ATTAATGATT GATTTAAATA ATTATCAATA TTCTCTTCGA 3420 

TTGGTTCAAT GCCCACTGCA CAATTGTTAT TAAATGTTTC CATACCATCA GCTAAAAGAT 3480 

10 

AAATTGATTG TAGTGTATTA TGCATAATAA CTGGTTTATA AACATTCAAT TCAAAGTTAC 3540 

CTTGTGAACT TGCGAACCAA CAACTGTATC ATTACCCATT ACTTGGACTG CnACCATTGT 3600 

15 TAACATTTCA CATTGTGTAG GATTAACTTT ACCAGGCATA ATTGATGAAC CTGGTTCATT 3660 

TTCAGGGATA GAAATTTCTG CCAAACCAGC TCGTGGCCCT GAAGCCAACC ATCTCACATC 3720 

ATTAGCAATT TTCATTAAGT CTCCTGCTAA TGCCTTCAAT GTTCCATGCA ATTGAACAAC 37B0 

20 TTCATCATGC GCTGTAAGTG CGTGGAATTT ATTTTCAGAA GATACAAATG GATAACCCGT 384 0 

ATTTTCTGAA ATATAATGTG CCACTTTATC ACCAAATTCA GGATGCGCAT TAATACCAGT 3900 

ACCAACAGCC GTACCACCGA TGGCAAGATT TAAAATGTGC TTCTTAGATT CAGATAACAT 3960 

25 TGTTTCACAA CGGTCAAGCA TATAACGCCA GCCACTAATC TCTTGTCCTA GTTTGATCGG 4 020 

CGTTGCATCT TGTAAATGTG TACGACCAAT TTTAATAATT GAATCAAATT TATCTTCTkT 4 080 

TTCTTTCAAA GTATTTCTTA AAAGTTTTAA TGCAGGTTCT AATTTTGTTT CAACCTCTTG 4140 

30 

ATATAATGCA ACGTGCATAG CAGTTGGGaA TGTATCaTTC GaACTTTGAG ATTyATTTAC 4200 

ATCATCATTk GGGTGGATAC TTTCATCACT TTGATGATCT TTTAAATACA TATTAGCAAC 4260 

ATAACTTACT ACTTCGTTCA CATTCATATT ACTTTGTGTA CCGCTTCCTG TTTGCCATAC 4320 

35 

AACTAGTGGG AAGTGTTCAT CTAATTCACC TGATAAAATT TGATCACATG CGTATACAAT 4380 

GGCATCTTTC TTTGCCTCGC TTAATTTTCC TAAATCAAAA TTAGCTATTG CTGCTGCACG 4440 

40 CTTTAGTTGT GCAAAACCAT AAACTACTTC GATTGGCATA CGCTCTTTAC CAACTGGGAA 4500 

ATTACGTTTA CTTCTTTCTG TTTGAGCACC CCAATATTTA TCTGCAGGTA CTTCTATTTC 4560 

TCCAAAAGTA TCATGTTCAA TTCTTACTGA CATTCAATTT CTCCCCTTAT CACTGTTTAT 4620 

45 TTAACTGTAG TATATCATTA AATAATTTAA TTGAGCAATT TATGATTAAA ACGTTTTCAT 4680 

AATTTGAAAT AAAAATACAC TAATCGCACG TGTTCACCCT TTATTACAGT GATACGGTCA 4740 

TACGATTAGT GTGTTATCTA TCATTATTTA GTTATTATTG AACTAAGTTT AATTACGATA 4 800 

50 CTTTGTTTTA GTAGCTTCAA CCGtAGCAAT AGCTGTAAGT ATATATAATA CAGCACTAAC 4860 

AATTGTCGTA TATGGATTTA GAGCAACAAG CGTaCCTAAA ACTCCTGTTA AACTCGCATA 4 920 
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TACAATACCT GATTGATTAC TTTTAATGAA TGTTTGCGCA TTAACATCAT CAATTAATCC 5040 

TTTTGATAAA TTGAGTTGTA ATTTTATTAC TTTGAAAATA ACAGGTAAAT ATAATGCCCC 5100 

5 

AATTGCCAAT GGAAAAGCTT TAATTGATAT TAAACTTATA ATAACTGTTG CTATCAATAA 5160 

TTGAATCCAG TATTTTCCTA ACATAAATAT ATAAATCTCC TCTAATTTCA TTCTTCAATA 5220 

GCATATCATA ATCTTGGCAT ATTAAGAAAC GCGGTTTAAT GATTTCATTA AAAATATTAC 5280 

10 

TGATAGATGA CTTCTTTCAA TTATGTCTGG AGTAATTAAT TATCAATTCC GTTTAAATGG 5340 

TGTTTTAATA TTTAAAATTG AACTTTTGAT ATATTACTAT GTCTGGTACA CAAATCAATG 5400 

15 TTTTATGCTT TACAAAGTTA TATTGGCAGT AGTTGACTGC AGTCCACAAC ATAGAGGCTT 5460 

CGGAATGTCA GCTTCTATTT CATGCAAGTT GGTGGAGCTC CAACATAGTG GAATTGGATT 5520 

CCCAATTTCT ACAGACATTG CAAATTGGGG AAACGGGCCA CAAACTCAGA AACTGGTGGA 5580 

20 AAGTCAGCTT AAAATAACAT GCAAGTTGGC GAGGCCCCAA AATAGTGAGA TCGGATTTCT 5640 

AATTTCTACA GACATTGCAA ATTAGGGAAA CGGGCCACAA ACTCAGAAGT TGGTGGAAAG 5700 

TCAGCTTAAA ATAACATGCA AGTTGGCGGG GCCACAACAT AGAAAAATTG GATCCTCAAT 5760 

25 TTCTACAAAC AATGTAAGTT GGGGAAACAG CCCCAACACT GAAACTAGCA GAAAGTCAGC 5820 

TTCTATGAAT ATAATAAAAA AGCTAGGTAA CAAAATGCTA CCTAACTTCA TATTCAAGAT 58 80 

AATCAATCCT ATTTGATATA TGTTCTATAC TATACATTAT TTACATGATA AATAACTGAA 5940 

30 

TATTACACAA TTATAATACT TTACTGACTG TCTTCTTCAG AATTCTTTTC TTGATCATTT 6000 

TGATCAGAGA TTTGTTCCaT TTCTTTACCT AATTCTTTTA AATCTTCAAA ATCCGTTACC 6060 

ATACTGTTTT CTTCTTCATG ATAATTTAAT TTTGGATCTT TGTCTTTAGA CATAATCAAT 6120 

35 

ACCTCACAGA TTTTTAATTA AGCAAAGCGT GACGTAAAGT AAGCTTTAAC ATCTTCAGGT 6180 

AAACCTGCAG CCGCTTCTTT ATCAAGAATA ACATTTACCA TTCTATGTGC TTTTAAATCG 6240 

GCTGGTTCGA AGCTTGTTTT ACCATTTTCT TGATATAATT TTTCAACTAC ATCTCGTTTA 6300 

40 

TTAGCACCTG TCACTACTAA GAAAATTTCT CTTGCTTCCA TTAGTCCTTG ACGAATACTA 6360 

ACATTTAACT TACCTTGCTC ATCGATAGAA ACAACTTGTA ATGTTAATTT CCCTTTATTT 6420 

45 TCTTTAGTTT TAATCTTATC AGCGATTAAT TCGATTGCAT CTTTTTCATA AGCAATTGGA 64 BO 

TAAACTTGAC CTGCTGGTAC ACCTAACGCT TCGAAATATG ATTTTTTATC GTCATAATCT 654 0 

AAAATATTTA TTTGGCTAAA ATCAACAGCA TGTTTTTCAA CATTTTTCTT TAATTCATCT 6600 

50 AGAACTGGCG CTTGATCTGT ATCTAAATGA AAACCTGCAA TTGTAGTAGG ATTATTGTTA 6660 

AATTGCTTTC TAATAATATC AGCAGCATAT TCTGCTACAA GTTGACTATT GTCAAAGACT 6720 
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35 



GATATTAATA 


ATTATTATAC 


CCTAACTTTC 


AATATATCAA ACCATTTAAC 


TTTAACATGC 


€840 


TTATACTCTA 


AATATAGCAC 


TTAAGCATCA 


TTTTTATAAT 


GAAAATGAGT 


AAATTTTAAT 


6900 


TCAATCCCGG 


rAAATCTTGT 


TGACGTAACG 


CTTCATAAAT 


TAACAACGCA 


GCAGTATTTG 


6960 


ATAAATTTAA 


TGAACGAATA 


TGTTCACTCA 


TAGGAATTCT 


TAACGCTGTG 


TCTTGATATT 


7020 


TCTCTTTCAC 


CCAGTCTGGT 


AATCCTGTCG 




AAAAATGAAG 


TAAAAATCTT 


7080 


TGTCATGATT 


TGAAAAATCA 


AAATCACTAT 


AAGTCTTTTT 


ACCAAATTTT 


GTTAATAAGT 


7140 


AATACTCGCC 


ATTTGTGACT 


TCAAAAAATG 


CTTCAATACT 


ATCATGATAC 


GTAATATTCA 


7200 


CAAATTCCCA 


ATAATCTAAA 


CCGGCTCTTT 


TTAACATTTT 


ATCATCAGTT 


CTAAATCCAA 


7260 


GAGGTTTAAT 


TAAATGTAAA 


TGTGTGTTTG 


TACCTGCACA CGTACGgCAA TGTTACCAGT 


7320 


ATTAGCTGGG 


ATTTCTGGTT 


GATATAAAAC 


GATATGATTT 


GTCATATTAC 


TATTCTCTCC 


7380 


TTGTGTCTAA 


TCCTTTTATC 


ATTTCATTCT 


GAACTTCTGC 


ATCCTCTTGA 


TCATAATTAG 


7440 


CATTGATAAA 


ATCTCTTGCT 


TCTTCCCCAA 


GAATTTGACC 


AATGGCCCAA 


TAAGCAGTTG 


7500 


CTCGAATCAA 


CGGTCTTTCA 


TCTGTTGTTG 


CAACTTTTTT 


CAATTCTGGA 


ATTGCATCCA 


7560 


CTTCATTAAA 


ATGCGCCAAT 


GCTAAAATAG 


CATTTCGTTG 


TATCGGCTTT 


TTACCACGCC 


7620 


AAGCACCTGC 


AAGgTGACCA 


TATGTTTGTT 


TGAATTCTTT 


ATTAGACATA 


CGTAGtAAAG 


7680 


GTACTAATCT 


TGGCTTTAAA 


ATTTCTGGTT 


CCAAAATGAT 


GTCATCTTGT 


TCGGTATTAA 


f Ih\j 


TACCTCTATT 


TTTCGGACAA 


ACTTGTTGAC 


ACGTATCGCA 


ACCATATAAT 


CTATTCCCAA 


7800 


TTTTATAACG 


ATATTGGTCA 


GGCATATAGC 


CTTTTGTTTG 


CGTTAAAAAA 


CTAATGCATT 


7860 


TCTGACTATT 


TAATTGGCCA 


TTTCCAACTA 


ATGCACTTGT 


TGGACAACGA 


TCAACACAAA 


7920 


TTGTACAATG 


CACCACAGCT 


ATCTAATAAT 


GGATCATCAG 


GTTCC 




7965 



(2) INFORMATION FOR SEQ ID NO: 322: 

(i) SEQUENCE CHARACTERISTICS : 
4( > (A) LENGTH: 13 02 base pairs 

(8) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 322: 
GCCCTGTTGG AGAAATCACC TTTATACGAT GGTTGAAAAA GCATCATTAG GTACAATCGT 6 0 

SO GGCAATTATA GTTATTTTGC TGTTTTTAAG AAACATTCGT ACGACGGCAA TTTCTATTAT 12 0 

ATCGATTCCG TTATCACTTC TTATGGCGCT TATTGCTCTG AAATTGAGTG ATGTTTCATT 180 

55 
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TGTAGTTGTT 


GAAAATATTT 


ATCGAGGCTT 


AACAGATTCA GAAGAACAAC TAAAAGGTGA 


300 




AAATTTAATT 


ATCAGTGCGA 


CAACTGAAGT 


ATTTAAACCA ATAATGTCAT 


CGACACTAGT 


360 


5 


TACTATTATC 


GTCTTCTTAC 


CACTTGTGTT 


TGTATCAGGT 


TCAGTAGGCG 


AAATGTTTAG 


420 




ACCTTTTGCA 


TTGGCTATTG 


CATTTAGTTT 


ATTAGCATCG 


TTATTAGTGT 


CAATTACACT 


480 


10 


CGTTCCAGCG 


TTGGCAGCTA 


CACTATTTAA 


AAAAGGCGTT 


AAACGTCGTA 


ATAAACAACA 


540 


TCAAGAAGGA 


TTAGGTGTTG 


TTAGTACAAC 


TTATAAAAAA 


GTATTACATT 


GGTCATTAAA 


600 




TCATAAGTGG 


ATTGTAATTA 


TATTAAGTAC 


ATTAATTTTG 


GTTGCAACTA 


TTGTATTTGG 


660 


15 


AGGACCGAGA 


CTAGGCACTA 


GCTTTATTTC 


AGCAGGTGAC 


GATAAATTTT 


TAGCTATTAC 


720 


TTATACACCG 


AAGCCTGGTG 


AAACGGAGCA 


AGCAGTGTTG AATCATGCGA 


AAGATGTTGA 


780 




AAAATATTTA 


AAACAGAAAA 


AGCATGTAAA 


AACAATTCAA 


TACTCAGTTG 


GCGGTAGTAG 


840 


20 


TCCAGTAGAT 


CCAACGGGTA 


GTACAAATAG 


TATGGCAATC 


ATGGTTGAAT 


ATGATAATGA 


900 




CACGCCTAAT 


TTTGATGTAG 


AAGCGGATAA 


GGTTATTAAA 


CATGCAGATG 


GCTTTAAACA 


960 




TCCTGGAGAG 


TGGAAAAATC 


AAGATTTAGG 


AACAGGTGCA 


GGTAATAAAT 


CTGTAGAGGT 


1020 


25 


TACTGTAAAA 


GGTCCATCAA 


TGGATGCCAT 


AAAATCAACT 


GTAAAAGATA 


TTGAACAGAA 


1080 




AATGAAACAG 


GTTAAAGGAC 


TAGCCAATGT 


CAAATCTGAT 


TTATCGCAAA 


CATATGATCA 


1140 




GTATGAAATT 


AAAGTCGATC 


AAAATAAAGC 


GGCAGAAAAT 


GGTATTTCTG 


CAAGTCAACT 


1200 


30 


TGCAATGCAC 


TTGAATGAAA 


ACTTACCAGA 


AAAAACAGTT ACGACTGTTA 


AAGAAAATGG 


1260 




TAAAACTGTT 


GATGTTAAAG 


TCAAACAAAA 


TAAGCAAACA 


GC 




1302 



(2) INFORMATION FOR SEQ .ID NO: 323: 

55 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 323: 
ATATATATTA ATTTAAACGT GTTTCACATG TACCAGTGTT AATGACAGAT AATGCTGCGT 60 

45 

TTAAACCACC TTCAACAAGG TTTTGTACTG CTTCATCAGA GAAGAAAGCA ATATGTGGTG 120 
TTACTAAAAT TCTTTCATGT TCGATTAACT CTAATAATGT TTTATCGTCA ATGTCTTTAT 180 
so TAGTCCAGTC ATTTGTGAAG TATGCTGCTT CATTTTCATA AGTATCAATC GCAGCACCTA 24 0 

ACAAAGTACC ATCGTTCACT GCAGCGATTA AATCAGGTGT ATTGATGACT GCACCACGTG 3 00 

55 
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GATAGCTTTC TTTGTTCGCT GGAACATGTA AAGAAATAAT 
CTTTAACACT ATCTTTATAA GTTAAAAAGT CTAAATCTTT 
CTGTAATTGT AGCACCAAAT CCTGCATATA TtTTAGCTGT 
TACCAATAAT TGCAACAGTC ATATTTTTAA CTGGTTTAGA 
TAAAATCATG TGCTTGTACA CGGCGTTCAA TATCTGGGAA 

10 

CGATAGArAC AGAATACTCT GCAATtGTTT CAGGTGAATA 
CAATATTGTG TTTTTTAGCT AAATCTAAAT CATACATATC 
75 CAATTTGTTT AATACCGTAA GATTCTAATT TAGGATAAAC 

ATTGCATTGT AGTTACGCCA TCGTAATCTT TTAATTGAtC 
GCTCTTTAGA AGTAGTTACT TCGACATTAT TCTtTtTCCC 

20 

TCTCATAATC ACGCGTaCCA AAGAACATAA TTCTCGTCAT 

(2) INFORMATION FOR SEQ ID NO: 324: 

(i) SEQUENCE CHARACTERISTICS: ■ 
25 (A) LENGTH: 5030 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: * double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 324: 
GGCTTTTATA GTGTTTACGA TTATTATGAT TACTTATATT CACGCTGTTA AAAATTATCC 60 
35 AAATAACCGT ACTGTTCATT ATGGTTATAC AGCTGCATTT ATACTTGTTA TATTACAAGT 120 

TATCACAGGT GCATTATCTA TTATGACAAA TGTTAACCTG ATAATCGCAC TTTTCCATGC 180 
ATTATTTATC ACTTATTTAT TTGGTATGAC AACATACTTT ATCATGCTAA TGTTACGATC . 240 

40 

AGTAAGAAGT GACArGCAAT AACaAAAAAG cmAACCGTAA TTTTAATGGC ACGCCCATTA 300 
AAATTACGGT nTTTTATATC AATATTTAAA AtTAAACCtA AGCCATGTAA AAACGAGATT 360 
ACACGTCAAT TGTTGTGTAA TCTCGTTTTA TnTTAATCAT TTTAGTCAGT TGCTTTTTCA 420 

45 

ATTTCGATTA ATAAATCGCC TGTCGCTATT GTGTCACCAT TATTTACAGT TACTTGTTTA 480 
ATCACACCGT CAAATGGTGC TTGAATTGTT GTTTCCATTT TCATAGCTTC AGTAATTAGC 540 
SO AACGGCTGAT TAGCTTTCAC AGTTTCACCT ACACTAACCT TGACTTCAGT TACTGAACCT 600 . 

GGCATTTGAG CACCGATATG ACTTGGATTA CTCTTATCTG CTTTTGGCTT AACGTTCGCA 660 
TTTGTATGCA CATTTTCATC TTTAATGTAA ATACGTCTCG CTTGACCATT CATCGCATAG 720 

55 



ATCGGCATCT TTAATAGCTT 420 

ATTAGGATAG GCGTCATAAG 480 

AGCAGCACCG ATACGACCCG 540 

CATGATTTCT GCTTGCCAAG 600 

GCGACGCACT AATTGTAGGG 660 

ACTAGGAACG TTAGATATCA ^720 

AAATCCAGCA GTACGTTGTG 780 

GTCATTTTCT AACTTACCAA 840 

GACTGTAGCA CTTGATAATA 900 

CCAATTTAAT GCCATCTCTT 960 

TAT . 1003 
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CGTTTACCTT TATCGATTTC GATTTCTACT GTTTCACCAT TACGCATTCC AAAGAAGAAT 840 

GTAGGCGTAT CAAGTAACGA TAAGTTTCCG TATTGATTTC TAGTTTGAAT ATATTGTTCA 900 

5 TATACTTTTG GATATAGTAC ATAACTAATA ATATCTTGCT CCGTAACAGG ACCTTGTTGC 960 

TCTTCTTCAA GCAACTCACG GACTTTTTCA AAATCAACTG GCTCTAGATA TTCACCTGGA 1020 

CGAGCTGTTA GTGCTTCTTG GCCTTTTAAA ATAACCGCTT GTAAATCTTT ATTAAAACCA 1080 

10 . 

TTTACAGGTT GTCCTATTTC ACCTTTGAAG AACGACACTA CTGATTCTGG GAAATCTAAT 1140 

TTATAGCCAT CTGTAATCAC GGATTGTTCA TCAAGATCAT TTTGTACCAT ATAAAGTGCC 1200 

1S ATATCACCAA CTACTTTAGA CGATGGTGTT ACTTTTACGA TATCACCAAA TAAGAAATTC 1260 

' ACTCTGCGAT ACATATCTTT GACTTCATCA AATCTTTCGC CTAAACCTAA ACTTTTAGCT 1320 

TGTTGACTTA AATTCGAATA CTGTCCACCA GGCATTTCAT GTTGATAAAT TTCAGTATTC 1380 

20 GGTGATTTGA TATCACTTTc AAAGTCTGAA TAATAAGTAC GTACAGTTGA CCAATAATGA 1440 

CTAAGTGACT CCATACCTTC AATATCAGTT CTAAGGTGGC GTGGGAAGCC ATTTAATGCA 1500 

TAATATAACG AATTGGCGCT TGGCTGACTT GTTAAACCAC TCATTGAAGC AACAGCAGTA 1560 

25 

TCAATGATAT CGACACCAGC ATCTATTGCT TGTTTGTATG TTAATAAACC ATTACCACTT 1620 

GTATCATGAG TGTGAAGATG AATTGGTAAA TCTACAGCTG ATTTTAACTC ACCAATCAAT 1680 

TCGTAAGCGG CTTTAGGTTT TAATAAGCCT GCCATATCTT TAATCGCTAA AATATGGAAA 1740 

30 

CCTTCACGTT CTAACTCTTT AGCTAGTTTG ACATAATACT CTAAAGTATA AATGTTTGAT 1800 

CGCTCAGGAT TTAAAATGTC ACCTGTATAA CAAATAGTAC CTTCTGAGAT TTTGCCCGCT I860 

35 TCTTGTACTG CTTCATTGGC AACTTTCATT TGATCTACCC AGTTTAATGA ATCGAAAATT 1920 

CTAAAGACAT CTATGCCTGC TTTAGCACTT TCTTGTACGA ATTTATGAAT AACATTATCA 1980 

GGATAGTTTT TATAACCAAC TGCGTTTGAA GCACGTAACA ACATTTGGAA TAATACATTT 2040 

40 

GGAATAGCTT TACGTAGACG TTCAAGTCGT TCCCATGGGT TTTCCTTCAA GAAATTATAT 2100 

GCCACATCAA ATGTAGCACC GCCCCACATT TCTAGTGAGA AACCATCTTT AAATACGTCC 2160 

GCTGTTTTGG ATGCGATATT AATCATATCC TTAGTTCTAA CTCGTGTAGC TAATAATGAT 2220 

45 

TGGTGTGCAT CTCTAAAGGT TGTATCTGTT AGTAAGACAT CATCCTGCTT TTTAACCCAT 2280 

TCAGCTACAC CTTTTGGACC TACTTCATCA AGCAATTGTT TCGTACCACT AAATGAAGCG 2340 

SO ATTTTACTTG AAGATACAGT TGGAATTGAT GCTAATTCAT AGTCTGGTTT CGGACGTTTC 24 00 

TCAACATTTG GGAAACCATT AATTGTTACA TTACCTATAT ATTCTAATGT TTTAGTACCT 2460 

CTATCTAGAG ACGGCTGAAT GTCGAAAAGT TCTGGTGTTT CTTCAATAAA TTTAGTTGTG 2520 

55 
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TTAACACCAC GAATACGCAT TTCTCGTAAT GAGCGTACCA TTTTTTCTTC TGCTTGTTTA 2640 

AATGATATCG CGTGTGTAGA TAATTTTACG AGTAATGAAT CATAATAAGG TGATATCTCA 2700 

GCACCTTGGA AACCATCTCC AGCATCAAGA CGTACACCAA AGCCCCCGCT TGAACGATAA 2760 

GCAATGATTG TTCCAGTATC CGGCATGAAA TCATTTAACG GATCTTCTGT TGTAATACGA 2820 

CATTGGATGG CATAGCCTAA TGTTGTAATA TCTTTTTGTT GCGGCATATT AATCTCTTCA 2880 

CCAAATAAAT CGGCACCTGC TGCAACTAAA ATTTGTGTCT TAACAATATC AATTCCTGTT 2940 

ACCATCTCTG TAATTGTATG CTCTACTTGT ACACGAGGGT TAACTTCTAT AAAGAAGAAT 3000 

TCGTCACCAG ATACTAGAAA TTCAACAGTA CCTGCATTGA CATATTTAAT ATTTTCCATC 3060 

AATTGAATTG CAGCATCACA AATACGTTGA CGTAATGTTG ATGATAATCC AACTGATGGT 3120 

GCAACTTCTA CAACTTTTTG ATGACGACGT TGTACTGAAC AATCACGTTC AAATAAGTGT 3180 

20 ACGATATTTC CATGTTCGTC ACCTATGACT TGTACTTCAA TATGCTTTGG ATTATCAATG 3240 

TATCTTTCTA TGTAAACTTC ACTATTACCA AATGATTTTT CAGCTTCTGA TTTTGCTCTA 33 00 

TGGAAAGCAT CTTCTAATTC ACTTTCTTCA CGAACGATTC TCATACCTTT ACCGCCGCCA 3360 

25 

CCACTTGTGG CTTTAATCAT TAGCGGGAAA CCAGCTTCTT CTGCAAATTC TTTTGCTAAT 3420 

) 

TCATATGATT TAATTGGACC GTCTGTACCA GGAATAACTG GTAAATCTGC CTTGATAGCC 34 80 

GTTGTACGAG CTTTAACTTT ATCTCCAAAC ATATCTAAAT GTTCTAAATG AGGACCAATA 3 540 

30 

AATTTAATTC CTTCTTCTGC ACAACGACGC GCAAATTGTT CATTTTCACT TAAAAATCCA 3600 

TAGCCAGGAT GAATCGCATC CACATTCGCT TGTTTTGCTA CATCAATGAT ACGCTCAATA 3660 

35 TTTAAATAAC TTTCAGCAGG ACCTAAATCA CTTCCAACTA AATAGGATTC ATCTGCTTTA 3720 

TATCTATGTA ATGAACTTTT GTCTTCATTC GAATAAATTG CAACTGTGCT GATGTCTAAT 3780 

TCTGCCGCCG CTCTGAATAT ACGAATTGCA ATTTCTCCAC GGTTAGCAAC AAGTAACTTT 3840 

TTTATTTGTT TCAATAGCGA TACACTCCTC AAACTATTAG AATTTTCTAA CTAATTAGAT 3900 

AATAAAATTT TATCTTAAAG CGCTCTGTTT TGCTATAGTt mTGTTTCmAA TTTTCAAAaT 3960 

TTaACATyCT tGAGACAATT AAAaCCyCCG CTTCmGaAAT AATAATTTCA AAAATGACTA 4020 

TGCAACAACA GGTAGTTCCA CGTTTTTGTT GTGAAACATT TTCGATTTCT ACAACTCTAA 4 080 

AAAATTAAAA ATAAAATTGC AAAACATCAA CATTTATTAT CAATAGCGAT AACTTTATCT 4140 

so TATCATCATG ATTCTAATTT CGCCACCACA TTTAGTAATT TTTAGTCATA AAATTTAGTT 4200 

ATAATTATAC GTTGTTTTGT TTATAAAATT TGATAATCak GAGTAATCtC GTAATATCAA 4260 

AACaAAAAGG AAGTTAAGCG TTGTTTGGTT GcCTAACTTC CGTTATTGAA CTCATCcAGT 4320 

55 



40 
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TCTCGTACTA AATATTGGCT AGTATTTTTT 


TAATTAAATT 


GTCTTCTTAT 


ATCAACTTTT 


4440 




TGTTGTTGTT 


TCTTTCGTTG 


CTGGTCTACT 


TTGATTTGTT 


TACCTACAAT 


CAGAAGTAAA 


4500 


5 


CCCATAGCAA TACTTAAACT AATCATTGAT 


GATCCACCAA 


AGCtGATAAA 


TGGCAATGGC 


4560 






ATGGAATTGT 


TGCCGAAATA 


CCGCCAATGT 


TTACAAACGT 


TTGACTTCCA 


4620 


10 


AAGTATGTGG 


CAATCCCAAC 


ACACACAAGT 


TTATAAAAAT 


ATGATGATGT 


TTTATTTGCA 


4680 


AACTGGAAGG 


CACGATATAC 


AATAAAGAAC 


TCTAAAGTAA 


TAACTAGCAA 


TCCTCCGATT 


4740 




AAACCTAATT 


CTTCGCAAAT AATTGCAAAA 


ATAAAATCTG 


TATGTGGTTC 


TGGTAAATAG 


4800 


15 


CCCAATTTCA 


TTGCACTATT 


TCCTAATCCT 


TTTCCAAATA 


CGCCACCGTT 


ACCTATCGCA 


4B60 




AGCAATGAAT 


TGGAAATATG 


GTATCCAGTT 


CCTGATTCGA 


ATTGGAATGG 


ATCTGTTAGC 


4920 




GTACTAAATC 


TGGCAGTTAA ATAACTTGGT 


AACCAACCAG 


CCATTAATGC 


AATGACAAAT 


4980 


20 


ACTACTAAGA ATCCTAGCAC 


TGCTGGTATA 


CCAAATCTTA 


GGACTTTGTT 




5030 




(2) INFORMATION FOR SEQ ID NO: 325; 








25 


(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: 13 8 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








30 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO; 325: 








CTTGTTAATC 


CGAAAATTAT 


TAGTCAATCA 


AATGAAACAA TAACAGACTT 


nGAAGGTTCA 


60 


35 


ATTACATTGC 


CAGATGTTTA 


CGGCGAAGTG 


ACAAGAAGTA AAATGATAGT 


TGTCGAAAGT 


120 




TATGACGTCA ATGGGAACAA 


AGTTGaACTA 


ACTGCACATG 


aAGATGTAGC 


AAGAATGATT 


180 




TTGCATATTA 


TAGATCAAAT 


GAACGGTaTC 


CCTTTTACAG AACGTGCGGA 


CCGTATTTTA 


240 


40 


ACAGATAAAG 


AAGTGGAGGC 


ATATTTTATA 


AATGACTAAA ATAATATTTA 


TGGGTACACC 


300 




AGACTTTTCA 


ACAACTGTTT 


TAGAAATGCT 


TATTGCAGAA 


CATGATGTCA 


TTGCAGTCGT 


360 


45 


AACGCAACCA 


GATCGACCTG 


TTGGACGTAA 


ACGTGTTATG ACACCACCAC 


CAGTTAAAAA 


420 


AGTTGCAATG 


AAATATGATT 


TACCTGTATA 


TCAACCTGAA AAATTAAGTG 


GATCAGAAGA 


480 




ATTAGAACAA 


TTGCTTCAAT 


TAGATGTAGA 


TTTAATTGTA ACTGCTGCTT 


TTGGACAATT 


540 


50 


ATTACCTGAA 


TCATTGTTGG 


CATTACCAAA 


TCTTGGGGCA ATTAATGTAC 


ATGCATCATT 


600 




GTTACCGAAG 


TATAGAGGTG 


GTGCACCAAT 


TCATCAGGCA ATTATCGATG 


GTGAACAAGA 


660 


55 


AACCGGCATA 


ACAATTATGT 


ATATGGTTAA 


AAAATTAGAT 


GCGGGTAATA 


TTATTTCGCA 


720 
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ATTAGGGGCA GATTTATTAA AAGAAACTTT ACCATCTATT ATAGAGGGCA CAAATGAAAG B4 0 

CGTACCTCAA GATGATACGC AAGCAACATT TGCTTCCAAT ATTCGACGCG AAGATGAGCG 900 

AATTAGCTGG AATAAACCAG GAAGACAAGT GTTTAATCAA ATTCGTGGAT TATCACCATG 960 

GCCAGTTGCT TATACAACTA TGGATGACAC TAACTTGAAA ATATACGATG CTGAACTCGT 1020 

TGAGACTAAT AAGATAAACG AGCCTGGAAC CATTATAGAA ACGACTAAAA AAGCCATTAT 10 BO 

TGTTGCTACA AATGATAATG AAGCTGTTGC AATTAAAGAT ATGCAATTAG CTGGGAAAAA 1140 

GAGAATGTTA GCTGCCAATT ATTTAAGTGG TGCGCAAAAC ACACTAGTAG GGAAGAAACT 1200 

TATATGATAG AAAACGTGAG AAGTCTTGCT TTTGACACGA TTCAAGATAT ATTAAATGAA 1260 

GGTGCGTATA GTAACTTGCG TATCAATGAA GTGTTGTCAG AAAATGAATT AAATGCAATG 1320 

GATAAGGCTT TATTTACAGA AATTGTCTAC GGAACCGTTA AAAGAAAATA TACGTTAGAT 1380 

20 TTTTATTTA 1389 
(2) INFORMATION FOR SEQ ID NO: 326: 



10 



15 



25 



30 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 2746 . base pairs. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 326: 

TTTGCTAATA ACAATAAAGC CAAAGCCGAT TCACACTCTA AACAGCTAGA AATTAATGTT 60 

35 AAGAGTGACA AAGTACCTCA AAAAGTAAAA GATCTAGCAC AACAACAATT TGCTGGTTAT 120 

GCTAAAGCAT TAGATAAACA AAGTAATGCA AAAACTGGTA AATATGAACT TGGCGAaCTT 180 

TTaAAATTTA TAAATTTAAT GGTGAAGAAG ATAATAGTTA CTATTATCCA GTTATAAAAG 24 0 

ACGGTAAAAT TGTTTATACT TTAACACTTA GTCCTAAAAA TAAAGATGAT TTAAACAAAT 300 

CAAAAGAAGA CATGAATTAC AGTGTTAAAA TTTCAAACTT CATCGCTAAA GATTTAGACC 360 

AAATTAAAGA TAAAnATTCA AATATCACTG TTCTTACTGA TGrAAAAGGG kTTTATTTTG 420 

aAGAAGATGG cmAAGTTAGA TTAGTAAAAG CTACGCCTCT ACCTGGTAAT GTAAAAGAnA 480 

AAGAAAGTGC TAAAACAGTT TCAGCAAAAT TGAaACAAGA GTTaAAAAAT ACAGTAACAC 540 

CTACTAAAGT TGAAGAAAAC GAaGCGATrC AAGAAGATCA AGTTCAATAT GAAAATACAT 600 

TAAAAAACTT CAAAATTwGA GaACAACAAT TCGATAACTC ATGGTGTGCA GGATTCAGTA 660 

TGGCAGCATT ATTAAATGCa ACTAAAAATa CAGACACTTA TAATGCACAT GATATTATGC 720 

55 



1192 



EP 0 786 519 A2 

AAATGATTGA ATACGGTAAA TCACAAGGCA GAGATATTCA TTATCAAGAA GGCGTACCAT 84 0 

CATATGAACA AGTTGATCAA CTTACAAAAG ATAATGTAGG AATTATGATC CTTGCACmAA 900 

5 GTGTATCTCA AAACCCTAAT GACCCACATT TAGGACATGC GCTAGCAGTT GTTGGTAATG 960 

CTAAAATTAA TGACCAAGAA AAACTTATTT ACTGGAATCC TTGGGATACA GAaTTATCAA 1020 

TCCAAGATGC AGATTCAAGC CTATTACATT TATCATTCAA TCGTGATTAT AACTGGTATG 1080 

10 

GTTCAATGAT AGGTTACkAA AAAGTAATAT AGATATTGAT TAAAGGCAGG TAAAACTATG 1140 

TATCAACTAC AATTTATAAA TTTAGTTTAC GACACAACCA AACTCACACA TCTAGAACAA 1200 

15 ACCAATATCA ATTTATTCAT TGGTAATTGG AGTAATCATC AATTACAAAA ATCAATTTGT 1260 

ATACGTCATG GCGATGATAC AAGTCACAAT CAATATCATA TTCTTTTTAT AGATACGGCA 1320 

CATCAACGCA TTAAATTTTC ATCTATTGAT AATGAAGAAA TCATTTATAT TCTTGATTAT 1380 

20 GATGATACAC AGCATATCCT CATGCAAACG TCATCCAAAC AAGGTATTGG CACTTCGCGA 1440 

CCAATCGTTT ATGAGCGCTT AGTATAACTA ATTTAAATGA TTTCACTTCA TAAAGCGGGT 1500 

TGGCGAGAAT TCAATTTCTC ACCAGCTCGT TTTTTCATTG TAATAATAAT CTTTAACATT 1560 

25 

TATTCTTTCT CTATTAATTT TTCTCAAACT ATCTTATCTT TATGATAATT AATTAAAATG 1620 

CCCTTTTAAA TTCTTATAAA ATAAAAAAGC CACCTATCGT CGCTAATAAA CGACGCAAGT 1680 

GACTTAATAT CATATTCAAA ATAACTTATG GGAATTTAGG GAATTGATCG AAGTCAGGAT 174 0 

30 

CACGTTTTTC TTTAAACGCA TCACGGCCTT CTTTCGCTTC ATCAGTTGTG TAATAAAGCA 1800 

ATGTTGCATC CCCAGCCATT TGTTGTAAAC CAGCTAAACC ATCTGTGTCA GCATTCATAG 1860 

35 CTGCTTTAAG GAATCGTAAC GCTGTTGGTG AGTGTTTCAT AATCTCTTTA CACCATTGCA 1920 

CAGTTTCATC TTCAACTTTC TCTAAAGGTA CCACTGTATT TACTAGACCC ATATCTAAAG 1980 

CTTCTTGTGC ATTGTATTGA CGACATAAGT ACCAAATTTC ACGTGCTTTC TTATGTCCAA 2040 

40 CGATACGTGC TAAATATCCT GAACCATAAC CCGCATCAAA TGAACCTACT TTAGGACCAG 2100 

TTTGTCCAAA AATAGCATTA TCAGCAGCAA TCGTTAAGTC ACAAACAACA TTTAGTACAT 2160 

TACCGCCACC TACAGCATAA CCTTTTACCA TCGCGATAAC CGGTTTTGGA ATAATACGAA 2220 

45 ■ 

TTAAACGCTG TAAATCTAAT ACATTTAAGC GAGGGATTTG GTCTTCACCT ACATAACCAC 2280 

CATGTCCACG TTTCTTCTGG TCACCACCAG AACAGAATGC TAAATCACCT TCACCAGTTA 2340 

so ATACGATAAC TGAAACGTTT TGATCATCAC GTGCACGTGA AAATGCGTCA ATCATTTCAG 24 00 

CAACTGTTTT AGGTGTAAAC GCATTGCGTA CTTCAGGGCG ATTTATTGTT ACCTTAGCAA 2460 

TCCCTTCGTA AAATTCATAT TTGATTTCAT CATATTCTCT AAGTGTTTCC CATTGTCTGT 2520 

55 
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TCCACATGAA TTGTATGACC TGTAGCAGAA ATTAATTTAC ATTTACTATT AGGAATTAAA 2640 

TTTGCCATTT TTTTCGCAAT CTGTACAAAT TTTTCATCAT ATTCTCCAGC TAATATTAAT 2700 

5 GTTGGTACTT TAATTTCTnT CAGCGCGGCC ATAAGTTTGG CATTTG 2746 

(2) INFORMATION FOR SEQ ID NO: 327: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 900 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



is 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 327: 





TAATGTTTAG 


TTTATTAACA 


GTAAGTTCGT 


ATATCAATGT 


TTAGTGCTCC 


CCAAAATTGA 


60 


20 


AGTTTGAATT 


TTAAAAGCAT. 


CTTGTAGAAT 


TTAGTTGTAT 


TTTTTTCAAA 


GAAATTCATT 


120 




TTGATTATTT 


TTGATAATGA 


GCATTTTAAT 


AGTAATACAT 


GTTTATAGTG 


TGTAGTATAT 


180 




GTCTATACTA 


GTAGTAACTA 


TATAGAGAAA 


GTAGGAATAA 


ACTATGTCAC 


AAGATGTAAA 


240 


25 


TGAATTAAGT 


AAGCAACCAA 


CGCCAGATAA 


AGCAGAAGAT 


AACGCATTTT 


TCCCATCACC 


300 




ATATTCCCTT 


AGTCAATATA 


CAGCACCTAA 


AACAGATTTT 




AACACAAAGG 


360 


30 


TGCCTATAAA 


GATGGTAAAT 


GGAAAGTATT 


GATGATTGCT 


GCTGAAGAGC 


GATATGTATT 


420 




ATTGGAAAAT 


GGAAAAATGT 


TCTCTACGGG 


TAATCATCCT 
TGATGTTGCG 


GTTGAAATGT 


TATTACCTTT 


480 




ACATCATTTA 


ATGGAAGCAG 


GTTTTGACGT 


ACATTATCTG 


GTTATCCAGT 


540 


35 


TAAATTAGAA 


TTATGGGCTA 


TGCCAACTGA 


AGACGAGGCA 


GTTATAAGTA 


CTTATAATAA 


600 




ATTGAAAGAA 


AAATTAAAAC 


AGCCAAAAAA 


ATTAGCAGAT 


GTGATTAAAA ATGAATTAGG 


660 




ACCTGATTCA 


GACTATTTAT 


CTGTCTTTAT 


CCCAGGCGGA 


CATGCTGCAG 


TTGTTGGTAT 


720 


40 


TTCTGAAAGT 


GAGGACGTTC 


AACAAACATT 


AGATTGGGCA 


TTAGACAATG 


ACCGCTTTAT 


780 




AGTTACATTA 


TGTCATGGAC 


CAGCAGCACT 


ACTTTCAGCA 


GGGCTTAACA GAGAAAAATC 


B4 0 




TCCATTAGAA 


GGATACTCTG 


TTTGTGTCTt 


CCCTGaCTCA 


TTAGATGAAG 


GTGCAAATAT 


900 



(2) INFORMATION FOR SEQ ID NO: 328: 

( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATCTGAnTnG AGATAGTGAT AATGTGTCAC 
CaTCaGCTAA CaCtTCaTTa TCtTTAGtCG 
5 AAAATGaTGa CtTTTCAACa CCGAACaCTA 
CAATTTCCCC TGTTTGCTTC AAAGTAGCTT 
TGTTAAAAAC AGATTGTTGC ACAGTTTGAT 

10 

CTTGCCACTT ATCAATGCCT TCTTTATTCA 
ATAAAAATAG CAAGTAACTA ATCATCGTTA 
1S TGTTGCGCCG TATTTCATTC CAAGCTAAAA 
TAAACATTGT TTATATTCTT AGATGCACGT 
TACACTTATC CTTCATTATA CCCGAACTTT 

20 AATTAATTGT CTATTATTTT. TGTACGGTAC 

t 

CTACATATAC CTTTAAGTAC ATGTAGACGT 
TTTAGGATCG AATTGCTTAA TAGCATTTTT 

25 

TTTATAGTTA TTGTTGTAAA TCGTAGCTTC 
GATATATTTC TTACCGAAAT CGTCTTTATG 
TGTAGCTTCT TTTGGATCTG TAAAGTCTAA 

30 

ATGATCCTTT TGAACCATTT GTGTGCGTGA 
TAAAATATGA CTCTTCTTAG CTTGAGGGAT 
35 CTTTAAATCT TTTTCTTTAT CTCTATCAGA 

AGCATATTTC AAGAATTCTT TTGCGCCATA 
TTCATGCCAA CCTTCTGGGA AAGGTTTATT 

40 

ATTATCTAAA ACTGTTTCAT CTAAATCCAA 
CTTTTTAATC TCTTTATCTA ACTGTGTCTT 
TTTTGCTTCA GCTGAATTTT GATACCAAGC 

45 

CGCCTTTTGT GATGCTGGTA TAGAAGCTTG 
GGCAAACGCT GTAGAATTTG TCGTTTGTGG 
50 TGCTATTGCA ATATACTTTG AAATTTTATT 

ATACAGTTAA AATTATAAAT GCATTTATTT 
TATTATTTCT TGgAAAAGCT AACAAATATG 

55 
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CcAATTTTaA AcCtTTATTT TTaAGcGtTT 60 

cTTTATGcCC tTCtATTAAA CtCGGaACtA 120 

AAACaTTGtC tTTTTGATGG CCaTTAGACA 180 

GCTTCTTGTA TTTATTTTCA ATATCTTTCT 24 0 

TGGCATCTTT ATTTAGAACA ATGGCATCTG 300 

TATTGATAAG ACCATTCGCC AATCCAGATA 360 

ACACACCAAT AATTAGTCCA AACTTCAATT 420 

ACATGCATTT CTCTCCCTAC TACTATGATT 4 80 

ACGTCGTGTT GCGCTCTGTA ATGTTATACA 540 

TTATATTAAA AGCAAATTTA TGGAAAATGC 600 

ATTTAAAATT AAGGATCAAT TTAAAAACGC 660 

CCAATTCATA TATTATTTAA CTTCGCCTGT 720 

ACGTAATTTA TCTTTTGCTT TrTCACTTGC 780 

CCAACTACCA TACATTGGGT TAGGGAAAAT 840 

TTTTTCAATT AATGCTTCAC GAGATTCAGC 900 

TAAATTATCT CCAAATAGCA TGACAAGTTT 960 

TTCTTTACTC TTATCATCTT TACCTTTTAG 1020 

ACCTTGTTGT TTTAAGTTCT TTTGTGTTGC 1080 

AATATAGTAG ATATCGACAC CTTTTTTGTC 1140 

GACAGGTTTA GCTTTAGCAG cTTGTACCCA 1200 

ATGTATTGAT GCATAGCCTT GATATGGAGA 1260 

AGCAATAGCT AACTTATGTT TACCTTTATT 1320 

TGCACTGTTA TAACCTTGTA AATATAATGC 13 80 

CACTGCCATA ATATTTTGAT TACCAAGATT 1440 

TTGCGTTTGT TGAACTTCAG CAGAACTTTT 1500 

TGCTGAAACT GTAACCGCTA CCGATAATGA 1560 

CATCTTATCA CCTCATGATT AATATTTAAA 1620 

AATATTGCTA TACTATGAAA AGATATTTAA 1680 

TGAACATTTC ATATAAGCAT GATTTAATGG 174 0 
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GTGACTAGGC CTTCCTATCA GACATATTCA CTCATCCACG TATCATTATG TGTACAGTGT I860 

GCTATCTCTT ATTTACCTAT TGGAACAACC ATAAACTCAT CCATAGTTTA CCTTTTATAA 1920 

ATAGCAGTCC TCACTCATAC AATTTCTCAT AAAAATCACA ACGCTCCAAC GTATTTCCAA 1980 

CTTACTTTCA CCTATTTTAA TTCATAAAAA CGACACTTTA ATTGTCATTA TCCAATAATA 204 0 

GCAAGACGTT ATTATTGCAA TCTTTTTTAT AAAATAATAG AATCATAGTA TTGTCATTTA 2100 

AAGATAAAGT AAGAACGTTT TTATTTTTCA GATTTTTTAA ATTATTATGA ATATCTAGTT 2160 

TTAGGAAGGA AATTACATTG AAAAAACAAG TTATTATTTC GGGCCTCATG TTATTTTCAC 2220 

TATTTTTTGG . AGCCGGAAAT TTAATATTCC CGCCCATGCT TGGCCATACA GCGGGTCAAA 2280 

ATATGTGGAT TGGTATGCTA GGCTTTGCCC TTACAGGCAT ATnACTCCCC TTTATTACTG 2340 

TTATTGTTGT TGCATTTTAT GATGAAGGTG TTGAAAGTGT AGGCAATCGT ATACATCCAT 2400 

20 GGTTCGGGTT TATTTT T GCT GTCGTGATTT ACATGTCTAT CGGAGCATTT TACGGTATTC 2460 

CACGTGCTGC AAATGTCGCG TACGAAATTG GTACAAGACA CATTTTACCT GTGCATAACC 2520 

AATGGACTTT AATTATATTC GCAGCAATCT TTTTTGCCAT CGTTTACTGG ATTAGTTTAA 25 BO 

ATCCATCGAA AATCGTTGAT AATTTAGGTA AATTATTAAC ACCGTTATTA CTATTAATGG 2640 

TCGCTCTATT AAGTATTGCT GTCATTTTCA ACCCTGAATC TGCACTAAGT GCACCTAAGG 2700 

ATAAATATAT AACACATCCT TTCATTTCAG GAAGTTTGGA AGGCTATTTT ACAATGGATC 2760 

TTGTTGCTGC GTTAGCTTTT TCCGTAGTCA TTGTCAATGG CTATAAGTTT AAAGGCCTCA 2820 

CAGATCGCAT GAAAATTTTA AAATATGTCT GCTTTTCAGG TCTTATTGCA GCCATATTAC 2880 

35 TTGGAATGAT TTACTTTGCA CTTGCATACG TTGGGGCATC AACAGCTCCA GGAAACTTTA 294 0 

AAGATGGTAC AGATATATTG ACGTACAACT CATTACGATT ATTTGGTTCG TTCGGTAACC 3000 

TCGTATTTGG AATGACGGTT ATCCTTGCAT GCCTAACAAC ATGTATAGGA CTCGTcAATG 3060 

tTGCGCCACA TTTACTAAGA AACACGTACC TAAGTTTTCT TATAAAATAT TCGCACTTAT 3120 

TTTCtCTATC ATAGGGTTCT TATTTACAAC ACTTGGTTTA GAAATGATTT TAAAAATTGC 3180 

TGTCCCATTA TTGACTTTAA TATATCCCGT GTCGATTGCA CTTGTACTCA TATCATTTGC 324 0 

TAACATGTTT AGCACATTCA GATTCAGTTG GGCCTATCGA CTCGCAACTG TTATTACATT 3300 

GATTATTTCA ATTTTACAAA TACTAAATAG TTTCAACTTA TTACACGGTG TTATTTTGAA 3360 
50 ATCGTTTATG ATGTTACCTT TAGCAGATAT CGATTTAGCT TGGCTTGTAC CATTCATGCT " 3420 

CTTTGCTATT ATCGGTT7CA TAATCGATGT ATTTATACGC CGTCCGAAAC AAGCGACAAC 34 80 

TTAATAAATG CTCACTGCCT AGTAATGATT GACCCATCGT TACTAgGCTT TTttATATGA 354 0 
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TAAAAAATCC TAGCTGTTAT TCAAAAATAT TAGTTTTTAA AA 3642 
(2) INFORMATION FOR SEQ ID NO: 329: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 2187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 329: 

75 TTATTGATAT TGAAAATTCA AAAACTGCAA CACATCTTAT CACTAAAAAC CAACACTATC 60 

ATCAAACTGA CACTCATTTT GAACAGTATA AGAAATTTAT TTTAGATTCA GGTATATCAT 120 

CAACACAATT TGTATATAAT AACCTGTCTG TAAGCGGATT TAAATATACT AATGATGGTA 180 

AGAATCCAAT TCAATTATCT GACATAGTGT ATCACTTAAT CGCATTATTA CGATATGGCG 24 0 

GTGGCATTAG CTATCAATTA TTAGATGACC ATTCAAATTA TATTTCCTTG TACAACAAAT . 300 

ATGGTAGCCC CCTACCATTA ATGCATTTAT ATAAAATGTT TAGACCTTTT GTTAATGAAG 360 

ATATTGAAAT TACAAATAAT TATGTATTGA GTCGTAAAGA TAATAACTAC CATTTCTTAT 420 

TATTCAATAA AATTAATGAT CGATATATGT CAGACGTAAA ACAAGATTTC ATTTTCCATA 4 80 

ATGAATTACC TCAAGACTCT TTGATGATTA TTAAAACATT GAATCATGAA CATGGTTCAA 54 0 

TTCAACATTT GCTTCCAATA AGCGATCAAC TTGTTTATAT AGAAAAAGAA ATTTTAGATG 600 

AATTAGACAA AACGAATTAC CCTAAAACGG AGCTTGCAGT TCAAGAAGAA ACTGGTCGTA 660 

35 CATTTGAACT CAAGTTAAAT CACGACGAGG TTAAATATAT TTGCTTTAAA CCAAGCTAAA 720 

TACTAACAGT CCTCTTGTGT TTAGTTTCTT ACGTTAAAGG CTATTTATAT CATAAGGAGA 780 
TGATTTGTAA TGAGTAACTC ACAAGCAATT CAAGCAATTG AAAACGTGTT AGTAACGTCA . 840 

AAAGTTGGTG TATTATCAAC TGCATATAAT AATAAnCCTA ATAGTAGATA TATGGTCTTT 900 

TATAATGATG GTCTTACTTT ATATACTAAA ACGAATATCC ATTCTGCTAA GGTCAAAGAA 960 

ATTAAAGATA ATCCAGCAGC ATATGTTTTG TTAGGCTATA ACGACACAAC TAATCGTAGT 1020 

TTCGTTGAGA TGGAAGCGAC AATCGAAATC GTTACAGAAC AAGAAGTGAT TGATTGGCTA 1080 

TGGGAAACAC AAGACAAAAG CTTTTTCAGT TCAAAAGAAG ACCCAGAGTT ATGTGTTTTA 1140 

50 AGAGTAGTTC CGCAATCCAT TAAGCTAATG AATGATAAAT CATTAGATAC ACCTATCAAA 1200 

ATCGATTTAT AACACAAAGT GTATATAGGA AATAACTTTT ATGAATTCTA GATATAACAA 1260 

TGTTAAATAC TTAAAATAAC TCGCTATAAT TAAAGTGTTT AATATGTTTA CAATTCAATT 132 0 

55 
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TATAAATACT GCATTTGCAA ACTGTTGCAC TTTTAGGTAT 1440 

AGGAGATTGA TGAACCATGA AAAAGAAAAA AGGTTTTGGT 1500 

CATCATGTTA ATTGTATGTA TTGTATTAGT AATCATGATG 1560 

TACATACTAT GGAATTATGA AAGATAATAC TACTATTGAA 1620 

TGAAAGTATT GAAAAAAATG TTAAATTACC TTCAGATTCA 16 BO 

TGATTTTGTA ATTGTTTATA AATTAGCAGA TTCAGATAAA 1740 

TGACCATGAC GATGTACCAC ATGGTTTAAT GATGAAAATT IB 00 

CATGAAACAC TAATTGTAAT TTAAATTACA AATTTTAGTT - 1860 

TAAAAGCAGC GGTAAGTTGA TTTCCAATTT GGAATCATTT 1920 

GAAATACTTT CATATTGAAT AGCTCCACTT GCCGTTOGCC 1980 

AAGCCCCTAA CAACCTGAGG TCACTGCGCT CCGGTTCGCC 204 0 

AAAGCCCCTA ACAACCTGAG GTCACTACGC TTCGGTTCGC 2100 

AAAAGTCCCT AACAACCTGA GGTCACTACG CTTCCGTTCG 216 0 

TAAAAGC 2187 

(2) INFORMATION FOR SEQ ID NO: 330: 

( i ) SEQUENCE CHARACTER I STI CS : 
30 (A) LENGTH: 17BB base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 330: 

CCnC CTTTTT AAACCTGGnG AAATGtmCAG tTTTGATGGt ATTGGGtTCT TTAGTATTAT 60 

40 „ _ 

GCTTAAGATA GAGTGTAATA CGCTCCTGTT GTTCTTTATA GTATATTGCT TTTTGTTTTT 120 

CTTTTTCGTC ATATTTCACT TTTAAATAAA TGACTGATGC AACTATACAT ATACATAAAA 180 

TGACACCAAT AATTATAAAA ACATGTTTTT TGTTCATTAA AGTAACTCCT AAAATGTGGT 240 

45 

GGAAAATGAA AAAATTTTTA TAATCTATAA TTATGAACAT TACAGATTAT AAACCATAAC 300 

ACTAACATCG TCGCCTTCAT TAACTTGATT GTTAAAATCA GCAATTACTG AGAATTGTAC 360 

50 AAGGAATAAT TGCCTATTAT GCCCTCATGT AATTATTGCC TTACTAACAT TAACAAAATC 420 

GATAGCTATT ACATTAAATG CCTATACCCC AGACCTCAAA CACCTTTTTA TACAGGACGC 4 80 

ACTGTAATTT CATyyACGTT AACGTGCTTT GGTTGTGTTA ATGCATATAA TACTGCTTCT 54 0 



TTTAAATTAA TTTTATGTAA 
AACAGAATTA ACTACATTTA 
5 CTTGGTATTA GTTTAATCGC 

ATGACTGGCG GAAAGAAAGA 
AAAATGATTA GTGAAAAAGA 

10 

GATGTTAAAG TTAAAAAAGG 
ATTGTTAAAG TTAAAAAAGT 

15 CATGACATGG GCAAAATGCA 

GCCATCAAGG TATATACGAG 
TACTGCTGCT TTTTATATTT 

20 TgcgCTTTGC GCATGCATAA 
TGcgCTTTAG CGCATGCATA 
CTGCGCTTTA GCGCATGCAT 

25 

CCTG.CGCTTT AnCATGGCCA 
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GTATCTACCA TTCCTGGAGA AATGCTTGTT ACTTTAACGC CTGTCTTTGC CAACTCTTTT 660 

TCTAATCCTT GAGTAATAGT GTGAACTGCT GCTTTCGTCG CACTATAAAT CGTACTACTT 720 

TTCGTTACTT CAAAGCCAGA AATAGATGcA ATGTTAATAA GATGGCCACT TGATTGTTCT 7B0 

AACATAGTTG GTAATGCAGC CTGTGCCGTA TATAAAGTGC CTTTGATATT CACATCAATC 840 

ATACTATCCC ACTCATCTAC TTGATAATCA GTAATCTTAG ACGACAACAT TTGCCCCGCA 900 

CTATTGATAA CAATATCCAA ACCACCGAAT GTTTGTTGTG CAATTTTTAT CAATTCATCG 960 

ACTTCTTCTT TATTCGTTAC ATCTGTTGGC ACTACCTTCA CACTATCTTG TGACAATTGA 1020 

75 TTCGCTACGT TTTGTAATTT TTCTTTATTT CTACCTGCTA AGACAACTTT TGCCCCTTCT" 1080 

TCATGTAGTA ATGTTGCAAT TGCTTCTCCA ATACCACTAC CTGCACCTGT AACTACTGCT 1140' 

ACTTTATCTG TTAATACTGT CATAATGATC GACTCCTTTG ATTCTTTTTA TTTTTTCAGG 1200 

GTAAATCATA AATACATATT ACTTTTAAAA AGCGTATCAC. AATTCATATA ACGGTCATAA 1260 

TAACTCGCTT CATTTTCATA GATAAATTAC ATTACAAGCC ATTCGAAACA TACAATTAAT 1320 

CGTTGCTTAT ATTTTTTATT TTTAAAAATG TTGAAAAATC GTCACTTCTT TATTGTAAAA 1380 

ACATTATATT AGTAATAAAG TTAATACTGT GnATTTaTCA TTCGATTGAA TGATTAGAGG 144 0 

GAGGAATAAA ACgTGACATA TCATGAGCGT GTTTTAGCAT TAAGAGCAGA AAGTAAAAGA 1500 

30 ACCGCATTTG ATTTTCGATT CGAAGATTTA TTTAGCAAAG AAGAATGGcT AAGTATGTCT 1560 

CTTGCAGAAA GACAAAAAGC TGAAAAAGCA TTTCGACACG AgTTAAAAAT ATGGACGATG 1620 

TAAGAATGCC CTTCTCAAGT GTCCATGACG CCCAAGTAAA ATTATATAAT GTTGTATATT 1680 

35 CTTATAACGG CaTTAAACGT AATTTTAAAC AAGTTGAAAA TGGAAGGATT CTAATATCAT 1740 

TTCGTTTATA TATnGCAGAC CATGGATAGA ATTTTnTATG GTnAATCC 1788 
(2) INFORMATION FOR SEQ ID NO: 331: 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 
{B} TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 331: 
50 TAAGCCAAAT 'CCATTGTTAC CAGAAGAAGT TCGCTTCATC TTAAAACAAA TGGGTCTTAA 60 

AGAAAAGACT ATCGATGTTG AACTCGAAGT TGGCGAGCAA GTTCGTATTA AATCAGGTCC 120 
ATTTGCGAAT CAAGTTGGTG AAGTTCAAGA AATTGAAACA GaTAAGTTTA AGCTAACAGT 180 

55 
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GCTTTAATTA ACAATTAAAG TTATTAAACT AACCAAAAGA TAAAAAAGAG TATTGATTTT 300 

TTAATTAGAA AAGTGTTAAA ATTATGTGGT cGcGCTTTTA GAGCGCCCAT TTCGTcACGA 360 

AATGTTAAGA GTGGGAGGGC AAAACTGAGC CCTGTGACCA CATCACGATA TCAAGGAGGT 420 

GCACATCGTG GCTAAAAAAG TAGATAAAGT TGTTAAATTA CAAATTCCTG CAGGTAAAGC 480 

GAATCCAGCA CCACCAGTTG GTCCAGCATT AGGTCAAGCA GGTGTGAACA TCATGGGATT 540 

CTGTAAAGAG TTCAATGCAC GTACTCAAGA TCAAGCAGGT TTAATTATTC CGGTAGAAAT 600 

CAGTGTTTAT GAAGATCGTT CATTTACATT TATTACaAAA ACTCCACCGG CTCCaGTATT 660 

, 5 ACTTAAAAAA GCAGCTGGTA TTGAAAAAGG TTCAGGCGAA CCAAACAAAA CTAAAGTTGC 720 

TACAGTAACT AAAGATCAAG TACGCGAAAT TGCTAACAGC AAAATGCAAG ACTTAAACGC 780 

TGCTGACGAA GAAGCAGCTA TGCGTATTAT CGAAGGTACT GCACGTAGTA TGGGTATCGT 840 

TGTAGAATAA TTTTACGAAT ATTAAATTTG ATTACATGAT TTAAACGATG AAGCAGATAA 900 

CAGAGATAAT AATGATGAAT TATAAATATA ATCTGAATGA CTAGATTAAT GATTGATTTA 960 

TTCATAAGAT TAATTCTTCT GTTGTCTGCy CTTAACTTGC ATATAGCAAG TAATGTGGGA 1020 

GGAAATTCCG CTAAAACCAC TAAAGGAGGA ACTATAAATG GCTAAAAAAG GTAAAAAGTA 1080 

TCAAGAAGCA GCTAGTAAAG TTGACCGTAC TCAGCACTAC AGTGTTGAAG AAGCAATTAA 1140 

ATTAGCTAAA GAAACAAGCA TTGCTAACTT TGACGCTTCT GTTGAAGTTG CATTCCGTTT 1200 

AGGAATTGAT ACACGTAAAA ATGACCAACA AATCCGTGGT GCAGTTGTAT TACCAAACGG 1260 

AACTGGTAAA TCACAAAGTG TATTAGTATT CGCTAAAGGT GACAAAATTG CTGAAGCTGA 1320 

35 AGCAGCAGGT GCTGACTATG T 1341 

(2) INFORMATION FOR SEQ ID NO: 332: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 5136 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 332: 

CTCTAAATCT TCAATAGGTA ACTTCATTTT AATAATTCCC ATTTGAACAA TTGCTTCTTC 60 

so ATCATTGACA ATTTCTAAAA CTTCACCCTT TTGACCGTAA GAkAATACTT TkACTTCATC 120 

ACCAGCAACA ATTTTATCGT ATTTTTGCTT TTGTACATTT TGCTTTATAG. ATTTCGCTTC 180 

ATAATGATCA TCTAATCGTT TCTTCTTATC AATCAATTCA TGTTCTTTAA CATCTGCACC 240 

55 
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CTTAATCTTC TGATTTGCTT TkTCTTTCGC TTCCTCTATT AGAGACTTTT CATAATTTTG 360 

GAATTGTTGA TACTGCTTAG ATAAATCATC ATGCACTTGt CCGCkTskTT TACAAGACGG 420 

TCCAGkTCTA ACCTCTGTGT CTCTACACGT TTGTAATTAC GCTCTAATGA TTCAATCATT 4 80 

TCATTTATTT CTTTTTCATC AGTACCAATC ATCGTCTTAG CCTTATTAAT AATATTCAAA 540 

CTAAGACCTA ACTTTTTAGA AATGTCAAAA GCATTTGAAC GACCCGGCAC ACCCATTAAT 600 

AACTTATACG TTGGACTCAA AGTATCTACA TCAAATTCTA CACTCGCATT CATAACGCCT 660 

TCTCGATTAT AACTATATGC TTTAAGTTCA GGATAGTGCG TCGTTGCCAT TACTAGAGAA 720 

15 CCAATTTTTC TAACATGATC TAAAATGCTC ATTGCTAATG CAGCACCTTC ACTCGGATCT 7 BO 

GTACCTGCAC CTAATTCATC AAATAAAACT AAACTATGTT TGTCTGCATG CTTTAAAATT 840 

TCAACTATAT TCGTCATATG AGATGAAAAA GTTGATAATG ATTGTTCTAT TGATTGTTCA. 900 

20 TCTCCGATAT CGCAATATAC ATTTTTAAAT ACACTTAACT GACTACCATC AAGTGTGGGA 960 

ATCAACAATC CTGATTGAGC CATAACAATA ATTAAACCTA ATGTTTTTAA TGTTACAGTT 1020 

TTACCACCTG TATTCGGTCC TGTAATAATT ACCGTTTCAA TATCTTCCAT AAATTCGATG 1080 

GTATTAGCTA CAACAGTCTC ACGATTTAAT AATGGATGGT ATGCTTTAGG * TAAATATACA 1140 

GTACGGTCCT . CTTTAAATAT CGGCTTTGTT CCTTTAATAC TTCTACTATA TCTCGCTTTT ' 1200 

GCGATTAAAA AATCTAACTG ACCCATGACT TGTTCTGCCA CAAGTAGTGC ATCTTTGTCC 1260 

GCAGCCACAT AACCAGTTAG TTGCGTTAAA ATGCGTTCTT TTTCAATTGC TTCGTCATGA 1320 

CGTAATCGAC TAATTTGATT ATTCATTTCA ACAACTGATG ATGGCTCAAT ATACAATGTT 13 BO 

35 TGTCCTGAAG CAGATTGATC ATGTACAATC CCATTAAAAT CTTGTCGATA TTCAGCTTTG 1440 

ACAGGTATAA CGTTTCTTTC ATTCCTAACT GTTACAATAG CATCTGATAA TTTTTTCTGA 1500 

TTTGCTTGGC TTTTAACAAT ACGGTCCAAA TTTTGTCTAA TACGTTGATT CGTGCTAGAA 1560 

ATTTTACTTC TAATCCCTTG CAATTCATAA CTCGCATTAT CATATAAATC ATACGTATCG 1620 

CATGTTTCAT TTATTTGTTG AAAAAGATCA GTTAACACAG GTAATTGATT CATCTTGTCA 1680 

TCTAATATTG GGTATTTAAC ACCTTCATCT TCTTCAACCA ATTGATTATA AAATGTCTTG 174 0 

AATTGATTTT GTACTTGAAT TAATCTTTTT ATCAAGTTAA GCTCTGATAC ATTTAAAACG 180.0 

CCGCCAATAT CAGCGCGATG AATGAATGCT GATACTTTAG ATAAGCCACT CAAGCTTGGT 1860 

SO _ AAACGATGCT TATTATAGAT TTGAGCAATC TCATCCGTTT CTTCCATTTG AAAAACAACC 1920 

GTTTCAAAAT TAGTAGCTGG CATCATTTGA TTGACCTTTT CCAAGCCTAA GTCACTAATA .1980 

GTTTCATTGG CAACGAGTGA TTTTATTTTT TCAAATTCTA AGACGTCTAA TGTTTTTTGT 204 0 
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TCACGCGATA ATGCGTTAAT CACTCTATCT TTTGTtACAA ATC CTTTTT G CGCAGTTGtA 2160 

CGCCATAATT CATAAAATCT AAATGATTTG TATGATGCGC ATCAGTGTTA ATAGTTAATT 2220 

s 

TCACATTTGG ATATTTACGA ACGATATCAG CGCTCAGATC CAGTCGATGT GGATTGGCAT 2280 

TAATTTCTAA TACTGTATTC GTTTCTTCAG CTAATGCCAT TAATTGTTCA ATATTCGGTT 234 0 

1Q TATAACCATC TCTTCTACCT ATAATACGCC CTGTTGGATG CGCTATATGT CGCACGTATG 24 00 

GATTGCGACA TGCATTAGCT AATCGTTCCA TAATTTGTTC TTCTGATTGG TTAAAGCTTT 246 0 

GATGAATAGC TCCAATTACA TAATCAAGTT GTGCTAAAAT TTCATCATCA TAATCCAGCG 2520 

15 AGCCATCAGG TAATATATCC ATTTCTGTAC CTGAATAAAT ATCAATTTCA CTATATTCTT 2580 

TATCTAAAGC CTTAATTTCT TCGTTTTGTC TTAAAAGTCT TTCTACTTGT AAGCCATTAG 264 0 

CAACACGTAA ACTTTGTGAA TGATCAGTAA TTACCATGAA TTTATAACCT TTTGCGATAT 2700 

20 

TTGCTTCTAC CATGTCTCGA ATAGAAAACG CACCATCACT ATACGTTGTA TGCATATGAA 2760 

TATCACCATT AATATCATCT ATTGTAATGA TATTACTTAG ATCTTTATCA AATTCGCTAC 2820 

CATCTTCTCG CATAGCAGGT GGTATAAAAT TCACATTAAA ATGTTCATAT ATCTTGGCTT 2880 

25 

CACTATCATA TTGAATTAAT GTACCATCAG CTTGTTCAAT TCCATATTCA CTTACTTTTT 2940 

CATCACGTGC TTTAGCAAGT TGTCGAATTC TTATATTATG TTCTTTTGAC CCAGTAAAAT 3000 

30 GCTGCAATGT ATGATAAAAA GCACTTGGTT CAATTAATCG AAAATCGACA CCAATCGTTT 3060 

CATCATCAfA CGCTAATTCT AATGAAACTT TTGTGTTCCC CACTGCAACT TCTTTTACTT 3120 

TATTGGGAAT ATTTAATAAT TGCTGCTGCA CTGCTTTTGG GTTATCGGTA CTTATTATGA 3180 

35 AATCTAAATC TTTGCTCATT TCTTTAAAAC GACGGAAGCT TCCTGCAGAT GAATATTGAT 324 0 

CGATATAATT TAATGTATCT ATATAATCAA TGATTTCTTG ATTAAGTCTT CTCATTTGAT 3300 

CAATTGGATA TCTATCTTTC TTAGCACCAA GTTGTTTCAC AGCTTCTAAT ATGTTTTGTT 3360 

40 

CCGTTTTCTT AGCAAATCCG CTTAATTCAC TAACTTTTCC ATTTTCACAA GCAACTTGAA 3420 

GTGACGCTTT ATCAACAATA TTCAACTCTT TATATAGCTT AGCAATTTTC TTGCTTCCAA 3480 

GTCCTTGAAT TTTCAAAAGT GGAATAAGAC CTTCCGGAAC TTCTTCCTGT AATTGCTGTA 3540 

45 

AATACTGAGA TTCACCGGTC TCACGGTAAT CATTGATTAC TTCTGCAACA CCTTTACCAA 3600 

TGCCTTTtAA CTCCGTtACA TCAGATATTT CATCTAATGG TCGTTCATCT AATTCAAGAC 366 0 

50 TTTGAGCTGC TTTTCGaTAC GCTGaTATTT TAAAAGTATT TTCCCCTTTT AATTCCATAT 37.20 

AAGTAGCAAT TTGTTCTAAT AGTTTGATAA CATCTTTTTT TGTCATAATA ACACTCCATA 37 80 

AAAAGAAGAC CAGGACGTAT CATTAATATA TACCTTTGTC CTGACCTCTT ATGATAATTT 384 0 
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TAGATATTTT AAGCTGATGT TGTAATGCTT CGTTAGGATA TAATGCCAAT AGATATAACG 3 960 

TAAAGTGTAA GACAATTATC GTCATAAACA CACCAACTAT CATTCCCATT GCACGACTGA 4 020 

5 

AAATATGAAT GTTTTGATAC GCTATTATTT TATCAAAAGT TACGATAATT AGATATAAAA 4080 

TGAACTTACA AAACAATGTA ATCATTAAAA AAGCTACAAT CGCTTCAAAT CGATTTTGTA 414 0 

GATGATTAAA ATGAAACGCA AAAGTTGTAT TAAATGCTGT TGTTTTAGGA TATGGAATAA 4200 

10 

ATACAATTAA TCTTTCTACA ATAGATTTGT AAAATTGACT GGCAATCCAC AATGATACAA 4260 

nCGTTGCACT CAAATGTATC ATAGATAACC AAAAACCTCG TCTGAATCCA ACGATGACAA 4320 

75 AATACACAAA GAAAATGATT ATGATAAAAT CAATGACCAT TTATTGCTCA CGCTGCTGCA 4380 

ATTTGTGAAT TTGTTGTTTC AAACGTCGAT TTTTTTTCTT CTAGTAGTAC TTTTTCATGC 444 0 

ATAATATTCA CAGCAGTTAG TATTGCTTTT CTTGAAGTAT CTAAACCTGC TGCTTTATAC 4 500 

20 

CCTAATTCTT TTATTTTATC ATCAACTAAA TGTGCTACAT ATCGTATGTG CTCTGGGTTA 4 560 

TCTTCCCCAA CAATTGTAAA AAGCTGATCA TTAATTGATA CATTTACCTT GTTTTTAAAC 4620 

TGTGTCATTT ATAATTTCTC CTGATCCTTT TTTTAAAATC TAAATTCACG TTATAAAATA 4680 

25 

TGACTGGATA GTTTGTCTGA ATTTGATACT AATATTGTTA TATTGCAATT ATGATAAAAC 4740 

AACAACACAA TCTCTATAGA TGACTTAATG TTCTTTTTAT AATGAAATAA TGTAAAGAAT 4 800 

30 TTTCTATTCA ATACTTTATC ATGTTTAAAT TGTGTCACTA TAACATTTTC ATAAACATTA 4 860 

TACATGACCA CTATGTATTT TGTAAGTATC CGCAATTAAT TCTTTACAAC ATACATAAAT 4 920 

GTTTCTGACG TTATTATCAT TTATGATATG ATTATTTTTG CTAAAGACAA TGAAATTTTA 4980 

35 TGAAAGGATT TACACAAATG GCGAATATCG TTTTTAAATT GTCGGATAAA GACATAACGA 5040 

CATTAATGTC ACGCATTTCT TTTGATACTG AGAATTTACC TCAAGGnATG AnAGCACGTG 5100 

CAAAGTATCC AAAATACAAC TGTAAATATT TACCAT 5136 

40 

(2) INFORMATION FOR SEQ ID NO: 333: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4239 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

SO (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 333: 

GGCCAAAATT GCACCTCCAC TTTCCTTTGA ACAAATCTAT TTTAAACGCA CATTCCCATT 60 

ATGACAAATT AAATGTGAGT nACATTTGTT TGTATTTTAA CATGACTACT AACGCAAACT 120 

55 
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GGTATCTTTC AAAGATAAAA TCTTAATAAT 
CACAATATTA TATAGACCTA TTTTATTTTG 
ACCTTTATAA AAATTCATAC CTTTATGCTA 
AATTCTTAAC TATCCTTTGA TGATTGTTTT 
CGAATTTCTA TAGTTCAATG CTACTAAAAA 

10 

TGTTTTATAA TTATGCTTGC GATGGTGTTT 
ATTTAAACTC TTTACCATCA TGATCTACTG 

15 CTAATATTAA TTCACTTAAA TTATCTTCGA 

TTGCACCATA TTCTGGATCA TATCCTTCTT 
CAATAATATT TATGTTTTGT TCAGATAATC 

20 CAATTTCTTT TAATTCTTCT TTTGTTAGTT 

TTAAAAATTC TGGACGGAAT GAATTTTTTA 
CATAATCTTG TCCATCACTT GAACCACCGA 
GTGCCCCAAC GTTTGATGTC ATTATGATAA 
TTGTATCTGT CAAATGTCCA TCATCTAAAA 
GAGCTTTTTC AATTTCATCA AATAAAATTA 

30 

TTAATTGTCC ACCATCATCA TGACCAACAT 
CTGCGTGTTT TTCCATAAAT TCACTCATGT 

35 ACATTGATTC AGCTAAAGCT CTAGCTAATT 

AGATAAAGCT ACCAATTGGT CGTTTAGGAT 
CTTTACTGAT TGAATTAACA GCATCTTTTT 

40 CTAGACTAAG AAGTTTTTCA GATTCTGTTT 

ATCCTGCAAT AACTTCAGCA ATATCTTCTT 

CATTCTTCCA TTCATTTTTA GCTTCTTCAT 

45 

GCAGgTTAGC AGCATTTTCA AACTCTTGAG 

CTTTTTCAAT TTCTTGTTCA ATTTCTTTTA 

TTACTTTAGA ACTTGCTTCA TCAATTAAAT 

SO 

AAACGTATCT GTTACTTAAT TTAACAGCTG 
GATGGTGTGC TTCGTAACGA TCTCTTAATC 

55 



TTCTTAGTAA ACTCTTTTCT CTAGATTTAT 240 

ACGTAAGTTG CTAGTATCTT CAAACAAAAA 300 

TCGCTGTAGG CTCATTAACT TGTTAGATAT 360 

ATTAGATTGT TTCGTTGATG GATACTTTCA 420 

AACAGCCCAA AACTTTAATT TGTTTTGGAC 480 

TAGTTTCTGA AGTTTGTTCA GCAATGTCAT 540 

TAACTTTCTT ACCTTCAATT TGATTACCAT 600 

TAGTTTTTTG TATCGCTCTA ATTAATGGTC 660 

CTGCGATTTT GTCTTTCGCT TTATCAGTTA 720 

GATTTGTTAA TTTATTAACC ATCATTGTTA 780 

TATGGAATAC AATGATATCA TCTACACGGT 840 

ATTCTTTTAA CATCGTTTTT CGAATTGTTT 900 

ATCCAGCAAa'tCGTTGATCT TGTAATTCTT 960 

TTGTATTTCT GAAATCAACT GTACGTCCTT 1020 

CTTGTAATAG AATATTAAAT ACATCTGGAT 10 BO 

CAGAATATGG TTTACGTCTA ACTTTTTCAG 1140 

ATCCTGGAGG AGCACCAACT AATCGGCTCA. 1200 

CTACACGGAT CATCGCATCA TCATCGCCAA 1260 

CAGTTTTACC AACACCAGTT GGTCCAAGGA 1320 

CTTTTAACCC TGCACGGGCA CGTCTAACCG 13 BO 

GCCCAATAAC TCTCTCATGT AATGTATCTT 1440 

CATTGATTTT AGTTAATGGG ATACCTGTCC 1500 

CTGACAATGA AGTTGaCATG CCATTTTGTG 1560 

ATTGCTTTTC AAGTTTTGTT TGTTTATCAC 1620 

CATGTACTGC GGCATCTTTT TCATTTTTAA 16B0 

AATTATTAGG TGTCGTATGA CTCTTAAGTC 1740 

CAATTGCTTT ATCTGGTAAG AAACGATCTG 1800 

CTTCAATAGC TTCGTCTGAA ATATTAATAC 1860 

CTTTTAAAAT AGCAACTGTA TCTACTACTG 1920 
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TTTTGCGATA TTCATCTAAT GTAGTAGCAC 


CAATACATTG 


TAATTCACCA 


CGTGCTAATG 


2040 




CCGGCTTCAA AATATTCGAA GCATCGATAG 


CACCTTCAGC 


ACCACCAGCA 


CCAACTAAAG 


2100 


5 


TATGCAACTC 


ATCAATAAAT 


AGGATGACAT 


TACCTGCTTG 


TTGGATTTCT 


TCCATAACCT 


2160 




TTTTCAGACG 


CTCTTCAAAT 


TCACCACGAT 


ATTTAGTACC 


TGCAACTACT 


GTTCCCATAT 


2220 


10 


CTAAAGACAT 


AACACGCTTA 


TCTTTTAATG 


TCTCTGGTAC 


CTCATTATTC 


ACTAtGGCtT 


2280 


GCGCTAAACC 


TTCAGCAATA 


GCAGTTTTAC 


CAACACCTGG 


CTCTCCAATA 


AGCACAGGAT 


2340 




TGTTTTTCGT 


ACGTCTACTT 


AATACTTCAA 


TTACACGTGT 


AATTTCTTTA 


TCACGTCCTA 


2400 


IS 


TAACAGGATC 


TAATGTACCG 


TCTTTGGCAA 


TGACTGTTAA 


GTCACGAGCT 


AAACTATCTA 


2460 




AAGTTGGAGT 


ATTATTTGAC 


TTACTAGCTT 


GTGCATTTTT 


ATTACTCATT 


TCAGGGTTTC 


2520 




CTAAAGCTTT 


CACAACTTGT 


GCACGTGCTT 


TAGTAATATT 


TAAATCTAGA 


TTTGCAAAAA 


2580. 


20 


CTCTTGCTGC 


AACACCTTCA 


TTTTCACGAA 


TCAAGCCTAA 


TAAAATATGT 


TCCGTTCCAA 


2640 




CAAAATTGTG 


ATGTAATTTT 


CTAGCTTCAT 


CCATCGATAA 


TTCAATGACT 


TTTTTAGCTC 


2700 




TAGGTGTATA 


ATGCAATGTA 


CCAACATGAT 


CTTGACCATG 


TCCGATTAAT 


TTTTCAACTT 


2760 


25 


CTTGAATTAC 


TTTATCTTCA 


GTGATATTAA 


AACTTTCTAA 


TACTTTTGCA 


GCAATTCCTT 


2820 




CAGGTTCTTT 


CATTAACCCC 


AATAATAGGT 


GTTCTGTTCC 


TATATTTGAA 


TGATTTAAAC 


2880 


30 


GAATTGCTTC 


TTCTTGGGCA 


TGTGCTAATA 


CGCGCTGTGC 


ACGCTCAGTT 


AATCTACCAA 


2940 


ATAACATAAA 


TAATGACCTC 


CTACTTTATA 


TGTTCTCTTA 


GTATATCTGC 


TCGTTTTTCT 


3000 




TTTACAGATT 


TGTCATCTTC 


TTCATCTAAT 


AAAAATGGTG 


ACTGTATAGC 


TACCATCAAT 


3060 


35 


TCATTAAATT 


TAAAGTTTTG 


TAATTCAATG 


TAATTTAAAT 


CTATACCAAG 


TTTAACTcGC 


' 3120 




TTAATCTATA AGAAGCCTCT 


TCCATAGTTA 


TCATTCTACA 


GTTTTGTAAA 


ATACCTAGCG 


3180 




AGCGAAAAAC 


ACGGTCTTGT 


GTTTCTAATT 


GATTATAAGT 


GTCTAACTTT 


TGTCGTATTT 


3240 


40 


GTTTTTCTTC ATGAATGATT 


TGATTAACAA 


CTTCTGTTAA 


TGTTTCTATG 


ATTTCTAACT 


3300 




CAGATTTACC 


AAGTGTAAGT 


TGGTTGGATA 


CTTGATAAGT 


ATGTCCATAA 


ACTTGCGAAC 


3360 




CTTCACCGTA 


AATACCTCTG 


ATTGTATATC 


CAAAACGATT 


AATGGTTTGA 


GCAATCCGTG 


3420 


45 


TCATTCTTTT 


CATAATAGAT 


AGACCTGGCA 


AATGTAGCAT 


CACGCTTGCT 


CTCATACCAG 


3480 




TACCTATATT GGTAGGACAT GTAGTTAAAT 


AACCAAGTTG 


TTCATCATAA 


CTTATATCAA 


3540 


50 


GGCTTCGATC 


TAATTCATCA 


TCAATTGATG 


AAGCTTGATT ATATAAAGCC 


TGTAATGTCG 


3600 


TGTCAGTTCC 


CATAGCTTGA 


ATACGAATAT 


GGTCCTCTTC 


ATTTATCATG 


ACACTTAAAG 


3660 




ATTCATCGTC 


ATTCACTAAT 


ACTGCGGCTG 


CTGGTTGTTT 


TATTAGTTCA 


GGACTAATCA 


3720 
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CAAAGTTTGG CAAGGCATCT TGTACCTCAT TTATAACTCT AAATCCcATC ATTTTCAGTA 3 84 0 

GCATACATTA GTGGATGCAC ATGATTTTCT AAAtTACGCG CTAACCGAAT TCTAGAAGAC 3900 

5 ATAACAATTG GTGTTTCTTC ATTACTTTTC ATCCATTGGC TGATATTATC ATGAATATTA 3960 

TGCGTCATCA TGTTGCACCT CACTCTCAGC TTTTAGTGCT TTAATTTCAT CTCTAACAAT 4020 

GGCTGCTTCC TCAAAATCTT GGATTTCAAT AAGTTTTTTC AAATATTCAT TCTTTTCTTC 4080 

10 

GATTTTTCGC TTTAAAGCTA TCTTTTTATG TGAAGAATGT GGTGTCTTTC CAACGTGCTC 414 0 

AAATTGTCCA CCTTGAACTC TGCGGACGAT ATCAATGATG TCATCTTTAA ATGTTGcATA 4200 

15 ACAATTAGCA CACCCAAATT TACCAACATG TGCAATATC 4239 

(2) INFORMATION FOR SEQ ID NO: 334: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1245 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 334: 

CAAAACTTTG ATTATGTCGT GTCAGTTATT AATGCATACA GTAATGCATC GTTGTTATTT 60 

TTTATTTATT TACTATTTAT TATGTACCAA ATCAGCCTTt CAGTAAAATA GGTCAAATTA 12 0 

30 

CTGATTTTCT AAATATAAAA TGCCTCCTAA TAACATACTA TTAGTACATC ATTAAGAGGC 1B0 

TCTTGTGTTA TTTGCATACT AAGCGCTCAA ATTTAAATTT AAGATGAAGA TTCTTGCAAT 24 0 

3S AATATTTCTA TATTTGTCGA TATATCCAAT GGATCTTCCA TTGGCAAGAA TCGATTTACA 300 

ACATTTCCTT GTCGATCGAT TATAAATTTT GTGAAATTCC ATTTTATTGG GGaCCCAAAG 360 

ATTCCTGGTT GTTCaTTCTT TAAATGCGtA AATAACGGAT GTTCATCGTT CCCGTTCACA 420 

40 GATATTTTAG CTAACACTGG AAATGTTACA CCAAATTTCT CACGACTAAT TTTCAAGATT 480 

TCTTCATTAG AACCTGGTTG TCGATTGTCA AAATTATTAT TCGGAAAACT CAACACTACA 540 

AACCCACGAT CCTTATATTT TTGAAATAAA GTCTCTAGTT TTTTCAATTG TTCGCTATAT 600 

45 

ATACATTCTG TTGCAGTATT AACAACTAAA ATCACTTTAC CTCTAAATGC TTCTAATTTA 660 

TAAGTTAAGC CTTTATAATC ACTTACTTCG ATATCATACA CATTTCTATT ATTCATAAGA 720 

CACCCCTACA CAGCCTTTTT TATATTGAAT ATGTTCTTTT TAGAATGTTC TGATAAAATA 780 

50 

AGTGCGCGTT TACACCGTGA ACACACATTA TATAGCGTGA TACATTTTTC GAGCACACGA 84 0 

TAAATAATGT TCGAGTTTAT GTTGTTGCTC AACCTATCCG ATTTACCGTC TTTTTTCACC 900 

55 
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CCGTTATAAC 


CCCTCATTAT AATCATCCTT ATTTTCTATT 


TTTAAAAAGA 


CAATTAGACC 


1020 




GCTCTTTAAA 


CTATAGaTTA 


ATACTTAAGk 


TAAACTCATA 


CATACTGATA 


CCATACGTTA 


1080 


5 


GATCTAACAA 


TTTAAAATTC 


GTTATAACTA 


TGGATTAAAG 


AGCTGCCCAA 


CTCATATAAT 


1140 




CCTTAAAAAC 


TTCACATGTG 


ATTGTJcTATT 


AAGCCCTCCT 


TTATCnTATT 


AAATATCCTT 


1200 


10 


ATAACCCTTT 


TAAAATTAAA 


CTGACACACT 


CATACATTGT 


TACAC 




1245 


(2) INFORMATION FOR SEQ ID NO: 335: 








IS 


(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








20 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 335: 








CTTGTAAAGG 


TAGTTGTTCA 


TTTAAATTAA 


AACAGTATGC 


TAAGTTnGTG 


CTTATATTTT 


60 




GCAACTTCAA 


TTCGACAGGC 


TCATCTTGAT 


ACATGAATGC 


CTCAATTTCA 


TCATGTGATA 


120 


25 


ACTTTTTCAA 


AATATCAATT 


AAATGrATAC 


TAAAAACGAT 


AAATAACATA 


TCCCAATTTG 


180 




AATTCAGCCC 


TAGCGATTTT 


AATTTGTTTA 


AAATCTTATC 


TTTTTGAAAA 


ATTCGATTCC 


240 


30 


TAATGTCTTG 


TATATCGTTA 


TCAGTCAAAG 


TTTCCCAATC 


TATATGTGAA 


TGAAGACCTA 


300 


AATAACACTT 


ATCCATTAAT 


AATTCATATA 


CCGTTAATGC 


AGAGACATCG 


AAACAACGTT 


360 




CTTCACTTAr 


AAAAACGCCA 


TCAACATCAA 


ATAAAATTTT 


CTTCACAATC 


CCCACTCCAT 


420 


35 


TTCTGaAAAT 


TCAGaTATAA 


ATCATTCTAC 


TATTTGACTA 


AAAAAAGCGC 


AAACCCTATT 


480 




GAAGTaGATT 


TGCGCTTTAG 


CTGTTnAAAT 


TTTATAAATG 


TnTTTCAATT 


TCATCAGCAA 


540 




CCTGCTGTAC 


GTGTGTACCG 


ACAATAACTT 


GAGTTGAATG 


TTTGCCATTA 


ACAGTAACAC 


600 


40 


CAACTGCACC 


GGCGTTTTTA 


ATCTTCTGTT 


TATCAATAAT 


AGATGTGTCT 


TTTAACTCTA 


660 




GACGCAACCT 


TGTTGCACAA 


TTGGTTAAAT 


TAACAATATT 


CTCTTGACCG 


CCTAAACCTT 


720 


45 


CTAATATTTG 


TATAGCATGT 


TGATGATATT 


TACTTTGTTT 


AATATCATTT 


TCACCAGGAG 


780 


CAATATTATC 


TTTTACAACT 


GTTGGGTCAA 


CTAATTCATT 


TTCACCTCTA 


CCAATCGTAT 


640 




TCAAGTTAAA 


TACTTGGATT 


ACTACACGGA 


AAATCACATA 


GTATAAGATG 


AAAAATACAA 


900 




CACCTTGAAC 


AAGCAACATC 


AATGGATGAT 


TTGATACTGG 


ATTAATTAGT 


GATAACACAT 


960 




AATCTATCAA 


ACCTGCACTA 


AATGAAAATC 


CAGCTGTCCA 


ATGGAATGTA 


GCTGCGATAA 


1020 




ATAAAGATAA 


TCCTGTTAAT 


AACGCATGAA 


CAACATATAA 


GATTGGCACA 


ACAAACATAA 


1080 
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ACCAACCGTA AACTTGTTTT TTCTGAGTAG TTTTAGCTGT ATGATACATT GcTAACGCAG 1200 

CCGCTGGAAT ACCGAACATC ATGATTGGGA AGAATCCCGC TTGATAGCGT CCTGTAATAC 1260 

CTTTTATAGC ATCTTTGCCA CTTTGGAATT TACCAATATC ATTAATACCA ATCGTATCAA 1320 

ACCAGAACAC ACTATTCAGT GCATGATGTA ATCCTGTAGG AATTAATAAT CTATTGGCAA 1380 

CACCATATAT GAAAGCTCC 1399 



10 

(2) INFORMATION FOR SEQ ID NO: 336: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1329 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 336: 

TATAGTTnTA TTATTTAGCG AAGCATTAAT ACTACCACCG GTTATAACAA ATGTATTTTG 60 

CGTATTAAAT TGAATGGTAG GACCAATCAA AGTATATTCA ATCGCTGGAC CATCATTGTT 120 

25 AATTAATGAC TGCGCAACCT TAAAACTAAA TTGATCCATG GCACCTGCGC CTGAAAATCC 180 

AATATGTTCA TAACCTATTC TTCCTAGATC * TTGTACCGTT GAAAAGAGAC CTGGTTGTAA 240 

AATCTTAATT GACATTTTCA ATCACCACCC AGTCATCAAC ATTAAAGTTG CCATCTGATA 300 

30 

TATCTCTTTC GATTTGTATA AATTTCTGTT CATCTATTGC ATAAAATTGT ATCCATTCTC 360 

. CTGCTTCGTA CATTGACATT GGTTCACGCT CGCTGCTAAA TACTTTTAAc GGTGTGCGTC 420 

V . 

CAATAATTTG CCATCCGCCA GGAGAATCTG ATGGATATAG TCCTGTTTGA TTATTCGCAA 4 80 

35 

TACCTACAGA ACCTGCATGA ATTTTTAACC TTGGCTGATT ACGTCTAGGT GTATGTAGTT 540 

GTTCATCAAG TcCGCCTAAG TATGGAAATC CTGGCATAAA TCCTAGCATA TATATTAAAT 600 

40 AAGGTTTACT TGTATGTTTT TCAATAACTT GCTCAACAGT TATTCGATTA TGCTTTGCTA 660 

CTTCTTCAAT ATCTGGTCCA TATGTACCAC CATATTGAAC AGGTATTTTA ATAATACGAT 720 

TGGTTTGATT CACAGCATGA ACATTTTTTT CATTAAATTT GTTAAGTTCT AAATTTTCAA 780 

45 TTAATTTAGA AGATGTTATA GCTTGTTCAT CAAAATATAT TAGAACTGCT CGATACGAAG 84 0 

GGACAATATC TTGAATTTCT AATATTTCTT TTTCTCGTAT CCACCGTACC ATTGCTGTGA 900 

CATTACGATA TGTCTCTTCG GATATTTTAT TTTCAAAATA AATCATAATT GTCTGCTCGT 960 

50 

TAATAAATCT TACATCCACT TTAAATCCCC CTTTGTATTG CAATAAACCA GTATTGAATA 1020 

CCTTTTCATT GTATCATTGA GAAGCACAAG TTGTTTAATA AGTAATTCAA ATCGCATATA 1080 

55 
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TTAATATTGT AACTCTTACA CTAATTTAGG TTCTGCTATC ATTCGGTCTG ATGGAAAATT 1200 

TTTACTTTTC ATCTGTCCGA TT T TTT GATT TTGAATATAA AAAAGCACGA CCGAAGTATC 1260 

S . ATTAACACAC TTCAATCGCG CAATTAAATA ATCTATTTGA TCATTTATTG GATATTAACA 1320 

ATTTTTACG 1329 



(2) INFORMATION FOR SEQ ID NO: 337: 

10 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 337: 

20 GAATAGTGCC ATTTGGAAAG ACATAAAGAA TCCAGAAGCA CCTTTAGCGA ATATGCCGTG 60 

TTCATATAGA TTTGTGAAAC TGGCATGCCC GAATTGAGTT TTAAATGCAA ATAGAATCAT 120 
GACGAAACCA ACTACTATTA AACCAATAAT TGTCGCTATT TTAATGATAG AGAACCAAAA 180 

25 TTCTAATTCT CCGAAAAGTC TTGCGCTAAG TAGGTTGAAT GACATTAATA ACAGTACACA 240 

AAATAGTGCA CTTATCCAGT TTGGAATTTC TGGGAACCAA AAGCTAACAT ATTTTGCCAC 300 
AGCCGTTACT TCAGCCATAC CTGTAATAAT CCAACAGAAC CAGTATGTCC ATCCGGTAAC 360 

30 

AAATCCTGCA AAAGGCCCAA TATATGTATT GGTTACATCT GCGAAAGATT TAAATTCAGT 420 
ATTCTGTATA ATGATTTCTC CTAAACCTCG CATAAACATA AATAACATAA ATCCTATAAT 480 
GATGTATGTT AATAGAATTG AAGGGCCGGT TAATGCAATC GTTTGACCAG CACCTAAGAA 540 

35 

TAAGCCTGTA CCAATTGCAC CGCCAATTGC AATTAATTGT ATGTGGCGAT TGCTCAGTTC 600 
CCTTTGTAAT TTTTCAGCCA TAATACATCT CCCTTAAATA TAGATATGTT TATTATGCAC 660 

40 TTATATTGAG ATATATACAA TTATTTTCGG TAAAAATGTG TAAAATTCCA TGTTAATATA 720 

CTTTGGTTTT TATAATCATA TATAATAACC AATTGAAAAT TTAATTCTAT TGTAAAATTC 78 0 

ATGGATTATT CACATCTTGA AAAAGCTTTA ATGGTGCTAT TTGTGGCTAT TCTGTGACAT 84 0 

45 ' TTACATAGAT TTACAAAAAA ATTGTTGCAC ATATAATGCC AGTtTTTATA TTTCACAAAC 900 
GAAATGCGTT TAcTATAATA TTAGTTGAAA GCCATTTCAT AAAGAAACAG TAAAGGGGAA 960 
ATTLATCaTA GCmGaATTAC AAAGAGGTTT AGAAGGGGTT ATCGCmGCGG AGACTAAAAT 1020 

50 

AAGTTCAATT ATTGAAAGTC AATTGACTTA TGCCGGCTAT GATATTGATG ATCTAGCTGA 1080 
AAATGCGCAA TTTGAAGAAG TTATTTTCCT ATTATGGAAC TATAGATTGC CAAACGAAGA 114 0 

55 
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TACACATTTT GAGGAGTATG TTACAGATCA CGTGCATCCA ATGACAGCAT TACGTACGTC 1260 

ATTATCATAT ATTGCACATT TCGATCCTGA TGCTGAAAAT GAATCAGATG AAAATCGTTA 1320 

5 TGAAAGAGCA ATGCGTATAC AGGCTAAAGT AGCATCATTA GTTACAGCGT TTGCTCGAGT 1380 

AAGACAAGAT AAAGAACCAC TTAAGCCTAA TCCTGACTTA AGTTATGCGG CAAACTTCCT 1440 

ATATATGTTA CGTGGGGAAT TACCAACAGA TATAGAAGTA GAAGCCTTCA ATAAAGCACT 1500 

10 

TATTTTACAC GCTGATCATG AGTTGAACGC ATCTGCATTT ACGGCACGTT GTGCGGTATC 1560 

ATCATTGTCA GATATGTACT CAGGTATTGT AGCAGCCGTA GtTCTCTGAA AGGGCCATTA 1620 

CATGGTGGTG CAAACGAACA AGTTATGACG ATGTTATCTG aGATTGGGTC AaTTGAAAAT 1680 

75 

GTTGATGCTT ACTTAGATGA AAAATTTGCT AATAAAGrTA AAGTAATGGG CTTCGGTCAT 1740 

CGTGTATATA AAGATGGTGm tCCTAGaGCG AAaTATTTaA GaGAAaTGAG CCGTCAAaTT 1800 

20 mCGAAAGACG CTGGTCGTGA AGAATTATTT GAAaTGTCAG TGAAAaTGGA AAAmCGTATG 1860 

GCAGAAGAAA AAGGATTAAT TCCTAATGTT GATTTTTATA GTGCGAGTGT TTATCACTGT 1920 

ATGGAAATAC CTCATGACTT ATTCACGCCA ATCTTTGCTG TAAGTCGTTC TGCAGGATGG 1980 

25 ATTGCTCATA TTTTAGAACA ATATAAAGAT AAT AGAATTA TGCGTCCTAG AGCGAAATAT 2040 

ATTGGCGAAA CGAATCGTAA GTATATCCCG CTTGrAGaAA GAAAmTAATC AATAGAAATT 2100 

AAAAATGAAG ATGTAAAATT TGGAGGTAAA ATAACTATGA CTGCAGAAAA AATTACTCAA 2160 

30 

GGAACTGAAG GATTAAACGT ACCTAATGAA CCAATTATCC CATTTATTAT CGGTGATGGA 2220 

ATTGGACCGG ATATTTGGAA GGCAGCAAGC CGAGTTATAG ATGCTGCtGT TGAGAAAsCC 2280 

TATAATGGCG AAAAACGCaT TGAATGGAAA GAAGTGCTAG CTGGCCAAAA AGCATTTGAT 2340 

35 

ACAACTGGTG AATGGTTACC TCAAGAAACA CTTGATACAA TTAAAGAATA TTTAATTGCT 24 00 

GTTAAAGGAC CTTTAACAAC ACCAATTGGT GGTGGTATTA GATCATTAAA TGTGGCTTTA 2460 

40 CGCCAAGAAT TAGATTTATT TACTTGCTTA AGACCGGTAC GTTGGTTTAA AGGAGTACCA 2520 

TCACCTGTTA AACGTCCACA AGATGTTGAT ATGGTTATTT TCCGTGAAAA TACTGAAGAC 2580 

ATTTATGCTG GTATTGAATT TAAAGAAGGT ACAACAGAAG TTAAAAAGGT AATTGACTTC 2640 

45 TTACAAAACG AAATGGGTGC GACAAACATT CGATTCCCAG AAACTTCAGG TATTGGTATT 2700 

AAACCAGTTT CTAAAGAAGG AACTGAGCGA TTAGTTAGAG CAGCTATACA ATATGCTATC 2760 

GATAATAACC GTAAATCAGT TACTTTAGTT CATAAAGGTA ATATTATGAA ATTTACAGAA 2820 

50 

GGCTCATTTA AGCAGTGGGG TTACGATTTA GCATTATCTG AATTTGGTGA TCAAGTATTC 2880 

ACTTGGCAAC AATATGACGA AATTGTTGAA AATGAAGGCA GAGATGCTGC TAATGCTGCT 2940 
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TTACAACAAA TTTTAACTCG TCCAGCTGAG CATGATGTTG TAGCAACTAT GAACTTGAAT 3060 

GGTGACTATA TTTCAGATGC TTTAGCTGCA CAAGTTGGTG GTATTGGTAT TGCGCCAGGT 3120 

5 GCAAACATTA ATTATGAAAC AGGTCATGCT ATTTTTGAAG CAACACATGG TACAGCTCCA 31B0 

AAATATGCAG GTTTAAATAA AGTGAATCCA TCTTCAGTAA TTTTAAGTTC TGTATTAATG 3240 

TTAGAACATT TAGGATGGCA AGAAGCGGCA GATAAGATTA CAGATTCAAT TGAAGATACA 33 00 

10 

ATTGCTTCAA AAGTTGTTAC TTATGACTTT GCCCGTTTAA TGGaTGGtGC TGAAGAAGTT 3360 

TCTACATCAG CATTTGCAGA TGAATTGATT GnAAATTTAA AATAAGCAGA ATAGAATTAG 3420 

G 3421 

15 

(2) INFORMATION FOR SEQ ID NO: 338: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3173 base pairs 
20 (B) TYPE: nucleic acid 

(C) • STRAND EDNESS : double 

(D) TOPOLOGY: linear 



30 



35 



25 . (XI) SEQUENCE DESCRIPTION: SEQ ID NO: 338: 

CCCTnGATAC CCAAACTGCC CTAATACCAC TGGCTAAACC TAATGGATAG TACCATTTAT 60 

TTTCCAATAA ATAAnCCAAC TGCAATnGCT ATaACTCTAA ATATAATAGA GATAATCmcA 120 

TTAATTGGAT TAAtACGCCA AATACTAGTA ATAATAGGCT AGATAATAAT CCACCTAAAA 180 

AGTACTTTTT AATTCCAAAG AAAGCTAATA TCAATAATGC TGCCGGTGCA GATAATTGAA 240 

AATCTAATCC TGGTATAATG GACGGTATTT TCAAAACTGC CAAAATGGTT AAAATCGCAG 300 

CAATGACACT AATTTGAGTA ATATCTTTTG ATGTCATACT AAAACCCCTA TACCGTTTCA 360 

TAAACAACTT GCTTCGGTGT GCTTTCTAAA AATGATATGT AATGATTTAA ATCAATACAA 420 

40 TCGTCCACAA ATATTATTCT GCCTCCATAT CTCGTATTAA CTGGTTTAAT ATCAAATAAT 480 

CGATGGTAAC CAATTTTAGC AGCGGCAAAA TAACCTGTCG TATACGTTAA GTCATCGGAC 54 0 

ACGCAAAGTT CTCCTTTGAC ATACGGATGC GCATTGATAC AACTAGCAAT TGCTAAGGCA 600 

TCAGTCACTC TTTCATTAAG ATCACCTTTA TTATTTATAT CTTCAAACGA AAAATGTGTT 660 
GCCCTAATCC CCCTTTGTCC AAATGAATCT AAACGTTTAC CAGATATAGC AGATAGAATA .. 720 

. ATAGCTCCTG TATAAACCGT TTCATTTTTA ATATATGTCA TCCCTTGATT TAGCGCTTGT 780 

TCAGTGACAC CACATTCTTG TGTTAAATGT TGGAGATTTG CTTTATCATC CTCAATAATT 84 0 

TGTAATGCTT TTATTTGTTG AATCGGTTCC ATGATTTTTT GTATTTTTAT ATTGAGAAAA 900 
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ATTTTGTCAA ACTCACAAAT CGTTTCAGCA CCACTAATAT GAACATCTTG ATTGCTAGAA 1020 

CGCATTTTTA TACTATACAT GACGATCACC TcAATCTTCT TGaTGCAAAA TTTCAAACAA 1080 

CCTATCTATA TCTTGTTCAG TATGAAAATA CGACAATGAT ATTCTTAACA TTGGCTTAGT 1140 

CACAGTtGGA TACCTTAAAT AACTTGTAAA CACATGATGC TTTAATAATG TTTGATGAAT 1200 

GTTCTCAGCC GCTTCTATGT CATCAAACTC AATAAACTTA ATCGGCGAGT TTGaACTATT 1260 

ATAATkAACA TTGAGTGCTT TTAACTTTTG GTTAAAATAT TTACTCAAAC TATTTAATTT 1320 

AGTGCGTCTA TCATCAGCAT TTATTAACTT TTCAATGTTT CTTTTTATAA AATACAAATT 1380 

ATAAATTGGC AAACTACTTG AGTAGATGAG TGGTCTACCG TGATTAATTA aCATATCCtT 1440 

CaCATCaTTT GaACTkaAAA TCACACCCCC GTATGCACCA CATGCTTTAG ATAAACTAGA 1500 

AGTGAGTATA TCTACACCTT GATAATTCGA GTAAtTCTCT ATtCCAAAAC TATGTGAAAC 1560 

20 ATCGAGTATC AGTGTTGCGT TAnATTTATG CTTTAATGAG ACTAATTGAC CAATATCCAC 1620 

AACGTCGCCA rTCGTTGAAA ATACACTATC AGATATGATT ATTTTTGGTA TATTTTGATT 16 80 

AGGGTATTTC TCTAACCTTT TTTCTAAATC AGCAATATCT AAATGCTTAT ATATCACTTT 1740 

25 TTCTAAACCA CTTAACTTAA TACCGTCAAT AATACTCGCA TGATTTTCTT GATCTGAAAA 1800 

CACGACACAA TTTGTATTTT TGAAAATATT AAATAACGCC AAATTAGCAT CATAACCACT 1860 

ATTTAAGATA GTACATGcAC TATATCCGAG CCAACCTGCT AACATTGTTT CAATTTCTTC 1920 

ATAAGCTGTC GAACTTCCAC TAATTAATCT TGAACTTGAT AAGTGATAAC TATACTTCCG 1980 

CATAAATCTT TCGAAATCAT CCTTATCAAA CGCTATTTGA CCTAATCCTA AATAATCATT 204 0 

AGATGTATAG TTCGTACATC TCTTATTTTC TACTTCAATA TACTGTCTAT CTATATACCC 2100 

TACCGATTTA AGCGACCGAT ATAACCCTTT CTGTTGTAAT AAATCAATTT GCTCTTGAAA 2160 

CTTCATTCTT GTTTTCCTTA TTTTCACAAG TGTCATAATC AATTTCAAAG CCTAAATCAT 222 0 

40 TAATCATATC GTAGTCTAAT TGGTTCGGTT GCCCACCAGT AATTAGATAA TnCACCGACA 2280 

AATATTGAAT TCGCCGCTTT TAATGCTAAT GGCTGTAACG AACGTAAGTT GACCTCTCTT 2340 

CCTCCAGCAA TACGAATTTC TTTCGTAGGA TTGATTAATC GGAATAATGC TACGATTCTT 2400 

45 AAACATTTCA TTGGTGTTAA ATCATCCATG CTTCCAAACT TTGTGCCTTT GATTGGATGC 2460 
AAAAAATTAA TCGGAATACT GTCGGCATCC ATTTCTTTTA AAGCAAATGC CATATCAACA • 2520 

ATATCTTGAT TAGATTCTCC CATACCACAA ATCACGCCAG AACATGGTGA TATATTATTC 2580 

SO 

GC7TTCATTA GTTCTATCGT ATCTGTTCTA TCTTTATAAC TATGCGTTGT CACGACGTTA 264 0 

TCATGGTAAT TTTCACTTGT ATTAATATTG TGGTTATATC TGTCTACACC AGCTGACTTA 2700 
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5 



10 



IS 



25 



30 



35 



SO 



TGTTGAGATT TAATCGTTCT 


TACAGTATTA 


CTAATATGAT 


CAACTTCTTT 


ATCGCTCGGT 


2820 


CCTCTACCAC TCATAACAAT 


ACAATATGTT 


CCAATATGAT 


TATCATGTGC 


CAC CTTTGCT 


2680 


CCATCGATAA TTTGTTCCTC 


TGGAATTAAA 


GCATATCGCT 


GTTTTTGTTT 


AATATCTCGT 


2940 


GATTGTCCAC AGTACCCACA ATTTTCAGGA 


CATATACCAC 


TTTTAGCATT 


TAAAATCATG 


3000 


TTTAATTTTA CTTTTTTACC 


AAAATAATGT 


TTTCTTAAAA 


TGTACGCCTC 


ATTTAATAAA 


3060 


TCTAAGGTAT CAATATTAGT 


ATCCTCATAA ATtTTCAATA 


CAGTCTCTTT 


TGTtAATTGT 


3120 


tCCCCTTGTA ATATGCGTTT 


AGCCAAATTC 


ATATTAACAC 


TTCCTATCTA 


AAA 


3173 


(2) INFORMATION FOR SEQ ID NO: 339: 








£i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1694 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO:" 339: 






CGATTATCCA TTAATACAAC 


CCTAAGTAAA 


TGTATAAAAA 


TTATCTTCCA 


CAAACTTCAA 


60 


CAAAAGCCTA AATAAATTAC 


AGCAATTTAT 


CAAATATTGC 


TTACTTTGAT 


TTTATGAAAT 


120 


nACTTAATTC TAACACATAC 


TAAATCATCA 


TATACTAATT 


CGAAATCAAA 


TGCATTTAGA 


180 


GATAATCaAA ATGCGGAAAC 


ATCTCCaATA 


ATCAATAATC 


TATTCCCAAT 


AAATATGAAT 


240 


GTTCTCAACA ATACATTATT 


TATATCTCTT 


TACACTGTCA 


TCGACAAAAA 


CTAAATCTTT 


300 


CACTTTCAAT TTCGAACGTG 


GTTCTACGAC 


ATTTGCTGCT 


ATATCATTTA 


ATGGGATTAA 


360 


AACAAATGCA CGTTCATTCA 


TTCTCGGATG 


TGGCACCGAC 


AGTTTTGGTA 


AATCTATCAT 


420 


TTCTTCTCCA TACAACAAAA 


TATCCACATC 


TAAAGTTCTA 


GGACCCCATC 


GTTCCTTTCT 


480 


AATACGGTGT AAACATTCTT 


CTGTCkTCAA ACAACATTCC 


AACAGTTGTA 


ATACTGTGAG 


540 


TGTTGTTTsA ATTTCAACAC 


ACAAATTTAA 


AAAGTTAGGT 


TGCTCAGTAT 


ACCCAACTGG 


600 


TGCTGTTTCA TAAATCGGAG 


AAATAtTAGA 


TACGTTAATA 


CCATCATATT 


CATTCaAAAT 


660 


CTTyATAGCA TCGTTTAACt 


GGCTTTCTCT 


ATCACCAaTa 


TTACTACCTA 


AyCCTAAGTA 


720 


TGCTTGAATC ATyTATTCTC 


CCTCACTATT 


TCGATACCTA 


CTCCATCATA 


ATGACCCGGA 


780 


ATCGGTGGGT TTTCTTTAGT 


GATTCTCACT 


TTCGTTTCCA 


TTACACGATT 


ATATTGTGAA 


840 


TTTATACGAT TTGCAATACG 


TTCAGCTAGA 


TGCTCAAGTA 


AATTAACGGC 


CTTACCTTCC 


900 


ATAATTGATT TAACCTCTTc 


GAACACTTCA 


CCATAATGAA 


CTGTATCAAT 


AACATTATCA 


960 
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5 



20 



ATTTCATTTT 


CAGCTGATAA 


AGCACCATGA 


TATCCATAAA AGCGCATACC TTTAAGAAAG 


1080 


ATTGTGTCTT 


GCATTTTCAT 


TCTCCTTTAA 


AAAATCTATA 


CCTTTAGCTA ATTTAGCATT 


1140 


CAACTCGACA 


TTATGAACGC 


GTACTGCTCT 


AACGCCTTTC 


ATAATACCAT ATGCAGTCGT 


1200 


AGCTGCAGTT 


ACTTCATCTC 


TTTCAACCGG 


TGTTGTATCA 


TAACCCATCA TCTCTTTAGT 


1260 


GAAACGTTTC 


CGGCTTGTCG 


CTAATAAAAC 


TGGATATTCT 


GTTGCAACAA GTTCATCCAG 


1320 


TCTTGCCATA 


ACTTCGGCTT 


CTTCATTTCT 


AGTTTTAGCG 


AAACCTATAC CTGGATCTAG 


1380 


CCAAATTTTA 


TTTGAAGGTA 


TACCAGCTAT 


TTTAGCTTGA 


TGTGCTTGTG CTAACAAAGA 


1440 


TGTTAACATT 


TCTTCGACAA 


CCGGTTCATC 


ACGATTACCA 


TTTCCATTAT GCATTAAAAT 


1500 


AATTTCCGCG 


TCATAtTTAG 


CTACAATTTG 


GAACATACGA TGATCATACA GACCgCCcAT 


1560 


tGATCATTAA 


TCATATCAAC 


GGCTAATTTT 


AAACATGCTT 


CAGCAACCTC ACTTCGAAAT 


1620 


GTATCGACTG 


AAATTTTTTA 


CATCAllAACC 


GACAATAGCT 


TCAnCAACAG TAATACTCTG 


1680 


TTCCATCTCT 


TCTG 








1694 



(2) INFORMATION FOR SEQ ID NO: 340: 

2S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 340: 
AGCATTTCTT TTCTATAAAC ATTTAATTGA ACATTATTAA GTACACTATT ACTATAGTCA 60 

35 

CTATATTGAA CaCATACCTC ATTTAATTCT AATAGCGGTT CAGATTTGTA CTTATTATCA 120 
TTATTTGCAG ATGTTTCATC TATCCATTTT TTCACTTTAA ATTTAACATG TTCACTCATA 180 

40 CAAACGTCAC GTAAATTCGC - TAAGTTATCA ATGGATTCGA CATCTACTTC TGCATATTTA 240 

AGCGCTGTAC AGTATAATGG TTCACGTATG CCTGCTTCTT TAAGCTTAGA TGATTTTAGC 300 
AAATCACTAG GCGTTGTATT AGCGATGATT TTTCCATCTT TAAAAAGAAG AACTCTATCA 360 

4$ AACGTATCAT CTAATGATTC TTCTAATCGA TGTTCGACAA TAATCATCGT TGACTTTGTT 420 

TCTTCATGAA TATTGTTTAA CAATCTCAGC GTTTCATGTC CTGTCGCAGG ATCTAAATTG 480 
GCCAGTGGCT CATCCAATAT TAAAATAGGC GTACGATGGA TTAATATACC ACCTAATGAA 540 

50 

ACGCGTTGTT TTTGACCTCC AGATAAATCT TGCGGTCGGT GATTTAAATG TTCTATCATG 600 
CCAACTTTTT CAGCCCAATA ACTTACATTT TTCTTCATAT CATCTTGTTC AACACAATTA 660 
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TCTTGTAAAA CTGTACCAAC AACA7TAGAT CTATCATGTA AACAACTAAC GGTTGCATCT 780 

TGATTATTTA TATATAGTTC CCCAGTTATG TTACCTTTAG TTTTAAATGG AATTAATCCG 840 

TTTATGCAAT TTGCAAAAGT CGATTTACCA CTACCCGAAG CACCAACTAC TAATACTTTT 900 

TCTCCTGGAT AAATATCAAC ATTTATATTC TGTAATGTAG GTGTTGCTTG ACTATGATAT 960 

TGAAAACTAA AGTCTTTGAA CGAGATAATT GGTTCAGTCA TGATATATCA TTACCTTTCT 1020 

ATATTCATTT ACATATCTGA TTCAACAAAA TAACTATTCC TTACGTAAAC TACCTTTTTT 1080 

AATTTGAGAT GAaGCATATG CTTTTAATAA TATTGTCCCA ATAATGCCAA CTGAAATAAT 1140 

ATTTAATACT GCAGAGATAA CACCTTGTGT ATAAACCTTG TTAGCTGGTT CGTTATAAAT 1200 

CAAAATATCT AATGTTGGTG CAATAAGTGC CCAGCAAATA ATATTCGCAA TAATTTGACC 1260 

GATATTAAAA TAAACCATCG ATTTCCTAGA AAATCGGCCT GAAGAAAGAT TTAATTTTAG 1320 

20 TCCAATCCAG CCATATAAAC AGCCTATAAT TCCCGAGC 1358 
(2) INFORMATION FOR SEQ ID NO: 341: 



10 



15 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4557 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 341: 

TAGAAGAATT GGAGAAAATG CTAATTCAAT TGTCAACATT CCATAGTTAT CATGATTTAG 60 

AGTTTCTATT TGTGACACGT GAAGATGAAG TTGAAACATT GAAATGGGCA CGTTGGTTGC 120 

CACATATGAC ATTGAGAnGG CAAAACATTA GAGGATTTGT TTACAATCAA CGAACGCGTG 180 

ACCAAATTTT AACGTCAATT TATAGCATGA TTAAAGAACG TATCCAAGCT GTGCtGaACG 240 

40 CAGCAGAAGT AATGAGCAAA TTATTTTCAC ACCGCAATTA GTGTTTGTCA TTACAGATAT 300 

GTCATTAATT ATTGATCATG TCATTTTAGA ATATGTAAAC CAAGATTTAT CAGAATATGG 360 

TATTTCATTA ATCTTTGTTG AAGATGtGAT TGaAAGTTTG CCAGAGCATG TAGATACCAT 420 

45 TATTGATATC AAGTCTCGTA CTGAAGGCGA ACTGATTACG AAAGAAAAAG AATTAGTTCA 480 

ATTGAAATTT ACACCTGAAA ATATTGrTAA CGTCGATAAA GAATATATCG CGCGACGTTT 54 0 

GGCGAATTTG ATACACGTCG AACATTTGAA AAATGCAATT CCTGATAGTA TTACATTTTT 600 

SO 

AGAGATGTAT AACGTGAAAG AAGTAGATCA GCTTGATGTG GTTAATCGAT GGAGACAAAA 660 

CGAAACATAC AAAACGATGG CAGTACCTTT AGGTGTAAGA GGTAAAGATG ATATTTTATC 720 
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AGGGAAATCT GAGATTATCC AATCATACAT TTTATCTTTA GCTATTAATT TTCACCCTCA 840 

TGAAGTTGCA TTCCTATTGA TTGACTATAA AGGTGGGGGT ATGGCGAACT TATTTAAAGA 900 

5 TTTAGTCCAT TTAGTTGGTA CGATTACAAA CTTAGATGGC GATGAAGCGA TGCGTGCCTT 960 

AACATCAATC AAAGCCGAAT TGAGAAAACG TCAACGTTTA TTCGGAGAGC ATGATGTTAA 1020 

CCATATTAAT CAATACCATA AGTTATTTAA AGAAGGTATT GCGACAGAAC CAATGCCACA 1080 

10 

TTTATTCATT ATTTCCGATG AGTTTGCCGA ATTAAAATCA GAACAACCTG ATTTTATGAA 1140 

AGAACTTGTA TCAACGGCAC GTATTGGACG TTCGTTAGGT ATTCATTTAA TACTTGCGAC 1200 

ACAAAAACCA TCGGGTGTTG TTGaTGACCA AATTTGGTCT AACTCTAAAT TTAAGTTGGC- 1260 

15 

ATTAAAAGTA CAAGATAGAC AAGACAGTAA TGAAATTTTA AAAACACCAG ATGCAGCAGA 1320 

CATTACmTTA CCaGgTCGTG CGTATTTACA AGTTGGTAaT AATGAmATTT ATGAATTATt 13 80 

20 CCAATCTGCA TGGAGTGGTG CAACATATGA CATCGAAGGC GATAAATTAG AAGTTGAAGA 1440 

TAAGACGATT TACATGATTA ATGACTATGG TCAACTTCAA GCAATCAACA AAGACTTGAG 1500 

TGGACTTGAA GATGAAGAAA CGAAAGAAAA TCAAACTGAG TTAGAAGCGG TCATAGATCA 1560 

25 TATCGAATCT ATTACAACAC GATTAGAAAT CGAAGAAGTT AAGCGTCCAT GGCTACCACC 1620 

ATTGCCAGAA AATGTATATC ArGAAGATTT AGTAGAAACa GATTTCAGAA AATTATGGTC 1680 

AGATGATGCA AAAGAAGTGG AATTAACATT AGGACTTAAA GACGTACCAG AAGAACAATA 174 0 

30 

TCAAGGACCG ATGGTATTGC AATTGAAAAA AGCTGGGCAC ATCGCGTTAA TCGGAAGTCC . 1800 

AGGATATGGT AGAACAACGT TCTTACACAA CATTATTTTC GATGTTGCAA GACACCATCG 1860 

TCCTGATCAA GCACACATGT ACTTGTTCGA TTTCGGTACC AATGGTTTGA TGCCAGTTAC 1920 

35 

AGACATACCA CATGTCGCTG ATTACTTTAC AGTAGATCAA GAAGACAAGA TTGCTAAGGC 1980 

GATACGTATA TTTAATGATG AAATTGATCG TCGTAAGAAG ATTTTAAGTC AGTATCGTGT 2040 

CACTAGTATT TCTGAATATC GAAAATTAAC TGGTGAAACA ATTCCGCATG TCTTTATTCT 2100 

40 

TATTGATAAC TTTGACGCAG TAAAAGATTC ACCTTTCCAA GAAGTTTTTG AAAATATGAT 2160 

GATTAAAATG ACGCGTGAAG GGCTAGCATT AGACATGCAA GTAACCTTAA CTGCTTCAAG 2220 

45 AGCTAACGCT ATGAAAACAC CAATGTACAT TAATATGAAA ACGCGTATCG CAATGTTTTT 2280 

ATATGATAAA TCAGAGGTGT CGAACGTAGT AGGACAGCAA AAATTTGCGG TTAAAGATGT 234 0 

TGTGGGTCGA GCATTGTTAA GTAGTGATGA CAACGTATCA TTCCATATTG GCCAACCATT 2400 

so 

TAAACATGAT GAGACCAAAT CATATAATGA TCAAATTAAT GATGAAGTAT CGGCGATGAC 2460 

AGAATTTTAT AAAGGTGAAc ACCAAATGAT ATtCCTATGA TGCCAGATGA AATTAAATAT 2520 
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GGATTAGATT 


ATGAAGGTGT 


TACACTACAA 


AAAATTAAAT 


TAACTGAACC 


AGCAATGATT 


2640 




TCATCAGAAA ATCCGAGAGA 


AATTGCGCAT 


ATTGCTGAAA 


TTATGATGAA 


AGAAATTGAC 


2700 


5 


ATATTAAATG 


AAAAATATGC 


GATTTGTATC 


GCAGACTCAA 


GTGGAGAGTT 


TAAAGCTTAT 


2760 




AGGCATCAAG 


TGGCTAACTT 


TGCCGAAGAA 


AGAGAAGACA 


TTAAAGCGAT 


TCATCAACTA 


2B20 


10 


ATGATTGAAG 


ACTTAAAGCA 


AAGAGAAATG 


GACGGCCCAT 


TTGAAAAAGA 


TTCACTTTAT 


2680 


ATTATCAATG 


ATTTTAAAAC 


ATTTATTGAT 


TGCACGTATA 


TTCCGGAAGA 


TGATGTTAAA 


2940 




AAGCTTATTA 


CAAAAGGACC 


AGAACTTGGC 


TTGAACATTT 


TATTTGTCGG 


CATTCATAAA 


3000 


15 


GAATTAAtAG 


ATGCTTATGA 


TAAACAGATT 


GATGTTGCAC 


GTAAAATGAT 


TAACCAATTT 


3060 


AGTATAGGTA TTCGTATTTC 


AGACCAACAA 


TTCTTTAAAT 


TTAGATTTAT 


TCAACGAGAA 


3120 




CCTGTTATTA AAGAAAATGA 


AGCATATATG 


GTCGCAAACC 


AAGCTTATCA 


AAAGATTAGA 


3180 


20 


TGGTTTAAAT 


AGCAATGAAT 


TAAATAGGAG 


GGAGGTATGT 


TATGAATTTT 


AATGATATTG 


3240 




AAACAATGGT 


TAAGTCGAAA 


TTTAAAGATA 


TTAAAAAGCA 


TGCTGAAGAG 


ATTGCGCATG 


3300 




AAATTGAAGT 


TCGTTCTGGA 


TATTTAAGAA 


AAGCTGAACA 


ATATAAGCGA 


TTAGAATTTA 


3360 


25 


ATTTGAGTTT 


TGCACTAGAT 


GATATTGAAA 


GCACAGCAAA 


GGACGTACAA 


ACTGCAAAAT 


3420 




CTAGTGCTAA 


TAAGGACAGT 


GTAACTGTTA 


AGGGAAAGGC 


GCCCAATACG 


TTATATATTG 


3480 




AAAAAAGAAA 


TTTGATGAAA 


CAAAAGCTTG 


AAATGTTGGG 


TGAAGATATC 


GATAAAAATA 


3540 


30 


AAGAATCCCT 


CCAAAAAGCT 


AAGGAAATTG 


CTGGCGAAAA 


GGCAAGTGAA 


TA7TTTAATA 


3600 




AAGCAATGAA 


TTAATATTGA 


GGTGAAGATA 


TGGGTGGATA 


TAAAGGTATT 


AAAGCAGATG 


3660 


35 


GTGGCAAGGT 


TGATCAAGCG 


AAACAATTAG 


CGGCAAAAAC 


AGCTAAAGAT 


ATTGAAGCAT 


3720 


GTCAAAAGCA AACGCAACAG 


CTCGCTGAGT 


ATATCGAAGG 


TAGTGATTGG 


GAAGGACAGT 


3780 




TCGCCAATAA 


GGTGAAAGAT 


GTGTTACTCA 


TTATGGCAAA 


GTTTCAAGAA 


GAATTAGTAC 


3840 


40 


AACCGATGGC 


TGACCATCAA 


AAAGCAATTG 


ATAACTTAAG 


TCAAAATCTA 


GCGAAATACG 


3900 




ATACATTATC 


AATTAAGCAA 


GGGCTTGATA 


GGGTGAACCC 


ATGATGAAAG 


ATGTTAAGCG 


3960 




AATAGATTAT 


TTTTCTTACG 


AAGAATTAAC 


AATTTTAGGT 


GGTAGTAAAT 


TGCCTCTCGT 


4020 


45 


AAATTTTGAA 


TTGTTTGATC 


CATCAAATTT 


TGAAGAAGCT 


AAAGCTGCTT 


TAATTGAAAA 


.4080 




GGAATTAGTA ACAGAGAATG 


ACAAGTTAAC 


TGATGCAGGT 


TTTAAAGTGG 


CTACATTAGT 


4140 




CAGAGAGTAT 


ATTAGCGCCA 


TTGTAAATAT 


TCGAATTAAT 


GATATGTATT 


TTGCACCATT 


4200 


SO 


TAGCTATGAA 


AAAGATGAAT 


ATATTTTGTT 


AAGCCGGTTT 


AAAAATAATG 


GGTTTCAAAT 


4260 




ACGAATTATC AATAAAGACA 


TTGCATGGTG 


GTCGATTGTA 


CAATCATATC 


CTTTATTGAT 


4320 
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CTTAAATAAT GAAAGTATCG ATACGATTGG GCGTGTTTTA GAAATTGAAA TATACAATCA 4440 

TCAAGGTGAC CCTCAACAAA GTTTATATAA CATTTATGAA CAAAATGATT TGTTATTCAT 4500 

5 

TCGATACCCA TTAAAAGATA AAGTGCTGAA TGTTCATATT GGTGTCATTA ATACATT 4557 

(2) INFORMATION FOR SEQ ID NO: 342: 

(i) SEQUENCE CHARACTERISTICS; 
10 (A) LENGTH: 3931 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

is 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 342: 

TTGAGTGACT TTATTGAAGC GCGTGTAGAA GAAATATTCT TCGAAGTATT TGATGTTTTA 60 

20 CAAGATTTAG GATTAACAAA AGTAAATGGT GGGTTTATTG TAACTGGTGG ATCTGCAAAC 120 

TTACTTGGCG TAAAAGAATT ATTATCAGAT ATGGTAAGTG AAAAAGTTAG AATTCACACG 180 

CCATCACAAA TGGGAATTAG AAAACCTGAA TTTTCTTCAG CAATTTCTAC AATTTCTAGT 24 0 

AGTATCGCTT TTGATGAGTT ATTAGATTAT GTTACAATTA. . ATTATCATGa TAATGAAGAA ( 3 00 

ACTGAAGAAG ATGTTATTGA TGTGAAAGAC AAAGATAACG AATCTAAATT AGGCGGaTTT 360 

GaTTGGTTTA AACGTAAAAC AAACAAAAAA GATACTCATG aAAATGAAGT AGAGTCAACA 420 

30 

GATGAAGAAA TTTATCAATC AGAAGATAAT CATCAGGAAC ATAAACAGAA TCATGaACAT 480 

GTTCAAGACA AAGATAAAGA TAAAGAAGAA AGTAAATTCA AAAAACTAAT GAAATCTCTA 540 

TTTGAATGAT TATTGGCCAA TAAAACTAGG AGGAAATTTA AATGTTAGAA TTTGAACAAG 600 

35 

GATTTAATCA TTTAGCGACT TTAAAGGTCA TTGGTGTAGG TGGTGGCGGT AACAACGCCG 66 0 

TAAACCGAAT GATTGACCAC GGAATGAATA ATGTTGAATT TATCGCTATC AACACAGACG 720 

40 . GTCAAGCTTT AAACTTATCT AAAGCTGAAT CTAAAATCCA AATCGGTGAA AAATTAACAC 780 

GTGGTTTAGG AGCAGGAGCT AATCCTGAAA TCGGTAAAAA AGCTGCAGAG GAATCTCGTG 840 

AACAAATTGA AGATGCAATC CAAGGTGCAG ACATGGTATT TGTTACTTCT GGTATGGGTG 900 

45 GCGGAACTGG TACTGGTGCA GCACCAGTCG TTGCTAAAAT TGCAAAAGAA ATGGGCGCAT 96 0 

TAACTGTTGG TGTTGTAACT CGTCCATTTA GTTTTGAAGG ACGTAAACGT CAAACTCAAG 1020 

CTGCTGCTGG AGTAGAAGCT ATGAAAGCTG CAGTAGATAC ATTAATCGTT ATACCAAATG 1080 

SO 

ACCGTTTATT AGATATCGTT GACAAATCTA CGCCAATGAT GGAAGCATTT AAAGAAGCTG 114 0 

ACAACGTGTT ACGCCAAGGT GTACAAGGTA TCTCAGACTT AATCGCTGTT TCTGGTGAAG 1200 
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w 



15 



20 



25 



30 



35 



40 



45 



50 



GTATTGGTGT 
CTCCATTACT 
GCGAGTCATT 
ATGAAGACGT 
TTGTAACAGT 
GTAGCACTGG 
CATTCACTTC 
CAACTAAAGA 
GAACAAGACG 
ACATATCTGA 
aAAGTTATTA 
ATTTTAACTT 
AGATAATTTA 
CATCATTTAA 
GGAGATGGTT 
CGCCCATATA 
AAAAATTGGG 
GATATAGGCA 
ACATATGATA 
TATAGTACAA 
GAAATTGTAA 
ATTGGACCAT 
ACATTGCCAA 
GATTTGAAAA 
TATACGACAG 
AAAGGTCAAA 
GTTTGCGTGT 
AAAATAATTT 
GAGCTAAAGA 



TTCTTCTGGT 
TGAAACATCT 
GTCATTATTT 
TAATATGATT 
TATTGCAACT 
ATTCGGAACA 
AAATTCATCA 
AGATGATATT 
TTAATCGGTT 
TGATTTTTTT 
AAAGTGTTTA 
TAAACATAAA 
AATGAAATGA 
TATATGAAGA 
TAAGTGACTA 
ATATTACTCA 
TGTTTCCCAT 
CAAATATAGA 
GTAATGTCTT 
AACATCATTT 
AAGAAGTGCT 
CTACATCATC 
TTGATAGTGC 
AAGCCAATGC 
CGTATGCTAC 
CAGGACGCAT 
GAAAGATAAT 
TTCAACAAAA 
AGCGTATGAG 



GAAAATAGAG 
ATCGTTGGTG 
GAAGCACAAG 
TTCGGTACAG 
GGTTTTGATG 
AGCGTAAATA 
AATGCACAAG 
CCTAGCTTCA 
AATATATATA 
ACTAATTTTT 
AGTATCGTGT 
TGTTATATTA 
TGGTGACGAT 
GTTATTACAA 
TCCTAAAAAT 
ACATCAATTG 
TCAAACACAT 
CACTTTAACT 
ATTAACGATG 
TATTGCATTG 
AAAACATGTG 
AAGTTATGAA 
CAACTATATT 
TGCATTATTA 
ATCTGAACAT 
GTTAGCATTC 
TTACAACAAA 
CCAAACGTGA 
GCTGGAATAA 



CGGTAGAAGC 
CACAAGGTGT 
AGGCTGCTGA 
TTATTAATCC 
ACAAACCAAC 
CTTCTAGCAA 
CAACTGATAG 
TTAGAAATAG 
CACAAATAAT 
AGaACATGTA 
GAAAATTAAG 
TATAATTATT 
CGAGTGAATG 
CAAGGTATTA 
GCTTTTAATA 
CAATTAGCTG 
GAAAATAAAG 
GATGCGCTTC 
TGTTATGCAG 
GCGCATGCAG 
AACTTTGATT 
ATTAATGATG 
GAAACTAGAG 
ATTTATTATG 
TTAGAATTAT 
ATTGGTCAAC 
TCTCAACACA 
TTGCAGTTAC 
GACATTTTGG 



TGCTAAAAAA 
GCTTATGAAT 
TATTGTCCAA 
TGAATTACAA 
ATCACATGGT 
TGCAACTTCT 
TGTAAGTGAA 
AGAAGAAAGA 
TCAACACAAA 
GAAGGACATT 
TCaAAAATTA 
AACTTTGTAC 
ATAATTTTAA 
CTCTAGGTAT 
TGGCGAGATA 
AAGAAATTGC 
TCGCTTGTAT 
ATGGTATTGA 
ACTGTGTACC 
GTTGGCGTGG 
TGAAAGACTT 
ATATTAAAAA 
GACGAGATCG 
GTGTTCCTAA 
TTTTCTCTTA 
AGTAAACAAG 
AATTAATGAC 
AAAATATGTT 
TGAGAATAGA 



GCAATCTCTT 
ATTACTGGTG 
GATGCTGCAG 
GATGAGATTG 
CGTAAATCTG 
AAAGATGAAT 
AGAACACATA 
CGTTCAAGAA 
TCATCAGATA 
TAAGTTTTTC 
TTTGCGCAAC 
AGTTAGACGA 
AAAGCAACCG 
TACAACTAGA 
TATTGATGAT 
GTTTGATAGA 
TACAAAGGAT 
TGCGATGTAC 
AGTATATTTT 
TACCTATACT 
ACATGTCGTT 
TAAATTTGAA 
TCATGGTATT 
AGAAAATATT 
TCGATTAGAA 
GAGGAGATAT 
AAAAGTGAAA 
ACAATAGAGC 
TTGGAAGGCT 



1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2B20 
. 2880 
2940 
3000 
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AATCTCGAAA AGTTAAGGAC GTTATAAACG ACGTAGATTA TTTCCATGCT TTAGATCGAT 3120 

TGAGCTTAGC CAAAGAAATT AACAAACGTG CAGAACATAA AATTAAATGT TTCTTGCAAG 3180 

TGAACGTTTC GGGAGAAGCT TCTAAACATG GTATTGCTTT AGAAGATGTT GATCAGTTTA 3240 

TAGATGATCT TAAAAAATAT GACAAAATCG AAATTGTAGG TTTAATGACG ATGGCACCAT 3300 

TGACAGATGA TGAAGCATAT ATTAGATCGT TATTTAAACA GTTACGTTTG AAAAAAGAAG 3360 

AAATACAACG ACTCAATTTA GAATATGCGC CTTGTGATGA ATTATCAATG GGAATGAGTA 3420 

ATGACTATCT TATTGCAGTT GAAGAAGGTG CGACGTTTGT TAGAATTGGG ACTAAACTTG 3480 

TAGGAGAAGA GGAGTGAGCC ACTTGGCTTT AAAAGATTTA TTTAGTGGAT TTTTTGTAAT 3540 

AGATGATGAA GAGGAAGTAG AAGTACCTGA CAAACAACAA CAGGTAAATG AAGCGCCAGC 3600 

AAAAGAGCAG TCACAACAAA CAACAAAACA AAACGCAATC AAATCAGTCC CTCAAAAATC 3660 

20 TGCATCAAGA TATACAACAA CGTCAGAAGA AAGGAATAAC CGTATGTCTA ATTATTCAAA 3720 

AAATAATTCA CGTAATGTTG TAACTATGAA CAATGCTACA CCAAACAATG CATCACAAGA 3 780 

AAGTTCAAAA ATGTGTTTAT TCGAACCACG TGTTTTTTCA GATACACAAG ATATTGCTGA 384 0 

25 TGAGCTTAAA AACCGCCGTG CGACACTTGT CAATTTACAA CGTaTTGATA AAGTATCAGC 3900 

GAAAAGAATT ATTGA TTTTT TAAGCGGTAC T 3931 
(2) INFORMATION FOR SEQ ID NO: 343: 
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30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 343: ' 

AATTGTCGGG GGACTCTTAG GTTTTGTCAT GCAAAGAACA AGATTTTGTT TAACAGGTGG 60 

CTTTCGAGAT ATGTATGTGC AAAAGAATAA TAAGATGTTC TATGCATTAT TAATCGCTAT 120 

TACTATTCAA AGTATAGGAT TATTGATTTT GACGGCAACA GATATTTTAC AAATTCCTGC 180 

45 ACATAGTTTT CCAATATTGG GAACAATTAT AGGTTCTTTT ATTTTTGGAA TTGGAATAGT 24 0 

ATTGGCTGGA GGATGTGCAA CAGGtACTTG GTATCGCGCT GGTGAAGGGC TAATTGGTAG 300 

TTGGATTGCA TTAGTATTAT ATGCTGTTAC TGCAGCAATC ACTAAAACAG GGATTTTAAA 360 

50 GCCAGTAATG GATAAAATTA ATCAACCAAC GAATGTAAAT AGTGATATGT CTCAAACAAC 420 

TGGCATTCCG TTTTGGGGAT TAGTCGTTAT ATTAACTATA ATCACCATTT TTCTAGTTGT 4 80 
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AGGTATTAGA TATTACCTTT TCGAAAAACG ATACCATCCA T7TATTGCAG CAATTGTAAT 600 

TGGACTTATC GCACTCTTAG CTTGGCCAAT GAGTGCATCA ACTGGAAGAA ATGACGGTTT 660 

AGGTATAACA ACGCCTTCAG CAAATTTAGT ACACTTTTTG ATTACAGGTG AAACTAAATT 720 

TATTGATTGG GGTGTCTTTT TAGTTCTAGG AATTTTCATT GGTTCATATA TTGCAGCTAG 780 

AGGATCAAGA GAATTTAAAT GGCGATTGCC AGACAAGATT ACAATACGAA ACAGTGCCAT 840 

TGGTGGCATA TGTATGGGAT TTGGTGCGTC AGTTGCTGGT GGTTGTTCTA TCGGTAACGG 900 

TTTGGTTGAA ACGGCAACGA TGACTTGGCA AGGATGGATT GCGCTAgCAT gCGATGATAG 960 

TTGGTGTATG GACAATGAGT CATTTTATCT TTGTTCGTCC AATGAAAAAA GTACACCAAC 1020 

AATCTGCAAA GGTTAAACAG CAAACGCAAA TAGTATAGAA GATTATTATG CAAATGATGT 1080 

TGATCAAATA AAAGTGATTG GAAAAGGAGA AATAATTATG ATACACGAAT TAGGTACAGT 1140 

20 AGGAATGGTA TGTCCATTTC CGTTAATTGA AGCGCAAAAG AAAATGGCAA CATTGCAATC 1200 

TGGAGATGAA TTAAAAATTG ATTTTGATTG CACGCAAGCG ACGGAAGCCA TTCCAAATTG 1260 

GGCTGCAGAA AATGGTTATC CTGTAACAAA CTATGAACAA ATTGATAATG CTTCATGGAC 1320 

25 AATTACAATT CAAAAAGTTT AACGTTATCA TTTT^ACAAT AAAATAGATA TTAGATTCTA 13 80 

TGGCTACTTC CGCTAATTTA AAAGTGAGTA AGTAGTCTTT TTTTTTTTAG TTCATGAAAT 1440 

CATTTTTATA TAGTGTGGCA CATTTTATTC CAAAAGATGT AATAAAACTT AACGCATTTT 1500 

TGCTTTTTAT AAATTGTCAG ATTATTATGA AAAAAAGGGA GTGGTAAGTA TGAATCTTAA 1560 

CGATACGATA TTTATGTTTT TGTGTACATT ATTAGTTTGG TTAATGACAC CAGGATTAAG 1620 

TTTATTTTAT GGTGGGTTAG TTCAATCTAA AAATGCGCTT AATACTGTCA TGCAAAGTAT 1680 

GGCAGCAATT GTGCTTGTTA CATTTGTATG GATAACAGTT GGTTTTACAA TTAGTTTTGG 1740 

GAATGGGAAT TTATGGTTCG GAAATTGGGA ATATACTTTT CTTAATCATG TAGGTTTTGC 1800 

40 GACTCAAGAA GATATTAGCC CACATATTCC TTTCGCTTTG TTTATGTTAT TTCAAATGAT 1860 

GTTTTGTACG ATTGCAATTT CTATTTTATC TGGTTCAATC GCTGAGAAAA TGAAGTTTAT 1920 

TCCTTATTTA TTATTCGTAG TAATATGGAC TGCTCTTGTA TACAGTCCAG TAGCACATTG 1980 

45 GGTTTGGGGC GGCGGTTGGA TTAACAAACT CGGTGTATTA GATTTCGCTG GAGGTACGGT 204 0 

TGTTCATATT ACATCAGGTG TTTCTGGTTT AGTATTAGCT ATTATGATTG GAAAAGGAAA 2100 

CAAACATTCT GAATCAACAC CACATAATCT TATCArrACG TTGATTGGCG GTATATTCGT 2160 

SO 

GTGGATTGGT TGGTATGGAT TTAATGTAGG TAGTGCTTTT ACATTTGATA ATATTGCGAT 2220 

GCTTGCATTT ACAAATACTG TCATTTCAGC CAGTGCAGGT GCTATAGGTT GGTTAATTTT 2280 
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70 



ATTAGTTGTC ATTACTCCTG CAGCAGGATA TGTAACATAT CTTAGTGCAA CAATAATGGC 2400 
TTTAATAGGA GGTATCTGTT GTTATATTGT CATTAATTAC ATCAAGGTAA AACTAAAATA . 2460 

TCATGATGCA TTAGATGCAT TTGGTATTCA TGGTGTTGGT GGTATTATTG GTGCTGTTTT 2520 

AACAGCAGTT TTCCAAAGTA AAAAAGCCAA TCCTGACATT GAGAATGGCT TTATTTATAC 2580 

TGGTGACATA CATATTATAC TTGTACAAAT ATTATGTGTA ACAGCAGTTG TAATTTTTAG 2640 

TATAGTCATG ACGTTTATTA TTGCGAAAGT AATTAAATTa ATTACACCAT TATCTGTTAC 2700 

GGAACAAGAA ACGAATATAG GATTAGACAA GATTGTTCaC GGTGAACATG CTTACTTTGA 2760 

75 AGGTGAGCTA AATAGATTCA ATAAACATAT TCGATATTAG AATATATTTA CATAGAATAT 2820 

TCATTGTCCT GACATTTAAC TAAAGGTTGA TGTTGGGACA TTTTGTTATA CAAAAGTTTT 2880 

ATTTTGAAAT CTTTTTATGA AAGAAGCAGA AATATTATTT AAAGCGGTTA CACATATGCT 294 0 

20 AAAATAAGGC TAAGTGTCAC AAATAATGAT AGGTGAATAA GTATGAAAAA TATATCTGAT 3000 

ATTGCCAAAT TGGCAGGCGT TTCAAAAAGT ACAGTATCTA GATTTTTAAA TAATGGATCT 3060 

GTCAGTAAAA AAACAAGTGA AAAATTAACA AGAATTATAG CAGAACATGA CTATCAACCG 3120 

AATCAATTTG CTCAAAGTTT AAGAGCGAGA ~" 3150 

(2) I KFORMAT ION FOR SEQ ID NO: 344: 
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ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3719 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 344: 

GTTATAGTGA AATTGACTCA TCACATTTCA CAGACCGTGA CAAACGCGTT ATTAGACGTG 60 

40 ATCATGTTAA AGAAGCACAA AGCTTAGTAG AGAACTATAA AGATACACAA AGTGCTGATG 120 

CTAGGATGAA AGCCAAACAA AAAGTTAACA CATTAAGCAA ACCGCATCAA AACTATTTCA 180 

ATAAACAAAT TGATAAGGTT TATAATGGAT TACAACGCTA ATCCAAAGTA AATTATAAGT 240 

TATACATCTC GTTTTTAAAT GACAATTTAT CCCCGTAAAT ATTATAAATA ATCTTTTCAA 300 

ATTCCACATA GATATAGAGA CACTAATAAA CCTCTTTGTC TCGATATGAT AGTCTGCAAC 360 

GATTCATGTT GTAGGCTTTT TAATTTTACA AATAAGGCTA AATATATAAG TTCTGGCACC 420 

TAAAATATAG AAAATACATA AAAGTAAGTA TAGTTATTTT ATTATAATTA TTAAATTTTT 4 80 

ATTAATTAAT TGTAAAAATG TATAATTATA ATTAATTAAC GTTTAATATT AAAATTAACT 54 0 
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ATCGTTTCAA TATTACTTAT AGGGATGGCT ATCAGTAATG TTTCGAAAGG GCAATACGCA 660 

AAGAGGTTTT TCTATTTCGC TACTAGTTGT TTAGTGTTAa CTTTAGTTGT AGTTTCAAGT 720 

CTAAGTAGCT CAGCAAATGC ATCACAAACA GATAATGGCG TAAATAGAAG TGGTTCTGAA 7 BO 

GATCCAACAG TATATAGTGC AcTTnCAACT AAAAAATTAC ATAAAGAACC TGCGACATTA 840 

ATTAAAGCGA TTGATGGTGA TACGGTTAAA TTAATGTACA AAGGTCAACC AATGACATTC 900 

AGACTATTAT TGGTTGATAC ACCTGAAACA AAGCATCCTA AAAAAGGTGT AGAGAAATAT 960 

GGTCCTGAAG CAAGTGCATT TACGAAAAAA ATGGTAGAAA ATGCAAAGAA AATTGAAGTC 1020 

75 GAGTTTGACA AAGGTCAAAG AACTGATAAA TATGGACGTG GCTTAGCGTA TATTTATGCT 10 BO 
GATGGAAAAA TGGTAAACGA AGCTTTAGTT CGTCAAGGCT TGGCTAAAGT TGCTTATGTT ' 1140 

TATAAACCTA ACAATACACA TGAACAACTT TTAAGAAAAA GTGAAGCACA AGCAAAAAAA 1200 

20 . GAGAAATTAA ATATTTGGAG CGAAGACAAC GCTGATTCAG GTCAATAATG CTCATTGTAA 1260 

AAGTGTCACT GCTGCTAGTG GCACTTTTAT AATTTTTAGA TCACGATATG ATTTATTATC 1320 

AATTCAGAAT TAAAAAAGTA AATAGTATCA AAAGTAAGTG TATTTAATAT TAGAAAATAA 1380 
AAATTTTAAA TTTAGTATTA AAATGGAATG TTACTATATA GTTCAATGTG TATTATCACA . 1440 

GAAAATAAAA TAATGCTTTA CTTCTATATT TAAAAGTGTA TAATGAAAGT TAAGTAATAA 1500 

AGAGCGTGAA GAAAAATGTG AGTTATTTAT ATAGAATATT CTCCTTTTCA TTTATGAATT 1560 

. TGTTACAAAA TATTTAGTGC AAAAGCACGA CGGAGGTATT CAATATGaAT AACGGTACAG 1620 

TTAAATGGTT TAATGCAGAA AAAGGTTTTG GTTTCATCGa AAGAGAAGAT GGTAGCGACG 16 80 

35 TATTCgTACA CTTCtCAGCA ATCGCTGAAG ATGGATACAA ATCATTAGAA GAAGGCCAAA 1740 

AAGTTGAATT CGACATCGTT GAAGGCGACC GTGGCGAGCA AGCTGCAAAC GTAGTTAAAA 1800 

TGTAATTTTA ACTTATTCAA ACAGTCCTTA CTATAGGGCT GTTTTTTTAT GCTTTAAATC 1860 

40 GATAACAGTT GGTGTGGTAA AAGCACTAGC CGTTATTTTT TTGTCCAATA AATTTAGTTG 1920 

GAGATTTAAC AATATATAAT GGTTCTAAAA TAAATCGAAC TGATGGAAAA GTTTTTTACT 1980 

TTTCATCTGT CCGACTTTTG ATTTTGAATA TAAAAAAGCG CCAATACAGA ACTTTAATAA 204 0 

TGACGAGAAT TAAAGTCTGT ATATGGCGAT AACAAGAAGT AATGTTAAAC ACTCAAAATG 2100 

TTTAACAATA ATAGGATACC ACATCGCATA ATATCTTACT ACTTAATTAA TAATTTAACT 2160 

AATCAACTTT TTGTTAATTT TTTATTAAGA CTGATTAATT ATTGAGAATA TTTATTGTTT 2220 

TTAAAATCTC ATAATAATTC AGTAATCTTG TTTTCATTTA AAAGGCGAAA CATTAAAATA 2280 

ATTAAATAAA AATATTGCGT TTAATTTACA GCGTCAAATA TACTTATTTC TAATGCTTTG 234 0 
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ATTTCTGTGA 


GTATTTGGAA 


GCTACCATTA 


GGCAACGGTT 


TAACAATAGA 


CAATTGCTTT 


2460 


TCCGCTTGTT 


GTATTAAAAA 


AGGTTTTGTA 


GATTGATTAT 


TAATATGCCA TTCACTCATG 


2520 


TATGTTTTTC 


ACTCCTGCTT 


TAAAATAGGG 


TTAGAAAGTT 


TATAGTTGAG 


ACATTCATGT 


25B0 


TCAACCAAAA 


TTTTGTTCGA 


ATTCAATAAA 


TGTCTTGTTT 


AAAATAGAAA 


TATTGTAAAT 


2640 


GTTATCGTCC 


AAAACTTCAC 


CAGTTAAGTA 


TTTGTTTTGA 


ATTAAAATTT 


GGCAGTTAGT 


2700 


TAAGAAGTCT 


TGATAATCAC 


GATCGCAAAA 


ATAGTTTTCA 


CGTGCATCTT 


TAGCATCGCC 


2760 


AAAAAAGTTA 


GCGACTGTTT 


CTGTTTCTCC 


TTTATTCGAA 


CGTTCAATAT ATAATTTGTA 


2820 


AAATTTAGCT 


ATTGTATACT 


TTTGTTCTTT 


AGTTAGTTCA 


TTCAAAATAT 


TGGGCCTCCT 


2B80 


GAAATATCAT 


TTGTAATCTA 


TACCCAATTT 


ATTGCAAAAC 


AAAAACTAAT 


TTAACTATTT 


2940 


GATGAAACTG 


TGTTAATAAg 


CTTTAACAAG 


CCTTAGTTTG 


TATGGATCTA TAAAATTATC 


3000 


TTTAATTGCA 


TAGGGTGAAA 


TAATATGTAG 


TCCATAACTT 


TTAACTGATT 


TTTCACTTAC 


3060 


ACCAAATTTA 


TAAGCTTGGT 


AGATAATTTT 


AGTACAATAC 


GTAAATTTTT 


TGCTGTTCAA 


3120 


ATTTAATGTA 


ACTAGATAAC 


GATGATTTGT 


ATTCTCATAG 


TTTTTCTTAA 


CCCATTCAGC 


3180 


CGCTTTTTTA 


CCTGCACCAG 


GATAGCTGCA 


ACGATAAACT 


TTCATCCAAT 


CATTTTTGCC 


3240 


ACTTGCATAA 


TTATATTTAA 


AAGATTCGAA 


GGATTGTGTA 


GTTGGTTTGT 


CGCCAGGCCC 


3300 


CTCAATTTGT 


AAAATCGTTT 


TATCATCAAT 


CGCGATACTA 


CAATGACCAA 


AAAATCscCA 


3360 


CATGACAGGG 


CCTTTTGTAA 


CAATAATATC 


ACCAGGTTGT 


AATTGGAATT 


TGTCATCTTG 


3420 


AATTTCTGAA 


TACTTATTAT 


CTGCAATTGT 


TTTTGGTGAG 


TTTATTGGGG ATACGACAAC 


3480 


GAATAATATA 


AGTAAAATTA 


TCGTTCGTTT 


AATATAGTTC 


ACTTAAAAGC 


TCCTTGTTGA 


3540 


AGAAATATAT 


GTAAATAGTC 


TTAAATTAGA 


ATTGTAATCT 


TTAATAAGCT 


TGtAAGACTA 


3600 


AAACATATCT 


TAAATATTAA 


AGTATGAGAG 


TGTGAAATGT 


CTATTAAGAA 


TnAAAAACAG 


3660 


TCTGAAACAT 


CATTGAGACG 


TTCCAGACTG 


GATATAAAAT 


GAATTTCATT 


TATAGCACA 


3719 



(2) INFORMATION FOR SEQ ID NO: 345: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1676 base pairs 

45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:. 345: 
TTGCGTTGCC GCACCAAGAT ATTGAATGCC TAGCGATTCC GAGTATGCAA ACTGAACGCA 60 
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CTCTTTAATA CGCGTTATCG CTTTTTGTAA ATCTGCATCA TGATAACAAA TCATAACGCC 
ATAGCCACCT GCTGTCGGAA GATCATCTCC CGGCTTAATT ACTAACGGGA ATTCCCAATT 
CTTAATCTCG TTTTCGAATT GCTCAATTTT TACAACTTTT CTTTTTGGTA AAAACTTCCC 
ATTTGTCCAT TCAGGTATTC TTGCTTTATT ATTTAAAGCA ACAAATAACG TTTTATCTAA 
TGCATAATAT TGCTGATTCA AGATTGTTTC ATCATGAATA TATTGAAAAT AAATCTTTTT 
ATTTTCCTTA TGTGCCAATT GTTTGATCAA GTTTTCGTAA GATTGCTGAT TGTTAAATGT 
ATAAATTGAG TTCGGtACTT CCTTACCAAT AACTTGAAAT AGCTGATGCA ATTTGTCTGT 
CGCACTAGCT TCGTGAACAA TAACAGGTAA TTGATTTGCT ATTAATAACT CCCTACCAGT 
TAAAAAATTA GATTGATGTT CGTCCGGTTT CAACCATGGA TTCGATATAT ACGAAGGTCT 
TGACGTATAG ACAACATCTT TGTCATATAA ATCACTTAAC GTTAAGTTCG GCTCATTACC 
ATTATTTGTC ATTACTTCCC ATTCCCTTTC AAATGCGCAT GCTCTTCAAT AATGTCTTGA 
TAAACGTCTT GATTTGTAAT TAACTCTAAC CCCATCAACG CCATTATTTT AGCGCCTTTA 
ATTAATGCTT CATCACCATG TACACTCGCA GCCGCTTCTC TAAATCTATG CGTATGTCCT 
ACTAAATTAC GTGATCCTAT TTTAATATGA GGATGTATTG TTGGCACAaC ATgaCTTACG 
TTCCCTGTAT CCGTAGAGCC ATAACCAAAA TCATCATCAA TAACTGCTTC ACCAACTTCT 
TCAGCATATT TAGCAAATAA ATCATCTAAT TTCGGCGTTT TAATGAATTC ATTCACACCG 
TTTTGAATTC GACCAAATTC ATAATCACAA CCAGTCTGTA TCGCAGCTCC ACGTGCGATT 
TGATTTACTT TTTCTGTTAA TATATCCAAT TCTTTACGCG TCATTGCTCT AGTATAAAAA 
CGAGCATGTG TATAGTCTGG AATAATATTA GCTGCTTTCC CGCCATCTAA AATCACACCA 
TGCACACGTT GATCTTTTTT AATATGTTGT CGTAGTTGTG CTACACCATT AAAATAACTA 
ATCATAGCGT CTAATGCATT TAACGCTTCA TCTGCATTTT CAGAGGCATG AGCACTTTTT 
CCGTAAAATT TAACATCTAA AACATCCACT GCCAAAGTAT CAATCGTTTT ATAAGTTTCA 
TTTCCCGGAT GAATCATTAA GGCAATGTCT ATTTGATCAA TCACACCAGC CTTGACATAA 
GAAGCTTTAG CGCTACCATT TTCCCCACCT TCTTCAGCTG GACATCCAAG AACGACTACT 
TTACCACCAA TTTGGTCAAT CACTTGCTTC AAACCAATTG CACCAAGAAC ACTTGCAGTT 
CCAATGATAT TATGACCACA AGCATGACCC AATCCTGGCA AAGCATCGTA TTCTGC 
(2) INFORMATION FOR SEQ ID NO: 346: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1294 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 346: 

TACAGTAGGA ATCATAAAAC CTAATACAAC AAATACAAAA CCATTTAAGG CATAACTAAA 60 

TGTGTTCCAA ATTTGATGGT AATTCATTTG TAGTTCCGTT TGTGCTCTAA TTAAACGGTC 120 

GCGTTCTAAA CCATGGATTA GACCTGCGAT TACAACTGCA ATGATACCTG AAGCATGAAC 180 

TTCTTCTGCT AAAAAGTATA CGACAAAAGG AGTTAATAAT TGAATAAAAG TTAAGGTATT 240 

GTTATCTTTT AAACCTTTAT TAGCGGTTAA GTCTATACGT ATTCTAACGA CAACGAATCC 300 

75 AATAATTGCA CCAATAAGTA CACCTAGTAT TGTTGAAATG ATAAATTGTT CAACAGCTTG 360 

GAATAATGAA AAGGTACCAG TTACTAATGC AGTAACAGCA ATTTTAAATG AAATGATACC 420 

TGCTGCATCA TTGAGTAAAG ATTCACCTTC TAAAATCGTC ATAGAACCTT TAGGTAATAA 480 

TTTTCCGCGT GTAATAGCAG ATACTGCTAC TGCATCAGTA GGACATAAAA TTGCTGCTAT 540 

TGCAAAAGCG GCTGGCATTG GTAAGGCAGG CCAAATCCAA TGTATAAAAT AGCCAACACC 600 

GACTACAGTT GCAAACACTA GTGCCATTGA CATTAATAGT ATAGGTTTAC GATATTCTAA 660 

TAATTTTGTT CGAGAGACGT GGGTACCTTC CACAAAAAGT AGTGGCGCGA TAACGGCAAA 720 

CATAAATACT TCAGAATTGA ATTGGAAATC AACTTGTATT GGAATAATGA AAATAACGAC 780 

30 ACCTAATGCA ATTTGAATAA AGGCAGTAGG AATTTGTGGG AATCGATTAT TGATAACCGA 84 0 

ACTAATAATC ACAGCAAAAA TAAAAATTAA AAATGCTTCT AATAGTGCCA TACAATACTC 900 

CTCAAAATTT TAATAGTTAA TATTTTATCA CTTTTAAGGC ATAATGACAT AGATATATTG 960 

35 ATAAAATGAA GTTATTTTCA AAAAAACTCT AGTATCGGTT GAACTGATAC TAGAGCGAGA 1020 

TGTTTAAATT ATTGATTGTc ATATCTGAAA TGACCGCTGT CATTTTGTCG TTGTTCATAC 1080 

GCGAGCTTTT CAGCATTCGT TTTGTATTTT TtATAAAAGa AAAATAaAAA TATnAACCaG 1140 

AATGGCGAAA TATAAATAGC TGCTCTTGtT TCGTCACTAA AGAATAATAA AATGAATACA 1200 

AAGAAGAAGA ACGCTAGAAT AATGTAAGCA ATAGGCTTAC CACCAATCAA CTTAAATTTA 126 0 

CTGTTTTTAT GTGCCTCAGG ATGCTTTTTC AAAT . 1294 

(2) INFORMATION FOR SEQ ID NO: 347: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1935 base: pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ACATGATAAT GATGACGCTA TTAAAACACG TT TT T TATTT TTCATTGTTA TAACCTTCTT 60 

TCGTATGATT GATATT TGT T GATATGTATC GACATGTGAA TAATATCACA AAAACAGAGA 120 

ATATATATTT AACTATTTAT TAAATGATTT TGTTAATATT ATTAAATACT TTATCCTCTT 180 

TAAAAATAAT GTGTGTACAA AGTCATTAAT TTAGCAAATA TTTTTATTTA GTAGTTAATA 240 

ACCATCGATT TGAAATTTAT ATATAATTAT TAGCTAAATA ATATCCTGCA TCTTTCTCAT 300 

ACAATTTACT ATAAAtTAgC ATATCCGATA TCAGCGTTAA TAAGATCGTT GATACTAGmC 360 

AGTTAATTTC ATAGAACGAA ATCAAATAAC ACACTACTTT CTGCATTTTA AATTATGTTT 420 
AAGAATCAnA ATTATGTTTA nATAAATATA TATACTACTT TGAAAGGTGT GAGCTTAATG . 4 80 

ACAACTTTTA GTGAAAAAGA AAAAATTCAA TTACTAGCAG ATATTGTTGA ACTACAAACT 540 

GAAAATAATA ATGAAATAGA CGTTTGTAAT TATTTAACAG ATTTATTCGA CAAGTACGAT 600 

20 ATTAAATCTG AAATTTTGAA AGTTAATGAA CACCGCGCCA ATATCGTTGC AGAAATCGGT 660 

AACGGCTCAC CTATACTCGC ATTGAGTGGT CATATGGATG TTGTTGATGC AGGAAATCAA 720 

GATAATTGGT CATATCCCCC TTTTCAACTG ACAGAAAAAG ATGGCAAATT ATATGGCCGA 7B0 

GGCACTACAG ATATGAAAGG CGGTTTAATG GCTTTGGTCG TATCTCTAAT CGAATTAAAA B40 

GAACAAAATG AATTGCCTCA TGGAACGATT AGATTACTGG CTACTGCTGG CGAAGAGAAA 900 

GAACAAGAAG GTGCCAAATT ATTAGCTGAT AAAGGCTATT TAGACGATGT CGATGGCTTA 960 

ATTATTGCTG AACCAACTGG ATCTGGAATT TATTATGCAC ATAAGGGGTC TATGTCATGT 1020 

* AAAGTAACTG CAACTGGTAA AGCTGTCCAT AGCTCAGTTC CATTTATTGG TGACAATGCA 1080 

35 ATTGATACAC TGCTTGAATT TTATAATCTA TTTAAAGAAA AATATTCAGA GCTTAAACAA 1140 

CAAGATACTA AACATGAATT AGATGTTGCG CCTATGTTCA AATCATTGAT TGGAAAAGAA 1200 

ATTTCTGAAG AGGATGCAAA TTATGCATCT GGTCTTACAG CTGTATGTTC GATTATAAAT 1260 

GGCGGcAAAC AATTTAACTC TGTACCAGAT GAAGCTTCAC TTGAATTTAA CGTAAGACCA 1320 

GTTCCTGAGT ATGATAACGA CTTTATAGAA TCGTTTTTCC AAAATATCAT TAATGATGTG 1380 

GATAGCAATA AGCTTTCACT CGATATTCCA AGCAATCACC GACCTGTAAC AAGCGATAAA 1440 

AATAGCAAAT TAATTACTAC GATTAAAGAT GTAGCTTCTA GTTATGTAGA ACAAGACGAA 1500 

ATATTTGTTT CAGCGCTTGT AGGCGCAACA GATGCCTCTA GTTTCTTAGG AGATAATAAG 1560 

so GACAATGTTG ATTTAGCCAT TTTTGGACCA GGTAATCCAT TAATGGCACA TCAAATCGAT 1620 

GAATATATTG AAAAAGATAT GTATCTGAAA TATATTGATA TTTTTAAAGA GGCTTCCATT 1680 

CAATATTTAA AAGAAAAATA AGAACGATGC TGTCAGCTGC CCTATTCGCG TGCTGGCAGT 1740 
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TCAAATATCA ACAAGCACAT TTTCATTGAT TAAGTGATGT AAAACTGAAA TTATTGTGCT I860 

GATTTGTCAT ACATATATTG ACTAATGGGC ATATAAAAAG ATAGCCTCTA ATAGTnACAT 1920 

5 AAACTCGTAA AAnCC - 1935 

(2) INFORMATION FOR SEQ ID NO: 348: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1351 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

is 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 348: 

CCTTTnCCTA AACAATTTTT AGATTTAGAC AACAAACCGA TTTTAATCCA TACATTAGAA 60 

20 AnATTTATTT TAATTAATGA TTTTGAAAAA ATTATTATCG CGACGCCACn ACAATGGATG 120 

ACGCATACGA AAGATACACT TAGAAAATTC AAAATTTCTG ATGAAAGAAT TGAAGTCATT 180 

CAAGGTGGTA GCGATCGTAA CGATACAATT ATGAATATCG TTAAACATAT TGAATCAACA 240 

AATGGTATTA ACGATGACGA TGTCATTGTG ACACATGATG CAGTTAGACC ATTTTTAACG 3 00 

CATCGTATTA TTAAAGAAAA TATTCAAGCT GCTTTAGAGT ACGGTGCAGT AGATACAGTG 360 

ATTGATGCTA TAGATACGAT TGTTACATCT AAAGATAATC AAACGATTGA TGCAATTCCA 420 

GtGCGTAATG AAATGTACCA AGGTCAAACA CCTCAATCGT TTAATATTaA TTTATTAAAA 4 80 

GAaAGCTATG CACAGTTGAG TGATGAGCAA AAGAGTATTT TATCTGATGC TTGTAAGATT 540 

35 ATTGTAGAAA CAAACAAACC GGTTCGACTT GTAAAAGGTG AGTTATATAA CATTAAAGTA 600 

ACAACACCTT ACGATTTAAA AGTAGCGAAT GCTATTATTC GAGGTGGTAT TGCCGATGAT 660 

TAATCAAGTA TATCAATTAG TTGCACCTAG ACAATTTGAA GTTACGTATA ACAACGTAGA 720 

TATTTACAGT GACTATGTCA TTGTACGTCC TTTATATATG TCAATTTGTG CTGCCGATCA 780 

AAGATATTAT ACTGGTAGCC GTGATGAGAA TGTCTTATCT CAGAAATTGC CAATGTCTTT 84 0 

AATTCATGAA GGTGTTGGTG AGGTCGTATT TGACAGTAAA GGTGTGTTTA ATAAAGGTAC 900 

AAAAGTAGTT ATGGTACCGA ATACGCCGAC AGAAAAAGAC GATGTCATTG CTGAAAACTA 960 

TTTAAAATCG AGCTACTTCA GATCAAGTGG ACATGATGGG TTTATGCAAG ATTTTGTGTT 1020 

SO GCTAAATCAT GATAGAGCTG TACCACTACC TGATGATATT GATTTAAGTA TTATTTCATA 1080 

TACAGAGCTT GTAACAGTAA GTTTGCATGC TATTCGTCGT TTTGAAAAGA AATCTATTTC 114 0 

AAATAaAAAT ACATTTGGTA TTTGGGGTGA TGGTAACTTA ggTTACATTA CAGCCATTTT 1200 
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GAGTCACTTC TCATTTGTTG 


ATGATGTCTT CTTTATTAAT AAAATACCTG AAGGCTTAAC 


1320 




ATTTGATCAT GCATTTGAGT 


GTGTGGGTGG T 


1351 


5 


<2) INFORMATION FOR SEQ ID NO: 349: 




10 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 411 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




15 


{xi> SEQUENCE DESCRIPTION: SEQ ID NO: 349: 






TCATCAAGTC TACGATAAAT 


TAAGTCCATA TCTAAAGGCT CGGGGTCGAC AGTTTGTAAA 


60 




GTATAACCAA CTGCACAGTG 


GCTACAACGC ATATTACAAA GATTTGTAGT TGTAAATTCG 


120 


20 


ATGTTACTTA AAGTTAATTG 


GCCATGTTCT TTAACATCGT TATATGCTTC CCATGGGTCG 


. 180 




TTTTGAATAC TTATTTTAGG 


CTTGTTATTA CGCATTTTAT AAACTCCTTA ATTGTTATTT 


240 


25 


GATACCAATT TGATACCGTT 


TAATCAAATA TGCTCATAGC TTGATGTTTT TTATCAGTAT 


300 


ATAAATGAGA GTACGTTTGA 


ATTGTTTCTG TAATGTTAGA ATGCCTCATT AATTCCATTA 


360 




ATAAATACAT ATCTACACCA 


TTATTAATTA AaTAGCTAGC GTACGAGTGT G 


411 


30 


(2) INFORMATION FOR SEQ ID NO: 350: 




35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1639 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 




40 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 350: 




TCATTTTCAT AGGTTATTAC 


GCAGATCAGC ATAATAATCC ATTCCATATG AGTcCTTATT 


60 


* 


TTGGTTATGC AGCACGTCTA 


TTGGCAACAA GTGGCATTGA CTATACGTAT GTAAGAATGG 


120 


45 


CAATGTACAT GGATCCACTT 


AAACCATATT TACCAGAATT GATGnATATG CATAAACTGA 


180 




TTTATCCnGC TGGCGATGGT 


CGTATTAATT ATATTACTAG AAATGATATT GCTAGAGGTG 


240 




TCATTGCTAT TATTAAAAAT 


CCAGATACTT GGGGCAAACG CTACTTATTA TCAGGCTACA 


300 


50 


GTTATGATAT GAAAGAACTT 


GCTGCAATTT TATCTGAGGC ATCaGGCACA GAAATTAAAT 


360 




ATGAGCCCGT TTCATTAGAG 


ACATTTGCAG AAATGTATGA TGAACCTAAA GGCTTTGGTG 


420 




CATTATTGGC ATCAATGTAC 


GACGCAGGAG CAAGAGGACT ATTAGACCAA GAaTCCAATk 


480 
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TTAATAATAA 


AGGAGCGTTA 


TAGTGAATAT 


CATCTCAACA 


ATTtTAATCA 


TATTTGTGGC 


ATTAGAGTTT 


TTCTATATTA 


TGTACCTTGA AACGATTGCT 


ACAACTTCCA AAAAGACTAG 


CGAGACATTT 


AATATAAGCG 


TCGATAAATT 


GAAAGACAAA 


AATATTAACC 


TACTTTTGAA 


GAACCAAGGC 


GTATATAACG 


GTTTAATCGG 


AGTTTTGCTA 


ATATACGGTT 


TGTTTATCAG 


CAGTAATCCA 


AAAGAAATAT 


GCGCAGCTAT 


TTTAGTGTAT 


ATCATTGGCG 


TTGCTATTTA 


TGGTGGCCTT 


TCAAGCAATA 


TTAGTATCTT 


TTTCAAACAA 


GGCACATTGC 


CAGTATTGGC 


ACTCATATCA 


ATGCTTTGGT 


AAGTATTGGT 


GTTTGGGGGG 


GTGGAGATGT AGTCGGAGGT 


TTGGAGGATT 


TGAGCGAATT 


GTGTGTGGAC 


TTTAGACTCA 


GAGTATTTCA 


TCCTAATTAT 


TTCAAGCAGA 


GGTGACAGTA 


GCGTTGCCTC 


TGTTTCCTTA 


TAAAAAAATT 


ATTTAATGAA 


GAAAACCCAT 


ATCTGATTTA 


ATTTTCAGCT 


GATAAATACT 


CCATATATTA 


GAATGGCTAC 


TTTATCTATT 


GCATCAATCC 


TTTAAAACAA 


AAAACCCATG 


ATTTCGAAAT 


TCCCGTATGA 


TGGGGTTCCT 


ACTCTCATGG 


ATCAGTTAAA 


TAAATATTAT 


CACTATCAGT 


TTATTATTTC 


AATATTATTA 


ACAATATATG 


TAGTCGTAAA 


AGGAAAGAGG 


ACATGAGAAC 


TTCGGTGTTG 


ATTGGCATTA 


CATAACGCTT 


CCAAACATAT 


TATTTGGTAA 


CAATAAGAAA 


CTATTTACAC 


AATATATTTT 


GTATAGTAAA 


ATTATTTTAT 


AATATTTAAA 


TCCAATTGCA 


CAAGGAGTGA 


TTATCaTGGT 


ACCAGAAGAA 


AAAGGTTCTA 


TTACTTTGTC AAAAGAAGCA GCTATCATAT 


TTGCAATCGC 


AAAATTCAAA 


CCATTTAAGA 


ACAGAATTAA AAATAACCCA 


CAAAAAACAA 


ATCCATTTCT 


TAAATTACAT 


GAAAACaAAA 


AATCTTAATC 


ACTTTtATTT 


ATAGcATTTC 


TAATCTCAGA 


AATGCTATA 











{2} INFORMATION FOR SEQ ID NO: 351: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1B16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 351: 
AAAATCGCAT ATAGTAATAT GAATAACCAG ATTGTATCTA CAAAAAAGTA TATTGAAAAA 
CCAAGCGCAC CCATTAATAA TGCGAGAATA ATAATAATTT TTCTATTAAA GTGATGCGTA 
TCCGAAAATC TAGCAATAAT TGAATTTACT GTAAACTGGC TAATCGCTGC AGATGCTAGA 
AGTAATCCAT ACTGATTTGT TGTCATACCT AAATCTTTAG TTGCAAAAAG AACAAGATAT 
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TTCTyTATTT 


GTAaTAACGC TGCAAACATA TCCATAACCC CGCTTCTTAG AGCCCCTTTT 


360 




AATTnATnAA 


TTAGGGGCTC 


TTATGCAGTT 


GGTGCATTAG 


CAACCAACTG 


TATTCCTTTG 


420 


5 


TCCCCTTTTA 


ATTTATTaAT 


TAGGGGCTCT 


TTTGCTGTTG GTGCATTAGC AACCAACTAC 


480 




GTTCAATTTA 


ACCGAATAGT 


TTAAAATTAA 


ATACAAACCT 


TAAATTAGTC 


TAAAACTACG 


540 


10 


CCTTTGGTTG 


TTCAACAAAG 


CTCGCCATGA GATTTACAAA • AGAATCAACT TGTGGCAATT 


600 


GCAACATGCT 


CGGATCATAA 


CTCATAAATG 


TCGAACGAAT 


CAGCGGTTCA 


TTATCAATTT 


660 




CTACTTTTTC 


AAACTCAAAT 


TGTTCTTTGC 


TGATATTTTT 


CATCATAATT 


TCTGGCAAGA 


720 


15 


TTGTAACACC 


TACACCACTA 


ATCAACATTT 


CTTTGCAAGT 


TGCTACTTGA 


TCCACTGTAA 


780 




TAGTTGCATG 


GTAATCTTGT 


TCTAAATTAT 


CGTTATACCA 


TTCTTTTATT 


TGATTTATAT 


840 




AAATCGGATC 


AGCTTGAAAC 


TCTATAAATG 


GTAACTTTGT 


AACATCATCT 


CGTCTATTTT 


900 


20 


TTGGAAAAAT 


AAAATAATGA 


TCATCATTAA 


ATAAATGTGT 


GTTAGCTAAA 


TTCATTACCT 


960 




TATTTCCACG 


AGTTATCATA 


ACATGATAAT 


CTCTATGATT 


TGCTTTAATT 


TGTTCAGTTG 


1020 


25 


AACCAACTTG 


CACTTGTATT 


TCAACATTAG 


GAAATTGGGC 


ATTATATAGG 


CTCAAAACTT 


1080 


CAGGAAGTAA 


GGTTTGTCCA 


ATCAAAGAAG 


AACACCCGAT 


TGATATTGTT 


CCATTCACTT 


1140 




CACCAATATG 


TGCCTGCATT 


TTGTCAAAAA 


ATAATCGCTC 


TCTTTTCAAC 


ATGTCACGAG 


1200 


30 


CATGCTCAAT 


AATCATTGTT 


CCTTCAGTTG 


TTGTAATCAA TTGTTTTTTT 


GTTCTGATAA 


1260 




AAATATCTAC 


TCCAAAAGCA 


TTTTCAATAG 


CTTTTAGTCT 


TTGTGTAACA 


GCAGGTTGAG 


1320 




ATATATATAA 


AATTTCAGCC 


GCTTTACGTA 


ACGTTTTCGT 


TTCGTCTAAT 


GTTATTAGTA 


1380 


35 


AACGATAGTC 


TTCAATCTTC 


ATAATTTCCC 


CCCATAAATT 


ATTCAATTAT 


TGAACTTTCA 


1440 




TGGCTACAAG 


CATTCATGAG 


TTCATTACTA ACGAATAATT 


TCACCAATTT 


TATTGGTATG 


1500 




GCTGCAGCTT 


GAATTACTTA 


GTTTTTCTTT 


TGTTGTTGGT 


GATTTTTAGT 


TTGATTATAT 


1560 


40 


TGCTTAGGCT 


TTATTTGTTT 


GCTTTTTTCA ATATTAGTTT 


TATTTTGTGG 


CTTTTGATGA 


1620 




TTTTTTTGAG 


CCTTTGCATT 


AATTTTATTA 


AAGCAGTACA 


TGATTTTCTT 


TTGGAATCCT 


1680 


45 


TTAAAATCAT 


TTTCTAACTC 


TGCCATAATT 


TGATGTGCAA 


TCATATATGC 


TTCATGAAAT 


1740 




TGCTTTTTTG 


TAATTTGCTC ACTTTCTAAT 


GCAAACATTA 


AATCATCTTC 


ATCTACCAAC 


1800 




TCATaTcACC 


ACTTGG 










1816 


SO 


(2) INFORMATION FOR SEQ ID NO: 352: 









(i) SEQUENCE CHARACTERISTICS: 

. (A) LENGTH: 9956 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 352: 
s GCGTTGTCGT CGATGATTAA TAAAGTATGG GTATACCATT AAGAATAACG CTACCCAAAT 60 

gAkTGCTAGT GACGTGCCGC CAATGACATC TGAAAAGTAA TGTGCATGAA AATAAAGGCG 120 

ACAAAATAAT ATGCTAAGCC ATAATATTCC CATAACCAAC GCACTCAACA CTTTTGTTAT 180 

10 TGTCTTAGCA GCAAGTGAAA TAATAATGAT CATTAAGGCG AAATATAATA ATGTGCTGGC 240 

GTTGGAATGT CCACTCGGAA ATGAAAAGCC TGTATCAACG GCTAAATGAT TATATGGTCT 300 

TGGACGTAAT ACAGTATCTT TAATTAATTT GTTCATGATG ACACCTGAAA CCAAATATGT 360 

CACAAACCAA ACCGCTAAAT. GCCTCTGTTT AATAAACAGT ATGATTGTGA CGATAATGGA 420 

AATCAACACG ACACCTTTGA CATCTCCAAT TTCCGCACTA AACGTCATAT AGTAATTAAA 4 80 

CAAATTGTTA ACATACTGAC GTTGTGGCTC ACCGAAATAA TCTGTAAACC ATGTTAATGA 54 0 

TCCCATATCT ATATTTTTAA GCCATTCTTG ATTTGTCACT ACACTGTAAA ACATACCTAT 600 

AAATACAATC AGCGCGATTA AAAATAAAGG CACTGTcATT TTCGGTGATG TTAATTTTTT 660 

25 ATCTATCATC TTACAATCTC CTCGTATCAT CATTTTCATT TTACAAATGT TATCCATAAT 720 

ATCAATGTGC CACAAATTTC ACTTTACCGA CAATATCAAA ATTATAAAGT TCATATTGTT 780 
ATGTATATTG CAAATAAAAC ATTGTATAAT TGAAATAACA ATATTTTGCT ATTTTCAATT 840 
30 TAGTACGATT TATATTTATT ATACAGAGGG GGTAAGGCGT ATCAATAGAG TTATTTTTGT 900 
CTATATAGCG TTAATCATTA CATTAGTTAG ATTCTTTACC CCTATTCATC CATCATTTTC 960 

AAATTTGATT TACTGGATAT TTGTATTATA TTTTATTCCT ATTATACTAT GCGTTATCGG 1020 

TTTCAAGGCC GAAAAACTTA TTGCAACAAT GGTCATTATA CCTAATTTTT TAGGAATACT 1080 

TTATCGATTA TATGCCTACG TCACACATAT ACTCTTTATG TAAAAGGATA GTGGGCATGT 1140 

CTCGCAACAA ATATGCTTGC GTCGACTTGT ' CACCATTCGC AAATTTTATG ACATACGCCT 1200 

TTCACGGGCT ATATTCAGAC CCACGCATTC ATCCACGTAA TAAACACATC ATGTAATAGA 1260 

AAAACAGCAC ACCCAAATAT ATGGCGTTGC GCTGTTTAAC CAAGCATACT TCTATAGCTT 1320 

45 TAATAAGCCA GCAGAAGCAT ACCTAACCTT CTTAAATATG CTTTTCCAAA TTATCCTCAA 1380 

GTTTGAATAC GATAATACGT TCACCTGTAA CTGTACTTAA ATCACTATGG AAGCTCATCA 1440 

CTTTGATACC TGTAATTTTA AAAATGATAT CATTCAAATC TTGCTCACCG GATTCAACTA 1500 

ATTCAGAACG TGTTCGTTTA ATATTTAATA ATCCTTCATT CGTACTACAT ACACGATATT 1560 

CAGCTGGCGT TAAGATACCT TGTAAACTAA TAATCACCAT ATCTCTTAAA ATGTCTGATT 1620 
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TTTCAGCTTC GATTTCACCT TTCGTTCTTT TCATATCACT AACTCCAATA ATATTAAAAT 


1740 




TGATTACTTC ATCTTTGTAT CGTTATCACG ACATCAACTA TTACATTAAG TTTATCATTT 


1800 


5 


TTAGTATATT 


TTAAGAAGCT AGAACATTGT AGATATGATG ATATATTAGT TACTTAGCAT 


1860 




CGCAACATAT 


CATCGTTAAA TCCAACTTTT AAAACGCCCT TCCTCATTAA CGCTCATTAA 


1920 


10 


ACGCAGCCAA 


TGATTAGACA CCTTCCTAGC GAAATGCTCA TTATTCGCGA GTAGTCTTGC 


1980 


TACAACATAG 


TCGGGTGCCT GAATAACGAC AAGTAAACGA ATTGGCGAAT GATACATCGT 


2040 




CCGATCAGCA GCCATAACAG ATTGCCATGA TAAGCCATAC ATCAGATCAC TCGCATTACC 


2100 


15 


TTGCATGACA 


CCAACACCTG ACGTGACGGT TTGTGTCGCT TTATTCCCAC TTCCGTAAAA 


2160 




ATGCGGCGCA ACTGTCGACG CATAATATTG TAAATTAATC CATTGTGCCA CAAGTGCCGG 


2220 




ACCAGAAATG ATGGTATTTA ATAATGTGCC ATCTTTATCT TTACGCCAAT CATAATTGTG 


2280 


20 


TAAAAATGTC 


CGCCCTTCTA AATCAATGCC TTTTGTTAAT TGGCGTCGTC CAATTATAAA 


2340 




TGATGCATTT 


TTAGCCAATC CCCATTCTGG ACGTACCTCA CTCCAATCAC TCGCAAACCG 


2400 




CTGCGCTTCT 


TCCACTGGAT GATTCACACG ACCAATCGTT GGCAGTTTGT CCAAACGTTC 


2460 


25 


GCGATTCGcg TGtTCAGAAA TCATCGGCAT CGCGTCATTC AATGATTCAT ATGCATCTAA 


2520 




AGCAATAGAA GATAATGTGT CTGGCACATA TACCCATGCC AACGTATCAG TAGACGTATG 


2580 


30 


ATGTTCTGCT ACCGCAAAAA CAGTTGTCTC TGGAATATAC ACACCTGATT GTTTTAATCC 


2640 


TTGTCTGACA 


TTTGGACGAT TACATATCAT CGCTAATAAC TTAGCATTAA AACCGCTTGA 


2700 




TGCGCCACCA 


CAAGCCCCAC ATTCAAGTGA TGCATGATGT GGATTATTGT GAGAATGACT 


2760 


35 


AGCATGACCT 


GCTAACACAA CGAACGGCGC AAATGCTTCG GTTAAATCCA TCAATTTCAA 


2820 




CGCTTGTAAC 


GCGAAATCAA TTTGCTCTTG CTCAGTAAAT CCAACAGGTA AGTCTGATGT 


2680 




TCGGTCAAAC 


TCACGATCAA TCGTCAACTT TGTTTCAGGC TTTTTCAACC ACTTTTGTTT 


2940 


40 


TATTTTTTGT 


AAAGACGCGC GACTTTTTCT AGGCATAATC GAATTGACAA TGGTACTTAA 


3000 




GCTTAAAAAT 


GGCCCACTTA ATTCAGGCAA TAACAGACTA GGCATGACAT TATTTTTCAT 


3060 


45 


CAATTTAAAT 


GTGTAAAACA TCGATGACAT TGTCTGTTGC TGTTGTCGAT AAACATTCAT 


3120 


ATCGTAGCGG 


TCTGCAAATT CTTTAATGCG ATATGCCGGC GGTACCATGA CAGGTAATGA 


3180 




ATCATGTTTG AATTGTTCGT CTACGGCATC TTTTTGAATA GGTAATCCAA AGAAGCCTGC 


3240 


50 . 


AATAC CAATC 


GTTTCAAAGG GCCCTGCTGC TTCGATATGT CTACGAAATG GTTCTGAACG 


3300 




AACATCTATA CAAAATGCAA TTTGCGCTTT CGTTGATGTG CCCATCTGAT TTAGCTCGCT 


3360 




ATTATTTTCA 


TCAACTGCTT GTGTGTCATT TAACAATACT GAATGTGGCT GATTAGCGTT 


3420 
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TGCTTTAATT 


TTTTGTTTTA ACTGAGATTC GTATGTCATT TCCCAGGCAA 


TTAGCCATAA 


3540 




ATTTTTAAAT 


ACATTTTTAT 


TCATAGTTGC TGCAAAATGA ATAAACGTTT 


GAATTTCATT 


3600 


5 


GACGTCATGT 


TGTAGTAATA 


CATCGCTAGG CATATCACTG TAGTAACACC 


ATGATGCAAC 


3660 




AGTTTGCTTA 


AACCAATTTT 


CCGATCTACT TTCACAATCT TTAGCGACTG 


ACTTAAACTC 


3720 


1U 


ATCACCAACT 


AGCAATTGTT CGACAACTAA CCGAATTGCC AAATAATCCG TTAACAAATG 


3780 


TTGTTCAAAG 


TGATGCTGTT 


GTGAACGGTA ATACAACATA CCTGCCCAAC 


CCGGTAACGC 


3840 




CAAAAGATGT 


CCTTCAACAT 


AAGCTTGGTA GTCTTCCTGA TCTATTGAAA AATGAGTTAA 


3900 


15 


TACTGACTCT 


ATCGTCATTT 


CAGGATCATT GGGTAAGCCT TTAATCACTT 


GGCGCTGTGC 


3960 




TTTAGTAAAA 


CTATGGTCAT 


GTTGCGCTAA ATGCAACCAT GCATGGTAAA AACTTTGcTC 


4020 




ACGCTTCGGC 


ATTGTCCAAC TCGATAGAAA TTGATCGATA TAAAGTTTCG TCCATTTAAT 


4080 


20 


CATTTGACGA 


TTCACTTGTT 


CGCTAAGTGG CTCACCTTGT TCATCTATTA 


TTGCATCACT 


4140 




CATCGGACGT 


ACATCATAGT 


GATGATATGA TTCAGCCATA TCACGTTTTG 


ATTTTTCTAA 


4200 




TAGTAGATCA 


GCAACAACAT 


CAACATTTGA ATGATTCATA TATGATGCAG 


GTACGTCTTT 


4260 


25 


TAATGTTTTA 


ATGTTATCAA 


TATAAAGATT GATGTAGTGT TGCGGGATAT 


TGTAGTGATG 


4320 




TTCAAGTAAC 


ATATCAGTAA 


CAAGTTGATT AAAGACACTT TCATCTAATT 


CACCACGTGC 


4380 


30 


CACAGCGCTT 


TCTATTAATG 


CTTTATTTGG GAAAATATCC ACATCTCGAA 


CATCACGTAA 


4440 


CCATTTTGCG 


ACATCTTCAA ACGTATCCGC TTCTAATCCT TCCCATGGAT 


TTCGTGCTGC 


4500 




AAAAATCGAA 


ATTGGTGATA ATGGTGTAAT AACACGTTTC GCATTTTCAA TGACTGAATT 


4560 


35 


GATATTTAAC 


TGTGTTGTCA 


TACCTTTCAC CTCCTATAAA TACTTCTTCA 


AATAATTCGG 


4620 




ATGACTTTCT 


ATCGCTTTCG 


AGCGTGCTTC ACCTAGATTA ACTAACCACA 


CGTACAATAC 


4680 




CGCAAAAGCC 


TTAGAGTATC 


GATGCCGCGC CACCCAAATA CTTAATAAAC 


TGCCAAAGAT 


4740 


40 


TAAAATAACA 


ACACTAATGA 


TGACACTCAC TGTAGGCGGC GTTGTCGCAT 


GTGTTGTTAT 


4800 




ATTTTGTAAT 


ACAGCGTAAA AATAATTATG TGTGATGACG TAGATAAATG 


TCACGATTGC 


4860 


45 


AATCAAAATC 


ATACCAACAA 


GACGTGCCAT GCGTCCTTTA CTAAAGGCTA 


CCATTTGATT 


4920 


CCAAGATACA 


AGTAATGACC 


ATCCTAGAAT GAGTGCACTT AACACTTCAT 


ATGCACTTCT 


4980 




GTCACTACTC 


ATCCAAAATA 


GAAATGCCAC GATAATAGCT AATACACGTC 


CCATGACAAT 


5040 


50 


CCAGCCATAA 


GCGTCTTTAG 


CAGATGCTTG TTTTGGAATA TTGAATCGCT 


TCACGATAGA 


5100 




ACCTGATTGT 


AAAAATAATG 


TTGCTTTAAA AATACCGTGC AATATTAAAT 


GAATAATCGC 


5160 


55 


TGCTGAATAT 


ACACCCAATG 


CACATTGAAC TAACATAAAG CCCATTTGAC 


TCATCGTAGA 


5220 



1234 



AGAAATACTA GAAAGGATAA GTAATAATGA 
AAAACGAGTT AGAATAACAC CACCTGCATT 
6 TGGCGTTGGT GCCGTTACAG ATTCAATCAA 
TATCATGACA GCTAATACAA GTAGTACATT 
ATGTTGTGGT ACCCGCCACT CGCCAGTCGC 

10 

TGCAAGCCAA CCACATAAAA ATGTCATGCT 
CCAAAAACGA TTAACGTTCA TCAGCAATGT 

1S TGCCATCAGT CTTAAGTCTT CAGACATCCA 
GAACAATGGA AAGTAATGTC TATAATGATG 
TTGAATAATA AAGCCAAGCG CCATTACAAA 

20 TTTAAATGGA CCTAAGACAT GTTGACCATG 
TACTGGCATG ACTAGTATGT ATAAATGTAA 
TAAAAACAAC AAGCCACTTA TCAATGCAAT 

25 

CAATTGAAAA CTTAACACTG CATAACCTCC 
GTATAAAAAT AAAAACCTAC AATAGTAGAT 
GCAGGTTTCA GTTTAACTAG ACACTGCATC 

30 

TCTCTTTAAG CGTGCTCCCA TGCACATATG 

i 

TCATCTTCAT AAATATGCTT TGCCTAGACG 
35 TAACATAAAA TATAATTAAA TTTCTGCTTC 

AACCAATGAT TGTAAAGATT TTGTAAATGC 
TTTTAAAATA AAAAATTCGC AGTTATGATC 
40 TACGCACAAG ATAACTATGT ACAATGAAGT 
GGGTATCATC GCTGGCATCA TTAAAGTTAT 
ATAAGATTTC ATAACAAACA AAGGAGGTCT 

45 

AATTCATTAA AGGATTAATT GAGAAATTCA 
TATGAACATC TTATTTGAAG GGGGCCATTC 
50 TCTTTAAAGA TTTACTTGGT AAATTTTTAG 
ACAACTCATC ACTATGTTAA ATCAACATAC 
CTATCATTAA AAT CATC AAA GCAATTATCG 

55 
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TAACGCAAAT CCATTATCAA ATATCGGCGC 


5340 


CACAATTCCT 


GCATGCATAA 


TTGCCGATAC 


5400 


CCATCGATGA 


AAAGGAAATT 


GTGCTGCCGG 


5460 


CGTCAACAAT 


GACCATGTCG 


GATGAACTAT 


5520 


AATATAAATA 


GTTACAATTG 


CTCCAACGAA 


5580 


TGATAATTTC 


GCAGACTCAC 


GTGGCACTTT 


5640 


TAAACATAAT 


AATGTAATAC 


CCCAGCAGAG 


5700 


TGCTAAAGAT 


GCAAACGACG 


TAATCGCAGT 


5760 


ATCACCTAGT AAATATCGCA TTGAAAACTT 


5820 


GCCAGCTAAT AACCAAGATA AACGATCTAT 


5880 


AATACCGAAA AAGCCAATGA 


CTGCAAATAA 


5940 


TTTAATATAT 


CTCATTGGCA 


TAACTGGTGC 


6000 


GATAAGCGCA 


ATAACAAACA 


GTGAAAATAG 


6060 


TTATTTCTAA 


TCTCTCGCAT 


AATTGCTTAT 


6120 


TCTGTACATA ATGGCAGAAA ATTTACTATT 


6180 


ACGGTACGTT 


GATATACCTT 


GTTGCAGTGT 


6240 


TATATAAAAT 


GTTACTTCTG 


TCTGTTCAAT 


6300 


AGACCTAACG 


TGTTATTCGT 


TTTAAACTTA 


6360 


ATGTCAAATT 


CATGAGCTTA 


ACCTCTATTA 


6420 


ACCTGTACAG 


TTAGGCAGTA 


TTTCCCGTCC 


6480 


ATAACAATTC 


AAGTTAGGAA AAAAATCAAT 


6540 


TAACTCATAA 


GCAAAGGAGG 


TAATCTTAAT 


6600 


CAAAAGCTTA ATCGAACAAT 


TCACTGGTAA 


6660 


TTCACATGGG 


TATCATTGCA 


GGAATCATTA 


6720 


CTGGTAAGTA AGTTATAAAA 


ATCTCATAGA 


6780 


ACATGGAATT 


CGTAGCAAAA 


TTATTCAAAT 


6840 


GTAACAACTA ATCTCAAACA 


TTAACGATCA 


6900 


AGGAGGACAA AACGATGGCT ATTGTAGGTA 


6960 


ACATTTTCGC 


AAAATAATTT 


AAGCGAATTG 


7020 
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70 



15 



TTATTGATGT GAGGTGAGTC TTGTTAGTTT GTTGCAAATA AATGGTCTTG GTGTTTTTTG 714 0 

TATAGGACGT TCTTAGTGGG ACATACGGAA TATTCGTGAT CTTTGTAGTC TGACGCGTTA 7200 

TATTTTTGTG GCCTGTTTTA TGTTTGATAC TCGAGTTCTG AGACATTCAT GATTTGGCAT 7260 

GCGAAATCTT AATGATTTTC ATGATCTAGC GCAAGATATA TTGGCCACGT GCGGAATTGC 7320 

GTTGCACGTT TAGACTGAAA CACTCGTGTG ACCGTAAGTG TTAATAGTAC ATTGATAGCT 73 BO 

GCATTTACTT CACTCATTTT TATGACTGTT AAACAATGAT TGTACCTTCA ATTAACAGTT 744 0 

GGTACGATGG TTTTGCCATT TTTCATCAAC GTAAATATAA AAAGGACTAA GACACATACA. 7500 

TGTCCTAGCC CTATGGATAA AATGCAAATT TCTGCTTTAT CAAAACTATC ACACTTTAGA 7560 

TAGATTGAAA ACAAAAAGAT CCTAAGAACA CCTTAACTTT TTATTAATTG TCATAAATTG 7620 

CAAACAATTA AGGCACAATT CAAAAATGAT TATACTTCAT TCAACTTATC GTGCTGGTCT 7680 

20 AATTTGCCAT TGATATGGAT CTTCAAATTG TTGCCAATCT GCATCAATTT CTTGCGCATT 7740 

GACTAAGCAT GCGTCGAGTT CTTTTGTTAA TTTTTCTTCA TCTAATTCTG TACCAATAAT 7800 

GACAAATTGT GTATGACGAT CGCCATATTC TGGATCCCAT TCAGCTGCGA CATCTTGACG 7B60 

TTCTGCTAAT ATTTGTGTTT GTTGCGCTTC AGACATACTA GCCACCCAAT ATGTAACTGG 7 920 

ATGAATATTG CAAGATGACC CTGCTTGAGA TAATAAACAT GCTACGTGAT TGTATTGTGC 7 980 

TAGCCATACG ATACCTTTTG ATCGAACGAC ATTATTTGGC ATGCTTTCTA ACCAATCATT 804 0 

GAACCTTTTA GCATGGAAAG GTAGACGACG TTTATATACA AACGATGATA TACCATATTC 8100 

TTCTGTTTCA GGTGTATGCG ATGCATGCCC ACCAGACTCA AGTTCTTTGA TCCATCCTGC 8160 

35 TGACTCGCTC GCTTTTTCAA AATCAAAACG CTGCGTATTC AAGACTTCTT TTAAATCTAC 8220 

TTCAGAATTT GTTGTCTTAA TAATTTTAGC AGTCGGTTGC AATGCGCTTA ACATTTTTTC 8280 

TAACTTCGCT AGTTCTTCTT CACTAATTAA ATCAATTTTA TTAATAATCA ATACATCACA 8340 

40 i 

AAATTCAACT TGGTCAATTA ATAAATCAGC AATCGAACGC TCATCTGTTT CGTCAACGCT 8400 

TTGATCACGA TCCATCAATA AATCTTCTGA GTTGATGTCA TGTACGAAGC GGTTAGCATC 8460 

CACAACTGTA ACCATTGTAT CTAAACGGCA AATCGCTGTA AGATCAATGC CAAGTTCATC 8520 

45 

ATCAATATAT GAGAAAGTTT GTGCAACAGG TACTGGCTCT GAAATCCCTG TTGACTCAAT B580 

AACAATTTGA TCGATGCCAC CTTTTTTCAC TAAACGCTCA .ACTTCTTTTA ATAAATCGTC 8640 

50 TCTAAGTGTA CAACAGATAC AACCATTAGA AAGTTCGACT AATTTTTCAT CTGTACGCGA 8700 

TAGTCCCCCA CCATCTGCGA CAAGATCTTT ATCGATATTT ACTTCACTCA TATCATTTAC 8 760 

AATTAC CGCG ATACGTCGAC CTTCTCGATT TTGTAAAATA TGATTTAACA ACGTTGTCTT 8820 

55 



25 



30 
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ACTTCAATTT ATTTGTAAAT AGGAATAATT CTGTTTTACA TTATATAGGA GCGTTTCCTC 894 0 

TTTCGCAATC TTCGATAATA AAAAAATAGT ATACTTAATT AAATTATTGA GCGCTTTACT 9000 

TTATAATGGA GACAAAGATA TATCTCACGA AAGAGAATCG AGGTGTATAA ACATGTTATT 9060 

TGTCATTTTA GTTTTATATG TTACTGGTAT TGCATTTATT CTACTCAGTG TTTTTGGTTC 9120 

AAAGACTGAA GGATTATCTA CGAAACATAC TTTATATACC ATTGGCAGTG CTATTATAAC 9180 

GATTGCTATT TTCATTTCAA TTGGCTATGC CATTCAATAC TTAACTGCAG CGCTTTATGG 9240 1 

TTTGTAAGGT GAAGGTGATG AGTAACGGGT AGTTCGGGAG AGGTTAACTT GCGTTGATTT 9300 

15 TGATAAAGTG ATCATAGCTT TTAGTACTTG AGGATTTTTA TTGTTGCTGT TACGAATGTG 9360 

GTCATGTTTA ATGCGGGACA GTAATTTAAG TTG TTTT TTT ACAATTGAGA GTGTGATATT 9420 

TCGATTCGGT TCGAATTACT TTACATGGGA ATAATATAAA TTAAAAAGAA GCGGCCTAGT 94 60 

20 GTCAGTTGTG AATATACTGA ACATTGGTCG CTTTATTTAG TAGTATGATA TGTAGTTTAG 9540 

CTATTAATTT TTTTCAGGTC ATCCTTAATG CTGTCTATCT CAGACATGGC ACTTTTAACC 9600 

CAATCTCCTT GAGCTGCACC TTTAAAATTA GCTTTAAAAg ctTCGCAATG TTGCGCCATT 9660 

TGTTCAATTA ATACTTTTTC TTCACCTTTT AATCCGTTTT CAATATCTTT GTATTTATGC 9720 

TTATGTTCAG GTGCAATAAC TGTGCGAATA TTTTCTTTTT GCGCTTCCAT TTTAGATATG 97 BO 

AGATTAAGTG TTTCTACTGT AGTACTTATA TCTGGCATTC TTAAGGTCAT ATCTGGTTCT 984 0 

ATTAGAGTCA TTTAATCTCC TCCAAATTAT CAGTCACTTA GCTTATCTAA CTGCTTTTCA 9900 

TAAGACTTTT TTAAGTCTTC TTTATATTCT TCTAATTTCC CATTCTTGCT TTCTGA 9956 
35 (2) INFORMATION FOR SEQ ID NO: 353: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2411 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 353: 

45 

TTTCTTTTAA CAGaTTTATC cCATTAATTG TTCTACAAGC CCACGaTGAG CAATATCATT 60 

TTTAGCAACC ATTAATAAAC CAGAAGTATC CATATCTATA CGGTGAACAA TACCTGGACG 120 

SO AATTTCTCCA TTAATACCTG ACAAATTTTT AATTTGATAC ATTAAACCAT TAACTAATGT 180 

ATTGGTATAA TGCCCTGGTG ATGGATGAAC TACCATGCCT TTCGGTTTAT ATACAACTGC 240 

AACATCGTCA TCTTCATAAT AAATATCTAA ATTTAAATTT TCAGGTAGAA TATCAGCTTC 300 

55 



25 



30 
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AACAACTTTA TCGTTTGCAA CGACTAAACC 
AGACCAATCA TTATTTAATT CAGGCAGCAA 
ATCTGTAATG TTAAATTCAT AAGTCTCCAT 
GTATCCTTTA ATAAGGCAAT AATAATTAAT 
ATATTAAATA TTGGAAAATC ATAACCAAAA 

10 

CCTGTTAAAA TTCTATCAAT AAAGTTTCCA 
ACTTGCATAA ACAAATTATA TTGAGCATCT 

15 ATAATAATGG TAATAATAAA GAAAAATGTC 

CCATTATTTC GATGTGaTGT TATGTTTAAA 
ATTTTGATTG TAGTAGCTAT AATATATTTA 

20 GCTATTAAAA TGGAAGTGCC AATAAAATAT 

CGTCCATGAG ACAACTCTTT ATATTATAGC 
TTACATTAAA TCTAATGTTA CTAACTCAAT 

25 

ATATGAACTT AATTATTCAT TTATCATATT 
CTGGAACATA AATCCCTAAA AAGACAGTAG 
CTGCTGTTCT. CTATTTATAC AATACTTCGT 

30 

CATTACTGTA AAATTCTATA AATAGAATTT 
CTCAGCCTCG GtCTTCGACT GGCACTGCTC 

35 TTAACATGTA ATTTTACTTT TAAATACTTT 

AATTGGACAA ATTCTATGAG AATAGATATT 
TtCACTCGAA TGTCAGTTCG AGGAATAAAT 

40 ATGGCAATTA ATATAAGGAA AATAGCAGCT 

AAAAAAGCTG ATTTCTATCA AATAATTAAT 
CTTAATGTCC CAAGCCCTAA AACTTGTTGT 

45 

TATCTTAAGT ACATAAATGC TAAAAGTATA 
CTTGTATCAA CATTAATAAA TAATAACCCG 
so TAGCCCATAT ATTTGCCACC TAATAGTTTG 

TTACGACTAT AATTGATATA TGCAATGATA 
GTAGAGATGG TAGTCACATA CGTAAATACT 

55 



TGCTTTAATC CAATCTTGTA TCTGGTTACG 420 

CTTATCTACA CGCATACCTG TTTGTTCTTT 480 

TACTTAACCT CCTTCTCCTT TTTATTGGAA 54 0 

ATTACACCAA TTGTTAAACT TGAATCTGCG 600 

ATATTTGTAT CAATAAAGTC AACAACTTCT 660 

AGTGCACCTG CAAAAAGTAA ACTAATAGCA 720 

TTAATAAAGA AATATACTAA GGCTATTAAT 780 

ATTTTTCCAC TCAATATTCC CCATGcAGCA 84 0 

AAGTGCGGTA TCACTTCAAA TGAATCTCCA 900 

GTAACTTGGT CAAATATAAC GACAAATACT 960 

TTTTTGTGCA TTTTCGTTCC TCCAATCAAT 1020 

TTACACCTGC TAATAAAAAA AGTAAGCATA 10 BO 

ACTTGATAAA CTACTATGTT TTGACATTAA 1140 

TAAGATGACA TTAAAAATTA GGAAAGCAGG 1200 

TAAGATATTT TCTAATTAAA AATTATCTTA 1260 

ATTGAATGGC TTCGCTATGC CCATCTGGCA 1320 

TTGATGATGG GTCCCTTCCT AGGGTGCCGT 13 BO 

CCTCAGGAGT CTCGCCATTA ATACTACGTA 1440 

AAAAAAATAA GACATGAATC GTCTACACTT 1500 

GTTAATTTAA GAAAGTAGGC TATTTTGAGT 1560 

AAAGTTAAAC GAGAGCTAGG TTTTGTATTA 1620 

CAACGAGCTG TACATTATAA AATACATATC 16 80 

AGAAATCAGC TTTTTTACAT TGCCTAAGAA 174 0 

TATTTATTTG ATTTAGCAGC GATACGTTTA 1800 

AACCAAATCG GAATAAAATA AATTGCACGT 1860 

AACACAAAAA TGAAGAATAC AAATATTAAG 1920 

TACGTAGCAT TTTTATGTAG ATCTGGGTTT 1980 

ATCAGACCCC ATACAACTAA AAATAACACT 2040 

TTTGTCGCAT CTGGGAAAAT ATAGTTTAGT 210 0 
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TTATTCGTCT TAGAAAAGTT CGGAGGTGCT TGTyGTTGAC TTGATAAACC GaAAAGCATA 2220 

CGGCTATTTG AGAATATACC ACTGTTACAT GATGAAGCAG CAGCGGTTAA TACTACAAAA 2280 

TTAATCAAGC CCGCAGCAAA CGGAATTCCG ATCAATGCGA ATrATTTTnC GAATGGACTG 2340 

TTATCAGGAT CAACTTGCTG CCAAGGGGTA ATAGACATGA TAACCGCTAA CGCCCCAACG 2400 

TTrmATATTA A 2411 



(2) INFORMATION FOR SEQ ID NO: 354: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 354: 



25 



35 



GnGAATTATT TTTAATAATG 


AAAGGATTAC 


TTnCATGGGT TTTTACTAGG 


AnTACCCAGA 


60 


AGGTCAAAAT ATTTTTGATG 


CGCTAAGTCA 


ATATGAAGTT AAGCGACGCG 


GCGATATGGA 


120 


AGAGGATCCA TCATATAAAC 


AACTCATTTC 


TTATTGTTTA CTTGAAAATG 


AGCATGGCGA 


180 


GATATTAGTG TATGAACGAT 


TATCTGGCGG 


TGGAGAAGCT CGATTGCATG 


GACAATCTTC 


240 


AATAGGTGTA GGCGGTCATA 


TGAATGATGT 


TCCAGGAGCA GAATCTATTA ACGAAGTATT 


300 


GAGAGTTAAT GCACAGAGAG 


AATTAGAAGA 


AGAAGTAGGT TTAAGTGAGC 


AAGATTCACA 


360 


AAATATGGAA TATATCGGTT 


TTATTAATGA 


CGATAATAAT GAAGTGGGCA AGGTACATAT 


420 


TGGTGTTGTA TTTAAAATCA 


CTGTAAGTAC 


GAATGATGTA GAAGCTAAAG 


AAACAGATAC 


480 


TTTACGAATA AAATGGGTTG 


AAAAAGGCAA 


CATAGAGTCA TATGATGATT 


TCGAAACGTG 


54 0 


GAGTGCATTA ATCCTTCAAG 


ATTTATAATC 


AAACGAGGTG ACATATATGT 


CAGATATTAT 


600 


TCCAG 








605 



(2) INFORMATION FOR SEQ ID NO: 355: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 355: 
TTTATTAGCA CATCCAAACT ATTCATATGT TGGACAATTT TTAAACGAAC TAGGATTTAA 60 



1239 



EP 0 786 519 A2 





ACAATTAGAC ACTGAACATT 


TAGCTGATTT AAATCCAGAG CGTATGATCA 


TTATGACAGA 


180 




TCATGCTAAA AAAGATTCTG 


CTGAATTCAA GAAGTTACAA GAAGATGCAA 


CATGGAAAAA 


240 


5 


GTTGAATGCA GTTAAAAATA 


ATCGCGTGGA TATTGTTGAC CGTGATGTTT 


GGGCAAGATC 


300 




TCGTGGCTTA ATTTCTTCTG AAGAAATGGC TAAAGAACTT GTTGAATTAT 


CAAAAAAAGA 


360 


70 


ACAAAAGTAA GGTGGAAGTA AATGGCTATA AAAGAAATAA GTAGCCAATC TGCCATAGAT 


420 


. CATAAAAGAA AAAGACGCAC 


AACACTCACG TATATAGTGA GTTTGTGCTT 


TCTTTTTATT 


480 




TGTATATATT TAAATATGGC GATTGGTTCT TCGAAAATTA ATTTTAGCGA TATCATTCAC 


54 0 


15 


TATGTTACTG GTCATACAGA TACGAAAGCA ACGTTTTTAT TGCATAATGT ACGTATGCCA 


600 




AGGATGATTG CAGGGTTATT 


TATTGGCGGT GCATTAGCGG TATCTGGTTT 


GTTAATGCAA 


660 




GCAATGAC 






66S 


20 


(2) INFORMATION FOR SEQ ID NO: 356: 






25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7B7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
«■ (D) TOPOLOGY: linear 






30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 356: 






ATACAAAAAA ACATATCGAA 


AATAAAGCTA AAAGAAACTA TCAAGTTCCA 


TATTCAATTA 


60 




ATTTAAATGG TACATCTACA 


AACATTTTAT CGAATCTTTC ATTTTCAAAT AAACCTTGGA 


120 


35 


CAAATTACAA AAATTTAACT 


AGTCAAATAA AATCAGTACT GAAGCATGAT AGAGGTATTA 


180 




GTGAACAAGA TTTAAAATAT 


GCTAAGAAAG CTTATTATAC TGTTTATTTT AAAAATGGTG 


240 




GTAAAAGAAT CTTACAGTTG 


AATTCAAAAA ATTACACAGC AAACTTAGTT 


CATGCGAAAG 


300 


40 


ATGTTAAGAG AATTGAAATT 


ACTGTTAAAA CAGGAACTAA AGCGAAAGCA 


GACAGATATG 


360 




TACCATACAC AATTGCAGTA 


AATGGCACAT CAACACCAAT TTTATCAAAA 


CTTAAAATTT 


420 




CGAATAAACA ATTAATTAGT 


TACAAATATT TAAATGACAA AGTGAAATCT 


GTATTAAAAA 


480 


45 


GTGAAAGAGG CATCAGTGAT 


CTTGACTTAA AATTTGCGAA ACAAGCAAAA 


TATACAGTAT 


540 




ATTTCAAAAA TGGAAAGAAA 


CAAGTAGTGA ATTTAAAATC AGACATCTTT 


ACACCTAATT 


600 


50 


TATTTAGTGC CAAAGATATT 


AAAAAGATTG ATATTGATGT AAAACAATAC 


ACTAAATCAA 


660 


AAAAAAATAA ATAAATCTAA 


TAATGTGAAA TTCCCAGTAA CAATAAATAA 


ATTTGAAAAC 


720 




ATAGTTTCAA ATGAATTTGT 


GTTCTATAAT GCAAGCAAAA TTACmATTAA 


TGaTTTAAGT 


780 



55 
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(2) INFORMATION FOR SEQ ID NO: 357: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 534 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 357: 
AAAGTAAAAA TAAATCTCCC TTTTTAACTT TCGTTTCTGC CATAGCCATT GCTTCTTCTG 60 
TGATAGTTGC TACAATATCT TTTCTTTCAC GGTTAAAATG TTCAACTTGT TCTGCTAAAA 120 

75 

ATGCAGCTTC TTCTTCGACG TCAGTCATCA ACAATTCGCa AGCTAATGAT GCGTCATCTA 180 
AACGACCTAC AGCATTAAGT CTAGGTCCAA TAATAAAACC AATTGTTTCT TCATCAATAT 24 0 

20 TGTCATTGTA TCCCGCTTCT TTTAGCAATG CTTTAACAGA GGTCGGACAT TGATCATTTA . 300 

AGACTTTTAA TCCTTGTtTC ACTAATGATC GATTTTCATC AGTTAAGGAT ACTAAATCCG 360 
CAATGGTACC TATCGCAACT AATGCTTTAA AATAATCAGG TACATTTtCA ATCAATGCTT 420 

25 GTGCTAATTT GTATGCAACA CCTGCACCAC ACAATTGTTG GAACGGATAA TTAAACGATG 4 BO 

GATGCATTGG ATGTACGATT GCATATGCTT CTGGTAATGT ACTACCAATT TCAT 534 
(2) INFORMATION FOR SEQ ID NO : 35B: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 358: 



45 



GGTGAGTCAA ATTAAATGAA 


TCTAATAAGT 


CATAACTATC 


TATTTGTAAT 


GTGCAACGCT 


' 60 


TAACGCATAT ACAAAATGAA 


TGTGCTGATA 


ATGATTTACT 


CAAATTAAAA 


GGTGATTTTT 


120 


ATTCAATGAT GAATGAAAGT 


TGCCTTTTTA 


TTTTTGGTAA 


AAGTTAATGC 


GTCAGTGAAT 


180 


TGTGTAAGTT TTTCAAAAAG 


TAAAAAGAAA 


TAATAAAGGT 


GAATTATTAG 


AATTCCArAA 


240 


ATAATTCATT ACATTCATAA 


AGCATTTTAC 


AAATGGTAAG 


AAAATGAGTG 


TTACAAATCT 


300 


AAATATTGCA AAAGAAGCTG 


ATTTAGTCAC 


AAAAAATGTC 


CTATGTAATA 


ATTCGAGAAA 


360 


GATGCACTAT ATACGGTCTT 


CTTACTATTC 


AAATGTAAAA GTTGCTTATT 


TGCGTGGCTT 


420 


TTTGTTTTAT AAAAGTATAA AATTTTACTA 


TAATATATCT 


TGTAGAGAAC 


AATGAAATGA 


480 
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ATAGCAAACT GTATTACTTT GATACAAAAA TGGTTGTAAT AAATATTTAT CGATATGACG 600 

ACTTGAATAT GATAAAGTGA CATATTTATG TATATGACTA TTTCGCAAAA TGTAATCGAG 660 

5 

GTAGAATTTC TTGACAATTC TGTCAGTTTA TAAGATGTTA TAAATATGTA GTGTATAAGG 720 

AGGCAAACAA GATGACTGAA GAATTCAATG AATCAATGAT TAACGATATT AAAGAAGGTG 780 

ACAAAGTCAC TGGCGAGGTA CAACAAGTTG AAGACAAGCA AGTTGTTGTT CATATCAACG 640 

10 

GTGGTAAATT TAATGGGATT ATTCCTATTA GTCAACTATC TACGCATCAT ATTGATAGCC 900 

CAAGTGAAGT TGTAAAAGAG GGCGACGAAG TTGAAGCATA TGTCACTAAA GTTGAGTTTG 960 

15 ATGAAGAAAA TGAAACTGGA GCTTACATCT TATCTAGAAG ACAACTTGAA ACTGAGAAGT 1020 

CTTATAGTTA TTTACAAGAA AAATTAGATA ATAATGAAAT CATCGAAGCG AAAGTAACAG 1080 

AAGTAGTTAA AGGTGGTTTG GTTGTTGATG TAGGACAAAG AGGTTTTGTT CCGGCTTCAC 114 0 

20 TAATTTCAAC AGACTTCATT GAGGATTTCT CTGTGTTTGA TGGACAAACA ATTCGTATTA 1200 

AAGTTGAAGA ATTGGATCCT GAAAATAATA GAGTCATTTT AAGCCGTAAA GCAGTTGAAC 1260 

AAGAAGAAAA CGATGCTAAA AAAGATCAAT TATTACAATC TTTAAATGAA GGCGATGTTA 1320 

25 

TTGATGGTAA AGTAGCGCGT TTAACTCAAT TTGGTGCATT TATAGACATT GGCGGTGTTG 138 0 

ATGGTTTAGT GCATGTATCT GAACTTTCTC ACGAACATGT TCAAACACCA ■ GAAGAAGTAG 144 0 

TTTCAATTGG TCAAGATGTT AAAGTTAAAA TTAAATCTAT TGATAGAGAT ACAGAACGTA 1500 

30 

TTTCATTATC AATCAAAGAT ACGTTACCAA CACCTTTCGA AAATATTAAA GGTCAATTCC 1560 

ACGAAAATGA TGTCATTGAA GGTGTCGTAG TAAGATTGGC AAACTTTGGT GCATTTGTTG 1620 

35 AAATTGCACC AGGTGTACAA GGACTTGTAC ATATTTCTGA AATTGCACAC AAACACATTG 1680 

GTACGCCAGG TGAAGTGTTA GAACCTGGTC AACAAGTAAA TGTTAAAATA TTAGGTATTG 1740 

ATGAAGAGAA TGAAAGAGTA TCACTATCTA TTAAAGCAAC ATTACCAAAC GAAGATGTTG 1800 

40 TTGAAAGTGA TCCTTCTACG ACTAAGGCGT ACTTAGAAAA CGAAGAAGAA GATAATCCAA 1860 

CAATTGGCGA TATGATTGGT GATAAACTTA AAAATCTTAA ACTATAATTT AATATTTAAT 1920 

AGTCAACTCC ACATGTTTAT GATTGcATGT GGAGTATTTT TATGTAACAA AATATACTCG I960 

45 

GAATGATAAC GTGGgACAAA TTTAACTAAG TGTTTAAAAA GATArAGTTT TAAGTGCtGa 2040 

tTTTTATCAT TACAGTAATA AACTCATTTT GAATACACAG TCTCATGTGA TATTATTAAA 2100 

AAGATATaAG AAAGAGAGGA AGTTAGCTTA TGACTAAACC TATAGTAGCT ATTGTAGGTA 2160 

50 

GGCCTAATGT AGGTAAATCT ACAATTTTTA ATAGAATAGT TGGAGAACGT GTTTCGATTG 2220 

TGGAAGACAC GCCAGGTGTA ACACGAGATC GTATTTATTC TTCAGGTGAA TGGTTAACAC 2280 
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AAATTAGAGC GCAGGCAGAA ATCGCCATAG ATGAAGCGGA TGTTATTATT TTTATGGTTA 
ACGTGCGTGA AGGATTGACA CAAAGCGATG AAATGGTCGC TCAAATTTTA TACAAATCTA 
AAAAACCGGT CGTATTAGCG GTTAACAAAG TAGATAATAT GGAAATGCGT ACAGACGTGT 
ATGATTTCTA TTCATTAGGA TTTGGTGAAC CGTATCCGAT ATCAGGGTCA CATGGTTTAG 
GTCTTCGTGA CTTGTTAGAT GCAGTTGTTT CTCATTTTGG TGAAGAGGAA GAAGATCCTT 
ATGATGAAGA TACAATTCGA CTATCCATTA TTGGACGACC AAACGTAGGT AAATCAAGTT 
TAGTAAATGC TATTTTAGGT GAAGATCGCG TTATCGTTTC TAATGTTGCA GGGACAACGA 
GAGACGCTAT TGATACAGAG TATAGTTATG ATGGACAAGA TTATGTTTTA ATCGATACTG 
CTGGTATGCG TAAAAAAGGA AAAGTATATG AATCAACTGA GAAATATTCA GTATTAAGAG 
CTTTAAAAGC GATTGAACGT TCAAATGTTG TTTTAGTGGT CATAGATGCA GAACAAGGCA 
TCATTGAACA AGATAAACGT GTTGCAGGAT ATGCACATGA ACAAGGTAAA GCAGTCGTGA 
TTGTCGTAAA TAAATGGGAT ACTGTGGAAA AAGATAGTAA AACGATGAAG AAATTTGAAG 
ATGAAGTACG TAAAGAATTC CaATTTTTAG ATTATGCACA AATTGCTTTT GTGTCTGCTA 
AAGAACGCAC AAGATTACGT ACATTATTCC CTTACATCAA TGAAGCAAGT GAAAACCATA 
AAAAACGTGT TCAAAGTTCA ACTTTAAATG AAGTTGTTAC TGATGCAATT TCCATGAACC 
CTACACCAAC AGACAAAGGT AGACGTTTGA ATGTCTTTTA TGCAACACAA GTTGCTATAG 
AACCACCGAC ATTTGTTGTA TTTGTTAATG ATGTAGAATT AATGcATTTT TCTTATAAAC 
GCTATTTAGA GAATCAAATC CGTGCCGCTT TTGGTTTTGA AGkTACACCA ATTCATATTA 
TAGCTCGAAA GAGAAATTAA CGATTGGGGG ATAACAATGA CTAAAATTAC CGTTTTTGGT 
ATGGGAAGTT TTGGGACAGC CCTTGCCAAT GTTCTTGCAG AAAATGGACA TGATGTTTTG 
ATGTGGGGTA AAAATCAAGA TGCTGTTGAT GAATTAAATA CATGTCATAC AAATAAAAAG 
TATTTAAAAT ACGCGAAATT A 
(2) INFORMATION FOR SEQ ID NO: 359: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

( D ) TOPOLOGY : 1 inear 



2400 
2460 
2520 
2580 
2640 
2700 
2760 
2620 
2860 
2940 
3000 
3060 
3120 
:* 3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3621 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 359: 
CTTTCGGAAA TTAGGATCnG nGCTATCTTG GCCCCAATTA CCAAGGGAAC TAnTGGCACC 
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AATGCtTCTT TCATCTCCAT GCCCTGTTGC TCATTATTAA TAACACGGTC TATTAACACA 1B0 

ATGGCATTTG TTACTACGAT TCCAATTAAC ATTAGCATAC CAATTAAACT TGGTACTGAT 24 0 

5 ATTGTTTCTC CTGTGATTAA TAGTGCAATA ATTACACCGA TAACTGTAAA TGGTAAAGAG 300 

AATAAAATTG TAAATGGTGC TAGGCCACCT TTAAATGTAA TAACTAGGAT TAAATATACG 360 

ATAATGATTG CAGCTAACAT TGCAAAGGCT AATTGTGTCA TTGCATTGTT AATATCATCT 420 

10 

GATGCACCAC CGATATTAAC CTTTACATTA TTCGGTTTAT CCAAATTATT TATTTTAGAC 480 

ATCACTTGTC GTGTTGTGCC ACCCACATCT TTATTTGTTA CTTTAGCAGA TACCGTCGTT 540 

1S GCATAATCTC CTTGTTCTTG CGTCAATTTA CTTGGTGTCG TTGTTTTAAC TAACGTAGCG 600 

ATATCTCCCA ATTTAATCGT ACCACCAGTC GGCTTTTTCA AAG 643 
(2) INFORMATION FOR SEQ ID NO: 360: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2524 ba9e pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

25 • .- .-• • . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 360: 



35 



50 



TTTCAGGCyG 


TGCAAGGGCT 


TTTTCTTTTG 


CTTTTAAGAT TATGATTTAT 


CGTGCAAAGT 


60 


TAAGTGGTCG 


TATATAGTTT 


TAGTTTTAAA 


AAGGTAATTA AATAAAATAG 


TTTGCCGAGG 


120 


GAGATGTCAA 


AATGATTAAA 


ATACCTAGAG 


GGACGCAGGA TATTTTACCT 


GAAGATTCAA 


180 


AGAAATGGCG 


TTACATTGAA 


AATCAATTAG 


ATGAATTAAT GACATTTTAT 


AATTATAAAG 


240 


AAATAAGAAC 


ACCAATTTTT 


GAAAGTACAG 


ATCTTTTTGC AAGAGGTGTT 


GGTGATTCAA 


300 


CCGATGTCGT 


ACAAAAAGAA 


ATGTATACAT 


TTAAAGATAA AGGCGATAGA AGTATTACAT 


360 


TAAGACCTGA 


GGGAACAGCT 


GCAGTTGTGC 


GTTCATATAT TGAACATAAA 


ATGCAAGGTA 


420 


ATCCAAACCA 


ACCAATTAAA 


CTTTATTACA 


ATGGACCGAT GTTTAGATAT 


GAACGTAAGC 


480 


AAAAAGGACG 


CTATCGTCAA 


TTTAATCAAT 


TTGGTGTAGA AGCTATTGGT 


GCTGAAAATC 


540 


CTAGCGTAGA 


TGCAGAAGTA 


TTAGCTATGG 


TTATGCATAT TTATCAATCA 


TTTGGATTAA 


600 


AACATTTAAA 


GCTTGTTATT 


AATAGTGTAG 


GGGATATGGC GTCTCGAAAA 


GAATATAACG 


660 


AAGCGTTAGT 


GAaACACTTT 


GAACCAGTAA 


TTCATGAATT TTGTTCAGAT 


TGTCAATCAC 


720 


GTTTGCATAC 


AAATCCGATG 


CGAATTTTGG 


ATTGTAAAGT AGACCGTGAT 


AAAGAAGCGA 


780 


TTAAGACTGC 


ACCTAGAATC 


ACTGATTTCT 


TAAATGAGGA ATCTAAGGCA 


TATTATGAAC 


840 
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GTGGATTGGA TTATTATACA CATACAGCAT TTGAATTAAT GATGGATAAC CCTAACTATG 960 

ATGGTGcCAT TACAACGCTT TGTGGTGGTG GCCGTTATAA TGGTTTATTA GAATTGCTAG 1020 

ATGGTCCAAG TGAAACAGGT ATTGGTTTTG CGCTAAGTAT AGAACGATTA TTGCTTGCAC 1080 

TTGAAGAAGA AGGTATCGAA TTAGATATTG AAGAAAACTT AGATTTATTC ATTGTTACAA 1140 

TGGGTGATCA AGCAGATCGA TATGCTGTGA AGCTATTAAA TCATTTGAGA CATAATGGTA 1200 

TTAAAGCAGA TAAAGACTAT TTACAGCGTA AAATTAAAGG ACAAATGAAA CAAGCAGACC 1260 

GTTTAGGTGC CAAGTTTACA ATCGTTATTG GTGATCAAGA ATTAGAAAAT AATAAAATCG 1320 

ATGTTAAAAA TATGACAACT GGTGAATCTG AAACAATTGA ATTAGACGCA TTAGTCGAAT 1380 

ATTTTAAGAA GTAGAGAGGG CGTTAAAATA TGAGTAAGAG AACAACTTAT TGTGGATTAG 1440 

TTACTGAGGC ATTTTTAGGA CAAGAAATTA CATTAAAAGG ATGGGTTAAC AATCGTCGTG 1500 

20 ACCTTGGTGG ATtGATTTtC GTTGATTtAA GAGATAGAGA AGGAATTGTA CmAGTCGTGT 1560 

TTAATCCTGC ATTTTCAGAA GAGGCaTTGA AAATTGCTGA AACAGTACGT TCTGAATATG 1620 

TTGTAGAAGT TCAAGGTACA GTTACGAAGC GTGACCcTGA AACAGTTAAT CCTAAAATTA 1680 

25 AAACTGGCCA AGTTGAAGTA CAAGTTACAA ATATTAAAGT GATTAATAAA TCTGAGACAC 174 0 

CACCATTTTC TATAAATGAA GAAAATGTTA ACGTTGATGA AAATATTCGA TTAAAATACC 1800 

GTTATTTAGA TTTACGTCGT CAAGAGTTAG CGCAAACATT TAAAATGAGA CATCAAATTA 1860 

CACGTTCTAT TCGTCAATAT TTGGATGATG AAGGGTTCTT TGACATCGAA ACACCAGTAC 1920 

TAACGAAGTC AACACCTGAG GGTGCACGTG ACTATTTAGT ACCATCTCGT GTTCATGATG 1980 

GTGAATTTTA TGCATTACCA CAATCACCAC AATTATTTAA GCAATTATTG ATGATTAGTG 204 0 

GATTTGACAA ATACTACCAA ATCGTAAAAT GCTTCCGTGA CGAAGATTTA CGTGCAGATC 2100 

GTCAACCTGA ATTTACACAA GTCGATATTG AAATGAGTTT TGTAGACCAA GAAGATGTGA 2160 

40 TGCAAATGGG TGAAGAAATG CTTAAAAAAG TTGTTAAAGA AGTTAAAGGC GTTGAAATTA 2220 

ATGGCGCTTT CCCACGCATG ACATATAAAG AAGCGATGCG TCGCTATGGT TCTGATAAAC 22 BO 

CAGATACACG TTTTGAAATG GAATTAATTG ACGTTTCTCA ATTAGGACGT GATATGGACT 234 0 

45 TTAAAGTATT TAAAGATACT GTTGAAAATG ATGGTGAAAT TAAAGCAATT GTCGCTAAAG 24 00 

GTGCAGCTGa ACAATATACT CGTAAAGaTA tGGGaTGCTT TAACAGAATT TGTaAACaTC 2460 

ymTGGtGCTA AgGtTAGCGT GGGGTTAAAG TTGTGGGAAG GTGGTTTTGA CAAGGTCCCA 2520 

so 

ATGG 2524 
(2) INFORMATION FOR SEQ ID NO: 361: 
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(A) LENGTH: 1507 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: i 
10 TCGTTGAGTA AAAGTCCAGA AAATTGGATG 

GAGATTAATA TACCGGGTTC ACATGATAGT 
TCAGTTTGGG CAAAGACTCA AGATAAAGAT 
?5 TTTTTTGATA. TTAGAGGTAG AGCAAGTGCT 

GTTTATTTGC ATCATGAATT AGGAAAATTT 
TATCCAAACG AAACAATTGT GATGTCTATG 

20 

ACGAAGACAT TTGAAGAAaT TTTTAGAGAA 
CTTTTTtACA CAGGAAGTAA TGCGAATCCT 

2S CTATTCAATA GAATGGGGGG TACGTACATA 

ATTCAATGGG CAGACAATGC GACATTTGAA 
AAAGTACAAG ATGAGTATAA AGATTACTAT 

30 TTGGCTAAAG CTAAAACGGA TAGTAACAAA 

GCGTCTGGAG GCAGCGCATT TAATAGTACT 
ATTGCAAAAA CGATTAAAGC AAATGGGAAA 

35 GCAGGATATA CGTGGCCTGG ATATGATGaT 

TAAGGATTCA ATAATGATAT TAAGACGAGT 
CTACTCGTTT TTATTTTGAA AATAAGTAAT 

40 

GTTTGTGAAA TTTTTGATAA TATTAAAGTG 
AATTAACAAA AACAAATCAT TTTAAAAAGA 

45 CAAATTAGTG TTAGGCATTA TTTTTCTGTT 

CATAGGTAAA GAAGCGGAAG TTAAGAAAAG 
TAAAAATCTA GAGGATTTAT ACGATAAGGA 

SO TGATAAAGGT ACATGGATTA TAAATTCTGA 

GGTAGCTAAA GGCATGGTTC TATATATGAA 
mTATGTCGAT GTGACTAAGG ACGAGGATGA 

55 



JEQ ID NO: 361: 

AGTAAACTTG ATGATGGAAA ACATTTAACT 60 

GGCTCATTCA CTTTAAAGGA TCCAGTAAAA 120 

TACCTTACCC AAATGAAGTC GGGAGTCAGG 180 

GATAATATGA TTTCAGTTCA TCACGGCATG 24 0 

CTCGATGATG CTAAATATTA CTTGAGTGCT '300 

AAAAAGGACT ACGATAGCGA TTCTAAAGTT 360 

TATTATTATA ATAACCCGCA ATATCAGAAT 420 

ACTTTAAAAG AAACGAAAGG TAAAATTGTC 4 80 

AAAAGTGGTT ATGGTGCTGA CACGTCAGGT 540 

ACGAAAATTA ATAATGGTAG CTTAAATTTA 600 

GATAAAAAAG TTGAAGCTGT TAAAAATTTA 660 

GACAATGTAT ATGTGAATTT CTTGAGTGTA 720 

TATAACTATG CATCACATAT AAATCCTGAA 780 

GCTAGAACGG GTTGGCTGAT TGTTGACTAT 84 0 

ATCGTAAGTG AAATTATAGA TAGTAATAAA 900 

ATGAAAATAG TTAGATTCTA ATTATTTTCA 960 

AATTCAACAA TATTATAAAT TGAACAGATT 1020 

AAAAAGTGTT ATAAATTGAT AAATATATGT 1080 

AGAGAGTTGT AAGATGATGa AACGATTAAA 114 0 

TTTAGTCATT AGTATCACTG CTGGTTGTGG 1200 

CTTTGAAAAA ACATTGAGTA TGTACCCTAT 1260 

AGGCTATCGT GATGATCAGT TTGATAAAAA 1320 

AATGGTTATT CAACCTAATA ATGAAGATAT. 1380 

TAGAAATACC AAAACAACAA ATGGTTACTA 144 0 

AGGAAAACCG CACGACAATG AAAAAAGATA 1500 
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(2) INFORMATION FOR SEQ ID NO: 362: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1216 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double. 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 362: 

TACATGTTTC GGATGCTACT TTATTTAGTT TGAAGGGTGC ATTATGGACG TTAGCGCAAG ' 60 

15 AAGTTTATCA AGAATGGTAT TTAGGATCGA AGTTGTATGA AGATGTTGAA AAGAAAATAG 120 

CACGAACTAC TTTTAAGACA GGTTATATTT ATCAAGAAAT TATTTTGAGA CCAGTAGATG 180 

AAGTTAAGGT ACTTCTGAAT GATTTAAAAG GTGCTGGTTT CGAATTAGGT ATTGCAACAG 240 

20 GTCGTCCTTA TACTGAGACT GTTGTGCCAT TTGAAAATTT AGGATTGTTA CCATATTTTG 300 

AAGCTGATTT TATTGCAACA GCAAGTGATG TTTTAGAAGC AGAGAATATG TATCCGCAAG 360 

CACGACCATT AGGAAAGCCG AATCCTTTTA GTTATATCGC AGCTTTATAT GGTAATAATC 420 

25 GCGATAAATA TGAATCTTAT ATCAATAAGC AAGATAACAT TGTAAATAAA GATGACGTAT 4 80 

TTATAGTAGG CGATTCGTTA GCTGACTTAT TAAGTGCTCA AAAAATAGGT GCAACGTTTA 54 0 

TTGGAACATT AACAGGTTTA AAAGGTAAGG ATGCTGCAGG TGAGTTAGAA GCGCATCATG 600 

CCGACTATGT TATTAATCAT TTAGGTGAAC TTAGAGGTGT ACTAGATAAT TTGTAATTTG 660 

ATTGTTGTTT GACAGCATAA CTTGTAGTGA ATGATTGAAC CAAAGGTTTC ATATTGAGTT 720 

ACAATGAAAT TAATAATGAA AAAATGCCAA GAAGCAATGG AAGTAATCCA ATGTCTTCTT 780 

GGCATTTTGA ATTTACATAA ATTGTTTATG ACTGTACCGT CAATTCAGTT GTGAAAATTT 840 

GATTGTATTC ACCAACTTGT TTAAGTTCAT CAATTATATT GTTTGAAACA GGTTGATCAA 900 

40 CGGATAAAAT CATTAGCGCA TCTCCGCCCG CTTCAGTTCT ACCTAAAGTC ATAGATGCAA 960 

"TGTTGATATT GTATTTACCT AACAATGCGC CAGTTTTTCC TACCATACCT GGAGTATCAT 1020 

TATGATATGA CACAATTTGA TATTGATTTG GCTTTAAGTC TACAGAAAAA TTATTAATTC 1080 

45 TAACAATTCT TGGACCGAAA CCTGTAAAGA CAGAAGCGCC AACTTTAACG GAATCGCnAT 114 0 
CGnTTGATAG TTCTACCTCT AAGTAGTTAC TAAAACCTGT CTCTGCTTTA TTATTTTCAA . 1200 

TATTTAATGT CACTTG 1216 

50 

(2) INFORMATION FOR SEQ ID NO: 363: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 867 base pairs 
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CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 363: 

TTTAAATAAT ATTGATGGTC ACATTGTAAA TrmAATAGAA AATAAATTTG €0 

ACAAGAACCA TTAAATCCAT TAAATTATGA TACTGTCAGT GGATTAGCTG 120 

ATATTTGCTA AATAGAGTAG ATGAGAATGA ATTTAATGTT AAAGCATTAA 180 

AGTATACTTT AAAGATATTC AATATTCTAA AAATAGCTGG GTAGTCCCAC 240 

ATTTTTAGAG TCTGATAAAA ATTATTTTAC TGAAGGTAAT ATCAATCTTG 300 

TGGaGTGCTA GGACCGATGT CTTTATTTGC ATTATGCGTG ATTAAAGGAA 360 

AAATCATCAG CACATATTAA AAGACATGTA CAAATTTATC ATGGACGAAA 420 

CCACGAAAGA TGGTTGCAGC GTTACGATTT AATTTCTGAA CGTAATCATT 480 

TCGGAATGGT TGGTGTTATG GCAATACGGG TGTAATGACG ACGTTGTTTT 54 0 

AGCATTACAA GATGATGAAA TAATTAAAAT GTCTAAAAAA GTGATGCTAC 600 

TGATAAAGAT GAAAATTTAA TAAGTCCAAC TATTTGTCAT GGATTGTCAT 660 

AATGTTAACA ATTATGAATT TGAATTTTGA ATTAAATGAA GTGTCTGATT 720 

ATTAATAAAT AAACTGATTT CTCATTATAA GGAAGATTAT CTGGTGAATT 780 

TAATGAAAAT AAGCAAGATG TATTTAAAAG TAGGAAAGTT GGCCTTTTAG 84 0 

AGGGGTCATC TAACATT B67 
** I (2) INFORMATION FOR SEQ ID NO: 364: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10813 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

^ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 364: 

TACCTTTTCT TtTAAATCAT TTTATATTTT CCCaCTAATA TCCGcTGtTA ATCaATCCtG 60 

ACATCCtTGT ATCaCTATGA CAATTAATTG TTAAATACAT GAATTTCTAC ATTTTATGAA 120 

50 AAAATCCATT TTTATTACAA TTCAACACTT TATATGACAA CTTCATTACA GTTACTTTTA 1B0 

TTGTTGATTG CTTACATTGT TTTCTAAAAA AAATTTGTTA TCATAATTAA CGTTGAATAA 240 

AGAAAAAAAT TAACTTGGGA GATAAAAATG GAATATAAAA AGATACTAAT TCGTTTATTA 300 



1248 



ATCAAAATAT 
10 ATCAAATATT 
GG ATAGGG AG 
AAAGCATATT 

15 

AAGArAGTCA 
GCCTTGCACA 
TTACGATTGA 

20 

AATTTTGTAA 
TCAATTTTAT 

25 TAATCGGCCA 
AAGTAGTAAA 
CACAAATATT 

30 ATATCACTGT 
TTATAGACAT 
AAGGTGAATT 
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CAATCGCATG CCGCAGTTAA TTATTATAGT AAAAACCAAT GTACATGGTG GGCATTTAAA 420 

CGTCGCGCAC AAGTCGGTAA ACCTGTTTCT AATAGATGGG GCAATGCTAA AAATTGGTAT 4 80 

TACAATGCAC GTAAATCAAA ATATGCGACT GGTCGTACAC CAAGAAAATT TGCTGTCATG 540 

CAATCAACTG CAGGATATTA TGGACATGTC GCAGTTGTTG AACAAGTATA TAAAAACGGT 600 

AGTATTAAAG TTTCAGAATA CAACTTTTAT CGCCCATTAA AATACAATAC ACGTGTACTA 660 

AGCAAAAAGG CAGCACGTAA CTTTAACTAT ATTTACTAAT CAAAAAACTT CTATCACGAA 720 

CGCTTCAATT TCCTGTATGC GTGTGATAGA AGTTTTTATT TTATGAAATT ATATTATTAC 780 

TTCTACAAAT TTCAAATTGC CGTAATTGAA CGTATATTTC TTCTTCAACT ATTATTTCAT 840 

CTTTAGCATA ATCTATATAT AAAATTTTAT GCTATTATTT AAATAATTCG CTATAACTTA 900 

ACATACGTTT TCGATATAAA CCTTGTTCTA AATCTCAATA ATTTTTTGCT GTTTTCATCG • 960 

20 TCATTAGTTA AAAAAATAAT TTAACTGAGT TTTTGAGACC TGTTTAAATT GATGACTCAA 1020 

AACCTTTATC CCCTTTTCAC TCGGTTTAAT TGCTTTAATA TTTAACACAG TCTCATTGTA 10 BO 

ATTTTGCTAC TAAGTTTGAA ATATTTCGAT TCGAATGTGA AcATCATTTA TTATTACTTT 1140 

25 CGACAGCTAG AAAAATATTG TTAAACCAAA ACTATAATTA CCACTTTTAT ATAGAACTAT 1200 

ATATAAATTT TAACTTTGAG ACAGAACTGC TAGTCAGGTT TATGAATATA TTTCTTTAGT 126 0 

TTACTTGaTA TACTTATTGG TAAATCATTA TTTATTTGAA GAGCATTTAT AACAAAAAGT 1320 

ATAATCCCAG TTATAGCGAT TCCATGACCA TTTTTATCAG TCAAAGATTC ATCAACAAAA 1380 

ACTTTTTGAT AAATTTATAT ATTTGTATAA TTTATTATGG TAGATATCTA CACCCGTATC 1440 

TATAACAGCT ACAGTTATCA TTGCGTCTTA TCCAAATAGT TTTTAAGAAA TAAATATAAT 1500 

TCATTAATAT CATGGCTATT TGTAACGTCA GtGCTTaAGA CCTTGTCAAT TATTGTTTTA 1560 

TCTGTTTCTA AGAGCCCCTG ATTTTCTGCA TACATATTTT GTATATCGCT GTTAATCTCA 1620 

TTTAATTTTT TAAATTTTTT TCTAACCTTT TTCAAAATAA AATGCTCTTT AATTCCATCT 1680 

CTTAAATACA TATAAATCTC TATTATAATC TCCATAAATG TCATATCTCC TCTCTCAAAA 1740 

AACATTATAC TCAATATATT AAAAATAAGA AAGGTTTTTT CAGAATTTTT AGTATTTTCA 1800 

45 GATATTTATT CTTAAGTATC TTATTATTAC TTCTTTTTAA CGCTAGTGGa ATAATTAATA 1860 

ATAATTGTAT TTAAATGCAA TGCTGTTGTT CTTTCATACT TACAAGCAAG TCATAAGAAA 1920 

TGAGAAATTA AATTCTTATA TGATGAATAA ATGATGACTG CAGAAGAAAT TGCTAAAAAA 1980 

50 GGAGTGGAGT GAGGCATTCT ACTGTTCATA TTGTTAACAA ATATCAAAAT GATAGGAACT 2040 

TGAAGAATCT AAGACTGTTA AAATCGAGAC AAGAAGAAAC TCGTTCCAAA TTTGAATCCT . 2100 
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AAAACTTCTA TCACGAACGC TTCAATTTCC TGTATGCGTG TGATAGAAGT TTTTATTTTT 2220 

GTTCATATTA ATTTATCTAA GCGCTACGAT GGaCTGACTT AAACTTTTCT TTAAAATCGC 2280 

TATTCGCCAT TTACTATTGT TGTCTAATTT CTTGTAAAAT ATGTTCCGCT GCTTGTGTAT 2340 

TTGCACGGGG TTCTTTTTTC AAAGCTTCAG CTACTTTAGC AATTTCATCA CCTTTTGCCC 2400 

CTACAACGAT AGCTAATGAT TTATATTGTA AACTCATATG ACCTTGTTGA ATACCTTCTG 2460 

ACACAAGCGC GC GACATGCT GCAAAGTTTT GCGCTAAACC AACGGCAGCA ACTACATGAC 2520 

CTAATTCTTG TGCTGACTCT ACATTTAGTA GCTCTAATGA AGCTTTAGCA ATTGGTAATA 2580 

1S CTTTT GTACC ACCGCCAACG ATTGCCAATG TCATAGGCAC TTCAATTGTA CCAATCAATC 2640 

GTTGACGATC TTGATCGTAA CGCCATGTAG CAATACCACG ATACTGTCCG TCACGACTCG 2700 

CGTATGCATG CGCACTTGCT TCTGCACCAC GCGTATCATT TCCTGTTGCT AAAACAACAG 2760 

20 CATGTATGCC ATTCATAACA CCTTTATTAT GTGTTGCTGC ACGATGAATA TCTACTTGGG 2 820 

CCAATACAGA AGCACGTTCC ATTCGTTTGG CAACCTCTTC TCCAGTTCTC TCGCCCCTTG 2880 

CTAAATCTTT AACATCAATT TCGCCTTGAA CTTTAACAAC GGACGCTGTT GCATGATTGG 294 0 

ATAAAATACT CATTAAAATG TCGCTTTGCG GAAATTCATT TTTTAAAAAT GCAGTTATGG 3000 

CCTCTAAAAT CGTATTAAGC ATATTAGCGC CCATAGCATC TTTCGTATCA ACAAATACTT 3060 

TTAAAGATAG TAACTGTTGC TCAGGAAATG TATCAATCGC TATACGTTGG TAACCACCAC 3120 

CACGCGCTTT AATAGAAGGA TATGCCTCAT CCGCAATTTT ATGAATTTGC TTTTCTAAAG 3180 

CTTTAATGTC TGCTGATAAT TTTTCAGTAT CGTCAACGCC ATCAAAGACG ATTTGACCTA 324 0 

TCATAATACG TTCAGAAGAT ACCGTTTTAA ATCCGCCAGT CTGATTCACT AGCTTTGCAC 3300 

CATAACTAGC TGCAGCGACA ACTGAAGGCT CTTCCACCAT CATAGGTACA ACATATGCCT 3360 

TATCGTCCAC AATGATATTC GGTAATAATC CAACGGGTAA TGCACCTTGC GCGATGACAT 3420 

40 TTTCAATTAA ACTATTGGCT ACTTCTTCAT CGATTAATGG ATGATTCAGT AAAATGTCGA 34 80 

ATTGTTCTTC TGATAACCAT TGCTTATCAA CCAATTGTTG TAACTTTTCT TTACGAGATA 354 0 

AATGTCGAAA ATTCTTATCT AAATTTTGCA TGGACGTACT CCTTTTACTT CACATAATTT 3600 

45 TTAACATTTC AATCACTACT ATTTTTACCA CAAAATAACG TCATTCGTCT TAAAATTCAA 3660 

TTGAATAATT GTCGTTTTGA CTTTAAAATA AAACAAGGTA AATTAAAACG CTTACAAGAA 3720 

ACGACAAATc ATTTTTAAAT TTAGTATATT TCTTTGTATA AAATTAGCAT ATTCTGATAT 3780 

SO 

GATACAAGTG TTGCTTTTAT AAATTTGAAA GGATGTAAAA CCTTATGACA ATAGGTATCG 3840 

ATAAAATAAA CTTTTACGTT CCAAAGTACT ATGTAGACAT GGCTAAATTA GCAGAAGCAC 3900 
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CTGTAAACCA AGACATCGTT TCAATGGGCG CTAACGCTGC TAAGGACATT ATAACAGACG 4020 

AAGACAAAAA GAAAATTGGT ATGGTAATTG TGGCAACTGA ATCAGCAGTT GATGCTGCTA 4080 

5 

AAGCAGCCGC TGTTCAAATT CACAACTTAT TAGGTATTCA ACCTTTTGCA CGCTGCTTTG 4140 

AAATGAAAGA AGCTTGTTAT GCTGCAACAC CAGCAATTCA ATT AG CT AAA GATTATTTAG 4200 

CAACTAGACC GAATGAAAAA GTATTAGTTA TTGCTACAGA TACAGCACGT TATGGATTGA 4260 

10 

ATTCAGGCGG CGAGCCAACA CAAGGTGcTG GCGCAGTTGC GATGGTTATT GCACATAATC 4320 

CAAGCATTTT GGCATTAAAT GAAGATGCTG TTGCTTACAC TGAAGACGTT TATGATTTCT .4380 

IS GGCGTCCAAC TGGACATAAA TATCCATTAG TTGATGGTGC ATTATCTAAA GATGCTTATA 444 0 

TCCGCTCATT CCAACAAAGC TGGAATGAAT ACGCAAAACG TCAAGGTAAG TCGCTAGCTG 4500 

ACTTCGCATC TCTATGCTTC CATGTTCCAT TTACAAAAAT GGGTAAAAAG GCATTAGAGT 4560 

20 CAATCATTGA TAACGCTGAT GAAACAACTC AAGAGCGTTT ACGTTCAGGA TATGAAGATG 4620 

CTGTAGATTA TAACCGTTAT GTCGGTAATA TTTATACTGG ATCATTATAT TTAAGCCTAA 4 680 

TATCATTACT TGAAAATCGA GATTTACAAG CTGGTGAAAC AATCGGTTTA TTCAGTTATG 4740 

25 GCTCAGGTTC AGTTGGTGAA TTTTATAGTG CGACATTAGT TGAAGGCTAC AAAGATCATT 4 800 

TAGATCAAGC TGCACATAAA GCATTATTAA ATAACCGTAC TGAAGTATCT GTTGATGCAT 4860 

" ATGAAACATT CTTCAAACGT TTTGATGACG TTGAATTTGA CGAAGAACAA GATGCTGTTC 4 920 

30 

ATGAAGATCG TCATATTTTC TACTTATCAA ATATTGAAAA TAACGTTCGT GAATATCACA 4980 

GACCAGAGTA GTCGGTGTAT TTAAAACACA TATAATAAAA CCTAAAAGCA GCAGTAAGAC 5040 

CACTTCTAAT TGAAATCGTC TTACTGCTGT TCTCTATTTA TAACACTTCG TATTGAATGA 5100 

35 

ATTCATTATG CCTATTTGAC ACATTATTGA AGTTTTCCTA ATGCCTGGAT CCTTTATACG 5160 

TTACGGCTTC GTGCTATGTT TTGGTACATA AAGCTTTGAC ATATCGATAT TCTCCAACTC 5220 

40 TAACAGCTTA ATTTTATTAT TAATCGTTCC ACCGAACCCT GTTAAGCTAC CCGTTTTACC 5280 

GACAACACGA TGACATGGCA CGATAATAGA TAATGGATTA CTTCCGACTG CACCTCCAAC 534 0 

CGCTTGGGCT GACATTTTTG GCTTGTTAAG CAGCTTGCCT ACTTTTTTGG CAATAGCACC 5400 

45 ATACGTTGTT AGAGTCCCAT AAGGAACCTG TCTTAATTCA TTCCAAACAC ACTGTTGAAA . 5460 

ATGACTACCT GTTGGCTTTA AAGGTATTGT GATTTCAGGA TTGTCACCTT TAAAATACGC 5520 

GTCTAACCAC TGTGTCGCCT CTCTAAATAT CGCTAAAGAC GTATTTTCTT CCCTAGTACC _ 5580 

ATCACCTTGT TGATTTTCAA ACAAAACAGC GGTCAGACTT ACCCCATCAC TCAAAAGTTC 564 0 

CAATCGTCCT ACAGGCGAAT. CATAGTAACT CTTATACTCC ATAAAAATTC CCCCTTTTTC 5700 
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ATAAGTCGTC AATTACGTAT ATAAACACGT AATACCAGCT ATCACTTTGC TGCAATATAC 5820 

AGTTACATAT CTTACTACAC GTGCTAACCT CTTACTTTGT AAACCAAATC TTAAATTAAA 5860 

ATATTGAAAA TGCAATGAAT CCTTAATATT TTATTAAACC TATAATTACT TATTAAAAAT 5940 

AACACACAAT ATTCATAAAG TTTTAAAAAT ATTCTGTTTT ATCACCTACT ATTAGTGGAA 6000 

AAGTACAATT GCAATTGTAT ATAGTTTGCA TAACGCTTCA AAAGTAATTT CTTTTTTGTT 6060 

10 

TAGTTCAAAA AAATTTAGAG GTGATGTTAT ATGAATAACG GTTTTTTCAA TAGCGACTTT 612D 

GATTCAATTT TTCGAAGAAT GATGAAAGAT * ATGCAAGGTT CAAATCAAGT CGGAAACAAA 6180 

75 AAGTACTATA TTAATGGTAA AGAAGTTTCA CCTGAAGAAC TAGCGCAACT CACACAACAA 6240 

GGTGGCAATC ACTCTGCTGA ACAAAGTGCG CAAgcTTTTC AACAAGCAGC ACAAAGACAA 63 00 

CAAGGGCAAC AAGGTGGCAA CGGCAATTAT TTAGAACAAA TTGGTCGTAA CCTTACGCAA 6360 

20 GAAGCACGTG ACGGTTTATT AGATCCAGTC ATTGGTCGTG ATAAAGAAAT TCAAGAAACT 6420 

GCTGAAGTTT TAAGTAGACG AACTAAAAAC AATCCTATAT TAGTTGGAGA AGCTGGTGTT 64 80 

GGTAAAACTG CGATTGTTGA AGGTTTAGCA CAGGCAATCG TTGAAGGAAA TGTACCAGCA 654 0 

25 GCAATCAAAG ACAAAGAAAT TATTTCTGTA • GACATTTCAT CATTAGAAGC TGGAACGCAA 6600 

TATCGTGGTG CTTTTGAAGA AAATATTCAA AAATTAATCG AAGGTGTTAA ATCTTCACAA 6660 

AATGCCGTAC TATTCTTTGA TGAAATCCAT CAAATTATCG GTTCAGGTGC CACAGGAAGT 6720 

30 

GATTCAGGTA GCAAAGGGTT ATCTGATATT TTGAAACCTG CATTAAGTCG TGGTGAGATT 6780 

TCTATTATTG GTGCAACAAC ACAAGATGAA TATCGAAACA ATATTCTTAA AGATGCTGCA 6840 

TTAACGCGCA GATTTAATGA AGTGCTTGTT AATGAACCAA GCGCTAAAGA TACTGTTGAA 6 900 

35 

ATTTTAAAAG GTATTCGCGA AAAATTCGAA GAACACCATC AAGTAAAATT ACCAGATGAC 6960 

GTATTAAAAG CATGTGTTGA CTTATCAATT CAATATATTC CACAACGATT ATTACCAGAT 7020 

40 AAAGCAATCG ATGTGTTAGA TATTACAGCA GCACATTTAT CTGCGCAAAG TCCAGCTGTC 7080 

GATAAAGTTG AAACTGAAAA ACGAATTTCT GAATTAGAAA ATGATAAACG TAAAGCAGTA 7140 

AGTGCTGAAG AATATAAAAA AGCTGACGAC ATTCAAAATG AAATCAAATC ATTACAAGAT 7200 

45 AAATTAGAAA ATAGTAATGG TGAACATACT GCTGTTGCTA CAGTTCATGA TATTTCAGAT 7260 

ACTATTCAAC GATTAACTGG TATTCCAGTT TCTCAAATGG ATGATAACGA TATTGAACGT 7320 

TTAAAAAATA TTTCTAATCG TTTAAGAAGT AAAATCATAG GTCAAGATCA AGCTGTAGAA 7380 

50 ATGGTTTCAC GTGCAATTCG CCGTAATCGT GCTGGGTTTG ATGACGGCAA CCGTCCAATT 7440 

GGCAGTTTCC TATTTGTTGG CCCTACTGGT GTTGGTAAAA CAGAGCTTGC TAAACAATTA 7500 
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GACACAACAG CTGTTTCAAA AATGATTGGT ACAACTGCTG GTTATGTTGG TTATGATGAC 7620 

AATTCAAATA CGTTAACTGA AAAAGTACGC CGTAATCCAT ACTCAGTCAT TCTATTTGAT 7680 

5 

GAAATCGAAA AAGCAAATCC ACAAATTTTA ACATTGTTAT TACAAGTAAT GGATGATGGT 774 0 

AATTTGACTG ATGGTCAAGG TAATGTCATC AACTTTAAAA ATACAATTAT TATTTGTACA 7B00 

TCAAATGCTG GCTTTGGCAA TGGCAATGAC GCTGAAGAAA AAGATATTAT GCACGAAATG 7860 

10 

AAAAAATTCT TCCGCCCTGA ATTCCTTAAC CGCTTCAACG GCATCGTTGA ATTCTTACAT • 7920 

TTAGATAAAG ATGCATTGCA AGATATCGTC AACTTATTAT TAGACGATGT ACAAGTTACA 7980 

15 TTAGACAAAA AAGGTATTAC GATGGACGTT TCTCAAGATG CGAAAGATTG. GTTAATTGAA 8040 

GAAGGCTATG ATGAAGAATT AGGTGCACGT CCATTAAGAC GTATTGTTGA ACAGCAAGTA 8100 

CGTGACAAAA TTACAGATTA CTATTTAGAT CATACAGACG TTAAACATGT GGATATAGAT 8160 

20 GTTGAGGATA ACGAATTAGT CGTAAAAGGT AAATAACGAC ACTTTAACAT ATCGCGCATC 8220 

AAAAATGAGC ATCAGGTCGC CCTTGCCTGT GCTCATTTCT TTAATTATTT CCCTGGAAAA 8280 

TGATTCGCTG TGTGCTGTTC TGTTCCACAA CAATCACGAT TAATGTCACA TGTACCACAT 8340 

25 TTTCCTTGTT TTGAACGCTT GAAAAATTTT ACTAGTGTAT ATAAGGCATA TCCGAAAATT 8400 

GCTAAAAAAA TTAAAATGTT -AATAATGACT GACACTTTAA CCACTCCTTA AACAAATAAA 8460 
TGTCCGACTT GATAAAAAAT GAATGTTAAG ACATATGCAG TGACTAGAGG ATAGGCAACT . 8520 

30 

GCAAGTGCCG TCCATTTCCA TGAATAAGTC TCTTTACGGA TTGCTGCTAC TGTAGAAACA 85 BO 

CAAGGAATAT ACAATAGTAT AAATATCATA AATGCATACG CAGATAGCGG TGTGAATTGA 864 0 

TTTTGAATCA CATTAACAAG GCCTGCATCA CCTGATGAAT AGATAATCGC CATCGAACTT 8700 

35 

ACGATAACTT CTTTTGCTAA AAATCCTGGC AcTAACGTAG CACCTGCTTG CCATGTTCCA 8760 

AATCCGAGCG GTTGCACTAA CATACCAAAG AAACTACCAA CCATATGTAA AAAACTTTGA 8820 

TTGATATTCA CATTGATACC ATGTGGTCCT ACATAACTTA ATAGCCAAAT GACTACTGAG 8880 * 

40 

CCGCCAAAAA TAAATGTACC TGCTTTACGA ACAAAGCCCT TAGCCTTTTC CCAAGTACTA 894 0 

CGCCACAACG TTTTAATGGA AGGCACACGG TATGTTGGCA ATTCCACAAT AAAGATTGCA 9000 

45 TTATCATTTT TTAAAATCGT CTTAGTAAGT ACTGTACTGA CTAAAAATGC CATAATAATA 9060 

CCTAAAACAT ACAGGCTTAA TACTACTAAA GATTGATTCT CTTTGAAAAA GATACCTACG 9120 
AACAACGCAT ACACTGGCAG TCTAGCAGAG CATGACATGA ATGGTGCAAT TAATATCGTT ' 9180 

50 GTTAAACGCT CTTTTTCATT TTCAATACTG CGCGCAGCCA TAATACTCGG TACATTACAA 924 0 

CCAAATCCGA TAATCATTGG TATAAAAGAC TTCCCGCTTA AACCGAACGA TTCCATAATA 93 00 

55 



1253 



EP0 786 519 A2 



10 



15 



20 



25 



30 



35 



40 



45 



50 



AAAAAGAGCA CAACAATTTG TGGTACAAAG ACTAATACTG ATCCTACACC AGCAATAATG 9420 

CCATCTGTAA TTAAATCTTG TAAAAATGGT ATAACACCAA GATAATTCAT AATCGTCTTC 9480 

ACACTATCTG TAAATGTACC ACCTATAAAT GCATCGAGTT GATCCGACAA AGGTGTGCCA 9540 
ATCCATGTAA ATGTAGTTTG AAAGATCAAC CACATAATTG CTAGAAAGAT AGGCATCCCT . 9600 

ATATATTTAT GTGTTAATAT CTTGTCTATT CTAGAGCTGA AATATTGCTT ATCTTCATCT 9660 

GGATACGTTA CCACGTCTTG CAATAACGTC TCAATATAAT GATTGCGTAT ACGCTCCATC 9720 

TCTCGACGAA CAGATACAGC CCCTACTTGT TCAGCAACTT GATCACGTAA ACTCGACAAT 9780 

TTATTTACAA CCTCTGAATT AAGTTCGTTT GCAATTTGCA TGTTATTTAA TAAGAATTGA 9840 

ATCGCAATAA ACCTAGCTTG ATACTTATCA TGAGATGTCT CTGTCATTAT TATTTGACAC 9900 

ATATTTTTAA TTGTCTCTTC AATCTTCTCA CCATAATTGA TTTTAAAATG CGGTTGATAC 9960 

" CCTTCCCCTA GATGCTTTAT TTCGCCAAGT AAATATTTTG TTCCTTTGCC TGTACGTGCC 10020 

ACAACTGGAA AAATAGGTGT TTTTAACTTT TTCATCAATT TATGATAATC GATTTTTATC 10080 

CCGCGCTTTG TAGCTACATC AATCATATTT AATCCGATGT ATATTGGTTG ATTAAGTTCT 10140 

AACAATTGTA CTGTTAATTG CATATTTCTT TTTAGTTGAC TCGCATCAAC AATGTTAATG 10200 

ATTCCTGAAA ATGAATCGTT TAATAAATAG TCTGTCACTA CAGTTTCATC TTTAGAAATC 10260 

GGCGATAAAT CATATGTACC TGGTAAATCA ATTAATTGTC CTACATTTTC TTTAAGTTTC 10320 

CCTACTTTTT TCTCTACCGT TACGCCACTC CAGTTGCCTA TATATTCATA CGAACCAGTT 10380 

AAAGCGTTAA ACAAAGATGT TTTACCAACA TTAGGATTTC CTAAAATACA ATAATTTTCC 10440 

ATTCGTCCGG CTCCTATTCT TCTAATGCAA TAGAACAAGC ATCGCAATGT CTAATACTTA 10500 

ACTGTTGTCC GTTTACTTCA ATAATACATG GCCCTTTAAA TAAACATTTT TGTTTAATCG 10560 

TTATGATAGC GTCATCTGTT AACCCAAAGG CACTTAGACG ATACAACATA TTCTCATTAG 10620 

CAATATCCAT TCGCTTTATT TTATAAGCCT TATTCATTTC ACCATTTTTA ATGTTTAACA 10680 

TACTATTTCG CTCTCCTATT AGAAATAATA ATCATTATCA CTTAAAAATC ATAACCCTTA 10740 

AAATTGTAGC TCGCAATACT TTATTTAAAT AATTTTCATT TTTCATGTAA AATTTGTGAC 10800 

ATTGCAAAAA TGT 10613 
(2) INFORMATION FOR SEQ ID NO: 365: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6804 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 365: 





TGATGGATTA GCAGACATTT 


TACGAGCGAA 


TGGTTTCAAA GTGTTTGGTC CAAATAAGCA 


60 


5 


AGCAGCTCAA ATCGAAGGCT 


CAAAATTATT 


TGCTAAAAAG ATAATGGAAA AATATAATAT 


120 




TCCAACTGCT GATTATAAAG 


AAGTTGAGCG 


AAAAAAGGAT GCTTTAACAT ATATTGAAAA 


180 


10 


CTGTGAATTG CCCGTTGTTG 


TCAAGAAAGA 


TGGGTTAGCT GCTGGGAAAG GCGTTATTAT 


240 


TGCAGATACT ATTGAAGCAG 


CCAGAAGTGC 


TATTGAGATT ATGTATGGTG ATGAAGAAGA 


300 




- AGGTACTGTT GTATTTGAAA CGTTTTTAGA AGGTGAAGAG TTCTCGCTAA TGACATTTGT 


360 


15 


TAATGGTGAT TTAGCAGTAC 


CTTTCGACTG 


TATTGCACAA GATCATAAAC GCGCATTTGA 


420 




TCATGATGAA GGACCAAATA 


CTGGTGGTAT GGGGGCTTAT TGTCCmgTAC CACATATTAG 


480 




TGACGATGTT TTAAmACTTA 


CAAATGAAAC 


AATTGCACAw CCCATTGCAA AGGCAATGCT 


54 0 


20 


TAATGAAGGT TATCAATTCT TCGGTGTATT ATACATTGGT GCTATTTTAA CTAAAGATGG 


j 600 




TCCAAAAGTA ATAGAATTTA 


ATGCCCGTTT 


TGGTGATCCT GAAGCTCAAG TATTATTAAG 


660 




TCGCATGGAA AGTGATTTAA 


TGCAGCATAT 


TATTGATTTA GATGAAGGAA AACGTACTGA 


720 


25 


ATTCAAATGG AAAAATGAAT 


CTATTGTAGG 


GGTCATGTTG GCATCAAAAG GATATCCTGA 


780 




TGCATATGAA AAAGGGCATA 


AAGTAAGTGG 


CTTTGATTTA AATGAAAACT ATTTTGTTAG 


640 




TGGATTAAAG AAGCAAGGTG 


ATACCTTTGT 


TACTTcAGGT GGTAGAGTTA TACTTGCCAT 


900 


30 


CGGAAAAGGT GACAATGTAC 


AAGATGCACA 


GCGAGACGCA TACAAAAAAG TATCACAAAT 


960 




ACAAAGTGAC CATTTATTCT 


ATCGTCATGA 


CATTGCGAAT AAAGCACTAC AACTTAAATA 


1020 


35 


AGTAAATTTA AAATACTAAG 


aTTAGCTATG 


AACGAATCTA TAACGATAGA TTTTTTCATA 


1080 


GCTTTTTTAG TTGTAGAGTC 


TAGGACATTG 


ATTTCTGTAC CAAATTTGTG ATTATGCATA 


1140 




TGTAATACAA AAGAGGCGCC ACAACATGTT TGGATGAACA AAATAACATG TTTGTGGCAC 


1200 


40 


CTCTTTTGTT TAGTATGGAA 


TAAATGGTTT 


TCTTTTTCTA TACAATGAAT TTCTAATTTA 


1260 




GTATCTATAC AATTATGGAT 


AAAATTTAAC 


CTACACGACC AAGACGAACA TCATCTATGC 


1320 




CCGTGATGGG TAAGGTGATT GAACAATAAT ATGCCATAGT AATAATGGCA ATTAAAACTA 


1380 


45 


TAATAAAGAT TATATCTTTA 


TATGAGAAAG 


GTACGTTGTA ATAGTAAGTA CGAGGACCAT 


.1440 




CTCTAAATCC TTTCGACTCC ATCGCAACTG ATAATTGATG TGCCTTTCTA ATATTTTGGC 


1500 




TTAATAGAGG TATAATTAAA 


TGCTTAAATC 


GCTTTAACCC TCTATAATTT GCCGCGTCTA 


1560 


50 


TCATCTGATA GCGCATTTTT AAAGATCTGC GAAgcTGTAt TAAAGAACTA ATCATTAAAG ' 


1620 




GTATCATACG AATGGCAGCC ATGAATGCAT AAGCAACTTT TGATTTAACC TTTAAATGTT 


1680 
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ATGAAATAGC 


AATGGTTCTT AATGATACAT 


GTAAACCACG 


AACTAAACTT 


TCTGTTGTAA 


1B00 




TATGGATAAA 


TCCGAATTTC 


AAAATTGTAT 


GGCTACCATT 


CCCGTATAAA ATCATGAACA 


I860 


5 


GGGAAGAGAG 


TAATGCAAAG CCAATACTTA TAGTTATAAA AATTGCTGTA ATTTTAAACT 


1920 




GAGTACCATT AAACATCAAT AAGAAAACTA ACATTAAGAT AGTGATATAA AG CAT AAAAT 


I960 


10 


CGAAATTATG CACAAATATA ATAAAGAAAA ATAGTATAAT TCCAAGAAAT AGTTTCGTTA 


2040 


TAATGTTGAC ATCATCAACA AATGATTGCC GAACTTTCCA TTGCTCATAC ATTCGTATCA 


2100 




CCATCACAAT CTAGTAACGC ACCATCTGAA ATTTTAAGTC TTCTTGATGG ATAACGTTCA 


2160 


15 


ATTATTTCAT 


CGTCATGTGT 


AaCCATGACA 


ATACTTTGTC 


CCAAATTAAT 


TCGCTTTTGG 


2220 




AAAAGTTTGA TCAACTGGAA 


TGTATTATGG 


CTATCAAGTC 


CAAATGTCGG 


TTCATCTAAA 


2280 




AAGATAATAT CAGCTTTAGA ACTTAGTGCG GTAgcTACGC TAAGGCGTCG TTTTTGACCA 


2340 


20 


ATAGACAACT CATAAGGATG 


TTGATCTTTT 


ACATTTTGTA AATCTAAAAG 


TTTTAAAAGT 


2400 




TGTATCGTTT 


CATCATCACT 


TTGATCTTTA 


GAAAGGTGAT 


TAAAATGAAT 


GTTAATTTCA 


2460 




TCATAAACCG 


AATTTGTTAT 


AAATTGTAAT 


TCTGGGTTTT 


GATAAACTAG 


GTACATGTGT 


2520 


25 


TTTGCTGCAT 


GTTTAATTTT 


TGTTAAACGC 


TGATTTTCAA 


AATAAACATC 


ACCTTGATAT 


2580 




TTAATCAATT 


GCATAATTGA 


TTCAAGCAAG 


GTTGTTTTAC 


CACTACCATT 


TGCCCCTGTA 


2640 




ATTGTAATCC 


ACTCACCTAG 


ACCAATTTCT 


AAATCTGAGA 


ATGAGAGCAA 


TGTTGATTTA 


2700 


30 


CCGCGAATAA 


TACGTCCATT 


TTTAAATTGT 


AATAAGTGTG 


AGTTTGTTGT 


TGGAAAGTCA 


2760 




ACACGACTTG 


GTGCGAATTC 


CCATGCACGT 


GGATGCCACA 


CACCATATTC 


ACTGAGTAAA 


2820 


35 


TGAACATACT 


TCTGTAATAT 


GATTTCAGGA 


CATTCATCGG 


CAATGATATT 


TCCGTTATAA 


2880 


TCCATCAAAA 


TGACGCGGTC 


GACATGATTC 


CAGATGTGTT 


TAACTTTATG 


TTCAACGATT 


2940 




ACAACCGTTT GATCTTCCCA AAGTTCAATT AGTTTAGTCC ATAAAT CTTC TGTTGCTTGA 


3000 


40 


ACATCTAACA TTGCTGTCGG 


TTCATCTAAA AACAATGTTT 


TTGATTGTTG 


AAGAATGGTT 


3060 




TCAACAATTG 


CCAATTTCTG 


TTTCATCCCG 


CCACTTAAAT 


CTTTGATATA 


CGTTTCAGGG 


3120 




GTAACATTTA 


AATTGACCAT 


ATTTAAAGCA 


TTGATAATTA ACGCATCCAT 


GTCTTCACGT 


3180 


45 


GGTAATTGTC 


TATTTTCTAA AACGAATGCA AGTTCTTCGT ATACTTTTGG 


CATACAAAAC 


S2h 0 




TGGCTATCAG 


GGTCTTGGAA 


AATAACGCCA 


CTTAATGGGT 


CAACGATTAG 


TTCATCATAT 


3300 




TTCATAGGTA 


ATTCAATTAA 


ATTAGGAACA 


ATACCACTTA 


ATACATTCAG 


AAGTGTACTT 


3360 


50 


TTACCGCAAC 


CAGAAGGACC 


GAGTAAAAGT . 


ACTTTTTCTT 


TGTCTTGAAT 


AGTGATATTT 


3420 




AAATGATCGA 


AAATTTTACG 


TTGACCACTT 


GGATATTTTA 


ATCGTAAATC 


ACTTACTTTT 


3480 
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ATTTTGTTAC GCCTGTCTTA TCTAAAGCTT TTACTAAAAG GTAAGATAGG ACGCCGGCGA 3600 

CTACTGCACC ACTAATTAAT CTAAATACGA TGAATAATGT TAAGTTCCAA CCTGCAACTT 3660 

5 

CATTTAAATA ACCATAGAAA TAATCTATCG GGAAAGCCGC GATTGCTGTA CAAAAACCTG 3720 

CTAACATAGC TACGATAACT GAACGTGATT GATATTTAAA AATTGCAAAG ACAAGTTCAC 3780 

ACGCTAAACC TTGTATAAAA GCGTAAACGA TTGTCGGAAT ATCGAAACGA CCCATAATGA 3840 

10 

TAGTTTCGCC GGCACCTGCA GCAAATTCAG CCAGTAAAGC AATACCTGGT TTTGGAATAA 3900 

TTAGATAGCA GACAATCGCT GCCATGAACC AAACCCCGTT TGTTAATTGT TCGAGGTGAA 3 960 

i5 GGCCTGTAGC 7TGCACACCA TTGTAAACAA ACCACCATAA ATTGTAAATA ACTGCGAATA 4020 

CTACTGAAAT AAGTACGGTT ACTAGTATTT CAGATAGCTT TAAACCTTTT GACATTTTTA 4 080 

CATCCTCCTA ATAAAAAAAC GCACAACCAT CCATAGGAAA GTTATGCGTT CACAATATAT 4140 

20 ATTAGTAAAA CATATGTATA GTAACACTTT CCTACGCTAG TTCAAGCTAG ATCAGGTTCA 4200 

AAGGGTTTGA GGGCAAGCCT CATCTCAGTA TAAAACACCC CTAGTGTGTG CGATTTATTT 4260 

AATTAATTAT ACTGTAAGAC GTTTGTAAAC TTATGTCAAT AGGTTGTCTT CATGAAATTT 4 32 0 

25 CGTTTAATTC GATTTAAAAT TTATAATATT AGCATTGGAT TTAAATTGAA GATGTAGTAG 43 80 

GAATGTTAGT AATTAAAGAT ATAAAAATAT GTGACATGTA ATAATATTGA GCTGATAAAT 444 0 

GAAGAGGGAT ACTTATCAAT CATACCTCTT TAACAACAGT GAAGAACCCG TGCATAATGG 4 500 

30 

CTTACGAATT ATAGTTTATA AGGAAGAAGA GGGATACATG CGCCGAGCAC ATGCATAAAA 4 56 0 

GCCCCTAACA ACTAAAAGTT GTAAGGAAGG AGAGGGATAC ATGCGCCGAG CACATGCATA 4 620 

AAATCCCCTA ACAACTAAAA GTTGTAAGGG GATTTAAATT AATJTAGTGT ATCTTGGATA .4680 

35 

TCTTGTTTTG kTTGaTTAAT ATCTTCTGTT TTTTCTTCTT TTTTATCTTT TAATTTTTCT 4740 

TCAACTTCTT TAGCTTTTTC TGCTGCTTTT TTATTTTGAT TTTCATTAGA CATGATTAAT 4800 

TCCTCCCAAA TTGGATAATT ATTTATATAT AAATCTTACC CGGTTGTACT TTCGTTAAAC 4 860 

40 

TTTTCTAAGT CTATAGCACT ATTTATTCAT TTATCTAAAG ACAACAACAT TAGATTAATA 4 920 

TATAATGATT TTGAGGTGAA CATAATGTCT TTTCTTAGGA AACACGCCGA AATTATTTTT 4980 

45 AGCTATTTAA TCGGTmwCGT TcACTCTTCA CTGGkcTCAT TATTTTAATT AACTTGCCAT 5040 

TAATTAAACA ATTAAATGGT GGTAAAAAAG TTGATACACA TGTTCATAAT GTGTGGGAAT 5100 

TTCTGAATGC ATTTTTCAGT GAAATTATTA AAGTAATGAG TCGATTTATA GGTAATTTCC 5160 

50 CtATAGTTAG TGCAATTGTG ATAATTATAT TCGGTATTTT AGTTATGTTG ATTGGTCATA 522 0 

CATTACTTAG AACTATTAAG TATGACTATG ATATTTCTAT CTTTTTCTTA GTTATCGGTA 5280 

55 



1257 



EP0 786 519 A2 



TTTTCATTAT TCCATTTACA ATTCATATAG GATATATCGT CTATAAAGAT GAATTGAATC 5400 

AGGAAAATGT AAAAAATCAT TTCATGTGGA TAATTGTGAG TTATGGTATA AGTTACTTAA 5460 

TTACACAAAT TGCATTGTAT GGCAGAATTG ATGCTAATGA AATAGAGTCA ATTGATATCT 5520 

TAAGTGTCAA TGCTTTCTTT ATAATTATGT GGTTACTTGG TCAAATGGCT ATTTGGAATT 55 BO 

TCTTGTTCTT GCGCCGAGCT TTACCTTTAA CAAAGCAAGA ATTAGGTGAA GAGGAGCCAG 5640 

10 

AATTATCAAG AACAAGTAAA GGGAATGTCA CGAATCAAAC TAAAATTCAC TTGAAACAAC 5700 

TCCAAGATAA GACTACAGAA TATGCACGTA AGACAAGAAG AAGTGTCGAT TTAGATAAAA 5760 

TTAGAGCTAA AAGAGATAAA TTCAAAAAGA AAGTTAATGA TATTATCGAT ATTCAAGAAG 5820 

ACGATATTCC TGATTGGATG AGAAAACCGA AATGGGTTAA ACCAATGTAT GTCGAACTAT 5880 

TTTGTGGTGT CGTCATCTTT TTATTCACAT TTTTAGAATT TAATAATCGT AATGCATTAT 5940 

20 TTGTATCTGG TGATTGGAAA TTATCACAGA CACAATATGT TATTGAATGG GTTACATTAT 6000 

TAATTCTGTT ATTCATTATT ATCGCATATA TCGCTACAAC GTTAACTTTC CACTTGAAAG 6060 

GTAAGTTTTA TTATTTACAA TTATTTATGG GGAGCATTTT ATTCTTTAAA TTGTTAACGG 6120 

25 AATTTATAAA TATAATGATT CATGGACTAT TACTTTCAGT GTTCATTACG CCAACATTAC 6180 

TATTAATGTT ATTGGCAATC ATCATTTCTT ATTCGTTACA ATTACGAGAG CGACCATAAT 624 0 

TAAAAGCATT ATAAAAGTAC TATCTATTAA ACATTTTGAT GTGTACGCTA TAAGTTAGAT 63 00 

30 

ATATCTCTAA CTTACTTAGA TACAGGTCAA TGAAGTTTAT GGATAGTACT TTTTTTGTAA 6360 

CTAGATTTGA TTGATTCAGG TGATGTGAAT TAAGTATTGA rAATTGTATA CAAAGTTTTA 6420 

GTGCAAATAA AATAGTTGAA AAGTTATCCA TTTGTAAAAT CAAGAAAACT AGTAAATAGT 6480 

35 

TGAAGCGACT TATGGaATTT GCGAAACGAT ATATAGTATT TCCTTTGTAG AAATTTmACA 6540 

TATATCATTC AAATTACTAA TTTGTTAAAA TCAACAGTAA GATTAGAAGT AGATGATATT 66 00 

GAAATTTGGC AAACAaTtTA ATCTATATAA AACTACAACG AAACACAGAA AGGAAGTTGT 6660 

40 

CAGATGAAAA TAGCAACTCT GAACAAAGGC AAAGAAACAA AATATTTTAA TGGATATCCT 6720 

TTAATTGAAG AAGAGGATAT CTATTCACAA GATCATTTAA AAGAAGGAGA TATTTTTCAA 6780 

45 ATTGTGACTG ATAAATCACA ATAT 6804 

(2) INFORMATION FOR SEQ ID NO: 366: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1717 base pairs 
so (B) TYPE : nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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•(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 366: 





aaaAGAGACG 


CATTaAAACA 


AGCTATACAA 


ATTATCGATA 


AATTAACATG 


GGGTGTTTAG 


60 


5 


TGGTGGTTAA AGAAATTTTG 


AGACTATTAT 


TCTTACTAGC 


GATGTATGAG 


CTAGGTAAGT 


120 




ATGTAACTGA 


GCAAGTATAT 


ATTATGATGA 


CGGCTAATGA 


TGATGTAGAG 


GCGCCAAGTG 


. 180 


10 


ACTTTGAAAA 


AATCAGAGCT 


GAAGTTTCAT 


GGTAATAGCT 


ATTATCATTT 


TTGAATTAAT 


240 


TATATTAATG 


TGTGTAGCAA 


TAGCACTGGA 


GGTGTTGTAA 


ATATGTGGAT 


TGTCATTTCA 


300 




ATTGTTTTAT 


CTATATnTTT ATTGATCTTG 


TTAAGTAGCA 


TTTCTCATAA 


GATGAAAACC 


360 


15 


ATAGAAGCAT TGGAGTATAT GAATGCTTAT 


CTTTTCAAGC 


AGTTAGTAAA 


AAATAATGGT 


420 




GTTGAAGGTT 


TAGAAGATTA 


TGAAAATGAA 


GTTGAACGAA 


TTAGAAAAAG 


ATTCAAAAGC 


480 




TAAAGAGAGG CGTTGGCTTC TCTGCTCTAT CyAAAATAAT 


GAAAGGAGCC 


saACATGTTA 


540 


20 


GaCtnAAGtCA 


CTCAAATAGA AACAATTAAA 


TATGATCGTG 


ATGTCTCATA 


TTCTTATGCT 


600 




GCTAGTCGTT TATCTACACA TTGGACTAAT 


CACAATATGG 


CTTGGTCTGA 


CTTTATGCAG 


660 




AAGCTAGCAC 


AAACAGTTAG 


AACTAAAGAA 


GATTTAACTG 


AGTACAATAA 


AATGTCTAAG 


720 


25 


TCTGAACAAG 


CCGATATAAA AGATGTTGGC 


GGATTTGTCG 


GTGGATATTT 


AAAAGAAGGC 


780 




AAACGGCGTG 


CTGGTCAAGT 


CATGAATCGT 


TCAATGCTAA 


CACTTGATAT 


CGATTATGCA 


840 




GCCCAAGATA 


TGACTGACAT 


ATTATCTATG 


TTTTATGATT 


TTGCATATTG 


TTTATATTCA 


j 900 


30 


ACACATAAGC 


ATAGAGAGAT 


AAGTCCAAGA 


CTGCGTTTAG 


TGATTCCTTT 


AAAACGAAAT 


960 




GTAAATGCAG 


ATGAGTATGA 


AGCTATTGGG 


CGTAAAGTCG 


CAGATATCGT 


TGG CATGGAT 


1020 


35 


TACTTCGATG 


ATACAACTTA 


TCAACCACAT 


AGGTTAATGT 


ATTGGCCTTC 


AACTAGTAAC 


1080 


GATGCGGAAT 


TTTTCTTTAC 


CTA7GAAGAT 


TTACCTTTGT 


TAGACCCAGA 


TAAAATATTA 


1140 




AATGAATATG 


TTGATTGGAC 


TGACACATTA 


GAATGGCCAA 


CGTCTTCAAG 


GGAAGAGAGT 


1200 


40 


AAGACTAAAA 


GATTAGCAGA 


TAAGCAAGGC 


GACCCAGAAG 


AAAAGCCGGG 


AATTGTTGGT 


1260 




GCATTTTGTA 


GAGCCTATAC 


GATAGAAGAA 


GCTATAGAAA 


CTTTTATTCC 


TGATTTATAC 


1320 




GAAAAACATT 


CTACTAACCG 


TTATACCTAT 


CATGAAGGTT 


CAACTGCAGG 


TGGATTGGTG 


1380 


45 


TTATACGAAA ATAACAAGTT TGCCTATTCT CATCATAATA 


CGGATCCCGT 


AAGCGGTATG 


1440 




C7TGTGAACA 


GTTTTGATTT AGTACGCATA 


CACTTATATG 


GTGCTCAAGA 


TGAAGAAACT 


1500 




AAAACAGATA 


CTCCGGTTAA 


TCGACTACCT 


AGTTATAAAG 


CAATGCAGCa 


AAGAGCGCAA 


1560 


50 


AATGATGAGG 


TTGTTAAAAA 


GCAA7TAATT 


AATGACAAAA 


TGTCTGATGC 


AATGCAGGaT 


1620 




TTCGATGAAn 


GAGAAAATAG 


CGATGATGCA 


TGGTCTGAGA 


CGTTnGAAAT 


TACTTCGAAA 


1680 
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(2) INFORMATION FOR SEQ ID NO: 367: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1847 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY; linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 367: 





AAGATTAGGC ATCAAGACAG 


GGTCGCGATT GTTTGAAATC 


CCACATAGnA 


ATGATATTTa 


60 


15 


CaTTATCAAT 


CCaAGTATGC 


GTAAATATCT 


TAATGTTTCA 


GTTGCTATTT 


CTAAGATTGC 


120 




ATTGCGTTAT 


ATTCCACCTG 


AAGATTTACA CCAATATAGT ATTGACGAAT 


TTTTTATGGA 


180 




TGTTACTGAT 


AGCTATCATA 


GATTTAGTTC 


TACAGTACAT 


GCATTTTGCG 


AAAGACTTAA 


240 


20 


ACGTGAAATT 


TATGAAGAAA 


CAGGCATTTA 


TTGTACTGTG 


GGCATTGGTT 


CTAATATGTT 


300 




ATTAAGTAAA ATTGCTATGG 


ATGTTGAAGC 


GAacATAGTc 


AAAATGGTAT 


AGCTGAATGG 


360 




CGATATCaAG 


ATGTACCAAC 


GAAATTATGG 


CCAATTCmGC 


CCtTGCGAGA 


TTTTTGGGGT 


420 


25 


ATTAATCGTC 


GAACAGAAGC 


CAAATTGAAT 


AAAAGAGGAA 


TTTTTACTAT 


AGGAGATTTA 


480 




GCGAAATATC 


CATATAAATT 


TTTAAAAAAA 


GAGTTCGGTA 


TTTTAGGTGT 


TGATATGCAT 


54 0 




CTACATGCGA 


ATGGGATAGA 


TCAGAGTAAA 


GTACGTGAAA 


AGCACAAGAT 


CAGCAATCCA 


600 


30 


TCGATATGCA 


AAAGTCAAAT 


ATTAATGAGA 


GATTATCATT 


TTGATGAAGC 


AAAAGTAGTA 


660 




ATGCAAGAGT 


TAATTGAAGA 


TGTTGCTAGC 


AGAGTTCGAG 


CAAGAAAAAA 


AGTGGCAAGA 


720 


35 


ACGATACATT 


TTGCCTTTGG 


CTATAGTGAT 


GAAGGCGGTG 


TACATAAGCA 


ATATACTTTG 


780 


AAAGATCCAA 


CAAACTTAGA 


AAAAGATATT 


TATAAAGTAG 


TAATGCATTT 


CGCAGATAAA 


840 




TTATGTAATA AACAAGCACT 


ATATCGTACG 


CTAAGTATAT 


CTTTGAGTCA 


ATTTATTAAT 


900 


40 


GAGGATGAGC GACgTTAAGT 


CTGTTTGAAG 


ATGAATACCA ACGCAAACGT 


GACGAATGTC 


960 


TAGCTAAAAC 


GATAGACCAA 


TTACATTTGA AATACGGCAA AGGTATTGTG 


TCCAAAGCAG 


1020 




TATCGTTTAC AGAAGCAGGT 


ACAAAACACG 


GCAGATTAGG 


TTTAATGGCT 


GGACATAAAA 


1080 


45 


TGTAATGACT ATACGGTTTA 


AGTAATATAT 


AACTGTGATT 


CGTATAAAAT 


AAGTCTCTAA 


1140 




AGATAAATAT 


TTCATATATC 


ACAATAGATT 


TTCACAATAA 


TATCTAAGAA 


TACATGGAAT 


1200 




TTATCAAAAG 


AGACTTAATA 


ATTATTGGAT 


ATAACAATCA 


AAATCACTCA 


ATGCTTGCAT 


1260 


50 


ACCGCGTTCT 


CGGTCAGTAG 


GGTTTTTGAA 


ACTAATTTTT 


AAAGCACCGT 


ATATATCTTC 


1320 




GCGTACTTCT 


AAGATTCTTA 


AGTTGCTTAT 


AGATATGTTA 


TGTAAACTCA 


GGATATAAGT 


1380 
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TAGTCCACCT AGTTGTTTAG CGGGTAGTGC GTCGCGATAC GATTTAGCTT GGGCAAAAAA 1500 

TGATAACAAT TTTTCAGAAT CATTGCTTTC AATTAGTCTt TCTAAATCTT GAAACTGACT 1560 

TTTTAGCTGT CGAATCATTT CTAAAATATA CGTTTTATTA CTCAAGGTGA TATCTTTCCA* 1620 

CATTTGTGCA TTACTACTAG CTATACGAGT GATATCACGA AAACCACCAG CTGCAAGTTT 1680 

ATTAACTAAA TGATGTTCTT GACCGTTCTT TTGACTAACA TGAACTAAAC TAGATGCAAC 1740 

GATATGAGGT AAATGACTTA CGACGCTTGT TACGTAGTCG TGTTCTTCAG CAGTAGTTAC 1B00 

AATAAATTTA GCAAGAGTAG GTGATAACAG TTCTTTTAAC GTGTTTG 1847 
1S (2) INFORMATION FOR SEQ ID NO: 368: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 494 base pairs 

(B) TYPE: nucleic acid 
/(C) STRANDEDNESS : double 

20 (D) TOPOLOGY: linear 



25 



30 



35 



40 



50 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 368: 

AACAAAAGGC ATAAGTTCGT GAATTAATGC GTATACAAGG ATAAAGCTTA TAACAGTAGT 60 

AATTGTTGCT ATCAAACGAA CAACATATAT TCTATTTTCA GATAGCAATC GATTCACTAA 120 

TCGATAATTT ACGTATACAA GAATTAACAA CAGCACAATA TAAACAATAA TCATATTTGG 180 

CCCCATTTAA TTTTTAAATT TGTTTTCACA TCATTTTCTA CTTTATTATC ATACTAATTT 24 0 

TAAAGGCAAA GGTGGACATC GGCAACCTCT CGTAAACTAT TTATCAAAAA TAAACGTATC 300 

TCATTGTTAT GATATTTATA AATCAATTCG TTTTTATTAA AGTCTTTTTC AACAAGTTTG 360 

TCACTATCTA TTAAATAATC ACGCATGCAT CCTTTTAAGA AATCATCTTT ATAACTTGGT 420 

GTGTACCATT TTCCATCCTC TTCAATGACA ATGTTGCCAA TATCAAATTC AAGGACCTTG 4 80 

CCGTCCTCTG AAGT 4 94 
(2) INFORMATION FOR SEQ ID NO: 369: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2518 base pairs 
45 (B) TYPE: nucleic acid 

- (C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 369: 
GTCCATATTT CCCGTCCAAC CAACTAAAAA TGGGTAAATC CCTGGATTTa AATCTACTTC 60 
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GATAATTTCn AATGTAGCGA CCATCATTAC GAGATGATTT GATATAAGCA CAGTTTGGAT 180 

GTTGACCAAT ACTATCGCCT TCTTCTTCGA TGATATCTAT TTTAATACCA TCATCAGCTG 240 

5 CAATTTCTAA TGAAGATTTA ATTCGGTTAT CAAATGTTGA ATATCCCATT GCTCCACCCA 300 

CAATAGCGAC ATCTGTACCA TGTCCTTGGT GTGTTTGAGC AAATGATTCA TAATAATGTA 360 

TTTCAATATT TTT AGGAAT A TCTCCCAATA TTGCGCGTGC TGAATTCCCA ATCTTTACTG 420 

10 

CACCAGCCGT ATGAGAACTT GAAGGGCCCA TCATAACTGG TCCGATAATA TCGAAAGCAC 4 80 

TTTGATAATC ATAGCTCTTT GCCATAATTA AACACTCTCC TTAATATGAT TCTTTTTGCT 54 0 

CGGCATTTTA AAGTTGATAT TCATTAAATT AAACTTATTA ATTAGTGTTC CAATAATATA 600 

15 

GGCTAAGACA ATGCTGACAA TAATCACTGT TGCGATTGTT ACAATTGATG TCACTGCATT 660 

ATTAAAGCCA AACAATACGA TGGCGCCTGC AATTGGTGTT GCCATACCTT TGACACCTAT 720 

zo TACTAGTCCG CTAAATGTCA CGATACATGC GTTGACGACC CCAATCAGTG CATTTGTACC 780 

ATATAGTTGT ACTGGATATT GCGCTATTAA ATCAATTTGC GTCAATGGCT CAATACAAAC 840 

TGCAAATGCT TTTGACGGTC CACCAATGTT TAATTTTCGG AATAAAATAA GGTTAACAAA 900 

25 TGAGCTACCT GTACATGTTA GTGCTCCAAT AGCCATAGGA ACACCTGTCA GTCCTAATAA 960 

ACTTGTTAAT ACCATTGAAC TTAGCGGTGT CATACCTGTA ACAGGAATCA CTAGTCCTAA 1020 

AATGACCGCT AATGCATATG GATTGTTATC ACCTACCGCA GTGACAGCAC TACCTATTTG 1080 

30 TTTTAATGTT GCTAGCACAC CAGGTGTAAT GATTGATGCA AGTCCGAAAG CAATTGCTGG ' 1140 

TGCAAATAAG ATCACCACAA TTAAGTCCAA GCCTTCTGGA ACTTTCTTTT CAATCCATTT 1200 

AATTAAAAAA GcTACGCCAT AAGCTGCGAT GAATGCTGGT AATAATTTAA AGTCATGTAA 1260 

35 

TACTAAACCA ACAATGACCG CAAATACTGG TGCAACGCCT AAGTTTAAGC ACGTTAGAAT 1320 

ACCTACTGcG ATACCGCTTA AACTTCCTGC TAAATCCCCA ATATCTTGTA GAAATTTAAT 1380 

ATCAAATACG CCACCAATAG CATAACTTAA GAATGCTTGT GGTAGAAATG TCGCACAAGC 1440 

40 

TGCACCGGAT AATGCTTGTA GTCCTTGTTT ACCGTACGGT GCATACTTTA AAAATAGCGT 1500 

CATGATCACT AAAACCAAGA CTAATGTGCC TACACCTAAC AGAATATCCA TTTCCCAAAA 1560 

45 ACCTCTCTCT ATGTTTATTT TATTTTCAGA CCATAAACAT CGTACACCCT TAAGAAAACG 1620 

TTTTCAACTT TTATCTGTTA TCAAATCaAA TATTTAAGTG AAATATTTCA TATTTGTGAA 1680 

AGATTTTTAA AATGGATTGT TTCAAAAACT ACTTATGTTG TCGTTAATAT TTACTAATTA 1740 

50 ACTTTTTACT CTATATTTCA AACAGTTGTG TGACAGTTTT TTGATAACTT TTTTACATCT 1800 

GAAAGTAAGT AATATTTCTA AAAACTTTTA ATATTTATAC ACTTTATCTT TCGAGCTATT 1860 
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ACACATATAT TTGCAATAAG ATAATTAAAG TAGGATATTA TTTTTAGTTT TCTGATAGGA 1980 

ATGATGATAG TCTATAGGTT GAATCTTTAC TTTTTTTTAA AGCTAAATTT ACATCAACTT 204 0 

5 AACAATGGTT GGTTATACTG AAGATGAAAA TATTTAGTAT AACTTAGTGG AGGCGATAAA 2100 

GGTGCAATTG AGTCATTCCG TTAAAGTTGC AATTTCTATC TATTTAGCAC TTATCTTTAT 2160 

AACGTTCACT TCTTATTTAG TCATTATTTT ATATACGAGT ATGACTGGAC ATGATGTATC 2220 

10 

ACATTTCGTG TTAGATAGTC AGCATTCTCA TCATGGATCT CTTACGCAAA AACATTTGAG 2280 

TCTTCCTGAA ATCTCATTTA AATAGTTTAT CCTCTGTGTT TCAACATTCA TTTCCCATAT 234 0 

CGATTCATTT ATCTATCATC TAGACCACTA CATCTTAGAT GATTTTTTTA TTTTCTCATT 2400 

TCACTCTTTC TTAAAAGTCG ATATAATGAA TTAAATCATT ATCATACACC GACATATTTT 2460 

■ i 

ATGTTGTTGG TGTTAAGTTT aAAGGGGTGA GATACTTGGC GAATaATCaT TCAGCTTT 2518 
2Q . (2) INFORMATION FOR SEQ ID NO: 370: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 790 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 70: 

30 ATACTACTGG ACCGTCnACC AATTGACAAA ACCTGAAAAC TGGATTTTTA TTGCAGGATT 60 

ACATCGAACC TGGAAGAACA ATCCCCAGGA GTTGATGATA TTCGAGATAT GGCATACAAT 120 

CAAGGTAGTT TAGATAAGAC AATTTATGAA ATTTCTAAAC GCACAGTACT ATTTTTAATA 180 

CAGAAAGATA TTACGGTATA TAATAAGACG ATTGACTGTT TAAATTATTA TAACTATAGT 240 

GACGAAAGAA TAAAGGATGA TTAAATGAAT TCACAAGAAT TATTAGCAAT TGCTGTGGAT 300 

GCAATTGACA ATACCCCAGG CGAAGATACG ATTTCTTTAG AAATGAAAGG TATCAGCGAT 3 60 

40 

ATGACAGATT ATTTTGTTGT AACGCACGGA AATAATGAAC GACAAGTTCA AGCGATTGCT 420 

AGAGCGGTGA AAGAAGTAGC CAATGAACAA AATATAGAAG TAAAACGTAT GGAAGGATAC 480 

45 AATGAAGCGC GTTGGATATT AATTGACTTA GCTGATGTTG TGGTACATGT TTTCCATAAA 54 0 

GACGAAAGAA ATTATTATAA TATTGAAAAG TTATATCAAG ATGCACCATT AGAATCATAT 600 

AGTCAGGTTG CGTATTAATT ATGTCGCAAT ATGCAGAAAT GAGCCTAGTG TACGATCAAT 660 

SO TGACTCAAGA TCAACCATAT GAAAAATGGT TTGAAATTGT AAAAAATCAC TGCAAAGATG 720 

AATCAAATAT TTTAG ATATT GGATGCGGTA CTGGTAGTTa ACAGTTCAAT TAGAAGCTTT 780 



55 



1263 



EP 0 786 519 A2 



(2) INFORMATION FOR SEQ ID NO: 371: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1823 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 371: 

ATAGATGAAG GTGCAAATAT TGAAaTAGGT TATTTACCTG GACGCTTGAA ATGGTTAGTT 60 

GCTGATTTAT TAACTAAACA AGGATTAAAA GTAGTTAACG ACGATATGAC AGGAAGAACG 120 

75 

TTAAAAGATC GTAAATTATT AACAGGTGAC AGTCCTTTAG CTTCAAATGA GTTAGGAAAA 180 

TTAGCAGTTA ATGAAATGTT AAATGCAATA CAAAATAAAT AATTAAATAT TAATTAGAGG 24 0 

20 AGCCTCATAT GTAAATGTAT GAGGGCTCTT TTTTTTGGCA AAATTTAAGT GATACTTGTA 300 

AAATAGAACC TATTATGAGT ATGATTTAAG AAAACGCTTG CAAAACTAAT AACCGCAACT 360 

AGCGATATGG AGGAAACATG ATGTCTTATA GCATTGGAAT TGATTATGGA ACTGCTTCAG 420 

25 GCCGTGTGTT TTTAATTAAT ACAACTAACG GTCAAGTAGT ATCAAAATTT GTGAAACCAT 480 

ATACACATGG TGTCATTGAG AGTGAATTAA ATGGTTTGAA AATACCACAT ACATATGCAC 54 0 

TTCAAAATAG TAATGATTAT tTAGAAATTA TGGAAGAAGG AATATCATAT ATAGTACGTG 600 

30 AATCAAAAAT AGATCCAGAC AATATAGTAG GTATTGGTAT AGACTTTACT TCATCTACTA 660 

TTATTTTTAC TGACGAAAAC CTTAACCCGG TACATAACTT AAAACAATTT AAAAACAATC 7.20 

CACATGCGTA TGTGAAACTT TGGAAACATC ATGGTGCATA TAAAGAGGCA GAGAAATTAT 780 

35 

ATCAAACTGC TATTGAAAAT AATAATAAGT GGTTAGGCCA TTATGGATAT AATGTTAGTA 840 

GTGAATGGAT GATTCCCAAA ATAATGGAGG TCATGAATCG AGCACCAGAA ATTATGGAAA 900 

AAACGGCTTA TATTATGGAA GCGGGCGATT GGATTGTAAA TAAATTAACT AATAAAAATG 960 

40 

TACGCTCGAA TTGTGGATTA GGTTTCAAAG CATTTTGGGA AGAAGAAACA GGGTTTCATT 1020 

ATGATTTATT TGATAAAATA GACCCCAAAT TATCAAAAGT AATTCAAGAT AAAGTATCTG 1080 

45 CACCGGTTGT TAATATTGGT GAAGCAGTAG GGAAACTGGA TGATAAAATG GCACAGAAAT 1140 

TAGGATTATC AAAAGAAACT ATGGTAAGTC CTTTTATTAT TGATGCCCAT GCTAGTTTAT 1200 

TAGGTATTGG GTCTGAAAAA GATAAAGAAA TGACTATGGT GATGGGAACA AGCACATGCC 1260 

50 ATCTTATGTT AAATGAAAAG CAACATCAAG TGCCAGGTAT ATCAGGTTCT GTAAAAGGAG 1320 

CAATTATTCC AGAATTATTT GCTTATGAAG CGGGGCAATC AGCAGTAGGT GATTTGTTTG 1380 

r 
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CTGTATTTGA ATTAATGAAT GAAAAGATAA AACATCAAAT GCCAGGTGAA AGTGGGCTCA 1500 
TTGCTCTTGA TTGGCATAAT GGAAATCGAA GTGTATTAAG TGATAGCAAT TTAACAGGTT 1560 
5 GTATCTTTGG ATTAACTTTA CAAACTAAGC ATGAGGaTAT TTATAGAGCm TATTTaGaAG 1620 

CTACaGCATT TGGTACtAAG ATGATTATGC mACAGTATCA AGATTGGCAT ATGGaAGTAG 1680 
aAAAGGtATT TGCaTGTGGc gGTAtACCtA AAAAGAATGC TGTTATGATG GATATCTATG 1740 

10 

CGAATGTACT GAATAAAAAA CTAATTGTTA TGGATAGTGA GTATGCACCA GCAATAGGCG 1800 
CAGCAATATT AGGTGCAGTC AGT 1823 
(2) INFORMATION FOR SEQ ID NO: 372: 

75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1600 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
2Q (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 372: 

25 ACGATCATCT GCATCAGCGA ATTCCGATGc AaTTcATATG tCCtAATAAC AAAAGAATAG 60 

GTTTTaAAAG ATATGCATAC CGTAATGaTA GATATAGTTT TAAACGTGAC TTCAAGCTAT 120 

ATGaATGTGA TGACTGTTCA TCATGTTCTT TGAGACATCA ATGCATGAAG CCAAATTCGA 180 

30 ' AATCCAATAA GAAAATTATG AAGAATTATA ATTGGGAATA CTTTAAAGCC CAAATTAATC 240 

AAAAGCTTTC TGAACCAGAA ACGAAAAAAT CTATAGTCAA AGAAAAATTG ATGTAGAGCC 300 

TGTTTTTGGA TTTATGAAGG CTATTTTGGG TtTCACTCGA ATGTCAGTTC GGGAATAAAT 360 

35 

AAAGTTAAAC GAGAGCTAGG TTTTGTATTA ATGGCACTTA ATATAAGGAA AATAgcaGCT 420 

CAACGAGCTG TACATTATAA AATACATATC AAAAAAGCTG ATTTCTATCA AATAAATAAT 4 80 

AGAAATCAGC TTTTTTACAT TGCCTAAGAA CTTTAAGGAA CTTAATGTCC CAAGCTCTTT 54 0 

40 

TTTGTTATAT CTAATTCGTA ATTTATGATT GTTTATTCGG TCCTTTGATG TTCACTAAAT 600 

GTGACTTTAA ATCTTGTTCT AATTGTTGTA ATTCTTTTTC AGCTAATTGT CGTTCTTCGC 660 

' 45 GACCGTGTTG TTGAATAATT AATGTTTCTT CAATTGTCTC AATAATGTTA CGCTGTGTAC 720 

GTTTCAATGT ATCAAGATCA ACAATGCCAC GCTCATTTTC TGTTGCAGTT TCAATCGCAT 780 

TTTGTTTCAA CATTTCAGCA TTTGCTGTTA ATAAATCATT AGTTGTATCA GTGACAGCTC 840 

50 GTTGTGCAGC AACTGCATTA CGCTGTCTCA TTAATGTAAG CGCAATGGCC ATTTGATTTT 900 

TCCATAGTGG AATACTTGTC AAAATTGAAC TTTGTATCTT CTCGGCAAGT GCTTGATTAA 960 

55 



1265 



EP0 786 519 A2 



AGTCATATAT GCGTTTATCT AGTCTATCTA TAAATTGCTG CATATCTGCA ACTTGTTGAA 1080 

TATCCATTTG ATTAGTGGAt TgtGCGCTTG CTGTTGCAAT TGTGGTAGCT TTTCATTTTC 114 0 

5 TAATTGCAAC TTTTTTTGcT GTGCAGCAAT GATATGCAAT GATAAGTCAT CAAAGTATTG 1200 

TTTGTTTTTA TCATATAGCG TATCTAATAA TTCAATATCT CTTGTTAAAT GTGTTTGATG 1260 

TTTCTGCAGT TGAATCGTTA TGCGATCGAC TTGAGCACTA ACTGATTGCA TTCTTGAAAA 1320 

10 

GATTTCATTG ATAGACGACT TTGCTCTGCT AAAAATTCTT TTTAACATAG ATGGTTTATC 138 0 

AGTATTTAAC TCATTTGGAT TAACTGACTT TAGTTTTGAC ATTAGATCTG ACAAAGTATC 144 0 

TCCAATAGGA CCAACATCTT TACTTTGTAC TTCATCCAAC ATTTGATGTG AAAATTGAGA 1500 

15 

CATTTGTTTC TGkAAATCAG AACCAAACGC TAATAAACCT TCATTGTCTA AAGGGTTAAT 1560 

TTGTTTACTG ATTGTGTCTA CCTGTTTTTG TTGTTCAATT 1600 
20 (2) INFORMATION FOR SEQ ID NO: 373: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1227 base pairs 

(B) TYPE: nucleic acid 
(G) STRANDEDNESS : double 

25 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 373: 

TACAAGAATA AAGACCAACA AGTTGGTGCA ACAATTCTTG 60 

GTTGCTATAT CTGGTGGACG TGATTTCAAA GACGTCGTTA 120 

CCTCACCCTA CTGGTTCATC TTTAAAACCT TTCTTAGCGT 180 

ATGAAATGGG CAACAAACCA TGCGATTCAA GATGAATCTT 240 

ACATTTAGAA ACTATGATAC GAAGAGTCAC GGTACTGTAT 300 

CAAAGTTTCA ATATCCCAGC TTTAAAAGCT TGGCAATCAG 360 

GATGCACCTA AGAAATTCGC TGCCAAACTT GGCTTAAACT 420 

TCTGAAGTAC TTGGTGGTTC TGCTTCAGAA TTCTCACCAA 4 80 

GCTGCAATCG CTAACGGTGG TACTTATAAC AACGCGCATT 540 

CGTGATGGTG AAACAATCGA ATACGATCAT ACTAGCCATA 600 

GCATACATGT TAGCTGAGAT GCTAAAAGGT ACATTTAAAC 660 

CATGGTGTAT CTGGAGTAAA TATGGGTGCT AAGACAGGTA 720 

ACTTATTCAC AATATAATTT ACCTGATAAT GCAGCGAAAG 780 

55 



ATGTTGATAA TGGTAGCTTC 
aTAGTAAAAC TGGTGGTTTA 
ACAGAAACCA AGCAACAGAT 
ATGGACCTGC CATTGAAAAT 
CATATCAAGT TGATGGTTCT 
CTATTTATGA TGCTTTACGA 
TTAAGCAAAA TGCTGGTAAT 
ACGAAGGCGA TATTGGTCCA 
CACAATTAGC ATCAGCATTT 
CAATTCAAAA AGTAGTTACT 
AAGCGATGAG TGATTACACT 
CATATGGTTC TGCATATGGC 
CTGGTACTTA CGGTGCTGAA 
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AAGTTAAACA ATATGGTGAA AACTCATTTG TrGGACATAG CCAACAAGAA TATCCACAGT 


900 




TCTTATATGA AAATGTGATG TCAAAAATTT CATCTAGAGA TGGCGAAGAC 


TTTAAACGTC 


960 


5 


CTAGCTCAGT AAGTGGTAGT ATCCCATCAA TCAATGTTTC TGGTAGTCAA GATAACAACA 


1020 




CTACAAATCG 


TAGTACACAC GGTGGTAGTG ACACATCAGC AAACAGCAGT 


GGTACTGCAC . 


1080 




AATCAAATAA 


CAATACTAGA TCTCAACAAT CTAGAAACAG CGGTGGATTA 


ACAGGTATAT 


1140 


10 


TCAACTAATC 


CACTCAACAT AAAATCCTCA GTTATACCAT ATTTATGGTG 


TAGCCGAGGA 


1200 




TTTTnTTAGG 


TTCTTCATCT TTTATGG 




1227 


15 


(2) INFORMATION FOR SEQ ID NO: 374: 






20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1953 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 374: 






25 


CCATATGGtG 


CAACATTCTT CGTATTTAGT GATTATTTAA AACCAGCGTT 


ACGTTTATCA 


60 




TCAATTATGG 


GATTAAATgC aACGTTCATC TTCACACATG AyTCaATTGC AGTAGGTGAA 


120 




GATGGTCCTA 


CTCATGAACC AATTGAGCAA TTAGCTGGAT TAAGAGCCAT 


TCCAAATATG 


180 


30 


AATGTTATCC 


GTCCTGCTGA TGGTAATGAA ACAAGAGTAG CATGGGAAGT 


TGCCTTAGAA 


240 




TCTGAATCTA 


CACCTACTTC ATTAGTATTG ACACGTCAAA ACTTACCGGT 


. ATTAGATGTA 


300 




CCAGAAGATG 


TAGTTGAAGA AGGCGTTCGA AAAGGTGCCT ATACAGTTTA 


TGGCTCTGAA 


360 


35 


GAGACACCAG 


AATTCCTATT ATTAGCTTCA GGTTCAGAAG TTAGTCTTGC 


AGTTGAAGCT 


420 




GCTAAAGATC 


TTGAAAAACA AGGTAAATCA GTACGTGTTG TTTCAATGCC 


TAACTGGAAT 


480 


40 


GCATTTGAAC 


AACAATCTGA AGAATATAAA GAATCAGTTA TTCCATCAAG 


CGTAACAAAA 


54 0 


CGTGTTGCGA 


TTGAAATGGC TTCACCGCTT GGATGGCATA AATATGTAGG 


TACTGCAGGT 


600 




AAAGTTATTG 


CTATTGACGG CTTTGGCGCA AGTGCACCTG GCGATTTAGT 


AGTTGAAAAA 


660 


45 


TATGGATTTA 


CAAAAGAAAA TATCTTAAAC CAAGTTATGA GCTTATAAGA ATAATTTATA 


720 




AAGCGAGTAT 


GTTTAGAAGT CTAGGATGCA TAATCTTAGG CTTCTTTTAA AGTGTTGAAA 


780 




TTTAGAGTAT 


AGCACTTAAA CTACATCATA AGTGATAAGT TATGAAAGTA 


TACTATTTCA 


840 


50 


GATTAATCTT 


TAAAAGCTCT GTTATAACAG CATGATTTTT GATATTATTT 


TTAGTATCGA 


900 




TATTAAAATA 


CTTGAATAAA CTAGTTCTTG AAATAATGTG ATGAATTTAG 


TAAAATTCAG 


960 



55 



1267 



EP 0 786 519 A2 



5 



10 



15 



20 



TGCATTAATT TTAGGTTTAA 


TTGGAGGTTT 


CCTTTTAGCT 


AGAAAATATA 


TGATGGACTA 


1080 


CTTGAAGAAA AACCCACCAA TCAACGAAGA 


AATGCTTCGT 


ATGATGATGA 


TGCAAATGGG 


1140 


TCAAAAACCT TCTCAGAAGA AAATTAATCA 


AATGATGACG 


ATGATGAATA AAAATATGGA 


1200 


TCAAAATATG AAGAGTGCGA AAAAGTAAAT 


TCGCAATTGA 


TAGAGGCTAT 


TTTCCAGATA 


1260 


TGGAAATGGC CTCTTTTTAT 


AATCAAATTA 


ATAAGAATAA 


ATATGTTTAT 


TAAAATTAAA 


1320 


GTTAACAAAA TGACGAATAG ACTGAGAAAT 


GCTATAATTC 


ATTTTGTATG 


ATTTACAGAG 


1380 


AGTTTATTTA ACGAGAAGGT GTCyGCGTGC 


TCTATTTAAT 


ATTTTCAATC 


ATTGTAGCTT 


1440 


TATTTATGGG AACTATAGTT 


ATAGTTATTC 


GTATGAAAGC 


TCAAAATTAT 


CCGGTAAAkG 


1500 


AGAAAAAAAT AGTTTTGCCa 


CCgTTTTTTA 


tGGCgACCGG 


TGCATTGATG 


TACGTCGTTC 


1560 


CaTATTTTAG GCTAACAGGA 


TCGGAAATGC 


TAGAAGCCTT 


TATAATTGGT 


TTGCTTTTTT 


1620 


CtACAGTTCT AATTTGGACT 


TCTCGATTTG 


AAGTCAAAGG 


TACAGAAATT 


TATATGAAAC 


1680 


GATCTAAAGC ATTTCCAGTT 


ATTTTGATTT 


H ATT A PTT & T 


CATTCGTACT 


GTGATGAAAA 


174 0 


TATTCATTAG TAATGAAATA 


GATCCTGGAG 


AATTAGGCGG 


CATGTTCTTT 


TTATTAGCAT 


1800 


TCTGTATGAT TGTTCCTTGG 


AGAGCAGCAA 


TGCTATATAA 


ATACAAAAAA 


CTAAAGAAAA 


1860 


CATTAATCAA TTAATTACTT 


TTaAAACCAC 


TTGTGATCGA 


CTTCTAAATC 


AGTCAATGAG 


1920 


TGGGTTTAAT nTTACTTGGA 


AAAGGnGGAA 


AGG 






1953 



(2) INFORMATION FOR SEQ ID NO: 375: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: ! 

40 

ACATTTGATC AAnTTATCGA CATTAAAGAT 
TGTTGAAGTA GCACGTTTGC TTGATATAGT 

45 TATCcACGTT GATTAAAGAT AAAGGGAAAA 

CTGAAAATAT TGATGGCGAA GTGCTGTTCA 
AGGTTGGTGT TCAAAATAAT GCAATGrCAA 

SO ATAGTTTGAA GTTTTTAGTT AAGTGCATTG 
AAAGAAGCAT TTAATGGCGT TGTCGTGTTA 

SS 



SEQ ID NO: 375: 

GAATTCAnTT GATCGTnTCA ATGATTATCC 60 

GGAmATaAAA GTACACGCAT TACATTCAGG 120 

TAATTGATAT TCATTTATCT GTAAAAGCCA 180 

AAGCAACACA ACCTTTAGGT AGAACAATGA 240 

TTACTTTAAC GAAACAAAAT CAATGGCTTG 3 00 

AAGAAAGTAT GAGAATCAGT GATGAAGCAT 360 

ACTGCTGCAT TAATTGTCAT TAAAATTCTG 42 0 
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CAACAAGTGT ATCCAATTGT AGCATTAGGA ATGATATTAT CGATGAATGC CATTCCTAGT 54 0 

GCAATTACAC AAAATATAGG GAAGTATCAT AGTGACGAAG CATATGCAAA AGCAGTCGCT 600 

5 TATATACAAT TAGTTGGTAT ATTATTATTT ATTGCTATTT TTGTGTTTGC GAACAATATT 660 

GCACATATGA TGGGTGATGG CCATTTAACA CCAAtGATTC AAGCTGCAAG TTTAAGCTTT 720 

ATATTTATAG GTATGCTTGG CGTGTTAAGA GGTTATTATC AATCTGCAAA TAATATGACA 780 

10 

GTTCCGGCTA TTTCCCAGGT TATAGAACAA GTTATACGAG TAGGTATTAT CATTGTTACT 840 

ATTGTTATTT TTGTAGACAG AGGTTGGACG ATATATGAAG CGGGAACAAT TGCTATTTTA 900 

GCATCAACGA TAGGTTTTTT AGGTTCTTCA ATTTATTTAG TAGCGCACCG ACCTTTTAAG 960 

75 

TTTAAAATGG TAAATAACAC TGCAAAGATC GTTTGGAAAC AGTTCGCACT TTCGGTTTTG 1020 

ATTTTCGCTA TCAGTCAATT AATCGTAATT TTATGGCAAG TGATTGATAG TGTTACTATT 1080 

20 ATTAAGTCAC TTCAAGCGAT ACGCGTGCCA TTCGATGTTG CCATAACTGA AAAAGGAGTC 1140 

TATGACCGTG GTGCATCATT TATTCAGATG GGATTGATTG TAACTACAAC ATTTAGTTTT 1200 

GCGCTCATTC CTCTGTTAAG TGACGCAATC AAAATGAATA ATCAGGTACT TATGAATCGT 1260 

25 TATGCAAATG CGTCATTAAA GATTACGATT TTAATAAGTA CAGCAGCGGG . AATAGGATTA 1320 

ATTAATTTAT TGCCTTTAAT GAACGGTGTG TTTTTTAAGA CGAATGATTT AACCTTAACG 1380 

TTAAGTGTTT ATATGATTAC GGTCATTTGT GTATCGTTAA TTATGATGGA TATGGCATTA 1440 

30 TTACAAGCGC AACATGCTGT GAGACCTATT TTTGTTGGTA TGACGGCAGG ATTGGTTATT 1500 

AAATTTATAC TTAATATCAT TTTGATTCGT TTAAGTGGCA TTATTGGTGC GAGCATTAGT 1560 

ACTGTTGTAT CATTAATTAT ATTCGGTACG ATTATCCATA TTGCTGTCAC GAGAAAATAC 1620 

' 35 

CACTTATATG CGATGAGACG ATTTTTTATC AATGTTGTTT TAGGTATGGT ATTTATGTCG 1680 

ATTGTTGTTC AATGCGTGTT AAACATAGTG ACAACACACG GTAGAATCAC TGGACTCATT 174 0 

GAATTATTAT GTGCAGCAGT ATTAGGTATC ATTGCATTGT TTTTCTATAT TTTTAGATTT 1800 

40 

AATGTTTTGA CATATAAAGA GTTAACTTAT TTACCATTTG GTTCAAAGTT GTATCAAATT i860 

AAGAAAGGAA GACGTTGATG GCACATACCA TTACGATTGT TGGCTTAGGA AACTATGGCA 1920 

4S TTGATGATTT GCCGCTAGGG ATATATAAAT TTTTAAAGAC ACAAGATAAA GTTTATGCAA 1980 

GAACGTTAGA TCATCCAGTT ATAGAATCAT TGCAAGATGA ATTAACATTT CAGAGTTTTG 2040 

ACCATGTTTA 1'GAAGCACAT AACCAATTTG AAGATGTCTA TATTGATATT GTGGCGCAAT 2100 

50 TGGTTGAAGC TGCTAATGAA AAAGATATTG TCTATGCGGT TCCGGGTCAT CCTAGAGTTG 2160 

CTGAGACAAC TACAGTGAAA TTACTGGCTT TAGCAAAGGA CAATACTGAT ATAGATGTGA 222 0 

55 
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10 



is 



ATGATGGCTT CACACTGTTA GATGCGACAT CATTACAAGA AGTAACACTT AATGTTAGAA 234 0 

CGCATACATT GATTACGCAA GTTTATAGTG CAATGGTTGC TGCTAATTTG AAAATCACTT 24 00 

TAATGGAACG ATATCCTGAT GATTACCCTG TTCAAATTGT CACTGGTGCA CGAAGCGATG 246 0 

GTGCGGATAA CGTTGTGACA TGCCCATTAT ATGAATTGGA TCATGATGAA AATGCATTCA 2520 

ATAATTTGAC GAGTGTATTC GTACCAAAAA TCATAACATC GACATATTTG TATCATGACT 25 BO 

TTGATTTTGC AACGGAAGTG ATTGATACTT TAGTTGATGA AGATAAAGGT TGTCCATGGG 264 0 

ATAAAGTGCA AACGCaTGmA AcgCTAAAGC GTTATTTACT TGAAGAAACA TTTGAATTGT 2700 

TCGAAGCTAT TGACAATGAA GATGATTGGC ATATGATTGA AGAACTAGGA GATATTTTAT 2760 

TACAAGTGTT ATTGCATACT AGTATTGGTA AAAAAGAAGG GTATATCGAC ATTAAAGAAG 2820 

TGATTACAAG TCTTAATGCT AAAATGATTC GTAGACACCC ACACATATTT GGTGATGCCA 2880 

20 ATGCTGAAAC TATCGATGAC TTAAAAGAAA TTTGGTCTAA GGCGAAAGAT GCTGAAGGTA 294 0 

AACAGCCAAG AGTTAAATTT GAAAAAGTAT TTGCAGAGCA TTTTTTAAAT TTATATGAGA 3000 

AGACGAAGGA TAAGTCATTT GATGAGGCCG CGTTAAAGCA GTGGCTAGAA AAAGGGGAGA 3 060 

25 GTAATACATG AGATTAGATA AATATTTAAA AGTATCACGG TTAATAAAGC GACGTACGCT 312 0 

AGCAAAAGAA GTAAGTGATC AAGGTAGAAT TACAATAAAT GGTAATGTTG CTAAAGCTGG 3180 

ATCGGATGTT AAAGTTGAAG ATGTGCTGAC GATTCGCTTT GGTCAAAAAT TAGTAACAGT 324 0 

TAAAGTAACT GCATTAAATG AACATGCATC TAAAGATAAC GCGAAGGGTA TGTATGAAAT 33 00 

CATTGAAGAG CGTCGACTTG AAGAAGCGTA AATTGGAGGT GACAAGCAAT GAAAAATAAA 33 6 0 

GTAGAACATA TAGAAAATCA GTACACGTCG CAAGAGAACA AGAAAAAACA ACGTCAAAAA 3420 

ATGAAAATGC GTGTTGTTCg TAGGCGTATT ACAGTATTTG CGGGCGTATT aCTTGCGATA 34 80 

ATTGTTGTTT TATCaATCTT GCTTGTTGTC CAAAAACATC GCAATGATAT TGATGCACAG 354 0 

GAGCGAAAAG CGAAAGAAGC ACAGTTTCAA AAGCAACAAA ATGAAGAAAT TGCGTTAAAA 3 600 

GAAAAGTTGA ATAATCTGAA TGACAAAGAT TACATTGAAA AAATTGCGCG TGATGATTAT 3660 

TACTTAAGCA ACAAAGGTGA AGTGATTTTT AGGTTGCCAG AAGACAAAGA TTCGTCTAGC 3720 

4S TCAAAATCTT CGAAAAAATA AATCCAAATT GATTCAAAAT TATCCGAGTA TAGACATTGT 3780 

GAAAAAA 3787 
(2) INFORMATION FOR SEQ ID NO: 376: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

55 



30 



35 



40 
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(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 376: 

5 TAAACCATTT CAAACTGAGG AACGCnAAGA CGGsACgTTT CCAGATTTAG AAGTATTTAA 60 

AAATGAATGT GATTTAAGCT ATGACATAAC GTCACTTTAT ACTTTTAAGC AACCTGTATC 120 

ACCACACCTT GCATTTAAAA TGACAGATCA AATTTTTCTA AATAAGCAGC GTGTATTAGA .180 

10 

TAAGGTAAAA GTTTTAGATA AGGAATTTGA TTTTATCTTA ATTGAGGGTG CTGGGGGAAT 24 0 

TGCCGTACCA ATATATGAAG GTACAGATGA TTTCTACATG ACTAAAGATC TAATCAATGA 300 

TTGTGCAGAT TGTGTCATCA GTGTGTTGCC ATCAAAATTA GGTGCTATTA GCGATGCCAT 360 

75 

TGTTCACCAA GATTATGTTA ATCAGAATGT ATCGGCGAGT AATTTTTTAA TAATGAATCG 420 

CTATACAGAC AGCTATATTG AAAAAGACAA TCAAATGACG ATTGGAAAAT TAACAAATAA 4 80 

20 AACAGTCTAT ACATTTGAAG AACATGCCAC GTATGAAAAT TTCTCAGAAG CATTTTTAAA 54 0 

ACAATTAATA GGAGTTAAAA ATGAATTACA CACAACAACT TAAACAAAAA GACTCAGAAT 600 

ATGTTTGGCA TCCATTTACA CAAATGGGTG TATATAGCAA AGAAGAAGCA ATCATCATTG 660 

25 AAAAAGGAAA GGGTAGTTAC CTTTACGATA CGAATGGCaA TAAATATTTA GATGGTTATG 720 

CATCGTTGTG GGTCAATGTG CATGGTCATA ATAACAAATA CTTnAATAAG GTAATTAAAA 780 

AGCAACTCAA TAAAATTGCC CATTCTACGC TGCTAGGATC ATCAAATATT CCGTCAATAG 840 

30 AACTTGCGGA AAAATTAATC GAAATCACGC CAAGTAATCT AAGAAAAGTA TTTTATTCTG 900 

ATACAGGCAG TGCGTCTGTT GAAATCGCAA TAAAGATGGC ATATCAGTAT TGGAAAAATA 960 

TTGATAGAGA AAAATATGCC AAGAAAAACA AGTTTATAAC GCTAAATCAC GGTTATCATG 1020 

35 

GGGATACGAT TGGTGCGGTA AGTGTTGGTG GTATCAAGAC CTTTCATAAA ATATTTAAAG 1080 

ACTTAATATT TGAGAATATT CAAGTAGAAA GCCCATCTTT CTATCGCAGT AATTACGATA 1140 

CTGAAAATGA AATGATGACA GCTATTTTAA CGAATATAGA GCAAATTCTA ATTGAAAGAA 1200 

ATGATGAAAT CGCAGGGTTT ATATTGGAAC CGTTGATTCA AGGTGCGACA GGCTTGTTTG 1260 

TTCATCCTAA AGGCTTTTTG AAAGAAGTCG AGAAATTGTG CAAAAAATAC GATGTCTTAT 1320 

45 TAATTTGTGA TGAGGTAGCA GTTGGTTTTG GGAGAACTGG AAAGATGTTT GCATGCAATC 1380" 
ATGAAGATGT TCAACCGGAT ATTATGTGTT TAGGTAAGGC GATTACTGGT GGCTACTTAC . 1440 

CACTTGCAGC TACATTGACA TCTAAAAAAA TATACAATGC ATTTTTAAGT GATTCGCATG 1500 

50 GTGTGAATAC CTTTTTCCAT GGTCaTACAT ACaCCGGAAA TCAAATcGTT TGTaCGGTTG 1560 

cATTaGaAAA TATaAGaCTT TATGaAAAAC GTaAGTtnAT TGTgCACATa TTGaAACGaC 1620 

55 
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(2) INFORMATION FOR SEQ ID NO: 377: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 377: 
ATGATTTTtA aAAATCATTA AGTTAAGGTr GATACACATC TTGTCATATG ATCAAATGGT 60 
TTCGCCAAAA ATCAATAATC AGACAACAAA ATGTGCGAAC TCGATATTTT ACACGACTCT 120 

Jo 

CTTTACCAAT TCTGCCCCGA ATTACACTTA AAACGACTCA ACAGCTTAAC GTTGGCTTGC 180 
CACGCmTTAC TTGACTGTAA AACTCTCACT CTTACCGAAC TTGGCCGTAA CCTGCCAACC 240 

20 AAAGCGAGAA CAAAACATAA CATCAAACGA ATCGACCGAT TGTTAGGTAA TCGTCACCTC 300 

CACAAAGAGC GACTCGCTGT ATACCGTTGG CATGCTAGCT TTATCTGTTC GGGCAATACG 360 
aTGCCCATTG TACTTGTTGA CTGGTCTGat ATCcGTGAGC AAAAACGGCT TATGGTnTTG 420 

25 CGAGCTTCAG T 431 

(2) INFORMATION FOR SEQ ID NO: 37B: 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 378: 

TTTnTATAAC GTATTATAAA TCGTTAAAAA TTTTGGTTGT GTTTGCGTCA CGTAGACAAC 60 

CTCCATAAAG TTACTTAATC ACTCTCATCA TACAATAATT TTTACTCAAA TTGGAAnAAT 120 

TATAAAAATT AAATATAGAT AGGCTTTGAA AATTAGTTTT ATACAAGGTT AGTAGCTGTA 1B0 

ACTGTAAAAT GTTCTTAATA TTGTCAAAAT GTAATGCTTG AAAGCGCTTT TAAAaAATAT 240 

4$ TATTATATAC ATGGTTAGAC AAATAGACAA ATCACTATAC AAATATTGGG AGGAATATTT 300 

TATGAAATCA ACACCACACA TTAAACCAAT GAATGACGTC GAAATTGCAG AAACGGTTCT 360 

ATTGCCAGGA GATCCGTTAA GAGCTAAGTT CATTGCAGAA ACTTATTTGG ATGATGTGGA 420 

50 ACAGTTCAAT ACAGTGCGAA ACATGTTTGG TTTTACCGGA ACATATAAAG GTAAAAAAGT 4 80 

TTCTGTCATG GGTTCAGGTA TGkGTATGCC ATCTATTGGC ATTTACTCTT ATGAATTAAT 540 



55 
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CATTGATTTA TATGATGTGA TTaTTkCACA AGGTGCCTCT ACTGATTCAA ATTACGTTCA 660 

ACAATATCAA TTACCAGGTC ATTTTGCGCC AATTGCTTCT TATCAATTAT TAGAAAAAGC 720 

AGTTGAAACA GCACGTGACA AAGGTGTACG TCATCATGTA GGTAATGTGT TATCAAGTGA 780 

TATTTTCTAT AACGCGGATA CAACAGCGAG TGAACGTTGG ATGCGTATGG GTATTTTAGG 840 

TGTAGAAATG GAATCaGCTG CaTTATACAT GaATGCAaTT TACGCTGGTG TCGAAGCATT 900 

AGGTGTGTTC ACAGTGAGCG ATCATTTAAT TCATGAAACG TCAACAAcAC CTGAGGAAAG 960 

GGAACGTGCA TTTaCAGATA TGATTGAAAT TGCACTGTCA TTGGTGTAGA TGATTATGAA 1020 

TGTTGAATAT TCTAAAATAA AGAAAGCAGT ACCTATTTTA TTATTCTTAT TTGTATTCAG 1080 

TTTGGTTATA GACAACTCAT TTAAATTGAT TTCTGTAGCC ATTGCTGATG ACTTAAACAT 1140 

ATCTGTAACG ACAGTAAGTT GGCAAGCGAC ATTAGCCGGT TTAGTAATTG GTATTGGCGC 1200 

TGTAGTATAC GCTTCATTAT CTGATGCCAT TAGTATACGC ACACTATTTA TTTATGGCGT 1260 

GATATTAATC ATTATCGGAT CAATTATTGG TTACATTTTC CAACATCAAT TCCCATTACT 1320 

TTTAGTTGGA CGTATTATTC AAACTGCCGG TTTAGCTGCT GCAGAGACAT TATATGTGAT 1380 

ATATGTTGCA AAGTATCTTT CTAAAGAGGA CCAGAAGACT TACCTTGGCT TAAGTACGAG 144 0 

CAGTTATTCC TTGTCATTAG TTATCGGTAC ATTATCAGGT GGATTTATTT CTACGTATTT 1500 

ACACTGGACA AATATGTTTT TAATTGCATT AATCGTAGTA TTTACGTTGC CATTCCTATT 1560 

30 TAAATTATTA CCAAAAGAAA ATAATACGAA TAAAGCTCAT TTAGATTTTG TTGGCTTAAT 1620 

TCTAGTGGCA ACTATTGCTA CAACAGTCAT GCTGTTTATT ACGAACTTTA ATTGGTTATA 1680 

TATGATTGGT GCCTTAATTG CGATTATCGT TTTTGCGCTA TATATTAAAA ATGCGCAACG 174 0 

35 TCCATTAGTA AATAAATCAT TTTTCCAAAA TAAACGTTAT GCTTCATTTT TATTTATAGT 1800 

ATTTGTAATG TATGCTATCC AATTGGGTTA TATTTTTACG TTCCCATTCA TAATGGAGCA 1860 

AATTTATCAT CTGcAACTAG ACACAACATC ACTGTTATTA GTACCGGGgT TaTATAGTAG 1920 

CAGTCATTGT TGGtGgCACT AAGTGGgTtA AAATCGGgCG rAATATCTGG AATTCCAAAA 1980 

CCAAGCGGAT TATCACAGCC AATTAA 2006 
(2) INFORMATION FOR SEQ ID NO: 379: 



70 



15 



20 



25 



40 
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50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TGGGGAAAAA AAGACCCAGC AGTATTAGAA 
AATCGTATCA TAAAATTAGT CGAAGAATTA 
ATTTCTTCTG AAGCACAGAC CGTGCATATT 
TTAAAACAAT TGCATCCTGA TTATCAATTT 
ATTAAAATGA AACCTCATCA ATTCGAACAA 

10 

AAATATGATG TGAAGAATAA GAAAATTAAA 
ATAATTGAAA TTACAGATCA TGGAATTGGT 

15 GATCGCTTTT ATCgAGTGGA TAAATCTCGT 

TTATCTATTG CTCAAAAAAT CATTCAATTA 
ATTAACAAAG GAACAACGTT TAAAATCATA 

20 TCATAGGATC AATTTTTTAA GTACACATTA 

AACAAATAAA CAGTGGTTCT TTATCATTTC 
CATCTATAAA CCTAGAAGAA TCAACGCTTT 

25 

GTAAATCTAC TTTAACAAAT AACTAAATAG 
TTTTTGTGGA GGGTGTAAAA TGACTAACGA 
CTTCGGTGCG AATTTAGGTC TAATGTTAGA 

30 

ATCTGTACCA GAAGATTTAC AAGTCTTATT 
ACCAGCTTTA AAAAGTACAA GTAGTCAAAA 

3S TTTAATTGAT AATATTCGCC AATACGGGCA 

TCCAAAAAGG AAACATGTAC CTAAATTAGA 
TTTGGAAGGT ATATCAGCAG GAATTGTTTC 

40 TTATGAAGCA ATTTTAAGAA TGGAAAAACG 

ACATATTAAT AACAATACCG AACGTGGTTG 
AGTAACGTTA AATAATAACG AAAAAAGGGC 

45 

GTTTGAAAAA TATCTTCATA AAAACTTCGT 
AGACGCACTT GTACCGATGT TACAACGTAC 
AAATATACAA ATAGGCATGG CTCACCGTGG 

50 

AAAACCGTAC GAAATGATGA TTTCAGAATT 
TGAAGATGGT AGCTTGCAGT TAACTGCTGG 

55 



GAATCGTTAA ATATTTCTAT 


TGAAGAAATG 


SO 


CTTGAATTGA CTAAAGGAGA 


TGTAAATGAC 


120 


AATGATGAAA TTCGCTCGCG 


AATACACTCA 


1B0 


GATACGGATC TGACATCTAA AAATCTAGAA 


240 


TTATTTTTAA TCTTTATTGA TAATGCAATC 


300 


GTTAAGACAA GGTTAAAAAA 


TAAGCAAAAA 


360 


ATTCCAGAGG AAGATCAAGA 


TTTCATTTTT 


420 


TCAAGAAGTC AAGGCGGTAA 


TGGACTCGGA 


480 


AACGGAGGAT CGATTAAAAT 


TAAAAGTGAA 


540 


TTTTAATCAT GACTGAGACG 


TCAATCAAAG 


600 


GCTGTGACTA ATGTATAAGA 


ACAACTATAA 


660 


TGTTGTACTC CCAAAATTTA 


CAATAAAATA 


720 


TGTTGATTCT TCTTTTTAGC 


AGATAAATAG 


780 


TGATATTATT ACATTGTAAG 


CGTTTCAACA 


840 


AAGAAAAGAA GTTTCAGAGG 


CTCCTGTAAA 


900 


TCTATATGAT GACTTTTTAC 


AAGATCCATC 


960 


CAGCACAATT AAGAATGATG 


ACTCAATTGT 


1020 


TAGCGACGGC ACAATTAAGC 


GTGTCATGCG 


1080 


TCTTAAAGCC GATATTTATC 


CTGTAAATCC 


1140 


GATTGAAGAC TTTGATTTAG 


ATCAACAGAC 


1200 


AGATCACTTT GCCGACATTT 


ATGATAATGC 


1260 


TTACAAAGGA CCAATTGCAT 


TTGAGTATAC 


1320 


GTTAAAAAGA AGAATTGAAA 


CGCCATATAA 


1380 


ACTATTCAAA CAATTAGCGT 


ATGTTGAAGG 


1440 


TGGTGCAAAG CGTTTTTCAA 


TTGAAGGGGT 


1500 


TATTACGATT GCTGCGAAAG 


AAGGTATTAA 


1560 


ACGTTTAAAC GTTTTAACGC 


ATGTCTTAGA 


1620 


TATGCATACA GATCCAATGA AATTCTTACC 


1680 


ATGGACTGGT GATGTGAAAT ATCACCTTGG 


1740 
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AAGTCACTTG GAAATTGTTG CAGCTGTTGT TGAGGGGCGT ACGAGAGCAG CACAAGATGA 1860 

TACACAACGA GCTGGGGCTC CGACGACTGA TCATCATAAA GCAATGCCAA TTATTATACA 1920 

5 TGGCGATGCT GCTTATCCTG GTCAAGGAAT TAACTTCGAA ACAATGAACT TAGGAAACTT 1980 

GAAAGGCTAT TCTACGGGTG GTTCATTGCA TATTATTACT AACAATAGAA TTGGATTTAC 2040 

TACAGAACCA ATTGATGGAC GTTCAACAAC TTATTCTACA GATGTGGCCA AAGGTTATGA 2100 

10 

TGTGCCAATA TTCCATGTCA ATGCAGATGA CGTTGAAGCT ACTATTGAAG CAATTGATAT 2160 

TGCAATGGAA TTTAGAAAAG AGTTTCATAA AGACGTCGTT ATTGATTTAG TAGGTTATCG 2220 

15 TCGTTTCGGA CATAACGAAA TGGATGAACC ATCAATTACT AATCCaGTTC CTTATCAGAA 2280 

TATTCGCAAA CATGACTCTG TTGAATATGT GTTTGGTAAA AAGCTTGTTA ATGAAGGTGT 2340 

CATTTCAGAA GATGAAATGC ATTCATTTAT AGAACAAGTC CAAAAGGAAC TAAGACAAGC 2400 

20 TCATGATAAA ATTAATAAAG CTGATAAAAT GGATAATCCA GATATGGAAA AGCCTGCAGA 2460 

TCTTGCATTA CCGTTACAAG CAGACGAACA ATCATTTACT TTTGATCACT TGAAAGAAAT 2520 

AAATGATGCA TTGTTAACAT ATCCGGATGG CTTTAACATT TTGAAAAAGT TAAACAAAGT 2580 

TCTTGAGAAG CGTCATGAGC CGTTTAATAA AGAAGATGGT TTAGTTGATT GGGCACAAGC 264 0 

AGAACAACTT GCATTTGCGA CAATTTTACA AGATGGTACA CCGATTCGCT TAACTGGTCA 2700 

AGATAGTGAA CGTGGTACAT TCAGTCATAG GCATGCCGTG TTACATGATG AGCAAACAGG 2760 

30 

TGAAACATAT ACACCTTTAC ATCATGTTCC TGATCAAAAA GCGACATTTG ATATACACAA 2820 

TTCTCCGCTT TCAGAAGCAG CAGTAGTTGG TTTTGAATAC GGCTATAATG TGGAAAACAA 2880 

35 AAAAAGCTTC AATATTTGGG AAGCACAATA TGGTGATTTT GCAAATATGT CACAAATGAT 2940 
TTTTGACAAC TTCTTATTCA GTTCTCGCTC AAAATGGGGA GAACGTTCAG GATTAACATT , 3000 

ATTCTTACCT CATGCATATG AGGGTCAAGG GCCTGAACAT TCATCAGCAA GATTAGAGCG 3060 

40 ATTTTTACAA TTAGCTGCTG AAAATAATTG CACAGTTGTC AACTTATCTA GTTCAAGTAA 3120 

TTATTTCCAC TTATTGCGTG CACAAGCGGC TAGTTTAGAT TCTGAACAAA TGCGACCATT 3180 

GGTTGTTATG TCACCAAAAA GCTTACTGAG AAATAAAACA GTTGCAAAAC CAATTGATGA 3240 

45 

ATTTACTTCT GGTGGATTTG AGCCAATTTT GACAGAATCA TATCAAGCGG ATAAGGTTAC 3300 

AAAAGTTATT TTGGCAACTG GTAAAATGTT CATTGATTTA AAAGAAGCAT TAGCTAAAAA 3360 

TCCAGACGAA TCAGTATTAC TCGTTGCGAT TGAAAGATTG TATCCATTCC CAGAGGAAGA 3420 

SO 

GATTGAAGCA TTACTAGCAC AATTGCCAAA CCTTGAAGAA GTGTCATGGG TACAAGAAGA 3480 

ACCTAAAAAT CAAGGTGCAT GGTTATATGT CTATCCATAT GTTAAAGTGC TAGTTGCAGA 3540 
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AGAAATTCAT AAACTTGTTC AAAATAAAAT TATAGAAAAT GCATTAAAAA ATAACTAGGG 3660 

GGAAATAAGT CATGCCAGAG GTTAAAGTTC CAGAATTAGC AGAATCTATT ACAGAAGGTA 3 720 

5 CCATTGCAGA ATGGTTGAAA AACGTAGGGG ATAGCGTAGA AAAAGGTGAA GCTATTCTTG 3 7 BO 

AATTAGAAAC TGATAAAGTT AATGTCGAAG TTGTATCTGA AGAAGCAGGT GTATTATCTG 3840 

AACAACTTGC AAGTGAAGGC GACACTGTAG AAGTTGGACA rGCAATTGCT ATCATCGGCG 3900 

10 

AAGGTAGTGG CAATGCTTCT AAAGAAAATA GTAACGACAA TACTCCACAA CAAAATGAAG 3 960 

AAACAAATAA TAAAAAAGAA GAAACAACAA ATAATTCGGT AGATAAAGCT GAAGTAAATC 4020 

1S AAGCAAATGA TGACAATCAG CAACGTATTA ATGctACGCC TTCTGCGCGT CGATATGCTC . 4 080 

GTGAAAATGG TGTGAATCTT GCTGAAGTAA GTCCGAAAAC AAATGATGTG GTTCGTAAAG 414 0 

AAGATATTGA TAAGAAACAA CAGGCACCGG CATCAAGACA AACAACACAA CAAGCATCTG 4200 

20 CAAAAGAAGA GAAAAAATAC AATCAATATC CTACAAAACC AGTGATTCGT GAAAAAATGT 4260 

CACGTAGAAA GAAAACAGCT GCCAAAAAAT TATTAGAGGT ATCTAATAAT ACAGCTATGT 4320 

TAACAACATT TAACGAAtTG AcATGACAAA TGTTATGGAA TTGCGTAAAC GTAAGAAAGA 4380 

25 

ACAATTTATG AAAGATCATG ATGGTACTAA ATTAGGATTT- -ATGTCATTCT TTACTAAAGC 4440 

TTCTGTAGCA GCTTTGAAAA AGTATCCAGA AGTTAATGCA GAAATCGACG GCGACGACAT 4500 

GATTACGAAA CAATATTATG ATATTGGTGT AGCTGTTTCT ACAGATGATG GATTATTAGT 4560 

30 

ACCATTTGTA AGAGATTGTG ATAAAAAGAA TTTTGCAGAA ATCGAAGCAG AAATTGcTAA 4620 

TTTAGCAGTT AAAGCaCGAG AGAAAAAACT TGGCTTAGAT GATATGGTTA ATGGTTCATT 4680 

35 TACGATTACA AATGGCGGTA TTTTTGGATC AATGATGAGT ACGCCAATTA TCAATGGTAA 474 0 

TCAAGCTGCA ATCTTAGGCA TGCATTCAAT TATTACAAGA CCAATTGCGA TTGATCAAGA 4 BOO 

TACAATCGAA AATCGTCCAA TGATGTATAT TGCATTAAGC TATGATCATA GAATTATT 4 858 

40 (2) INFORMATION FOR SEQ ID NO: 380: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 380: 

50 

ATCAGTCACA CGGTAGGCAT ATAAAATGAG TCGTTTCTAC AACATTTTTA AACAGTTCAT 60 

TCAATATTAT TTTTATCTAA TAATnATATT GGGAGGATTA TACCTTTATA CACACCATGC 120 
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ATGCTATTTA GCTAAAGCTA AAAGACCAGA CACTATGCAT ATTTCAACTG GAAATATGTG 240 

GCGATACTTA GTTGCAATTA TTGCCTGTAT GATTTGGTAC CTTAATAAAG CGCATGTAAG 300 

TATCATCGGT ATAATTATTG GTTTAATGAT TTCATATGTT GTAGTTATCA TACGTCCTTT 360 

ACTAAAGGTG AGCAAATAAA TTAAGAAAGA GGTGAGATTA TGGATCACAA ATCCCCGCTC 420 

GTGAGTTGGA ATTTATTCGG TTTTGaTATC GTTTTCAATT TATCAAGTAT ATTGATGATA 480 

CTTGTTACGG cGTTTCTTGT TTTTCTACTT GcTATCATTT GTACGCGTAA TTTGAAAAAA 540 

AGACCAACTG GCAAACAAAA TTTCGTTGAA TGGATTTTTG ATTTCGTGAG GGGAATCATT 600 

GAAGGTAACA TGGCTTGGAA AAAAGGTGGT CAATTCCACT TCTTAGCAGT AACGCTGaTT 66 0 

CTGTACATTT TTATAGCTAA TATGTTAGGT CTTCCGTTTT CTATAGTAAC GAAAGATCAC 720 

ACATTGTGGT GGAAATCACC GACAGCnGAT GCAACAGTGA CTTTAACGTT GTCTACAACG 780 

20 ATAATACTGT TAACTCACTT TTATGGAATT AAAATGCGTG- GTACGAAACA ATATCTTAAA 84 0 

GGTTATGTAC AGCCGTTTTG GCCATTGGCA ATTATTAATG TTTTTGAAGA GTTCACTTCA 900 

ACATTAACGC TTGGTCTGCG TTTGTACGGT AACATATTTG CAGGTGAGAT ACTATTAACA 960 

25 - TTACTTGCTG GCTTATTCTT TAACGAACCA GCATGGGGTT GGATTATTAG TATCCCAGGA 1020 

TTAATTGTTT GGCAAGCATT TTCAATATTT GTAGGAACAA TCCAAGCATA TATCTTTATT 1080 

ATGCTTTCGA TGGTTTATAT GTCACATAAA GTGGCAGATG AACACTAAAA ATTTCAATAA 114 0 

TTATATACAA TCACAGGAGG AAATTAAATT ATGAATTTAA TCGCAGCAGC AATCGCAATT 1200 

GGTTTATCAG CATTAGGAGC AGGTATCGGT AACGGTTTAA TCGTTTCAAG AACAGTTGAA 1260 

GGTGTAGCAC GTCAACCAGA AGCACGTGGT CAATTAATGG GTATCATGTT CATTGGTGTA 1320 

GGTTTAGTTG AGGCATTACC TATCATCGGT GTAGTAATTG CATTCATGAC ATTTGCTGGA 1380 

TAATTAACAG ATAAAAGAGG TCGGGACAAA GCGCATAGGA CATAATTCAT GATGCATATA 1440 

40 TAGTAATATC TTTGAACTTT ATTAAATAGT TGAGATATGA ACGCACCATG CCTATCGCAT 1500 

AAATTCAGTA GGTCCTAACC TCGTCGTTTT TTTCTATATA ACACTAGCGA TTATTTTAAT 1560 

GAAAGGAGTG TCATGAACCC GTGACTGAAA CAGCTAACTT ATTCGTTCTT GGTGCAGCTG 1620 

4S GAGGCGTTGA GTGGGGTACT GTGATTGTAC AGGTCCTAAC TTTCATCGTG TTAcTTGCGT 1680 

TACTTAAAAA GTTCGCATGG GGTCCATTGA AAGATGTAAT GGATAAACGT GAAAGAGATA 1740 

TTAACAGAGA TATCGATGAC GCAGAACAAG CTAAGTTAAA TGCACAGAAA CTTGAAGAAG 1800 

SO 

AAAATAAACA AAAACTTAAA GAAACACAAG AAGAAGTTCA AAAGATTTTA GAAGATGCTA I860 

AGGTTCAAGC ACGTCAACAG CAAGAACAAA TTATTCaTGA AGCAAACGTA CGTGCAAACG 1920 

55 
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TTAATAATCa AGTATCTGAA CTATCAGTGT TAATTGCTTC TAAAGTTCTT AGAAAAGAAA 2040 

TTTCTGAACA AGACCAAAAA GCATTGGTTG ACAAGTATCT AAAAGAGGCA GGCGATAAAT 2100 

5 AATGGTAAAA GTAGCTAACA AGTATgcTAA AGCATTATTT GACGTGTCAT TAGATACAAA 2160 

TAATTTAGAG ACTATTAATG AAGAATTAAC AGTTATAAAT GAAGCAGTAA AAGATAAAAT 2220 

TG 2222 



(2) INFORMATION FOR SEQ ID NO: 381: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1143 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

SQ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 381: 

TAACAATTCC TaTATTCATG TTTAATACGA AACACTACAT TTACATTGTA ATTCACTATC 60 

TTTTGAAGTA ATAAAGTGAT TTGTTCAATC GATAGCTCAT TGCTTGTGTC GATTGTAACA 120 

25 ATTATATGCA AGTTTTCAGG ATTAACACCT AATCTTTGAA TGATTTGTTT AATAGTATAA 180 

TAATCCATCC AATAAAAATT ACTTCCTTTA ATATAAATGT TTTTAGGTTG ATACATTTGA 24 0 

CTCCTTTTTG TAGGCTCAAA AGGTATATCA ATCTCGCGCA TACTTGAAGA ACTTTGATTA 300 

30 _ - ■ - 

GTATCATCAA ATAATTCAAT TATATTTTTA TCAATTTCAA CTAATTGGGA ATGGTTAATT 360 

GAATGAGATG TTGGACTATA TCTTTTTCTA ATTAATTTAG GAGTGTTACC ATACGTTTCT 420 

TTAAAGAGGT GTATAAAACG AGAATAATGA TTAAAACCAT GACTACTTGC GATTTCTTCA 4 80 

35 

ATAGGCTTCT TAGrAGTTAA AATATCAATT AAGCAATGCT CCAGTCTAAT ATGATTTAAA 540 

TATTGAATAA AATTACTATA AGGTGTCGCT TTAAACATGT CACTTAAAGC TTTGTTTGTA 600 

ATACTAACTT GATTAATGAC ATCTTTCCTA TTTATCTTTT TATGGTGGTT GTTTGTTAAA 660 

40 

TAATCGTGCA CTTCTTCGGC TACTAAATGA CGACTACCAT CGTATGTATT TAATGACATT 720 

AATTCAACAC ACATGTAATT AATAATCTTA TCATTAGCAT TATAAGACTG TTGTTTAATT 780 

45 TTGCTGTAGA TTAAATACTT AATCAAGATT CTAAAAGTGC TAGCAACCTC ATTTGTTAAT 840 

ATCTTGCCAC G CAT CAT ATA GTTTGCATTC ATAAACTTCA AAAATGTCTT TGCACTTACT 900 

TCAACAACAC AACATACACT ATCATCATTA CCATCAATTT GATACAAATC ATTCAACATA 960 

50 ATAATGGTGA CATCATTTCT TTTTACATCA TATTGTTGTA GATTGATTGA AACTACACCT 1020 

GTACCTTGTA GCCAATATAT AATTTTtAAG TTCGCTTCCT TCACACTACT CATTTTCATA 1080 
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ATT 1143 
(2) INFORMATION FOR SEQ ID NO: 382: 



S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 506 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 



15 



20 



25 



TCAAATACAA ATTCGTTAAC GTCACT 
(2) INFORMATION FOR SEQ ID NO: 383: 



60 
120 
180 
240 
300 
360 
420 
480 
506 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 382: 
GGACCACCTT TTTTACCTGT ACGACTAAAG CCAACTGTTT TAAAGGCACC AACAGCATGC 
CACACTTGAA TAATTTCTTG TGATGGTCTA AAGCGAACCG TATAAATTAA TGGATGGAAA 
TCATCAACAA AAATATAATC GGCCTTACCA AGTAAATATG GCAATCTAAA CTTGTCTCTC 
CATTTGCGTC TATCCGTAAT ATTCTCCTTA AATACCGTTT TAATATCATA ATCAAAATCT 
ACTTTTTGGC GTAGTAACTC ATCATATACA TACTTGAAAT TCCCTGATAA ATTCGGACGC 
GAATCTGATG TGAATAATAT TGTTTTGCTT CTTTTAATAT GTAGTAACTT TGTAATATTA 
AAAATAGCTT TAAATAAGAA ACTTCTACTT TCAAATGAAG CTTTATGaCC TTGTTTaTGA 
AGCCAGTGTG cACTTgTCGC AATGaCCCCT GaTTTCyCyT GagGtAAGGk GaTTTCmATA 



(i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 383: 

40 

AGCATCTGCA AtGAGTcTCT AATACAACGA TACGTTTTGC ATCTTTAGGT ACTTTTACTG 60 
TACCATTTTC ATCTTTTACC GAAATAGTAT CTTTAGTTGA TGATTCTTTT TTACTTGAAT 120 
45 TATCCGTATT ACCACAAGCT GCAACTAAAA GTAAGGCAAC TATTAATCCC AATATACTAA 180 
AAGTTTTTAG ACCTCTCATC GTTCCACTCC TTAATATGTA TAACTTCATT TATTATTTTA 24 0 

TTGATAACAA TTATCATTGT CAAGTAGCGT TCAATCTTTT TTATATTTCT AAAATGTATG 300 

so 

ACTATATATT TCCTCTAATA ATTATGACTA CAATTAGCAC ATTTCCTTAG ACAAAATACT 360 
GATAATGTAT CATTGCTATA TCATCTTTGC ATTAATACAA TTGACACCAT TTAGCATGAC 420 
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{2) INFORMATION FOR SEQ ID NO: 384: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 862 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 384: 



TGCACTTGAT 


ACTTTAGCAC 


ATGAAATTGC 


TAATCGATTA 


GTTGCAAATG 


ACAAGAATGA 


AGCAACTTTG 


GAAATGACTA 


ATAAAATGGC 


AACGATTCGT 


TTTACAGAAC 


CTACGCTGAT 


TGCATTAGCA 


GGGGGTAATG 


TCAAAGCTTA 


CACTGAGCAT 


ATGACTATAT 


CTCCATATAA 


ATTGTATTTG 


TTAGATAAAG 


GCGATGTTTT 


AAAGTTTAGA GAAACAAGTT ATACATCGCG 


AGTGTATTTA 


GCTGTgGGAG 


GCGGATTTGA 


ATTAGATGCA 


TGGTTAGGAT 


CTAACTCAAC 


CGACTTTAAT 


GTAAAAATTG 


GTGGTTTTAA 


AGGTAGAACA 


TTACAAGATG 


GCGATGAAAT 


AAAGCTTAAG 


AGAGATTATA 


CAGCTCGTCA 


TCATAAGTTA 


TTTGAAAACC 


TTGCTCACAC 


GAAACAAACA 


GATTGGGGTA 


TTGATGGATA 


CGCCTTGTCA 


TTTAATTATA 


TGTCTGATGT 


ATTTCATGTC 


GTTAAAAATA 


AAGGTACGGA 


AGATTTTAAA 


GAAGATGCCA 


TTCAAAGATT 


TGTGAAACAT 


GATTATAAAG 


TAACGAGCAA 


AGCAAATCGC 


ATGGGGATGA 


TGCTTGAAGG 


TGAAAAAATC 


AAAGCTTTTT 


ATGAAGATAT 


GCCACCGTAT 


CAGACTGTCA 


AAAAAGGAAC 


GATACAAATT 


AAGCGTGATG 


GCACACCTAT 


TATCCTATTA 


AATGATCATT 


ATACGCTAGG 


TAGCTACCCG 


CAAATCGGTA 


CAATCGCAAG 


TTATCATTTA ACGAAATTAG 


CACAAAAACC 


GCAAGGATCA 


CGTTTGAAAT 


TTCAATTTAT 


AGATATTTTA ACGGCTGAAA AGAACCTTGT 


TAAgTATAGT 


AACTGGTTAA 


AC 









(2) INFORMATION FOR SEQ ID NO: 385: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 385: 
AAATGAGTTG ATGATGGCTC CTATTCCATC TATCTCTAGG TATGACAGAA sATAAACGGT 
CTTCTACCTG TCTsACATTA TCTTTCCAAC GATTAATACC TAAGCGTTTA GAAACACGTT 



1280 



EP0 786 519 A2 

CAGTTTTACG TCCTACACCT GCTAAACTTT CTAATTCCTT ATGTGTTTGT GGTATTTCTC 240 

CATTAAATTG ATCAATCAAA GATTGACAAA GTTTCTTAAT ATTCTTAGCT TTGTTACGAT 300 

ACAGACCGAT AGAACGAATA TCATTCATAA GTTCTTCATC ACTGACTGCC AAATAATCTT 360 

CAGGCGTTTT GTATTTTTTA AACAGCTCAG TTGTTACTCT ATTTACTAGA ACGTCTGTAC 420 

ATTGCGCTGA CAATAATACA GCAATAGTTA ATTCGAACGG ATTATCATGT TTTAATTCAC 480 

ATTCTGCATC CGGAAACATA TTTGCTATAA CATCAATCAT TTCTAATGCT TTTTTCTTAC 540 

TTACCATCAA GGTTCTCCCC GTTTAACCAA TCAAATTTAG GTACCGTTTT AACTGTGTGC 600 

GTCATTTTCG GTTTATTGAA TTTTTCTCTT ATTTTTCTAG AATCGTCAAT TGTTTTGACA 660 

TTGTTTTTCT TCCAATTAAG TAAAATACGA TCTATATATT TAAAGCTAAG TTTATTCAAA 720 

CTATTCGCCT CGTCTAATGC CGCTTGTATA ATTGCAGTAT CGTGTTTATC AACATCAATC 7B0 

20 CATTGATTTA ACGTTTCTAT TTCATATGGA GATAACGGCC TTGCAAATGT ATC CTCTAAA 84 0 

ACTCTAAATA ATTGTTTAAA TTTTTCTTTA CTATTTAGCT CTTTCGTTTC CATACTTTGT 900 

TGCTTCAATA TATGACTTAA TTTTTCGAAA AAAGGATCTA GATTCATATA TTCGGkAAAT 960 

25 CTACCTTCyT CATCyTTTTG aACTkGtAAT tCTAGCAATT CACGTgTATC AAATTTTGGA 1020 
TACCATT 

(2) INFORMATION FOR SEQ ID NO: 3 86: 



10 



15 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1006 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1027 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 386: 

4Q AAGGnTTGGA GGGAATTAAT nGATGGCAAA TCCCAGaTAA AGTACACaAC GCATGGAATG 60 

TGATGCATTT AATATTTCCA GTAGTTAGTA CGACGTTTGC AAGCTTTAAA TCTATGTATG 120 

GGGGCATACC AAAAGATTTC ATAGACTACT TATTTATTGA TGAAGCAGGA CAAGCAATAC 180 

^5 CTCAAGCAGC TGTGGGAGCA TTATATCGTT CAAAAAAAGT TGTAGCTGTA GGTGATCCGA 240 

TTCAAATAGA ACCGGTTGTG ACTTTAGAAA GTCATTTAAT TGATAACATT CGTAAAAATT 300 

ATCATGTTCC GGAATATCTA GTTTCTAAAG AAGCTTCTGT GCAGTCTGTT GCAGACAACG 360 

50 

CCAATCAATA TGGTTTTTGG AAATCTGATG CTACTGATAG TAATCAAAAA ACCTGGATAG 420 

GCATACCTTT ATGGGTGCAC AGACGATGTT TAAAACCTAT GTTCACGATa GCTAaCCAAa 4 80 
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GGTATGACGT TAAAGGAAaC GCAGTTCAAA AACAATTTGT GAAAGAGCAT GGTGAAAAaG 
TAGTGGGATT ATTAGCTGAT GATTGGATTG AAGCAATTAA GGAAGGTAAA AATGAACCGA 
GCTCATTTGT AATATCGCCT TTTTCAGCAG TACAGCAACA GATTAAACGT ATGTTAAAGC 
AACAACTACC GACTAGAATT GATATTGAAC GTACAAAAAT TAATCAATGG GTCGATAAAT 
CCATTGGTAC TGTTCATACT TTTCAAGGTA AAGAGGCTCA GAAGGTGTAT TTTGTAATAG 
GTAGTGATAA TACCCAAGAT GG7GCTGTGA ACTGGTCATG CGAAAAACCA AACTTGTTAA 
ACGTTGCAGT GACAAGAGCT AAGAAAGAGT TTaTGTAATT GGCGACATGC AAAGAATACA 
GATGAaACCA TTTTATGAGA CGATTTTTAn AGnAAGAAAT GTAAAA 
(2) INFORMATION FOR SEQ ID NO: 387: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 387: 
CGTTTCATCA AGTnATCGAT CCTAATTTAC AAAATATTCC GGTTAGACTT GAAGAAGGGC 
GTAAAATAAG AAAAGCCTTT AAACCAACTT CAAAAGATAG CGTTATATTA TCAGCAGATT 
ATTCTCAAAT TGAATTGCGT GTATTAGCAC ACATTACACA AGATGAGAGT ATGAAAGAAG 
CATTTATCAA CGGCGATGAT ATTCATACAG CAACTGCTAT GAAAGTATTT GGTGTAGAAG 
CTGATCAAGT CGATAGTTTA ATGCGTCGTC AAGCAAAAGC AGTTAACTTT GGAATTGTTT 
ATGGGATAAG TGATTATGGT TTAAGTCAAA GTTTAGGTAT TACTCGTAAA AAAGCAAAAG 
CATTCATTGA TGATTATTTA GCTAGTTTCC CAGGTGTAAA ACAATATATG TCTGATATTG 
TAAAAGATGC CAAAGCTTTA GGTTACGTGG AAACATTGCT ACATCGTCGA CGCTATATTC 
CTGATATTAC GAGTCGTAAC TTTAATTTAC GCGGCTTTGC TGAACGTACT GCTATGAATA 
CGCCAATACA GGGCAGTGCT GCAGATATCA TTAAACTGGC AATGGTTAAA TTTGCTCAAA 
AAATGAAAGA GACAACATAT CAAGCTAAAC TATTATTACA AGTACACGAT GAATTAATTT 
TT 

(2) INFORMATION FOR SEQ ID NO: 388: 

fi) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 669 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 388: 

5 TCATCCAAAT TTTGGAAATT CCACATTTTA CATATCGTAA TTTTTTAGGA AACTAGTGAA 60 

TATAACAAAT CCCTCCTCTC ATTTTTAAAA TAGATATATC ACTTCCCCAC TTTTACTTAA 120 

CTAAACTGCA ACGGTTCCTA ATACCAAAAT CCTGCCCTCT ATTTTTATCA ATTCAAGCAT 180 

10 ACTTATTGAA AAATGTTAAC GTTTTCTTGA TAATCATTGT AAGCGCATTT ATTTTATAAA 240 

CTAACGTTTG AAATATACTA CAGGAGTGAC ACGTAATGAC TCAAATTACT GAAAAAGAAT 3 00 

TAAAAAAGAA GTATTTAGAT TTACTATCCC AAAATTTTGA TACTCCAGAA AAACTTGCAA 360 

CTGAAATTAT CAATTTAGAA TCAATTTTAG AATTACCTAA AGGTACGGAA CATTTCGTCA 420 

GCGATTTACA TGGTGAATAT GAAGCTTTCC AACACGTATT ACGCAACGGT TCTGGGAACG 4 80 

TGCGAGCGAA AATCAATGAT ATTTTCAAAG AGAGACTTTC AACTAAGGAG CTTAATGACT 540 

TAACTGCTCT TGTCTACTAT CCAGAAGrCm AATTtAAAAT TgATTAAAAG TGATTTCCAA 600 

AATTgCGGtC mACtTAATGt CyGGtATATC ACaACmATCG aACATTTAAT TGAGTTAATT 660 

AAATATTGT 669 
(2) INFORMATION FOR SEQ ID NO: 389: 



15 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 9 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY:, linear 



35 (xi) -SEQUENCE DESCRIPTION: SEQ ID NO: 389: 

CACATGGCTG TTAGAGATAT GAATGGCCAT GCGTTACCTT TAACAAAAGA TGGCAATTTT 60 

TATCAAACGA ATGTAGATGC AAATGGTGTT AATCATGGTG GTAGTGAAAT GGTGCAAAAT 120 

AAAACAGGTC ATATGAGTCA ACAArGCCAT ATGAATCAGA ACACACATGA ACCAACAGCC 180 

ACACATGCAA CAAGGTCATA TGCAATCATC AAACCATCAA ATGATGAGTC CAAAAGCAAA 240 

TATGCATTCA TCAAATCATC AAATGAACCA AAGTAACAAA AAAGTTTTAC CAGCTGCTGG 300 

TGAAAGTATG ACATCAAGTA TTCTTACTGC AAGTATTGCC GCACTACTAT TAGTATCTGG 360 

GTTATTCTTA GCATTTAGAC GACGTTCAAC AAATAAATAA ACATAATACG ATTAATAATA 42 0 

50 GAAAAATCGT GTGATTATCT GaGGGAGCCT AGGACATAAA TCAATGTCCT AGGCtCnCTA 4 B0 

AtGTTATATT GGCAGTAGTT GACTGAATGA AATTGCGCTT GTAACAAGCT TTTCCATTTC 54 0 



40 



45 
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CGATTGTCTT ATACGTGTCA GTGTTAATTC AGATATTTCC TGTGGAATAT ACCACTTATT ' 660 

AATCATAATT GGATAAGGTG tTTGTGCGTA CAGTGTTTCA ATAATCAGCC AACAATGTGT 720 

5 ATCACCaTCA AACACGTGAC TATGATTTTk GAAGTGGGGC GCTTTGGTAA TAGACATTTT 780 

TAAATCTGAT TGATATGCAT TGCTATAAAT CGTTTGCTCA ACGAATGTCT TCATGTCGTC 840 

TTCGTTTTGT GTATTCACTT TAAATGTGTC AATGACATTT AACGGTATAA AGGTAAAGCA 900 

10 

AAATGCATCA GCTTGCTTAG AATGATTGTC CTTTTTTTGA TAATAGCGTT CCATTGCAAT 96 0 

GACGGCAGAA GGATGGTTTG CAAACAAATG ATTTGTATAT TCACTTTCTA AATCAACACG 1020 

ATAATTAATT GATGACATAG ATACGCGAGC TAGCAATATT TGATCAAGTG GATGCTTAAA 1080 

15 

TTGATCCATA CTTGAAGCGT GTTGGGCATT TGTTTGTGGA ATAACAAAGT GTCCCTTCCC 1140 

TCTTGTACTC TCTACGATGC CATCTTCGGC TAACAATTTt ATAGCTTGGC GCAAAGTCAT 1200 

20 ACGACTGGAC ATCAAAGCGC GCACAAAGTT CCTTTTCAGT AGGTAATGC 1249 

(2) INFORMATION FOR SEQ ID NO: 390: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1788 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 90: 



45 



AATGATGATT 


CATTCAAAGA 


AATTAACGCT 


CGGTATATGC 


TTGGTATTAC 


TCATTATATT 


60 


GATTGTAGGT 


TATGTCATTA 


TGACAAAAAC 


AAATGGTCGA 


AACGCCCAAA 


TTAAAGACAC 


120 


ATTTAATCAA ACATTAAAAT 


TATATCCAAC 


CAAAAATCTC 


GATGATTTTT ACGATAAAGA 


180 


AGGCTTTCGA GATCAAGAAT 


TTAAAAAGGG 


TGATAAAGGT 


ACTTGGATAG 


TTAATTCTGA 


240 


AATGGTAATC 


GAGCCAAAAG 


GTAAGGATAT 


GGAAACGAGA 


GGAATGGTGC 


TCTATATCAA 


300. 


TCGCAATACT 


AGAACCACAA 


AAGGGTATTA 

<* 


TTTTATAAGT 


GAAATGACAG 


ATGACAGTAA 


360 


CGGCAGACCA AAGGATGATG 


AAAAAAGGTA 


TCCGGtAAAA 


ATGGAACATA ATAAAATCAT 


420 


ACCAACGAAG 


CCACTACCGA 


ATGACAAGTT 


AAAAAAAGAG 


ATTGAAAACT 


TTAAGTTCTT 


480 


TGTACAATAT 


GGCAACTTTA 


AAGATATTAA 


TGATTATAAA 


GATGGTGATA 


TTTCATATAA 


540 


TCCTAATGTA 


CCAAGTTATT 


CGGCAAAATA 


TCAATTGAAT 


AATGATGATT 


ATAATGTCCA 


600 


ACAGTTaAGa 


AAAAGATATG 


ATATTCCAAC 


CAAACAAGCG 


CCGAAACTAT 


TATTGAAAGG 


660 


CGATGGAGAT 


TTAAAAGGTT 


CATCCGTAGT 


TcTAGAAGTC 


TTGAATTTAC 


CTTTGTCGAA 


720 
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AGGTATGAGT CAAACTGAAT ATCAAATAAA ACCTGGTAAT ATAACAAGTA ACTCTGAAGA 840 

AACAAGTTCG ATATCTAAAG TGAGCTGTGA AATATAGGTA GCCATTTCAA AAAATTTAAA 900 

5 GGTGAATTTG ATAATGTAGC TCAAGGAGAT TGGGTTAAAA AGGCGAAGAA TGAACTGGAT 960 

GATATTAGTA AGAAATTAAA AAATATTCAA AGAACGGAAG TTTAATAGCT TATATGATTC 1020 

TTGGAGCTAA GACAGCATGC GTTCATTCAT GCCATTATTA ATATAAGCAC CGCAACAAAA 10B0 

10 AAGCTTCTAA TGTGATACAG GAACCTCATA TTCCGTATCA TGTTAGAAGC TTTTAATGTC 1140 

TAAAGAACAT CTACATTTTA TCATATTTTC TGACTTATTA AACTTTTATA TAATTAAATA 1200 

TTTCTTAATT TTCCAAAATA GTGATAAATT TGTGAAATAC ATCACAAATC CCTTTATTTA 1260 

75 

TTTGGAAATT CATGTAATAT TAGACTTGTA AGAAGTTAAT AAATAGAGAG AGACGAGAGA 1320 

GTTTATATAA ATACTATATA AACATTGGAG TGATGATTAT GAGAAAAGAG ATTGAAGCGC 1380 

TTATTTTCTC AGACGTAAtA GCTATGATAT TTACGTGAAC ACTGGTGTAA ATCAAGGATT 1440 

20 

AATTGGTGAC ATCAAAGATG GTTACCTAAC TATTGATTCT ATGCCTTACA TTGATGCTGA 1500 

GCGTTTGTAT CACTTTGCTA TGGAACGTAA ATCGTTAGTC ACTAACTAGT TCTTATTGCC 1560 

25 AATGATTACT ACCCCTAGTC GGCGGCAATT GAAGTGTGAT TGATGTAACT TGCCCTCGTT 1620 

GGTGAGCAAT TGAGGGCAGA CCCCTTTAAT TAAGTAAACC CTAACTCCCC ACAAATCTGG 1680 

AACGATACTA AAAGCCACGT CCTATATTGG ATGTGGCTTT AGTCAkACTT ATATTATTTT 174 0 

30 tAAAACGATT ACCTACAAGA TTTACATATA AAATTCTATC ATGnCTGC 1788 

(2) INFORMATION FOR SEQ ID NO: 391: 

<i) SEQUENCE CHARACTERISTICS: 
3S (A) LENGTH: 24 07 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(O) TOPOLOGY: linear 

40 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 391: 
GCAAGTTTAG TTAAACAGAT ATTAAAACCA GATCAATATA CAGATGCAAA TTCTCTATTA 60 
GAGATTAGTT TGCAAACAGG TATGTTTATT GCAGGTGGAT TATCAGGAAT ATT AT AT AAA 120 

45 

ATAAATGGAT TCACTCTAAT TATAGCGATG ACTATAATGA TGTTTCTAAT CAGCATTTTT 180 
ATGTTATTTA GATTGCATGT AGATAAACCA ACTCATTCAG AGGAAGAATC AACAAATAGC 24 0 

50 TTATTGCAAG AATATTTATT AGGATGGAAA TTTTTAAAAG ACAACATGAT GATATTTATT 300 

r 

TTTGGAGTTA TTTCAATCAT ACCAATGGTG TTTACAATGA TCTTTAACAT ATCATTACCA 360 
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TATGGCATTG GAGGATTATG TGCAGGTTTA ATTTCAGCTA TTCTTTCGAA GAAAATCTCA 480 

ACTAAAGTAT TGATATTTTT GTTATATTTC ATATTAGTCA TAAATTCAGC ACTATTTATT 54 0 

5 TGGATAAACT CAGCATTTTA CTTATTCATA GGATCATTTA TACTAGGCTA CTCAATTTCA 600 

TCAATCAGAA TTTATATGAA TACAGCTATA ATGAACACTG TTTCAGATAA ATATGTCGGT 66 0 

CGCTCATTTA CGATATGGAC GTCAATTTCA TTGTTACTGC AAAGTTTAAT TGCTCCATnT 720 

10 

TTAGGAAGAT GGATTAATGA AATTAATGAT AAATTCGGTT TCTATATTAT ACTCATTTTA 780 

TCCTTACTCA TATTTGTaCA CTGCTGCTTG TTAACAAAAC AGACAAAATA AAATATGCAC 84 0 

ATAAAGAAGA GTGACCGTCA CTCTTCTTTA ACAAGCGACC ATTTATCGaT GGGCTTAGTT 900 

75 

CTCTCTGCAC CCACACTTCA CTACTTCACT TTTTCAAATC ATTTTTTATG GTCTTAAATA 960 

AATCAGTGAG ATTTGTTGCT TCGGTAAAGT CTAGAATTAA TATCATTTCT TTAGAACCTG 1020 

2Q GATATGGCGA AACTAATGTA TTATCTTGCA ATTTCTGCTG GGCACTTTTA GTCGCCTTGA 1080 

CCAATAATCT ATTATCATAC AAACCACCTA TAACCACGCC ATCATAATAA ATAATATATT 114 0 

. CTCCCATCAT CTTTCTTGTC TTAACCGCGT TTGAATTCAC ATGATTTAAA AATAAATCAT 1200 

25 GTACATCTTT CTTAGTCGCC ATTGTAATCG-CTCCTTCAGT TTTATGTTTA ATCACATTCG 1260 

TATTAATGAT TCATTTCGTG TTGCTCTTAA TTTTATCTAT AATTATATGC GTAGTTAAAA 1320 

TCAAACCTAT GGAAAAGAAA ATAATGATAA CAATGTTAAG AAATATAGTT ATAAAATTAT 1380 

30 AGTTTGGAAG GTATGCGAAT AGCAGAAAAA TAGGTATCGC AAAAAATAAA TCCCACCAAC 144 0 

CTAAACTTTT TAAAGAATGC TTTAAACCTT CCATAATATC ACCTTTATAA ATTTGTCTTT 1500 
GTTATAAGAT AACTAAAAAA TCGCTTTACT GTAAAAGTAG CCAAAGAAAA TTCTGAATCA . 1560 

35 

TATTCATAAG TAGTGTATCA TTAATAATGA ACAATTTAAT ACTATAATCC TTGATCTTTG 1620 

TATTGATCAA CTTACCACAA CATTTATTTT AGACTACTCT TAGACTTCCC TTTCAAATGG 1680 

TTGCATCTAT TGAAATTCCT TTTGTATAAG TTAGGCTTTT GTGGTAATAT CATCATGCAT 1740 

40 

AAAAAATCGA GATACTAATT ATAAAGAGGG TATAAATATA TTATGAAAGA AAATTTTTGG 1800 

AGTGAATTAC CACGTCCATT TTTTATTTTG GCGCCAATGG AAGACGTTAC AGATATCGTC 1860 

45 TTTCGACACG TTGTAAGTGA AGCAGCTAGA CCGGATGTGT TTTTCACTGA ATTTACAAAT 1920 

ACTGAAAGCT TTTGCCACCC TGAAGGCATA CATAGTGTGC GCGGACGCTT AACTTTTAGT 1980 

GAAGATGAAC AGCCGATGGT CGCTCATATA TGGGGAGATA AGCCAGAACA GTTCCGTGAA 204 0 

50 ACGAGTATTC AATTAGCTAA AATGGGCTTT AAAGGCATAG ACTTAAATAT GGGATGTCCT 2100 

GTAGCAAATG TTGCTAAAAA GGGTAAGGGT TCCGGCTTAA TCTTAAGACC TGACGTTGCT 2160 
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GG CTACTATG 


AAATCGATGA 


ATGGAAAGAT 


TGGTTGAAGC ACGTCTTCGA ACAAGACATT 


2280 




GCCAATTTAT 


CTATTCATCT 


TCGTACACGT 


AAAGAAATGA 


GTAAAGTAGA 


TGCACATTGG 


2340 


5 


GAATTAATCG 
ATTAACG 


AAGCTATTAA 


AAATTTACGT 


GACGAAATTG 


CACCAAATAC 


ATTGTTAACA 


2400 
2407 




(2) INFORMATION FOR SEQ ID NO: 392: 








10 
15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2424 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 392: 






20 


ATGATGnATG 


GCnCCGCCAA 


GAAGTTGAAC 


CAGTCTATTG 


GCTAGAATTG 


CTCAAAAAGC 


60 




GAGAcaTGTG 


GTATTCaTAT GTTaGTAgCt ACGCAAAGAC 


CATCTGTCAA 


TGTAATTACA 


120 




' GGTTTAATTA 


AAGCCAACAT 


ACCAACAAGA ATTGCATTTA 


TGGTATCATC aAGTGTAGAT 


180 


25 


TCGAGAACGA 


TATTAGACAG 


TGGTGGAGCA 


GAACGCTTGT 


TAGGATATGG 


CGATATGTTA 


240 




TATCTTGGTA 


GCGGTATGAA 


TAAACCGATT 


AGAGTTCAAG 


GTACATTTGT 


TTCTGATGAC 


300 




GAAATTGATG 


ATGTTGTTGA 


TTTTATCAAA 


CAACAAAGAG 


AACCGGACTA 


TCTATTTGAA 


360 


30 


GAAAAAGAAT 


TGTTGAAAAA 


AACACAAACA 


CAATCACAAG 


ATGaATTATT 


TGATGaTGTT 


420 




TGTGCATTTA 


TGGTTAATGA 


AGGACATATT 


TCAACATCAT 


TAATCCAAAG 


ACATTTCCAA 


480 


35 


ATTGGCTATA 


ATAGAGCAGC 


AAGAATTATC 


GATCAATTAG 


AGCAACTCGG 


TTATGTTTCG 


540 


AGTGCTAATG 


GTTCAAAACC 


AAGGGATGTT 


TATGTTACGG 


AAGCAGATTT 


AAATAAAGAA 


600 




TAATTATGAG 


TAAGGAGTTT 


TATATAATGA 


CACACTATCA 


TTTTGTCGGA 


ATTAAAGGTT 


660 


40 


CTGGCATGAG 


TTCATTAGCA 


CAAATCATGC 


ATGATTTAGG 


ACATGAAGTT 


CAAGGATCGG 


720 


ATATTGAGAA 


CTACGTATTT 


ACAGAAGTTG 


CTCTTAGAAA 


TAAGGGGATA 


AAAATATTAC 


780 




CATTTGATGC 


TAATAACATA 


AAAGAAGATA 


TGGTAGTtAT 


ACAAGGTAAT 


GCATTCGCGA 


84 0 


45 


GTAGCCaTGA 


AGAAAtAGTA 


CGTGCACATC 


AATTGaAATT 


AGATGTTGTA AGTTATAATG 


900 




ArnrrrAGG 


ACAGATTATT 


GATCAATATA 


CTTCAGTAGC 


TGTAACTGGT GCACATGGTA 


960 




AAACTTCTAC 


AACAGGTTTA 


TTATCACATG 


TTATGAATGG 


TGATAAAAAG 


ACTTCATTTT 


1020 


SO 


TAATTGGTGA 


TGGCACAGGT ATGGGATTGC 


CTGAAAGTGA 


TTATTTCGCT 


TTTGAGGCAT 


1080 




GTGAATATAG 


ACGTCACTTT 


TTAAGTTATA 


AACCTGATTA 


CGCAATTATG 


ACAAATATTG 


1140 



55 



1287 



EP0 786 519 A2 



10 
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35 



TGGCACATAA 


TGTTAAAAAA 


GGTATTATTG 


CTTGGGGTGA TGATGAACAT 


CTACGTAAAA 


1260 


TTGAAGCAGA 


TGTTCCAATT 


TATTATTATG 


GATTTAAAGA TTCGGATGAC ATTTATGCTC 


1320 


AAAATATTCA 


AATTACGGAT 


AAAGGTACTG 


CTTTTGATGT GTATGTGGAT 


GGTGAGTTTT 


1380 


ATGATCACTT 


CCTGTCTCCA 


CAATATGGTG 


ACCATACAGT TTTAAATGCA 


TTAGCTGTAA 


1440 


TTGCGATTAG 


TTATTTAGAG 


AAGCTAGATG 


TTACAAATAT TAAAGAAGCA 


TTAGAAACGT 


1500 


TTGGTGGTGT 


TAAACGTCGT 


TTCAATGAAA 


CTACAATTGC AAATCAAGTT 


ATTGTAGATG 


1560 


ATTATGCACA 


CCATCCAAGA 


GAAATTAGTG 


CTACAATTGA AACAGCACGA AAGAAATATC 


1620 


CACATAAAGA 


AGTTGTTGCA 


GTATTTCAAC 


CACACACTTT CTCTAGAACA 


CAGGCATTTT 


1680 


TAAATGAATT 


TGCAGAAAGT 


TTAAGTAAAG 


CAGATCGTGT ATTCTTATGT 


GAAATTTTTG 


1740 


GATCAATTAG 


AGAAAATACT 


GG CGCATTAA 


CGATACAAGA TTTAATTGAT 


AAAATTGAAG 


1800 


GTGCATCGTT 


AATTAATGAA 


GATTCTATTA 


ATGTATTAGA ACAATTTGAT 


AATGCTGTTA 


1860 


TTTTATTTAT 


GGGTGCAGGT 


GATATTCAAA 


AATTACAAAA TGCATATTTA 


GATAAATTAG 


1920 


GCATGAAAAA 


TGCGTTTTAA 


TATGTTTATA 


ATAGAGTAGT ATGGGTATTT ATTATTAATG 


1980 


ACATTATTAC 


ATGTTAATTA 


GGAGGCGTTT 


TTAATGGATT GGATTTTACC 


AATTGCTGGA 


2040 


ATTATCGCTG 


CGATTGCATT 


CTTAATTTTA 


TGTATCGGTA TCGTAGCTGT 


ATTAAATTCT 


2100 


GTTAAGaAAA 


ACTTAGATTA 


TGTTGCAAAA 


ACACTTGACG GTGTAGAAGG 


TCAAGTTCAA 


2160 


GGTATTACTC 


GTGAAACAAC 


AGATTTACTT 


CATAAAGTAA ACCGTTTAAC 


TGAGGATATC 


2220 


CAAGGTAAAG 


TAGATCGTTT 


AAACTCAGTT 


GTAGATGCTG TTAAAGGTAT 


CGGTGACTCA 


2280 


GTACAAACGT 


TAAACAGCTC 


TGTAGATCGT 


GTAACAAATT CAATTACACA 


TAATATTTCT 


2340 


CAAAATGAAG 


ATAAAATCTC 


ACAAGTTGTT 


CAATGGTCAA ATGTTGCAAT 


GGAAATTGCA 


24 00 


GACAAATGGC 


AAAATAGACA 


CTAC 






2424 



(2) INFORMATION FOR SEQ ID NO: 393: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 738 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 393: 
50 CnATAATATT CnTCAAnCCT TTAAATAGAG GAATAGACTG CCGACAGAGT CCGAGACTTT 60 

• GTGGGTAGTT TTTTTAGTTT TGATAACGGA AGTTAGAGGC TCTCTGTCAA ATTGGGCAGA 120 
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AGTGTAAAGT TTTCAACATA ATACTATTAG TTCGGTCATG TATCGGACTG 


ATGGAAAAGC 


240 




GTTTCACTTT TAATGACTCA 


TTAAGAACGG CCTGAAAATG 


TTTGGCGTAT 


TAAGTGCAAT 


300 


5 


GATAGTTTTG ACATTTAGTT 


TCTAATTGGT CATTACTGCC 


GAGCAAATCT 


AGTAGAGTAA 


360 




TCATGTAAAT CTTTAATGTG 


CCATTTGATT CACTAGCGGT GTTAATAACT 


ACGGAAATTG 


420 


10 


CATTTCCGAC TGAAATTTTT GAAAAATATC AACGTACGCT ACAAATAAAA 


TTTTAAACTG 


480 


TTATAAATGT GTCTCAATTT 


CATATGTTCA TCGACGATAT 


GAAGCGTATT 


ATGGTAAAAT 


540 




GAAGAAATAA TAAACTTGTT AATAAATAAA ACATCACGAT TTGACTAAAG 


CACTTTATTA 


600 


15 


TTGTGTAGAT AATAGTTTTT 


TAACGAAATA AAAATGGCGA 


CTGGTTTTAA 


TAAATCAGCT 


660 




AATGAATCAC TACACCTATA AGTATGAATA TAGTGATTAG AATGCTTTGT 


ATAGTTGGAT 


720 




TTTGCAAAAT TGATGTTA 








738 


20 


(2) INFORMATION FOR SEQ ID NO: 394: 








25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1270 base pairs 

(B) TYPE : nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY : linear 










(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 394: 






30 


AAAAGTTGTA ATTAAAAGTG 


GGATTTTACT TAAGnnAGAA 


GGAAACTATT 


TATATGACTA 


60 




ATAAAAGAGn AGATGTCCGC 


AATATAGCAA TTATTGCTCA 


CGTTGACCAT 


GGTAAAACAA 


120 


35 


CTTTAGTAGA TGAGTTGTTA AAACAATCTG GTATATTCAG 


AGAAAATGAA 


CATGTCGATG 


180 


AACGTGCAAT GGACTCTAAC 


GATATCGAAA GAGAGCGTGG 


AATTACGATt 


CTAGCCAAAA 


240 




ATACGGCTGT TGATTATAAA 


GGTACACGTA TTAATATTTT 


GGATACACCA 


GGACATGCAG 


300 


40 


ACTTTGGTGG AGAAGTAGAA 


CGTATTATGA AAATGGTTGA 


TGGGGTTGTC 


TTAGTAGTAG 


360 




ATGCGTATGA AGGTACAATG 


CCTCAAACAC GTTTTGTACT 


TAAAAAAGCG 


CTAGAACAAA 


420 




ACCTGAAACC TGTTGTTGTT 


GTTAATAAAA TTGATAAACC 


ATCAGCACGT 


CCAGAGGGTG 


460 


45 


TTGTAGATGA AGTTTTAGAT 


TTATTTATTG AATTAGAAGC 


AAACGnTGAA 


CAATTAGAAT 


54 0 




TCCCTGTTGT TTATGCTTCA 


GCAGTAAATG GTACAGCTAG 


CTTAGATCCT 


GAAAAGCAAG 


600 




ATGATAATTT ACAATCATTA 


TATGAAACAA TTATTGATTc 


ATGTACCAGC 


TCCAATTGAT 


660 


SO 


AACAGTGATG AGCCCATTAC 


AATTTCCAAG TAGCATTGTT 


GGACTACAAT 


GATTATGTTG 


720 




GACGTATTGG TATTGGTCGT 


GTATTCAGAG GTAAAATGCG 


TGTCGGAGAT 


AATGTATCAC 


780 
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GATTAAAACG TTTAGAAATT GAAGAAGCAC AAGCTGGAGA TTTAATTGCT GTTTCAGGTA 900 

TGGAAGACAT TAATGTTGGT GAAaCTGTAA CACCACATGA CCATCAAGAA GCATTGCCAG 960 

TTCTACGTAT TGATGAGCCT ACTCTTGAAA TGACATTTAA AGTTAACAAT TCTCCATTTG 1020 

CTGGCCGTGA AGGTGACTTT GTAACAGCAC GTCAAATTCA AGAACGTTTA AATCAACAAT 1080 

TAGAAACAGA TGTATCTTTG AAAGTTTCTA ACACAGATTC TCCAGATACA TGGGTAGTTG 1140 

CTGGTCGCGG TGAATTGCAT TTATCAATCC TTATTGAAAA TATGCGTCGT GAAGGTTATG 1200 

AATTAGAAGT TTCAAAACCA CAAGTAATTA TTAAAGAAAT AGATGGTGTA ATGTGTGAAC 1260 

CATTTGAACG 1270 
(2) INFORMATION FOR SEQ ID NO: 3 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1365 base pairs 

(B) TYPE : nucleic acid. ' . 

(C) STRANDEDNESS : double 
(D.) TOPOLOGY: linear 



25 ' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 395: 

AAGACCAGGA GAACAnvGTAA AACAATATAC AGTTGaAATC GCTCGTAAAT TAATGGAATT 60 

TGATATAAAA TGCTCGTGAT TGCTTGTAAA TACGCAACTG CTGTnGCTTT AGAATATTTA 120 

30 CAAAAGACCT TATCAATCCC CAGTGATTGG CGTAATTGAc CAGGTGCTAG AaCAGCAATA 180 

ATGACTACTA GAAATCAAAA TGTATTAGTA CTAGGAACGG AAGGCACAAT TAAATCTGAA 240 

GCATATCGTA CGCATATTAA ACGTATCAAT CCACATGTAG AGGTACATGg CGTTGCCTGT 300 

35 

CCAGGTTTTG TGCCACTTGT AGAACAAATG AGATATAGTG ATCCAACAAT TACAAGCATT 360 

GTCATTCATC AAACACTGAA ACGTTGGCGT AATAGTGAGT CTGATACTGT CATTTTAGGA 420 

TGTACCCACT ATCCATTGCT CTATAAACCT ATCTATGATT ATTTTGGTGG TAAAAAGACA 4 80 

40 

GTGATTTCGT CTGGATTAGA AACGGCTCGT GAAGTTAGTG CATTGCTAAC ATTTAGTAAT 540 

GAACATGCAA GTTATACTGA ACATCCAGAT CATCGATTTT TTGCAACAGG TGATCCTACT 600 

CACATTACTA ACATTATCAA AGAGTGGTTA AATTTATCTG TCAATGTGGA ACGTATATCA 660 

45 

GTGAATGACT AGGAGGATTT TTAATGAAAG AGATTGTTAT TGCATCGAAT AATCAAGGGA 720 

AAATAAATGA CTTTAAAGTA ATATTTCCAG ATTACCACGT AATAGGTATT TCAGAACTAA 780 

so TACCAGATTT TGATGTGGAA GAAACAGGAT CAACATTTGA AGAAAATGCT ATATTAAAAT 840 

CAGAAGCTGC TGCAAAAGCA TTGAATAAAA CGGTCATAGC TGATGACAGT GGACTAGAAG 900 
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40 



45 



50 



GCGATGAAGC AAATATTGAA AAATTATTAA ATAAGCTTGG TAATACAACT GATCGTCGTG 1020 

CGCAATTtGT TTGTGTCATA AGTATGAGTG GCCCTGATAT GGAAACAAAA GTATTTAAAG 108 0 

GTACTGTTTC AGGTGAAATT GCAGATGGAA AATATGGCGA AAATGGTTTC GGATATGATC 1140 

CGATATTTTA TGTACCGAAA TTAGATAAAA CCATGGCTCA ACTTTCAAAA GAACAAAAAG 1200 

GGCAAATTAG TCATAGACGA AATGCGATTA ATTTACTTCA AGC TT T TCT T GAAGGTGATA 1260 

AAAATGTCTA AATGGATTAT TGTGAGTGAT AACCATACTG AATCAGGCGT TTTATATCAA 1320 

ATTTATGAAA TGCACCCAGA TGCAGATGTA TATTTACATT TAGGA 1365 
(2) INFORMATION FOR SEQ ID NO: 396: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 83 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 396: 

AATTCCTGGT GCAATAATAA ATAGGATGAA AAAGATTCGG AAAATATGAT AACTCGTAAT 6 0 

CATAGCAACA TCGGCACCAG TAGCTAATGC AACTAAAACT ATCTGATTAA CCCCTCCTGG 120 

TGCTGCACCA AGaAACAATT CATTAATAGG ATTATTATCA AAGAAATGTA TGATATAAAC 180 

30 CATGATTAGC GCACCAATTA TCAACATAAT ATTTTGAATT GTAATTGCGA TTGCTAGTCT 240 

ACCTTTTAAA TCTGACAATA AATGCGCAAT TTGAACTCCA ATTCTAATCA TATATATTAG 300 

TTGTGCCATG TTCAACAACC AATGATCTAG TGTAAATGTT AAACCTGTAG AAAAATTCCA 360 

35 AACAATTAAT ACAATGAGTG GTGCTAATAA TTGAAATGTT GGAAACTTTA TTTTAGACAT 420 

AATTAGATAA ACTATAAAGA TAGCTATCGC TAAAATAACT ATTTGCCCTA TGTTTAATAC 480 

TTGTGATAAA GGCAAGACTT TTGTTAACTT TCCATTCGCA TGCATGTTAC CATCATGAAA 540 

AAAATATGAA ATGAACGGTA CTAAAACAAC AACAAATATA ATTCGTGATG TTTGCGTTAA 600 

GCTAACAACT AACAAATTAG CACGTTTGTC TTGTTCAG CC ATGACCAGCA TTTGTGTTAG 660 

TGCTCCTGGT ATAACACTTA AAATAGCTGT TTCTGTATTA ATACGTGCAA TTTTTTTAAA 720 

AACAAATGCC ATTACTATTG CAATTAATAA TATCGAAATA GATACAACAA TAATCGAAAG 780 

CCAATTGTTT TTAATATCCA TAACGACATT TTTCGTAAAC GTTGATCCGA TTTGCACACC 84 0 

TAATAGTACA ATACCTAATT CACTAAGTAA GAATGGCCAT TTAATATCAA GTTTGAAAAC 900 

TTTTACACAA ATGATTGATG CGATAATAGG ACCAAACATA AATGGAAGTA ATACGTGCGA 96 0 
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TATCATTGCC ATGTTTTCCA CTTCTTTCAA TAAAAAATAA AATGACTAAA TTGCTGCTTG 1080 

AGCTTCACGT TTGTTAAGAT AACAATATCC GCTAGCAGTT tTGACTACAA AGCATATATG 114 0 

GaCTTTCACT ATCAAGTCGC CGCCCATGCC TTATATACAT TTAAAAnGAG CCTGAACAAA 1200 

GTTCAGGCTC TCAATTTGTC CGTATATTTA TTTTACAATA CGACTTAAAG CCGTATCAAA 1260 

TGCTTGAATC GTTTTTCAAT ATCTTCTTTC GTGTGTGCCG TAGATAAGAA TGTACCTTCA 1320 

AATTGAGATG GnGGnAAAAA CACACCTCTT TGnCATTCTC GGTACATTTC TGCAATAATT 13 80 

TCC 1383 
(2) INFORMATION FOR SEQ ID NO: 397: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 415 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
£D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 3 97: 

25 TCCACTAAnA TGATTACAAT TGCATTAGTT TGGTGGAGTG CATTTACAAT CTTAACGGGT 60 

ATGATTAAGA ACCACGGTTT AATTaTTTAG TGAGATTCTT ATTTGGTGTA GGTGAGGCGC 120 

CAATGTACCC TTCTAATGCT GTGTTTAATT CATTTTGGTT CTCTAAAAAT GAAAAAGGTA 180 

30 GAGCATCAAG TGCATTATTA GCAGGATCAT ATTTCGGACC TGTATTAGCA CCAATAGTTA 24 0 

CAATTGCTAT TGTTAACGCA TTTAACTGGC AAGCAGTATT TTACATTTTT GGTGCAGTAG 300 

GTATTTTAAT GGCkGTATtA TGGGCGATTA TTGCCAAAGA CTTACCTGaG CrACATAGwa 360 

35 TGGTTAATGA AGCGGAGAAA CGTTTCATTA TGGAAAATCG TGATATCGTA GCTAC 415 

(2) INFORMATION FOR SEQ ID NO: 398 : 

• (i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 398; 

TTTTAGaTaA aTyCAATTnT CyATaCTaAA TgATTnTCTT ATTaCGTCAA TTcGCCTTTT 60 

aTTTTATCGT AATCTTTCCa CTGCAAAGCT AAAGCTTCTC CTATTCTAAG ACCAGAATAA 120 

AATAACAGTC TAGTTAGCTG ACGAGAAGTA TCATTTGTGA TTTGTTCTAC TTTTTCATCA 180 



1292 



EP0 786 519 A2 



5 



20 



25 



AATGTGGGGT 


CGTATAAGAG 


CTTGTAATGC 


TTTTTGGCGT AATTGATAAC TGCTTTAAAA 


300 


CCTGCCCACA 


CAGATCGTGC 


ATAGTCAACA 


GAAAGACCTG CATCGTTTAA CAAATAATTC 


360 


CTGAAAGCAG 


TACATTGCGT 


AGTAGTGATT 


TTGCCAATAG GGATATTTCC GAACCTTTCT 


420 


TTTATGTGAG 


TATTATATTC 


TGTAGTTCGC 


TTTTCTATTG AGCGTGCAGA AAGATTTTCA 


4S0 


TTTTTTAAAC 


GATCAAAAAA 


TATATATTCA 


AAGGGTTGAT TGTCCGAGTA TCCATATTTA 


540 


ACATTTTGTA 


TAAATTCGCT 


TTCAGCTAGT 


TTGGCATCTT TCTTACGTTC AAACCCACGC 


600 


TTCATTTTTC 


GTTTGTTATT 


ACCGTATACA 


TCTTTATATC TAATGGAAAA ATACCATTTA 


660 


CCTGTATTAT 


CATCCTTATA 


TACTGGCATT 


TTGCTTCTCC CTCCTCAAAA TTGGCAAAAA 


720 


ATAATAAGGG 


TAGGCGGGCT 


ACCCGAAATT 


TAGTACTAGG TACTAAATGT GATATAATAA 


780 


AATAAAAAGT 


AGGTGATGTT 


ATGACATTTA 


AAAACAATCA TAATTTCAAT GAATTAGTTT 


B4 0 


TAA rG AATGA 


AG A C ATT AG A 


A'F"i~i " rAAAJVA 


ATGTCTTAGA AGATGCAGTC AGTGTTTATG 


900 


ATGAATATTC 


GGTATGTAAT 


GAAGAATCCG 


ATTTTGCTTA CTGTTTATTA AGAGACTTAT 


960^ 


ATACATTAGA 


CAGCTTAGCT 


ATTTCGTCAA 


ATAATGTTTG AATTATCGAA TTGTACTCTT 


1020 


CGATTTTAAT 


ACCATGCATA 


ATAGAGTTTC 


TGTGTTCAAT AGCAGCTTTG ACTGAATGTk 


10BO 


TTAAATGTTC 


TTCTATTAAA 


TCGTTGTTTk 


CCAtTTCGtk TAaAAATGTt CyTATATTCC 


1140 



T 1141 
30 (2) INFORMATION FOR SEQ ID NO : • 399: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 706 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 399: 

40 



50 



ACTGTGGTAG 


GTTTTTTATT 


TTGAAGTATT 


AATCATAACA GACTAATAAT CATGAGGTAA 


60 


CTAATAACAC 


ATATTTAACT 


TGTATTCTTA 


AACTGGTATA ATAAATTTAT GTTGAAATGA 


120 


ATATTGTATG 


ACAGGGTATT 


CACTTTTATT 


AAAAGGTAAA ATTAAATAAA GGTTTTATAG 


ieo 


AACGTATTTA 


AATATATGAG 


GAGTAAACAA 


ATGGCTGATA GAACGAATAA AGAAATTAAA 


240 


ACAGGACGCT 


TTATTGCAAC 


TGCATCAATC 


GTATTCTCAA TATTATTGAT TATTCATTAC 


300 


TTTGTTTCGT 


TGGATAATGC 


GACTG CCAAA 


GCATTACTTA ATTTAACGAA TCAAAACACT 


360 


TCAGATAAAG 


CGATTGATTA 


CATTTTAAAC 


AGCTTTAGAT TCACTGGTAT TATGTATATT 


420 
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ATGTTTGCAG TTTATGTATC .AAATAGTTTG TTTACGTTGA TTAATTTATC AATCACAATT 540 

CAAGCAATAA AAGCTGCACA CGGTGCGTAC TTAACATTGC CAATTTTAAT TGTTATTATA 600 

GGTTCGGTTG CATTAGCGAT TTATATGCTT GTTGTTTCTA TCAAACGTAA AAGTACATTT 660 

AATCGCTAGA AAATTGATTT TAACAATAAA AATATGAAAA AAAAnn 706 



(2) INFORMATION FOR SEQ ID NO: 400: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1187 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

15 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 400: 

20 ACACAATCTG AAGATTCACG TTG7GGTGCT GGACATGATC GAAAAATTAG AGCTGAACAA 60 

ATGAnAGAAA TCAGTGATTT TGTTAAAAAG AAAAATATCC CTAAAGATGA AACGGTATAT 120 
ATAGGTGGCG ACCTTAATGT CAATAAAGGC ACTCCAGAGT TCAAAGATAT GCTTAArAAC 180 

25 TTGAATGTAA ATGATGTTCT ATATGCAGGT CATAATAGCA CATGGGACCC TCAATCAAAT 240 

TCAATTGCGA AATATAATTA CCCTAATGGT AAACCAGAAC ATTTAGACTA TATATTTACA 300 
GATAAAGATC ATAAACAACC AAAACAATTA GTCAATGAAG TTGTGACTGA AAAACCTAAG 360 

30 

CCATGGGATG TATATGCGTT CCCATATTAy aCGTTTACAA TGATTTTTCA GATCATTACC 420 
CAATCAAAGC CTATAGTAAA TAGTGCTCAA CTAACTAATA ACTTGCTtCG TTCTAAAAGG 4 B0 

ACGAAGCGAG TTATATTGTT AAAATTTGAA TTGACTTACA TTTTAATAAA ATCATCTTAA 540 

35 

CAACTTTAAT TTTTCaTTAA TACAaGTCTT TACTCTACAC TCAAACnAGA TTCATACACT 600 
GCACGTCATA ATAAATCTAT CTATTCAAAT ATAAATAAAA GTTACCTACT ACATTCTATG 660 
40 TAGCAGGCAA CTTTTATTAC TTATTTCTTT TCATTATCAT TAAGTACTTT TACAAACTTC 720 

ACATTATGTG TCTTCCAATC AACTTCATAT AATGCTGATA ATT T T T CTTC TT T TT T ATCT 780 
ACATGGTTTT CACCAGACCA ATAGCCCCAG AAACCATGGC GATTCCAATC TATTTTAAAC 84 0 

TCATCCATTG ATCTTTTATA ATGAACAACA AATTGTGATT TACCTTTGTC TTTTTTATCA 900 
TGTGACATAA CAGCTAAAAA TTCTGGATTA AACCCTTCAG ACACAGTTAC AGGCATTTTG 960 
TCTTTAGGTG TGAAATTATC TTTCGCCCAT AAATTTCCAT TTCGTGTTAA AGAAAAAATT 1020 

SO 

TCACTTTTAG TTCTATTATC ACTATCATTA GTTAATTGTC TCGTATGGTC ATGTCCCATA 1080 
TTATTTATCA AATGTQCTTC TACTTTCCAA CCTACACCTT TATGTGACGT AGATTGATCA 1140 

55 
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(2) INFORMATION FOR SEQ ID NO: 401: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 401: 



CCAGAATTAT 


TTTTTCAAAA 


AGGACAATTT 


AACAATGTCG 


ATAACGTTAT CATAAGCAAA 


CCGATGAAAG 


GGACAATGCC 


TAGAGGTAAA 


ACGGAaGCTG 


AAGATCAACA GTATTATAAA 


ACATTGCAAA 


CTTCTTCGAA 


AGATCGTGCA 


GAAAATGTCA 


TGATTGTTGA TTTACTAAGA 


AACGATATAG 


GGAGAATATC 


ACAGAGTGGC 


TCAATTAAGG 


TGTATAAACT ATTTTTTATT 


GAGGCATATA 


AAACTGTATT 


TCAAATGACT 


TCGATGGTAA 


GTGGAACTTT AAAAAATAAT 


ACAGACTTAA 


CTCAAATTTT 


AACATCGTTA 


TTTCCTTGTG 


GTTCGATTAC AGGTGCACCG 


AAACTGAATA 


CAATGAAATA 


TATTAAACAA 


TTAGAAAGTT 


CACCTCGTGG TATATACTGC 


GGACAATTGG 


ACTATTACTT 


CCAACTGAAG 


ATGATAAAAT 


GATTTTTAAT ATTCCGATTC 


GCACTATTGA 


GTATAAATAT 


GGACAAGCGA 


TTTATGGAGT 


CGGAGCAGGT ATTACAATTG 


ATTCTAAGCC 


AAAAGATGAA 


GTGAATGAAT 


TTTACGCAAA AACCAAGATT TTGGAGATGT 


TATAATGCAA 


TTATTTGAAA 


CAATGAAAAT 


TGATAATGGA 


CATATCCCTA GACTTACTTA 


TCATACTAAT 


CGCATAAAAT 


GTTCTTCTGa 


GCGATTAAAC 


TTTAAATTTG ATGAACATGC 


ATGGCGAAAT 


GAATTAAACG 


ATGTAACAAC 


AAAGTATCAC AGTGGTCAAT ATAGACTTAA 


AATCGTATTA 


AATGCTGAAA 


GCAAATTTGA 


AACGATAGTG 


TCACCTTTAC CTGAGAAAAG 



TAGTTTT 

(2) INFORMATION FOR SEQ ID NO: 402: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 740 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 402: 
TGAAGATGAA GCAGAAGCAG AAGACAATCT GCTACGAGTA CAATCGAAGA AAGAAGAAGT 
GTATCGTCGA TTACTTGCTT CTAACTTAAC AAGCGTTCCT GAAAGtTTAT CATCATGAAA 



1295 



EP 0 786 519 A2 

GTTAAACAGT TAAAAGATAA AGTGTCTAAA ATTGTGATTC AAATGAATAC ATTTGAAGAT 240 

GAAGCAAATG ATGTTCTTGT TAATGCTGTT TATGCAGAGA AATTAATTCA ATATGGAAAT 300 

AGATATCGTA AGGACTATAG CAATGTTGAT AAGAGCTTAA ATGAAGCTGA ACGATTATTT 360 

AAAAATAATC GCTATAAGCg TGCGATTGAA ATTGCAGAGC AAGCTCTTGA AAGTGTTGAG 420 

CCAGGTGTTA CTAAACATAT TGAAGAAGAA GTTATTAAGC AATAGAAACT AGTATGTAGT 480 

TATACTTAAA TAATATGAGC ACTCTGTCAA ATTGGACTGA TGAGTTTAAT AATTGAAGTT 540 

AGCCAACGAT ACGTTGTCTA GCTTCTTTTT TATATGGATA AATGaAAGGG ACAATAAATA 600 

TAAATAGCAA TTGTTTAAAG ATAAACGTAA TCAAATGTGT TGTTTTAATT AATATAAGTA 660 

GTGAAAAAAG CATAATCACA CAGCTGTTTA AATAGAGTGA AATAGTCTAA TTCTTATTTA 720 

ATAAGTAGAA ATAAGATTAT 740 
20 (2) INFORMATION FOR SEQ ID NO: 403: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 630 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



w 



15 



30 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 403: 

ATGCCAATAA TTAAACCATG TAATAAATAT ACGTATAGCG TACGACTACC AATATAAGTA 6 0 

TATAATTTTT TCTTTGTTGA CATTAAATTT AGAAACGCAG TCATTGCGAT TAATATAATT 120 

CCATATAATA TAAGTCGTTT AAAAGGACTG AATATACTCT GTCCTTCATT TTCAAGTGAA 180 

GTATATGGTG AACTTCCCAA TAACCAATCT GCATTGATAG GATGAATCAC GTAAACGATA 24 0 

AAAAACAAAA TAAAGGTAAT GATAGATACT GGTATTAGTT TTTTATTTTT AAAAATAGCC 300 

40 GTATGTTTTT TGGTGAAAAT GTAACCTAGA TAAAATATTG GGAAAAATAC GATTGTCCTT 360 

GAAATGCTTA AGTAGCTATC GATGTTATCT GAAAAACCTG CTCCAATAGA TATAATAATT 420 

GAAACTGATA GCACTTTATA TGGATTAAAT CTTCTAACTA TTACTAAAAT GACATGAAAG 4 80 

AAAAATAGCG TGATCAAAAA CCATAACGCA AATACTGGGT TAAAAGGATC AAGTTGTAAT 540 

TCGTCACTTT TACCTGTTAA GAAATAATAA ATTGAAAAGA ATGCAAAAAA TATCATATAA 600 

GGTACTATCA AACGTTTTGA AATTTTTTCT 63 0 

(2) INFORMATION FOR SEQ ID NO: 404: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6254 base pairs 
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10 



(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 404: 
ATTTAAACGT TCAGTTTCTA AAAGTGGAAC AATCCAAGAA GTACGTAAAC GTGAATTTTA 60 

CGAAAAACCA AGCGTAAAAC GTAAAAAGAA ATCAGAAGCT GCACGTAAAC GTAAATTCAA 120 

ATAATTAATA CCTCTGTTGA CTCCCTCAAC ACGAATATTA ATTATATAAA ACAAACATCA 180 

CAAGTTAGTG TCTGACACTA ATATGTGATG TTTTTTTGTT GTCAATTTTT AATTAAAAAA 240 

75 AGTTATATAG TTTATAAATA ATCAAATTGA TATTCTATAG GTTCTTATAA CTATAAAGTA 300 

TATTCAATTT CATGTATAAT TAATGTGAGG GCGAGGTGAA ATTGTGAGTT ATAATAATTT 3 60 

TTTACAAATG ACAACTATCT TGGAATCAAC GGCTGGAGAT ACTTGGGTTG AACAAGTTAG 420 

CAATATAATT GTTCAACCTA TT TT TA CGTT AATATTAACC TGTTTGACAT TCTTAGGATT 480 

TGTATATCAA CTTTACTCTA AAAAAATCAA TGCAGCTGGT ATTATCGCCA CATTATCATT 54 0 

ACTTATTTTA TTTTTGGGAT TTCTAATCCA AGGAAATGTC AATATGCATT CTATCTTAAT 600 

ATTCTCAATT GGCGTTATAT TAGTTGTAAT TGAATTATTT GTAGTTGGTG CAGTAATTGG 6'60 

TATTATTGGC ATGATACTGA TAACTATAAG CATAACAACG CTCGGTGATA ATTTGCTATT 720 

TATGCTTGCG AATGTTATCG TTGCCTTGAT TTTAACGATT GTAGAATGGG TGATATTAGT 780 

GAAGATTTTC AACAGAAAGA TTCCGTTTTT GGATAAAGTT ATCTTAAAAG ATTCAACTAA 840 

TTCTGAGTCA GGTTACAATT CTCATGATAA CCGCTCGCAC CTCGTAGGAA AGACTGCTCA 900 
35 AACAGTTACA GATCTTCGAC CTGCAGGGAT TATTTTTTGT GAAAATGAAC GTATTGATGC 960 

TGTTTCAGAT GGCAACTTTA TTTTGCGCAA TAAAACGGTA AAAATCCTTG AAGTTGAAGG 1020 

AACAAGAGTA GTTGTGAGGG AAGTAGATTA ATTAAAAGGA GCGATACCAT GTTTAGTTTA 1080 

AGTTTTATCG TAATAGCAGT TATTATAGTA GTTGCATTAC TTATTTTATT CTCATTTGTA 1140 

CCCATTGGTT TATGGATTTC AGCGTTAgcA GCTGGCGTTC ATGTTGGTAT AGGTACATTG 1200 

GTTGGTATGC GTTTACGTCG TGTATCTCCA AGAAAAGTTA TAGCGCCATT AATTAAAGCG 1260 

CACAAAGCAG GACTAGCATT AACAACAAAC CAATTAGAAT CGCATTATCT AG CAGGAGG A 1320 

AATGTTGACA GAGTTGTTGA CGCTAATATT GCTGCACAAC GTgcTGACAT TGATCTTCCT 13 80 
TTCGAACGTG CTGCTGCAAT TGaCCTTGCA GGACGTGACG TATTAGAAGC GGTTCAAATG * 144 0 

TCTGTTAATC CTAAAGTCAT TGAAACACCA TTTATCGCAG GTGTAGCAAT GaACGGTATT 1500 

GAAGTGAAAG CCAAAGCTCG TATCACAGTT AGAGCTAATA TTGCTCGACT TGTTGGTGGT 1560 

55 
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AGTAAGCATC ATACAGAAGT ACTTGAAAAC CCAGATAATA TTTCTAAAAC AGTTTTAAGC 1680 

AAAGGTTTAG ATTCAGGTAC TGCATTTGAA ATTTTATCAA TTGATATTGC TGACGTTGAT 1740 

5 

ATTAGTAAAA ATATTGGTGC AGACTTACaA ACTGAACAAG CATTAGCAGA CAAAAATATT 1800 

GCACAAGCAA AAGCTGAAGA ACGTAGAGCT ATGGCTGTAG CAACTGAGCA AGAAATGAAA I860 

GCGCGTGTAC AAGAAATGCA TGCTAAAGTA GTTGAAGCCG AATCTGAAGT ACCATTAGCT 1920 

10 

ATGGCTGAAG CATTACGTTC AGGTAATATC AGTGTTAAAG ATTATTATAA TTTGAAAAAT 1980 

ATCGAAGCTG ATACAGGCAT GAGAAATGCA ATTAATAAAC GAACTGATCA AAGTGATGAT 2040 

15 GAGTCACCTG AACATTAAGT CGAGAGGTGA TTAAATGAGT GTCGGTATTC TAATTTTTGT 2100 

CATATCAGTG ATCATTTCTA TCATTACTAC TATGCGCGAA AATAGTCATA AAGATAGACA 2160 

AAATCAAAAG CCACCTCAAA AAACATCTAC CGATAATGAA CCAAAAAAAG GTGG CTT T TT 2220 

20 TGAAGAAATT GAGCGAACGT TTAAAGAAAT AAGTGAAGAA TTAAATGAAG AAGAAAAGAA 2280 

ATCATCGAAA CGAAAATATG ATGATACGTT ACCACCTTTA TTCGATGAAC TTCCAAAGGA 234 0 

AGAGCCTAAA TCGAAACCTG TTGTAGAACC TATGGCACCT AAAAAACAAC AAGAAACAAA .2400 

ACCGATGACA GAGAAACCAA TCACAGTGCC TAAAGCAGAA CCGGTGGAGC AGAAACATAG 2460 

ACCTTCTAGA CAAGATAATT CTGACGAAAT TAGACGTCAA TTAGAAAAAT CACTTAGAGA 2520 

TGATATTAAA ACGATTCGTA CTGACATTGA TAGAGAAAAA GAAAAGCAAA TTGCTAAAAT 2580 

30 

GGAAAAACGT GCTAGAGATA TTATTGAGGA TAAATACTTA TCTGAACGTA CAAAACGTTT 264 0 

GAAATTAAAG CAGCTGCTTA ATTCTCAAAA TGTCGAAAAA GATTTGACTA AATCAGCGTT 2700 

CCAATTTGAT AAAGATGAAG TAATCAATGG TATGATATGG TCAGAAATTT TAGCTAAACC 2760 

35 

AAAACAATTA TAAAATTTTT TGAAAACAAG CACTATCGTA ATGGTAGTTG CTTGTTTTTT 2820 

TACGTTAAGG AAAATTAAAA AACAAAGAGA ATTTTTCGAG AAATATTAGT TATTTAAATT 2880 

40 ACAGCAAAAA ATTGATTAGT CTAAAATTGA ATCTGCTTTT ATGACAAGGT GAAAAGTATA 2940 

AATGATTATT TTAAATTAAA GAAAAATGAG TAAGTCAATG CAAAGATGTT TAAATCAATC 3000 

AATTGCATGA TATAATTAAG TAGATATTAA AGCATCATAG AATGAATATA AATGATATAT 3060 

45 GAAAAGGAGC GCGTGTATGC CTGGAATTAT ACAAATAGAC GATATGAACC AATCTCAAGC 3120 

TTTAATTGGA AATAATGATG AACATTTAAA AGCAATTGAA GAGAGTTTCG ATGTTGTCAT 3180 

CCATGCAAGA GGACAAGaAG TTGCCGTTAA AGGTACAAAA ATAGAAAACG TAGAAAAAGC 3240 

50 

GGAATCAGTA TTAATCAATT TGCTGAAGGT TATTGATTTA GGTAATAATA TTACAATTAA 33 00 

AGATGTTGAA GCAGCTATTA AAATGGCGCA TAATAACACA ATTCAACATC TGTTAGATTT 33 60 
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10 



15 



GCAACGTATA TATGTTAATG CCATGAAAAA TAATGATTTA GTATTTGGTA TAGGTCCTGC 34 BO 

TGGTACAGGT AAGACATTCT TAGCTGTAGT TTATGCAGCA AAGCAACTCC GTAAAGGTGC 3540 

TGTTAAACGT ATTGTATTAA CAAGACCTGC TGTTGAAGCA GGAGAGTCAC TTGGATTTTT 3600 

ACCAGGAGAT TTGAAAGAAA AGGTAGATCC ATATTTAAGA CCTTTATATG ATGGTCTATA 3660 

TACTGTTCTT GGGCGTGAAC AAACAGAGCG ATTTATTGAA AGAgGCATTA TCGAAATAGC 3720 

GCCACTTGCA TATATGCGCG GACGAACATT AGAAGATGCA TTTGTAATTC TTGATGAGGC 3780 

GCAGAATACG ACACATGCGC AAATGAAAAT GTTTTTAACA AGACTAGGTT TTGGCTCAAA 3840 

AATGGTAGTT ACTGGTGACC AAACTCAAAT CGATTTACCT AAAGGTGTTA AAAGTGGACT 3900 

TAAGGAAGCG GTCAGTAGGT TACACAACGT TAAAGGTATA AGTATATTGA AATTAGATCA 3 960 

GAGCGATGTA GTCAgACATC CATTGGTAAG TAAGATCATT GAACATTATG AAGGAGAGAA 402 0 

20 TTAAATGTTT ACGATAGATT TTAGCGATCA CACAGGCTTA GTTAAAGATG CTTGGTATAA 4080 

ACAAATTGAA GATTTATTAG AATTTGCTAA AAAAGAAGAG CATATAGAAG ACGATGCTGA 4140 

GCTTTCTGTT ACATTTGTAG ATAAACAAGA AATACAAGAA ATTAATCGAA CATATAGAGA 42 00 

TAArGwTAAr GTTtnCaGATG tAaTCyCaTT tGCTTTAGrA GrAGATGAGC CmGaGATkGA 4260 

TtTTAGTGGT CTTGATATAC. CACGTGTTTT AGGGGATATA ATTATCTGtA CGGATGTAGC 432 0 

GCAAGrACAA GCAAACAATT ACGGACATTC TTTTGAACGA GAATTAGGAT TTTTAGCATT 43 80 

ACATGGATTT TTGCATCTAT TAGGTTATGA TCATATGACT GAAGCGGATG AAAAGGAAAT 444 0 

GTTTGGTCGA CAAGATACAA TATTAAACGC ATATGGATTA ACACGAGACT AATTATGAAA 4 500 

AGGTTTAAAT ATGCACTTGA TGGGCTGAAA ATCTTAATTC AAAAAGACTA TAAATTTCTT 4 560 

TTACATGTGT TTGCAATGAT TGTTGCTATT GTCTTTGGTC TCGTACTAAA TATTAATCGG 4620 

ATTGAGTGGA TATTTATACT CATTGCTATT GCATTAGTTC TCACTGTTGA AGCTTTAAAC 46 80 

40 ACTGCTATTG AATATGTTGT CGATTTAGTG ACCGTTGAAT ATCATGATTT AGCTAAATAC 4740 

GCTAAAGATA TTGCGGCTTT TAGTGTACTT ATAGTTTCAA TATTAGCATT TATTATAGGT 4 800 

i 

TTAATAGTAT TTTTACCACA TTTTATAGCG TTATTTTAGG GAGGCATATA TGAGTTATCA 4 860 

ACCTCATTAT TTTCAAGAAG TTAGAAAAGC ACAACAAGAA TCATATTCGC CATACAGTCA 4 920 

t 

ATTTAAAGTA GGGGCTTATT TAAAAmCGAA AGACgGTAGA ACTTTTTATG GTACCAATGT 4 980 

AGAAAATGCT TCTTATCCAT TATCGATATG TGCTGAACGA GCTAGTTTGG TATCGGCAAT 5040 

TTCTCAAGGA TACAGACCAG GTGATTTTGA ATCAAtAACT GTAACCGTAG ATGCAGATAA 5100 

ACCGTCATCA CCTTGTGGTG CATGTCGTCA AGTTTTGAAG GAATTATGTG ATGATGATAT 5160 
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ACCATTTGGA TTTTCAGGAA AGGATTTAGA ATAAATGACA GAACATAAAT CAGGATTTGT 5280 
TTCAATTATA GGTAGACCAA ATGTAGGAAA GTCAACATTT GTTAATAGAG TGATCGGCCA 5340 
5 . TAAAATAGCA ATCATGTCCG ATAAAGCTCA AACAACTAGA AATAAAATTC AAGGTGTTAT 5400 
GACAAGAGAT GACGCGCAAA TTATATTCAT TGATACGCCA GGTATTCATA AACCTAAACA 5460 
CAAATTAGGT GACTATATGA TGAAAGTCGC TAAAAATACA TTATCTGAGA TAGATGCAAT 5520 

10 . 

CATGTTTATG GTTAATGCCA ATGAGGAmAT TGGACGAGGC GATGAATATA TTATAGAAAT 55B0 
GTTGAAAAAT GTTAAGACAC CAGTATTTTT AGTATTAAAT AAAATAGATT TAGTGCATCC 564 0 

AGATGAATTA ATGCCAAAGA TTGAAGAATA TCAAAGTTAT ATGGACTTTA' CAGAGATTGT 5700 

75 

ACCTATTTCA GCATTAGAAG GGCTAAATGT CGATCATTTT ATTGATGTTT TAAAGACGTA 5760 
TTTACCCGAA GnACCTAAAT ATTATCCAGA TGATCAAATT TCAGACCATC CTGAACAATT 5820 

20 TGTAGTGGGT GAAATCATTC GTGAAAAAAT CCTTCATCTT ACAAGTGAAG AAATCCCTCA 5880 
TGCGATTGGT GTTAATGTGG ACCGTATGGT TAAAGAAAGC GAAGATCGTG TTCATATCGA 5940 
AGCAACTATA TATGTTGAAA GAsGTTCGCA AAAAGGAATT GTCATTGGAA AAGGCGGTAA 6000 

25 AAAGTTAAAA GAAGTAGGaA AAcGTGCGAG ACGTGAtATA GaAATGCtTC TAGGCTCTAA 6060 
AGTTTACTTA GAATTATGGG TCAAAGTTCA AAGAGACTGG CGAAACAAAG TTAACTTTAT 6120 
TCGCCAAATT GGTTATGTTG AAGACCAAGA TTAATCTTAA AAGTGGTGAA GATAATTGTT 6180 

30 

AATGCGCCAA AAAGGGATTA TCATCAAAGC AGTTGATTAT GGTGAATCTG ATAAAATTAT 6240 
CACGATTTTA AATG 6254 
(2) INFORMATION FOR SEQ ID NO: 405: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3710 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 {D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 405: 

45 GTTGTTCTAA ATGTTTCTTG nATGAAACGA GTCAATGTAA ACTGATATTG CTGTATTTGT 60 

J 

TGCAGCAATT CATATTGGTC TGGTGTTGCA ATAACAGCAG CTTGAGTTGG AGTCGCAGCT 120 
CTGATGTCTG CAGCAAAATC ACTTAATGTA AAGTCTGTTT CATGACCAAC TGCTGATATA 180 

SO 

-ATCGGTGTCT TACAATTATA TATTGCACGG ACGACAGCTT CTTCGTTGAA ATTCCATAAA 24 0 

TCTTCTATGG ATCCACCGCC TCGACCTACA ATAATGGTAT CTACACCTAA ACTATCTGCA 3 00 
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TGTATTTGTT CAGCTAATGG AAAACGACTA TTTATCGTTG AATGGATATC TCGAATTGCG 420 

GCACCTGTAC TCGCTGTTAA AACTGCAATT TTTTTAGGAA ACTTAGGTAT TGATTTCTTA 4 80 

5 

TTCGCTTTAT CAAAACAACC TTCTTCAGTT AATTTTTTCT TTAATGCTTC TAATTTTTGA 540 

TATAAGTTCC CTATACCATC TAATTGCATT TTATTTACAT AAATTTGATA GTTTCCACGA 600 

CGTTCAAAAA CAGAAACACG TGCTTCTAAT AAGACTTCAT CTCCTTCTTT AGGTTCGAAG 660 

10 

TTTAATTTAG AAGCACTACC TTTGAACATC ATGGCAGTTA TAACGCTTTC TTTATCTTTC 720 

ACATTAAAGT ATAAATGaCC ACTTGaATGc TTTTTGAAAT TTGAAAGCTC ACCTTTAATC 780 

75 AATACAGATT GGAGATGTGG ATCTTGATCA AATTTATATT TAATATATTT CGTTAAAGCT 84 0 

GAAACACTTA AATAATCTGA CATATAACAT CACTCAATTT TATTTTTTTA TATTACTCAA 900 

TACACCATTT ATAAATTTAT AATGATCATC ATCACTGAAT TGTTTTGTTA ATTCAACTGC 960 

20 TTCATTCATT ACGACTTTAG CAGGTGTATC ACTGTGTAAT ATTTCATATG TTGCCATTCT 1020 

TAAAATAATA CGATCCGTTT TTAATAAACG TGCAATAGTC CAATCTTTTA AATAAGGACT 1080 

AATTGTCTCG TCTAATACAG GTTCGTGATC TTTAACGCCA GAAACTAGCC AATGAATAAA 1140 

25 

v ' TTCGAAGTCT AAATCTGGAT TATCGTCTTT AATAAAGCTT ATCGCTTCAT TTATCGTTAA "" 1200 

ATCACTGTCC TTCATTTCTA ATTGAAATAA AGTTTGAAAA GCTTGCACTC GGGATTCTTT 12 60 

ACGACTCATT TTTAACTCCT TCAAACGTTT GTATTTTTCT TTATTTAATT ACTGAATTAG 1320 

30 

GTATGACATT ACTTTTCAAT AACGATTTGT GTAATGTGAA TATTAATTTG CTTAGGTTCT 1380 

ATCGCTGTCA TATTAGAAAT TGAATTAAAA ATTGACGTTT GAATTTTGTT TGCAGTTTTT 1440 
35 GAAATATTAA CACCATGTTT TAATGCACAA TATACATCTA TATATATGCC ATCTTCTTTA . 1500 

CTCTCGATTT TTAAATCACG GCTTAAATTT TTACGACTAA CTTTTTCTAA ATTTGTTTCT 1560 

TTTAATTCAG CAAAATGGCC AGTGATGCCT TCGACTTCCG AAGTAGCTAT ACTTGCAATA 1620 

40 

ACAGATAGCA CTTCTGGCGC TATTTCTACT TTACCTAATT TTGAATTTGA ATAATCAGTT 1680 

ACTTTGACCA TGGATTGACC TCCTATTAAC CTTCATCATT CATAATGCTA TTTTGCTCTA 174 0 

AAAAGTTTGT ATTAAATTTA CCGCTTCTAA ATATATCGTT ATTCAATAAT TTAATATGGA 1800 

45 

ATGGAATAGT TGTATCAATA CCAAGAACCA CAAATTCACT TAGTGCACGA ATGCCAGCCA 1860 

TAATCGCTTC ATCTCGTGTC GGTTCATGTA TGATTAATTT CGCTACCATC GAATCATAAT 1920 

SO ATGGCGGTAT' CGTATAATTA GTATAACATG CTGACTCTAT TCGAACACCA TATCCACCTG 1980 

GTGCAAGATA TTGCTCmATT TTACCTGGTG ATGGCATAAA GTTCTTGTAA GGATTTTCAG 204 0 

CATTAATTCT AAATTCAATT GCGTGTCCTG TTAATTTAAT ATCTTCTTGT TTATACGGTA 2100 
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j 

CAGTTACAGG ATGTTCTACT TGAATACGTG TATTCATTTC CATAAAATAA AATTTATTAT 2220 

CATTTAAATC ATATATAAAC TCAATTGTTC CCGCATTTTC ATAATTTACA GCTTTCGCTG 2280 

CACGAACTGC GGCATTTCCC ATTTCACGAC GTGTTTCATC ATCTAAAATT GGGGAAGGTG 2340 

CTTCTTCCAC TAATTTCTGC ATACGTCTTT GAATTGTACA ATCACGTTCT CCTAAATGAA 2400 

TTACATTACC ATAGCTGTCC CCAACAATTT GGATTTCAAT ATGGCGGAAG TTTTCGATGA 24 60 

ATTTCTCCAT ATAAAGTCCA CCATTACCAA ATGCAGTTT G AGCTTCTTGT TCTGTCATTC 2520 

GGAAGCCAGT TTCAAGTTCT TTTTCATCAC GAGCAACACG GATACCTTTT CCGCCACCGC 2580 

CAGCAGTAGC TTTAATGATG ACCGGATAGC CAATTTTTTT GGCGATTTTC TTAGCTTCTG 2640 

AGACGTCTTT CATTAAACCG TCACTACCAG GAACAACTGG AACATTGGCT TTGATCATTT 2700 

CTGCCTTAGC AACATCTTTG ATACCCATTT TTTGGATAGA TTGATAACTT GGTCCAATGA 2760 

20 ACTTCAATTG GcATgctTCG CATAATTCTG CAAAATCAGC ATTTTCAGCT AAAAAGCCAT 2820 

AACCCGGATG AACGCCATCA CAACCTGTAG AAGTTGCAAT AGATAAGATG TTCGGAATAT 2880 

TTAAATATGA ATCTTTAGAC AAAGTGGGAC CTACGCAATA TGCTTCATCA GCAATTTGAG 2940 

TATGTAGCGC ATCTTTATCC CCTTCAGAAT AGATTGCAAC AGTTTGGATG CCTAAATCAC 3000 

GACAAGCGCG AATAATCCTA ACTGCGATTT CACCGCGGTT TGCAATTAAA ACCTTTTTCA 3060 

TTATTTCACC TTAAATAACG GTTGGCCATA CTCTACCATT TGTCCGTCTT CTACTAAGAT 3120 

TTCAACAATT TCACCTGAAA TTTCTGCTTG AATTTCATTA AATAGTTTCA TTGCCTCTAA 31 BO 

AATACACACT GTTGTTTCAT TTGAAACAGT GTCCCCAACT TGCACATATG CTTCTTCGTC 3240 

3S TGGAGATGGC GATTTGTAAA ATGTACCTAC CATAGGTGCA TTAATTGTTT TGTGATTATC 3300 

TGAAGTTGGC TTTGGAGCTT CAGTTTTATT GCTATCAGTT GATTGTGCTT GAGGCATAGG 3360 

CATTGCCGCA GCTTCAACTG GCATTTGTGA GATTTGTGGC GTGATAATCT CAGTTTCTTT 3420 

TTCTTTCTTA AGCGTCACTT TGCCTTTAGT ATCTTCAATA TTGATTTCCG TTAAAGTTGA 34 80 

TTTATCCAGA ATTTCAATTA ATTCTTTGAT TTCTTTAAAG TTCATTATTA CTGACTCCTT 3540 

CAGTTTGTTT TCATCTACCC GTCTATTTTA CTTGAGACAA CTCTTCAATT CAAGCATGTT 3600 

CATATTGCTG GCGACATTAT AAGTCTATCC CAAAGTTATA ATAAAACCAC ATTTTAAATT 3660 

AAAAACACTT GTGTATTTAT TACTTAACAT TGACTCATCT TAACTCTTGA 3710 
50 (2) INFORMATION FOR SEQ ID NO: 406: 

' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1705 base pairs * " 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 406: 
GCTGATGTTC GTTGCCTTTn TCCACCAGAC AATTCAGAGG GATATTTATC ACTAATATCC \ 60 

AATATATTTA ATGCTTCTGC TACTTTTTCA TAACGATTTA ACATATGTTC TTTATCTAAC 120 

TTCTGTACCG TTAGTGGTAA CATTATGTTT TCTTTAACAG TCAATGTATG CAGTAAATTA 180 

TACTCTTGAA AAATAAAACC AATATCATGC TTGCGTATAT CAGATwATTC CTTGTTTGAA .240 

AGCTTTTCTA ATTTTTTTCC TTTTAATGTA ATAGAACCTT GTGAAATATA ATCAATTGAA 300 

CTTAAAACAT TTAATAATGT CGTTTTCCCA GATCCAGAGG GACCCATAAT AGCAATAAAC 360 

TCGCCTTCTT CAATAGACAT . ATTGATATCT CGCAACACTT CTTGTGCCAT TTTTTTAGTT 420 

CCATATATTT TTGTTAATTG TTTTACTTCT AAAATTGCCA CTTTAACACT CCTATAATTT 480 

ATCTTAACTT CATTTCCTTT AGGCTTTGGC ACTTGTATCT TCAATTTAAC ATATGACTAA 54 0 

CATCTATCTT ATTATAACGT TGAAGCTGCA TTGATGTATC AATTCTAAGT AACAAAACGC 600 

ATGTTTAAAA TGACAAATTT GTCACTTCCG ACATGCGTTC AACAATTTCA TTTTGTAATG 660 

GGAAAATCAA TCTGACAGTT GTCCCCTTAC CAACAGTCGA CGTGACTTGC AGGTGAATAC 720 

CTAATTGATC CTTTACACTA TTTACTAAAT ATAGACCCAT ACCTGAAGAC GTCGTTTCAT 780 

TTCTGTTAGC CGTTGACGTA AATCCTCGTT CAAATATTCG CGGCATATCT TTTTTACTAA 840 

TACCTCTGCC ATAGTCTTTA ATATATAACG AAACATGTTG ATCATTTAAT TCTGTCCCAA 900 

TTTCAATATT AAAATTCTCA CTATATTTCA ATGCGTTTGA CAAAATTTGT CTAATAATCA 960 

TACGACACCA TTTTATATCT GTATAAACAT AATCATCCAC TTTAAAGTCA ACATCAAAAC 1020 

CAATACCTTT AACCTGACTA ATATGTCTTG TTAATTGTAT TTCATCAATG ACCATGCGTT 1080 

TAAGTGACAC GTAATCAAAA TACATATCTT TACGTTGAGA TTCTAATCTA GTAATATACA 114 0 

GCTGTGTATC TAGCATCGAG TTTATACGAG ACCATTCATA TAGTAATGCT TGTwTTCTTT 1200 

CTTGATTTTT TTCTTGATCA ATTAATAATT TCATAGCTGT CACAGGtGTT TTTATGTCGT 1260 

GCACAAATTC TGTAATGGTT TGTTCATGCA TGTTCAATTG CAACTGTTGC TCAACAACCT 1320 

TTTCTTTGTG CGCTGAGATT TGACGATATA AATAATCAAC TGTATGACGT TGAAATGGCG 1380 

TTTCCGCTAA ATCTTTATGT TTAATTTCTT CTATTTCTTT ATCTTTGTCA AAATGCTTAT 1440 

ATAATTTTAC TTCTTTAAAA TATGTCAATA AAAGAAAAAT CATTGTTAAA CTTAAATTCA 1500 

AAGAAACAAT ATAAAATAAA CTGTCTATTG GAAAATCATA ATCGATTAGA CTArTGCCTA 1560 

ACATAAGGAA GTTTAAAAAC AATATCCAAA AtATCCAGTT CATGCGAGAT TTCAAAAAAT 1620 
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GCACTATCCA TACTAATTTC AGATA 1705 
(2) INFORMATION FOR SEQ ID NO: 407: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

w 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 407: 



75 


TCAATATATC 


TTAAAATTCA 


ATGATTAAAT 


CnATTATCAC 


TAGACATnAA ATACATAAAT 


60 




CCTATTCCCC 


ATTTTCATTT 


HTTAATTCAT 


AAATGAATCA 


ATAACCACCT 


AATAACAAAT 


120 




CATATTATAC 


ACCTTTGTTC 


TCTATTTTTC 


TAAGGTTTAA 


AAAATATTTT TAGGTAAACC 


180 


20 


TAAAAATAGA 


TGTAATAAAA 


ACGCCTCCTC 


AGATATTTAT 


ATATCTATGA 


AGACGTTTAA 


240 




ATACATTATA 


GATGGTCTGG 


TTCTGGGTGA 


ACGTATACTG 


AGGAAAtACC 


TTTTTTGTGC 


300 


25 


AAATGATGTT 


CGACATTGTC 


ACAAATTTGA 


TGCGCTTCTA 


CTAAGGAAAG 


GTTAGCATCT 


360 


ACAACAATTG 


TGACATCAAT 


AAACACACTA 


CTTCCATGGT 


AACGCCCTTT AATACTTTTA 


420 




ACTTCTTGTA 


CTTCATCAAC 


TTCTAAAATA 


TCATTGCGAT 


ACGCTTCTAA 


TTCAGTTTCA 


480 


fin 


TTGAAACCAT 


CACTCAACAT 


AAAAATTGCT 


TCTTTAAAAA 


TACCAAAACC 


AGTATAAACG 


540 




ATTAGTAAGC 


CTAGTAATGT 


TGCTAAAATA 


ATATCGACAA 


TTGGGAAACC 


GATTTGCGTA 


600 




AAAATTAATC 


CTATCGCTGT 


TCCAATGCTG 


ACTAAACTAT 


CCGATAAATT 


ATCTTTGGCA 


660 


35 


GCCGAATTTA 


AAGAACTACT 


TTTCGTTCTT 


TTCGCTAGTC 


TTTGATTGAC 


TGCAAATACA 


720 




ATCAACATTA 


CAAGACCACT 


GATTAAGCTG 


ACGATAATTG 


TTATTGCGTT 


AGGTACAACG 


780 




TCATCTTCTT 


TGAACAAACG 


AGGTGCATTT 


TGAATAACTA 


CTTGGATACC 


TACAAACATA 


840 


40 


ATGACAAATG 


ACACCAATAA 


TGAAGAAATA 


TTTTCAGACT 


TCAAATGGCC 


ATAAGGATGA 


900 




TTTCGATCGG 


CAGGTTTAAT 


TGAAATTTTC 


AATCCAATAA 


TAACAGCTAA AGAAACGATA 


960 


45 


ATATCTGTCA 


TATTGTTTAA 


TGCATCGGCT 


CTTACAGCTG 


CAGAGTTAAA 


GACAAAACCC 


1020 


GTGACATACT 


TAACAATAGA 


TAAGATTATA 


TATACAATTA 


AACTCAAATA AGCACCGCGT 


1080 




TGCGCCAATT 


TAAGATTTTC 


ATTATGAGAC 


ATGCGTTGAA 


CCACCTTGAA TTAGTATAGT 


1140 


50 


AACAATATTA 


TGAATGATTC 


ATTTTAATTT 


TACAACGTTT 


TTAATTTTTA 


TAAATTTTTA 


1200 




TAAAATTAAA 


CTAATTTATT 


CATTCGCAAC 


CCCTAAAAAT 


AATTTTTAGC 


CTTTCTGCGA 


1260 




ATTTTATGAG 


CTAGAAAGGC 


GCCCAACTCT 


CCCTGTTTGT 


TAACTTTCGC 


CTCGAAAGTT 


1320 
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CGAAvwTTAT GAGCTAGAAA GGCTTATGCA GTTGACGTTT TACGTCCAAC TGCGTTCCTC 1440 

CGTCTTCTTC AAATTTATTT GTnAGAAAGG CACCCAACTC TCCCTGTTTG TTAACTTTCG 1500 

CCTCGAAAGT TTCTATGTTA GAACCCTATG CATGAGTTGC GAAnTATCTA ATGTCGTGAA 1560 

CTAATTATAT AGAAGAAAAA GTGCATCAAT GACAAATTAA ATGAGATTTC TACTCTACCA 1620 

AACTCTCTTC GAAAGACAAT TTTCTCCTCT ATTTATTAGC AACTATTGCA TTTCTCCATA 1680 

TAGTACTTCC TTACTTAAAA TACGCTGAAT GTCTGAATTA AA 1722 
(2) INFORMATION FOR. SEQ ID NO: 408: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5521 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 408: 

GCGAGACCCC CTGAGGGAGC AGTGCCCAGT CGAAGACCCG AGGCTGAGAC GGCACCCTAG 60 

'GAAAGCGAcC ATTyCAATAC GAaTTgTGAt AAATAGAGAA CAGCAGTAAG ATATTTTCTA 120 

ATTGAAAATT ATCTTACTGC TGTTTTTTTA GGGATTTATG TCCCAGCCTG TTTTTTGTGA 180 

TTTTTAATAA TTTGAATATG GAAAATGTAT TATTCTCTCA TTTGTATAGA TTGTATTTAA 24 0 

TAAGTTAATG TAATCCTTGA GCTCACGATT AATAAAATTC TATAACCTTA ATTATTTTCT 300 

CGATACAAAG GGTTATTAAC TTTAATATAA GTATAATGAT GTGCCTCATC TTCAAGACGC 360 

ATTGTTGTAA CACACTTATA ATCTATAAAT GGCGCGAACA TGGTATCTTT AATTTCaTTT 420 

ATGCGATTCT CATTTACTTG ATTAGATTTG TGTGTCGAAA GTACAAGTTG ATCaAAAATG 480 

TTATCTAGTA CATCACGAAC GATATACCAC ATATGTCTTT CTAAGTTTGA ATCATTTGAT 540 

40 GCTTTAGAGA TTGTAAGAAT TAATTCGCCT AAATGGTTTT GAACGGTAGA ATAAAAGGCT 600 

TTGTTAAACA CAGACGTTTT TGAATCAGTA AGAATTCTTG ATTTTTCATG GAAATGAGAT 660 

GTACTGTATA CCATTTCATT TAGTTGTGCT TTATCAATTC TTAAACCTTC AAAGTCTCTA 720 

ATATACATCG TGTCCAATAG ACCATCTTTC CGAAATGTTG CAATAGCATT TTGCAAGTGA 780 

GCCTCTAATG CAATGCCATA TTTAGTAACT AGTGGGATTA CGAGACCAAG CAATGCTTTA 84 0 

CTATAAGTTT CAACCCACGA TTTCGCCGAT GATTCAAAAT CAGACAATGA TGCAGCTGAT 900 

TGATAACGTT TAATCAATGT CACGATAGGT GATTCATTGT TAAATGGGTA GGTTGCAACT 960 

AAGCTTGAAG GAATCAATGG TGTGACTTCT TGTGGAATCA TTTGGTATAT ATTTTTTCTA 1020 
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AAAAATGAAT ACCAGCAACT TCATCAATAA 
. CTTTCTCCAA AATATCATTT AAAATACGTG 
5 CTGATAATGT ACGAATCTCA CCTGTAATAT 

ACATAGCTGG GTATTTAGGA ACTAATGTTC 
TATGTTTTGC TTCAATGATT AATTCTTTAT 

10 

CGTCATCTAA TTGCCATGGA TGAACAATCA 
GCGTAAATTC ATTTTCTAAT TGTTTAATTA 

15 CATAATCTTT AGACAGTGAC ATAGTACGGC 

ACTTAATAGG TTGATTGAAT TCTGAAGAAT 
TACGTAATTT AGCCCCAGGA TGTAGCGGAT 

20 AGTAACTATC TTCGCTATTT TCGATAATAT 
ACAGTGCTTG ATAGCTAATT GCAAATGTCA 
GAAATTGATC ACTAGCAGCA TTTTTTAAAT 

25 

TTTCATTTGG ATGGTGTACT CGTGTAATCG 
GGCCTTCAAC ATCAATTCGA TCAAAGGCGT 
GCTTAGCTTG TGGGAACTGG ATTTCTAGTA 

30 

AATCACGACT CAATATTTTC TTACTTTGAG 
TTGATGTGAC CAATCTCTGA GTGACTTTAT 
TATTAGCCCA ATCGCTATTA TGTTGTTGTA 

35 

TAAATTTTAA TGTCTGTTCT TTAAAAATTA 
TATTTTACAA AAGCAAGATA GATTTGTATA 
40 CAACAGAATG CGGGTGTAAG TTTTATGACA 
TTTCTAGGTA ATTGGATTGG ACAAATAGGG 
AACGCAGTTT ATCTGGGGAT TGTCAATTTT 
GTGTGGGCAG GGGCAATTGC CGATAAATAT 
TCATCATCAT TTTTAGTAAC TGCAATTTTA 
CAATTAGCGT CATTATTATA TATGCGACAT 

50 

CTTTAAGACA AGCAATCTTA CCAGATTTAT 
CATTTCATTC ATTCATCATT AATATTTGTC 

55 



TTGTTGATGC ATAGGACTTA AATATGACAT 124 0 

TCATTAGTGG ACCATTGTGC GTCGTTTGTT 1200 

GAACGTTTGT CGACAATTTG ATGTGTGGCG 1260 

TGAAAGATAA ACCAGCATAA TAATCCaACG 1320 

CTACTTCTGC TTGATAATCA GAATGTAATA 1380 

TAATGTGATA ATCATTAAAG TTAAACTTTG 1440 

AGTCCGGAAA TAGTTGATGA ACAGTTGTAT 1500 

TTAATTTACT GTGAATCAGT ACTATTTTCA 1560 

ATAGGAATGT TTGTAATGCA TTTAAACCTT 1620 

GACCTTCAAT AACGGCTTGC TCTGAACGCA 1680 

TAAATAAAGG TGCAGAATCA TGTTGCATTG 1740 

TATtAGTtGC ACTGTTTATT AAATCTTGCT 1800 

CTGGTGCTTC AATTAAAATA CACTCAAGAA 1860 

TATTTGTAAT GTCATCTTTA ATGTAGAAAG 1920 

GTTCTCCAGT GATAGGAGCA TATAATGTTT 1980 

TATGAGTCGT TGAGATATCT AACATAATCA 204 0 

TGCGCGCTTT AACTAAGTTT TCGCGATGCA 2100 

CTCTTCCTTG TAAAATCATC TCTTTAAAAA 2160 

AAAATAAATA TGTTTCTTGT TCTTCTTTGT 2220 

AGTTCAAGTT CATAATTCAC CTCTATGAAA 2280 

ATCCATATTA ATGATAATGA yTCTTATTAT 234 0 

AAATATTTTT TTAGCAGTTC TTTTCTACTA 24 00 

CTAAATTGGT TTGTACTTAC CACTTATCAT 2460 

TGCAGACTTG TACCAATATT ATTACTAAGT 2520 

GATAAAGGGC GATTGCTGAG AATTACAATT 2580 

TGTGTGCTCA CGTATAGTTc ACTGCAATTC 2640 

TAAGAGGGAT TTTAAGTGCG GTTGAAACAC 2700 

CAGATAAAAT ATCTACTACA CAAGCTGTmw 276 0 

GTTCAATAGG GCCTGCCATT GCTGGTGTCA 2820 
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CAGTTTTATT ATGCTTACCA TTACATTTTA AAGTAACTAA AATACCTGAA GaTGCATCAA 294 0 

GaTACATGCC GTTAAAAGTT ATTATAGATT ACTTCAAATT ACATATGGAA GGTCGACAAA 3000 

5 TATTTATAAC ATCATTATTG ATTATGGCGA CAGGTTTTTC ATATACGACA CTTTTACCAG 3060 

TTTTGACAAA CAAAGTATTT CCGGGGAAAT CTGAAATATT TGGTATCGCT ATGACGATGT 3120 

GTGCCATTGG. TGGTATTATT GCAACGCTAG TTTTACCTAA AGTACTTAAA TATATTGGTA 3180 

10 

TGGTAAATAT GTATTATTTA AGTTCATTTT TATTTGGCAT TGCTTTGTTA GGTGTGGTAT 3240 

TTCACAATAT TGTCATCATG TTCATTTGTA TTACATTGAT TGGGTTATTT AGTCAATGGG 3300 

CACGTACGAC AAATCGCGTT TATTTTCAAA ATAATGTTAA AGATTATGAA CGTGGTAAAG 3360 

IS 

TACTGAGTAT TaTTATGATG GgATAGAGGT ATGaTTCCAT JcGGGAAGTCn ATTAATGAGT 3420- 

ATATGTGCAG ATGTGTTTGG CATTGTTAGA ACTTTTTCAA TAATGGGAAT AAGTACTATA 34 80 

20 TGCATTACAA TGGTATTCTA TTTTATAAAT AGAAAGTTGA AGTTAAAGTT GGAGGAAAGT 3540 

AATCATGGTA TATCTTGAAT GGGCAAAGGC AGATAGAAAT ATTCAATATC GTGTAATTAA 3600 

CGCCATTATT AAAGAACGTA TTTACCCCGA GCAAACATTT ATTTCGCAAA AAGGATCTTT 3660 

25 AATTGAAATT CAGTATCATA TGCATGTGTT GACTATTGAA GTTGTTAGAA AAAGTGCATT 3720 

AGAACGCTAT GAGTTTACAG GTGATATTAC TTATTTAAAT AAAGGTGAAA CGTCATTAAT 3780 

TATAACTTTA GAAGGTTTAT TAGATGTGTT GAATCATGAC TTTGATATCC CTATTTCAGA 3 840 

30 . 

GCGACTACGC GAAGAGTTAA TACACAGTCG AGATAGTTTA GTTGAAACAT ATAAGCAAAT 3 900 

GTCTCACAGA CAAACGTTAA TAAGTCmAAG TTTTAAATTT TCAAGGTTAC CACAAGATAT 3 960 

TAACTTTTTT TCakGGTtAC AACATGTAAA AGATAGTGAT AAGACAGATG ATTTAACTTA 4 020 

35 

TTCTGAGAGT TTGGTACCAG AGGGGCATCC AACACACCCT TTAACCAAAA CGAAATTGCC 4080 

CTTAACTATG GAAGAAGTAC GAGCATATGC ACCTGAGTTT GAAAAAGAAA TCCCTTTGCA 4140 

40 AATTATGATG ATTGAAAAAG ACCATGTTGT GTGCACAGCT ATGGATGGTA ATGATCAATT 4200 

TATTATTGAT GAAATAATTC CCGAATACTA CAATCAGATT CGTGTGTTTT TAAAGAGTTT 4260 

AGGTTTGAAA AGTGAAGACT ATAGAGCGAT TTTAGTACAT CCTTGGCAAT ATGATCATAC 4320 

45 GATAGGGAAA TATTTTGAAG CATGGnTTGC TAAAAAAATA TTAATTCCAA CGCCGTTTAC 43 80 

AATACTTcCA AAAGCaACTT aTCatTTaGG ACGATGTCTT TAATTGATAA ACcATACCAT 4440 

GTTAAgTTGC CCgTCGATGC aCAAGCAACA AGTGCCGTTA GAACAGTCTC AACTGTGACT 4500 

SO 

ACTGTAGATG GACCAAAGTT AAGTTATGCT TTACAAAACA TGTTGAATCa ATATCCaGGA 4560 

TTTAAAGTTG CTATGGAACC GTTCGGTGAA TATGCAAATG TTGATAAAGA TAGGGCACGT 4620 

55 



1307 



EP 0 786 519 A2 



10 



15 



AGTGCAAGTC TAGTTAATAA AAATCCAATA GATCAAAAAG TTATCGTGGA TAGTTACTTA 474 0 

GAGTGGTTAA ATCAAGGAAT TACTAAAGAA AGTATTACGA CATTTATTGA ACGATACGCT 4800 

CAAGCATTAA TCCCGCCTTT AATTGCTTTT ATTCAAAATT ATGGAATTGC TTTAGAAGCA 4 860 

CACATGCAAA ATACAGTAGT GAACTTGGGG CCACATTTTG ACaTTCAATT TTTAGTGAGA 4 920 

GATTTAGGTG GTTCTAGAAT TGATTTAGAA ACATTACAAC ATCGTGTATC AGATATTAAA 4980 

ATTACAAATG ATAGTTTAAT AGCTGATTCT ATAGATGCAG TGATTGCAAA ATTCCAACAT 504 0 

GCTGTTATTC AAAATCAAAT GGCAGAATTA ATCCATCATT TTAATCAGTA TGATTGTGTT 5100 

GAAGAAACCG AATTATTTAA CATAGTACAG CAAGTAGTAG CGCATGCCAT TAACCCAACA 5160 

CTACCACATG CAAATGAGTT AAAAGATATT TTGTTTGGAC CAACAATTAC TGTCAAAGCG 5220 

TTGTTAAATA TGAGAATGGA AAATAAAGTA AAGCAATATT TAAATATTGA GTTAGATAAT 5280 

20 . CCGATAAAAA AAGAGGTGTA GTACTACATG GCACACGTTA ACATAAATAT ATCGAAGATT 5340 

AAaTATAACG CCAAAGTACT TCAAACAGTT TTTCAAAGTA AAAATATGCA ATTCACACCA 5400 

GTAATTAAGT GCATAGCTGG TGACCGTACA ATTGTAGAAA GCTTAAAAGC GTTAGGTATC 54 60 

AATCATGTTG CAGAATCCAG ATTGGATAAC ATAATTAGTA TTGCAGATAC AGGATTTAAC 5520 

A 5521 
(2) INFORMATION FOR SEQ ID NO: 4 09: 



25 



30 



35 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1261 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 409: 

40 AGGTCGTCTr GCaGmAGmGm TATTTGACGA TCTTGCTTTC CCAAAACACG ATGATGATTT 60 

TAACATACTG TCTGATTATA TTGAGACACA TGGTGATTTC aCATTGCCAA TGTCTGTATT 120. 

TGATGATTTA TATGAAGAAT ATACGGAATG GCTAAAATTT TAATATAATT TTTAATAATA 180 

ATAGTTAGAA CCAGGGTGAT GCAATTCGTT ATCCTGGTTT TAATTTAAAA TAAACTAAGT 240 

TGTGACTAAA AATTAATCAA TTATAGTGAA ATATGGTGCG CTATCTTGCA TAAATTGATA 300 

TGATTAACTA CACAGAATTT AAAAGTACAT AATACATAAT AAGGAAGTGA TACAATGGAT 360 

GATAAGCAAC ACACATCTTC ATCCGATGAT GAACGCGCTG AAATTGCAAC AAGCAATCAA 420 

GACCAAGAAA CTAATTCATC GAAACGCGTT CACTTAAAAC GTTGGCAATT CATATCAATA 480 

55 
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CAAAAAATAA GTGGTTTAAA CAAAACTGAT CAAGCAAACT TAAATAAAAT TGAAAATGTG 
TATAAAATCT TAAATAGTGA TTATTACAAA AAACAGGACT CTGACAAGTT AAGTAAAGCT 
GCAATTGATG GCATGGTCAA AGAATTAAAA GATCCTTATT CTGAATATTT AACAAAAGAA 
CAAACGAAAT CCTTTAATGA AGGTGTTTCA GGTGATTTTG TAGGTATTGG TGCAGAAATG 
CAAAAGAAAA ATGATCAAAT TATGGTTACT AGTCCTATGA AGGGATCTCC AGCAGAACGT 
GCTGGCATTC GTCCTAAAGA TGTCATTACT AAAGTAAATG GAAAATCAAT TAAAGGTAAA 
GCATTAGATG AAGTTGTCAA AGATGTTCGT GGTAAAGAAA ACACTGAAGT CACTTTAACT 
GTTCAACGAG GTAGTGAAGA AAAAGACGTT AAGATTAAAC GTGAAAAAAT TCATGTTAAA 
AGTGTTGAGT ATAAGAAAAA AGGTAAAGTT GGAGTTATTA CTATTAATAA ATTCCaGAAT 
GATACATCAG GTGAATTGAA AGATGCAGTT CTAAAAGCTC ACAAAGATGG TTTGAAAAAG 
ATTGTTTTAG ATTTAAGAAA TAATCCAGGT GGACTACTAG ATGAAGCTGT TAAAATGGCA 
AATATTTTTA TCGATAAAGG AAAAACTGTT GTTAAACTAG AnAAAGGTAA AGATACTGAA 
G 

(2) INFORMATION FOR SEQ ID NO: 410: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 24 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 410: 
AAATATATTG AAnAGAnAAT TACTAAGATT AAATCnTCTT AAAATATCCC TGAAATAACG 
TCCTAAAGAT TAAAGGAAAG AGGTTATAAG TTATGCCAAA ATTAATTTTA TGTCGTCATG 
GACAAAGCGA GTGGAATGCT AAAAACTTAT TTACTGGATG GGAAGATGTT AATTTATCTG 
AACAAGGTAT TAATGAAGCG ACTAGAGCAG GTGAAAAAGT AAGAGAAAAT AACATTGCCA 
TCGATGTAGC TTTTACATCG TTATTAACAC GTGCTTTAGA TACAACGCAT TATATTTTAA 
CTGAATCTAA ACAACAATGG ATTCCTGTAT ATAAAAGCTG GCGTTTAAAT GAACGCCACT 
ATGGTGGATT GCAAGGCTTA AATAAAGATG ATGCTAGAAA AGAATTTGGA GAAGAACAAG 
TACATATTTG GCGTCGTTCT TATGATGTGA AACCACCTGC TGAAACCGAA GAACAACGTG 
AAGCTTACTT AGCTGATCGT CGATATAATC ATTTAGATAA ACGTATGATG CCTTATTCTG 
AAAGTCTGAA AGATACTTTA GTTCGAGTGA TACCATTTTG GACAGATCAT ATTTCACAAT 
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TTAAATATCT TGAAGATGTG TCAGATGAAG 
CACCGCTTGT TTATGAATTA ACGGATGATT 
5 AaAAGAGCTG CATGTACACA AGGAGTGAGT 

TGTAAGGAAA TAGTTAAGTA TAGAGTTTAT 
GTTAGACATA CAATATAGTC AAATTAAAAC 

10 

ATCTTTAAAG CACGCTTTCT TGTTTTAATG 
TGTAAATCTA ATTTCATCTC TATCCCTCCT 
TATGTAAATA GTAATGATTA CGTTTTAAAG 

15 

GAAAATGGGT ATAATTACCG GATATCTAAA 
GAAGCTTTAA ATGTCGAAGA AACAAAAATT 

20 ATTTTTTGGA TTATTAAATG AAACACTATT 

TTTTGAAATT TCATATACAC AAGTTCAATG 
GATTGTTATT CCTTTGTCCG CGCTTGTTAT 

25 AGTGGGTATT TCTATCTTTT TCTTAGGTAC 
AACATTATTA GTTGCTAGAA .TTATTCAGGC 
GATGACAACG ATTTTGGATG TTTTCCAACC 

30 

TGGTTTGGTA ATTGGTTTAG CACCAGCTAT 
ATATTTTAAC TGGAGATCGC TTTTCCATGT 
AATTGGaTTT AAAAcGATAA AAAATGTTGG 

35 

TTCTGTCATT TTTTCTGTAC TAGGTTTCGG 
AGAAAAAGGT TTTGATAATC CTAcGtATTA 

4Q GCATTATTTG TAwTACGTCA ATATCGGCTA 

AAAAATAAAC AATTTACAGT TGGTATCATT 
GGTTCGGAAA CGATTTTACC TATCTTTGTG 

45 TCTGGATTAA CTTTATTACC AGGAGCAATT 

GCTTTATATG AAAAGTTTGG TCCTAGAAAT 
ATTACTACGG CTTATTTTGT "TGTAATGGAT 

SO 

GTTTATGCGA TTCGAATGGT TGGTATCGCG 
ATGAATCAAT TAAAGCCAGA AATGAATGCA 

55 



ATATCATTAA TTATGAAATT AAAACAGGTG 720 

TAGAAGTTAT AGATAAATAC TACTTATAAA 780 

GTATATGcAG CTCTTAAAtA TGTGAAGTAA B40 

ATTAACGAGC TAGGGATACT CGAAAATATA 900 

AATTATTTCG CTCTTTTATG TTGCTTAATA . 960 

TTAGGGCTAT TTAAATTACG ACGAGCAGTC 1020 

TGTAAATATA TTATGACCGA TAACTACTCA 1080 

AAATTGTAAT AAAGTCGTGC TAATTTTTTG 1140 

AATGTGTGTC GTTTTTTAGA TGGTGAGGGG 1200 

AACGATGATT ATTACTATGC TGATGGGTGG 126 0 

AGTGACGGCT TTACCAAGTA TTATGAAAGA 1320 

GCTGACAACA GCTTTTTTAT TGACTAATGG 1380 

ACAACGTTAT ACAACAAGAC AAGTGTTTTT 144 0 

ATTACTGGG'G GGCTTGAGt-C CGCACTTTGC 1500 

GTTAGGCGCA GGTATTATGA TGCCATTGAT 1560 

ACATGAACGC GGTAAATATA TGGGGATATT 1620 

TGGACCTACT CTTTCAGGTT ACCTTGTTGA 16 BO 

TGTCGCTCCA ATTGCAGCTG TGACATTTTT 174 0 

AACTACAATT AAAgTACCTA TTGATTTTAT 1800 

CGGGTTATTG tATGGAACGA GTTCaATTTc I860 

GTATCTATGA TTGGAGGCX3T TGTTTTAGTC 1920 

TCAACACCAT TATTaAATTT TGCTGTATTT 1980 

ATTATGGGTG TCaCAATGGT ATCG ATGATT 2040 

CAAAATTTAT TGCATCGTTC AGCTTTAGAT 2100 

GTTATGGCAT TTATGTCGAT GACTTCGGGT 2160 

CTTGCTTTAG TAGGTATGGC GATTGTTGTT 2220 

GAACAAACAT CAACAATCAT GTTGGCAACA 22 80 

TTAGGATTAA TTCCAGTAAT GACCCATACG 2340 

CATGGTTCAT CTATGACAAA CACAGTACAA 2400 
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AACTTTTCTC CAACTATGTC AGACTATA 2488 
(2) INFORMATION FOR SEQ ID NO: 411: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
. (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 411: 



15 


TTTTACAATT 


TCAGATATCT 


CTAAAACACA 


TACCGATCCA ATGGCCAAAA 


TTATTCGTCA 


60 




AAAATTGAAA 


AAACTAGGAA 


TTCGTAAAGG 


GATTCCAGTT 


GTATTTTCAG 


ATGAAAGTCC 


120 




AATTGTCATA 


AGAGAAGATG 


TAAAAGATAT 


AGTTGGAGAT 


AAAAATGCTA 


TCAATCGAAA 


180 


20 


AGGACAAATG 


CCACCTTCTT 


CAAATGCCTT 


TGTGCCAAGT 


GTTGTTGGAT 


TAATTTGTGC 


240 




■ AAGTTATGTG 


GTGAATGACG 


TATTAAAAGA 


TATTCCAGTT 


CGTCGCATTA AAGACAAAGG 


300 




GCAATAATTC 


ATTTTGAAAG 


GGATAATTTT 


CAACGTAAGG CAAGTGTAAC CACACCATAA 


360 


25 


AACTAAAACT 


GACTAGTTCG 


CAAGCATAGT 


AGAATATGCC 


TCGTGTACTA 


GTCAGTTTTG 


420 




ATTTGATTAT 


AGCTAAAATA 


TCAAAGGTAA 


TCGATTTAAT 


CTTATTTTTA 


CGGCCTTTGT 


480 




TTAAGTAAGT 


TGTTATATAT 


TTCTTTGAGT 


TGTTGTTCAC 


TTTTAGATGT 


CGTTTTTGGT 


540 


30 


TCGTAATAAA 


TTTTGTTTTT 


TAGTTTATCA 


GGTAAATATT 


GCTGTGAAAC 


ATAGCCATTA 


600 




ACATATTGGT 


GTGGATATTT 


GTAACCAATA 


GATCGGCCTA 


GATCTTTAGC 


ACCTTGATAA 


660 


35 


TGTCCATCTT 


TTAAATGGTT 


TGGAATTTGG 


CCCACATGAC 


CGTTTCTAAT 


ATCGGACAAT 


720 


GCACTATCAA 


TGGCACTCAT 


TGCTGAATTT 


GACTTAGGTG 


ATAAGCATAG 


TTCGATTACT 


780 




GCTTGGCTTA 


GTGGAATTCT 


AGCTTCTGGT 


AAACCTAGAC 


GTTCTGCTGA 


TTCAATAGCA 


840 


40 


GCAAGTGTTC 


TCTGACCAGC 


ATTAGGTGAG 


GCTAAGCCTA 


TATCCTCATA 


GCTAATTACA 


900 




AGTAATCGTC 


GAACTATTGT 


AGGTAAATCT 


CCAGCTTCAA 


TTAATCGTGC 


TAAATAATGT 


960 




-AAAGCGGCAT 


TGACGTCGCT 


ACCACGGATA 


GATTTTTGGA AAGCGCTCAT AACATCGTAA 


1020 


45 


TGCATGTCAC 
TGCAATGTAA 


CATCCTTGTC 
CATGTCGATA 


ACTTACAAAT 
ACCGT 


GCACCTTTTT 


GTAAACAGTC 


TTTAGCATCT 


1080 
. 1105 



(2) INFORMATION FOR SEQ ID NO: 412: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 579 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 412: 

TAACGTCGAA CTTGAGCTGT TACGTTATGA CTCATAATTA TTTTAGCATA GTCATTTAAA 60 

5 

TAAACTTCTG TTCTGTCTGT TGGATAAGCA AATTCAAGCA ATTGACTGTA ACTATCATTA 120 

ATGACTTCTT GATCAACATG ACTATCAAAA TATACAGCAT AATAATAAGT ACCATCAACC 180 

10 ATATATAACA AATCTTCAAA CTCTGTAGTT ATTGGATTGC TATGATATGC ATAATTAATA 240 

ACATCTTCTA AATCGTTAAA TTTCACAATG ATTGTTCTTG TATTTTTACG TGCTGAAGAC 300 

TTTTGACGTT TAGAACCTTG AGCTTCTTTT TCTTTTGTTC GTTGCTCGAA TAATTCTTCT 360 

75 AATTGATCTT CACCTTCTAA TGTTTGAGCT AACAATTCTT GAACTTGTTC ATCAAATkGA 420 

TCAGTTGCAT CATCATCAGA CATATTCATC ATATCTTCAT TTTTAGATTT AGAAATTGTG 480 

ACTTCGACAC CTTTTTCAAA GGCATGTACT TGAATCCATA ATGGACCTnC TACAACAAAA 540 

TCTTCTACTT CGTTAATTTC AT C CATCATT GAnCAAAAG 579 

(2) INFORMATION FOR SEQ ID NO: 413: 

' (i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 1342 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : . linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 413: 



35 



40 



45 



50 



GTGGTGAAAC 


TTCTTGCTTT 


GTAATTTTAT 


AAAGTGGATC 


AATATCACTC 


TGACTTACAT 


60 


CAGGTTGAAC 


TGTCATCTTT 


TTAGTTACTT 


TGTTTTCAAG 


CGTAATATTT 


GCTAATTcTA 


120 


GGtTTTACAG 


AATAATGAAT 


GTCATCAGCT 


AmCCCTTTmm 


CtTGATACTy CTCTTGATAT 


180 


TTTCCAGCTT 


TAGCATTTGA 


TAAATCAATC 


ACTACTCTTA 


AATCTTCTGG 


ATTTTCAATT 


240 


TTTATTATCT 


TTGATTGTGG 


TCCTGAAATA 


GTCACATTAA 


CTGTTTCAGG 


CGCTTTTGTT 


300 


AAATGCAAAT 


CTTTAGTGTT 


ATAAAGAATT 


TCAACGGGTA 


CATCTTGAAT 


CGTTTTACTA 


360 


GACTTTTGAC 


CAAGATTACC 


AGTGTTAAAG 


ATATTTCCAA 


AAACATTGTT 


AACAGATAAA 


420 


AAGAAAAACA 


ATGCCAAAAG 


AAAGGCAATA 


AATCTCAAGC 


CCCATTTACT 


TTCTAGCATA 


480 


TTATTTCACA 


CCTTTCTTTT 


GAAAGCGTGT 


GCCAAACCAA 


TGTTCAGCAA GCAACTCTTC 


540 


AAAAATTTCG 


TTTGAAATGT 


CTCGTCGTAA 


TTTTCCATCA 


AATGTTACCG 


AAATATCACC 


600 


AGTTTCTTCA 


GATACAATAA 


CGGTAAATGC 


ATCAGATACT 


TCTGAAATAC 


CAACCGCAGC 


660 



55 
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10 



15 



20 



35 



40 



45 



TGCTGCTGCT GCAATCTTCG TGCCTTGAAT AATCATTGCA CCATCATGTA AAGGTGTGTT 780 

AGGTATAAAG ACATTAATTA AAAGTTCTTG CGAAATATTT GAATCCATTG CAATACCTGT 84 0 

TTCAATATAA TCTTGAAGAC CTG TTTCTTT TTCAAAGACA ATTAATGCAC CTATACGTCT 900 

TTTAGCCATA TATTGCACAG CCTTTGAAAC CGATTGAATC AATTTCTCTT CATCTTTACT 960 

ATACGTATTA GAAGTATAGC GTTTTA AAAA GCTACCTCTA CCAAGTTGTT CTAACGCACG 1020 

TCTAATTTCT GGTTGGAATA TTACTATTAA AGCTAATACC CCCCATTGAA TAACGATATC 1080 

GAATAATTTA GATGTTGCAG TCAAGTTCAA TATCATACTT ATCTGCTGAC CAATAACAAT 114 0 

TACTAATATC CCTTTAAGTA ATTGTATCGC TTTAGTTCCC TTAAAGACCG TGATGAGAAG 1200 

ATAAAGTACA TACCAAACTA TCAGTAAATC AAGGATACTC GTTACAATTT TTAACGTACT 1260 

GAGGTTTTGA AAAAAGTTGG AAAAATCCAT. AACATCTCCT CCGGGTAATA TTTTTCCATA 1320 

ATACCCATTA TACCAATCAT TT 1^42 
(2) INFORMATION FOR SEQ ID NO: 414: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1073 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 414: 

CTGTATAAAG ATGGAGGTGT TTTGCATGGT AAAACGTACT TATCAACCAA ATAAACGTAA 60 

ACATAGTAAA GTTCATGGTT TCAGAAAACG CATGAGCACA AAAAATGGCC GTAAAGTTTT 120 

AGCGCGCCGT CGTCGTAAAG GCCGTAAAGT TTTATCTGCA TAAGATCACT GACTCATCAG 180 

TGAtCTkTTT TTTCGTTTAA ATTAAGAATA AATAGAAATT TATGTTATAA GCTCAATAGA 240 

AGTTTAAATA TAGCTTCAnA TAAAAACGAT AnATAAGCGA GTGATGTTAT TGGAAAAAGC 300 

TTACCGAATT AAAAAGAATG CAGATTTTCA GAGAATATAT AAAAAAGGTC ATTCTGTAGC 360 

CAACAGACAA TTTGTTGTAT ACACTTGTAA TAATAAAGAA ATAGACCATT TTCGCTTAGG 420 

TATTAGTGTT TCTAAAAAAC TAGGTAATGC AGTGTTAAGA AACAAGATTA AnAGAGCAAT 480 

ACGTGAAAAT TTCAAAGTAC ATAAGTCGCA TATATTGGCC AAAGATATTA TTGTAATAGC 540 

AAGACAGCCA GCTAAAGATA TGACGACTTT ACAAATACAG AATAGTCTTG AGCACGTACT 600 

SO TAAAATTGCC AAAGTTTTTA ATAAAnAGAT TAAGTAAGGn TAGGGTAGGG GAAGGAAAAC 660 

ATTAACCACT CAACACATCC CGAAGTCTTA CCTCAGACAA ACGTAAGACT GACCTTAGGG 720 



55 
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TTTAGATACA ATTACGAGTA TTTCAACACC AATGGGTGAA GGGGCAATTG GAATTGTTCG 840 
ATTGTCTGGA CCGCAAGCCG TTGAAATTGC TGACAAATTA TATAAAGGAA AACATCTTTT 900 

5 

AAATGATGTT CCATCACATA CGATTAATTA CGGTCATATT ATTGATCCAG tCTAAAGAAG 960 
TGGTTGAAGA AGTTATGGTG TCTGTGTtAA GAGCGCCAAr AACATTTACA CGCGArGATA 1020 
TTATAGAgAT TAATTGTCAT GGTGGtATTk TAACGATTAA TAGAGTGCTG GGA 1073 

10 

(2) INFORMATION FOR SEQ ID NO: 415: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3176 base pairs 
s (B) TYPE: nucleic acid , 

(C) STRANDEDNESS : double 

(D) TOPOLOGY.: linear 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 415: 



25 



30 



35 



40 



CTTTACCAAT 


GCCAAATCCG 


AAGTAAAGTA 


TAGCAATAAA 


GATTACTAAT 


ACAATTCTGT 


60 


AAATGGCAAA 


TGGAATTAGT 


TTGATTTTGT 


TAATTAGATG 


CAAGAATGTT 


TTGATTGCAA 


120 




AGTAAATGCA 


GCTAAAAAGC 


CTAAAATATA 


AAAAGGTATA 


TCAGCAATCT 


s 180 


GAATATCTTG 


ATAATGTTTT 


AATAAAGATA 


AACCACTAGC 


TGCTAACATA 


ATTGGAACAG 


240 


CCATAATAAA 


TGTAAAGTCC 


GATGCTGCTT' 


TATGATTTAA 


TTTCATTAAT 


ACCCCAGTTG 


300 


AAATTGTTGA 


GCCTGAACGG 


CTGAAACCAG 


GCCACATAGC 


TACTGCTTGA 


gAAATACCAA 


360 


TTACAAATGC 


TTGGAAATAA 


CTGATTTGAT 


CTACTGTTTG 


TGGGTTTTTA 


ACTTTAGCTG 


420 


AGTATTTATC 


AGCAATAATC 


ATATAGATAG 


CACCTACGAA 


TAAGCCAATC 


ATAACAGTTG 


480 


GCACACTAAA 


TAAATGTTCT 


TCGATGAAAT 


CATCAAATAG 


TAAGCCTAAA 


ATACCTGCTG 


540 


GCACCATACC 


CACTAATACA 


TGTAATAAAT 


TTAAACGTCT 


TGGCTTTGAA 


CGTCTTTGTT 


600 


GATCGTTATC 


TCCTTCAACA 




TACCAATATG 


TAAAATCTCT 


AAGAAGCGTT 


660 


CGCGGAACAC 


CCATGCTGCT 


GCAAAGACGG 


ATCCTAATTG 


GATGACGATT 


TTAAATGTAA 


720 


ATGCTGACTG 


AGAACCTAAA 


AATTCAGATG 


ATTTTAACCA 


CATATCATCA 


ACTAGGATCA 


780 


TATGTCCAGT 


AGAGGAAACA 


GGTGCAAATT 


CTGTTAATCC 


TTCGACGACC 


CCTAAGATAA 


840 


TACCTTTTaT 


TAATTCAATG 


ATAAACATAA 


TGTACCCACT 


TTCATTACTC 


AATTTAATTT 


900 


ATTTAAATAT 


CAAAATTACC 


ATATCATGAT 


AGCATATTCA 


TTTAAAGACA 


TGCTAGTTAT 


960 


AGTTATAATA 


CTAGACTAAA 


GATGTATATA 


TTCATTTTCT 


TTTACATGTA 


AAACTACAAT 


1020 


ATTTTATTGA 


GCTATTTAAT 


TTGATTTTAA 


GGAAAACCTT 


TTATAATAGG 


TTTAGGTGAT 


1080 



55 



1314 



IEP0 786 519 A2 



TTCTTGGTCA GTACTGGTCT CGGCATAATC GTTATAACGC AAAATATTTT AATAGCAGAT 1200 

TTTTTAGCTA AAATTATAAG ACATCAATTT CAAGGTTTAT GGATTGTATT ATTTATTTTA 1260 

5 TTAGGTGTTT TACTTTTAAG AGCAACTGTG CAATTTCTAA ATCAATGGTT AGGTGATACA 1320 

TTAGCATTTA AAGTTAAGCA TATGCTTAGA CAGCGGGTTA TTTATAAAAA TAATGGTCAT 13 BO 

CCAATCGGTG AACAAATGAC TATACTCACA GAAAACATTG ATGGTCTAGC ACCTTTTTAT 1440 

10 

AAGAGTTATT TGCCTCAAGT GTTCAAATCA ATGATGGTTC CGCTCATCAT AATCATTGCA 1500 

ATGTTTTTCA TCCATTTCAA TACCGCATTA ATTATGTTAA TAACTGCACC ATTTATTCCT 1560 

TTGTTTTATA TTATTTTCGG TTTGAAAACG CGAGATGAGT CAAAAGATCA AATGACTTAT 1620 

IS 

TTGAATCAAT TTAGTCAACG GTTTTTAAAT ATTGCTAAAG GTTTAGTGAC GTTAAAGCTA 1680 

TTTAATCGTA CAGAGCAAAC AGAGAAGCaT ATTTa CGACG ATAGTACTCa GTTTAGAACT 1740 

2Q TTAACAATGC GCATTTTaCG CAGTGCTTTT TTATCGGGAT TAATGCTCGA ATTTATAAGT 1800 

ATGTTAGGTA TTGGATTGGT TGCATTGGAA GCAACGCTAA GCTTAGTAGT ATTTCATAAT I860 

ATTGATTTTA AAACTGCGGC AATTGCGATT ATTTTAGCGC CTGAATTTTA TAATGCAATT 1920 

25 AAGGACTTAG GGCAAGCGTT CCATACTGGA AAACAAAGTG AAGGTGCCAG TGACGTTGTG 1980 

TTTGAGTTTT TAGAACAACC GAACTATAAT AATGAATTTC TATTAAAGTA TGAGGAAAAC 2040 

CAAAAGCCAT TTATTCAGTT AACAG ACATA TCATTTCGAT ATGATGATTC TGATAGATTG 2100 

30 GTATTAAATG ATTTAAATTT GGAAATATTT AAAGGTGATC AAATTGCACT TGTAGGTCCA 2160 

AGCGGGGCAG GTAAATCCAC TTTGACACAT CTTATTGCAG GTGTTTATCA GCCAACAATA 2220 

GGTACTATAA GTACAAACCA GCGTGATTTA AATATAGGAA TACTTAGTCA ACAGCCATAT 22 80 

35 

ATTTTCAGTG CTTCTATAAA AGAGAATATT ACGATGTTTA AAGATATAGA AAATAATACT 234 0 

ATTGAAGAAG TGCTAGACGA AGTAGGTTTA TTAGACAAAG TGCAATCTTT CACAAAAGGC 24 00 

ATTAACACAA TAATAGGTGA AGGAGGCGAA ATGTTATCTG GTGGACAGAT GAGACGCATA 24 60 

40 

GAACTTTGCC GTCTTTTAGT TATGAAGCCA GATCTCGTTA TATTTGATGA GCCTGCAACT 2520 

GGTTTAGATA TTCAAACAGA ACACATGATT CAGAACGTTC TGTTTCAACA TTTTAAAGAT 2580 

45 ACAACGATGA TTGTCATTGC ACATAGAGAT AATACAATTC GCCATTTACA ACGACGCTTG 264 0 

TATATAGAAA ATGGAAGACT GATTGCTGAT GATCGCAATA TTTCAGTAAA TATAACAGAA 2700 

AATGGTGATG ACTTATGAAA ACACGACTAA AATTTCAAGT AGATAAGGAT TTATTGTTAG 2760 

50 CTATAGTTGT TGGTGTTTGT GGAAGTTTAG TTGCGCTCGC CATGTTTTTC TTAAGTGGTT 2820 

ATATGGTGAC ACAAAGTGCA CTTGGTGCGC CACTATACGC TCTGATGATT TTAGTCGTTA 2880 

55 
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ATAAAGCTAC ATTTACAATG CTACGTGATA TTCGGGTACA GTTTTTCGGT AAATTAGTAA 3000 

ATGTCATTCC TAATGTTTAC CGTAAACTGA GTTCTAGTGA TTTAATTTCA CGTATGATTA 3060 

GTCGTGTTGA GGCATTACAA AATATA3CATT TACGTGTTTA TTATCCACCA GTCGTCATCG 3120 

GTTTGACAGC GCTAGTTACA GTCATAGTTT TGGCGTTCAT TTCAATCGGC CATGCG 3176 



(2) INFORMATION FOR SEQ ID NO: 416 : 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2109 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
1S (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 416: 

20 TCTTTATTTC AATTTCCAAT TGAATTTTTT TATTATTTAC GCATAGCTCT TAAAATTAAC . 60 

GTTACGATTG CAATTAAGAT AATTGAACCA ATTAATGCTG GCAAGATGTA AATACtTCCT 120 

AATTCAGGAC CCCATTGTCC GAATAGTTTG CCACCTACCC ATGATCCAAT AATACCTGCG 180 

25 ATAATATTGC CTAAAATACC ACCTGGGATA TCTTTACCCA TAATAGCACC AGCAGCCCAT 240 

CCAATTAAGC CACCGACAAT TAACATTCCA ATAAATCCAA ACATAATTTT CAGTCTCCTT 300 

TTTCTATTTA TTTTGCGTTA TTCTAAGTAG TACCCCTTAT TTACAATTCT AAAACAAATT 360 

CAAATTATTT TTATCCAAAT ATTTTTAAAA GTAGTAATTG AATATCAATT TTATTCAATG 420 

TAGCTATCGT TATTTAAAGT CTCTGTACCG ATAATATCAT ATACATTTAC ATTATTTTTT 480 

CTGCCGAATT CATAGCTTGA TTATTTTATG TTATAGGACT AgAATATACA CATATTATTA 54 0 

35 

GAGCATCTTT GAATTTTAAA TCAAGAAGCG AGGTTAATGA ACAATGAATA TGCATATTTT 600 

ATATaACTTA CGAACTAAAC ATAATTTAGA AATTGACGAA TTAGCACAGC AATTAAATGA 660 

GAAATATGGT ACTAAATATG AAGCACATCA AATTTGGGAA TGGGAGAATC ATCACCATGA 720 

40 

ACCTAAATTT AAAGATGCCA TGCATTTAGC TGACTtCTTT GATGCACCAT ATGAAATGTT 780 

TTTAGAAAGT AAGGTTAAAG AATATCAGAA ACATTTAGAA GAAGTCGATA TTCGCATGGA 840 

45 TAAATAGATG CAAATAAACC CTCACAACAC GTTTGGCATA TATCCTTTCA AATCTATACT 900 

GGATATATTA CATTACGTTG TGAGGGTATT TTATTAATTA ATATGAATTA AGACATTTTA 960 

CAAGCGTTAA TGCAACGAAT CTTTTTAGTG ATCTTGCTCA CTCTTTAATA CTTTACCGTT 1020 

50 CTTAGCATCA ACAGTAACTT CTTGTTTTTT ATTAC CTTTT TTCAAATCGA TATTGTAAAC 10 B0 

AAGTTTGCCA TCATCTTTTT CAAGTGACCA TTCTTTAATA TCACCATCAA ATTCTTTTTG 1140 
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5 



10 



15 



20 



ATTCATTGTA 


TCTTCTTTTT 


CAGTCTTTTT 


GTTAATCACT TTTTTATTTT 


TATCAGCAAC 


1260 


AAGTACTTCT 


GACTCTTCAC 


CAGATTTTTG 


TTGCGTCACT 


TTATAAGCCC 


ATTCACCATT 


1320 


AGAATTTTCA 


AATGAAATTC 


CTTTCAACTT 


TTGGCCTTTG 


TAAGTTTCTT 


CAGCTTTTTT 


1380 


CACAGCATCT 


TCTGGGCTTG 


TTTTAACATC 


TTTTAAAGCA 


ATAACATCTT 


TTGTTGTATT 


1440 


AGTGTCTTGA 


TTAGTATTTG 


ACTCTGTTGA 


TTTTGTTTCA 


TCTTTTGGAG 


TATCATTGCC 


1500 


ACATGCAGTA 


AGCACCACCG 


CTGACATTGA 


TAACACTGCT 


AATGATTTTA ATTTCATAAT 


1560 


ATCACTCTCT 


CTTCCTATTT 


TTGAAACTCA 


TAACAAAAGC 


TTATATGCTA 


TATAGATTGT 


1620 


ATTACCCCTT 


GTTTTTAATT 


TTATTCATAA 


TTATTACAAA 


TATTTTTAAA 


TTAATCGTCA 


1680 


TGCGTTACTT 


TCGTTCGTAC 


TCCTTTTATA 


AATGAACCAT 


GTAATATAAG 


CATGCTATTA 


1740 


TCGACTCAGT 


TTGTCTAATG 


CTTTCTTTGG 


TACTTCTTCC 


TTTTCAACTT 


CTTCAAAAGT 


1800 


TTCTACATGA 


TGACCTTTAT 


GTGTAATTTT 


CAAGTATCTA TGCGGTTTAA 


CATCAAATGT 


1860 


AGCAGTATAC 


ATTAATTGCG 


TCTCTTTCCC 


TTCTTTATTA . AAAACACTTT 


GACTATAACT 


1920 


TCGGAGTTGA 


TCATCCATTC 


TAGTAGACAC 


CTCTGTCGTT TTAACATAAG AATCATCCTT 


1980 


CTTTACTAAT 


GGGTTAAATT 


GATCTGTTAT 


ACCATGGGAG 


TCTATTGTTT 


TTAAAATGAA 


2040 


TAAAGCAgCA 


TAAGCGCCAA 


TGATAGTAAG 


TACAAGATAt 


GTkATTGTTT 


TTAAAATCGT 


2100 


TTCAAAAGA 












2109 



30 (2) INFORMATION FOR SEQ ID NO: 417: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 813 base pairs 
{B} TYPE: nucleic acid 
3S (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 417: 

40 

GTTACATAAA TGAAAACAAA AAAGATAATT TTACTGCTTA TGCTACACCA GAACATAATT 60 

ATCAATTTGG TGGTGCTATG ATAGAAAGTG AAAAATTAAG CGAGTTACTA AAGCCAGCCA 120 

45 ATCAGTTAAA ATCACCAGAT GATATAAAAA AAGAACTAAA TAAAAAGAAG AGTCACTAAA 180 

GTTAGGAGTT ACTTTAATGT CCAAAAAACA TGTTTTTATA ATTATTGGTG TCATATTGTG 24 0 

TATATGTACA GTTTCTACGG TCATGCATTT TAAAATGAAA TATGATGAAA AAGAAAAACA 300 

50 AAAAGCGATT TACTACAAAG AACAACAAGA ACGTATTACA CTCTATCTTA AGCATAATAC 3 60 

TAAAGAAACG AACACGATTA AATCTGTACA TTTCACAAAC TTGGAAACAA GTCCTATGGG 420 
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ATCGCCTGAA CATAATTATC AATTTGGTGG C6CTATGATA AAAAGTGAAG GAGTAGATAA 
ATTATTAAAA CCAGCACATG AAAGAAAATC ACCAGAAAAA ATCAAAGAAG AATTAGATAA 
AAAAGAAGGC CACTAGGGTC TTCTTTATTT TTGATTTAAT CTTCCAATAA TCTATGTCAT 
TGCTATCGAA GGTGTTTCGC AATTAATATA AATCACTTCA TCATCACCAA TACTTCCCCA 
GTTTTGTACA GTACATTAAC ACAAACGAAC CACGTTAATT TAAATGGAvT AaTAGTTTGG 
CCATTATAAG AACAATATAT ATCGAnTAAC AAT 
(2) INFORMATION FOR SEQ ID NO: 418: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 640 ba3e pairs 

(B) TYPE: nucleic acid 

(C) , STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi ) SEQUENCE DESCRIPTION: SEQ ID NO: 418: 

AAAAGCAATC GTTGGTGGTG CTAAATTCAT CGGCAACTCA TATGTAAAAG CTGGTCAAAA 

TACACTTTAC AAAATGAGAT GGAATCCTGC ACATCCAGGA ACACACCAAT ATGCTACAGA 

TGTAGATTGG GCTAACATCA ATGCTAAAAT CATCAAAGGC TACTATGATA AAATTGGCGA 

AGTCGGCAAA TACTTCGACA TCCCACAATA TAAATAAGCA ACATGAACAT AGGATCAAAA 

GTCATCCCCC ACTATCAATC ATGGGGGATG ACCTTTGATC CCTTTTTTAT ACATACACAA 

GCAAAAATAG CGGTGATTGT TTACCATCAA TTTTAACAAT CACCGCTACT TTTGCTTGTA 

• ATTCATGATT CAATTTTTGT TGTGTGCACA ACGACACTAA ATTATGTGTT TGCTATTGTC 

GTGTTACAAC GATATGCGTC GTTGATTTAA CTTATCAAGT AATTGATTTA AATTGTCTAA 

TTCGACTTCC GATAAACATT GACATCTTGC TTCAATCAAT TCGCAACGTG CATTATTTAT 

TTGTGAAATT AATGTACGTG CTTGATCAGT CAAAATTAAT TCTTTACATC TTAAATCTTC 

GCTAGATTGT TGACTTGTGA TGTACCCnnT CAAAACTAAT 

(2) INFORMATION FOR SEQ ID NO: 419: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1159 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 419: 
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TTCTTGGATA CGTAATAAAT TACCTTTTGT CATAGGATTA AACATAGATC CAGGTAAAAC 120 

ATAAACATTT CTATTTTGCT CTAATGCAAA ATCTATCGTG ATGTGACTGC CACTTTGTTC 180 

CTTAGCCTCA GTAATTAAAA CCCCTTTTGA CAAACCGCTG ATAATTCTAT TGCGCTCAGG 24 0 

AAATCTATAT TTAGCGATTG GTGTATGTGG TGGATATTCA GATATAACTA AACCTTTTTC 300 

TTCTATTTTA TTTCTTAATG CTAATGTACT TTTGGGATAA TGTGTTTGAT GGCCAAAGGC 360 

TAAAACTGCA ATTGTAGGGA GATTGTATTT TAAAGCTATT TGATGTGCCA TTGCATCAGC 420 

TCCTTGAGCA AGGCCGGAAA CAATTGTTAA ATATTTGCTT TTATCATTTG ATAATAAAAA 480 

TTCTAAAGAC TGTTGGGTAT AACTTGTAGA ATCTCTTGCA CCTACTACTG CCAAATGATG 54 0 

CATATTATTT ATTAATTTGA TGTTCCCTTT ATAGAAAAGA AGTAATGGAA ATTGATATAT 600 

TTCTTTTAAT AGCACTGGGT ATTCAGAATC CATATATGTA ATGTAACTTA CTTTTAATTT 66 0 

CTGCAGTTCT TTAATAATAT CGTTATGATC AAGTTTTATA AAAGCATAGT ATTTACGTAA 720 

TAAATGAACA TTTTCTTCCC TATTCACCCA TTCACATAAA TAACTATCTT TTTTTCTTCC 780 

CTCCTCTTTA ATTACATTAG GATATGCCAT TAAAAATTGA TGAATTTGTT TAGTCGAAAA 84 0 

25 GTGTGCCCAG TATAACTTAA GCAAAAATAG TTTAATCAAT AAATCAACTC CTTTTTGTAA 900 

TCATACAAAA TCATATTCTA TTTTTGTTTT ACATTTCTAA TACAAAAACA TTGTCGATGT 960 

AATGTTATTT TAAGGAGTAA AAATACTGAC TTAAAAAGTG AAAAGTATGT TGGAAAGAAT 1020 

30 TTAAAATTTC AATTTCCAAC ATACTTTATA ATTAAACCTT ATAAATAAGT TTTGCAAATT 1080 

TATATAGAAT TGGTCTTACT GGTTTGATGA AATCACCAAT TAATTCTTCA ACATGTGCAT 1140 

TAAAACCCTT TTTAAATTG 1159 
(2) INFORMATION FOR SEQ ID NO: 420: 



10 



is 



20 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1879 base pairs 

(B) TYPE: nucleic acid 
40 (C) . STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 420: 

TCAATCAGCG ACTACAAGAA GTGCGGGTCT TCAAACAATT GATGTGACAA CACTAAGTGA 60 

. CCCCACTAAT ATTATCATGG GTATTTTAAT GTTTATAGGA TCTTCGCCAA GTTCGGTTGG 120 

50 TGGCGGTATT CGTACAACAA CTTTCGCTAT TTTAATTTTG TTTTTAATTA ACTTTAGTAA 180 

TAATGCCGAT AAAACATCCA TTAAAGTTTA CAATAGAGAA GTACACATTA TGGATATTCA 24 0 



55 
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ATCAGCTACT GAAAATGGTA AGCTTACATT 


TTTACAAGTA TTTTTTGAAG TCATGTCTGC 


360 




ATTTGGAACT 


TGTGGACTAT CGCTTGGTGT 


CACAAGTGAT ATTAGTGATA TTTCTAAGGT 


420 


5 


CGTACTAATG 


ATACTCATGT TTATAGGACG 


TGTTGGCTTA ATATCATTTA TCATTATGAT 


480 




AGCAGGACGT 


CGAGAACCAG ATAAATTCCA 


TTATCCAAAA GAACGTATTC AAATAGGATA 


540 


10 


ATATAATAGC 


AATCTAAGTT TAGTTAATGT 


AGATTTTAAC TGGAACTTAG ATTGCTTTTT 


600 


TAGTTTGTAT 


TTTTAACTTA TTTTATAAGA CGATTGGTTT CGAAAATGGT AAACTAGTAA 


660 




CAATGAGAGG 


TGTAACATGA TGGAAAAAAA TGAAAACATT AATGTAGAGA TTTTAACTAC 


720 


15 


GTCAGATATG 


CATAGTCATT TCTTAAATGG 


TGATTATGGT TCAAATATTT ATAGAGCTGG 


780 




TACTTATGTT 


AACCAAGTAA GAGCACAAAA TCATCGCGTC ATTTTATTAG ATAGTGGCGG 


840 




AAGTTTAGCT 


GGCTCGTTAG CGGCCTATTA TTATGCTATT GTTGCACCTT ATAAACGACA 


900 


20 


TCCAATGATA AAGTTAATGA ACAGAATGCA TTATGATGCT AGCGGTGTGA GTCCAAGTGA 


960 




ATTCAAGTTT 


GGTTTATCAT TTTTAACTCG 


TTCAATTGCT TTGGCACGTT TTCCATGGTT 


1020 




ATCAGCAAAT 


ATTGAATACA ATGTTACTAA 


GGAGCCTTAT TTTTCAACTC CATATTGTAT 


1080 


25 


TAAACATTTT 


GGTGACTTAA AAATTGCTAT 


CGTAGGCGTC ACAGCAGATG GTTTAATGGA 


1140 




AAATGAGTAT 


TCTGAAATGG AGCAAGATGT 


ATCTATTGAA AAGACATTAG TGGCATCAAA 


1200 




ACGTTGGATT 


AGATATATCC ATGAAGTTGA 


AGAGCCAGAT TTTTTGATTG TAATTTATGA 


1260 


30 


TGGTGGATTG 


AATAAAATTA GTAATAGTAC 


GAAAAATAAA AAGGCAAGTT CGAATGAAGC 


1320 




TGAAAAATTA 


ATGGAAGAAC TCGGTGTTAT 


AGATTTAATG ATTACAGCTC ATCAGCATCA 


1380 


35 


AACAATAGTA 


GGTCAAGATC ATGAAACGTA 


TTATGTTCAG GCTGGTCAAG ATGCCAAAGA 


1440 


GCTTGTACAT 


CTTTCGATTA ATTTTAAAAA 


GAGAACAACA ACTTATGATG TTGAAAGCAT 


1500 




TGATTCTAAA GTGATTGACT TAAATGAGTA 


TGAAGAGGAT CAAGAATTAT TAGATTTAAC 


1560 


40 


ATTCTATGAT 


AGAAAAGCAG TGGCTTATTG GTCACAGGAA ATCATAAGTG ATAAAGGTTT 


1620 


GATGTTATCA 


GTAAATGGGT TACAAGATTT AGTCTGTCAA ACACATCCAT TTTCGCAATT 


1680 




ATTACATGAT 


GCAATTCACC TTGCATTTGA TAATGATATA ACATGTGTCC ACGTGCCTAT 


1740 


45 


GaACGGAGAG AAGGGGTTGA GTGGACAGAT TCGAAATGAA GrTTTGTATC aTGCATACCC 


1800 




ATaTCCAGAT 


aAGCCAtGGG tATGACAATT 


aGTGGTCAAA ATaTCAAGGT ATnTTGGGGT 


1860 




ATAGTTATTC 


ACCATTAGG 




1879 


50 


(2) INFORMATION FOR SEQ ID NO: 421: 





(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2710 base pairs 

55 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 421: 
AATTCCCCCT TTTTCCCCAT CCATTTTnCC ATCCACTGGT CCTTACGGGA CCATATTATT 
TnAAATTGGA IlAAAATTATt TTAAAGaTTA TwACtACTCT TtAATCATTT TrGTGAATTA 
AAAAAAGTAG TGCAAAAAGC AAAATATACT TTATACACTA CAAATCATTT ATTTATAATA 
AAGTTTCACC AAAAAATGTT CCAACTAATG AAACCGCTTG TTCAGCAGTA TGATTATTAC 
TGTCAATCAA TGGATTTACT TCAACTAAAT CCATTGAGGA AATTAAATGT GATTGATGCA 
GTAATTCCAA TGCAAAATGG CTTTCTCTAT AACTAAGACC ACCCAAAACT CTAGTACCAG 
TGCCTGGCGT TTCAAGCGGA TCTAAAGCAT CAACATCTAA AGATAAATGA ACGCCATCAA 
CATTGCGTGA CTTCAAATAT TCTATTGTAT TTTCAATTAC TTCCTTTATC CCCAATTTAT 
CAATATCTGA CATAGTAAAT GTTTTAATAT TATGATCTTT GATAAATTGT CTTTCACCTT 
TATCTAAATC TCTCATACCA ATTAGTACGA TGnTTTCTGG CTTGATTACA TTACnATTTA 
ATTCTAAAAG TTCTTTGGGA* CCTTCGCCTG TCAAAATCCT TAGAGGCATA CCATGAATAT 
TTCCACTTGG TGACTCTTCA GGTATATTTA AATCACCATG TGCATCATAC CAAATAACAC 
CTAAATTATT ATAATGTTTA CTTATTGCTG ATACTGAACC TACCGCAATA GAATGATCTC 
CACCAAGAAC TAGAGGAAAT CTGTTATTTT CAATTGATGC TGAAACCTCT TTATTTAATT 
TTTGATTAAC ATCTATAATT TCATCATAAT TTCTTAATCC TTTTTGTTCA CTATGAAATT 
TTTCAATGTT CACAGCAGGt ACCTTAATAT CCCCCTTGTC ATATACATCA AGGTCTAATT 
GCTTTAATCT TGAAATTAAT CCAGCATATC TAATTGCTGT TGGTCCTAAA TCAACACCTA 
ATTTTCTTTG TCCAAATGTT GATGGTGCAC CTATAATATC AATTGCTTTT GTCTTTGTCA 
TAAGCGTCCC CCTTTGCTCT ATGTAATTAA AGAATAATGT ATGCGCTTAC CATTATCAAG 
CAATAGCTAC ACATATAATC TGTTTATCTT ATTACTTCAT AAAAAAAGGT TCTTCATCTT 
TTATGGTGGG AAGGTAAAAC TTCCTGCTTT TTTTAATACA CAAAAAGCGC AATTGCCTCT 
ATAATTTAAA GTGACCAAAC CCAAACTAAA GGAGACAAGT. GCGCCTATGT GTAATGATAC 
CTTAGAATTA CTAAGAATAA AAGATGAAAA TATAAAATAT ATAAACCAAG AAATTGACGT 
CATTATCAAA GGAAAAAAAG CAACAGTGGT TAATGCTGTA CTAACGTATA AGCCTTCGGC 
CTGTTATTGT TGTGGAGTTA AAAATGAAGG ACAAATTCAT AAACATGGTA AGCGTGTTTC 
TCGTATTACT TTACTTAAAA CTCAAGGGTA TAACACATAC CTCAACTTAG CTAAACAACG 
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5 



10 



15 



20 



25. 



30 



GTGTTTTATC 


TCAAGATGTG 


TTACTCAAAA 


AGTTATAGAA 


GAAGCTACTA 


AAGTTAAAAC 


1680 


AGAGATTGAT 


ACTGCAGAAG 


ATAACTGTAT 


CTCTCCATCT 


ACTGTAAGTC 


GTATTAGAAC 


1740 


TAAAGCGGCT 


AATTCATTAC 


GAATTAAACC 


CTTTAATTGT 


TTGCCAGAAC 


ACATCGCTAT 


1600 


GGATGAATTT 


AAAAGCGTTA 


AAAATGTAAC 


TGGATCAATG 


AGTTTCATTT 


TTATAGATAA 


1860 


TGATACTCAT 


GATGTTATAG 


ATATTTTAGA 


AAATAGAACT 


ACAAGATTCT 


TGCGTGCCTA 


1920 


TTTCGAGCGA 


TTCGATTTAA 


AAAATCGACA 


ACAAGTTAAG 


ACGGTTACTA TTGACATGTA 


1980 


TGAACCCTAT 


GTCCGATTAT 


TTCGCGACCT 


ATTTCCTAAT 


GCAGCTATTA 


TTTTTGACAG 


2040 


ATTCCATATC 


GTTCAACATT 


TAAATAGAGA 


ACTTAATAAG 


TATCGTGTAC 


AAGTTATGAA 


2100 


TGAATACCGT 


AATAAAAAAG 


GACCTGATTA 


TACAATTTTT 


AAGAATAACT 


GGAAAGTCCT 


2160 


ATTGATGGAT 


ACTAGTAAAA 


CCATATTTAG 


TAAATACAGA 


TGGAATAAAT 


CTTTTAAGGC 


2220 


TTATAAACGC 


TCATCTGACA 


TTGTAGAATT 


CATGCTTTCA 


AAAGACGATA 


TACTACGACA 


. 2280 


CTCCTACGAA 


CTTGTCCAAG 


GATTACGAAA 


AGACCTAAGG 


TTATGTAATT 


GGCCTAAATT 


2340 


TATTAATCGT 


TTAAATTCAG 


TTAGTAAAAA 


GTCTGTGAGT AAGGGTGTAT GGAAAGTGGT 


2400 


TAAATATTAT 


AGAAAACATC 


AAAGGATGTT 


AAGAAATACA ATTTATTACC 


CAGCATTTAA 


2460 


TAATGGTGCT 


ATAGAAGGAA 


TTAATAATAA 


GATAAAATTA ATCAAGTGAA 


TTTCTTTTGG 


^Z>4, u 


TTACAGAAAT 


TTCAACAACT 


TTAAAGCACG 


TATAATGATG 


aTTTTCAGCT 


TGTACAAAGG 


2580 


AGaAAAAAAG 


AAGACAACCA 


AGCCCAATAA 


TGGACTGGCC 


GCCTAATAwA nGGGskCTAA 


2640 


AAGTTgTATT 


TTTAAAAATA 


GTTCCTTTAA 


ATTATATACC 


CnCCACATTT 


GGGGGAGGAC 


2700 


CTAAAAAAGC 












2710 



35 (2) INFORMATION FOR SEQ ID NO: 422: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1027 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 

\ 

. (xi) SEQUENCE DESCRIPTION: SEQ ID Np: 422: 

45 

CATTTTAATT GtTaAAATTC CAAAAAAtGT ArGTGGATTw AAAGrAAACC CtGTGTTTTT 60 
AAAAGGtACC ATTaAAATAg TTCCGATTGT ACCATCCCAC GTGAAAtTTT TAgTAtTGCC 120 
GGTGaGAGAA AATGCCAATG CAATCATCGC AGTTCATAAT CATCCATCCG GTGATGTAAC 18 0 

50 

GCCCTCACAA GAAGATATCA TAACAACAAT GAGGTTGAAG GAGTGTGGTT TGATTTTAGG 24 0 

55 
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10 



GGGTTACTTT 


GATGAAAATG 


ATTGAAGTTG ATAGAATTAA TGACGTATCT TGTGTATAAT 


360 


ACCTACGAAG 


TACTTTCATT 


GGAGGAAAAA TAGTGACTCT ATTTATTATT ATCGGGGTTC 


420 


TCGTGCCAAT 


GGTTTATACC 


ATGCAGTTAA ATATTAAAAA TGAACCTGTa ACAAAGCGCA 


480 


ATCTTTTAAT 


AACATTAGCT 


TTATCTACGT TAGGTATTTT AGTAACCGCG TTAGCAGGTG 


540 


TAATCGTTAC 


GAAACAAGCT 


TTTCCTTTAT TAAGTGTAGC AATTGGCTCA ATTTTTACTG 


600 


GAATCGTTTG 


GGGCCTTTTA 


CTAAGTGGTA GcTACGCGCT GATACGATTT TTATCTAACG 


660 


CATTTGGGCG 


TAAGTAATAG 


TAATCTGTTC ATCAAGTAGT ATCCGTGCTT GAAAACAAAC 


720 


TAAAACTCCT 


AATGTGGAAC 


TAGTTAAAAA ATTCTAAACT AGTACATTAG GAGTTTTGTT 


780 


ATGCAGAATA 


AATTTAATTG 


TTAAATTGAA AGTGCGGTAA AAATCCACTA TTTATTTGAA 


840 


AAAAATCGAG 


CAACCAAATT 


AAACCTTGAT ATCCTAAGTA AATACATAAT AAAACAAGTC 


900 


CGACATAAAT 


TAAAAATCGC 


AAAATAGAAA GTCCAACTCT AAAAAGGAGG ATGACTAATA 


960 


GTGCTATTAA 


AATAATTGTt 


AGTATACTCA ATGCTACAAA CCTCCTArTA CGCTTTTAAA 


1020 


TCCATAA 






1027 



(2) INFORMATION FOR SEQ ID NO: 423: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 569 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 423: 

35 CGGCACATTT TTAATTTATA CATATTTTAA AACTAAGTAA CAGTTTGAAG AAATCGTAGT 60 

TCAATAATGT TAATTGTGAA AATGTATATA AACATAAAAA AATCATGTAT AATATATGTT 120 

GTTAATTAAA CAGTTCGAAA GCGAGATGAC ATTATGGGAC GTAAATGGAA TAACATTAAA 180 

40 GAAAAAAAGG CCCAAAAAGA TAAAAACACA AGTAGAATAT ATGCGAAATT TGGTAAGGAG 240 

ATTTATGTTG CAGCAAAATC TGGTGAACCC AATCCAGAAT CTAACCAAGC TTTAAGGTTG 300 

GTGCTTGAAC GCGCTAAGAC ATATTCAGTG CCGAATCATA TTATTGAAAA AGCAATAGAT 36 0 

45 

AAAGCTAAGG GTGCTGGAGA CGAAAACTTT GATCACCTAA GATATGAAGG ATTTGGCCCA 420 

AGCGGATCAA TGCTAATTGT TGATGCGTTA ACAAATAATG TAAATCGTAC TGCCTCTGAT 4 B0 

GTGCGAGCTG CTTTTGGTAA AAACGGCGGT AATATGGGTG TATCTGGATC AGTTGCTTAT 54 0 

50 

ATGTTGTCAT GTGGCCACAT TTGGTATTG 569 

55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3264 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 424: 



10 



15 



20 



30 



35 



50 



TGGACCTATA 


AAATGGATGC 


ATCCAAATAT 


GGATGnGGGG 


TAnGGCGGGa 


AAATAGGaGG 


60 


TTATATAGTA 


AGTCATTTAA 


GTCGAATAGC 


CATCCTTTTT 


AAAATGAAAA 


AAATAGAAAG 


120 


CTCAATAGTT 


TGTTAAAGCC 


TTTAAATAGC 


GTCGTTACGC 


AATTTTAGAA 


TGCTAAAAAT 


180 


TGTCACAAAT 


TTCAAATTAC 


GTGCTAAAGC 


TTGAGATATC 


AATATTTATT 


GGCGATAGAG 


240 


TGTAATTTGA 


CTCCGCTTAC 


ATCTCAGTTT 


TATGTTTGTA 


ATTTGGTAGC 


ATAATATTAT 


300 


AATAAAATAA 


AATTGTTAAT 


CTTTAATTTC 


AGTATAGATA 


TTTTTACGTG 


TAGTCACGTG 


360 


TAAAATAAAT 


TCAATTAGGT 


TAGGAGACAT 


AATTATGAAA 


ACATTTGGTA 


AAAAGGTTGT 


420 


ATTAATCGGA 


GATGGATCTG 


TAGGATCAAG CTATGCCTTT 


GCAATGGTTA 


CGCAAGGTGT 


480 


TGCTGATGAA 


TTTGTAATTA 


TTGACATTGC AAAAGACAAtf 


GTAAAAGCAG " 


ATGTTCAAGA 


54 0 


TTTAAACCAT 


GGTACAGTCC 


ACAGTCCTTC 


ACCAGTTGAT 


GTGAAAGCAG 


GTGAATACGA 


600 


aGaCTGTAAA 


GATGCAGATT 


TAGTTGTTAT 


TACAGCTGGT 


GCACCTCAAA 


AGCCAGGTGA 


660 


AACACGTTTA 


CAATTAGTTG 


AAAAAAATAC 


TAAGATTATG 


AAGAGCATCG 


TTAAGAGTGT 


720 


TATGGATAGT 


GGCTTTGATG 


GATATTTCTT 


AATCGCGGCA 


AACCCTGTAG 


ACATTTTAAC 


780 


AAGATTTGTA 


AAAGAATATA 


CTGGATTACC 


AGCAGAGCGT 


GTTAT CGGTT 


CAGGTACTGT 


840 


ATTGGACAGT 


GCACGTTTAC 


AATATTTAAT 


TAGCCAAGAA 


CTTGGTGTTG 


CACCTTCAAG 


900 


TGTTGACGCT 


AGTATTATTG 


GCGAGCATGG 


TGATACTGAA 


CTTGCAGTTT 


GGTCACAAGC 


960 


AAATGTAGCA 


GGTATTTCAG 


TATATGACAC 


ATTAAAAGAA 


CAAACTGGTA 


GCGAAgcTAA 


1020 


AGCGGAAGAA 


ATTTATGTGA 


ATACACGTGA 


CGCTGCTTAT 


GAAATTATCC 


AAGCTAAAGG 


1080 


GTCAACATAC 


TATGGTATTG 


CATTAGCATT 


GATGCGCATT 


TCAAAAGCCA 


TTTTAAATAA 


1140 


TGAAAATAAT 


GTCTTAAATG 


TTTCTATACA 


ATTAGATGGT 


CAATATGGTG 


GTCACAAAGG 


1200 


CGTTTACCTA 


GGTGTACCAA 


CATTAGTTAA 


CCAACATGGC 


GCAGTTAAAA 


TTTATGAAAT 


1260 


GCCATTAAGT 


GCCGAAGAAC 


AAGCGTTGTT 


CGATAAATCT 


GTTAAAACAT 


TAGAAGATAC 


1320 


ATTTGATTCA 


ATTAAATATT 


TATTAGAAGA 


CTAAGCCTAT 


TTTAAGTATT 


AATTAGAAAT 


1380 


ATATTAATGG 


TAAGAGGATC 


TATTAGTGTT 


GCAGTAACAC 


GTGGCACTGA 


TAGGTCCATT 


1440 



55 



1324 



EP 0 786 519 A2 



10 



is 



20 



TATGACTCGT GAAAATTTAG TGACATGAAT AAAAATGTTG AGGCGTCATT GAGTATAAAG 1560 

GAAAGTAGTT CTGCATTAAT CACGAaGTAG AGCATGACraA CGAGGAATAA CTATAGGGaG 1620 

ATGGTTTTGG AATGACGAtG TCTTGTATCA ACATGGTACA TTAGGTACGT TAATGGCTGG 1680 

CTTACTAGAA GGCACAGCTA CAATTAATGA ATTGTTAGAA CATGGGAATT TAGGGATTGC 1740 

AACGTTAACA GGGTCTGATG GCGAAtAATA TTTTTAGACG GAAAGGCATA TCATGCTAAC 1800 

GAGCATAAAG AATTTATAGA ATTAAAAGGC GATGAGAAAG TACCGTATGC ATCGATTACT 1860 

AATTTTAAAG CGAGTAAGAC ATTTCCATTG CAACAATTAT CACAAGATGA TGTATTTGCA 1920 

CAAATTAAAA ATGAAATGTT AAGTGAGAAT TTATTTTCGG CTGTTAAAAT TTATGGCACA 1980 

TTTAAACATA TGCATGTACG AATGATGCCT GCTCAGCAAC CGCCATATAC ACGTTTGATT 2040 

GATTCAGCAC GCAGACAACC TGAGGAAAAA AGACAAGATA TTCGTGGTGC CATTGTTGGA 2100 

TTTTTTACAC CAGAATTATT TCATGGCGTA GGGTCTGCTG GTTTTCATAT ACATTTTGCG 2160 

GATGATGAAA GAGCTTATGG TGGACATGTT CTTGACTTTG AAGTGGATGA CGTTGTCGTT 2220 

GAGATACAAA ACTTTGAAAC ATTCCAACAA CATTTCCCGG TAAATAACGA GACGTTTGTT 2280 

25 AAAGCGAAAA TAGACTATAA AGATGTGGCA GAAGAAATTA GAGAAGCTGA ATAGTCTAAT 234 0 

ATAATTAAAA GACCTTAGCG ATATTAGGAA CAGGTGGTTC TAAATGCATC GTTAAGGTCT 2400 

TTTTATATTA TATATGTGCT TACATATTTT GTTGATACGC GCGTAAAAAG CTATAAATAG 2460 

30 TATCAAGATA TGGGGTGTCA AGAATGTGTT CACGTGCACG TCGATAAATA AAGCCTTGAA 2520 

TCGCTTCGAC TTCAAGAGGT TGCTGATGCG CAATGTCGTA ATACATGCTC GTTCCCATTT 2580 

CGTCGGGATA ACCTTGATAT ATAGTCATAA TAGTATCGAC AGTTTGTTCT GAAAAGTTTA 2640 

35 ACCCCTCTGC CTGTGCAACA CGACAACCAT CTAGTAATAG CTGTCTACAC AGTGTACGTA 2700 

TTTCAGGATT ATGCATGATT GCAACGGTCT GTCTACCGAG TGCTGTGATA GAATTAATGC 2760 

CCAAGTTAAC TAACAATTTA TACCAAATAG CTTGTTGAAT ATTTGCTTCT AATACAATGT 2820 

CGATTTGACT GTCTTGGACC AAATCTCTAA ATTGTCGAGT TAATGCATTA TCTTGTATAC 2880 

GTAATTGATA ATCTCTGAAG TGCGTAACAA CATCGCCTTT CTTTTGACCA CTTATATAGA 294 0 

CAACTGCTTG GCATACGTTT TTAAATGAAA TATGTTCGAG TTGACCATAA CCATTTTGGG 3000 

CTAAAATGAT GAGCGTGTCC TCATGTGCAA GATGAGTTAA ATGAGGAATA ACTGCATCAA 3060 

GTTGATGTGT TTTGACTGCA ATAATAATAA CATCAAATGT GTTTGTGACA TCTTCATAAC 3120 

CTTTCACAAC AATATCTTGT GCAGGTGCAT GTGGTACAGT ATAATATGTT ATTGTTTTGG 3180 

CGTGTCTCCC GATAAGCGTT GTATGAGGCA ATGATTGTTG TAATTCATAG GCAATAGTTG 3240 



40 



45 



50 
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(2) INFORMATION FOR SEQ ID NO: 425: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 425: 



15 



25 



30 



35 



AGAACGAACC 


TTTAATACGT 


ACGTGTCGAC CACGAACTGT CATGACAAAT TGATAAAACG 


60 


AATAGTTTTT 


CATTAGTTCA 


TTGTCACATC AATCACTTTT GTkTCACCTT TAATCACAGC 


120 


ATTTTCATCA 


TAAATATTAA 


TTGAAGCTGC TTGATCAGTG TTAGTAATTA TAATTGGTGA 


1B0 


AATTACAGAT 


TTAGCGTTAT 


TATTAATATA TTCAAGGTTG AATCTTACTA ATGGATCTCr 


9 A ft 


GACGTTAACT 


TCGTCACCAC 


TAGACACTAA CACTTCAAAT CCTTCACCGT CTAATTGAAC 


300 


TGTGTCTAAA 


CCGATATGAA 


CTAATAATTC TAATCCGTTA TCTGCTTTTA ACCCAATCGC 


360 


ATGCTTAGTT 


GGAAAGACAT 


TGTCAACACG TCCTGCAATT GGAGACACAA CTTCTCCTTC 


420 


AGTTQGATTA 


ATACCAAAAC 


CTTCGCCCAT CATTTTTTGT GCGAATACAG GATCTGGAAT 


480 


ATCTTCAATT 


TTCACGAATT 


CTCCAGTTAA TGGTGCATAA ATTGCGATAT CTTTCTGAAC 


54 0 


TTCTTTGCCT 


TTTCCGAATA 


ATTTTTTAAA CATACTTTCC ACTCCTACTT ATCAAAATGT 


600 


GATATTAAAT 


CGCCATAACC 


CAATTCTTCT AACTTTTCAT ATGGAATAAA TTGAATTGCA 


660 


GCGGAATTGA 


TACAGTATCT 


TAAGCCGCCA CTTTCTTTAG GTCCATCATT AAAGACATGT 


720 


CCTAAATGAC 


TATTTGATTC 


TTCTGAACGC ACTTCAGTTC TCAACATACC AAATGATTTG 


780 


TCGACTAATT 


CTATAA 




796 



(2) INFORMATION . FOR SEQ ID NO: 426: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 33 93 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi)/ SEQUENCE DESCRIPTION: SEQ ID NO: 426: 

ATCCAGCAAA TCTTTCTTAT CACGTTCTGT AATTTTCTTA GTATCCATCT TGATTAGCTT 60 

TGATAACTTT TCAGCCGTAT CCAACATTTC CGATTGTGTT GTTTTTCGAC CCCTAGTATA 12 0 

SO 

TGTAATAGCC ATTTTAGAAG CATTATCAAC TAAAACTTTC CCATTTCTGT CTAAAATACG 180 
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ATGTGAGCCT TGTGCGATTT GTAAATAACC 
. TACAATCACA CCAAATATAA AGTTAATTCT 
5 ATTTGATTTT TCTTTTAGTC TTTTTAACAA 

TTTAAATCAT ATATGAATTT AGAAATTATT 

CATTLAAGAG ACCGAACAAG TAATCATACT 

10 

TTTTT CCCAG TTAACTACAT TCCAAAATGC 

TTTTAGGTAA TAAGCGTGTT CCCATACGTC 

TAATGGATTA TCTTGGTTTG GTGTAGTCAC 

IS 

CCAAGCCCAA CCTGAACCAA AGCGTGCAGC 
TTCTAAAGAA CCCCATTGTT CTTTAATTTT 
TGGTGAAAGT AACTCCCAGA ATAATGAATG 

20 

AGCAGTTTGG ATGTTAGCTG GTACACTGTC 
AGATTCTAAA TCTGTACCTT CTACTGCAGC 

25 TCTGTCATGG TGAATTTCCA TAGTTTCTTT 

GTATGGTAAT TTTGGTAATT CAAAAGCCAT 
TAAATAATAA CAAGCAGTGA CTGGTTCAAC 

30 ATTTTCTCTA CTCATTTAAT ATAACTCAAA 

AAGTTAACAG ATTTGCCATA AAATCATACG 
ATAATTTAAA AAATGAGATT TATGCATACA 

35 TTATATTAAT CTTGGcAAGA CTCACATACA 

TTAACACCAG GTAGTGATAA TTTTATCTGA 
TCTCCACACT TTTCACAGAT AAAATGATGA 

40 TTCATTTCAC CATCAAGTTC TGTATTTTCA 

TTTCTATATA TTGTGTCGAA TGAAATTCCA 
TACTTTGCGT TTATATACTT ATCTTCTTCG 

45 

TCTGTATATT TTAAACCGTT CTCTTTTAAA * 
ATTAGCTCCC TTTTTAAACT TCATTCGCAT 
AATAACAAGT AGAACTACAA TTACACCACC 

50 

TAAGCCTAAT ATTACTGATA ATTCACCTAA 



TAAACGTAGT ACTAGTACTG CAAAAATAAA 300 

CTTGTTAATT GTATTTTGAA CGATTTCATC 360 

AACTACCTAC CTCTATTCAA AGTCTTTCAC 420 

TCTATCTTTT TGACAAAAAA ATAACGGTCT 4 80 

TTATTTTGTT GCATTATATA ATTCGTCAAC 540 

GCCAATGTAG TCAGGGCGTT TGTTTTGATA 600 

TAAACCTAAA ATAGGTGTTT TACCCTCAGT 660 

AATTTCTAAC TGGCCATTGT TTACGACTAA 720 

TGCTTTGTCA GCAAATTCTT TTTTAAATTC 780 

TTCTACTACA GTACCTTTTT CTTCTGAGTT 84 0 

GTTTAAATGT CCACCGCCAT TATTACGTAC 900 

TAAATTAGCA ACAATTTCTT CAATAGATTT 960 

ATTTAATTTC GTAACATACG TGTTATGATG 1020 

GTCAAAATGT GGTTCTAATG CATCAAATGC 1080 

AAATAATCAT CCTCCTAAAA TGTCTGTAAG 1140 

AAAGAATTTG CTTAAATTCT ACTACTTATT 1200 

TCAAAAATAA TTAAACATTT TGTATATAAA 1260 

AACGGAGTAT GAAATGAACC TTTATCTTCT 1320 

TCGGACCAAA TGTGCATAAA TCTCATTTCT 1380 

CCATAAACTT CAAGTTTGTG TTTGTGAATA 144 0 

TCTATTGGAC AATAATCTAT TACCTTTGTA 1500 

TGATGATGGT TTGTACAAGC GATTCTAAAC 1560 

ATAATTCCTA AATCTTTAAA TAAGTGCAGG 1620 
GGATAATTTT CATCCATAAC TTGTTGTATA ' 1680 

ACAAAAATAT CTAACATATC TTTACGTTTA 174 0 

ATTTTAATAG CATCATTTGT ATTCATTGAT 1800 

TTTCTGATAA GCCATTGTAA TCATAAGTAA 1860 

CGGAGAAATG TCCATATAGA AAGCTAGGAC 1920 

AAATACACTT AGTAATATCA ATTGCTTAAA 1980 
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TATCCCTACA ACACGCATTG AGGCAGAAAT AACCATCGCT ACAATAACAA TAAATAAAAA 2100 

TTGAATCCAT TTAGGAATGC CAATGACTTT ACTATATTCC TCATCAAATG ACAATATAAA 2160 

TAATTCTTTA TAAAACAATG TAATAAACAG AACAACTATG ATGGCAATGA CAATAATCGT 2220 

TGTTAAATCA CTTATATTCA CTGCGCTTAT TGAGCCAAAT AGCAATCCAA CAATTTCTTG 2280 

ATTGAACCCA TCAGCTAATG AAATGAAGAT TGCACTCAAG GCGATACCAG CACTCATTAT 2340 

AATTGGAATA GCAATTTCTT GGTAAGCAGT GTATGACGTT CTTAAT TT TT CAATTAGAAG 2400 

CGCACCTACT ATTGCGAATA AGATTCCAAA CCACATTGGA TTAATAAATA CTAGTGTTGG 2460 

CATAATAGTA AGTAAAAACA TACCGAAAGA TATACCACCT AAAGTTACAT GACTTAGAGC 2520 

ATCAGCTwTA AGTGATAGTC GTCTAACAAc GATAAAAGCA CCGaTTAGAG GCGCAATAAA 2580 

ACCTAtCAAG ATAcCACTAA TTAAAGAGTA CCTCATAAAA TCAAAATTCA ATAATGCATC 2640 

TATCAATTGT GACACGCCTT TCCATTTTAA ATAAACTCAA ATCTTTATTA ATTACAACAT 2700 

TCTCGATTAT GCTGATGATC GACAAAACGT ACAGGATGTC CATAAATTTT TGAAATTTCA 2760 

ACTTCATCAA GTGATTTAAA CTCATCAGTT GTACCATGGA AATGCAAATG CTTATTTAAA 2820 

-'CATGCTACTT CAGTAGCAGT ATCTGCTACA ACA'CCGATAT CATGAGTAAC TAAGATAATG" 2880 

GTGATACCTT CTTGTTTTAA TTGATCTAAA GTATTATAAA ATTCACTTAC ATGTTTTGCA 2940 

TCAATACCAT TCGTTGGTTC ATCAAGTACT AATACTGcAG GTTCTGAAAT CAATGCTCGA 3000 

GCAATCATTa CACGTTGTTG TTGACCACCT GATAATTCTG CTATAT TTT T ATGAATTAAA 3060 

TCACTTATAT TCAGTCTTTC TAGTACTTTA ATCACTTTTT CATTATCTTT GCTATTAAAT 3120 

GTTTGGAAAA GACGTTTTGT CTTTGTTAAT CCGCTTAAAA CAACTTCTTT AACACTTGCT 3180 

35 GGGAAACCTG AATTAAAGGC ATTTGCTTTT TGTGATACAT AGCTTAATTT AATTGATGTT 324 0 

TcTTATTTTT AAAATCAATA CCTTCAACAA AAATCTCACC ACTTTGTAAA GGTAATAACC 33 00 

CTAGAATCAA CTTCAATAAT GkTGATTTAC CAGCACCATT TGGkCCAACA AwTGCtAAAA 3360 

40 ATTCACCTTT ATTTATTTTG AATGnnnTAT ATT 33 93 

(2) INFORMATION FOR SEQ- ID NO: 427: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1123 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



15 



20 



25 



30 



50 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 427: 
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5 



15 



20 



25 



CCCTGTTTCT 


TAGCCTTTTC 


GTATATGCGT 


TCATAAGAAG 


AATTCTTTTT GATTTCAGAT 


120 


TGTAAAGCAC 


TGTTTTCACT 


TGATTGTTTA 


TCTATTTTAT 


AATCTAAATC TGCAATCTTT 


180 


CCTCGCGTAT 


CATACGCATC 


CATTTTTAAA 


GATAGCATAT 


AAATACTTAA CATAGCAATT 


240 


ACAGTAATCA 


AAGTTATGTA 


TAAAACTTTT 


TCAAATTTAG 


TTAATTGTAC AACCACTTTT 


300 


CTCGAAACAG 


TCTTCTTTTC 


GGGCTTAGTT 


TGTGGTTGTT 


GCTTCGGTAT ACTATTATAA 


360 


ACTTGTTCGT 


CATATGGTTG 


GTACACTTTT 


TCTACAGCCA 


TTATAAATTG CTCCTTATTT 


420 


AAGTATTTCA 


GCTACACGTA 


ATTTCGCGCT 


TCGTGCTCTG 


TTATTGTCAT CTAAATCTTC 


480 


TTCTGTAGCG 


GTAATCGGTT 


TACGATTAAC 


ACGCTTTAAC 


TTAGGTGTAT ATGCTTCTGG 


54 0 


TATAACTGGT 


AATCCTCTTG 


GTACCTCTGG 


ACCTTTTTCA 


TATTCTTGGA ACACCTGTTT 


600 


ACATAAACGA 


TCTTCTAAAG 


AATGGAAAGT 


GATTACCGAA 


ATCCTGCCAT CTACTTTCAC 


660 


TAATTCAATC 


GCTTGTTCTA 


TTGAATCTTC 


AAAAGCTGAC 


AATTCATCGT TTACTGCAAT 


720 


TCGTAGTGCT 


TGAAATACTC 


GTTTTGCAGG 


ATGTCCGCCT 


TTTCTTCTTG CTTTTG CAGG 


780 


AATACCTTCT 


TTTATAATGT 


CAACTAATTC 


TAATGTTGTT 


GTTATTGGTT GTTGTTCGCG 


840 


ATGTGCTTCG 


ATTCTTCGAG 


CTATCTGTTT 


TGAAAATTTC 


TCCTCGCCAT AGCGATAAAA 


900 


AATCTTCACT 


AACGCTTCAT 


ATGACCAATT 


GTTAACAATT 


TCATATGCTG TTAGTTCTTG 


960 


TGTTTGGTCC 


ATACGCATGT 


CTAATGTTGC 


GTCATGGTGA 


TAACTGAATC CTCGTTCTGG 


1020 


AATGTCGAGT 


TGTGGGCTTG 


AAACACCCAA 


GTCGTAATAA 


ATTCCATCTA CTTTTTCAAT 


1080 


GTTTAAGTCT 


TTTAATATTT 


GAGTTAATTC 


ACGGAAGTTG 


CTA 


1123 



(2) INFORMATION FOR SEQ ID NO: 428: 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 838 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 428: 
TATATCCTCC AnACCATAAA AATGGAATTA TAGTGAAGAC CGCTATGATC CAGGATGACC 60 

46 

ACGTATTATA GAATCTTTCg ATAGAGAAAC GAATCaACGT ATCCgACATC aCGAAATGAA 120 
ATTAGAAGAT TATAAAGATG AGTTAAGAAG AGAATATCTA AAACAATCTG ACAGAATTGA 180 
AGGAGATGAA TAAGCGTGGG ACTTGATTTT AGTGGTTTAC CAGATTTAGC AGTATTGGAA 24 0 

50 

CAAATGAAGG AAAAAGAACA GATTAGTGAG GTTATTGCGC CTGAACATGT TCGTATGCAT 300 
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CATTTCAAAA AATTTGAAGA TGATTTTAAA AATGCGGCAC AAGGGGCTTG GGTGAAAAAT 
GCCACAGACG AATTAAAAGA TATTAGTAAT GATTTAGAAA AAATTCAAGA TATTAAAGTA 
TAAAAAGGTA TTAAGAAACA CAATAAGTAT ATAATCCATT TAATAATAAA TGAATTATAT 
AGTTCATAAT TTCGACTATA AGTGGCTATT AGCCATTATT TTTCGGGATC TATGTCAAAT 
CGGACTAATG AATTCAATAA TGGAAGTTAA GCAACCAATC TTTGTTTAAC TTCTTTTTTA 
TTTTTTGGAA AATAAAGTTT TGAACATAAT AAAATTTGAT TATGTTTTAA CGAATTTTGA 
TGTTTCTTGA ACTATATATC ATCTAGTCGT CATTTACAAT TGGTAAATAT GACTTCAAAC 
TGkATGAAGG TGATGGCGAT TAAAAGGCTC ATCCGTAGGT TCTAAAGAAC TAGAnTTT 
(2) INFORMATION FOR SEQ ID NO: 429: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 429: 
nGTGTTCAAA TAATAGAATG GCTAATACCC CAACAATAAA TAGGAATAAA TATGTTGATG 
TCGTAATTTT AGTAAATAAT GAAAATCTAA AATTAGTAAC CCTATTTTGA ATATAAGCTT 
TAACTTCTAA TAATACTGGG AAGCCGATTG ATCCTAAAAT TATTAAAAAT ATAACTATCG 
CTTGTACAAA ATAATCATGT GCATAAGGAA TTAATGACTT ACCTGTAATA TCTAATCCAC 
CATTGGTAGT GGCAGAAATA GAAACAAAAA CACCTTGCAT GATTGCATAT TTTAAATCTG 
GATTATCTCG ATAAAAGTAA AATGCTAATA ACATGGCTCC TACGAGTTCG ATTACAAATA 
TTGATTTTAC AATATCAATA ATCAATTTGA CGGTACCACT CATTGTGTTT TTGTTATTAT. 
CTAACATAAT TAACTGACGT TCTCTAATTC CAATATGTTT ACCTAGTACC ACCCATAACA 
TCGTACCAAT TGCCATGACG CCAATTCCAC CAATATTTAA TATCACGAGG ATAATTAATT 
GTCCAAATGT AGAATAGGTA TCGACAATAC TTATCGGAGA CAATCCAGTA ACACTAATTC 
CGGATACGGC AACAAATAAT GTGTCAATTG GATTTACTTC TACACCTGGT TTATGaACAT 
ACGGTAAGTT TAATAATAAA AATGCAATGA CAATTGCGAC GATATAGTAC AATACAATAC 
CTTGTTGAGG GCTTGATCTT TTTAAAAACT GGCTAAAAAT TGACACGTTG TTCACCTCAA 
CATTATTTTA GTTTAATATT AATTTCTTTC TCTTTACCAT CTCTATAAAT CTTCGCTGTA 
ATTGATTTCA AGTCATCTTT ATGACTAAAT ATAATCTGCC TAAAGCGTAA ATCATCTTCT 
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AAACCATTGT TGTCAACTTG ATCTACAACA ACACCGTTCT TAACTTTTCC TGGCAATTTA 1020 
ACTGCTTGTC TTTCAAAACT ATTTAGACTG ACAATATTCT TCATTTTAAC ACCTACATCG 1080 
* GGATAGTCAA TTTTACCTTT tGTTTCTAAA TCTTTTACAA TCTTTtGTAC TTCATTAaCA 1140 

GGtATTGCAA • i 150 ' 

(2) INFORMATION FOR SEQ ID NO: 430: 

W 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 797 base pairs 

(B) TYPE: nucleic acid 

(C) , STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 430: 



20 


AAGAATATTT 


AGGATTTTAT 


GTTTCGCAAC 


ACCCAGTAGA 


TAAAAAGTTT GTTGCCAAAC 


60 




AATATTTAAC 


GATATTTAAA 


TTGAGTAACG 


CGCAGAATTA 


TAAACCTATA TTAGTACAGT 


120 




TTGATAAAGT 


TAAACAAATT 


CGAACTAAAA 


ATGGTCAAAA 


TATGGCATTC GTCACATTAA 


180 


25 


ATGATGGCAT 


TGAAACTTTA 


GATGGTGTGA 


TTTTCCCTAA 


TCAGTTTAAA AAGTACGAAG 


240 




AGTTGTTATC 


ACATAATGAC 


TTGTTTATAG 


TTAGCGGGAA 


ATTTGACCAT AGAAAGCAAC 


300 


30 


AACGTCAACT 


AATTATAAAT 


GAGATTCAGA 


CATTAGCCAC 


TTTTGAAGAA CAAAAATTAG 


360 


CATTTGCCAA 


ACAAATTATA ATTAGAAATA 


AATCACAAAT 


AGATATGTTT GAAGAGATGA 


420 




TTAAAGCTAC 


GAAAGAGAAT 


GCTAATGATG 


TTGTGTTATC 


CTTTTATGAT GAAACGATTA 


460 


35 


AACAAATGAC 


TACTTTAGGC 


TATATTAATC 


AAAAAGATAG 


TATGTTTAAT AATTTTATAC 


540 




AATCCTTTAA 


CCCTAGTGAT 


ATTAGGCTTA 


TATAACTTTy 


ATCAACTAAT AAATTATGAT 


600 




ATAGTAAACT 


GATGGTTAGA TATTTyTyAA 


CCATCAGTTT 


GCGTktATAT TAGTTTTTTA 


660 


40 


TGCTTATTAT 


TTTTATGAGT 


TTCACTTTAC 


ATTATTGATT 


AATCCATAAG AATAATTAGC 


720 




ACAAAAAGCA 


GTATACATAA 


ATTGAGTAAA 


GAATTTTGTC 


GATATAACCG nGCGGAAAAA 


780 




TAATAACnTT 


TTGGATn 








797 



(2) INFORMATION FOR SEQ ID NO: 431: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTTCCAAGAG AAGTTGCGGA AGTAATTAAT AAAACGCATC ATAATAAATT GGTCATTTCG 
ATGATTTCGk CACAAATCGA TGCGGATAGA ATGGATTATT TACAACGTGA TGCGTATTTC 
ACAGGTGTAT CATATGGTGC TTTTGATATG GAGCGTATTT TAAGATTAAT GCGACCTTCT 
AAAGATGAAG TACTAATCAA AGAAAGTGGT ATGCATGCAG TTGAAAACTT TATTATGAGT 
CGTTATCAAA TGTATTGGCA AATTTACTTC CACCCAGTTA GTCGTGGTGG AGAAGTGCTG 
CTTAATAATT GtTTGAAACG CGCAAAACAG CTTTATAATG AAGGCTATGA ATTTAAGTTG 
CATCCACATG ATTTTATTCC ATTTTTTGAA GAGACAGTTA CGATTGAACA ATATGTTGAA 
CTCGATGAAG CGGTAGTTAC GTATTATTTG GAAAAATGGA CAAAAGAAGA TGATGCTATT 
TTAAGTGATT TAGCAAGTCG ATTTATTAAT CGAGACTTAT TTAAATATAT . TCCATTTGAT 
GGCTCAATTA TTACAATATC AGAACTGCAA GAACTGTTTG AAGCAGGTGG TATTAATCCA 
GATTATTATT TTGTGAGTGA AGCATTTTCT GATTTGCCAT ATGACTATGA TCGACCGGGG 
TCAAATCGCA AACCGATTCA TTTATTAAGA CAAGATGGTA CGATTAGAGA AATAAGCAAT 
CAATCATTAG TCATTCATAG TATTACAGGC ATTAATCGCC AAGACTATAA ATTATATTAT 
CCTAGAGAAA TGGTTGCAAA GATTAAAGAT AAGACAATTA GAGAAGCTAT TGAAAATTTG 
ATTAATGAGC TTAATTAAAC AGGGCTAAAA TTGTTATCGT TAAATATGGA GGTTATATCA 
TTGTCTGAGA AAAAAGGCTT TAATTTTAAT ATCATAAAAA ATGACCCTCT AGATGGTCAT 
AAAGGTACAA ATATTGGTTC AATTAGCTTA GACAATATTG CACCAGTTTT TATCGATGTT 
GCTAACAAAG AAGCATTTAT TGATATTGGA GGCATGCATG CTCGTGCCAA AGTTGAAAAA 
GGTGTGAAAT GGATTACTGA TAAAGCTGCT GTTGAAGGCG ATGAAGCTAA AGaATATTGG 
TTGTGTTGGG TAACAACAGA ACGTAATGAA CAAGGACCAT ATTACGCTGG TTTAACAGCG 
TGCTATTTAT TAGTGAATAA AGCAATTCGT CGTGGTTATA AAAGTATGCC TGAACATGTT 
AATATGATGG ATAAATCAAT GAAACATCAT ATTATCATAG ATCAAATTGG TGACGAGAAT 
AAAGCTATTT TAAAAGACTT TTTAATGAAC CATGATGAAG GTATGTGGAA GCATTCTTCT 
GATGCTTTAC ATCAAGCATT TAATTAAATA TTAGAAACTA AAATTTCCCA ATTAATCTAT 
AAAGATATGA TTCATTTCTC AATGAC 
(2) INFORMATION FOR SEQ ID NO: 432: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2304 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 432: 





TTGTACGAAT 


ACAAAGTTGC ATTAATATAG AATCATTTAT GCTTTTTAAT ATCACAAAAC 


60 


s 


AAATTGATTA 


AACTAGCTAA AATAGTCAAA ATTGGAATCA AGATTACATA AATATCATAA 


120 




CCTCTAGATA 


TTAGTATACA 


TAATATAATT 


AGCGACCCTA 


TTATAATTGC TAAATTATAA 


1B0 


10 


GCCAAACTAA 


TTTTATATTT 


CATTTCAATT 


CCCACCTTAA 


TAGCATTTAT TAAAATTCTA 


240 


TAATGTTTAC 


ATAGACTTCT 


TAACAAACTT 


TAACCCAGCT AAGCTAAGTA TTGAAATAAT 


300 




TACACCTCCT 


AATATAAATA TATTACTTAA 


GCTTAGTAAA 


GGTATAATTA TACTTATCAG 


360 


75 


TCCTAAAGAC 


AATGTATCCG 


CTGCATAATT 


CGaTGtAGAT 


GAGATACTAA ATACTTTCCC 


420 




CATCAAATGA 


TTTGGCGTTT 


TTATTTGAAT 


GGCAACTGAT 


CTTGTTAGTC CCTCTATAGA 


4B0 




TTGTCCAAGT 


CCCAATAATG 


TTGCACCTAT ATATAATATC 


GCCACACTTG GAAACACATT 


540 


20 


AATAATCGAT 


AAGCCAATTC 


CCCAAACTAA AACACCAATA 


CTAAATTTAA AGATTAATCG 


600 




CTTTTCTGAC 


AGCAAACCCA 


TAATCAATGA 


CATTAATAAA 


GATGCTATAC CTAAACATGA 


660 




TGTAGCTAGT 


CCATATACAC 


CAACGCCCTC 


TTTTAATATA 


TTGGAAATAA ACAATGGTAA 


720 


25 


TACAACACGC 


CAAAGGCCAG 


TATTAATCAA 


TATGCAGGCA 


AATTGGATGA TTATAATAAA ' 


780 




TGGAATTTCT 


TTAGATTGTT 


TCAAGAATTC 


CCAAGTTTCA 


GAAAAATCTT CTTTTGAGTG 


840 


30 


TCTATCAATC 


ATGTTGTTAT 


TTGTATATTT 


TAAAAGTGCA 


TTAAAAATAA ATCCTAAAAA 


900 


TAGCAATATA 


CTACAAATAA AAAAGACGCC 


AACATTACCA ACTAGTATTA CAATGACACC 


960 




AATTAAAGCA 


GGTAAAATAA TATTTGAGCC 


TCTTTGCAAA 


CTATCGATTA ACGCATTACT 


1020 


35 


TGTTGCTAAA 


TGCTCCTCAT 


CAATAATTTC 


AGGAAGAATT 


GCCCTAAACG CAGGATCCGT 


1080 




ATAGCAGTTA 


ATAATGGTGA TAGCTGTAGA TATGGTTAGA AGCGTCAGAT AATTTAAATT 


1140 




TGATGTTATT 


GCAAGTAAAG 


GAATAATTAT 


TATAATCAAA 


CTTAGTATAA GATCAGATAG 


1200 


40 


ATAAAGTATT 


TTCTTTTTAT 


TATGTTTATC 


AGAATATGCG 


CCACCGAAAA TACCAAATAT 


1260 




AATAAATGGA 


AGTGTTTGAC 


TCATAACCAT 


CATTGATAAT 


TTTAAAGATG ATTGGTTTGT 


1320 


45 


CAATTCAACA 


GTAAACCAAA 


TTATTTGTAA 


CGAAAACAGC 


ACAAAACAAC TCCGACGTAA 


1380 


GArATTACCA 


ATCAATAAAT ATGTAAAGTT 


TCTATTTTTC AAAACTTCTA AATACAACAT 


1440 




ATTTATCACC 


TCTCATAAAA 


ATAATTGAAT 


GCATCCACCA 


GCTTTTTTAG ACCTTCTTCT 


1500 


50 


AAACTCTCTT 


TATCCAAAGC 


GCAATTAATT 


CTAATATAAT 


TTAGTCAGTT AAATATCAAT 


1560 




TATTTCGAAA 


TATACATACT ACTTGAAACA 


CCATACATAA 


CCCCCAAAAT GACTACTCAG 


1620 




AGGTTATATT 


CTACTAATTA TGATTATATT AAATATGAAA ATATTATCAA AAAAATCAAA 


1680 
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TCATTCTCAT CATATTCTAG GTTGTTTTTA ACAAACTAAA TATAGTGAAT GCAAATCAAC 1B00 

TATTATTTAA ATTATGAATT ATTTTAATTC TTTCTTCTAC GAGCCAATAA CATTAATCCA 1B60 

5 GCAATTCCAA TTATACTACT AAAGATCAAA CCTTTTTGCG TGCTTTCTAA ACCTGTTTTT 1920 

GGTAATTCTG CTCGTTTTTT CTCTTGATTA GCTACTGATT CTTTAGCAAT TTTAGATTTT 1980 

TTAACTTTAT CATTTTTATC CATTGAATGA ACTGGGCCAT TTGGTTTTGC TCTGTCTTTC 204 0 

10 

GATAATCCTG GATTGTTAGG ATTTACTGGG CCACTTGGAT GAGTTGGTCT GCTCGGCTTC 2100 

TCTGGGTTTT CAGGTCCTTT TGGATCTTTT GGTTTCTCTC CACCGAACTC TACAATCTTA 2160 

\ • 

TCTACTGGTT GTTTtGTGAT CTCTTCTGTT GGTTGACCCT CGCCAACTTT TTCACCTGTT 2220 

15 

AATGGGkTCA CTGTGAwTGG TGkTGTGAwT GTCyTACTTC CTGGkTGTCC TTCTTGTTTC 2280 
ACTCGCTCTT CACCAGGTTG TAAT 2304 
20 {2) INFORMATION FOR SEQ ID NO: 433: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 433: 

30 



35 



40 



50 



GTCTTAATAT 


TTTACAAGAT 


ACAGCCTGAC 


AATATTTATA 


TTACAAACCA ATATGGTAAA 


60 


GAGGCTGTGT 


CTCGATTAAA 


AAGAATTTTA 


GAGCAAAGTA 


TACCTAATAA AGAATATATT 


120 


GATTTATCAG 


AGATATCATT 


AATTACATTT 


TTTAAACCAG 


AATATAATAA ACAACATGTG 


180 


AATGAGGAGT 


TTAATTCTTT 


ATCTTCTTCG 


AAAATAAAGA 


ATTTAGCCAA AGAAAATGAT 


240 


GGTATTATTA 


TTTATTTAAA 


TTTTAAGGAT 


GTTAATTATC 


AACTGaTAa'G TGAAGGCAAT 


300 


ACTITTTTTT 


CTGCAAAACC 


ATATTTGCAT 


TGTGTGTTCA 


ATAAAGATAT TTTAAATATG 


360 


GATAAAGTTG 


ATATAGAAAA 


TTTTTTCAAA 


AGTATAGAAG 


TCAAATACAG TAAATAAGAT 


420 


CAATTATTGA 


ATCCACCGAC 


CAGCAGATTT 


TTTTAAAAAA 


TTAATACCCC GTTAATACCC 


480 


TTTGCTTCAA 


TTTGATGAAA 


ATCAATGAAA 


TTCAAAATTG 


AAGAAATCCT TTAATATCAA 


540 


GGTGTACGAC 


AGTCTATATA 


ATCATGCGAA 


ATTCTAAAAT 


TTTCtGAcGT AAAAAAACAA 


600 


ATTCyTAAAG 


CAGCTCGTCG 


TTCACCTCAA 


TTCTCAAAAC 


GTTAATTGTC GGACGATATA 


660 


TATACAAAAC 


ACCTCGATGT 


TATGTCGAGG 




CGTTTGTGTG GGGAATATGG 


720 


AATAGAGTAT 


AAATGAATTA 


ATGTCTCAGG 


TATAGAATTA 


ATTCAACTAT GAATTATTGA 


780 
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TTAGCTTGCG GAGACCGTAC GATAGTGTTC TAATAATAGA TATTCAAGGG AAAGAGCTAT 900 

CGGATGATAA CCAGATGATA ACTCAAGAAC CATGrTTTAA GAGTGTTATC AAAATAAGGA 960 

5 GTGACTTAAT TGAAAGATTC TATATTTTGG AAGAAAGCTT TTATTyCTGT TTATTTTATT 1020 

GTTGsGATGC TGGTGTTTCT ACTTTTTAGG TTTTATATTA AaACAGATAA CTTTTCTATA 1080 

TATTTAATGA GTATCTTCTT AATTTGTTTA GGAACTGCTT CTATCATTTA TAACTATAAA 1140 

*° ACCAATCGAT AAATATAAGT ATGAAATGTA TAAGTCACAG TTAGATCTAA GTCTTGCTGT 1200 

GGCTTTTTAT ATCTCCATAA AACGACAAAT TCAAGCCCGA CATAAAACAG CATTTTCAGC 1260 

CCACCATAAA ACGACAATTT CAGCCCGCCA TTGACTAAAC ACCACATCCC AAAAATATCG 1320 

TAACAATCCT CTACATCAAT CAATCCAACA TCCCTCATAA TCACAACGCA CAAAATCTAT 1380 

TCATGCATTT TTGGAATACT TAGTATTACA AATAACGATT TTTATTCATC TAACAAAGGT 144 0 

20 TATATAATGT ACTGAAGGCA ATTTTTATGT ATTACAAATC TAATCGTACA TGTAAAATTT 1500 

TGATAAACAT CATTAATTTT GCGTAACTAT CATTAGATTA CAAATCACAA AGTAATTACA 1560 

TGTAATACAC ATCTATACAT CACATTTGAA GGGAAATGaA TATAAATGAC TGATAAAAAG 1620 

25 TACACTGCAG CCGATATGGT TATTGATACT TTGAAAAATA ATGGGGTAGA ATATGTTTTT 1680 

GGTATTCCGG GTGCAAAGAT TGACTATCTA TTTAATGCTT TAATTGATGA TGGTCCTGAA 1740 

CTTATTGTCA CTCGTCATGA ACAAAACGCC GCAATGATGG CGCAAGGTAT TGGAAGATTA 1800 

30 

ACAGGCAAAC CGGGTGTAGT ACTTGTTACA AGTGGTCCTG GTGTAAGTAA TTTAACTACT 1860 

GGATTATTAA CCGCAACATC TGAAGGGGAT CCTGTATTAG CGTTAGGTGG CCAAGTGAAa 1920 

CGnAnATGAT TTATTACGAT TAACGCATCA AAGTATTGAT AATGCTGCGC TATTAAAATA 1980 

35 

TTCATCAAAA TATAGTGAAG AAGTACAAGA TCCTGAATCA TTATCAGAAG TTATGACAAA 2040 

TGCAATTCGA ATTGCTACTT CAGGAAAAAA TGGCGCAAGT TTTATTAGTA TTCCGCAAGA 2100 

40 CGTTATTTCT TCACCAGTTG AATCTAAAGC TATATCACTT TGCCAAAAAA CAAATTTAGG 2160 

AGTACCGAGT GAACAAGATA TTAATGATGT CATTGAAGCG ATTAAAAATG CATCATTTCC 2220 

TGTTTTATTA GCTGGTATGA GAAGTTCAAG TGCAGAAGAA ACAAATGCCA TTCGCAAATT 2280 

45 

AGTTGAGCGC ACGAATTTAC CAGTTGTAGA AACATTCCAA GGTGCAGGTG TAATTAGTCG 2340 

TGAATTAGAA AATCATTTCT TCGGTCGTGT GGGCTTATTC CGCAATCAAG TTGGTGATGA 24 00 

ATTATTACGT aaaagtqatx TAGTTGTTAC AATCGGTTAT GATCCAATTG AATACGAAGC 24 60 

50 

TAGTAACTGG AATAAAGAAT TAGAAACACA AATTATCAAT ATTGACGAGT TCAAGCTGAA 2520 

ATTACTAATT ATATGCAACC GAAAAAAGAG TTGATTGGTA ATATTGCTAA AACGATTGAA 2580 
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CAATTAAGAA CACATATTGA TGAAGAAACT 
CTACATCCAG TGGAAATTAT TGAATCTATG 
5 ACAGTTGATG TTGGAAGTCA CTATATTTGG 

AGACATTTAT TATTTAGCAA TGGTATGCAA 
TCAGCTGCAC TTGTGCGCCC TAATACGCAA 

10 

TTATTTTCAT CACAAGATTT AGAAACGGCC 
ATTTGGAATG ATGGAAAATA TAACATGGTT 

is TCGTCAGGTG TAGACTTCGG TCCTGTAGAT 
AAAGGTTTAC GAGTTACTAA TCAAGAAGAA 
ACAGATGGTC CAGTATTAAT TGATATACCT 

20 ACAAATATGT TACCTGACGT ATTTAACTAA 

GATTTTATGA CTAATGTTTT ATACCAACAT 
TTAAAAGGAA CTGCATCAAT AAATGAATTA 
CTAACAGGTT CAAACGGTGA GGTAATCTTT 
CATAAAGAAT TTGTAGAATT AAAAGGTGAC 
TTTGTAGCAG ATACAAGCTA TGAAACGAAA 

30 

ATTAAGGAAA AGATGTTGAG TGAAAATTTA 
AAAAAAATGC ATGTACGTAT GATGCCGGCT 

3S TCAGCTAGAA GACAACCTGA ACAAACTGAA 

TTTACACCAG AATTATTCCA TGGTATCGGA 
GATGATCGTA ACTTTGGTGG ACATGTCTTA 

40 ATCCAAAATA TAGAAACATT TGAACAGCAT 

GCAAATATTG ACTATAAAGA TATTGCAGAC 
AAATATAATG ACGGTTTATG AAAATTGACT 

45 

GTAAATATGA TTAACCATGA CTACAATAGA 
TAAAATAAAT CCCTTCACTA TTAGCAGTAG 
so AGATTTGCTG TTATGTTTAG ATTATAAATT 

GTATAGGCCA ATCTGTTACT ACGAGAAGCA 
AAAAGACAAA CGCCAATTTT TCAGCGCTTG 

55 
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GGTATTAAAG CGACGCATGA AGAAGGAATT 2700 

CAAAAGGTAT TAACTGATGA TACTACTGTA 2760 

ATGGCACGTA ATTTCAGAAG TTACAATCCA 2820 

ACGCTTGGTG TAGCATTACC GTGGGCAATT 2880 

GTTGTGTCCG TTGCTGGCGA TGGTGGCTTT 2940 

GTACGTAAAA ATTTAAATAT CATCCAGCTT 3000 

GAATTCCAAG AAGAAATGAA ATATAAACGT 3060 

TTTGTAAAAT ATGCAGAATC ATTTGGCGCG 3120 

TTAGAAGCGG CAATTAAAGA GGGCTATGAA 3180 

GTAAATTACA AAGATAATAT CAAACTTTCA 324 0 

AATAAAGATA AATGTTAAAG AGGAGTGGGA 3300 

GGTACATTAG GCACATTAAT GGCAGGATTA 3360 

TTGCAACATG GTGACTTAGG TATCGCTACA 3420 

TTAGATGGAA AAGCTTACCA TGCAAATGAA 34 80 

GAGTTAACAC CATATGCAAC TGTAACTAAA 354 0 

GATAAATCTT CAGAAGCAGT TTTTGCAGAA 3600 

TTTTCAGCAG TAAAAATTTC AGGCTTATTT 3660 

CAAGAACCAC CTTATACACG TTTAATCGAT 3720 

ACGTATGTCA AAGGTTCAGT CGTTGGTTTC 3780 

TCAGCAGGAT TTCATGTACA CTTTGCGAAT 3840 

GATTTTGAAG TAGAAGATGT TAAAGTAGAA 3900 

TTTCCAATTC AAGATAAAGA TTTCACTAAA 3960 

GAAATTAGAG AAGCTGAATA ATGAATCCAG 4020 

TCATAATGCG CGATTTAGAA ATGATAGTTT 4080 

ACAAATATAT TTATAATTAC GTCTAAGTAA 4140 

TGAGGGGATT TATTAGGTTC CAGATATTTG 4200 

GTGGTACACA CTCATATAAA ATTTACTATT 4260 

AACAACAATA ATTTACAAGT TCAATAACTA 4320 

CCCTATAAAA CTATTTTCAA ATTATTATTT 43 80 
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TTTCTTTAGA TTCACTACTT TTTTATTACC ATCATTCAAA GTAAGCGTAT AAGTTGCTGT 4500 

TTGGGCATTA TTAATTTTTT CTGTTGTAAC ACCACGTTGA GAAGCTAATT CATTTTTTAC 4560 

TTTACTGTCA' ATTTCTTGAT AAAGAACATT TTTATTTTCT GGGAAGATAA AGTAAGTTCG 4620 

ATGTAATGCA GTAATACCAT CTACTGAAAT TGTGTAAGGA ACAGTGTGAT AACCATCCAC 4680 

AGTCATTTCT TTATAGCCGT TATTACTATC TGCAGATGCT TCGTGACTCG GTA 4733 
(2) INFORMATION FOR SEQ ID NO: 434: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 434: 

AATAAATGGT AATTTTATAT CATCAACGGC TACAAATGGG GCAGTTTCAT ATATAAATAA 60 

ACACATCTAC TCAACTGnTC TTAATTAATT AATAAATGAT TTACCAGATG ACATATAACA 120 

GCCATTATTA GnTTAATTCT AATAGTTTAT TTAAATTTTC TTCGGTTGTC GCC CAACTGG 180 

TTGCGAATCT AACAACACGA TGTTGATCAT CGTATTTTTC CCAAACAGCA AATTTAACTT 24 0 

TTTGTTCTAA CTCTGCTATT TTCTCGTTAC TTAAAATAAA AAATTGTTGA TTGGTTGGAG 300 

AATCAAAGTA AAGACGATAG CCTTTATTTT TAAACCCGTC TTTCATCTTA TTTGCCATTT 360 

CGATAGCATG TCTGCTTATA TTAAAATATA AATTGTCCGT AAATAATTCT AAAAATTGTA 420 

35 TGCCTGTTAA CCGTCCTTTT GCTAAAAGGG CACCGTGGAT GCTTGATTCG AGTGGTAAAT 480 

TGTTTCGGTT CATTATTTTT CGTAAAAACA ATGGCTTCCC CGCATAATGC ACCTATCTkC 540 

GTACCACCTA TATAAAATAC ATCACAATAT TTAGCGrTGt CTTTAATAGT CATATCTGAT 600 

TGGTCACTCA TCAATCCATA CCCTAATCGT GCACCATCCA TAAATAATGG AAGCTGATAT 660 

TGCTTACATA CTTTGCATAA CTCTTCCAAT TCTGaTTTAG AGTATAATGT GCCATATTCT 720 

GTAGGATGAG AAATATATAC CATTCCTGGG AATACCATAT GGTCCTTTTT AAAATCACTT 780 

TTAAATGTCT CCATGTAAGT TTCAACATCT GAAGCACTAA CTTTTCCTTC CTTAGAGGGT 840 

•ATAGTAATTA CTTTATGTCC ACTATATTCA ATTGCACCGC CCTCATGCAC AGCAACATGA 900 

50 CCAGTGTCTG CTGAAATGAC CCCTTCGTAA CTTTCTAACA TTGAATTAAT AACAACCTGA 960 

TTGGTTTGTG TTCCACCTaC TAAAAAACGA ATTGTAGCAT TTGGaCAGTC AATTGTATCT 1020 

TTAATCTTTT CAATTGCCTG AGCTGTGAAT TGaTCAAAGC CATATCCCGA AGCTTGTACA 10 80 
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TCGAATGAAA TCACTACATT TCCCCCTAAA ACTAATATCA ACATTTTAAT AAGATAAACC 1200 

AATTTCAAAA CTAGTTCGAT ATTTAAAATG TATTATGGAT GGnTAAAGTT TGTATCGCAT 1260 

5 TATCGCGAAG TTGnATAAAT ATAT 1284 

(2) INFORMATION FOR SEQ ID NO: 435: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 1072 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 435: 

ACAGCTTTTG GTAArGGAGA AAwTCAaTAt GAAACAGTAT AArGCGTATT TaATCGATTT 60 

20 AGATGGCACA ATGTATATGG GAACAGATGA GATTGATGGA GCAAnACAAT TCATCGATTA 120 

TTTAAATGTA AAAGGCATTC CTCATTTATA CGTAACTAAT AATTCAACAA AAACACCTGA 180 

GCAAGTAACT GAAAAATTAC GTGAAATGCA CATTGATGCT AAACCAGAAG AGGTTGTAAC 24 0 

GTCAGCGTTA GCCACTGCTG ATTATATTTC AGAACAATCA CCAGGAGCAT CAGTATATAT 3 00 

GTTAGGTGGG AGTGGTTTAA ATACTGCGTT AACCGAAGCG GGACTTGTCA TTAAAAATGA 360 

CGAGCATGTT GATTATGTAG TTATTGGACT TGACGAACAA GTTACATATG AAAAGCTTGC 420 

GATTGCAACG TTAGGTGTAA GAAATGGtGC AACATTTATT TCTACAAATC CTGATGTATC 480 

AATTCCTAAA GAGCGTGGTT TATTACCTGG TAATGGTGCT ATTACAAGTG TTGTAAGTGT 540 

ATCGACAGGT GTATCGCCAC AATTTATTGG TAAACCAGAA CCGATTATTA TGGTTAAAGC 600 

ATTAGAAATT TTAGGATTAG ATAAATCCGA AGTTGCTATG GTAGGCGATT TGTACGATAC 660 

CGATATTATG TCTGGTATTA ACGTAGGTAT GGATACGATT CATGTACAAA CAGGTGTATC 720 

TACGTTAGAA GATGTGCAAA ATAAAAATGT -GCCACCAACG TATTCTTTTA AAGATTTAAA 780 

TGAAGCAATA GCTGAATTAG AAAAATAGAT ATAGTCATTT TATAAAGTAG GTGAATTGAT 840 

TTGGTAAAAA TAGTTGTTTC GAGGAAAATT CCAGATAAAT TTTATCAACA ATTAAGTAAA 900 

CTTGGTGACG TTGTTATGTG GCAAAAATCA TTAGTGCCTA TGCCTAAAGA TCAATTTGTG 960 

ACaGCcTTCG TGACGCAGAT GCTTGTTTTA TTACATTAAG TGAACAGATC GATGCAGAAA 1020 

TTTTAGCGCA ATCACCAAAT TTAAAAgTAA TTGCGAATAT GGCTGTAGGA TA 1072 
(2) INFORMATION FOR SEQ ID NO: 436: 



25 



30 



40 



45 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3271 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



75 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 436: 
TAAAAACTTA CTTTAACACC ATTCCTTTTT AACTTTTTTC GTGTTTCnTT TnCTTAAGTC 60 

10 CATCCATATT TTTAATGATG TCATCTGCTG TTTTATCTTT TAAATCTAAC ACTGaGTGAT 120 

AmCGAATTTG TAGCACAGGA TCAAATCCTT TATGGAATCC AGTATGTTCA AATCCTAAGT 180 

TACTCATTTT ATCAAAGAAC CAATCATTAC CAGCATTACC TGTAATCTCG CCATCATGAT 240 

TCAAGTATTG ATATGGTAAA TATGGATCGA TATGTAGGTA TAGACAACGA TGTTTTTTAA 3 00 

CATATTTTGA TAATTCATTA AAGAAAAAGT GTACGAGTTC TTGATTTTCA TAATCAATCA 360 

CTGGACCGCG ATTTGAATAA AAATACTTGA ACACTTTCAT AACAGGTACA GCAGTAAGtA 420 

AGCAAGCTGC AATGACCTCG TTATTATTGT TTTTTATTCC CACTAAATGT GTTTCATAAC 4 80 

CTTCAGCAAG CTTTAACTCA TAGTGGCCAA CAGTTTGCGT GaAATGACTG TATGGCATGC 540 

TATCTGTAAa GGCACCAAAC TCTTTAGCTG TTAAATTTGT AAACTTCATT ATCATTACTC 600 

CTATTTGTCT CTCGTTAATT AATTTCATTT CCGTATTTGc AGTTTTTCTA TTTCCCCTCT 660 

GCAAATGgCA AAAATAATAA ATCTAATCTA AATAAGTATA CAATAGTTAA TGTTAAAACT 720 

30 AAAACATAAA CGCTTTAATT GCGTATACTT TTATAGTAAT ATTTAGATTT TTGAATACAA 780 
TTTCAAAAAA AGTAATATGA ACGTTTGGGT TTGCTCATAT TACTTTTTTT GAAATTGTAT 840 
TCAATTTTAT AATTCACCGT TTTTCACTTT TTCAAACAGT ATTCGCCTAA TTTTTTTAAA 900 
TCAAGTAAAC TTAATTATTC AATGTTTGTT GGATAGATTG TAAATATTTA ATGATTTCCT 960 

CACGCGTGTT AG ATTTAAAT CGCTTAACGA TTTCGCTACC AATGACAATG CCATCTGCAA 1020 

CCTCTTTTAT ATCTGCAACA TGTTGTGGTG TTCTTATACC AAATCCTGCG ACAACTGGCA 1080 

CATTGGCTAT CGCTTTAATT GACTCAATTT TTCGTTTTAA TTCTGGATGA AACGCACCGT 114 0 

TTTGCCCTGT TGTCGCATTC ATCGTCACAG TATAAATAAA GCCTTCCGCA TGGGATACGA 1200 

45. TATCTTTTAT ACGTTTGTCA TCAGTAGTCA TCGCAACTAA CGATATGATT TTGACGCCAT 1260 

AGTGACTAAA TTGTTGTTTT AAACGCTGCG ATAATTCATA TGGTAAATCA GGAATAATTA 132 0 

AGCCGTAGAC ACCAGTATCT CGACATTTTT CAAAAAACGC TTGTTCTCCA TAATGACAAA 1380 

50 TAATATTATA ATACGTCATT AATACATAGT TACACTTAAT TTGATCACCA TGTTTTTCTA 1440 

ATTGATTGAA AATATAATCT ATCGTGATGC CTTGTTTAAT CGCTTGTTGA CCTGCTTCCA 1500 

TGATAACTGG ACCATCAGCA ACCGGATCAG AGAAAGGTAC TCCAATTTCA ATTATATCTG 1560 
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GTATAAATAA TTTAGTCATT TGCAAGACCT 
ATATCTTTAT CGCCACGTCC AGAAATAGTT 
GGCGCTAGTC TTTCAACATA ACTCAGTGCA 
TCATGTTTTG TAAAGTTGAT TAAAGCATTC 
GTTACTCTAC CAATGTCGTG GTAATAAGAA 

10 

CCTGCTGAAA TAGAATGTGC TAGTTGCACT 
TTAGTACCAT GTAATACGCC AGGTGATCCT 

1S TCATCGCCTT GACCTGCGGC TTCAACACCG 
TAAAATGTAC CGATTGCATT TGAGCCACCA 
CGACCTTCTT TCTTCAATAT CTGTGATTTT 

20 CTAACAATCG TTGGGAACGG GTCTGGACCT 
TCTACATGAC TTACCCAATA TTGCAATGCT 

CCATCTTCAA CTGCCACAAC CTTTGCACCA 

25 

TGTCTTTTAA TATCTTCACT TCCCATAAAG 

ACCGTAGCAC TAGCTACACC ATGTTGACCC 

CCCATTCTTT TAGCAAGCAA CGCTTGACCT 

30 

TGATTTAGAT CCTCTCGTTT CAAATATATT 
GCAGCATATG TAAGTGGTGT CGCGCGTCCT 

3S CTTTGAAACT CTGGGTCTGC TTTTGCCTCT 
GGCATTAATG TTTCTGGAAC ATATTGCCCT 
GCTTCTGTTT GTATTTGTTT ATTCATTGTC 

40 TGTCATTTTT TCTATATCTT TTCGCCCATT 
TTGATGTGAT AATTTAAGTT GATTAACTGT 
TATCAAATAA GGTATGTCTT TTATGTGCTT 
ACCACCATAC GACACTGAGG GTGTGTCGAT 
ATTTATGTTT TGGATTATGT TTTCATCTnG 
ATATkTCTTT TTAATTTCCT GTAtAAAAtC 

50 

wTGaCGwATg CtTAAwACgT GTGCCAATGG 
(2) INFORMATION FOR SEQ ID NO: 4: 

55 



CGCTCTACCA TATATTGTCT AATTGTTTCC 1680 

ACTACAATAA TATCTTCTTT CGACATCGTA 1740 

TGTGCACTTT CAATTGCAGG TATAATACCT 1800 

ATTGCTTGTG TATCACTAGC ATTTTCAAAA 1860 

TGTTCTGGTC CAATACCAGG ATAATCAAGT 1920 

TGCCCATCTT CATCTTGAAT TAAATACATT 1980 

TTGCCAATTG CAAGTGCATG TTTATCAGTA 2040 

TATAATGCAA CATCATCTTT AATAAATGGA 2100 

CCGATACATG CTACAATTGC ATCCGGAAGT 2160 

ATTTCTTTAC CAATCACACT CTGAAAATCT 2220 

AATGCAGAAC CTAATAAATA ATGTGTATCA 2280 

TTATTAACTG CATCCGATAA AGTCCCTTGA 234 0 

AGTAATTCCA TTCTAAATAC ATTAAGTTGT 2400 

ACAACAAGTT CCATATCAAA TAATGCAGCA 2460 

GCACCAGTTT CAGCAACAAG CTTCTTCTTG 2520 

AACGCATTAT TAATTTTATG GGCGCCTGTA 2580 

TTAGCGCCAC CTAGGCTTTC AGTATATGAT 2640 

ACATACTCTG ATAAATAGTA TTCCAGTTCT 2700 

TTATAAGCTT. TTTTCAACTC AATAATTGCT 2760 

CCATATTCAC CAAAGAAACC TAATTCATCT 2B20 

TCTATCTCCT TTCACAATAT TTACAATTGC 2880 

TACTTCTATA CCTGATGCAA GATCATAACC 2940 

TTGAATATTT TCAGAGTTAA TGCCTCCTGC 3000 

CAAAATAGTC CAGTCATATG TTTGACCGGT 3060 

AATAAATAAA TCTACGAACC CTTTATATTT 3120 

CAGtAAAGCT TTAGTGATTT TAATGCTTGA 3180 

AATAGATTCt GtGTAACTGT ATTGTGkTAA 324 0 

T 3271 
37: 
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(A) LENGTH: 1553 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: -437: 

10 ATAATGAAAT AATACTGTGT TTTATCTGCG AAATGTAtCA TTTTCTAATc GTTTCACAGT 60 

AAAATGAAAA GATAAAGTGT GTTTTTACTT GAATTTTGAC TAAAATTACT CTATATTTAT 120 

TAATTGAGCT ATGCTTATTA TTACAATTTG ATTACAAATT TTAAATTTGT TAATTGAATG 180 

ATAATATTAA ATAAAGAAAC TTACACAAGC AAATATGAGT TGTAGCCCAA AATACTTGTT 240 

AAATCAAAGT TGAAAGCTAC AAATAATGAA AATTATAAAC TTGAATCTGA AAGTAATTAC 300 

TATAATTATG ACAATGTTAA CTTTTAAACG CACTTATTAA TTAACTACAT AATGTTAATA 360 

TCTAATTTAT TCAAGTACTT TCGCAAGATT TATTATCTAA ATAACGGGGG AAAGAATCAT 420 

GAGTtCACAA AAAAgAAAAT TAGTCTTTTT GCGTTCTTCT TATTAACCGT AATAACGATT 480 

ACCTTGAAGA CGTATTTTTC TTATTATGTT GATTTTTCTT TAGGTGTTAA AGGTTTAGTA 54 0 

CAAAACTTAA TATTATTGAT GAATCCTTAT AGTTTAGTAG CACTGGTTTT AAGTGTGTTC 600 

CTATTCTTTA AAGGCAAAAA AGCATTTTGG TTCATGTTCA TAGGCGGCTT CTTATTGACG 660 

30 TTCCTATTAT ATGCCAATGT TGTGTACTTT AGATTCTTCT CTGATTTTTT AACGTTTAGT 720 

. ACTTTAAACC AAGTAGGTAA CGTAGAATCT ATGGGTGGTG CGGTTAGTGC ATCATTCAAA 780 

TGGTATGACT TTGTTTATTT CATTGATACG TTAGTTTACT TATTCATTTT AATATTTAAA 840 

ACAAAATGGT TAGACACAAA AGCATTTAGT AAGAAATTTG TTCCTGTCGT AATGGCGGCT 900 

TCAGTAGCAT TATTCTTCTT AAACTTAGCT TTTGCTGAAA CTGACAGACC AGAATTATTA 960 

ACACGTACAT TTGACCATAA ATATTTAGTG AAATATTTAG GACCTTATAA CTTTACAGTA 1020 

TACGaTGGTG TTAAAACTAT CGAAAATAAT CAACAAAAAg CGCTAgCATC TGAAGATGAC 1080 

TTAACaAAAG TATTAAAtTA TACGAAAcAA CGTCaAACmG AGCCTAACCC rGAwTATTAT 1140 
GGGGTGGcAA rGAAGAAAAA TATTATTArG ATTCATTTAG AAAGTTTCCA AACCTTCTTA . 1200 

ATTAATAAAA AGGTTAATGG TAAAGAAgTA ACACCGTTTT TAAACAAATT ATCAAGTGGG 1260 

AAAGAGCAAT TCACATACTT CCCTAACTTT TtCCATCAAA CAGGTCAAGG TAAAACATCT 1320 

50 GACTCTGAAT. TTACAATGGA TAACAGTTTA TACGGTTTAC CGCAAGgTTC TGCCTTTTCA 1380 

TtaaaaGGAG ATAATACGTA TCAGTCATTA CCAGCAATTT TAGATCAAAA GCAAGGCTAC 144 0 

AAATCTGATG TCATGCACGG TGACTATAAA ACATTCTGGA ACAGAGACCA AGTATATAAA - 1500 
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(2). INFORMATION FOR SEQ ID NO: 438: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1419 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



10 - • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 438: 





ACAAAATCAA TCAGCAAAAT GATTGGATTA AGACGCAAAT 


TGAGCGTTCA 


ATGGAAGGCG 


60 


15 


AAACAGTTGG 


CATTAATGAT. CAAAATATAG 


AAATATATAG 


TGAAcGtTgA 


GATTTATACC 


120 




ATACACTCGT 


ACCTTTAAAT CAAGAATTGC 


ATAAGTTGCG 


ACTTAAAACT 


CAAAACTTAA 


180 




CCAATGAAAA 


TTATAATATT AATGATGTGA AAGTTAAAAA 


GATTATTGAA 


GATGAACGTC 


24 0 


20 


AAAGACTAGC ACGAGAACTT. CACGATTCTG TTAGTCAGCA 


ACTTTTTGCG 


GCmAGTATGA 


300 




TGCTATCTGC 


TATCAAAGAA ACGAAtTAGA 


ACCACCATTA 


GACCAACAAA 


TTCCTATTTT 


360 




AGAGAAAATG 


GTTCAAGATT CGCAGTTAGA AATGCGTGCT 


TTGCTGTTAC 


ATTTAAGACC 


420 


25 


GCTTGGTTTA 


AAAGACAAAT CITTAGGTGA 


GGGTATTAAA 


GATTTAGTTA 


TTGATTTACA 


480 




AAAAAAAGTG 


CCAATGAAAG TTGTGCATGA AATACAAGAT 


TTTAAAGTGC 


CTAAAGGTAT 


540 


30 


TGAAGATCAT 


TTGTTCAGAA TTACACAGGA 


AGCAATTTCG 


AATACATTGC 


GTCATTCAAA 


600 


CGGTACAAAA 


GTGACAGTAG AATTGTTTAA 


TAAAGACGAT 


TATTTATTGT 


TGAGAATTCA 


660 




AGATAATGGT AAAGGTTTTA ATGTTGATGA AAAATTAGAA 


CAAAGTTATG 


GACTTAAAAA 


720 


35 


TATGCGTGAA AGAGCTTTGG AAATTGGTGC AACGTTCCAT 


ATTGTAT CAT 


TGCCAGATTC 


780 




AGGTACACGT ATCGAGGTGA AAGCACCTTT AAATAAGGAG 


GATTCGTATG 


ACGATTAAAG 


840 




TATTGTTTGT 


GGATGATCAT GAAATGGTAC 


GTATAGGAAT 


TTCAAGTTAT 


CTATCAACGC 


900 


40 


* AAAGTGATAT 


TGAAGTAGTT GGTGAAGGCG 


CTTCTGGTAA 


AGAAGCAATT 


GCCAAAGCCC 


960 




ATGAGTTGAA 


GCCAGATTTA ATTTTAATGG 


ATTTACTTAT 


GGATGACATG 


GATGGTGTAG 


1020 




AAGCGACGAC 


TCAGATTAAA AAAGATTTAC 


CGCAAATTAA 


AGTATTAATG 


TTAACTAGTT 


1080 


45 


TTATTGAAGA 


TAAAGAGGTA TATCGTGCAT 


TAGATGCAGG 


TGTCGATAGT 


TACATTTTAA 


1140 




AAACAACAAG 


TGCAAAAGAT ATCGCCGATG 


CAGTTCGTAA 


AacTTCTAGA 


GGAGAATCTG 


1200 


50 


TTTTTGAACC 


GGAAGTTTTA GTGAAAATGC 


GTAACCGTAT 


GAAAAAGCGC 


GCAGAGTTAT 


1260 


ATGAAATGCT 


TACAGAACGA GAAATGGAAA 


TATTATTATT 


GATTGCGAAA 


GGTTACTCAA 


1320 




ATCAAGAAAT 


TGCTAGTGCA TCGCATATTA 


CTATTAAAAC 


GGTTAAGACA 


CATGTGAGTA 


1380 
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(2) INFORMATION FOR SEQ ID NO: 43 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 
<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



w 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 439: 

ATGTUGGCGG ACGTAAAGGA CATGTTTATA CTGATGATCG AGCATTAGAT ATTGATATCG 60 

TTCCGCCTGC TCAAGCAGAT GGTAAAGCTA CTAACCCCGA ACAATTATTT GCAGCAGGTT 120 

15 

ATGCATCTTG CTTCAACGGT GCTTTCGACC TAATTTTAAA GCAAAACAAA GTGCGTGATG 180 

CTCATCCAGA AGTAACACTA ACAGTGAGAC TAGAAGATGA TTCAGACTCA GAAAGTCCTA 240 

20 AATTAAGTGT TTCAATTGAT GCGACAATTA AAAATGTTAT ATCTCAAGAA GAAGCTGAAA 300 

AATATTTACA AATGGCTCAT GAATTTTGTC CATATTCAAA AGCGACTCAA GGAAATATTA 360 

ATGTCGATTT AAATGTAAAT GTTGTAGATT AGCATTAACT TAAAGAGATT ATTCAACGTT 420 

2S ATTAATAAAA TTCACATAAA ATTCAAATTt stCrAcCAAA AATTTTTGGT TGGyTATTTT 480 

TTCTATTCGT GATTGAAATT TCTGGCAATA TTTAACTGAA AATGATTGTA CCTTAGTCAT 54 0 

CATAAATGTG ACCGGTTCCA ATACTGGCTT GACTTCTTCG CATACCGTCT ACAAATAAAA 600 

30 

GTCCAGTG 608 

(2) INFORMATION FOR SEQ ID NO: 440: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 682 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 440: 



45 



CCTTTCAAtT TCTCCCAAgA TTTACGCATG 


TCTGACCAAA 


TGTCAATACA ATACCTGAAT 


60 


AATGATTTCT 


GGAGTAATTG 


CGAAAAATAT 


TTAAAAGTTG 


CTATAGATCA 


ATTTTCAAAT 


120 


TATAGTATCT 


CATCTCAAGT 


TTCTAACTAT 


CATTTCACAG 


TATTACTTGG 


AGATCGCCAA 


1B0 


AAACCACTTA 


TGTATCTAAA 


TAAAAATCGC 


GGTGGTGATG 


GTGGCATACC 


AGGTTATATT 


240 


ATGATTTATT 


TAGTGCCGAG 


TACAAGTACA 


ATTAATTCTA 


TGAAAAGCTT 


AATTGCACAT 


300 


GAAGTAAATC 


ATAATATGCG 


CTATCAATAT 


ATTGATTGGG 


ATGGCGGAAG 


TTTGATTGAA 


360 
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ATTGGACCGT GGGTAACAAA 


TACTAATTGA AGTCGTGATA 


ACGTAAAAAT 


TAAAAATACT 


480 




ATTTATAATC ATTTACATTT 


AAAGCATATA 


TTTGAATCGA 


tGCCTTATCT 


CTATGGTGAT 


540 


5 


GATATTAATA AACTTCAAGG 


TAGGCCTATC 


GTTGGCTTAT 


CTCATGCTGC 


CGGGTATGCA 


600 




TGTGGCTATC ACTTGGTAAA ATACTTTTTA 


CAAAAAACAA ACATACCTAT 


TGAAGTTGCT 


660 


10 


ACAACACTTC CAGCACAAAA 


AA 








682 


(2) INFORMATION FOR SEQ ID NO: 441: 








15 


(i) SEQUENCE CHARACTERISTICS: 
j (A) LENGTH: 2574 base pairs 

(B) TYPE:, nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 








20 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 441: 








ATCTTCTTTT AACATGACAA 


ATTGCAACAA 


AATATTAATG 


ACGCCAAAAG ACATTTTTTC 


60 




ACGTTCAATT AATTCTTCAA 


CCATTGTCTT 


TTGCGATATA 


GTTGGTTCTG 


ATTCAGACCA 


120 


25 


AGAAGCTAAC ATATCAATTG 


GACTCGTTTG 


TTCAAGTAAC 


TCAAACCATT 


CATCACTTTG 


180 




TGGCTTTGGA TTCACTTCTG 


AAGATTTGCC 


CGCCGAAGAT 


GATGTAGCAG 


GAGATTTCAC 


240 


30 


CTGTAATTTA GGCATTTGAT 


TTTCGTGTTC 


CATTAAGTAA 


TACGAGCGTG 


CTTGTTTACG 


300 


CATTTCTTCA AAGGATAACT 


GTTGTCCACT 


TGTAATTGAA 


TTTAAAATAA 


CATGCTTCAT 


360 




GCCATCTGCT GTTAAACCAT 


ATAAAGTCGC 


GAGTTGTGTA 


ATTAAACGCT 


TTGCATCTTT 


420 


JO 


GGTAACAATG TCTTGACTAA 


TAAAATGTTT 


ACCTAACATT 


TGTCTCAACA 


TTTCAAAGTC 


480 




AAAAGATTCA TTTGATAAAT 


CGATACCTTG 


GTACGGTTCA 


TTAATCGGaA 


TATCACTTGT 


540 




ATCGATATCT ATTTTTGTAG 


ACGGCACTTT 


AAAAACATCA 


GTAAATTGTC 


TTGTTACCTG 


600 


40 


TTTAAATTCA CTCAAATCAA 


TTTGTTGATA 


CTCAAAGTAT 


TTCTTCAACT 


CATGAAATCG 


660 




ACGATGCTCG ACTTCACTAT 


ATAAAAAGAT 


TGACAACATT 


GGATCATTAA 


AAAATAAATG 


720 




TGCTGAAGGG GtTGaATTAA 


TTGGTAAACA 


AATTGTGTTT 


CTTGTTCATC 


ATGTTTGACA 


. 780 


45 


AACGCCTTTA ACAATCCAAT 


CGCTTCAAGT 


AAGTCCATTT 


GTTGTCTAAA 


CTCTAGTAAA 


84U 




TTAATTTTAA GTTCATTCAT 


AAAAATATAA 


TGAGAAAGAA 


TCAATGTTTC 


ATTATGACTT 


900 


50 


TCTTTAACGA ATTGAGTCAT 


AAAATGATAT 


AAACCCACTG 


CTTGCGTTCC 


AATTAGCGGT 


960 


GTATACAGTC GATTCAATAC 


CTCTAAATGA 


TTCGTATTTA 


AATCAAAGTG 


TTGCATAACT 


1020 




TTGAATTGAT CCTTTGGTCT 


TAAGCCGAAT 


TCGAAGGCTT 


GTCGTCCCaT 


TTAAGCnATC 


1080 
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10 



15 



AAATTCTTTA TAGACTGATG CAAATCTAAC ATATGAAACT TGATCAACAT GCATTAACAA 1200 

GTTCATAACG TGTTCACCTA TATCTCGTGA AGACACTTCC GTATGACCTT CATCTCGTAA 1260 

TTGCCATTCA ACCTTGTTAG TTATGTCTTC AAGTTGTTGA TATCTAACTG GTCGTTTCTC 132 0 

ACAAGAACGC ACAAGTCCAT TAAGTATCTT TTCTCTTGAA AACTGCTCTC TTGTGCCATC 13 80 

TTTTTTCACA ACTATAAGCT GACTAACTTC GATATGTTCA AATGTAGTGA AACGTGTTCC 1440 

ACAATTTTCA CATTCTCTTC GTCTTCGAAT GGCATTTAAT TCATCGGCAT GCCTTGAATC 1500 

TACAACTTTA GATTGTGTAG AATTACATTT CGGGCATTTC ATTACATCAC CCTCTTTATT 1560 

TTGATTATGC CTAATTATAC TATAAATCTA GAGATGAAAA AAGAATCCCT CAATTTAATT 1620 

CATTTAACCA AATAATGAAA CAATAAAAAA CATTATATCG TTACTTATTA AGTAATTTGC 1660 

ATGACAATAT TATTGTATTA AAAATAAAAA ACCTAACTCC GAAGTCAGAG TTAGGCTATA 1740 

20 AATTAATTGT ATTAACTTGC ACTTACAGTT TCTTTTGATG TCAAAAGTGC TCCAATTTGC 1800 

TCAGCAACAT CTACAACTCT ATTTGAATAA CCCCATTCAT TATCATACCA AGCAATAACT 1860 

TTTACTTTAT TCCCTGACAT GACCATTGTT GATTTTGCAT CAATAATAGC TGAATTTGGA 1920 

25 TTAGTATTAA AATCAACAGA CACTAGTGGT TGATGTTCGA CTTCTATGAT ACCTTCTAAA 1980 

CCTGCATTTT CAAAAGCTTG GTTTACTTCT TCTGCAGTTA CTTCTTTTTC TAAATCAACA 204 0 

ACTAAATCAA CGAGCGATAC ATTCTTTGTT GGTACACGTA ATGCCATGCC GTGTAATTTA 2100 

■ CCTTCTAATT CTGGTAATAC TTCTTTTAAA GCTTTCGCCG CACCAGTAGA AGTAGGAATA 2160 

ATGCTTTCAT TACATGAACG TGCACGTCTT AAATCTTTAT GTGGATTATC AATATTTTTT 2220 

TGGTCATTTG TAATAGCGTG AACAGTAGTC ATTAAACCAT TAACTATTCC AAACTGATTA 2280 

TTTAAAACTT TTGCAACTGG ACCAATGCAA TTAGTAGTAC ATGAAGCATT ACTAAAAATG 234 0 

TCAAATGCTT CTATATCTAA TTGGTTATCA TTTACGCCTT TAACTACCAT TTGAACATGT 2400 

CCACCTTTTG nAGGACCAGT TAACAAAAsT TTTtTGGCAC CTGCTTTAAT ATGTGCGATG 24 60 

GCTTTATCAC CATGaTTAAA TTTACcAGTT GCATCTATAG CAATATCGAT ATCTAATTCT 2520 

TTCCATGGCA AGTTTTCAGG ATTGCGATCA GCAACCAATT TAATTTTATG ATCT 2574 
45 (2) INFORMATION FOR SEQ ID NO: 442: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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CAAGGTACCG GTTTAGGTTT GTTCATTTGT .AAAATGATTA .TCGAAGAGCA TGGTGGTTCC 60 

ATAGATGTTA AAAGCGAATT AGGGAAAGGC ACAACATTTA TTATTAAACT ACCAAAACCA 120 

GAATAAAATT GAATATAGTT ATTTCAGAAC GCATGTTATT GATTAGAGAC TCTAATTTAT 180 

AGCATGCGTT TTTTGATTGA TGTGGGGAAT TTTGTATGTG GATTAGAACT TAGGGTTTTT 24 0 

GCGAATATCA ACTATTAAAT ATATTACTAA TTTATACAAA AATATAAAGT TTGATAAAGT 300 

TATTTATTTG ATTATAAAAA TAGGGTAAAA TATAGATATA TTGTATTAAT TAAATTATTC 360 

GAGGTGTCAT ATGAAAAAAT TCATTGGATC AGTTTTAGCT ACGACATTAA TTTTAGGGGG 420 

ATGTTCCACG ATGGAAAATG AATCAAAAAA AGACACAAAA ACAGAAACAA AATCTGTACC 4 80 

AGAAGAAATG GAAGCTTCAA AATATGTAGG CCAAGGCTTC CAACCGCCTG CAGAAAAAAA 540 

TGCGATTGAA TTTGCGAAGA AGCATCGTAA AGAATTTGAA AAAGTAGGTG AACAATTCTT 600 

20 TAAAGATAAC TTTGGACTAA AAGTTAAAGC TACAAATGTT GTAGGTAAAG ATGATGGTGT 660 

AGAAGTTTAT GTGCATTGTG AAGATCATGG CATTGTATTT AATGCAAGTC TACCTTTGTA 720 

CAAAGATGCC ATCCATCAAA AAGGATCAAT GCGCAGTAAT GACAATGGTg ATGATATGAG 780 

TATGATGGTG GGTACAGTGC TGAGTGGCTT TGAATATCGA GCGCAAAAAG AAAAGTATGA 840 

TAACTTATAT AAATTCTTCA AAGAAAATGA AAAGAAATAT CAATATACAG GCTTTACAAA 900 

AGAGGCAATT AACAAGACAC AAAATGTCGG ATATAAAAAT GAATATTTTT ATATTACATA 960 

CTCTTCTAGA AGTTTAAAAG AATATCGAAA GTATTATGAA CCACTGATTC GAAAAAATGA 1020 

TAAAGAATTT AAAGAAGGAA TGGAACGAGC AAGAAAAGAA GTGAATTACG CTGCAAATAC 1080 

35 AGATGCTGTT GCTACACTTT TTTCTACTAA GAAAAACTTT ACTAAAGACA ATACAGTAGA 1140 

TGATGTAATC GAACTAAGTG ATAAATTATA TAATTTAAAA AATAAACCAG ATAAATCTAC 1200 

AATCACAATA CAAATAGGGA AACCCACTAT TAATACTAAG AAAGCCTTTT ATGATGATAA 1260 

TCGTCCAATA GAATATGGGG TGCACAGTAA AGATGAATAA AATTAATGAT AGGGATTTAA 1320 

CAGAATTAAG TAGTTACTGG GTTTATCAAA ATATTGATAT AAAAAAAGAA TTTAAAGTTA 1380 

ATGGAAAAAG GTTTAAACAA GTAGACAGTT ATAATGATGA TAAGAATAGT AATTTGAATG 144 0 

GTGCTGCTGA TATTAAAATA TATGAGTTAT TAGATGATAA AAGTAAACCA ACTGGTCAAC 1500 

AGACAATAAT TTATCAAGGA ACATCTAATG AGGCAATTAA TCCAAATAAT CCATTAAAAT 1560 

so CATCGGGGTT TGGAGATGAT TGGCTCCAAA ATGCTAAATT AATGAATAAT GATAATGAAA 1620 

GCACAGATTA TTTAAAGCAA ACAGATCAAT TATCAAATCA ATATAAAATA AAGTTAGAAG 1680 

ATGCAGATAG ATTATCAAAT AGTGATTTTT TAAAAAAATA TAGAATGGAA TCAAGTAACT 1740 
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70 



1$ 



20 



25 



30 



35 



ATCAAGGAGC 


GAAACATCCG 


AATGAAAAAG 


TTGTTGCTAC 


TGACTCAGCA ATGATTCCTT 


1860 


ATGCTGCTTG 


GCAGAAATTT 


GCTAGACCAC 


GCTTTGaTAA 


TATGATTAGT 


TTTAATAGTA 


1920 


CCAACGATTT 


ATTAACATGG 


TTACAAGATC 


CATTCaTCAA AGATATGCCA 


GGAAAACGCG 


I960 


TTAACATTAA 


TGATGGTGTG 


CCCAGGTTAG 


ATACTTTAAT 


AGACAGCCAT 


GTAGGTTATA 


2040 


AAAGGAAGTT 


AAATAGAAAA 


GATAACACAT 


AOGATACTGT 


ACCACTAATC 


AAAATAAAGT 


2100 


CGGTAAAAGA 


TACAGAAATT 


AAAAATGGAA 


AAAAAGTAAA 


AAAGACTATT 


AACATAACAT 


2160 


TAGATATGGA 


TGGGCGAATT 


CCAATAAATG 


TTTGGACAGG 


AGATTCGATT 


GCACGTTCTG 


2220 


GAAGAGGAAC 


TTTAATTAAA 


CTTAATTTAG 


AAAATCTTGA TGCGTTGAGT 


AAACTGATTA 


2280 


CTGGTGAAAC 


AAGTGGTATG 


TTAGCAGAAT 


GCGTAATCTT 


TTTAAATGAA 


AGTTTTAACA 


2340 


TCTCAGAAAA 


TGAAAATAAA 


AATTTTGCAG 


ATAGAAAGAa 


ACAATTATCA 


GAAGGATTTA 


2400 


AGGATAAGAT 


TAACTTATTT 


CAGTTAGAAG 


AAATGGAAAG 


AACTTTAATT 


AGTAAAATAA 


2460 


ACTCACTTGA 


AGAAGTTGCA 


GATGAAACAA 


TAGAAAGTAT 


TAGTGCTGTT 


AAACACTTAT 


2520 


TACCTGATTT 


TGCATTGGAT 


GCATTAAAAG 


AAAGAATTAA 


TGAGTTGTTT 


AAAGGTATAA 


2580 


AATCTTTTAT 


AGAAAAAGTG 


TATGATAGTA 


TAGATAATGA 


AATTTTAGAA 


ATTTTCAAAA 


2640 


ATATAGATCA 


CGACTTCAGA 


GATGGAGTAT 


CTGAAGAAAT 


GATGAAACAT 


TTGAAAGTAG 


2700 


TGAAACAGAA 


TATAGACCAA 


ATAAAAAATC 


AAAATGATAT 


TTATGGTAGG 


CAAATTGCAG 


2760 


ATATTAGAAG 


TATTATGAAA 


CAACAAGATG 


CAACAATTTT 


AGATGGAAAT 


TTTCAAATTA 


2820 


ATTGTAGCGG 


CGAAAATATG 


GTACAGGGTC 


TAGTTATACC 


TTCTAATTAT 


TTAGGAAGAA 


2880 


AAATGAAAAT 


ATTAAAAGAC 


CATATCGATG 


ATGGTATTAA 


AAAAATAGCA 


GACTATGTTC 


2940 


AAGGTATATA 


TGATGAATAT 


GCATCGAAAA 


TTGTCGATGT 


AATAAAATAT 


TTGATTAATA 


3000 


CAATTCCCAA 


AATACGTAAG 


AATTTAAGAC 


ATGCAATTGA 


AATGTTAAAT 


GTAAAAAAGA 


3060 


AAGAATTTTT 


GTCCCTGATT 


CCTAATGTAA 


CTTGTAATTA 


TATTAAAACT 


AAATTAGAAG 


3120 


AATTAGATAA 


TACTTTAGGC 


AAATGGGAGC 


CTTTTCTTAA 


TGATTTAAAA 


GCAGTGTCAC 


3180 


CAATTTTAGA 


TAACCATTTA 


GATGATATTG 


TTAAGAACAT 


GAAGCCTTTG 


ATTGTACAAA 


3240 


TGAyATwTGA 


ACCATCACAT 


TATGaCGATA 


TGTTTAATTC 


aAGAAAAGCT 


TTAACGcCAG 


3300 


TGTTCTCAAG 


CGTTTTATAA 


AGGTTG 








3326 



(2) INFORMATION FOR SEQ ID NO: 443: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5301 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 443: 



5 


ACCTATAGCT GGCCCGTCAA GTCCTGAATC ATATGTCGCT ATAAACCCAG 


TTGCATGCCC 


€0 




AGCAATCTCG GTTTCAATTT CAAAATCATG CTCTTTCAAA CGATCTATTA 


AAGTTCGAGA 


120 




CGCAAATATT TCTTCATTAC CAAGTTCAGG ACGTTCATGA ATTCTATGAC 


TGATTTCGAT 


180 


10 


ATAACTATAT TTATTTGTTT CTATATAATC GAGAATTTGT TGTTTTTCAC 


TCATTTTTTA 


240 




CTATCTCCCT TTTACCCTAC ACTCATTTTA ATCATCCACA ATATTTTGTT 


CTTTCAAAAT 


300 


15 


GAATTATTAC TTATTCTATC GGTTTTATCT CATGATGTCA TCTAGTTTTT 


CTTTATTTAC 


360 


AAAATTTTCT AATAACTAAA aGCCTTTCAT AAATTTATAA AACAGTTTCA 


AATTGTAAAA 


420 




CATAAGCCCT ATTGTTACAA TTTAAGTATT GTTAGAAACA CAACTTACAA 


CAAACTTAAG 


480 


20 


TTTTATAATG CAACAAATCA TAAGCGTTTT ATAGTTTTGA GTAGTTAATA 


GGAGGAAATT 


540 




CAAAATGACA AAAATGAATG TTGAAAGTTT cAATTTAGAT CaTAcTAAGG 


TGGTTGCCCC 


600 




ATTTATTAGA TTAGCGGGAA CGATGGAAGG ATTAAACGGA GATGTCATTC 


ACAAATACGA 


660 


25 


CATTCGTTTC AAACAACCAA ACAAAGAACA TATGGATATG cCCGGACTAC 


ATTCATTAGA 


720 




ACATTTAATG GCTGAAAATA TTAGAAATCA TAGTGACAAA GTTGTTGATT 


TAAGTCCTAT 


780 




GGGTTGCCAA ACTGGTTTCT ATGTATCATT TATTAATCAT GATAATTATG 


ATGATGTATT 


840 




AAATATTGTT GAAGCAACTT TAAATGATGT GCTAAATGCT ACTGAAGTGC 


CTGCTTGTAA 


900 




TGAAGTACAA TGTGGCTGGG CAGCAAGTCA TTCATTAGAA GGTGCTAAAA 


CTATCGCTCA 


960 


35 


AGCATTTCTA GACAAACGAA ACGAATGGCA TGATGTTTTC GGTACAGGAA 


AATAAATCTT 


1020 


AGTCAATCAA GTTAATCAGA AAaGCAGTCG AACAATGATT TTACAATCGC 


CATTGTCCAA 


1080 




CTGCTTTTTA TTATGCTTCA AAGTCAAAAA ATCGAACAAA TGAAAAAGTA 


AAATCTTTAA 


1140 


40 


CATTTGTCCG ATTTATTTGA GAACCACTAT AATTTCTTAA TTAGTCCCAT 


TAACACGAAC 


1200 




TGCATAGGTA ACCTTAAATA TAGTTGCCAT GTTGGCAATT GTTTATCTCC 


TAAAGGTAAC 


•1260 




TTTTTAACTG CCATATAGAT ATTAGCTGGG AATACAGCTA GCAAGAATAG 


ATTGATTGTA 


1320 


45 


TTTTTCAAGC ATTGAGATGG TCTTTTAATT AAAAGTGCAA GTCCAAATAA 


TATCTCAAAG 


13B0 




ATTCCTGTAA CAAGAACCGC TGTTTTTCGA AGTGGCAAAC ATTTCGGTAT 


GATATTTCTA 


1440 


50 


AATTGTCGTT CTCGTGTAAA ATGCAATACA CCTATTACAC TAAAACCTAT 


TCCTAATAAA 


1500 


TATCTTAGTA TGTTCAATCA GCTTCAACTC CTATTCTGTA ATGATTTTAT 


GAATTAATGT 


1560 




AGGCGATACA ACATGaTCAG CAATTGTTAT GCTTGAATCT AA'IT'ITITAA 


CAACATCGTC 


1620 
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AATTTTTTTA TTTAAAACAA TACCAACCGC TAAATCAATA TCATCCTCTT TTGTTAAACG 1740 

TCCCGCTCCT AACATCATCG AAGCGACACC TATATCGTTA GAGACTAATT CAGTCACATA 1800 

ACCTGATTTT TTAGCTTTAT ATTCAATTTG ATATTGAGCT TGTGGCAAAC GCTCTGGATG 1860 

GTCAATAACA GTTTCGTCGC CACCTTGGTT TTTAATAAAT GTTTTGAATT TTTCTAATGC 1920 

TGCACCTGAA TTAATTGCCT CAATTAGCAA CGCTCTCGCT TCTTCAAGCG TTTCAGCTTT 1980 

GTTTGCAAGT ACAACCATTT GAGAACCTAA TGTTAATACA AGTTCTGTTA AATCTTTCGG 204 0 

ACCTTGTCCT' TTCAACGTAT CAATTGCTTC TTGTAACTCA AGCGCATTGC CAATCGCACG 2100 

TCCAAGTGGC TGATTCATAT CAGAAATAAT CGCCATCGTA TTACGTCCCA CATTATTACC 2160 

AATACGTACC ATTGCGTGCG CTAATGCTTC AGCATCTTCT AATGTTTTCA TAAATGCACC 2220 

GCTACCAGTT TTTACATCTA ATACAATTGC ATCTGCACCA GCAGCAATCT TTTTACTCAT 2280 

20 AATTGAAGAG GCAATTAATG GTATTGAATT GACAGTACCA GTAACATCCC TTAAGGCATA 2340 

TAATTTTTTG TCTGCAGGAG TTAAATTTCC TGATTGTCCT ACAACTGCCA CTTTATTTTC 24 00 

ATTAACCAAT TTCACAAATG TTGCTTCATC TATTTCAACA TGAAAACCAT CAATTGCTTC 2460 

TAATTTATCA ATCGTACCAC CTGTATGACC TAATCCACGC CCACTCATTT TTGCAACAGG 2520 

AACATCTACA GCTGCTACTA ATGGTGCTAA AACCAATGTA GTTGTATCTC CTACACCACC 2580 

TGTTGAGTGC TTATCTACTT TGACACCTTT AATATCACTC AAATCTATCA TATCACCAGA 2640 

ATTAACCATA GCCATCGTTA ATGCTGCACG CTCATCATCA TTCATATCTT GGAAATAAAT 2700 

CGCCATTGCT AAACTTGATG CTTGGTAATC AGGAATATCC CCTTTAACAT AGCCGCCAAT 2760 

AAAGAAATTA ATTTCTTCCG TTGTTAGTGT ATGACCGTCA CGCTTTTTCT CAATAATGTC 2820 

TATCATTCTC ATTTTTATCA TCCTTTTCTT AAAAAGCTTA GGACAAAGCA TCTGCGCTTT 2880 

CTCTAGTCCA TTTTTAAAAG CACAAGCGAA AATTATTATA GCAAGCTATC GATTTCATTT 2940 

40 TTAATATCAC AATTTCAATG CGATGTTATT ATTCTTAAAT AGATTGGTTA TAACGTTAAA 3000 

GTCCCTATTA AATTATCTTA GAATCATCAT GGCATTTATG ATGTCTTAAA GCTGATATCG 3060 

ACATACTTAT ATATGGTTAC GATGTCCCAT GCTTACATAT TTTTATAAAA TTAGTAATCT 3120 

GAATCTGCTT CTAAACCTTG CATAATTTGa ACGCCTGCGC TCGCACCAAT ACGTGTCGCA 318 0 

CCTGCTTCAA CCATTTTATT GAAATCTTCT AAATTACGTA CGCCACCTGA TGCTTTTACT 324 0 

TCTATATCAG CACCTACTGT ATCTTTCATT AATTTAACGT CTTCTGCAGT CGCACCGCCA 3300 

CCTGCAAAAC CTGTTGAAGT TTTAACGAAG TCCGCACCAG. CCGCTTTTGT TAATTCACTC 3360 

GCTTTTACAA TTTCGTCATG GTCCAACAAT ACCGTCTCAA TAATCACTTT TACTGTGTGA 3420 
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TTTAATGCGC CGATGTTGAT GACCATGTCA ATTTCATCTG CACCATTTTG AATCGCATCT 354 0 

TCTGTTTCAA ATGCTTTCGT CGCAGTTGTC GATGCACCTA ATGGGAATCC TATTACCGTA 3600 

CAAACTAACA CCTCTGAATC AGCTAGTCGC TCTGCTGCAT ATTTAACATG CGTTGGATTC 3660 

ACACATACAG ATTTAAAATG GTATGCTTTC GCTTCATCGA TGATTTGATC GATTTGCGTA 3720 

CGTGTTGACT CAGGCTTCAA TAAAGTGTGA TCAATCAATT TTGCACTATT CATTTTCTAT 3780 

CTCCTCCTTT ATGGTTGATT ATAAAAATAC GGTTGTAAAT TAGTTGATTG AGCGTCAGGT 3840 

TCATTTAAAT ATCAGGTTAG ATGTTCGCTT TTTATGTAAC CGCATACATA TACTATTACA 3 900 

TTAATTCATT TCCCATAAAC AAACAATACA ATTGAACGTG ATATCTTCAT TATGAACGAT 3960 

GACTTGACAA CAAGCTAATC AGGATTATAT TTTTATAATT CTTTAATTCT ATAGTACAAA 4020 

AATTCGCAAA AAAGGGAAAC AAATGTTATC TTAAAATTAT TAATGAATAT TAAGGAGAAG 4 080 

2o ATAACAAATG ACAAAAGGTA CACCACATAT TCAACCAAAT GGAGTAAAAA TTGCTAAAAC 4140 

AGTATTAATG CCTGGCGATC CGCTACGTGC AAAATATATT GCTGATAATT TTTTAGAAAA 4200 

TGTTGAACAA TTTAACGATG TACGTAACAT GTTTGGTTAC ACTGGTACAT ATAAAGGTAA 4260 

25 AGAAGTTTCT GTAATGGGTT CTGGTATGGG TATTCCAAGT ATTGGTATTT ACTCATATGA 4320 

GTTATACAAC TTCTTTGATG TAGATACAAT CATTCGTATC GGTTCTTGTG GCGCATTACA 43 80 

AGAAAATGTT AACTTATACG ATGTTATTAT TGCACAAGCT GCATCAACTA ATTCAAATTA 4440 

TGTAGATCAA TACAATATTC CAGGTCATTT CGCGCCTATC GCTGACTTCG AGTTAGTAAC 4 500 

TAAAGCTAAA AATGTCGCTG ACCAAATCGG TGCTACTACA CACGTAGGTA ACGTATTATC 4560 

TTCTGATACA TTTTACAATG CCGATCCAAC ATTCAATGAT GCTTGGAAAA AAATGGGTAT 4 620 

TTTAGGTATC GAAATGGAAT CAGCTGGTTT ATATTTAAAT GCGATTCATG CTGGTAAAAA 4680 

AGCACTTGGT ATTTTCACAG TAAGTGATCA TATTTTACGT GACGAAGCTA CTACACCTGA 4740 

40 AGAACGTGAA AATTCATTTA CACAAATGAT GGAAATCGCT TTAGAAATCG CAGAGTAACT 4800 

TATTTAAATT GACTTTAATT GCTCTTTAAC AATGCGATTA AACTCAAAAA GCCAACACAT 4860 

TCTGGGCGTA TCCCCATTTA TGTGTTGGCT TTTATTTATA TTATTACTTA TCTGTAGATT 4920 

AGCTTAAGTA AGATTTAAAC ATCCAATTAT GTTTATCTAC TGATGTTTGC ATACCTA7AA 4980 

ACATATCTTC TGATACATCA TCGCCAGCAT TACCAGCAAT TTCGATTGCG TTTTCTAATT 504 0 

GTTTTGAGAT ATTTGTGAAG TCTTGTGATA ATTCTTCAAC CATTTGTTCT GCAGAGTAAC 5100 

CTTTCGCAGC TTCTTTAACA ATTGATTGCT CTAAGCATTC AGTTAATGTA CCTACAGGGT 5160 

TTCCTCCTAC CGCTAAAATT CTTTCAGCTA ATTCGTCTAC ATATTGGCTT GCTTCATTAT 5220 
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AATTGTGTAG CTTTGTGTAA G 5301 
(2} INFORMATION FOR SEQ ID NO: 444: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 114 66 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 444: 

GAAGATGAAA GCTATATTGA AGATGATGTT ACTAAAAAGG AAGCTATTTT AAGTATGCAA 60 

ATTCCTAAAG GTTTCTCTCA AAAATTAAAA GAGAACCGTT TAAAAGAAAC GATACAGTTA 120 

TATGGTAGAG ATGACTTTAT AGGTGGTATT GCTGTAGAAA TTGTTAGTAG TTCATTATAT 180 

20 GAGCAGCAAA TTCCTAACAT TATTTATGAA CACCTTGAGG ATATGAAACA GCATCAATCC 24 0 

ATCGATGCTA TCAACAAGTC CTATCATAAA CATACACCTG AATCTAAAAT CAAATTTGTG 300 

TCGCTTACTA AACAAGCACA ACACTCTATT TCAATTAGCT TAATCTTTGC GGTGATTTTA 360 

TTTGTTAGCG CTGTTCAAGT AGTACTTCAT TATCGTTTAA ACCAACAAGC AGCATTGCAA 420 

CGATTATCAC AATATCATTT AAGCCGTTTC AAACTATATA GTACTTATGT AATGACACAT 480 

ACGATTTTGT TATTGTTGGT ACTATTGGCA GTTAGTCTAT ATTTGTCTCA ACCACTCAGC 54 0 

TTAATATTTT ACTTAAAATC ACTGTTACTT ATATTGATTT ATGAGATAGG TATCGTCTTT 600 

ATCTTATTCC ATATTCAAAC AATAAGTCAT CGATTATTCA TGACATTTAT ATATGCACTT 660 

GCTATGGGTA TCGTATACTT GATTATTTTC ATGTAAAGGA GCGTAACTGA TGATAGAAAT 720 

TAATAACCTT TCAAAGCGTT ACCGTAACAA ACAGATTTTC AATCATTTAA CTATGTCCTT 780 

TGATAGTAAT CGTTTAACCG TATTACTTGG TGATAATGGT GCTGGAAAAT CAACATTACT 840 

40 TCGTATGATT GCTGGTATTG AAAAAGCTAA TGATGGAACT ATCAACTATT TCGGCGAAAA 900 

ATGGAATCAA AGACaAATAC aAAATCACAT CGGTTATGTG CCACAAGACA TTGCGTTATT 960 

TGAACACATG ACAGTGGCTG AAAACATTAA ATTTTTTAAA TCACTTTGTA AAAATCCAAT 1020 

TAACGATACA ACTATCAACG AATATTTACA GCAATTAAAC TTTGATGATA CGTCTGCCAA 10 BO 

AGTATCTACA TTGTCCGGTG GGAATAAACG TAAAATTAAT ATATTAGTAG GTTTACTAGG 1140 

TCAACCTCGA ATTCTCATTT TAGATGAACC GACAGTTGGT ATTGATTTAA AATCTAGACA 1200 

TGACATCCAC CAACTACTTA ACATCATGAA ATCTAAATGT TTAATTATAT TAACTACCCA 1260 

TCATTTAGAT GAAGTTGAAG CACTTGCAGA TGATATCAAG TTAATTGGCC AAGATCCTTT 1320 
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CCCAAGCTGC GTATGATATC GCAACTTGGG ATTTTCTGTA TTATCTACTT TGCAAGTATG 1440 

ACGTTGGGTC TACTGCATAT TGATTACCGA TGCCACCAGA CATACGTTGG AAGTGTACGT 1500 

5 GAGGCGCTGT TGAATTACCC GTACTACCTG AATATGCAAT TTGGTCACCA GCTTTGACTT 1560 

TATCACCAGC TGAAACAGTT AAACGATTAT TATGCATATA CCATTGGTAG TTATTACTGT 1620 

TCGCTTCTTT AATCGTTACT TGATTGCCGC CACCATAGTT ACTCCAACCT GCTTGTACTA 1680 

10 

CTGTACCATC AGTTAATGAG TAAACTGGTG AATTTTCAGG CATTGCATAG TCGACACCGT 1740 

AATGCGCACC ACCACCGTGA TATTGTCCAT ATGGTTGTAG TTGTTTACGA CTTGTTAACC 1800 

75 AGCTTGCGTC TTTCGCATGA CCACTAGcTG TCGCTTTACT TGCTGATCCA CCATTTTGAT 1860 

TAGATGTGCC ATTAGGATAA TTGACCTTTC CATTACCATC ATGGCTGTTA TACGCTTGGT 192 0 

TGTTGTTACT ATGTGAATAA TAGCTCGCGT CTGGACCTAC ATTTGATTGA TAACCATATT 1980 

20 GATTAATATG CTGTTGGCTT TGACTCGCTG TGTAGTCATT GTTATCTCCT GCTGTTGCTG 204 0 

GATTCACATA TGTTTGGCCG CTTCCATTGG CATTTGCATT TTTTGGATAA CAGTTATAAA 2100 

AATAATGCGT ATGTCCTTGA GCATCTACGA ATGTATAGCT ATATTCTTTA TTATCAAACA 2160 

TTGCTTGATT CCAGTTACCA TCAGGTGTGT GATGATAATC CCCATTAGAA TCAATTGTAT 2220 

AATAAGTACC ATAAGATACG TCTTGTGATT GTGTTGACAT TTGTGTATGT GCTTGTTGGG 2280 

TGTTTGTCGT TTCTGCTGCA TCTGCTTGAT GCGCCATTGT AAATGTAGCG AAsCCATCGT 2340 

30 

TGCAATCGCT GCTGCTGTTA ATTTTTTCAT GTATAAAACA TCCTCCATTA AAGTTAAAGT 24 00 

TAGTTTTCAA TTAAACTGTA CTGCACATAC TAAAAGAATT AGACAACTGA GTAAAGGATT 2460 

35 TAATTCTCAT TTTCCAACTA TTTAATATTC CCGAAATGTT TTACTAAACT CATTACATTG 2520 

TCATTACAAA ATAGCCATAC ATTGATATTA AAATGACATC TCTCACTGCA TTCGTTTAAC 2580 
CTTTTATAAA TTTTCAAAAT TAACAACTAA TCGTTCGTCC ATGTTCGCGA TTCAACGCTA * 264 0 

40 ATGCATGATA GTAATCATCC ATCAAATCAT ATCAACCAAA TTCCATTATC AATCGCTATT 2700 

GATTGTCATT CAACTTTCTA ATAGTGATAT GCTTCTCAGG CTTAAAAATC GTCATATCTT 2760 

TTCTATTAAT TAAATCATCT GTGAGCTTTA ATGCTACTAA TTCATTGCTG CCATAATACT 2820 

45 

TAATATATAA CGTTCTTGTA GTTAAATTTA TTACGGTCTG ATACATCGTA TAGTGATTGT 2880 

CAGCATCATG CGGACGTACA ATTCCAATCG GTATATTTAC CGCATCTAAT AAATAAAATG 2 94 0 

CATTCATTAA ATCCATTTCT TTATCATTGT TTTGAGCAAT GTTTGCTTTC ATAAATGCCA 3000 

50 

TTCTCACAAA GCGCTCAGTT GAAGTAAATC CACCTGGCAA TCCAAATGTA CCTGCTTCAT ^ 3 06 0 

TGCCTAAAGG TTCAATCGTT ACACCTTCCA ATAAATTTGC TGTTGCTGGA TAAGGAGAAA 3120 
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CACCAATAGG ATTATCTTTT ATAACCACTT CACCCTCTTT AAATGAAACT TCGACTGTAT 324 0 

GTCCAGTTGC ATCGGAAACA TGATAATGCA .ATGGCGGAAC TTCACCGATG TCATTTAAAT 3300 

ATACAGCTAC AACATGTATT TGGGATGCTT GTTGTTTCAT ATCTTCAATG CTTGTTGTAT 3360 

ATCCCAAAAT CCATGTCACA ATTTCATTTT GCGTAATATT CATCGCGTCC GCTTTGTGTG 3420 

TTGATCCATA TGAACTATAA CCTCGGAAAT ATTGTGTTGA AATGGCAACG CcATGTtCAT 34 80 

TAACACCATC ACCATAAATA AAACCTTCCA TATCTGTTCC TGTGCCAATA AAGCCATATT 354 0 

GCGTTTGGCC TGTCGTGCCA GTGCAAGATT TCCAACGATA ATTTCTAGGC GTCACTGCTG 3600 

GCGAACCATC TAATGGATAA TCATAATCCA TCGTGCGTCC AAGAAGTACT TGATTATTTA 3660 

AAGTTTGTAT TGTGAATCCT GTGCACATTG TTCTCACTCC TCTGTACCTT CATTTACTTT 3720 

AATCACTTTC AAATAAAGCT GTTTCACTTA AACATACTAT AAAAAATCAA TTATACAAGC 3780 

20 AATTAATTGA TATTCATTCT, CAATAACTGT GGTATGATAT GTAAGGAAAT CATGACTTAT 3840 

GTGTGAGTGA ACGATCATCT ■ ATACATCCGT TCACTTCATC TCATGACTTT CTATATTTAA 3900 

TTTTTACAAG GAGTGACATC TGTGAATAAC ACACAATCTT CACCACGCAG TAATATTATT 3960 

25 ATTGCGATTA TGTTGTCTGC ATTAACATAT TGGTTGTTTG CACAATCATT TATTAATATA 4020 

GGACCTCTCG TTGGTCAAAC ATATCAAACC TCTCCTGCCG TGTTAAATTT ATCTATTAGT 4080 

TTAACTTCCT TCGCCACAGG TATCTTCATG GTGGCTGCAG GTGATATTGC TGATAAAATA 4140 

GGACAACTGA GAATGACATA CATGGGTCTC ATAATCAGTA TGTTTGCATC TCTTCTATTA 4200 

ATTATATCGG ACATCACTGC ACTGCTCATC ATCGGTAGAA TTTTACAAGG TCTATCAGCA 4260 

GCTATCTTGT TACCTTCAAC AGTTGGCGTG TTAAATAATC AATTTAAAGG AGAACATTTA 4320 

AGACGAGCGA TTAGTTATCT AATGATTAGT ACTGTTGGTG GCATCGGCCT AGCTGGTGTT 4380 

ATCGGCGGTT TAATTGCCTC AAATTTCGGA TGGCAAACGA ATTTCATCAT TAGTATAGTC 4440 

40 ATTGCTTTCA TTGCCATATT GCTTCTAAAA GGCACACCTG AAAAAGTAAG TCAACATAGC 4500 

CACCGTCATC CATTCGATTA CAAAGGTATG TCGATTTTCG CTGTTATGAT TGGTAGCTTT 4 560 

ACATTATTGT TAACACAAGG ATTCGAACAA GGTTGGTTTA GTACATTTTC AATCATTTGT 4620 

CTGAGCATTT TTATCaTCAc TACGTTGATA TTCATCATCA TCGAACGTCG ACATGAAGTA 4680 

C CTTTT ATTG ATTTCTCAGT ATTACGCAAC CGTCCGTTCA TTGGTGCATT TTTAAATAAC 4 740 

TTTGTTTTAA ATAGCGGTCT AGGCGTAACA GTGGTCTTTT TCATATATGC TCAAACACAC 4 800 

CTTGGTTTAt CAgcTGCGCA ATCTGGACTT GTTACATTGC CATATGCCAT TGTGGCAGTT 4 860 

GCGATGATTC GTTTAGGTGA AAAAGCAACA TTACGTTTCG GTGGCAAATT GATGCTCATC 4 920 
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TCACAATATG TCATTGCAGT TATCATTGGT TTCGTCATAT GTGCGATAGG TAATGGTTTA 504 0 

GTCGCAACAC CTGGACTTAC GATTGCAATT TTCAGTATGC CTAATGAAAA AGTTGGTTTA 5100 

GCTACAGGAT TATATAAAAT GAGTGGTACA TTAGGTGGCT CCTTTGGTAT AGCACTAAGT 5160 

ACTACAGTTT TCAGTATGTT ACAACTAAAC TATGCACCAA GTGTAGCTGC AACCGTAACA 5220 

TTTATAGTCA GCATTGTATT GATGATCCTT GGCTCATTGT CTGCATACAT GATCATTCCA 52 80 

AAAACAGTTA AATCTTAAAT ATAATAGAAG AATTATGTTT CGAAATATCT TTATCACTTT 534 0 

AAAATGATAT ACAAGAAATC CAAGAAAAAT AAGCGAACTG AATAAATAAA GATTCAATTA 54 00 

ACGCATCAGT ATTAGGATTC ACTCTAAAAC GATTAATAGT TTTATAAGAA GGTGTTTGAT 5460 

CTTGAGCTAA CCACATCATT CGAATACTGT CATGAAGTAA TTTCTCTATT CTACGACCAG 5520 

AAAATACAGA TTGAGTATAT GCATATAAGA TGATTTTTAA CATCATCTTT GGATGATAGG 5580 

ATGTTGCGcC ACGATGATGT CTGAATTCAT CGAATTTGCT ATCAGGTATC GTTTCAACAA 5640 

TTTCATTAAC ATGTCGCGAA ATATCATTTT GAGGAATTCT AACAGAAGTT TTTATTGGTA 5700 

GTGTAAGTTG GGCAAAGTGT CTTATTTTTT TAAAGTATTT CAAAGTAAAA TTACATGTTA 5760 

ATACGTAGTA TTAATGGCGA GACTCCTGAG GGAGCAGTGC CAGTCGAAGA CCGAGGCTGA 5820 

GACGGCACCC TAGGAAAGCG AAGCCATTCA ATACGAAGTA TTGTATAAAT AGAGAACAGC 5880 

AGTAAGATAT TTTCTAATTG AAAATTATCT TACTGCTGTT TTTTAGGGAT TTATGTCCCA 594 0 

GCCTCTTACT CTAATTATAT TCACTATCAA TTAGACAAAA TGGCCATTTT CAAATATCAC 6000 

GCGTTGTTTC TGACCTTGAA TATATTTATT ATAATTCTCT TTTTGAAAAT CAGTTAACAT 6060 

TAATTTAGAT GTACCGTATT TTAACACTTT TTGCATTGTT TCTATTCTCA TTTTTCTAAA 6120 

TAACCATCCA TCTTTTAACA CAATACGATT AACAGCATCA TATGATAATT CTACTGTTTC 6180 

TTTAATTTCA AATGTCTTGA ATGAAATAAT CGTGCACATT AAAAACGTAT CACCAAAGTA 624 0 

ATAAACATCT AAATCATCAC GTTTATGTTG TCCAACTAAC AAACGACCAT ATTCGAACTC 63 00 

TTTTTCTGGA TATTTCAATT CTAAAAAACT AATAATCTCT TC TTC T T T T A ATTTGAATTG 636 0 

CATTTAAAAA CATCCTCTCT TAAGTTTTAA CAAGCCTTAA TTAAAAATTT TTTCAATCAC 6420 

ATAGTTCAAT ATACATCATT TCGTTATGTT TTTTAATACT TTGTTCAAAA ACAAATATTT 64 80 

TATTCTTTAA AATAATGACT TTTGTATTTT TAATCACAAT AAACATTTTA AAATTCTTGT 654 0 

TATCATAATC ATTAAAAGGT ATTAACCTTA ATAATATATT CTCTCGTCTC AACCTTAATC 6600 

GTATACTTCA GACGTCTGTT TGTAGACAAT AAAAGTCATT CACGTCTTCA TATGTCATCA 666 0 

AATGTTTATC ATGATATGAT GAATATAATA ATCGGGTATA TAACTGTATG ATTAATTACA 6720 
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TGGTGTATCA GTTACAACTG TGTCACATAT TTTAAATCAT AATGATAGTC GTTTTTCCGC 684 0 

AACAACGATA AAAAACGTAC ATGCTGTTTC AGAACGTTTA GGCTATGCCC CTAATAAACA 6900 

TGCAAAACAA TTGCGCGGCA GTAAAATTCA AACTATTGGC GTCATTTTGC CTAGCTTAAC 6960 

AAATCCGTTT TTCTCAGCAC TGATGCAAAG TATTCATGAC CATAAACCAT CTGATGTTGA 7020 

TTTATGCTTT TTAACATCTA CAGCAACTGA TTTGTATGAC AATATTAAAC ATTTAATTGA 70 BO 

TCGAGGTATT GACGGATTAA TTATCGCACA ATACATATCA TCCCCGGACG CCCTAAATAA 714 0 

CTATCTAAAG AAACATCATG TACCTTATGT CGTACTGGAT CAAAATGACC ATCAAGGCTA 7200 

TACAGATTTT GTTCGGACAA ATGAATATCA AGGTGGACAA CTTGCAGCAC AACATTTAAT 7260 

AGAACTCGGT CACAACCATA TGATAATTGT TGCACCATAT GACATGATGG CGAATATGTC 7320 

GACTCGTGTC GCTGGATTTG TCGATACTTT GCGCGCGAAT CAATTGCCAG AACCACAAAT 7380 

20 CGTCCATACT GAATTATCTA AGCGCGGTGG GCTAACCATT GTTGATGACA TCATGGTTCA 7440 

ATCTGCCACT GCAATCTTCG CTATTAACGA TGAACTCGCT ATTGGCATTT TACGAGGACT 7500 

AATTGAACAT GGCATCAGTA TCCCGAAAGA TATCTCATTA ATAGGTTATG ACGACATTGA 7560 

TTATGCAGCG TACGTCTCGC CACCTTTAAC TACTGTGGCA CAACCTATAA CTGATATTGG 7620 

CAAAACATCT TTAACCTTAT TACTTCAACG ATTACAGCAC TTAGATAAAT CCATTGATAT 7680 

GATTGAATTA CCAACGACTT TAAAAATTCG TGCAACAACT GGCTATCATC TTTCAAACTA 774 0 

ACTACGTATC TTCCGAAATA TACTCATCAT TGTTAGGCCC TTAGCGTTGC TTTAATGCTG 7800 

AGGGTTTTTA ATCATAATTA TTTTACTAAG AAATTAAAAT AATAATGTAT GAATTTTTAA 7860 

ATATGATTTA AACGTTTTCA GTTTTTATGA AAACGCATGC ATTTTACAAA TAAAAATGGT 7920 

35 

ACGATGGCAC TGGTAAAACG TTTTACTAAA AACAAATCAT GAGGTGTATA ACATGAGCAT 7980 

TGTTGCATTA CTTATCGGGT TAGGCCCCTT AATTGGCTGG GGCTTCTTCC CAACAGTCGC 8040 

40 TTCAAAGTTT GGTGGTAAAC CTGTACATCA AATTATCGGT GCTACTGTAG GTACGTTAAT 8100 

CTTCGCTATT TTATTAGCCG TAGTCACATC AAGTGGCTTC CCTACTGGAA CCAATTTGCT 8160 

ATTCGCCTTA TTATCAGGTG CAGGATGGGG ATTCGGACAA ATCATTACAT TTAAAGCGTT 8220 

CGAATTAGTC GGCTCATCTC GTGCCATGCC AGTCACAACA GCATTCCAAT TATTAGGCGC 8280 

ATCTTTATGG GGTGTCTTTG CATTAGGAAA TTGGCCAGGC ATTGGTCATA AAATCATTGG 8340 

ATTTACAGCT TTAGTCGTTA TTCTAATTGG AGCGCGTATG ACAGTTTGGA GTGAACGCAA 8400 

AGAAGCAAGT AACGCCAAAA ATTTACGTCG TGCAGTGGTA CTTCTGTTAA TTGGTGAATT 8460 

TGGATACTGG TTATATTCAG CTGCACCGCA AGCAACTTCT ATTGATGGCC TAACTGCCTT 8520 
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AGCAGAGAAT 


CCATTCCGTA ATAAAATTAC 


GTGGTTACAA 


ATTATTTCAG 


GTTTCTTCTT 


8640 


TGCATTTGGT 


GCTTTAACAT 


ATCTTATTTC 


AGCACAACCT 


AATATGAATG 


GTTTAGCAAC 


8700 


TGGATTTATT 


CTTTCTCAAA 


CATCCGTTGT 


GCTTGCTACA 


TTAACTGGTA 


TTTATTTCTT 


8760 


AAAACAACAT 


AAAACGTCAA 


AAGAAATGGT 


TATTACAATC 


ATCGGCTTAG 


TACTCATTTT 


8820 


AGTAGCCGCT 


TCTGTTACAG 


TATTTATAAA 


ATAAGGAGTG 


TAGATGTCAT 


GAAAAAATCA 


8B80 


GCTGTTTTAA 


ATGAACATAT 


TTCAAAAGCA 


ATCGCGACAA 


TTGGTCATTT 


TGATTTATTA 


8940 


ACGATTAATG 


ACGCTGGCAT 


GCCAATTCCA 


AATGATCATC 


GTCGTATCGA 


CCTAGCTGTA 


9000 


ACTAAAAACT 


TACCACGCTT 


TATTGATGTC 


TTAGCTACAG 


TGTTAGAAGA 


AATGGAAATC 


9060 


CAAAAAATAT 


ACTTAGCAGA 


AGAAATAAAA 


GAACATAACC 


CTACACAATT 


GCAACAAATT 


9120 


AAACAATTGA 


TTTCATCGGA 


AATCGAAATC 


ATTTTCATTC 


CTCACGAAGA 


AATGAAAAGT 


9180 


AACTTAGCTC 


ACCCATTAAA 


TAAAGGTAAT 


ATTCGTACTG 


GTGAAaCAAC 


GCCCTACTCT 


9240 


AATATTGGAT 


TAGAATCGAA 


TGTTACTTTT 


TAAAAGTTAT 


AACTTGAAAG 


GAGCGTACAC 


9300 


ATGACCAACA 


AAGTTGTTAT 


TTTAGGTTCA 


ACGAATGTCG 


ATCAATTTTT 


AACAGTTGAA 


9360 


AGATATGCAC 


AACCAGGCGA 


AACATTACAT 


GTTGAAGAAG 


CACAAAAAGC 


ATTCGGCGGA 


9420 


GGTAAAGGTG 


CCAACCAGGC TATTGCCACT 


GCACGCATGC AAGCAGACAC 


AACATTTATT 


9480 


ACTAAAATTG 


GCACTGATGG 


CGTTGCTGAT 


TTCATCTTAG 


AAGATTTTAA 


AGTAGCTCAT 


9540 


ATTGATACAT 


CATATATTAT 


CAAAACAGCT 


GAAGCAAAAA 


CGGGCCAAGC 


CTTTATCACT 


9600 


GTGAATGCAG 


AAGGACAAAA 


CACCATCTAT 


GTTTATGGTG 


GTGCGAATAT 


GACGATGACA 


9660 


CCTGAAGATG 


TTATTAACGC 


AAAAGACGCT 


ATAATCAATG 


CAGACTTTGT 


CGtTGCACAA 


9720 


TTAGAAGTAC 


CCATCCCGGC 


TATTATATCT 


GCATTTGAAA 


TTGCCAAGGC- 


ACATGGTGTG 


9780 


ACGACAGTAT 


TAAATCCTGC 


ACCAGCGAAA 


GCATTACCTA 


ATGAATTATT 


ATCATTAATC 


9840 


GATATTATTG 


TGCCAAACGA 


AACAGAAGCC 


GAATTGTTAT 


CTGGGATTAA 


AGTAACTAAT 


9900 


GAACAATCTA 


TGAAAGACAA 


TGCCAATTAC 


TTTTTATCTA 


TAGGCATTAA GACTGTTTTG 


9960 


ATTACGCTAG 


GTAAGCAAGG 


TACATATTTT 


GCTACTAAAA ATCAAAGCCA ACACATCGAA 


10020 




TAAATGCGAT 


TGATACAACT 


GCTGCAGGCG 


ACACATTTAT 


TGGTGCATTT 


10080 


GTCAGTCGCT 


TAAACAAGTC 


GCAAGATAAC 


TTAGCAGATG 


CTATTGATTT 


TGGTAATAAA 


10140 


GCGAGCTCAC 


TCACTGTACA AAAACACGGC 


GCGCAAGCAT 


CTATTCCTCT 


ACTAGAAGAA 


10200 


GTAAATCAAG 


TTTAAATGAA 


TCAAACACAG 


CTATGATATG 


AAGGTTTAGC 


ATATAACATG 


10260 


CAACATTCGT 


ATATCATGGC 


TGTGCTTTTT 


TATCTTTATA AAACATCATC 


TATTAGAAAT 


10320 
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5 



10 



15 



30 



TTTGTAATCT 


TTTTAACTTC 


CAAATTaTCG 


CATATAAATA 


TGCTATATTA ATGATAATAA 


10440 


TTATCAATTA 


AAAGGAGGTT 


ATGCTATGTC 


TAAAGAAGCT 


GGTCATACAT 


TTTTAGCTAA 


10500 


ATTAGGAAAA 


ACTCGTCTAC 


GCCCCGGTGG 


TAAAGAAGCA ACAGATTGGT TAATACAACA 


10560 


AGGGGCATTT 


TCACAAGATA 


AACAAGTGTT 


AGAAGTGGCA 


TGTAATATGT 


GCACAACATC 


10620 


TATTTATCTA 


GCTCATACAT 


ATGGCTGTCA 


CATTCAAGGC 


GTTGATATAA 


ATAAGAAAGC 


10680 


ATTAGAAAAA 


GCACAGGAAA 


ACATTTCAGC 


AGCAGGTCTT 


GAATCATATA 


TTCAAGTTCA 


10740 


ACAAGCGAAT 


GCTGTTAAAT 


TGCCCTTTGA 


TGACAATCAA 


TTCGATATCG 


TTTTAAATGA 


10800 


AGCAATGTTA 


ACAATGTTAC 


CCATCGCCAT 


AAAGGAAAAA 


GCATTACGCG 


AGTACTACCG 


10860 


AGTCTTAAAG 


CCTGGGGGTA 


TCTTGTTAAC 


ACATGATATT 


GTCATCGTTA ATGAATCACA 


10920 


TGCCACACAT 


GTTGTTAAAT 


CATTATCTGC 


AGCAATTAAT 


GTCAATGTCT 


CACCGCAGAC 


10980 


GAAACTTGGC 


TGGTTAGATT. 


TATATAATCA 


AGCTGGTTTT 


AATCATGTGC 


ATTATCATAC 


11040 


TGGTCCAATG 


AGTTTAATGA 


CACCAAAAGG 


TTTAATTTAT 


GACGAAGGTA 


TTGTTGGAAC 


11100 


TATAAAGATT 


ATCAACAATG 


CTTTGAAAAA 


AGAAAATCGA 


CCAATGTTTT 


GTAAAATGTT 


11160 


TAAAACGATG 


ACTAAATTGC 


GTAAAGATAT 


GAATTATATT 


ACTTTTGTCG 


CTAAAAAAGA 


11220 


GCACTAAATA 


TAATGCCACT 


AACTGTACTT 


TGTATCTATG 


TTTGACTATC ACTTTAATTT 


. 11280 


CTTTGTGACA 


CTAATCATCT 


ACTTAACAAT 


ATCGTTATCG 


TTGATTAGTA 


AGTCATCAAT 


11340 


TTTGGTTAAA 


GACTTTCATA 


AACACTCAAA 


CATTAACACT 


ATACATAGTT 


AGTGGcATTA 


11400 


TTrmyCTn 


AAAATTTTAA 


CmTCmCGGGr 


TtGGGAmCrG AAaTGrtAwT 


TcGCrtnAACT 


11460 



35 TAwTcT 11466 

(2) INFORMATION FOR SEQ ID NO: 445: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2176 base pairs 
4 ° (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

4S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 445: 
TTACATAGTT AACACTAGTT AATCTATTAG TTAACATTAG TTAATAATTA GTTAATTTCC 60 
ATTTGTATTC TCATGTGATA AATTCTAAAA GCATACaATA AATTTAATAT GTAAAAAGAA 120 

SO 

AGGGAATACA CATGAAAAAT AAATATATCT CGAAGTTGCT AGTTGGGGCA GCAACAATTA 180 
CGTTAGCTAC AATGATTTCA AATGGGGAAG CAAAAGCGAG TGAAAACACG CAACAAACTT 24 0 

55 
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AAGTATTACA TCTAAAAGGT ATCACAGAAG AACAACGTAA CCAATACATC AAAACATTAC 360 

GCGAACACCC AGAACGTGCA CAAGAAGTAT TCTCTGAATC ACTTAAAGAC AGCAAGAnCC 420 

CAGACCGACG TGTTGCACAA CAAAACGCTT TTTACAATGT TCTTAAAAAT GATAACTTAA 480 

CTGAACAAGA AAAAAATAAT TACATTGCAC AAATTAAAGA AAACCCTGAT AGaAGCCAAC 540 

AAGTTTGGGT AGAATCAGTA CAATCTTCTA AAGCTAAAGA ACGTCAAAAT ATTGAAAATG 600 

CGGATAAAGC AATTAAAGAT TTCCAAGATA ACAAAGCACC ACACGATAAA TCAGCAGCAT 660 

ATGAAGCTAA CTCAAAATTA CnTAAAGATT TACGTGATAA AAACAACCGC TTTGTAGAAA 720 

AAGTTTCAAT TGAAAArGCA ATCGTTCGTC ATGATGAGCG TGTGAAATCA GCAAATGATG 780 

CAATCTCAAA ATTAAATGAA AAAGATTCAA TTGAAAACAG ACGTTTAGCA CAACGTGAAG 840 

TTAACAAAGC ACCTATGGAT GTAAAAGAGC ATTTACAGAA ACAATTAGAC GCATTAGTTG 900 

20 CTCAAAAAGA TGCTGAAAAG AAAGTGGCGC CAAAAGTTGA GGCTCCTCAA ATTCAATCAC 960 

CACAAATTGA AAAACCTAAA GTAGAATCAC CAAAAGTTGA AGTCCCTCAA ATTCAATCAC 1020 

CAAAAGTTGA GGTTCCTCAA TCTAAATTAT TAGGTTACTA CCAATCATTA AAAGATTCAT 1080 

TTAACTATGG TTACAAGTAT TTAACAGATA CTTATAAAAG CTATAAAGAa AAATATGATA 114 0 

CAGCAAAGTA CTACTATAAT ACGTACTATA AATACCAAGG TGCGATTGAT CAAACAGTAT 1200 

TAACAGTACT AGGTAGTGGT TCTAAATCTT ACATCCAACC ATTGAAAGTT GATGATAAAA 1260 

ACGGCTACTT AGCTAAATCA TATGCACAAG TAAGAAACTA TGTAACTGAG TCAATCAATA 132 0 

CTGGTAAAGT ATTATATACT TTCTACCAAA ACCCAACATT AGTAAAAACA GCTATTAAAG 13 BO 

CTCAAGAAAC TGCATCATCA ATCAAAAATA CATTAAGTAA TTTATTATCA TTCTGGAAAT 1440 

AATCAATCAA AAATATCTTC TCTAGTTTTA CATCATTTTT TAAATAATTT TCGTAACAAA 1500 

CCGTGATTAA AAAGAACCGT TGATTCTCAA TCGAATCTAC GGTTCTTTTT TCATTTTCCA 1560 

40 TCAATTAAAT GCTTCTTCGC TATTTGTCAG CCCACTTTTT TACCTGCAAC TTGTTAAATA 1620 

ATCCTTACAT CGTTAACGAA TAGTTCATCA TTTAGTTGAA TCAGCTCAAC TTTATTAACT 1680 

TCATATTTTC ACAAACTATT GCGCAATCCA TTCCTTTTCC ACTACAAGCA CCATAATTAA 1740 

ACAACAATTC AATAAAATAA GACTTGCAAA GCATAGTTAT GTAGCTATAT AAACGCCTGC 1800 

GACCAATAAA TCTTTTAAAC ATAACATAAT GCAAAAACAT CATTTAACAA TGCTAAAAAT 1860 

GTCTCTTCAA TACATGTTGA TAGTAATTAA CTTTTAACGA ACAGTTAATT CGAAAACGCT 192 0 

TACAAATGGA TTATTATATA TATGAACTTA AAATTAAATA GAAAGAAAGT GATTTCTATG 1980 

ATTAAAAAtA AAATATTAAC AGCAACTTTA GCAGTTGGTT TAATAGCCCC TTTAGCCAAT 2040 
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CmTyCAAArG AcACaGACAT TACTAGCCAA CGATTTAGCT ATnACTCCAA ACCTTCCATT 2160 
GGATTTGGTA AAGGnT 2176 
(2) INFORMATION FOR SEQ ID NO: 446: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 446: 

15 





AAAAGCATGG 


CTTAAATGAA 


GTACGCTATA 


ACAAATTACA 


AGAACATGCT ATTGTTATGC 


60 




ATCCGGCACC 


TGTGAATAGA 


GGAGTAGAAA 


TACAAAGCGA 


TTTAGTAGAA 


GCTTCAAAAT 


120 


20 


CAAGAATTTT 


TAAGCAAATG 


GAAAATGGCG 


TTTACTTAAG 


AATGGCAGTC 


ATTGATGAAT 


180 




TATTAAAATA 


GGTAAGGGGA 


CGAAAATGAT 


GAAATTAATT 


AAAAACGGTA AAGTATTACA 


240 




AAATGGCGAA 


TTACAACAAG 


CAGATATTTT 


AATTGATGGT 


AAGGTAATTA AACAAATTGC 


300 


25 


ACCTGCAATT 


GAACCAAGCA 


ATGGTGTTGA 


CATCATAGAT 


GCGAAAGGTC 


ACTTTGTGTC 


360 




ACCTGGATTT 


GTCGATGTTC 


ATGTTCATTT 


ACGTGAACCT 


GGTGGTGAAT ATAAAGAGAC 


420 




AATTGAAACT 


GGTACTAAAG 


CTGCTGCTAG 


AGGCGGATTT 


ACAACTGTAT 


GTCCAATGCC 


480 


30 


TAACACAAGA 


CCGGTACCAG 


ATTCTGTAGA 


ACATTTTGAA 


GCTTTACAAA 


AATTAATCGA 


540 




TGACAATGCT 


CAAGTACGTG 


TATTACCTTA 


TGCTTCAATT 


ACAACACGTC 


AATTAGGTAA 


600 


35 


AGAATTGGTT 


GATTTCCCAG 


CACTAGTAAA 


AGAAGGTGCC 


TTTGCGTTTA 


CAGATGACGG 


660 


TGTAGGAGTA 


CAAACTGCAA 


GCATGATGTA 


TGAAGGCATG 


ATTGAAGCTG 


CAAAAGTAAA 


720 




CAAAGCCATC 


GTAGCACACT 


GTGAAGATAA 


TTCATTAATC 


TATGGTGGTG 


CAATGCATGA 


780 


40 


AGGGAAACGC 


AGTAAAGAGT 


TAGGTATACC 


AGGTATTCCA 


AACATTTGTG 


AATCTGTTCA 


840 


AATCGCAAGA 


GATGTACTAT 


TaGCTGAAGC 


AGCAGGTTGT 


CATTATCATG 


TATGTCATGT 


900 




TTCTACTAAA 


GAAAGTGTTA 


GAGTCATTCG 


TGACGCTAAA 


CGCGCAGGCA 


TTCATGTTAC 


960 


45 


AGCTGAAGTT 


ACACCACACC 


ATTTATTGTT 


AACAGAAGAT 


GATATTCCTG 


GTAATAATGC 


1020 




CATTTATAAA 


ATGAATCCAC 


CATTGAGAAG 


TACTGAAGAT 


AGAGAGGCTT 


TGTTAGAAGG 


1080 




GTTACTAGAC 


GGTACAATTG 


ACTGTATCGC 


AACAGAcCAT 


GCACCACATG 


CACGTGATGA 


1140 


50 


AAAAGCACAA 


CCAATGGAAA 


AAGCaCCATT 


CGGAATTGTT 


GGTAGTGAAA 


CAGCATTCCC 


1200 




ATTATTATAT 


ACGCATTTTG 


TAAAAAATGG 


TGATTGGACA 


TTACAACAAT 


TAGTAGATTA 


1260 
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TTATGCAGAT 


TTAACAATCA 


TTGATTTAGA 


TAGTGAACAA 


GAAATTAAAG 


GAGAAGATTT 


13B0 




CTTATCAAAA 


GCAGATAATA 


CACCATTTAT 


CGGCTATAAA 


GTTTATGGAA 


ATCCGATCTT 


1440 


5 


AACAATGGTT 


GAAGGCGAAG 


TTAAATTTGa 


GGGGGATAAa 


TArTATGCAA 


gCAAACGTTA 


1500 




TCTAGTGTTA 


GAAGACGGTC 


TTTTACGAGG 


CTACCGTTAG 


GTCTGATAAC 


TTACTGA 


1557 




(2) INFORMATION FOR SEQ ID NO: 447: 








10 
15 


(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1799 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










£xi) SEQUENCE DESCRIPTION: SEQ ID NO: 447: 






20 


GCTAGAAATm 


TTGmATGaCA 


ATACAACTCT 


GTTAAAaTGA 


TGGACGTAGA 


CAAATATGCG 


60 




TATTGACGCT 


TTATTTTAAA 


AATTaACATG 


CTTATAACAT 


GTTTATAGAA 


GGAGATTAAC 


12.0 




CTATGAACTA 


TCaAGTTCTT 


TTATATTATA 


AATATATGAC 


GATTGATGAC 


CtGAACAGTT . 


160 


25 


TGCTCAGGAT 


CACTTAGCCT 


TTTGTAAAGC 


ACACCATTTA 


AAAGGTAGAA 


TTCTTGTTTC 


240 




TACAGAAGGT 


ATTAACGGCA 


CATTATCTGG 


TACAAAAGAA 


GAAACCGAAC 


AATATATGGC 


300 




ACATATGCAT 


GCCGATGAAC 


GATTCAAAGA 


TATGGTGTTT 


AAAATTGATG 


AAGCTGAAGG 


360 


30 


ACATGCTTTT 


AAGAAAATGC 


ATGTACGTCC 


TCGAAAAGAA 


ATCGTTGCTT 


TAGATTTAGA 


420 




AGATGACGTC 


GATCCAAGAC 


ACACAACTGG 


CCAATATTTA 


TCACCTGTAG 


AATTTAGAAA 


480 




AGCTCTTGAA 


GATGATGACA 


CAGTCATTAT 


TGATGCACGT 


AATGATTATG 


AATTTGATTT ' 


540 


35 


AGGTCATTTC 


CGAGGTGCAA 


TTCGTCCAAA 


TATCACACGT 


TTTAGAGATT 


TGCCTGACTG 


600 




GATTAAAGAG 


AATAAAGCGT 


TATTTGCAGA 


TAAAAAAGTG 


GTTACGTACT 


GTACTGGTGG 


660 


40 


CATTCGATGC 


GAAAAATTTT 


CTGGATGGCT 


TTTAAAAGAA 


GGTTTCGAAG 


ATGTAGCTCA 


720 


ACTTCATGGC 


GGTATTGCTA 


CATATGGTAA 


AGATCCTGAA 


ACAAAAGGTG 


AATATTGGGA 


780 




CGGTAAAATG 


TACGTATTTG 


ATGACCGTAT 


CAGTGTTGAT 


ATCAACCAAG 


TTGAAAAnAC 


840 


HO 


AATTATTGGT 


AAGGATTGGT 


TTGATGGCAA 


ACCATGTGAA 


CGTTATATTA 


ATTGCGCTAA 


900 




CCCAGAATGT 


AATAAACAAA 


TATTAGTTTC 


TGAAGAAAAC 


GAAACTAAAT 


ATTTAGGTGC 


960 




ATGCTCTTAT 


GAATGTGCTA 


AACATGAGCG 


TAATCGTTAT 


GTTCAAGCAA 


ATAATATTAG 


1020 


50 


TGATAATGAG 


TGGCAACAAC 


GTTTAACAAA 


CTTTGATGAT 


TTACATCAAC 


ATGCTTAGTT 


1080 




TTAATTAAAT 


ACCTTTCAAA 


ACACGCTTTG 


AAAATCCGAT 


TTATAAAGGT 


TTTTCAAGGC 


1140 
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TAAATTTTAA TACTGCGGGG TGTCTTAAAA TGCACATTTT AGTAACAGGG TTTGCGCCTT 1260 

TTGACAATCA AAATATCAAT CCCTCATGGG AAGCTGTGAC TCAACTAGAA GATATTATTG 1320 

5 GCACACATAC AATCGATAAA TTAAAACTAC CAACCTCTTT TAAGAAAGTA GATAATATTA 13 BO 

TAAATAAAAC GTTGGCATCT AATCATTATG ATGTTGTACT AGCTATAGGA CAAGCTGGTG 144 0 

GTAGAAATGC CATTACCCCA GAACGTGTCG CCATTAATAT TGATGATGCA CGTATTCCAG 1500 

10 

ATAATGATGA TTTTCAACCT ATTGATCAAG CCATTCACTT AGACGGTGCG CCAGCTTATT 1560 

TTTCAAATTT ACCaGTTAAA GCAATGACTC AAAGTATTAT TAATCAAGGA CTTCCTGGAG 1620 

CACTTTCAAA TAGCGCAGGT ACATTTGTTT GTAATCACAC ACTTTATcAC TTAGGTTATT 1680 

15 

TACAAGATAA GCATTACCCT CACCTACGAT TCGGATTTAT TCaTGTGCCA TACATACCAG 1740 

AGCAGGTCAT TGGTAAACCC GATACACCAT CTCATGnCCA TTGAGGAAAA GATnAGTTG 17 99 
2Q (2) INFORMATION FOR SEQ ID NO: 448: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 448: 



30 


ACTTGGTTTT 


TTATTGTTTA 


TAAATAAAAC 


TCACTTAATA 


ATGTTTTCAT 


AATCTTCTTC 


60 




GACTACTTAA 


TTCTTTAAGA 


TATTCGTGAA 


AAGAGACATT 


ACACTAGTTA ATTTTCAAAC 


120 




AATACAAAAA 


GCGTCTACCT 


CCTACATATA 


ATTGTAGCGG 


AGATAGACGC 


TTAATATTTA 


180 


35 


TTTAAAAATT 


ATTTTAAACC 


ACCGAATGTC 


ATAACATCAC 


GGGCAATCAT 


ACTTTCTTCA 


240 




TCTGTTGGAA 


TAACGACAAC 


TTTAACTGGT 


GAATGAGGAT 


AGTTAATAAA 


TCCTTCTTTA 


300 


40 


CCACGTAGTA 


AGTTTTCATT 


TTTCTTAGGA 


TCCCAGTAAA 


CACCCATAAA 


TTCTAAGCCT 


360 


TCAAGAACTT 


TCGCACGAAT 


TTCTACTGAG 


TTTTCACCGA 


TACCTGCTGT AAATACGATA 


420 




ACATCAACAC 


CATGCATTCT 


CGCAGCATAT 


GATCCAATAT 


ATTTGTGAAT 


TTTAGAAGCA 


480 


45 


AATACATCTA 


AAGCCATTTG 


TGAACGTGCT 


TTACCTGATT 


CAGCTTCTTC 


TGATAAGTCA 


540 




CGTAAATCAC 


TAGATGTACC 


TGATAATCCT 


AATAAACCTG 


ATTCTTTGTT 


TAAGATTTCC 


600 




AATACTTGTT 


CAGCAGTTTT 


ACCTGTTTTT 


TCCATAATAA 


ATGGAATTAA 


AGCAGGGTCA 


660 


50 


ATATTACCAG 


AACGAGTACC 


CATTGTTACA 


CCAGCAAGTG 


GTGTGAAGCC 


aTTGATGTAT 


720 




CAATAGATTT 


ACCGCCATCG 


ATAGCTGCAA 


TTGATGCTCC 


ATTACCAATG 


TGACATGAAA 


780 
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TATGGCTTGT 


ACCATGGAAA 


CCATACTTAC 


GAATGCCATA 


ATCTTTATAA 


TAATGATATG 


900 




GCAAGCTATA 


TAGATATGCT 


TTTTCAGGCA 


TTGTTTGATG 


GAATGCTGTA 


TCAAAAATTG 


960 


5 


CCACATGAGG 


GATATTTGGT 


AATAATTTAC 


GGAAAGCACG 


AATACCCAtC 


AAGTTaGCTG 


1020 




GGTTGTGaAG 


CGGTGCTAAT 


TcGCTTAATT 


CTTCAATTTC 


CTTTTCAACC 


TCATCAGTAA 


10B0 


10 


TAGCTACTGA 


TTCAGGGAAT 


TTTTCACCAC 


CATGTACAAC 


ACGGTGACCT 


GTTCCATCGA 


1140 


TATCGTTAAT ATCATTAATA ATATTGTGCG 


CTTTAAAAGC 


ATCCAACATG 


ATATCAACTG 


1200 




CCTCAACGTG 


ATCCTTGATA 


TCTTGTACTG 


TTTTAACTTT 


TTCCCCGTTG 


ACTTCAATTG 


1260 


IS 


TAAAAATTGA ATCCTTCAAT 


CCGATTCTTT 


CTACTAAACC 


TTTTGTTACT 


AATTCCTCTT 


1320 


CAGGCATTCT 


AATTAATTGA A 








1341 




(2) INFORMATION FOR SEQ ID NO: 449: 








20 
25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1529 base pairs 

(B) TYPE: nucleic acid 
"(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








Cxi) ! 


SEQUENCE DESCRIPTION: SEQ ID NO: 449: 








TTTTGAAGAT 


ACTACCGATG 


AAAATAGACA 


AAAGATTTTT 


CAATATTTAT 


CACCTGAAgA 


60 


30 


AGTTGCAAAT 


TTCTTTGATC 


AATTAGATAT 


TGATGACGAT 


GAATATGAGT 


TGCTATTTGA 


120 




TAAGATGAAT 


GCGACATACG 


CAAGTCACAT 


ATTAGAAGAA 


ATGTCATACG 


ACAATGCAGT 


180 




AGATATTTTA 


AATGAGTTGA 


CTAAACCAAA 


AGTTGCTAGT 


CTTTTAACAT 


TGATGAATAA 


240 


35 


AGATGACGCG 


AATGAAATCA 


AAGCATTACT 


TCACTATGAT 


GAGGATACGG 


CCGGCGGTAT 


300 




TATGACGACG 


GAgTATTTAT 


CACTTAAAGC 


GCATACGCCT 


GTTAAAGAAG 


CATTATTATT 


360 


40 


GGTCAAAGCG 


CAaGCACCAG 


ACGCAGAAAC 


AATATATGTT 


ATATTTGTCG 


TTGATGATGA 


420 


TGGTAAATTA 


GTAGGTGTTT 


TATCGCTAAG 


AGATTTAATT 


GTAGCTGAAA ATGATGCTTA 


480 




' TATTGAAGAT 


ATTATGAATG 


AACGTGTCAT 


TAGTGTGAAT 


GTAGCAGACG ACCAAGAAGA 


540 


45 _ 


TGTTGCTCAA 


GTTATGAGAG 


ACTATGATTT 


CATGGCTGTA 


CCTGTTATAG 


ATTACCAAGA 


600 




ACATTTGCTT 


GGTATCATCA 


CGATTGATGA 


TATTTTAGAC 


GTTATGGATG 


AAGAGGCTAG 


660 


\ 


TGAAGACTAC 


TCTCGTTTAG 


CCGGGGTATC 


AGATATCGAT 


TCGACTAATG 


ATTCAATCAT 


720 


50 


TAAAACAGCA 


TTAAAACGTT 


TACCATGGTT 


GATTATTTTA 


ACATTTTTAG 


GAATGATTAC 


780 




TGCGACAATT 


TTAGGGAGAT 


TCGAAAAAAC 


ATTAGAAAAT 


GTAGCGCTAC 


TCGCAGCGTT 


840 
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TCGTAACATT ACGAcAGGGG 


AAATTAATGA 


GCAAAGTAAA TTTAGAATTG CATTAAGAGA 


960 




AGCAGGAAGT 


GGTGTATTAT 


CGGGTGTTGT 


ATGTTCAACA ATATTATTTA CAATTATTGT 


1020 


5 


TGCAATATAT 


CATCAGCCAC 


TTTTAGCATT . 


AATCGTTGCA GGAAGTTTAA CTTGTGCGAT 


10B0 




GACGGTGGGG 


ACGTTTGTAG 


GTTCGATGAT 


TCCATTATTG ATGAATAAAT TAAATATCGA 


114 0 




TCCAGCAGTG 


GCTAGTGGAC 


CATTTATTAC 


AACAATTAAT GATATTATTA GTATGTTGAT 


1200 


10 


TTATTTTGGT 


TTAGCTACAT 


CATTTATGGC 


TTACTTAATT TAAGGAGGAG TTATGGAGTT 


1260 




TTTATCTTTA 


GTTATTGTTG 


TTTTAGCAGC 


GTTTTTAACT CCAATAATTG TCAATCGATT 


1320 


15 


AAATATTAAT 


TTCTTGCCAG 


TTGTTGTTGC 


AGAAATTTTG ATGGGGATTG TGATTGGAAA 


1380 


TTCATTTCTA AATATAGTAG 


AAAGGGATTC 


AATTCTAAAT ATTTTATCAA CGTTAGGCTT 


1440 




TATCTTTTTA ATGTTTTTAA 


GTGGTTTAGA 


AATTGATTTT AAAGCTTTTA AAAAAGATAA 


1500 


20 


ACGCGCACGT 


CAAGGACAAA 


ATGATGATG 




1529 




(2) INFORMATION FOR SEQ ID NO: 450: 




25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 450: 






TTCTGGAAAC 


CAAAGTATTG 


TCATCTTCTA 


CTAGTAGTAT nGGCATCCAT ATCACCCAAT 


60 




ATCATTTAGT 


ATATTTCATA 


TTTTCTCCTG 


ATTTTAATCG ACTTTGAAAA TCTTTAATCC 


120 


35 


GGCAGTCAAC 


TTCAAAGCCA 


TGAATCATCA 


TTTTTTGCAT TGGTGCTTGT ATAAAGTAAT 


180 




AAATCGGCCA 


AAATAATCGA 


GGGATATAAT 


CGTATAGATG TATATAAACG ACTGCCGACT 


240 


40 


CTTTGATTAA 


TCTAAATTCT 


AACTTCCCTT 


GATTAACTGT ATATTTTTTC ACTAAACTTC 


300 


CACTCAATAA 


AATTAAAGTT ATTATTCCAT CAGCTGTTTC TTCTATTTTA AATACTGCGA 


360 




GCGGTCGCAC 


CTTATTCTTA 


ATATATATCT 


TAAATTGATC ATGTGATTTT TCTGTTTTCA 


420 


45 


CAAAAGTTCC 


TTTAGTGTAA 


CCCATCCATG 


CAATAAAATG GTTTACAACG TTCTTTAATG 


480 




TCCATCCCTT 


TGGTAAAACT 


ACCTTCATCG 


TTGATCTAAC ATCATCATAC TTTGAAACTT 


540 




GTAATTCTAC 


ATTAACTAAA 


GAACGTTTAA 


AAACTAAATT TGTTGTTTCT ACAGGTGTAC 


600 


50 


CATATGCACC 


TAGGCGTTCA 


ATTGTTTCAT 


TATCATAACG ACTCCCAGGT ATGTAGATTA 


660 




CTTTTTTTAC 


TTGATTGATA GCCGCTGCTC 


GACCAAAATT ATCTGCTGCG ATTAATGTTA 


720 
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5 



10 



15 



20 



25 



CAGCAATATC 


TATTTGATTC 


ATTGCTGCAA 


CAACCTGTTC 


GTAATGAAAT 


ATATCACACT 


840 


GAATCCAAGT 


CATTTCAACA 


TCATCTGTTT 


kTTTATTGTC 


TGGATATTTT GATATAGCAA 


900 


AAAGTTCAGC 


ATCATTTTCA 


ATCACTTCAC 


TTAAATACTT 


ACCAATATAT 


CCTGTTCCAC 


960 


CTGCTAATAA 


AACTTTAGGT 


TTCATCTAAA 


ATACTCCTTT 


AAACTGTAAC 


CAAAAAACAT 


1020 


ATTACTCCAC 


CTTTTAGTTA 


CATATATATT ATAATAGTAG 


CAAATGTTTT 


AAAATTTCAA 


1080 


AATACTGGAG 


GCTTTTTATG 


GCCCATATTA 


TACGTAGAGT 


TAGTATCAAA 


GATGTAGAAA 


1140 


ATTTCATTTC 


AATGTTAGCG 


AACATATACG 


ACGAATCTCC 


GTATATGTTC 


TACACACCAG 


1200 


GAGAATATGA 


TCCTAGCGTC 


ACATCGGCTA 


GTAAACAATT 


AGAAGAATAT ATCACTTCTC 


1260 


CGCATAAAGT 


CATCTTCGTT 


GCTGAAAGTG ATGAACAACT CGTTGGCTTT GCCTTTGTTA 


1320 


ATACGACACC 


ATTTCAACGC 


ATTAAACATG 


-TTGCTAAAAT 


TGATTTAGGT 


GTAAAGAAAT 


1380 


TATATCAACA 


TCGTGGAATT. 


GGCCAAGCAC 


TTCTTGATGC 


CATTATGGCT 


TGGTGTTTAA 


1440 


ACAATCAAAT 


ACACCGAATT 


GAAGCAAATG 


TACCACTCAA 


TAACCAACCT 


GCCCTCGAGC 


1500 


TTTTTAAAAG 


TGCCGATTTT 


CAAATCGAAG 


GCGTTTTAAA 


AGATAAGTTA 


TTTATCGATG 


1560 


GTAAATATTA 


TGATGACTAT 


ATGATGGCTA AAATTCTTAA 


TTAAAGCTAT 


TTTATCATAA 


1620 


TCTTGTATCA 


GAATCGTATA 


ACAACGAATT 


TAATGGTTAC 


CTAATACATT 


ACTCATACTT 


1680 


ATCAATGTTA 


TCTAATCTCA 


AATAAATACG 


TACACTCTTA 


TTCATTTATC 


AAATTTAAAT 


1740 


TCAAAATAnA 


ACACCACTAA 


TGTGTAATTG 


ATTAACTATC 


AACTACGATT 


AGTGGTGCTT 


1800 


TATATATGTG 


GTTAGTTTTC 


CTnACTA 








1827 



(2) INFORMATION FOR SEQ ID NO: 451: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 616 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 451: 

ATATAGATTA ATGTTTGTTA TTTGTACTGT GTTTGCCAGT GACATTTATT ATAAAACATA 60 

TGACGTTATT CTATGkTGTA CTCATTtTCT ATATTGTAGG TTTATTAACG ATTAGAAGTA 120 

TTATTAAAAA GTTGAAATAT CAGGAAACAT TATTACGAGA CTAAAAAACT TCCATTGGCA 180 

TGTATGTTGT AAAGGTGCAT GTAATGTTGA ACGCCAAATG ATACGGCGTT CAGATTACAT 24 0 

TAGCATCTAT ACGTTAACAG CATAACCAAT GGAAGTTTTC TTCGAATCTA TTCTTTTATT 300 
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AATGCGTCTT TTGAAAAATG GTCATTAAAG GCATCAGATT GCTTAAAGTC TTCGTATGCA 420 

TGTCGATCAG CAAATCCGAA ATAAATTTTG TATGTTGTAC CTTTAGCAGG TCTTAACAAA 480 

CGATAGCTTT TAAAGCCACC AAAGTTTCTG AAATTATCGT CTACACTAAT CAGTTTCTTT ' 540 

TCAAGTTGAT ATGCATGATC TTCTGTTGAT GGaATGAAgA TTGCACaATA GAAATGaTGT 600 

TCACtGAATT CACCAA 616 



10 

(2) INFORMATION FOR SEQ ID NO: 452: 

( i ) < SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 944 base pairs 

(B) TYPE: nucleic acid 

15 iC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 452: 



35 



40 



45 



GCACGAGTGA 


TTAAACGGTT 


AATCAATGAA 


ACATTTGATG 


CAAATTACAT 


TGAAGTTATT 


60 


GAGGGAGGAA 


TTGAAGAAAC 


GCAAACGTTA 


ATTCACTTAC 


CTTTTGACTA 


TGTCTTCTTT 


120 


ACAGGAAGTG 


nAAATGTAGG 


CAAAATCGTT 


TATCAAGCTG 


CCAGCGAAAA 


TTTAGTTGCT 


180 


GTGACATTAG 


AAATGGGCgG 


TAAAtCTCCa 


GTCaTCGtTG 


ATGAAACAGC 


GAATATTAAA 


240 


gTTGCTAGTG 


AGCGCATTTG 


TTTTGGGAAA 


TTCACTAATG 


CCgGCCaAAC ATGTGTTGCa 


300 


CCAGATTACA 


TTTTAGTACA 


CGAATCTGTA 


AAAGATGATT 


TAATCACAGC 


CCTATCAAAA 


360 


ACGTTGCGTG 


AATTTTATGG 


TCAAAATATA 


CAACAAAGTC 


CAGATTATGG 


CCGCATTGTA 


420 


AACCTTAAAC 


ATTATCATCG 


TCTGACTTCA 


TTACTTAACA 


GTGCACAAAT 


GAATATTGTA 


4B0 


TTTGGTGGTC 


ATAGTGATGA 


GGATGAACGT 


TATATAGAAC 


CAACATTGTT 


AGATCACGTT 


540 


ACAAGTGATT 


CAGCAATTAT 


GCAAGAAGAA 


ATTTTTGGTC 


CTATCTTACC 


GATTTTAACG 


600 


TATCAGTCAT 


TGGATGAAGC 


AATAGCCTTT 


ATTCACCAAA 


GAC CAAAACC 


TTTGAGTTTA 


660 


TATTTATTTA 


GCGAAGATGA 


AAATGCTACA 


CAACGTGTAA 


TAAACGAGCT 


ATCATTTGGC 


720 


GGCGGCGCTA 


TTAATGATAC 


ATTGATGCAC 


CTAGCGAATC 


CTAAATTACC 


ATTTGGTGGT 


780 


GTTGGTGCCT 


CAGGTATGGG 


ACGCTATCAT 


GGTAAATATT 


CATTCGACAC 


TTTTACACAT 


840 


GAAAAAAGCT 


ACATTTTCAA 


ATCnACACGA 


TTAGAATCAG 


GTGTCCATTT 


ACCACCATAT 


900 


AAAGGTAAAT 


TTAAATrCAT 


CAAAGCTTtC 


tTTAAAnATT 


AATT 




944 



(2) INFORMATION FOR SEQ ID NO: 453: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 820 base pairs 
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(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 453: 



TCTCCAGTAG 


ACCTTGTGTA 


TGAACAGTTT 


CTTTCATATG 


AATGAACATC 


GTTTTTAAAG, 


60 


TTTGTTTCAA 


CTCAGCCTTA 


TCAGGATAAT 


ATCTAGAGAC 


AGTCGTCTCT 


GGCATCTCCA 


120 


TTGTATGATA 


TTTAACCTTA 


TGCAGCGACC 


CATGATTTTC 


GTTATAAACA 


AATGTATGAT 


180 


TCACTTCATC 


GAAATCATGA 


TCTTCTCCTG 


CAATCCAAAA 


AACTGGTACT 


ACTTGTTGCT 


240 


TATGTGTATC 


CGTTAATTCC 


TTAGATAAAG 


TAATGATTGa 


AAATATTTTA 


TGGAATGTAT 


300 


ACAATGGTCC 


CCCGAAAAGC 


CCTGCTTGTT 


GTCCACCAAT 


CACAACTTTT 


GAACCATTAG 


360 


CTAAATGTTG 


TATGTTTAAT 


TCTTGTTCAC 


TTGAAAGCTT 


GAAAGCTTTA 


AATCACTCAT 


420 


ATATTCACGA 


ATAACATTCG 


CTAATGCCGC 


TTCTCTTCCA 


TTATTTTCTT 


TAGACATCCT 


480 


TTTTTCAAAA 


CTAGTTTGTT 


GAGCTGCATC 


ATATTGAAAT 


AATCCTGTTA 


TTACAGGGTC 


54 0 


ACTGCTCTTT 


ATTTTTGGTA 


TAAACTGATC 


TTTTTCATTT 


AAACTAACTA 


CTTTACAGTC 


600 


CATGlTmT 


CTCCTTAAGT 


ACGCGATTAC 


AAACTATAGT 


ATAAAGTCTA 


TACCGGTGAT 


660 


TGACAATTTT 


ACGGCTTGAA 


AATCAATTTA 


ATCATGGAAA 


ATTTATAATA 


TTCATTGTTT 


720 


TACATTTTCA 


AATCAATGAA 


AAACACAAGT 


GGTTTAATGT 


ATAATAATAG 


TAGTAAACAA 


780 


ATAAGGGGTA 


GATAAATATG 


AGTGAAATCA 


AACGTCTTGA 


AATTAATTAC 


AAAACTGACG 


84 0 


AATTATTCGA 


AAACTTTAGA 


GCGTTTGGCA 


ACAAAGACTT 


ATACATGGTC 


AATGAGTTAA 


900 


ACGGTCAAAT 


GATTGATGCA 


AGTTCAGATT 


CACCATTTTA 


TGGCATATTT 


GTCGaGATCA 


960 


ATTAGGAGCT 


AGAATGGCAT 


TACTAAAAAA 


AGGTGATGTC 


GAAGAAATCT 


ACTTCCCAGA 


1020 


TTTTGAAGAT 


TATATATTAT 


TATGGAAGTT 


AGAAGTATTA 


CCAAAATATC 


AAAACAGAGG 


1080 


GTACGCTTCA 


GAATTGATTG 


ATTTTGCAAA 


GAGTTTCAAT 


ATGCCAATTA 


AAGCCATTGG 


1140 


CAGAAATGAT 


TCTAAGGATT 


TCTTTTTACA 


TCATGGATTT 


ACAGATGTGG 


AAGCTAAAAA 


1200 


TATAGAGGGA 


CATGATGTCT 


TATTGTGGAA 


ACCATAAGAT 


AATAATATTC 


GACACTACGA 


1260 


GCATGAAAAT 


GCATCTTTTC 


GTAGTGTCTT 


TTTTACAATT 


ACTTTCTTAA 


GCTAATATAA 


1320 


GTaAATCATT 


TTCAAATTAT 


TTGTCTTAAC 


GTACAATATC 


ATTTAGTTGT 


TTCCATGrAT 


1380 


TAATTTCATA 


ATCAGGTATA 


ATTCCTGGAT 


TATGATCAAA 


TCCTCTAAAa 


TTAAACCAGC 


1440 


AAGTAGCTAT 


ACCcGCATTG 


ATTCCACCTA 


GAATGTCAGA 


TGTTAnAGAA 


TCTCCaACTA 


1500 


TAATCGAGTG 


CTGt.CTTTCA 


TCCTCACCAA 


TATCATTAAA 


AACATAATTA 


AAAAATTCCG 


1560 
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ACGGCGTCTG ATTTAACCTT CTCTTTTGCG TTTCGGTTAC ACCATTAGTA ACAATATATA 16 BO 

AATCATGTCT TTTCGATAAT TCGACAATTG TTTCTAATGT TTGATCAAAG TATTTAACTT 174 0 

5 TAGCTTCTGC TAATCCATTT CTAAATAACA CATCTGCACG ATGCCCATCA ACTTCCATTT 1800 

GATGATGTiT GAAGTAATTC ACAAATCGTT CTGATAATAC TTCAGACTTC GkTAATTTAT 1860 

TTTGkTGAAA AGCTTCCCAA TGTTGGTGaT TGaTTTTTTT AAATGkTAAA AAATCATCCy 1920 

10 

TTGTTGCTTT ATGATTAAAA ACATTCGCCA TATAGTGAAn CGCCCATTCT TCTGcATCAT 1980 

AAAAATCAAC AATTGTATCA TCAAAGTCTA TCAAAATATT TTTATATCCC AATTTCCCCA 2040 

. TCTCCTATAT TGTCTATGTA TCTAAATCTT AACAGAGGCT CAAATTTCTG CAAATAAAAT 2100 

15 

AAACTGAGTG CATAACATTA AAGTATGCTC ACCCAGTTTA TTTTAAAGAA TATTAGTTAT 2160 

TATATTAGAA TCCAAATAAT TTACCTAGTA AACCCACACC GTTAGCAACG ATGTCTACGA 2220 

TACTTGTGCC TAATTTCACA CTATCATGTT GTTGTGCAGC TTGCACAGTA TTTGCGATTG 2280 

20 

CTTCTGCTAG TCCAGTCATT TAAATCTCTC CCTCACCTTT GAAATAATAC TGATTACTTA 2340 

CATAACATAT TGAAATTAGA ATCCGAATAA TTTACCTAAT AAACCTACGC CATTTTCAAC 24 00 

25 GATGCTCACA ATGCTTGTGC CTAATTTTGC GCCATCATTA TTAATTGCTG CAGTTACGGT 24 60 

ATCTTTAATT GCGTTAAATA AACCTTCCAT TGAAAACACT CCTTAAAATT TAAATTTGAA 2520 

GATAACAAAA ACGTGCGTAg yTTTTAAATC ACCGAAATGT TATTCGCTTA ACGTTTTGTT 2580 

30 GTTGTTATTT TAAAATAAAT TTGATGCAAT TAGTTTGTTT ATCCGCACAA CATCTTATAA 264 0 

TGTACTTAAC TGTATTTTAA AGAGAAAAGA AATACAGTTA GGCATTCAAA ACTGTATTTA 2700 

ACACAATTAA GTTGCCTGAA TTCGTATTTA AGTCTTATTG AACCTTTTTA GATAAATAGC 2760 

35 TCTATAATAG TGAAAAATAT AAACATTTTT TATTTACAAG GTATTGCTAA TTTAAGTTCA 2820 

TTTAGATATA ATAATTCTTG TGTTGTTAAA CGTGTCCTGG TAGCTCAGCT GGATAGAGCA 2880 

ATGGCCTTCT AAGCCATCGG TCGGGGGTTC GAATCCCTCC CAGGACGTTT ATAGGTATTT 2940 

40 

TTATACGCAT TACCAAACAA AAGAGTTCCG TGATTACGGG GCTCTTTTTG TTTTGAATTT 3000 

CAGTAATATA GTATGATGCG TCACCAAAAC GTCCCCCGCA TAAGCCCCGA AAATACAGTA 3060 

ATTAAAACAA GCATGCTTAT TCGTTATAGA ATTTTTTGAC ACACAATTGA CACGCGTCTG 3120 

45 

ACACTTGTTT ATACATTTTT AATTAAGTAA TTTTGTGCTC AAATTTCATC TATACTGCAC 3180 

CTGAACTACA CCAACACTAC ACCAAGATTT TTAACACTCA CCATTTGCAT GCGTAGAGAT 3240 

SQ TTTTATTATT ATATTATTCC TATAGATTTT GATACTATTC AAAATTTTAG GGACTTTTCA 3300 

GGGGCCCGAA ATCCTATAAT TATAATTATA TACATCTAAA AAAAATAACC ACGTCCATCG 3360 
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5 



10 



IS 



20 



40 



ATACTATTGG 


CAAATTTATA 


AAGTAGTTCA 


GCGTTTTTCA ATGACATATT 


GTCTAATGAT 


3480 


CTTTCATTTT 


TTCTCATTCT 


GTGTATTGTG 


CTTTGTGGAA CTCCTGTTTG 


TTTCGATATA 


3540 


TGTAAACTGC 


TCAAATCACT 


GTCTAATAGT 


TTTTGAATTT GATTTCTCAT 


TTTGTACACC 


3600 


C CCTTGTAAA 


TCTTCAATCA 


TCATTTAAAT 


TAAATAATTA CTTTTCCACA 


TATTCCAATA 


3660 


TTTAGGTTGC 


AAAGCATACC 


TCAAATATCA 


TTAATTTTGA GATTTAAATG 


TCAATTATGT 


3720 


GTCTATCAAT 


CCAATATACA 


TACTCTAATA 


ACGTAATAGT ACACACTCTT 


CTTATTAATT 


3780 


GGCCATAGCT 


ATCATGATAT 


AATTAGTGAA 


GAAAATCACA TAAGAAAGGT 


TGTAAATCAT 


3840 


GAGACTTCAA 


AAAGCACCTC 


TAGTAACGTC 


AGGACTAGTC TTAGGATTAT TAGGCCTGGG 


3900 


TAATCTATTA 


AAAGACTTAT 


CTCTTACTTT 


AAACGCTGTT TGCGGAATCT 


TTGCTTTCTT 


3960 


C3ATT7Y3GATT 


CACCTTTTAT 


GTACTATGAT 


CAAATATTTT AATAATGTGA AAGAACAATT 


4020 


AAAPAGTPrT 


W X tX V3 XXX l»A X 


PAG TGTT CAC 

lull 


AACATTTTTC ATGTCTGGCT 


TTTTAGGTAC 


4080 


XAV. 1 XrlX X in 


A 21 T 21 P 21 T"FTT 
n/iinLni xxx 


TTAGTAAPAT 


AACTTTTATC AATAGCTTAA 


TAACGCCTAT 


414 0 


x ivjvjnx X J. XA 


TGP PTTGTGG 


GAATT'ATGAP 


GCATATGATT ATTTTTTCAA 


TAAAATATTT 


4200 


21 71 Tw 2X ' 1 1 1 1 ** 1 "1 ' 
rtftftnvjAi xxx 


TPAPTTGAAA 


ATGTTTATCC 


TTCGTGGACT GTACTTTTTA 


TTGGTATTGn 


4260 


TATGGCAGGA 


TTGACGGCAC 


CCGTTAGCGG 


ATATTTTTTC ATAGGTCAAT 


TAACAGTAAT 


4320 


ATATGGPTTT 

nim xxx 


GTAGCTACTT 


GTATTGTCTT 


ACCTATAGTT TTCAAGCGAT 


TAAAAGCATT 


43B0 


TCCATTGCAG 


ACGTCAATCA 


AACCGAACAC 


ATCGACAATT TGTGCACCAT TTTCTTTAGy 


4440 


CGCTGCAGCA 


TA^TTATAG 


CTTTTCCTAA 


GGCGAATGCT TTTATCGTAA 


TTATATTTTT 


4500 


ACTATTAGCT 


CAAATATTTT 


ATTTTTATAT 


CATTATACAA TTGCCTAAAT 


TACTAAAAGA 


4560 


ACCTTTTTCG 


CCCGTATTTT 


CAGCTTTCAC 


ATTCCCTTTA GTAATCTCAG 


CAACTGCTTT 


4620 


AAAGAACAGT 


TTGCCTGTAC 


TTATGTkTCC 


AGACATTtGG AAAGGkCTTT 


TGTTTATCGA 


4680 


AGTGTTATTA 


GCCACTGTAA 


TAGTACTTAG 


AGTCTTTATA GGATATCTTC 


ACTTCTTTTT 


4740 


AAAAAAGGAA 


AAACAAGATA 


AATTTCTnCG 


TAATGCGTCT CAGTAACACT 


ATTACCAAGA 


4800 


ATTAACACGT 


ATATTTAATA 








4820 



(2) INFORMATION FOR SEQ ID NO: 454: 

AS 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 4358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
„ (D) TOPOLOGY: linear 
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ATTTGGATCT TTAATATCAC CAATATTTTT AATATCTTCC GGATTCAATC CATATACTTG 60 

TACTGTATCT GAGTATTTAA TTGTGAAATA ATCACCTGAT TTAACTTTGT CATCAACTGT 120 

AATTTGTGAT TTTAATGATA AATAATCTTG GGCTGGTACG ATTTTATTGT TTTTATCTGC 180 

ATCAACGACA GTtAATGTTG TATTTGATGT GATTAAATCA TTAACATTTT TAGCCTCTGT 240 

TGATGATGGC TGTACTGCTG CTATACGCAT TCTTGTATTC AAACGTTTAG GTGCTGTACT 300 

TTTTGGCAAA ATGATATCTG CATTATTTTC ATTATTTGAA TTACTATTGT TATCAACAAG 3 60 

AGTTTCATCA TTACTCTTGA TAGCATCACT TTTAACATTT AATGTAGTTG ATTCAGTTTT 420 

GGCATCTACC TTTTTGTTTT CCTCATTAGT TGGTTGAACA TTTACCACTG ATTTATTCTC 4 BO 

TTGCAAATCA GGTTGTAACG CTTCTTGATT ACTTATAGTT TGTTTAGTGT TTAAATCTTC 540 

ATTCGTAGAT TTTGGTGAAG CTTGCTCATC TGATTTGGCA GTTGAAACTT CAACTTTATT 600 

TCCAGTGGTA GATTGTACAC TTTCTTTTTC TATTAATTTA TTCCCATTTG AAGTCGTTTC 660 

ATTACCTTGa GATGATACCA TTTCTTTTTG ATTATCATTT TTAGTATTGT CTTCTTGATT 720 

TAGTTGCTGC ATATCAACTT TATCACTCGA TTGATTATCA CTTGCTGAAG TTGTCGCTcG 7 80 

25 TTCAATTCTT TATTAGTACT TTCTGCAGCC TTTGCTTCTT GGTTCCCCAG ACCAAAAATT 840 

AATGTTGTAC CTACTAAAAT TGATGCTGTT CCCACTGTGT ACTTTCTAAT CGAAAATTTA 900 

TTTAATCGAT TGGATACCAT GCCTTTCCTT GTTATTGCCG TTTTATTTTC TCTGTTTAGC 960 

ATTAGATTAC TCCTAATTCA TCAAATTTTT AAATAATACA ATTGTTTTAA ATACAAAAAT 1020 

GTATATCAAT ATAGTATTAC ATTTTTAGAT AAAGCACAAT ACTTTAATTA TTTTTCTTTA 1080 

TCGTAAAACG TTATTTAACA TTTGTGTTTA AATAAAAGTT TTTATGAGTt TTGTAATCTT 114 0 

TATTTAATCA TCATAAAAAA TAGTATTATT TGCCCTTGAA ATTAATATCT TAGCTTTTCT 1200 

AATTCATAGA CAATTACATT TCTGTAACAA ATTAAATTGT ATCTATTCCT TAAAGATTTT 1260 

TTGTTTTATA TCTGGGAATT TCTAAACAGA AAAAACCAGG CCACATGGAC CTGGTTAAGT 1320 

TAATCATATT ATTTATTTTG TTTTTTACGA CGACCGAATA ACAATAATGA TCCTAATGCC 1380 

GCGAATAATC CACCGAATAA TGTGCCATTA TTTGAATTAT TATTTTCACT ACCTGTTTCT 144 0 

45 GGTAATGCTT TAGCTGTTTT ATGCTGATCT TTAACCGTAC TCATTGGTTT AGCCGGAGTA 1500 

TGTTTACCTG CATCTGAATC TGAATCGCTA TCTGAATCTG AGTCGTTGTC TGAGTCCGAA 1560 

TCGCTATCTG AATCTGAGTC GCTGTCTGAA TCTGAATCGC TATCCGAGTC TGAGTCGCTA 1620 

TCTGAGTCTG AGTCGCTATC TGAATCTGAA TCGCTGTCTG AGTCTGAATC GCTATCTGAG 16 BO 

TCTGAATCGC TGTCCGAATC TGAGTCGCTA TCTGAATCTG AATCGCTATC TGAATCTGAG 174 0 
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TCTGAATCTG AGTCGCTGTC TGAATCTGAA TCACTGTCTG AGTCTGAGTC GCTGTCTGAG I860 

TCTGAATCGC TGTCAGAATC ; TGAGTCGCTA TCTGAGTCTG AATCTGAATC ACTGTCTGAG 1920 

5 TCCGAATCGC TATCTGAATC TGAATCGCTA TCTGAGTCTG AGTCGCTATC CGAATCTGAG 1980 

TCGCTATCTG AGTCTGAGTC GCTATCCGAG TCTGAATCGC TGTCTGAGTC TGAGTCGCTG 2040 

TCTGAATCTG AATCGCTATC TGAGTCTGAG TCGCTGTCTG AATCGCTGTO TGAATCTGAG 2100 

10 

TCGCTATCTG AATCTGAGTC GCTATCTGAG TCTGAATCGC TGTCAGAATC TGAGTCGCTA 2160 

TCTGATGTTT CTTCTTCGTA GTAGCCATTA TCAAGTGTGA AATCATCATG ATCCGTAATT 2220 

15 GTTACATCAA CTTCGCCACC ATCGGCATCT TTATCATCTT CAGTTGTATT TGTACCTGTT 2280 

TGAGTTAAGC CAGCAGGTTT TTCAAAGATA ACTTTGTATT TACCACTATC TAAATTATCA 2340 

AAGCGGTATT TACCATTTTC ATCTGTyTCA GTTGTACCAA TTACTTCGCC TTTTTCGTTT 2400 

20 TGCAAAGTAA CTTTAACACC TTTAATTCCT TTTTCAGTCG AATCTTGTTT ACCATCTTTA '2460 

TTACTGTCGT ACCAAACATA ATCACCTAAA CTATATTTTG GTGTTTTGTA GAATCCACTA 2520 

TCTAATGTCA TGTTGTCAGC GTCTTTAATG ACACCTGTTG TAGTTAGTCC ATCAGAATCT 258 0 

25 ACAGCATCAT CTGTACCTAC ATTTGCAGTT „ QTCGGTGTAT AACCGGCTGG TGTTGAAAAC 264 0 

TCTACACTAT AAGTTCCATT GCTTAAACCA GTGAACTGAT ATTTACCATT TTCATCTGTT 2700 

GTCGTACGAT CTAATTCTTT ACCGTTACTA TCTTTAAGAA TGACATAAAC ACCTTTAATC 2760 

30 

CCTTTTTCAT TGGCATCTTG TTTACCATCT TTATTTGTAT CTTCCCATAC ATAGTCACCT 2820 

AGATTATATT TCTTTTGGTC GCCATTAGCA GTTGATGAGC CATTCACATT TGAATAACTA 2880 

TTTGACCAAC TATATTTAGT. TTTGTCAGTG TCTAAAGTAT AATCAATTTT TCCATTATCT 294 0 

35 

GTTGAACTAT TATCTGGATA AGCAACTTGT TGAATGATGT ATTGTTTATT GCTGCTTGTT 3000 

TGGCCTTTCA TTAAATCGAC TGTAGCTGTT TTATTATCAT TACTATAAAT AACATCGAAT 3060 

40 TGATCAGTAA CATCTTTAAG TTTTGAAGTA TCAGGGGTGA AACTATCCAC AAATTGATTT 3120 

TGATCTGtCA CTtCGTAAAT . TTTGAAGTTT TTTGCATTTG GATTAAATTT ATATCCAGTT 3180 

AAATTAGTAA CAAACGTTTG TTTAGTATAT GTATTTTTAG GTTGATTTAC ATATGCAGTC 3240 

45 ATATTACGCG ATAAATCTTC ATTGTTAATA TAGTTTGTAC TTGAAATAAG CGGTTGTGCT 3300 

TTTTTATTAC CATAATCGAC AATGATTTCT TCGCTATATG TATCATTACC TAAAGTTACT 3 360" 

TCCATTTTAT AAGCTGTTTT ATCAGTTGTT GCATTTTTAC GTTTCGCAAA TGCAACTTGT 3420 

SO 

TCAAAGCTAC CTCTAACATT TGTATATTGA TCTACATAGT TCGTAAAAGT ATATGTTGTT 34 80 

• gtGTTTGTTG TACTATCATA AATACCTTTT GCAATAATAT TACCTTGGGC ATTATATAAA 354 0 
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35 



40 
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50 



GTAAATGTAT 


CGCCCTCTTT AACAGAATCA TCGATTGTGT AATTTGCTTT 


TAATTTTAAA 


3660 


ACATCACTTG 


AAGTTGCCCA AAATTCAGTT 


TTACCAGTAG 


TCTGATTAAC 


ATGTCCTTTA 


3720 


TCAATCGCAA 


TGTCAATATT TGAAAAATGT 


ACTTTATCAT 


TAACATTTGT 


TCCTTGTTGT 


3780 


GGAGCTGCAA 


CAGTATTCAC TGCCATGCGA 


TTTAAAGTTC 


TTGGTTTAAT 


AGTCGTTGTT 


3840 


TTAGGTGTAG 


TTGAAACATC TTTTGCTTGT 


GTTAAATTAC 


TTTTATCAGT 


TTCATTACTA 


3900 


TATGTAGTTG 


ATGATTTATC ATTTGTTGTT 


ACATTGCTAG 


TTTTTGTAGT 


AGATTGATTA 


3960 


GCTGTAGCGT 


TTTGTGGTGA TTGCATGTTA 


CTACTAGTTT 


CTTTAACTGT 


TGCACTATCA 


4020 


CTCATTGTCA 


CTTTAGGCTG ATCTGCAGTT 


GCAGTTTGCG 


TATTGTCTTT 


TAGTTGACGA 


4080 


CTATCAACTT 


TTTTAGTTGT TTTATTCTCA 


CTTGGGGCTG 


TCGTTTCATT 


TTTTGATTGA 


4140 


TTTAATTCTC 


CATTCGTATG TTCTGCCGCT 


TTAGCTTCAT 


GACCACTTAA 


CCCAAAAATC 


4200 


AATGTTGTCC 


CTACTAAAAT TGAAGCAGTA 


CCTACAGAAT 


ACTTTCTTAT 


CGAAAATTTG 


4260 


TTTAATCGAT 


TTGGTATCAT GCCTTTnCTA 


TTTGTnGCTG 


TCTTTTTATA 


ATTCATTTAA 


. 4320 


TAATACTCCT 


TTAAAATATC AAAATTTGAT AAATATAA 






4358 


(2) INFORMATION FOR SEQ ID NO: 455: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 455: 






TTGACTTCTT AATTCAGCAT TTTCTGCACT 


TAATGCTTTG 


TTCTTTTTAA 


TAAGTTGCTT 


60 


TCTTGCATAA ACTTCGGTAT CTATTTTACT 


ATTACTATAC 


CTTTGATTTA 


AAACTAATAT 


120 


ACCAATTAAT 


GCTACAATGA TAATGATAAG 


TACAACATAA 


AAAGACATTT 


TTTCACCAAT 


180 


CCTTTTTGAC 


TTCTTTAACT TTGTATACAA 


TAATAATTAA 


TAAAGATTAA 


TTGTTATTCA 


240 


ATTTCCCACA 


TTTTTATTAG TTGATTTTAG TTCATCATTG 


TTATAATCAA 


ATTATAAACT 


.300 


GACAGATATT. 


GATGTTCAAT GAATATGACG 


TGAAAGATTC 


GTGAATTCAA 


GTTTATGTCG 


360 


AATTTATGTT 


ATAACGGTCA TTTAAATGAC 


AGAATTAGGT 


CACTCATAGT 


ATTTTGAAGA 


420 


TTGAATTCAT 


TAATTTTAAA ATGTATAATG' ATATTTGTGA 


AAGCGCTTGC 


TTAGGAGGTG 


480 


TATTTGAGAG 


TGAATGAAAT GAATGCTAAA GAACAATTAG TGGACAATTT 


AATGAAAACA 


540 


TCATCGCAAT 


TATTTAAATT TCACGGTGAA 


GTTGCCATGC AGCTTTTCTT 


AAATGATGAA 


600 



4 



55 



1371 



EP 0 786 519 A2 





AAAGTTATTC CGCAATCATA TGCGTTACTA TACATAGATA AGCAAGATCA AGCAATAGCT 


720 


I 


AAAGAAGATT TATCACTTTC AAAAATTGCA AAAGTTTATG TGCAATATGA TGATACAACA 


780 


5 


ATAATGAGTA TTTTCGTTTA. 


TGATGTAGTA 


AACGATGAAT 


GGATTTTTAG 


ATTGGATCCG 


840 




AATATACGTA TACCTAAGAG 


TAACATATAC 


TTCCATAGTT 


TAAATTGGGA 


TGTGGATATA 


500 




TTAAACCGGA GtCGTCTAAT 


GTATGTCTAA 


TGCaCACCAT 


CAGaTCATCA 


TtATCCATTA 


960 


10 


TAAcGrGCAT AGTGCATAAn yACTwCAtTT TaTTaaATTG AGaGGgGCAC GATAGGTGCA 


1020 




TCAGGACATA ATATAGGAAG 


CATCAACGCG 


TGAnCAGGTC 






1060 


IS 
20 


(2) INFORMATION FOR SEQ ID NO: 456: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1262 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 456: 






25 


ATAATnACTA AATACnAAAG 


TTTAACTGTC 


TTACTAATAA 


TGACTATGTT 


ATAATTTTAA 


60 




AAGTGATATT TTGGGTAATC 


GCTATATTAT 


ATAGAGGAAA 


GTCCATGCTC 


ACACAGTCTG 


120 




AGATGATTGT AGTGTTCGTG 


CTTGATGAAA 


CAATAAATCA 


AGGCATTAAT 


TTGACGGCAA 


' 180 


30 


TGAAATATCC TAAGTCTTTC 


GATATGGATA 


GAGTAATTTG 


AAAGTGCCAC 


AGTGACGTAG 


.240 




CTTTTATAGA AATATAAAAG 


GTGGAACGCG 


GTAAACCCCT 


CGAGTGAGCA 


ATCCAAATTT 


300 




GGTAGGAGCA CTTGTTTAAC 


GGAATTCAAC 


GTATAAACGA 


GACACACTTC 


GCGAAATGAA 


360 


35 


GTGGTGTAGA CAGATGGTTA 1 


TCACCTGAGT 


ACCAGTGTGA 


CTAGTGCACG 


TGATGAGTAC 


420 




GATGGAACAG AACATGGCTT 


ATAGAAATAT 


CACTACTAGT 


TTAGCTCTCC 


TAGATGATGG 


480 


40 


AGAGCTITIT TCATGAAAAG 


AACACTTAAA 


ATTAACGCCy 


TGTCTTGaTA 


tAATGACaCT 


540 


GCcTTGTTTT AAAATAGTAA GCGGATGCgT 


TAATGTATCA 


GCGATTAAAT TTGTTGGAAA 


600 




TGTATAAAAA ACACAAGCTA 


AGAATAAAAT 


ACCTGTATAA 


AAGGAGAATC 


ATATATGTTT 


660 


**o 


CAATTACTTG CAGTTTGTCC 


GATGGGATTA 


GAAGCTGTTG 


TTGCTAGGGA AATTCAAGAA 


720 




TTAGGCTATG AAACAAATGT 


TGAAAATGGT 


CGTATATTTT 


TTGAaGGAGA 


CGCAAGTGCA 


780 




ATTGTAAAGG CAAATTTATG 


GTTGCGCACA 


GCAGACCGAA 


TCAAAaTTGT ' TGTTGGACGT 


840 


50 


TTTAACGCAA CAACGTTTGA 


CGAATTATTC 


GAACAAACCA 


AAGCGCTCCC 


TTGGGAATCT 


900 




ATAATTGATA AAGAGGGTAA 


CTTCCCAGTT 


CAAGGTAGAA 


GCGTTAAATC 


AACACTACAT 


960 
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TATAACGAAA AAGGTTGGTT AAATGAATCA GGTGCCAAAT ACCCTGTTGA AGTTGCCATT 1080 

TTAAAAGATA ATGTATTATT GACTATCGAC ACATCAGGTT CTGGTTTGAA CAGACGTGGT 1140 

TATAGATTAG CACAAGGTGA AGCACCAATT AAAGAAACGT TGGCAGCAAG TTTAATCCGT 1200 

CTTGCCAAAC TGGGAAAGGT GATTACACCT TTnAATTGGT CCCATTTGCG GTTCnGGTTA 1260 

CA 1262 



(2) INFORMATION FOR SEQ ID NO: 457: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1142 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 457: 





CCTGGCTGCT 


TTATCAGCAT 


CTACTACTTT 


AAAACCGAAT 


ACGGATAAGA 


GTTCTGATAC 


60 




TGTTGATTTT 


CCTGAGGCGA 


TTCCACCTGT 


TAGACCAATA 


ACTTTCGGCA 


TAATTTCACT 


120 


25 


CTTTCTTTAT 


TTTTGACATA 


CTGGACAATA ATGACTATTT 


CTTGTCGCGA 


TGATTTTTGT 


180 




TTCAATTTGA 


CTTCCACACA 


CTTTGCATAC 


CGGCTGCTTA 


TATACATTAA 


GATGCAATTG 


240 




CATCTCACCA GTTTTTCCAT 


CAGCATGACG 


ATAATCTGAA 


ATACTTGTAC 


CGCCATATTT 


300 


30 


AATACCTTCT 


TCTAGTACTT 


CTCTAACATA 


ATAAAAAACC 


ATTTCTTGTT 


GTTGGTGTGT 


360 




TAAGTCTTTT 


ACTTTTTTAT 


CTGGTAAAAC 


ACCTGCACGA 


AACAACGCTT 


CACATGCGTA 


420 




AATATTTCCA 


CAACCTGCGA 


TTACTTTATG 


ATCCAAAATC 


ACTTGTTTGA 


TTGGTTTATT 


480 


35 


CTTATTAGAC 


TGTTGATGAA 


TTCGATTTAA 


ATAATACGTC 


AATGCTTCAT 


TTGAAAAAGG 


540 




TTCAGGCGCT ATTTCTAAAA 


ATGAAGGATA 


AGATGCTACA 


GACGCAACAT 


TTCTAATTTC 


600 


40 


TCCAAAACGA 


CGTATATCTG 


AATAAATTAA 


CTTTTTGTCA 


TTTGACAACT 


CAAAAATAAC 


660 


ATGCCAATGC 


TTACGATAAT 


TAGGTATCAT 


AATATCTTCA 


AGTTCATCTA 


CAATGAAAAA 


720 




ACCGCCCGCC 


ATACCTAAAT 


GACTAATTAA 


TGTACGTTGT 


TCTCGTTTAT 


TATCTAGCTG 


780 


45 


AAAAACGATA TATTTACTTC 


TTCGTTCTAC 


ATTTGTAATG 


GTATAGCCTT 


CCGATAAAGT 


840 




TTTAAAAGTA 


TCTAATTCAA 


TTCCTTTTAT 


AATTGTTTCC 


TTGCCTTGAG 


CTTTACCTTC 


900 




GATTACTTTA 


TCCGAAAATA 


TAACGTGTTC 


AATTTTTTGA 


TTTATAACGT AGGGTTCAAT 


960 


50 


TCCTCTTTTT 


ACATGTTCTA 


CTTCTGGTAA 


TTCGGGCATA 


CCATTAACCT 


CACTTTATTT 


1020 




TGCATCATAC 


CAGGTTGCAC 


CATAACTTGA 


GTCTACTTTT 


AATGGAACAT 


CTArTTGCAA 


1080 
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TT 

(2) INFORMATION FOR SEQ ID NO: 458: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1814 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 458: 
CCTTTAGTAA ACAATCCTTC TTTAGTTTTA GTACGTTGTT CCAATCCAAA TAATTTGTAT 
TTCATTGCCT CGCCCGATTG AGTGCCGCTA AAGTTATCAT CTTTCATGTT AGGCGTGTTG 
GTAAACATGT GTATATCACT GTTTAAACGG TCTTTATAAG CTTCGGTACC TTGTACATCG 
TATTGCTTAT AAATATAACC ACCATCAACA GAGCCTTCTG TTTCTCTACC TTCGCTATCA 
GCATAAACAG TCGGTTCTAA AAACAACACG TTAGCTTCCT TTTGTTTTCT AACTTCTACA 
GGATCTAAAT TTAAATTACC TTTAATAAGT AACATAGCGT CATTTAAATC ACTCATATAG 
TTAGCAsymy CTGATTCAGC ATTATCATAC AAATCAATTA AAGTGATTAC TTTCTCATAA 
TCCCCTTTTC TTCTTTCGTT GTTGCTAAAT TCTGTAATAG GCATACGTTC GAAAGAGTGT 
GATTCAAAAC CGTTTTCACG TGGTGTGAGC TTCAATCCAT TTGTTCTACT GGTAAGATAT 
CTATAAACAC CGTGTGAAGT GAATAAATCA ACTGTAAACA CTTCATCTTC GTCAGTCTTG 
TCTATTGGTT TAGTTCTTAA ATATCTAACG CCTGCGATAC TATTACGTTC AATTGTATTG 
TCGTATATGA CAAAAGTACT CATTGCATCA CTCTTGTATA AACGCGTTTC ATCATCTTGG 
TTTCTAATCA TTAACTCATA AGCTTTGCCA TAAATTGACA AATCTAATCC TAAAGATCTA 
TTGTGTGACT CAACATCATT TAAATCATTG AACGCCTCAA TAACTTCTAA TACATCTTTG 
TCATCATCTT GATATTGAAT TGGATTACCC AAGAAATAGC CGTTGATAAA ATCGCTAATA 
TAAGATGCGT AATCATGCGC TACACGGTTA TCTGCCATGT ACTCTTCTTT GCGTCGTGTT 
AACTCAACTA AGTTCTTAGT TTTACCTTCG TAATAATCAC TTAACACTTT CAATCTAGGT 
CGTTGGTAAT CCATGTGATG TTCAATGTAT TTACTTACTT CATTAACGTT TTGTAATAAA 
TCGGATTCCG TCCCGTCATA TGTGTAAACA ACATTGGCTT CATCATTAAA TAAGTAATTT 
ATGTTTCCCC GTAGATCTGT ATCTGTTTCA AATTCGTTTA CTTTTAACAT TTGTTCCCTC 
CTATAATCCT AGAGATTTTA TTGTGTCAAC TTTCGAACTG AGATTTGTGC GTTTtCTAAC 
CGGTCTGTAG AATCGTTCCA CTGAATAACG CAACGAATCG ATACAATGAT TGTATGTATC 
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CTCTTCAATA GTCTTGAAAC AACGTTCATC AACAATGATT TCAAATTGCA TTAAGAATTG 1440 

TAACCCTTGT ACAACCGAGC CCTTCCCTTT TTTGGTTGGT AAAATCCTTT TAAGCCCTAG 1500 

5 ATTCCTTAAT TCAGCTATAC TTTTTTGTTC TGCACTATCT GCTGTAATTT CTTCTTTAGC 1560 

ATAACCAAGT TGCTTTATGA CATTAGCTAT TTCATCATTC AGCATACCTT GTTTAACATA 1620 

CTCTTCAATG ATGTATAACT TCTTTTTCTT TACATCTATT TTAGAATGTA TAAAAGCACT 1680 

10 AGGATCATTA ACGTAGCCAA AGTCCAATCC AAAATAAGAA GGTAAATGTC TTAACTCATC 174 0 

TTTATTTATT AAACGTTTTT CATACTTAGG GAAAACCAAT TTGTCTAGTG TAGCAAATTC 1800 

ACCTAACGCA TAAA 1814 



15 

(2) INFORMATION FOR SEQ ID NO: 459: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 686 base pairs 
(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

2S . (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 459: 

TTTAAAAATA TTTGTTTCAC AATTTTTTAT TACCTATTTA 60 

GCATTTTACT TGTTGGTATT TGAAAATGTG CAAAATTTAA 120 

TTCATGACTA TGCTATTTAT GTTTATGAAT ATGCATTCAA 180 

ATATTCTTAA AGTCTAATAA ATAGTTACAA ATTTAGTTAG 240 

GTGGTAAACA GTGCTTTGTG AATCTAGACA AATTTATAAA 300 

TAGATATAAT AATGAATATT TCATGGTCGA TTTAGTAAGT • 360 

CCCTATGATT AATTGGTTTT TGCCCAAAAA ATACGCAAAA 420 

AAGGTTAAAT ATAGTCGAGC CTGTTAAAAA TAATGTTTTT 480 

AGTTCTATTT GGAATTATAT TGAGAAAGTA CGGTAACTTC 540 

ACAACTAGCA ATCACTGTAT TTTTTATCAT GTTAATAGGG 600 

TCTAAATAAA AAATTAACAT TAAAAATTTT TAATACCAAC 660 

TTGTAT 686 
(2) INFORMATION FOR SEQ ID NO: 460: 

So (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1300 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 . 



AATTnAGATT ATTACCCTCC 
CTGGTTTCAT GTCTTATGGG 

30 TCTTATATGT TTCTTGGCTT 

TTATAGATAA AAAAGTACAT 
TTTTCAATTG TTAATTAGGG 

35 AATCCTAAAT ATCGAGTTAT 

ACTTGGATTA CTTATTTTTT 
ATTAGCGAAA ATGAATTTGA 

40 

TGGCCGGTTG CAGGAAGTTC 
TTTAATGTTC AGTTTGAAAA 
ATGTTAATTT TTTATTTTTA 

45 

GTGGGTAATA AGAATAGGAG 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 460: 



20 



30 



40 



45 



ATCTGCAATT 


ATGGGCACAC 


CCAAGCTnAT 


GCAAGTAACT 


AAAGGAGAAG 


TACTTTTAGA 


60 


CGGTGTAAAT 


ATTTTAGAaT 


TAGAAGTTGA 


TGAAAGAGCA 


AAAGCAGGAT 


TATTCTTGGC 


120 


AATGCAATAT 


CCATCAGAAA 


TTACAGGTGT 


TACAAATGCT 


GATTTCATGC 


GTTCAGCAAT 


180 


CAATGCGAAA 


CGTGAAGAAG 


GACAAGAAAT 


CAACTTAATG 


CAATTTATTA AGAAATT AGA 


240 


TAAAAACATG 


GATTTTCTAG 


ACATAGATAA 


AGACATGGCA 


CAACGTTATT 


TAAATGAAGG 


300 


TTTCTCAGGT 


GGAGAGAAGA 


AACGTAACGA 


AATCTTACAA 


TTAATGATGT 


TAGAACCTAA 


360 


GTTTGCaATC 


TTAGATGAAA 


TCGATTCAGG 


GTTAGACATC 


GATGCATTAA 


AAGTTGTATC 


420 


TAAAGGTATT 


AACCAAATGC 


GTGGGGAAAA 


CTTTGGTGCA 


TTAATGATTA 


CACACTATCA 


480 


ACGATTATTA 


AATTACATTA 


CTCCTGATAA 


AGTACATGTA 


ATGTATGCTG 


GTAAAGTCGT 


540 


TAAATCTGGT 


GGTCCAGAAT 


TAGCAAAACG 


TCTTGAAGAA 


GAAGGATATG 


AATGGGTTAA 


600 


AGAAGAGTTC 


GG TTCAGCTG 


AATAATCTTA 


TTAATACAGT 


ATCCATGAGA 


TGTTCATCTA • 


660 


TATATGATGA 


AAATGAACAT 


TTATACGAAA 


TAGTAAATTT 


CATCAAGTAG 


GAGGAAAAAG 


720 


TTATGACAAC 


TGATATTTTG 


rACaTTyCTG 


AAGAACAACT 


TGTTGATTAT 


TCTAAAGCCC 


780 


ACAATGAACC 


TTCTTGGATG 


ACAGAATTAC 


GTAAAAAAGC 


TTTGAAATTA 


ACAGAAACTT 


840 


TAGAAATGCC 


AAAACCTGAT 


AAAACAAAAT 


TAAGAAAATG 


GGATTTTGAT 


TCTTTTAAAC 


900 


AACACGATGT 


AAAAGGTGAT 


GTTTATCAAT 


CTTTATCACA ATTACCTGAG 


TCAGTAAGAG 


960 


AAATTATTGA 


CGTAGATCAT 


TCTAAAAACT 


TAGTAATTCA ACATAATAAT ACGATTGCGT 


1020 


ACACACAAGT 


TGATGATAAT 


GCATCGAAAG 


ATGGCGTTAT 


CGTTGAAGGT 


TTAGCAGACG 


1080 


CTCTTATGAA 


CCATAGTGAT 


TTAGTACAAA 


AGTACTTTAT 


GAAAGATGCA GTAACAGTAG 


1140 


ATGAACATCG 


TATCACAGCG 


CTACACACGG 


CATTAGTTAA 


TGGTGGOGTA 


TTTGTTTATG 


1200 


TTCCTAAAAA 


TGTAGTTGTA 


GAACATCCAG 


TACAATACGT 


TGTGTTGCAC 


GACGACGAAA 


1260 


ATGCAAGCTT 


TTATAACCAT 


GTTATCATCG 


TTACTGAAGA 






1300 


(2) INFORMATION FOR SEQ ID NO: 4 61: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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GACAGCAAAT 


TCAAGATACA TTAAATAAAG ATATTGTCAT 


AAAGCATATT 


CTTGTTCGAG 


60 




ATAAATCTAA 


AAAGAGACCG CTAAATATTA 


GCCAATATCA 


TTTAACTGAA 


GATGTTAATG 


120 


5 


AAATTTTAAA 


TGATGATTCA TTAGATATTA 


TCGTTGAAGT 


CATGGGAGGA ATTGAACCAA 


180 




CTGTAGATTG 


GTTAAGAACA GCACTTAAAA 


ATAAAAAACA 


TGTTATTACC 


GCAAATAAAG 


240 




ATTTATTAGC 


AGTACATCTT AAACTTTTAG 


AAGATTTAGC 


AGAAGAAAAT 


GGTGTAGCTT 


300 


10 


TAAAGTTTGA 


AGCGAGTGTA GCAGGTGGTA 


TTCCGATCGT 


AAATGCCATA 


AATAATGGTT 


360 




TGAATGCGAA 


TAATATTTCA AAATTTATGG 


GAATTTTAAA 


TGGTACCTCT 


AATTTTATTT 


420 


1Z 
IO 


TATCTAAAAT 


GACTAAAGAG CAAACGACAT 


TTGAGGAAGC 


ACTTGATGAA 


GCGAAAAGAC 


480 


TTGGTTTTGC 


TGAAGCGGAT CCAACTGATG 


ATGTAGAAGG 


GGTAGATGCA 


GCGCGTAAAG 


540 




TTGTCATTAC 


ATCATATTTA TCATTTAACC 


AAGTCATTAA 


ATTAAACGAC 


GTTAAACGAA 


600 


20 


GAGGAATTAG 


TGGCGTAACT TTAACTGATA 


TTAATGTAGC 


CGATCAACTG 


GGGTATAAAA 


660 




TTAAATTGAT 


TGGTAAGGGA ATATATGAAA 


ATGGCAAAGT 


TAATGCATCG 


GTAGAACCAA 


720 




CGTTAATTGA 


TAAAAAGCAT CAATTAGCAG 


CTGTAGAGGA 


TGAATATAAC 


GCGATTTATG 


780 


25 


TTATTGGTGA 


TGCCGTTGGT GACACGATGT 


TTTATGGAAA 


AGGAGCAGGC 


AGTTTAGCAA 


840 




CAGGTAGTGC 


CGTTGTCAGT GATTTATTGA ATGTAGCATT ATTCTTTGAA TCAGATTTAC 


900 




ACACATTGCC 


ACCACATTTT GAATTAAAGA 


CAGATAAAAC 


ACGGGAAATG 


ATGGATTCAG 


960 


30 


ATGCAGAAAT 


TAATATTAAA GAAAAATCCA 


ATTTCTTTGT 


AGTAGTGAAT 


CATGTCAAAG 


1020 




GTTCAATTGA 


AAATTTTGAA AATGAGTTAA AGGCAATATT ACCATTTCAC 


CGATCATTAA 


1080 




GAGTTGCAAA 


TTACGATAAT CAATCATATG 


CCGCTGTTAT 


AGTTGGATTG 


GAATCATCAC 


1140 


35 


CGGAAGAATT 


AATCACTAAG CATGGATACG 


AATTGACAAA 


GTATACCCAG 


TAGAAGGAGT 


1200 




TTAATTATAA 


TGAGAAGATG GCAAGGATTA 


GTAGAAGAGT 


TTAAAGCACA 


TTTACCAGTA 


1260 


40 


AATGAAAATA 


CACCAAAATT AACATTGAAC 


GAGGGAAATA 


CACCACTCAT 


TCATTGTGAA 


1320 


AATATGTCTA 


AAATACTAGG CATAGATTTA TATGTGaAGT ATGAAGGTGC 


CAATCCGACA 


1380 




GTTCATTTAA 


AGATCGCGGT ATGGTAATGG CTGTGACAAA AGCAAAAGAG CAAGGTAAGA 


1440 


45 


AAATTGTAAT 


ATGCGCTTCG ACTGGAAATA 


CATCAGCGTC 


TGCAGCAGCA 


TATGCAGCGA 


1500 




GAGCAGGTTT 


AAAAGCTATC GTCGTAATAC 


CAGAAGGTAA 


AATTGCATTA 


GGTAAATTGT 


1560 




CGCAAGCAGT 


AATGTATGGT GCAGAAATCG 


TTTCTATTGA 


AGGAAACTTT 


GATGAAGCTT 


1620 


SO 


TAGAAATTGT 


AAAAGAAATT GCAAAAAGTG 


GCGAAATCGA 


GCTTGTAAAC 


TCTGTCAATC 


1680 




CATTTAGAAT 


CGAAGGACAA AAGACAGGCT 


CATTTGAAAT 


TGTACAACAA 


TTAGACGGTG 


1740 
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AAGGCTTTAA AGAATATCAT GAAGCTAAAG GATCACAATT GCCGAAAATG TTTGGCTTCC I860 

AAGCTGAAGG CGCATGACCA ATTGTTCAAA ATAAAGTCAT TAAAAATCCT GAAACGATTG 1920 

5 CAACTGCTAT TCGAATTGGT AATCCTGCTA GTTGGGATAA GGCGACTAAT GCTCTTAAAG 1980 

AATCAAATGG ATTAATAGAT AGTGTTACTG ATGATGAAAT TCTAGAAGCA TATCAGTTAA 2040 

TGACAACTAA AGAAGGTGTC TTTAGTGAAC CAGCGAGTAA TGCTTCTATT GCAGGTTTAA 2100 

10 

TTAAATTGCA TAGACAAGGT AAATTACCTC AAGGTAAAAA AGTAGTTGCT ATTTTAACTG 2160 

GTAATGGATT AAAAGATCCT GATACTGCTA TTTCACTACT AGATAATCCG ATAAAGCCAT 2220 

TGCCAAATGA TAAAGATAGC ATTATCGATT ATATTAAAGG AG CTTT AT AA CATGTCGAAT 2280 

15 

GTTTTGGAGT TAACAATTCC TGCATCAACA GCCAACCTTG GAGTTGGCTT TGATTCTATA 234 0 

GGTATGGCTT TAGATAAATT TTTGCATCTG TCTGTAAAGG AAACATCAGG GACAAAATGG 24 00 

20 GAATATATTT TCCATGATGA TGCATCTAAG CAATTGCCTA CTGACGAAAC AAACTTTATT 24 60 

TATCATGTAG CACAACAAGT TGCTTCTAAA TATAGTGTTG ACTTGCCTAA TTTATGTATC 2520 

GAAATGAGAA GTGATATTCC ATTGGCAAGA GGGTTAGGTT CGTCAGCTTC TGCTTTAGTA 2580 

25 GGAGCTATAT ATATCGCAAA TTATTTTGGT GATATCCAAC TGTGTAAACA TGAGGTATTA 264 0 

CAATTAGCGA CTGAAATCGA AGGACATCCT GATAATGTTG CGCCGACCAT TTATGGTGGT 2700 

TTAATCGCTG GATATTATAA TGATGTCTCG AAAGAAACGT CaGTtGCACA TATCGACATA 2760 

30 CCAGACGTGG ATGTGATTGT AACGATACCA ACTTATGAAC TAAAAACAGA AGCATCAAGA 2820 

CGTGCTTTAC CACAAAAATT AACACATAGT GAAGCGGTTA AAAGTAGTGC AATTAGTAAT 2880 

ACAATGATTT tGgCATTAGC ACAGCACAAT TATGAATTAG CAGGTAAACT CATGCAACAA 294 0 

35 

GATGGCTTTC ATGAACCGTA TCGTCAGCAT TTAATTGCTG AATTTGATGA AGTGAAAACA 3000 

ATTGCTAGTC AACATAATGC CTATGCAACT GTAATTAGTG GTGCTGGACC AACTATTTTA 3060 

ATATTTAGTC GTAAAGAAAA TAGTGGGGAA TTGGTTCGCT CTTTAAATAG TCAGGTAGTA 3120 

40 

TCATGCCATT CTGAA 3135 
(2) INFORMATION FOR SEQ ID NO: 462: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 462: 
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AGnTCAATAT 


TTAGATCAAG 


CCGTTTTAAG 


TAATTACGAA 


CAAGTTTATA 


TCATTCATGG 


120 




TAAAGGTACA 


GGTGCACTTC 


AAAAAGGTGT 


ACAACAACAT 


TTGAAAAAGC 


ATAAAAGTGT 


180 


5 


TAGTGACT.TT 


AGAGGTGGTA 


TGCCAAGCGA 


AGGTGGATTT 


GGCGTTACCG 


TTGCAACACT 


240 




AAAATAAATT 


ATAATTTGAT 


AAATTAAATA 


GCTGCAGTTA AAATAATGTA AAGCAACAAG 


300 




AATACATTTC 


AAACATGTTA 


TTTGAAATAA 


GCATAAAAAT 


TGAGCAAATA 


GAAATACATG 


360 


10 


AAGCATGTTA 


TCTGATATAA 


TTTGAACATC 


ATAATAATAA 


TTAAGGAGGA 


TTGGCATTTA 


420 




TGGCAATCGT 


AAAAGTAACA 


GATGCAGATT~ 


TTGATTCAAA 


AGTAGAATCT 


GGTGTACAAC 


480 


15 


TAGTAGATTT 


TTGGGCAACA 


TGGTGTGGTC 


CATGTAAAAT 


GATCGCTCCG 


GTATTAGAAG 


540 


AATTAGCAGC 


TGACTATGAA 


GGTAAAGCTG 


ACATTTTAAA 


ATTAGATGTT 


GATGAAAATC 


600 




CATCAACTGC 


AGCTAAATAT 


GAAGTGATGA 


GTATTCCAAC 


ATTAATCGTC 


TTTAAAGACG 


660 


20 


GTCAACCAGT 


TGATAAAGTT 


GTTGGTTTCC 


AACCAAAAGA 


AAACTTAGCT 


GAAGTTTTAG 


720 




ATAAACATTT 


ATAAGTTACA 


ACCAATGACG 


ACTGGGGCAT 


TTCTTTAATG 


AATTGCTCCA 


780 




GTTTTTGTTT 


GTGTTTTTAA 


TATAAAAAGT 


TGAATGATAA 


GTCATCATAT 


TGTTTACGAC 


840 


25 


TTGAGAATGG 


TGGGATTAAT 


AAATCTATGA 


ACGTTAAATG 


ATAATCTAGC 


ATGCTGATAG 


900 




ATTTGTAGCA 


GTTGGTTTGA 


TAAAACCATG 


TTCAATATTA 


CATGATGTGC 


ATGAAAAGTC 


960 




ATACTCGAAG 


ATGTTGATTA 


TTAACTAGAA 


TTAGTGGTGA 


TAAATTTGAA 


GCACTTTTGT 


1020 


30 


AGCATCATTC 


ATTTTAAAAT 


TAGAAGGGGG 


GATATTTTTG 


GAAGACTATA 


AGCAACGAAT 


1080 




TAAAAATAAA 


TTAAATGTCG 


TACCTATGGA 


ACCAGGATGC 


TATTTAATGA 


AAGATCGTAA 


1140 




TGATCAAGTG 


ATATATGTTG 


GCAAAGCTAA 


AAAGCTAAGA 


AATCGATTGC 


GATCATATTT 


1200 


35 


TCACGGGTG 












1209 



(2) INFORMATION FOR SEQ ID NO: 463: 

(!) SEQUENCE CHARACTERISTICS: 
4* (A) LENGTH: 2410 base pairs 

(B) TYPE: nucleic acid 
.(C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 463: 
AGTTCAACAC GACGAATTTT ACCTGAGTTT GTTTTTGGTA AGTCGTCAAC GAATTCAATC 60 
50 TCTCTCGGAT ATTTATATGG TGCAACTTCA TTTTTAACAA ATTGTTGTAG TTCTTTAACT 120 

AACGTATCAT CACCCGCAGT ATGGTCCTGT AAAATAACGA ATGCTTTAAC AATATTTCCT 180 

55 
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GCATCTTCAA CTTCAAAAGG CCCAATCGTA 
CGTCCTTCAA ACCAGAAATA ACCATCATCA 

5 

TATTtACCTG TTTGCGCTTT CGCCGTACGT 
GCTGGCAAAT CAAGTGGTAC TGCAATATTC 
TCATCATCTA CTACAGTGAC CGAACTACCC 

10 

TGTGGTGTAT CTTTTAAAAA GCCTATAAGC 
CTTACAGTTA AATTAAAGTA TTTCTTGAAT 
CCTGCAGAAA CGGCACTATG TAAATGCGTT 

15 

GCCATCATAC GATATTCTGT CGGTGTACAA 
AATTCTAAGT ATGTTTCAGG ACTGAACCTT 

2o CCTAATACAG ATAAGAAAGG ACTCCATACC 

CAAACTAAGT CATCTTCATT aATACATaAC 
AATCCCCaTC CATGACAATG TGTAACGGCT 

25 GACAGAATCG GCATATCATC ACGCGTCGTA 

TCTTTTTCAG CTTCAAGTGA AATCCATCCA 
AACGCATCAT ATTCTTTAAT TTTTTCAAAT 

30 GCATTAATTT CACCATGTGT GATACGGTAT 

CATGGAATGA TTGCAACACC TAATTTTAAA 
GATCGTGGCA TCATAATGAG TACTTTATCG 

35 

TTACCTACTT TATTAGACTG TTCAATGAGC 
TCAGTATTAT GATATAAAAT TGCCTTTTTA 
GAAATAATGT TATATTTTTC AGGCGCGAAT 

40 

TCATACATCC ACTTTTCCTG TGATGAACAT 
TACGCCTATC TTTACAGAAT TTTCAATTAA 

45 ACACTTTGAC ATGACGTAGC ATTCAAGGAC 

CTAGACCATA TCAGCTAATT CAATACGAAT 
TACATCTGCA CCTTTCGAAG TGCCACGATG 

50 TTGCCAATGA TAAAATGCTT GCCTATTTTC 

AGCTGTTCTA GGTCTTAAAA ACCTTAATGC 

55 



TAGCCTGAAC TAATAATAAT GTCATCTCGA 3 00 

TCTACATGAG CTAAGTCACC AGTGATGTAG 360 

tCTGGCTCTT tATAATACCC TTTGAAAAGT 420 

CCTTTCGTAT TAGCAGGTAC GCTATTCCCC 4 80 

GGAATGCCTT TACCCATTGA TCCAATCCTC 540 

AAGGTACTTT CAGTCTGGCC ATATCCATCT 6 00 

TGTTCAACTA CTTCTCGATT TAGTGGCTCA 660 

AAGTCATAAT CATTTAAGTT CTGTAATTTA 720 

CATAAAACAT TAATTTGATA TTTTTGAAGC 780 

CCATTAAATA CAAAAGCAGT TGCACCTGAA 840 

CATTTTTGCC AACCTGGTGc TGCTGTTGCC 900 

CAATGTTTTG GTGCCATTTG TaAATGTGcA 960 

TTAGGATTGC CAGTTGTACC AGATGTATAT 1020 

TCTGCCATTT CTAGTTTGTT ACTTGCGTTT 1080 

TCTTTTTGAC CGGCAATAAC AAATTTAGTT 1140 

TCAACTGTGA ATGGCTCTAG TGCAATAACT 1200 

TGTAAATCTT TAGTTCTTAG CATTTCAGAA 1260 

GCAGCAATAT ATAATTCATA CGTCGCAATA 1320 

CCTTTAGATA AACCGTGCGA TGCTAAAACA 1380 

TGTTGGTAAG TGACTGATAT ATCTTCGCCT 1440 

TCTGGTATGT GGCTATATTT TTCGATTTCC 1500 

AGAGCTGACT TTTGCATAAC TAACTTCCTT 1560 

TGTAATTTTA TAAATGAATT ATATACATCA 1620 

ATAGGGTTAA ATACCAAAGT CCTCGACACT 1680 

TTTCAAATGA TTGAGGGTTG ATATCTCGGG 1740 

ATTGTATGAT AATTCACGAT TAATTATTTT 1800 

CTTGTGTGTA TGCTTGTACT CAGCTGAATT 1860 

CCACAGCGTA ATAATGATAT AGTGTCTACC 1920 

TTTAAATCCA TCAACGTTTT TTAAATGCTT 1980 
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10 



15 



30 



35 



40 



50 



55 



ATTTAACACA CATAATGAAT CATTTGATAA ATCATTTATC GCTTCTAGCA CATCGTAATA 2100 

TGCAGTGTCA TTATTTTTTT GTATTGTGAG ACAATCATCC AGTTCTTCTA TTACATAACT :2160 

TCTATATTCA TCATAAATTT TCATAATAAA TGCCTTCATT TCATTTATAT TTTTGGTCAT 2220 

ATTACTkTAT ATCTATTACT AAtJcCATTCC CGTATXTATT AATTACAATC ATAGTTTGGC 2280 

TyCTTTTTAA AAGATAAGAC TTTGTAAAAA GTATTAATAT TTCATGCAAA TGGGGGACAG 234 0 

GAGTCGCCCA CTATTTTTGT GTCTTCAATT TCATGATCAT TATTTAACAT TAGTCATGAA 2400 

AATAGCCGAC 2410 
(2) INFORMATION FOR SEQ ID NO: 464: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 590 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNBSS : double 
20 (D) TOPOLOGY: linear. 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 464: 

25 TTTATTAATT GTAAAAAATT GAGTAAATTA TCTTTACATT CTAAATTAGT CTTAACTACA 6 0 

ACTAGTATCC TAATAATTAT AGGAGCTATT ACATTCTTTT TATTAGAACA GTTTAATACT 120 

ATGCAACATA TGGGACTAGT TGAAAAAATC GGAAATTCTT TTTTCCAATC AGTAACAACA 180 

CGAACAGCGG GTTTTAACAG TATAGATATA GCAAGCATTA rCAAATCTAC CGCATtAaTG 24 0 

TTAATGCTAC TTATGTTTAT TGGTGGTGCC CCTCTCAGTG CAGCTGGAGG AATTAAAATA 300 

ACTACTTTTG CAGTTGCGTT TATTTTTGTA CTAAATTATA kACGTAAAGA AAATAATGTT 360 

TCAGTATTCA ATAAAGAAAT ATCTGACAAA CATATAAAAC TATCTATTGT TACCATTAAT 420 

ATCTCATTTC TATTTATCAG CATCATTACT TTTATATTAT CGATAATTAA TCCGAACATA 480 

TCATTAATCA AGTTATTATT CGAAGTGGTT TCTGCATTCG GAACAGTAGG GTTAAGTATG 540 

AACCTTACCA CAGAATATCA TGGTATTACT AAAATAATTA TTATATTCGT 590 
(2) INFORMATION FOR SEQ ID NO: 465: 



45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 905 base pairs 

(B) TYPE; nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 465: 
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TATTCGATTT 


GAnTCTTTTA 


AAATTATGTT 


TAAAAACATG 


TCTAATGATT 


CCGCATCATT 


120 




TTTTGCACCA 


TCAATAAGCG 


TTTCAGcAAA 


CCCCTTAATT 


GAAGTAATAG 


GTGTTTTTAA 


180 


5' 


TTCATGTGAA 


ACATTTGCTA 


CAAATTCACG 


TCTTAGATTT 


TCAAGTTGTT 


TCAGATTTGT 


240 




TATATCATGC 


ATCACaACTA 


AAATCCCtTG 


CAAACTTTTT 


TGAGACCTAG 


TTAAAATCGG 


300 


10 


AACGCATGAA 


ATATCAAAGT 


ACTTGGCATG 


GACTTGGTTT 


ATTGCAACTT 


CCAATTGTTC 


360 


ATAAATAGGT 


TTTTCAACTT 


TAAAACTTTC 


TAAAATTAAT 


TGCTCAATTT 


CAGTATTAAC 


420 




ATAGCCGTGA 


TAGCCTACTT 


GTTCAATATT 


ATGCGAGATG 


TTGAACTGTT 


CATAATACGC 


480 


75 


TTTATTTGCA 


ACAACGATTT 


TTCCATTTCG 


ATCTATCATT 


AAAATAGCAC 


TTGGAATATT 


540 


TTCAATCGTT 


GTTTTTAAAC 


GGTTGGATTG 


AATTTTTTGC 


TCATTATTAA 


GCTTTTGAAG 


600 










AAGCGCTTTA 


GTTTCTACAA 


CATTACTTTC 


e e r\ 
660 


20 


AGGTACACGT 


ATGTGATAAT 


AACCATTTGC 


CAACAATTGT GTTGCATAAG TAACTTCTTG 


720 




AATGGGACGG 


ATTAATGTAC 


GCTTAAAACT 


ACGGCTTGCA AAATACAGAC AAATGAGTAC 


780 




AACTAAACAT 


GTCAAAATAA 


GATATTTCCA 


CAACGTCCAA 


TGCATTTCTG 


TAATATCGTT 


B40 


25 


ATTGTAACCT 


TTAATCCATA 


CATGATAACC 


GTTAACCTTC 


TTATtAAAAA 


TAAAAACGTC 


900 



CCTTT 905 
(2) INFORMATION FOR SEQ ID NO: 466: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:" 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 466: 



40 



45 



50 



TTTGGTTAGC 


CCATAAAAGA 


AAAAAACAAG 


TAGTCATTTT 


TAAACAACAT 


ATCAAGTCTA 


60 


CCCAAGAAAT ACGTTTTGAC AAAGCGAAAG 


TGCTTGAACA 


CAAAGATGAA ATAGCAAATT 


120 


TTATTTCTTT 


CGAACCACAA AGTTTTGAAT 


TTTATTATTT 


TACAGAATCG 


GAATTTTCAG 


180 


AAGAACAATT 


AAATGAAGTT 


TCGCCAATTA 


GAATTAAATT 


CAATGTTATA AGACACACAA 


240 


AAGATTTGaT 


AAAGCATATG 


CCGAATATAT 


TTTTGGcTAG 


ACTTATTTCA 


GAAGATAATG 


300 


ATAAAAAGAC 


ATATATGTTC 


TATAAACGCA 


AAGTATTAAC 


CGATAACTTT 


TTAGATAAAT 


360 


ATATGCAGAA 


ATTTTCACCG 


GCAACATACA 


CAATAATATT 


TGTAAATGTC 


TTAATATGGT 


420 


TATGTATGAT 


TTTATATTTA 


aataattttt 


CGGATGTAAA 


ATTATTAGAT 


GTTGGCGGGT 


480 



55 
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10 



15 



30 



35 



40 



45 



ATTTTAGTTT TGAACATATA CTTATGAATA TGCTTTCATT ATTTATTTTT GGTAAAATAG 600 

TCGAAGCAAT TATTGGTTCA TGGCGGATGT TAACTGTATA CTTTATTGCA GGGTTGTTTG 660 

GAAACTTTGT ATCACTATCA TTTAATACGA CTACAATTTC AGTTGGGGCT AGTGGTGCTA 720 

TATTTGGTCT GATTGGATCA ATTTTTGCGA TGATGTATGT TTCAAAAACA TTTAACAAAA 780 

AAATGTTAGG ACAGTTATTA ATTGCATTAG TGATATTAGT TGGTGTTTCT CTGTTTATGT 84 0 

CAAATATAAA TATTGTGGCG CATATTGGAG GATTCATTGG TGGTTTATTA ATAACTTTAA 900 

TTGGCTATTA CTATAAAGTG AATCGtAATA TTTTTTGGaT TTaCtAATTG GTATGCtTGt 960 

tATATTTAwT GCACyTCltlAA TTAgAtTTTT ACmATTAAAG AAGATAATAw TTATAA 1016 
(2) INFORMATION FOR SEQ ID NO: 467: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 06 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 ■ (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 467: 

AACTTTAAAT TTAGACATCT TTAAAACCTC TCTTAAACCA TGCCTATATC TCAAGATGAT 60 

ATTTCAAATG AACAATACTA TTGCTTGAGA CCATTAATGA ATGATCATAA ATATTTCTTT 120 

CT AT AAAATT AGCTTTCCAA TAACTGTGTT GTTGCATAAT ATCATTCACA AGTACACCAT 180 

TTTCGGAAGT ATGATTATCT TTATCTATAC TTAAAACAAT TTGTTTAGTT TTAGCATGGC 240 

TAAATTGTTG AAGACCCTTA CACGATAAAC GTATAGCGTC TGAATTCTCA TTTAACAATG 300 

CGGCTGGgCA AACAAATGAC ACATTGTACT TCATGTTTGA ACtTCGTTAC AATCaTCGTG 360 

kCatTTTGat AAATAACAAT CCCTCGTAAT kGATTAAGTA TATTAT 4 06 
(2) INFORMATION FOR SEQ ID NO: 468: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1378 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 468: 
SO AGATAATACA CTTGAAGTTG GAATGGTTTG TGACGGTTAT TTAATGCGAA TTGAAAACTT 60 

AACACCATCA AATTTCTTCA ACTCAGCAAG TGAAGATACG ATTACTAAAA TTAAATTAAA 120 



55 
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5 



10 



15 



20 



AGGTACAGCG 


TTAAAACTAA 


GAGAAG C CAT 


CAATTATGAT 


GAAATGGTTA 


TTGTAGATAG 


240 


TATGACGTAG 


TTCCTAATTA 


TGCkAAAAGG 


GATTGATGAA 


AAACTGAAGG 


GCTTTTCATC 


300 


AATCCCTTTT 


ATTTTAGGGG 


AATTGAATAG 


ATAGTTTTAA 


ACTATACGAA 


TTATTAATAT 


360 


TTGAGATTTA 


ATTGAAATAA 


GTTTTAAAAA 


TTGGAGGAGA 


TAGATTAAGC 


GAAGTCATTT 


420 


AAAGGTGAAG 


TTAAGTGTAT 


TCACAAAAAn 


TAGCCACACT 


CATATGACAT 


CGGATGAGTG 


480 


TGGCTTAAGG 


ATCTATGGGG 


GGAGGAAnCC 


ATAGATGTTT 


ACTTTGATAG 


GCCAGATTAA 


540 


ATATCAAAGT 


ATGCGATTAT 


TTATAGCTTG 


ATGCAAAAGT 


GGTATGCCTA 


TTAAAAGTTA 


600 


CTGCACATAG 


CTTTTAATAT 


TCCGTTCAAA 


GGAAAGGGGC 


ATACAATTGA ACAATCTGTA 


660 


ATAGTACTTT 


TAACCAGCTA 


TGCTAAAAGT 


CTAGTAGGGA 


GAACAGTTGT 


CCAATCACAT 


720 


AAGAACCTCT 


AACTTCGTTA 


GTACGATTAA 


GAAAAGCTTT 


TTAGTTAGTA 


TGTAATACAA 


780 


TTTATTGACG 


CGCGTGAATC 


TCTTTTATAA 


GAGTGTGTAG 


GGAATGGCGT 


TGTATAAATT 


840 


GTATTAGAAG 


AACTTCTAAC 


GCATCTCTGT 


GGTTAAAAGA 


GATGAAGGGA 


ACGACAGTTT 


900 


aATTAAAACT 


GCATAAGAAC 


TTCTAGCTTT 


TCTCTCTCGT 


TCAAAGAGAA 


GCAGcTGTTC 


960 


GCAGTTTAAT 


CAAAACCACA 


TAAAGCTTTT 


AACTTTACTC 


TTTGATTTAA 


AGAGTGATAA 


1020 


ATGTTTACAG 


TTTAATTAAA 


ACTGCATAAG 


AACTTCTAGC 


TTTTCTCTTT 


CGTTCAAgAG 


1080 


AAGCAGCTGT 


TCGCAgTTTA 


ATCAAAACCA 


CATAAAGCTT 


TTAACTTTAC 


TCTTTGATTT 


1140 


AAAGAGTGAC 


AAATGTTTAC 


AGTTTAATTA 


AAACTGCATA 


AGAACTTCTA 


GCTTTTCTCT 


1200 


TTCGTTCAAA 


GAGAAGTTCT 


AATACCACCA 


TATCGTGCGA 


TCGGGAACGG 


TATATATATT 


1260 


AATAGGAGGG 


TAATATATAT 


TTAACGCACG 


ATATGGGACT 


ATTAGCCTTC 


GACTTTGTTA 


1320 


TGTTGATGTG 


TGGCCTAAAA 


TATTGGAGAT 


ACCAATATTT 


TAGGTTGCAT 


CAACATCA 


1378 



(2) INFORMATION FOR SEQ ID NO: 469: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 4171 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 469: 
TCCCAACCAA TAATCGTGGC AAAAATACGG ATATTGGTAT . GGCTTAACAA ATTGCAAATA 60 
SO TCGTTTAATC ATACATCCCC CCTAATCTAT TGCCCTATCC TATTCaTAAG CATAAAAATG 120 

AATAGAGGTT GGATACATAA TTTGTAGATG TAAATTCTTC TTACAATTTA CATTTTTAAA 180 

. 

55 
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TTATATTCTA TTCAATTTAA TCTATGGATA CTGTGTCCCC ACACGACAGC AAAAGTTATC 300 

ATACTTCTTT ACATCACTAA GTCAATATAA ATGATTTAAT CAGTATTTAC ACTTTATTTG 36 0 

5 CTTAATACTG TCTAATTTTT TTGTAACGTT CTTTCCAAAC TTTGATAAAA TCTGGCGCGA .42 0 

ATGGGCCCTT CTfCTGTTCT ATCCATTGTT GAAGAATGTC CACGTTGCGT CTTAAAATAA 4B0 

TATCAATATC ATGCGGATAA TTCATTTGAT TCATATGTTG CTCATATTCA TCTTCATCTA 54 0 

10 

ATAAATGATA CTTTCCGTTT GGATATACTT TAATATCTAA ATCATAGTCT ATATATTTTA 600 

ATGCCTCTTC ATCACAAACA AATGGTGATG ACAAATTGCA ATAGTAATAA ATTCCATCTT 660 

CTCTAAACAT GCAGATAACA TTaAACCAAT ATTCTGAGTG AAAGTaAACA ATTGCCGGTT 720 

CACGTGTTAT CCAAGTTCTT CCGTCACTTT CAGTCACTAA CGTATGATCA TTTCCACCAA 7 BO 

TGACAACATG ATCAGTACCC TTTAATATTG TTGTTTCAGA CCAAACGCGA TGAATCTTAC 84 0 

2Q CATCATGTTT ATAACTCTGA ATTTTAATGT TTTCCCCTTC TTTAGGTATG GATTCTCTGA 900 

CCATACTCCA CACCACCTTC TGTTAATTTA ACCATTATAA ATT AT AG CAT ATTTCAGAAA 960 

TAGTATTATA TAAATACATA TTTTTACGAA ATAAGATTTT ACTACTTAAT AATTAAACTC 1020 

25 GGTAATATTG CTAAGTACTA CAACAGAGAT TTACATGTCC CATTTAAAGT ATATAAAATC 108 0 

ATCACTTTTA TATATCAACA CTTTAACTTT TTGACATTGT TATTCTATGA GATTTAAAGA 114 0 

TATCATTTAT ACTTTTTAAA ATTAATGTCA CTATGTTTTC CGATAATATT ACCAATCATC 1200 

30 GAATGTTACC CATTTATAAA TTGATAAATs TTTGACATAG GTACAGGGAA TGTATATTGA 1260 

TCTCGATCAC TTAAATCAAA CCAAATCATG TCATCTGGTA ATGTTTCAAT GTTAATTGCT 1320 

CCTGAAACGG CGTATACTTT AATCTTCCAT GTTAAATGAG TAAATTGATG CTTCAACTCA 1380 

AAAATAGGTG TTTCTACTGG TTGAATGTCA TGACCGATTT TTTCAGTCAT TTTACGTCTA 1440 

GCATGCTCAC TTTCAAACAT AGGAAATTGC CACATACCAT GCAATAATTT TTCGCTACGC 1500 

TTTTGCAACA GATATTGACC TTGATTATTT CTAATTAAAA AGACGGATTG CTCAATTACT 1560 

40 

TTTTTACTTA CATTTTTAGA TTTAACAGGT AACTTTTCAA ATGTACCTTT ATCAAATGCC 1620 

TCACAGTTTT CTTGAACTGG ACAAAATAAG CATAATGGAT TTTTTGGTGT ACAAATTAAC 1680 

GCCCCTAATT CCATCATAGC TTGATTAAAC GTTCCAGCTT CTGTAGTAAC ATACGGTAAT 1740 

45 

AATTCTTGTT CGTACGATTT CCTCGTCGAT TGTAATTTAA TATCTCGATA GTCATCATTC 1800 

AATCTAGACC ATACTCGAAA AACATTTCCG TCTACAGTTG CTAGTGGTAC ATTATATGCA i860 

50 ATGCTCATTA CTGCAGCTTG TGTGTATGGG CCAACACCTT TTAACGCTTT AAATTGATCA 1920 

GGATCTTTGG GAACTAAGCC TTCATATTTA TCATGAACTT CTTTAATCGC CGTATGAAAA 1980 

55 
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GCTTGaCTCA AAACTTCCAC AGTTGGAAAT CGTTCAACAA AACGATGATA ATAGTCAATA 2100 

ACTGTTTTAA CTTGTGTCTG TTGTAACATG ACCTCACTTA ACCAAATATA GTACGGATTG 2160 

GTCGTTTGTC GCCATGGCAT TTCTCTTTGA TTTTCATCAA ACCAGTGTAT CAAATTTTCT 222 0 

TTAAAACTAG ACTGCTGATA CATTTATAAA ACCCTTTCCT CACCAAAATT AATTGTCTTT 2280 

ACTCATAATG TTTTTATTGT ACATTAAAAT CATGGTTAGT ATGTAAGTTA ATTTAGTTAT 234 0 

10 

TTGCGAAATT GGATTATAAT AGTATATATA ATATTATGAA ATGAGTGAAC TGATATGGAC 2400 

ACTGCAACAC ATATCGCAAT TGGGGTGGGC CTTACAGCAC TTGCAACTCA AGATCCAGCA 2460 

ATGGCTTCTA CGTTTGGTGC AACAGCTACA ACCCTTATCG TTGGTTCATT AATTCCTGAT 2520 

GGGGATACTG TTCTTAAATT AAAGGACAAT GCAACATATA TTTCGCATCA TAGAGGTATC 2580 

ACGCATTCCA TCCCTTTCAC AATACTATGG CCAATTTTAA TTACATTTTT AATATTCACG 2640 

20 TTCTTTAGTG GAACCAACCC ATTTCATGTA TGGATGTGGG CTCAGCTCGC AGTATTTTTA 2700 

CATGTCTTTG TAGATATATT CAATTCTTAT GGTACACAAG CGCTTAGACC TATCACAAAC 2760 

AAATGGATTC AATTAAGTGT GATTAACACA TTTGACCCTA TTATTTTCAC AGTTCTTTGT 2820 

25 ATTGGTATTG TATTATGGGT TATAGGCTTG CATCCATTTG CAGTCTTCTT TCCTATAATC 2880 

GCTTTACTAA TCATTTATTA CATGATTCGT TTTAAAATGA GAGCCGTAAT TAAGCAACAA 294 0 

GCTTTAAAAG CAATTCAACA AGAGCATCAC CCTGTTAAAG TATTTGTTGC GCCAACAATA 3000 

30 AAATTTATGG AATGGCGTGT CGCGATACAA ACTGATGCAC ATGACTATGT TGGAAAAGCA 306 0 

TATGGTAGGA ATGTGGTGTT TAGTGATAAA GTGGAACGTC AAACATTATC AACAGACTCC 3120 

ATTTTATGGA AAGTCAAAGG TAATAAAGAT ATACGTACTT TTTTAAACTT TTCATCAATC 3180 

35 

TATCGTTGGC AAACAACAAC GTTAGCAGAT GGTTCTACTG AAATTCGTTT GATTGATTTG 324 0 

CGTTATTTAA AAAATGATCA TTATTCATTT GTGGCAATTG CACATGTAAC AAACGATAAT 3300 

GTCATAGACC ACTCTTATAT TGGCTGGGTA TTTACAGAAG ATAAGTTACA ACGTAAACTG 3360 

40 

TATGCTAAAT AATTTCAAGT TATTATTCAC TAAAGTTAAT CTATAAAAAA TGAACAACCG 3420 

GGCAGAATGA AAATCAAAAC GATTTTTACT CTGTCCGGTT TTTTAATGTA AAACTATGAA 3480 

TGCTTTTACA AAATCTAAAA TTTATATTGT TGCTAACAAA CTACCTTTAA TGACTCGAAA 3 540 

45 

TATCAAAATC AGTATAGGAA AACAATATCT AGATGATATT CTAATTGTTT CTGATTCTCA 36 00 

CAGATTAATT TACACAACAG GTCAGCTAAA CATCATGAAG AAGTATCCGC CTCGTCTGTA 3 660 

50 CTATCATTTG AAACATCCTG TTGATTATCa GTTTGTGTCA CTTTAGATGT TTTATTaTAA 3 720 

ATTGCGTGTG TCGTATACCT TGCTAATACG AAATTTACTG CAGCTATTAA ACATAAGAAT 3780 
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TGTGCTATAC CATTAACAAT GTAATACATT GGATTTAGCA TTAGGATGTG ATTGATAAAT 
ACATGATTTG GATTTGGTAT GAAAATAATT GGTAACAATA AGAAACACAA TACACAAACC 
CCATAAAATA TGATATTTAT TTTTTCAGwT AACAGTCGAA TAAGACCAAA AGTAACGGAT 
ATTAATCCTA CAAAAATAGT TGCCAT CACA ATAAAATAGA ATAGCGCTAT ATATGATGTT 
TCGAAGTTTA CTGGTTTAAC CAATGCACTA ATCATCGTCA AAATGACTAG CATAATAAAA 
CTTAAAATAG ACATAATAAC TACTGGCGTC G 
(2) INFORMATION FOR SEQ ID NO: 470: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 9621 base pairs 

(B) TYPE: nucleic acid . 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 470: 
TGGTTGAAGT AGCAGTTAAT TCAAAATCTG CAACAGTTTC AGCAGAATaG GGGCTTTCAA 
AATAAtCAAA GGAGAATAAT TTATGACTAA t ,. AACTTTAAAG GTTTATAAAG GAGACGACGT 
CGTAGCTTCT GAACAAGGTG AAGGCAAAGT GTCAGTAACT TTATCTAATT TAGAAGCGGA 
TACAACTTAT CCAAAAGGTA CTTACCAAGT GGCATGGGAA GAAAATGGTA AAGAATCTAG 
TAAAGTTGAT GTACCTCAAT TCAAAACCAA TCCAATTCTA GTCTCAGGCG TATCATTTAC 
ACCAGAAACT AAATCAATTA TGGTAAATAC CGATGACAAT GTTGAGCCAA ACATTGCACC 
AAGCACAGCA ACGAATAAAA TATTGAAATA TACAAGTGAA CATCCAGAAT TTGTTACTGT 
AGATGAAAAT ACAGGAGCAA TTCACGGTGT AGCTGAAGGT ACTTCAGTAA TCACTGCTAC 
GTCTACTGAT GGAAGCGATA AGTCAGGACA AATTTCAGTG ACAGTAACAA ACGGATAGGG 
ATTTAAGGCG CAGTATATCT GCGTCTTTTT TATTTGAATA AAAGGAGCTA ATACAATGAT 
TAAATTTGAA ATTAAAGATC GTAAAACAGG AAAAACAGAG AGCTATACAA AAGAAGATGT 
AACAATGGGC GAACAGAAAA ATGCTATGAG TATTTAGAAT TAGTAAATCA AGAGAATAAA 
AAAGAAGCAC CTAACGCAAC AAAAATGAGA CAAAAAGAGC GACAGTTATT AGTAGATTTA 
TTTAAAGATG AAGGATTGAC TGAAGAAGAT GTTCTGAACA AGATGAGTAC TAAAACTTAT 
ACAAAAGCCT TACAAGATAT ATTTCGAGAA ATCAATGGTG AAGATGAAGA AGATTCAGAA 
ACTGAACCAG AAGAGATGGG AAAGACAGAA GAACAATCTC AATAAAAGAC ATTTTATCGA 
ACATTAAGAA AATACAACGT TTCTGTATGG AGCAGTATGG GTGGACATTA ACTGAAGTCA 
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AAGAAAAACA 


AAGTGAACAA 


AAAGTCATTA 


CAGGTACGGA 


TTTAAGAAAA 


CTTTTTGGAA 


1140 




GCTAGAAAGG AGGTTAATAT 


GAATGAAAAA 


GTAGAAGGCA 


TGACCTTGGA 


GCTGAAATTA 


1200 


5 


GACCATTTAG 


GTGTCCAAGA AGGCATGAAA 


GGTTTAAAGC 


GACAATTAGG 


TGTTGTTAAT 


1260 




. AGTGAAATGA AAGCTAATCT 


GTCAGCATTT 


GATAAGTCTG 


AAAAATCAAT 


GGAAAAATAT 


1320 


10 


CAGGCGAGAA 


TTAAGGGGTT 


AAATGATAGG 


CTTAAAGTTC 


AAAAAAAGAT 


GTATTCTCAA 


1380 


GTAGAAGATG AGCTTAAACA AGTTAACGCT 


AATTACCAAA 


AAGCTAAATC 


CAGTGTAAAA 


1440 




GATGTTGAGA AAGCATATTT 


AAAGTTAGTA 


GAAGCCAATA 


AAAAAGAAAA 


ATT AG CTCTT 


1500 


15 


GATAAATCTA AAGAAGCCTT AAAATCATCG 


AATACAGAAC 


TTAAAAAAGC 


TGAAAATCAA 


1560 




TATAAACGTA 


CAAATCAACG 


TAAACAAGAT 


GCGTATCAAA 


AACTTAAACA 


GTTGAGAGAT 


1620 




GCAGAACAAA AG CTTaAGAA 


TAGTAACCAA 


GCTACTACTG 


CACAACTAAA 


AAGAGCAAGT 


1680 


on 


GACGCrtACA 


GAAGCAGTCC 


GCTAAGCATA 


AAGCACTTGT 


TGAACAATAT 


AAACAAGAAG 


1740 




GCAATCAAGT 


TCAAAAACTA 


AAAGTGCAAA 


ATGACAATCT 


TTCAAAATCA 


AATGATAAAA 


1800 




TTGAAAGTTC 


TTACGCTAAA 


ACTAATACTA 


AATTAAAGCA 


AACAGAAAAA 


GAATTTAATG 


1860 


25 


ATTTAAACAA 


TACTATTAAG 


AATCATAGCG 


'CTAATGTCGO? 


? AAW^GGTGAA 


ACAGCTGTTA - 


1920 




ATAAAGAAAA 


AGCTGCTTTA 


AATAATTTGG 


AGCGTTCAAT 


AGATAAAGCT 


TCATCCGAAA 


1980 




TGAAGACTTT 


TAACAAAGAA 


CAAATGATAG 


CTCAAAGTCA 


TTTCGGTAAA 


CTTGCAAGTC 


2040 


30 


AAGCGGATGT 


CATGTCAAAG 


AAATTTAGTT 


CTATTGGAGA 


CAAAATGACT 


TCCCTGGGAC 


2100 




GTACAATGAC 


GATGGGCGTA 


TCTACACCGA 


TTACTTTAGG 


TTTAGGTGCA 


GCATTAAAAA 


2160 


35 


CGAGTGCAGA 


CTTTGAAGGG 


CAAATGTCTC 


GAGTTGGAGC 


GATTGCACAA 


GCAAGCAGTA 


2220 


AAGACTTAAA AAGCATGTCT 


AATCAAGCGG 


TTGACTTAGG 


AGCTAAAACA 


AGTAAAAGTG 


2280 




CTAACGaAGT 


TGCTAAAGGT 


ATGGAAGAAT 


TGGCAGCTTT 


AGGCTTTAAT 


GCCAAACAAA 


2340 


40 


CAATGGAGGC 


TATGGCAGGT 


GTTATCAGCG 


CAgcAGaAGC 


AAGTGGTGCA 


GAAATGGCTA 


2400 


CAACTGCAAC 


TGTAATGGCT 


TCAGCGATTA 


ACTCTTTCGG 


TTTAAAAGCA 


TCTGATGCAA 


2460 




ATCATGTTGC 


TGATTTACTT 


GCGAGATCAG 


CAAATGATAG 


TGCTGCAGAT 


ATTCAATATA 


2520 


45 


TGGGAGATGC 


ATTAAAATAT 


GCAGGTACTC 


CAGCAAAAGC 


ATTAGGAGTT 


TCAATAGAGG 


2580 




ACACTTCTGC 


AGCAATTGAA 


GTTTTATCTA 


ACTCAGGTTT 


AGAGGGGTCT 


CAAGCAGGTA 


2640 




CTGCATTAAG 


AGCTTCGTTT 


ATTAGGCTAG 


CTAATCCAAG 


TAAAAGTACA 


GCTAAGGAAA 


2700 


50 


TGAAAAAATT 


AGGTATTCAT 


TTGTCTGATG 


CTAAAGGTGA 


GTTTGTTGGA 


ATGGGCGAAT 


2760 




TGATTAGACA 


GTTCCAAGAT 


AACATGAAAG 


GCATGACGAG 


AGAACAAAAA 


TTAGCAACAG 


2820 
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CAGATAAAAT TAATAGCTAT AGCAAATCAT TGAAGAACTC TAATGGTGAA AGTAAAAAAG 294 0 

CAGCTGATTT GATGAAAGAT AACCTCAAAG GTGCTCTGGA ACAATTAGGT GGCGCTTTTG 3000 

5 • 

AATCGTTAGC AATTGAAGTT GGTAAAGATT TAACGCCTAT GATTAGAGCA GGTGCGGAAG 3060 

GATTAACAAA ATTAGTTGAT GGATTTACAC ATCTTCCTGG TTGGGTTAGA AAGGCTTCGG 3120 

TAGGCTTAGC AATTTTTGGT GCATCTATTG GTCCTGCTGT TCTTGCTGGT GGCTTATTAA 3180 

10 

TACGTGCAGT TGGGAGCGCG GCTAAAGGCT ATGCATCATT AAATAGACGC ATTGCTGAAA 324 0 

ATACAATTCT TTCTAATACC AATTCAAAAG CAATGAAATC TTTAGGTCTT CAAACATTAT 3300 

TTCTTGGTTC TACAACAGGA AAAACGTCAA AAGGCTTTAA AGGATTAGCC GGAGCTATGT 3360 

IS 

TGTTTAATTT AAAACCTATA AATGTTTTGA AAAATTCTGC AAAGCTAGCA ATTTTACCGT 3420 

TCAAACTTTT GAAAAACGGT TTAGGATTAG CCGCAAAATC CTTATTTGCA GTAAGTGGAG 34 80 

20 GCGCAAGATT TGCTGGTGTA GCCTTAAAGT TTTTAACAGG ACCTATAGGT GCTACAATAA 3540 

CTGCTATTAC AATTGCATAT AAAGTTTTTA AAACCGCATA TGATCGTGTG GAATGGTTCA 3600 

GAAACGGTAT TAACGGTTTA GGAGAAACTA TAAAGTTTTT TGGTGGCAAA ATTATTGGCG 3660 

25 ' GTGCTGTTAG GAAGCTAGGA GAGTTTAAAA ATTATCTTGG AAGTATAGGC AAAAGCTTCA 3720 

AAGAAAAGTT TTCAAAGGAT ATGAAAGATG GTTATAAATC TTTGAGTGAC GATGACCTTC 3780 

TGAAAGTAGG AGTCAACAAG TTTAAAGGAT TTATGCAAAC CATGGGCACA GCTTCTAAAA 384 0 

30 AAGCATCTGA TACTGTAAAA GTGTTGGGGA AAGGTGTTTC AAAAGAAACA GAAAAAGCTT 3900 

TAGAAAAATA CGTACACTAT TCTGAAGAGA ACAACAGAAT CATGGAAAAA GTACGTTTAA 396 0 

ACTCGGGTCA AATAACAGAA GACAAAGCAA AAAAACTTTT GAAAATTGAA GCGGATTTAT 4020 

35 

CTAATAACCT TATAGCTGAA ATAGAAAAAA GAAATAAAAA GGAACTCGAA AAAACTCAAG 4 080 

AACTTATTGA TAAGTATAGT GCATTCGATG AACAAGAAAA GCAAAACATT TTAACTAGAA 414 0 

CTAAAGAAAA AAATGACTTG CGAATTAAAA AAGAGCAAGA ACTCAATCAG AAAATCAAAG 4200 

40 

AATTGAAAGA AAAAGCTTTA AGTGATGGTC AGATTTCAGA AAATGAAAGA AAAGAAATTG 4260 

AAAAGCTTGA AAATGAAAGA CGTGACATCA CTGTTAAAGA ATTGAGTAAG ACTGAAAAAG 432 0 

45 AGCAAGAGCG TATTTTAGTA AGAATGCAAA GAAACAGAAA TGCTTATTCA ATAGACGAAG 4380 

CGAGCAAAGC AATTAAAGAA GCAGAAAAAG CAAGAAAAGC AAGAAAAAAA GAAGTGGATA 444 0 

AGCAGTATGA AGATGATGTC ATTGCTATAA AAAATAACGT CAACCTTTCT AAGTCTGAAA 4500 

SO AAGATAAATT GTTAGCTATT GCTGATCAAA GACATAAGGA TGAAGTAAGA AAGGCAAAAT 4 560 

CTAAAAAAGA TGCTGTAGTA GACGTTGTTA AAAAGCAAAA TAAAGATATT GATAAAGAAA 4620 

55 
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GTTGGTGGTC TAACTTTAGA GAAGACCAAA AGAAGAAAAG TGATAAATAC GCTAAAGAAC 4 740 
AAGAAGAAAC AGCTCGTAGA AACAGAGAAA ATATAAAGAA ATGGTTTGGA AATGCTTGGG . 4800 

5 ACGGCGTAAA AACTAAAACT GGTGAAGCCT TTAGTAAAAT GGGCAGAAAT GCTAATCATT 4860 

TTGGCGGCGA AATGAAAAAA ATGTGGAGTG GAATCAAAGG AATTCCAAGC AAATTAAGTT 4 920 

CAAGTTGGAG CTCAGCCAAA AGTTCTGTAG GATATCACAC TAAGGCTATA GCTAATAGTA 4 980 

10 

CTGGTAAATG GTTTGGAAAA GCTTGGCAAT CTGTTAAATC GACTACAGGA AGTATTTACA 5040 

ATCAAACTAA GCAAAAGTAT TCAGATGCCT CAGATAAAGC TTGGGCGCAT TCAAAATCTA 5100 

TTTGGAGAGG CACATCAAAA TGGTTTAGCA ACGCATATAA AAGTGCAAAG GGTTGGCTAA 5160 

15 - 

TAGATATGGC TAATAAATCG CGCTCGAAAT GGGATAATAT TTCTAGTACA GCATGGTCGA 5220 

ATGCAAAATC CGTTTGGAAA GGAACATCGA AATGGTTTAG TAACTCATAC AAATCTTTAA 5280 

20 AAGGTTGGAC TGGGGATATG TATTCAAGAG CCCACGATCG TTTTGATGCA ATTTCAAGTT 5340 

OGGCATGGTC TAACGCTAAA TCAGTATTTA ATGGTTTTAG AAAATGGCTA TCAAAAACAT 54 00 

ATGATTGGAT TAGAGATATT GGTAAAGACA TGGGAAGAGC TGCGGCTGAT TTAGGTAAAA 5460 

25 , ATGTTGCTAA TAAAGCTATT GGCGGTTTGA ATAGCATGAT TGGCGGTATT AATAAAATAT -5520 

CTAAAGCCAT TACTGATAAA AATCTCATCA AGCCAATACC TACATTGTCT ACTGGTACTT 5580 

TAGCAGGAAA GGGTGTAGCT ACCGATAATT CAGGAGCATT AACGCAACCG ACATTTGCTG 564 0 

30 TATTAAATGA TAGAGGTTCT GGAAACGCCC CAGGTGGTGG AGTTCAAGAA ATAATTCACA 5700 

GGGCTGACGG AACATTCCAT GCACCCCAAG GACGAGATGT GGTTGTTCCA CTAGGAGTTG 5760 

GAGATAGTGT AATAAATGCC AATGACACTC TGAAGTTACA GCGGATGGGT GTTTTGCCAA 5820 

35 

AATTCCATGG TGGTACGAAA AAGAAAAAAT GGATGGAACA AGTTACTGAA AATCTTGGTA 5880 

AAAAAGCAGG GGACTTCGGT TCTAAAGCTA AAAACACAGC TCATAATATC AAAAAAGGTG 5940 

CAGAAGAAAT GGTTGAAGCG GCAGGCGATA AAATCAAAGA TGGTGCATCT TGGTTAGGCG 6000 

40 

ATAAAATCGG CGATGTGTGG GATTATGTAC AACATCCAGG GAAACTAGTA AATAAAGTAA 6060 

TGTCAGGTTT AAATATTAAT TTTGGAGGCG GACTAACGCT ACAGTAAAAA TTGCTAAAGG 6120 

CGCGTACTCA TTGCTCAAAA AGAAATTAGT AGACAAAGTA AAATCGTGGT TTGAAGATTT 6180 

45 

TGGTGGCGGA GGCGATGGAA GCTATCTATT TGACCATCCA ATTTGGCAAA GGTTTGGGAG 624 0 

TTACACAGGT GGACTTAACT TTAATGGCGG TCGTCACTAT GGTATCGACT TTGGTATGCC 6300 

SO TACAGGAACG AACATTTATG CTGTTAAAGG CGGTATAGCT GATAAAGTAT GGACTGATTA 6360 

CGGTGGCGGT AATTCTATAC AAATTAAGAC CGGTGCTAAC GAATGGAACT GGTATATGCA 642 0 

55 
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ATCAGGTGCT ACAGGTAATT TCGTTAGAGG 
GTCGCATCCA GGGAATGATA CAGCTAAAGA 
5 TAGTGGCGTT CGAAGTGGTT CAGGTGTTAA 

ACGTCGTGCA GCAAAACGAA TGGGTGTTAA 
TAGCTTGATT CAACACGAAT CAGGAGGAAA 

10 

AGACATCAAC GTTTTACAGG GCAATCCAGC 
ATTTAGAGAT TATGCTGTTA GAGGTCACAA 

15 AGCGTTCTTT AACAACAGAT ATTGGCGCTC 

AAGTGGTCCA AGAAGATATG CGAATGGTGG 
GGGTGAAGGA GATAAACAGG AGATGGTTAT 

20 ATTAACTGAA CAGGTTATGC GCATCATCGG 

. AAATAATGAT ACTTCTACAG TTGAAAAATT 
AGGAAATAAA TTAACAGATG CGTTGATTCA 

25 TTCTAATGAT GCAATTAGAG GTTTAGAAAA 

AAATGCAAAT AATTATATGG GAGGTTTGAC 
GATGGTTACA AGGAAGAAGT AATAACAGAT 

30 

GCTGAAAGTC CAAACACCAA TGATAACAGT 
CCGGGCGCAA TTAGTTTTGC GCCTTTTTCA 

GATGTTATAG ATTTAAATTT ATTTGAGCAT 

35 t 

CCTTATTATG TTATTACTTC TCAAATGCCT 
GTTACATCTA ATTTAAAAGA TGGTTCTTCA 

40 AAAGGGTATT CTGAATCAGT TAATTGGACC 

ATGTTTGAAA ATGGAATTCC TCTTGATTTC 
TTTACTATTT GGAACGGTTC TACTGATACG 

45 ATATTAATTA ATTTAAATGC GAGTGGAGGA 

ATTTTTAAGT ACAACAAAAG TATAGATAAA 
GCATATCGAG ATATAAACAG AGTGGGAATT 

50 

CCAGGTAAAA ATGAATTTAA GATTaAAGGA 
.TTTCCTTTTA TTTATAGGTA GGTGATTTAA 

55 



AGCACACTTA CATTTCCAAT TGATGCAAGG 6540 

TCCAGAAAAA TGGTTGAAGT CACTTAAAGG 6600 

TAAGGCTGCA TCTGCTTGGG CAGGCGATAT 666Q 

TGTTACTTCG GCTGACGTAG GAAATATCAT 6720 

TGCAGGTATA ACTCAATCTA GTTCGCTTAG 6780 

AAAAGGATTG CTTCAATATA TCCCACAAAC 6840 

CAATATATAT AGTGGTTACG ATCAGTTATT 6900 

ACAGTTTAAC CCAAGAGGTG GTTGGTCTCC 6960 

TTTGATTACA AAGCATCAAC TTGCTGAAGT 7020 

CCCTTTAACT AGACGTAAAC GAGCAATTCA 7080 

TATGGATGGC AAGCCAAATA ACATCACTGT 714 0 

GTTGAAACAA ATTGTTATGT TAAGTGATAA 7200 

AACTGTTTCT TCTCAGGATA ATAACTTAGG 7260 

AATATTGTCA AAACAAAGTG GGCATAGAGC 732 0 

TAATTAATGC AATCTTTTGT AAAAATCATA 7380 

TTTAATCAGC TTATATTTTT AGATGCAAGG .744 0 

GTAACTATTA ACGGAGTAGA TGGTATTTTA 7500 

TTAGTATTAA GGTTTGGCTA TGATGGTATA 7560 

TGGTTTAGAT CTGTGTTTAA TCGCAGACAT 7620 

GGTGTTAAAT ATGCAGTGAA TACAGCTAAT 7680 

ACTGAAATTG AAGTAAGTTT AAATGTTTAT 7740 

GATAGCGAGT TCTTATTCGA CTCTAATTGG 7800 

ACACCTAAAT ATACTCATAC ATCAAATCAA 7860 

ATAAATCCAC GATTCAAGCA CGATTTGAAA 7920 

TTTGAACTGG TTAATTATAC AACAGGTGAT 79 80 

AACACTGATT TTGTTTTAGA TGGTGTGTAT 8040 

GATACAAATA GAGGCATTAT AACATTAGCG 8100 

GACGTCAGTG ATATTAAAAC TACATTTAAG 8160 

TGGATTATCA TGATCATTTA TCAGTAATGG 8220 



1391 



10 



75 



EP 0 786 519 A2 

ATTATGAACT GAATGAAGCT AGGTACATCA CCTTTACAGT TTATAGAACT ACTCATAATA 8340 

GTTTTGTTTT TGATTTATTG ATTTGTGAAA ACTTCATAAT TTATCATGGT GAAAAATACA 8400 

CAATTAAGCA GACAGCGCCA AAGGTTGAAG GTGATAAAGT TTTTATTGAA GTTACGGCAT 8460 

ATCACATAAT GTATGAATTT CAAAATCACT CAGTGGAATC AAATAAGCTT GATGACGACA 8520 

GTAGCGAAAC TGGTAAAACG CCAGAATACT CTTTAGATGA GTACTTAAGA TATGGATTTG 8580 

CAAATCAAAA AACGTCAGTC AAGATGACCT ATAAAATAAT TGGAGATTTT AAAAGAAAAA 8640 

TACCAATTGA TGAATTAGGT AATAAAAATG GCTTAGAATA TTGTAAAGAA GCAGTAGATT 8700 

TGTTTGGTTG TATTATTTAT CCAAATGATA CGGAGATATG TTTTTATTCT CCTGAAACAT 8760 

TCTATCAAAG AAGCGAAAAA GTAATAAGGT ATCAATATAA TACTGATACT GTGTCTGCTA BB2 0 

CTGTCAGTAC GTTGGAATTA AGAACAG CT A TAAAAGTTTT TGGGAAAAAG TACACAGCCG BB80 

20 AGGAAAAGAA AAATTATAAT CCTATTAGAA CAACTGACAT TAAATATTCA AATGGTTTTA 894 0 

TAAAAGAAGG TACTTATCGT ACCGCAACAA TTGGGTCTAA AG CTACTATT AACTTTGATT 900 0 

GCAAGTATGG TAATGAAACA GTTAGATTTA CAATAAAaAA GGGCTCTCaA GGTGGAATAT 9060 

ATAAGTTGAT TTTAGACGGC AAGCaAATTA AGCaAATTTC TTG TTTT GCT AAGTCGGTTC 9120 

AGTCTGAmAC AATAGATTTA ATAaAAAATA TTGATAAAGG CAAGCACGTT TTAGAAATGA 9180 

TATTTTTrGG AGArGrCCCC AAAAATAGAA TTGATATATC TTCAAATAAA AAAGCTAAGC 924 0 

CTTGTATGTA TGTTGGAACT GAAAAATCAA CAGTCTTAAA TTTAATTGCT GACAACTCAG 9300 

GTCGCAATCA ATACAAAGCA ATTGTTGaCT ACGTCGCAGA TAGTGCAAAG CAGTTTGGGA 9360 

TTCGATATGC TAATACGCAA ACAAATGAAG ATATCGAAAC ACAGGATAAG CTGTTAGAAT 9420 

TTGCAAAAAA GCAAATAAAT GATACTCCTA AGACTGAATT AGATGTTAAT TATATAGGTT 94 80 

ATGAAAAAAT AGAGCCAAGA GATAGCGTAT TCTTTGTTCA TGAATTAATG GGATATAACA 9540 

40 CTGAATTAAA GGTTGTTAAA CTTGATAGGT CACATCCATT TGTAAACGCA ATAGATGAAG 9600 

TGTCTTTCAG CAATGAAATA AAGGATATGG TACAAATTCA ACAAGCGCTT AACAGACGAG 9660 

TTATTGCACA AGATAATAGA TATAACTATC AAGCAAATCG TATAAATCAT TTATACACTA 9720 

45 GTACTTTGAA TTCTCCTTTC GAGACAATGG ATATAGGGAG TGTATTAATA TAATGGCAAC 9780 

AGAAGAAGTT AAAATCAAAG CGCTACTTGA AAACGATAAA C 9821 
. (2) INFORMATION FOR SEQ ID NO: 471: 



25 



30 



35 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1017 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 471: 

TTATTGTTTT CCAAATGGTC TTCTCGGTTA TTTAACCATC GGTTTCAATC CTACGTGCTT 60 

5 

CACTTTCTAT CtATTAATTC AaTTTCAGCT TGACCACCCG CTGTATAAAG GGTCAAAGTT 120 

GCTAATCGAT AGCGTCTCAT TATAGGACCA ACATCGATAT CAATATTTTG AATACGAAAA 180 

to TATGGTATTA CCTTTTCATC CAAAAATAAA ATGCCGTTTC GTACACGCAA ATGGTGTTTT 240. 

TCAAATGCAT ATCTGCAGTG CTTATATCGA TAGACTGGCG CTATAACAAG CGTGAAAACA 300 

GCAACAAGTA ATATTATAAT CACACTACTA ACAATGGATA AATGGTTATC TAAAATCTCC 360 

75 CAAAATAGCC AGTTCAAAAT ATTAAATGCG ATTAAAAGTA CAAGCGCTAT GGGTATCCAA 420 

AACAGCACAG CACTTAACCT CATCACTTTT TTAGCGTGTG GTGACATAAA ATTATAATCC 4 80 

CTCATCATTT TCACCTCTTA AATACCATGA TTTCAATTTA TTTGCATCTT CACTTCTTGC 54 0 

GTATTTTAAG TTAATCGATT GGGCTCCCAC ACCTTTAGCA ACAATAAAGC TAAAATTATT 600 

TAAATTGTTG CGTTTAAGTA ATGTATGTTG CCAAGTGTCA AATCCTATAA TGTGATGCGC 660 

TTTAAAATAA TAAATATTTC GTTTCAATAG CTCGAAATTC TGGATAGTAA TTTGTTCTTC 720 

TGTCATTTTA AAACCCGCAT GTTTGACATA AAGATATCCT TTGATCACAA ATAAACCAAT 780 

AATGACTATT GTTATAATCG TAAATAACAA TAATAATTGA TTCCAAAAAT AACAGCCTAT 84 0 

ACCTGCCATA GCTATGACAA TAATACTAGG TATTAAAAAG TGTCTGTGGA AACCTGACAA 900 

AGGCATACCT TCATTAACTT GTTGATAAGA TAAATCTGGT ACTAAATTCT GGATAATTTG 960 

ATATGCTTTG TCTCGTTTAA TAAACGGCaA TATCGGCACA CTACCTGAAT CATTGTC 1017 

35 (2) INFORMATION FOR SEQ ID NO: 472: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6806 base pairs 
IB) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



25 



30 



40 



45 



50 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 472: 

TAGGATTGAT TAATCCTTTC TTCAAAATGA TGAATGTGTA ATGTTAAATA TATAATTTCA 60 

GACTCACTGA TATTAACATC AAATTGTTTT TGTATCATAG TTAAAATTTT ATATGCTGTG 120 

TTATAGCAAA TCGGATAGTG ATTTTTAATC ATAGACACAA AATCATCTTG TGCATGTATA 1B0 

TATTCTTTTC TTCTTAATCG GCGAATTAAA AATTGTACGT GCCTTATAAA ACGTTGGTAT 240 
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TTAATAACAT TATTnGATCA AGGTCATCTC ATGCATAGAT AAATCTTCTG TATTaGATGC 360 

AATATGTAAT GCAATAAATC CTATCTCATC TTCAGGaAAA TGTaCATCTA ATGCTGCATT 420 

TAACTGATCA ATCACCTGTT TAGCAATATG ATATGCATCA CTATATAACT GCATAGTTTC 480 

CATAACAAAT GGATTGCTAA TAACTTGATT TTGTTTTAAG CGTTTATAAG CAAATATAAT 540 

ATGATCCGTT AATGAAACTA CAAGTTGTTT TGAATCAACA TTCATCGCAG TATTAGAAAT 600 

AAAATTCAAC GAATCAATAA TTACTTGTAA TACATTATCA TCAGCGATTT CAACTAAACT 660 

TTTATAATGT GCTTTTTGTT GCTCACTTTC TAATTTATAA ATTTTCTCTA TTGTAATAGT 720 

TTGGTCGTTT AACGCCATTC CCTCTTTTTT GTTAAAACCA ATACCTTTAC CGATTAAAAT 780 

AACTTCTTGA TCATTATTAG TACATACTAC GACATTGTTG TTCAATGTTT TAGTAACAAT 84 0 

ATATTCTCCC ATTATCATCA CCTATTTTTC TATTATTAAG ATTATATATC GGAAATGTCT 900 

AACTTGAAAG TATAATAATT TAAATACTTG TGTTCTATTA CACGCCTATC CTATATGATA 960 

TATCTTAATT TAATTTCGAT GTCTCTCAAA GTGGAATAAC TATAATAAAA AAATCTGACT 1020 

CCCATATTTA CAAATAATTC TATTTATTTA TATATTATCT GAATTAATAC TCAATACAAA 1080 

25 CTAAAACGTA CTATTAAATT GTGCAAAGCT AAAACAAATT TATATTCATC TATCCAACAA 114 0 

TATGTCTTAT CAATGGTATA GTCTTTGCAC ACCAATGGAG GAAATAAATC TCAACCTTAC 1200 

TATATTAATA TATAATCAAA TCTTAGATTA ACTAGTGTAA TGATACAGAT GATAATTGAG 1260 

30 TACAAATTTA AAACCCTGAG ATTTTCGCTT TAATTTGAAA ACCTCAGGGT TTATTTGATT 1320 

TTTATATAAT GAATCGTTAC ATTAAAAATA TTTATTTATC AGAGTTCTTA TATTTGTTAG 1380 

CGCCCCAAGC ACTAATTCCA AATAAGTTAA TTTCTAAGTT TTCAGGTTTA AAGACAGGGT 144 0 

TCTTGCCTTC TTTTTTCTGC TTTTGATAAT CTTTCATCAA TGCAAAAGCT ACATTGGACA 1500 

GTCCTATAAT GGAAATAATG TTTACAATTG CCATTAAGCC CATAAATAAG TCTGCCGTAT 1560 

TCCATACTGT TTCTGTTTTT ACAACTGCAC CGACAAAGAC AAGTACTACA ACAAGACATC 1620 

TAAAGATAAA TAATATTACA CGGTTTGTTG ATAAAAATTC AATATTAGAT TGACCGTAAT 1680 

AGTAATTACC TACAACAGAT GAAAATGCAA ACAGTGTAAC tGCTaTTGTT AAGAAAATAC 1740 

45 CTCCAGCAGA ACCTAAATGC TCATTAAGTG CTGATTGAGT AACTGCAACA CCTTGAGGTG 1800 
CGTTATCACC AAATTTCAGT CCTGAATATA GTAAAATCAT GATTGCAGTT GCTGTACAAA . 1860 

CCAACATTGT ATCAAAGAAC ACACCTAATG ATTGGATTAA ACCTTGCTTA ACAGGGTGTG 1920 

SO GTACGGCAGC AGTTGCCGCT GCATTCGGCG CAGAACCCAT ACCAGCTTCG TTAGAGAATA 1980 

AACCACGTTT GATACCTTGA AGAACCGCAG CACCTACAGC GCCACCAGTT ACTTGTTCGA 204 0 
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GCAATATTAC TAAAACCATA CCAATGTAAA TGATAGCCAT AATCGGTACA ATTAACGAAG 2160 

ATAACGTAGC AATACTACGT ACACCACCAA ATATAATAAT AGCTGTTACG ATTGCTAAAA 2220 

5 TAATACCTGT GATTACTGGA CTAATATTAT ATTGCGTATT TAACGACTCC GCAATTGTAT ' 2280 

TAGATTGCAC TGTGTTAAAT ACAAATGCAA ATGTAATTGT AATTAAAATC GCAAATACGA 2340 

TACCTAGCCA TTTTTGATTT AAACCTTTAG TAATATAGTA AGCTGGACCA CCACGGAATC ,2400 

10 CACCATCTTT ATCATGTACT TTATAAACCT GAGCCAAAGT CGCTTCTATA AATGCACTCG 2460 

CTGCACCTAT AAATGCAATA ACCCACATCC AAAATACTGC ACCTGGACCG CCTAAAACAA . 2520 

TCGCAGTCGC AACACCAGCA ATATTACCAG TACCAACTCT CGAACCAGCA CTAATCGCAA 2580 

75 

ATGCTTGGAA TGGCGAAATA CCCTTCTTAC CATCTTCTAA AGTTTCTGGA CGTTCTACTA 2640 

AAGCTCTAAA CATTTCAGGT AACATTCGTA ATTGAACGAA TTTAGAACTA ATCGTAAAGA 2700 

AGAATCCAGC TGTCAATAAT AGACCAATTA AATATTGAGA C CAT ATT AAA TCGGTACCAA 2760 

20 

CATGGACAAA TTCTTTAAAC CATCCAGGTA TTAAACTATC GAAATCTTTC AAAATAAACC 2820 

CCTCGCATCC TCTACATGAA TCATGTACCT TCTATAAAAT TAGACCGAAT TGAACCTTCA 2880 

25 GTAAATATAG AGATACATCA TCATTTCTTA TACAATACAA GAGATTTATA TTAGTTTGGT 2940 

CAAAGTATAT CGCTAATTTA ACGATAAGTA CTTGGTCAGC ATTTAATATA AATCCCTTGA 3000 

ATTTAGTCAA AATTTAACAT TACTGTATTT TATCATTTAA TTTCGTGATT GCATATAGTT 3060 

30 TTTAGCTAAT ATACATGTCT ATTACTTCAC CAAAATCATC TGTATCTACA ATGAATGAGC 3120 

CATTTGTATA TTGTTCAGAT TTATGAATAT CATTAATTAA ACCATGTTCT TCATTTGATT 3180 

TTGAATATAA TGTATATTGA CTATGTTCAC CTGTCACTAC ATGTGCAGCT ACAATACGAT 324 0 

55 GTGGATTTTT CTTTAATTCT TTTAATAAAG TTATTCCaCG TTGTGCTCTT TTAGCAACTT 3300 

GTAAGATTTT AAAACTAATA CGTTTTAACG AGCCGCGTTG TGTGGCCATC AATATAGTAT 3360 

CATTTTCAGA AACACCTTCT GTCATAACAA CGAAATCTTC AGCTTTAAGA TTTATTGATT 34 20 

40 

TAACACCAGC TGCCCTTAAT CCGGTATCTG ATAGTTCACT TGTATTATAC GITAATGACA 34 80 

TACCTTTATT AGTAATGACG GTAATTAATT GATCTTTTTC AAAGCGCATA ACACTAATCA 354 0 

AATCATCATT TTCTTTAACT TTAGTAGCAA TTAAAGGTTT ATTAAAACGC GTTGTTTTAA 3600 

45 

ATAGAGGCAC TGTACTTTTC TTAATCATGC CATTTTGAGT CGCAAAAACA TAAAATGCAT 3660 

CTGTATTAAA GTCCTTTTCA TTAAAGACAT TAATAACCAC TTCATCTTCT TCGATAGGAA 3720 

so CTATTTGTGA TACATGTTGT CCCAATTCTT TCCAACGAAT ATCTGCTAAT TTATGAACCG 3 780 

GTATAAATAG ATAACGACCT TTATTTGTAA ATACTAGTAC GGTATCTTGC GTATTTACTT 3840 
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TAAAGCTACG AATAGAAGTA CGTTTAATAT ATCCATGACG TGTCATACTT AAAATAACTy 3 960 

CTTCACTAGG CACCATAACT TCTTTGTCAA TTTTAATTTC TTCAATTTCT GCTTCAATTA 4 020 

AAGACAGTCG TTCAGATTTG AATTTCTTTT TAATTTCATT CAATTCTTCT TTTATGACAT 4 080 

TCAATAATGC ATCATGGTTA TCAAGAATAT GACGTAATTG TTTGATTAAT GCTTCAAGTT 414 0 

CTTTATGTTC ACCTTCAAGC GCAACTATGT CAGTATTTGT TAAACGATAT AACTGTAACA 4200 

TTACAATTGC TTCAGCCTGT TCTTCTGTGA ACTCGTATAC TTCGATAAGG TTTTCTTTAG 4260 
CGTCACGCTT GTTTTTAGAG CwACGAATCA ATTCGATTAC TTTAT CTAAA ATTGACAACG . 4320 

CTTTAATCAA ACCTTCAACG ATATGCATAC GTTTTTCTGC ATTATCTAAT TCAAACTTCG 4380 

TTCTATTTGC AACAACCTCA ATTTGGTGAT TCAAATAACT ATCTATAATT TGACGAATAC 444 0 

CCATCAATTT TGGACGACCA TCACTAATAG CGACCATGTT GAAATTATAT GAAATCTGTA 4500 

AATCAGAGTT TTTATAAAGA TAATTTTTGA TTGATTCACT GTTCACATCT TTTTTCAATT 4560 

CAATTGCTAT TCGTAAACCA GTTCTATCAG TTTCATCACG TACTTCAACG ATACCATCGA 4 620 

CTTTTTTGTC AGCACGTAAT TCATCGATAC GTTTTACTAA GCTACTTTTG TTCACTTCAT 4680 

2s ATGGAATTTC AGTAATAATT AACTGTTTAC GTCCATTGCG TAAAGTTTCT TCTTCAACTT 4 74 0 

TAGAACGAAC TATAATTCTA CCTTTACCTG ATTCATAAGC TTTTTTAATA CCATCAATAC 4 800 

CTTGAATAAT ACCACCAGTT GGAAAATCAG GACCTTTAAT ATATTTCATT AATTGATTGA 4 86 0 

30 CTGTAATATC CGGATTATCA ATATATTTAA GTGTTGCTTG AATCACTTCA GCTAAATTAT 4920 

GTGGTGGTAT ATCTGTCGCG TAACCTGCAG ATATACCTGT AGAACCATTC ACTAGTAAGT 4980 

TAGGAAATCT TGATGGCAAT ACCATTGGTT CGAGTGTCGT ATCATCATAG TTTGGAATGA 504 0 

AAGAAACTGT CTCTTTATTA ATATCACGTA ATAACTCTTC AGCTAGTAAG CTTAACTTAG 5100 

CTTCAGTGTA ACGCATTGCC GCTGGCGGAT CATTATCGAT ACTACCATTA TTACCATGCA 5160 

TTTCTATTAA GACATGTCGT AACTTCCAGT CTTGACTTAA ACGGACCATT GCTTCGTACA 5220 

CTGAGGAGTC TCCATGTGGA TGATATTGAC CAATAACATC ACCGACTGTT TTCGCACTTT 528 0 

TACGGAAATT TTTATCGTGT GTATTACCAC TTGAATACAT TGCATATAAA ATACGACGTT 534 0 

GTACTGGTTT TAAACCATCA CGAACATCTG GCAATGCACG CTCTTGAATA ATATATTTAC 54 00 

TATATCTTCC AAAGCGATCA CCTAAAACAT CTTCAAGTGA TAAATCTTGA ATTATTTCAC 54 6 0 

TCACTAGATT TCCTCCTCAT CAAATTGATC ATTTTCAAGC ACTTGTACTT CAGAATTATC 552 0 

SO TAAAATACTT TGGTCCTCTT GCATACCAAA CTCAACATGC TTTTCAATCC ATTCACGTCT 558 0 

AGGTTGTACT TTGTCACCCA TTAATGTTGT TACACGTTTA GATGAACGCA CTTCATCTTC 564 0 
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AGGGTTCATT 


TCACCCAAAC 


CTTTGTAACG 


TTGTAACGTG 


AaGCCTTTAC 


CAAGTTCTTT 


5760 


tTGCAATTTA 


TTAAGCTCTT 


CGTCTGTCCA 


AGCGTATTCA 


ACTCGCTTTG 


TTTTGCCTTT 


5820 


ACCTTTTTCC 


AATTTATAAA 


GTGGAGGTAA 


AGCAATAAAT 


ACACGACCTG 


CTTGAACAAG 


5B80 


CGGTTTCATA 


TATTTGAAGA 


AGAATGTTAA 


CAATAGCACT 


TGAATATGCG 


CACCATCAGT 


5940 


ATCAGCATCA 


GTCATAATAA 


TTACACGATT 


ATAATTACTA 


TCTTCAATTT 


TAAAGTCAGT 


6000 


ACCAACGCCT 


GCCCCGATTG 


TGTGGATAAT 


TGTATTAATT 


TCTTCATTTT 


TAAAAATATC 


6060 


TTCTAGACGT 


GCTTTCTCTG 


TATTAATTAC 


CTTACCACGT 


AATGGTAATA 


TCGCTTGGAA 


6120 


TTTGCGGTCT 


CGTcCAAGTT 


TTGCTGAACC 


TCCCGCAGAA 


TCACCTTCGA 


CTAAATACAA 


6180 


TTCATTTTTT 


TCAGTGTTTT 


TACTTTGTGC 


AGGTGTTAAT 


TTACCAGATA 


GCAAAGTGTC 


6240 


TTTACGCTTG 


TTTTTCTTAC 


CTGAACGAGC 


ATCTTCACGA 


GCTTTACGTG 


CAGCTTCCCT 


6300 


TGCTTGTTGT 


GCTTTAATCG 


CTTTTTTCAC 


AAGTGATTTA 


GACAATTGTC 


CTTTTTCTTC 


6360 


TAAATAGAAT 


GGCAATTTGT 


CTGCAACAAC 


TGAATCAACA 


GCACTTCTAG 


CTTCAGAAGT 


6420 


ACCCAATTTA 


GATTTCGTTT 


GTCCTTCAAA 


TTGCAATAAT 


TCTTCTGGAA 


TACGAACAGA 


6480 


CACAACAGCT 


GTTAAACCTT 


CACGAATATC 


ATTACCATCT 


AAGTTTTTAT 




6540 


AAGTTCATTA 


ATACGACGTG 


CATAATCATT 


AAATACACGT 


GTCATTGCTG 


TTTTAAAACC 


6600 


AACTTCATGT 


GTACCACCAT 


CTTTAGTACG 


TACATTATTT 


ACAAAACTTA AAATACTTTC 


6660 


TGAATATTGA 


TCATTATATT 


GGAAAGCTAC 


GTCTACCTCT 


ATACCATTTG 


CTTCACCTGA 


6720 


AAATGTAGCC 


ACGTCATGCA 


AAACTTCTTT 


TCCTTCATTG 


ACATAACTAA 


CAAACTCTTT 


6780 


GATTCCTTCT 


TATAATGGTA 


TGTCTT 








6806 



(2) INFORMATION FOR SEQ ID NO: 473: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1716 base pairs 

(B) TYPE : nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 473: 

GGGGCAnAAA TTCCAATACA CTCATTACCA AATATATACA CCACTTCCTG CTACAAGTrui 60 

TTTTACTTGA TCTTGGTCTT TTCCGCAGAA AGAGCATTTC CAAATTTTCT TCATCTTCCA 120 

TTGAATTTAA ACATTCTTTT TACACCCCTA TTCGTTAAAG ACTATACTAG ATTGGATGTT 180 

ACAATGCAAC ATATTAACAT ACAAACTTTT TGCTTAAAGA ATAGTAGCAG ATACATAAGC 240 
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CCtTCAACGA ACTTTGCGTT ATCTCTTAAT AAATCGATAA CTTTTT GGAT ACGAACATCA 360 

TTTTTAATGA TATCAGTATT ACCTAAAGTA TTTTTGATAT CTTCAACTGA GATATTAAAT 420 

6 TGTTTACTCA TTTTTTCTAA TTCTTTATCG ATATCTTCAT CAGTAGCTTC GATTTTTTCA 480 

GCTTCAGCGA TCGCAGTTAA AGTTAAGTTA GTTTTAACAC GTTGTTCTGC ATCGTCTTTC 540 

ATTTGCTCTC TTAATTGAGT TTCATCTTGA CCTGAGATTT GGAAGTACGT TTGTAAATCT 600 

10 

AAACCTTGTT GTTGAATTCT TTGTGCAAAT TCAGACACCA TACGATCTAA TTCAGTATTA 660 

ACCATTGCTT CAGGAATATC GATTGTTGTA TTATCAGTAG CTTTTGTAAT CG CT TC TT CT 720 

TTTTCAACAT TTTCAGCATC- TGTAGCTTTT TGTTCAGCTA AACGTTTACG TAAGTTTTCT 780 

is 

TTGTACTCGT CTACTGTATT TGCTTCTGCA TCTAATTCAT TAGCAATTTC ATCTGTTAAT 84 0 

TCTGGGACTT CTTTAAATTT AATTTCGTTA ACTTTTGTTT TGAAAGTTGC TTCTTTACCG 900 

2Q GCTAATTCTT CAGCATGGTA TTCTTCTGGG AATGTTACGA CAACATCTTT TTCTTCGTCA 960 

ACTTTCATAC CTTCTAATTG CTCTTCGAAA CCAGGTATGA ATGAACCTGA ACCGATTTCT 1020 

AAATCGTAAC CTTCAGCTTG TCCACCTTCG AATTCTTCtC CGTCAACTGA ACCACTAAAG 1080 

25 TCGATGTTAA CTGTGTCGCC ATTTTCAACA ACACCATCTT CTTTAACGAC CATTTCAGCT 114 0 

AAATGTCCTA AGCTGTGGTC AATCGCTTCT TGTAACTCAT CATCAGATAA TTCAGTTTCT 1200 

TGTTTTTCAA TTTCAAGACC TTTATAGTCT CCTAATTTAA CTTCTGGCTC AACTGTAACT 126 0 

30 GTTGCTTCAA AAATGAAATC TTTACCTTTT TCAATTTGAG TAACACTTAC TTCtGGTTGT 1320 

GCAACTGGTT TAATATCAGT TTCGTCAATT GCTTCACCAT AAGCATCTGG TAATAAAATG 13 BO 

TCGATAGCAT CTTGATATAA TGCTTCTACA CCAAAGCGTT GTTCAAAAAT TGGACGTGGC 144 0 

ACTTTACCTT TACGGAATCC AGGTACGTTA ATTTGTTTAA CCACTTTTTT GAATGCTTGA 1500 

TCTAACGCTT TGTTTACTTT TTCTGCAGGA ACAGTAACAG TTAATAAACC TTCGTTACCT 1560 

TCCTTTTTTT CCCAAGTTGC TGTCATGTAT ATATACCTCC ATGATTAACT AATTTATTTT 1620 

40 

TTCAACTTCC CTATTATATC ATACGTCTAT TCCCTATACA AACATTGAAA TCACAACGTT 1680 

TATATATTTG TAAATCAACT TTTTTCGTCA AAACTA 1716 
(2) INFORMATION FOR SEQ ID NO: 474: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 795 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
so (D) TOPOLOGY: linear 
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TGACCAAGTA CGTTTCGAAG TTGCCATTAA AGCATTAAAC CCATCATTGA AAGCATTCGC 60 

ACCTGTACGT GAGTGGGCAT GGAGTCGTGA AGAAGAAATC GATTATGCAA TTAAACATAA 120 

5 TATCCCTGTA TCAATCAACC ATGATTCACC TTATTCTATC" GATCAAAATC TATGGGGCAG 180 

AGCGAATGAA TGTGGTATTT TAGAAGATCC TTATGCTGCG CCACCAGAGG ATGCGTTCGA 240 

TCTAACAAAT GCTTTAGAAG AAACACCAGA TACTGCTGAT GrAATCATTT TAACGTTTGA 300 

W TAAAGGCATC CCAGTTCAAA TTGATGGCAA AACATATGAA TTAGACGATT TAATTTTAAC 36.0 

GTTGAATGCA TTAGCTGGTA AGCATGGTAT CGGAAGAATT GACCATGTAG AAAATAGACT 420 

TGTAGGTATC AAATCAAGAG AAATTTATGA GGCACCTGCT GCAGAAGTTA TTTTAAAAGC 480 

75 

GCATAAAGCA TTAGAAACGA TTACGTTAAC GAAAGATGTC GCACACTTTA AACCAATCAT 540 

TGAGAAGCAA TTTGCTGAAC AACTATACAA TGGACTTTGG TTCTCACCTT TAACTGATAG 600 

CTTGAAATTA TTTATTGATA GTACTCAGCA ATACGTAAGT GGTGATGTCA GAATTAAATT 660 

20 

ATTCAAAGGT AATGCCATCG TGAATGGTAG AAAATCACCT TACACATTAT ATGATGAAAA 720 

ATTAGCAACT TATACAAAAG AAGATGCATT TnATCAAGAC GCTGCTGTTG GCTTTATCGA 7B0 

25 TATCTATGGT TTACC 795 

(2) INFORMATION FOR SEQ ID NO: 475 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 887 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 (xi) -SEQUENCE DESCRIPTION: SEQ ID NO: 475: 

CGATTGAAAG AAGACGGTTC AGTTGAAAAG TTTCCAAAGC CAGTAATTAG CCAACAACCA 60 

GAAGGATATA CGAGTCATTT TAGAGATCCT AAAGTTTTTA AATATGATGr GaAATATTAT 120 

40 

GCAATCaTTG GTGCmCaAAA TAATGATCaG CaAGGTCGAT TATTACTTTA TAATACTGAA 180 

GATATAATTA ATTGGCATTA TTTAGGTGAA ATAAATACAG AGTTGGATGA TTTTGGATAT 240 

ATGTGGGAAT GCCCAGATTA CTTTAATGTA GATAATCAAG ATGTCATACT TATTTGTCCA 300 

4S ■ • 

CAAGGTATTG AACCAAAAGG CGATCAGTTC AAAAATATTT ATCAAAGTGG TTATATACTT 360 

GGAAAGTTTG ATATTGAAAA GTTAACATAT GAACATGAAA ATTTTGTCGA GCTTGATAAT 420 

50 GGTTTTGATT .TCTATGCACC TCAAACATTT TTAGATGAAA AAGGCCGACG AGTACTAATT 4 80 

GGATGGATGG GGTTACCGGA AATCGAATAT CCTACTGATA ATGAAGGATG GGCCCATTGC 540 
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GCGTTGGAAA AATTACGTCA CAATAAAGAG ACAGCATTtA GGctACGCAA ATAAATTTAC 660 

TCGAAAATTA CATCCGTATG AAGGTAAACA GTATGAATTA ATCATAGATA TTTTGGATAA 720 ' 

5 TGATGCTACC GAAGTGTACT TTGAATTACG TACATCTAAG ACTTCTTCAA CATTAATTGC 7B0 

TTATAACAAG CGTGAAAATA AAATAACATT AGATCGCAGC GACAGTGGTT TATTGCCGAC 840 

AAATGTTGAA gGTACGACGC GTAGTACGAT ATTAGACACG CCATTAA 8B7 



10 (2) INFORMATION FOR SEQ ID NO: 476; 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1183 base pairs 
(3) TYPE: nucleic acid 
is (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: 5EQ ID NO: 476: 

20 

TTGGAAnCAA AAAACCATTG . GTAAACCGTG TTnAACCGGA TTTCCGATGG ACCTTTTAAA . 60 

ACnACCAAAT AGAAAGCTTT GATAAAAGGT AATTATGGTA CTGATAACAA ACAAGTTCAA 120 

25 AAACATCATG ATTTAGTACG TATGCTTTTG ATGGATCAAG ATGGTTTTTT AACTGAAAAT 180 

AATAAAGTTG ATCATTTCAT TGATGGAAAT GATTTATATG ATCAAGTTTT AAAAGATATT 240 

AAAAATGCAA AAGAtATATC CATTTAGAGT ACwATACTTT CGCTTwAGAT GGTTwAGGTA 300 

30 AAAGAATTTT ACATGCTTTA GAAGAAAAAT TGAAACAAGG TCTAGAAGTA AAAATATTAT 360 

ATGATGATGT TGGATCTAAA AATGTTAAGA TGGCAAATTT TGATCATTTT AAATCGTTAG 420 

GTGGAGAAGT TGAAGCATTT TTTGCTTCAA AATTACCGTT ATTGAATTTC AGAATGAATA 4 80 

35 ATAGAAATCA TAGAAAAATC ATCGTAATCG ATGGTCAACT AGGTTATGTC GGAGGATTTA 540 

ACATTGGTGA TGAATATCTm GGATTAGGAA AATTAGGATA TTGGAGAGAT ACGCATTTAC 600 

GTATACAAGG GGATGCGGTT GATGCACTGC AGTTGCGATT TATTTTAGAC TGGAATTCGC 660 

40 

AAGCGCACCG TCCACAATTT GAATATGATG TTAAGTATTT CCCTAAAAAG AACGGACCAT 720 

TGGGCAATTC ACCAATTCAA ATAGCTGCAA GTGGCCCGGC TAGTGACTGG CATCAAATTG 780 

AATACGGTTA TACAAAAATG ATTATGAGTG CAAAGAAATC TGTATATTTA CAATCACCAT 840 

45 

ATTTCATTCC GGATAATTCA TATATaAATG CCATTAAAAT TGCTGCTAAA TCAGGTGTAG 900 

ATGTACATTT AATGATTCCA TGTAAGCCAG ATCATCCATT AGTATATTGG . GCGACATTTT 960 

so CAAATGCCTC TGACTTATTA TCAAGTGGTG TTAAAATTTA TACGTATGAA AATGGATTTA .1020 

TACATTCTAA AATGTGCTTA ATTGATGATG AAATCGTATC AGTGGGCACA GCAAATATGG 1080 
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CTAAAGATTT AAGGGTGGCT TATGAACATG ATATTACAAA ATC 1183 
(2) INFORMATION FOR SEQ ID NO: 477: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2332 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D> TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 477: 

GGTTATATGT TTGGTATTAC TCATTATATT GATTGGAGGT TGTGTCATTA TGACAAAAAC 60 

15 

AAATGGTCGA AACGCTCAAA TTAAAGAAAA TTTCAACAAA ACATTAAGTG TATATCTAAC 120 

CAAGAATCTC GATGATTTTT ACGATAAGGA AGGTTTTCGA GATCAAGAAT TTGATAAAAG 160 

AGATAAAGGG ACTTGGATTA TTTATTCTGA AATGGTTATC GAACCAAAAG GGAAnAATAT 240 

20 

GGAATCGAGA GGAATGGTGC TCTATATCAa TCGCAATACT AGAACmACGA AGGGTAATTT 300 

TATTGTCACC GAAATAACTG AAGATAGTAA AGGATATTCA CGTAGTAAAG AAAAAAAATA 360 

2S TCCTGTCAAG ATGGAAAATA ATCGAATTAT TCCAACAAAG CCTATACCGG ATGACAAGTT 42 0 

AAAAAAAGAG ATTGAAAACT TTAAGTTCTT TGTACAATAT GGAAATTTTA AAGATTTTAA 4 80 

AGATTATAAA AATGGTGATA TTTCATATAA TCCTAATGTG CCAAGTTATT CTGCAAAGTA 54 0 

30 TCAATTGAAT AATGATGACT ATAATGTTCA ACAGTTAAGA AAACGATATC ATATTCCAAC 600 

CAAACAAGCG CCCGAATTAA AATTGAAAGG ATC CGGCAAT TTAAAAGGCT CATCCGTAGG 660 

ATCTAAGGAT CTAGAATTTA CGTTTGTAGA AAATCAAGAA GAGAATATCT ATTTTTCAGA 720 

35 TTCGGTCGAA TTTACACCTA GCGAGGATGA TAAATCATGA GTCAAACGGA ATATCAAATA 780 

AAATCTGGCA ATATAAAAGG TAACTCTGAA GAAACAAGTA CAGTATCTAA TATAAGTTAT 840 

GAAATAGAAA ACGCAAATAA CAGTGGTTTA AAACAAAATA AAATTGATAA ACAAATTAAA 900 

40 

AAGTTACAAG AAAAAAATAA ATTCCCTAAA AATCTTTCAT ATCTTAAAAG TTATACGGAC 960 

CCCAAAACAG GCACGACTAC AAGCGCCTTT TTAAATAAAG ACACTGGCAA AGTTACTTTA 1020 

GGTATGACAG GTACTAATGT ACACAAAGAC GCAATATTAA AACAAACATT TGGTGTTCCT 1080 

45 

TCTTATCAAG GATATATAGA CGTGAGTGAA ACgCTaAAAG ATATTGGGGC CGATGTCAAT 114 0 

ATTGGCCTTC ATTCCGTCAC AGATAAAGAT CCACATTATA AAAATACCCA AGACTTTATC 1200 

50 AAAAATATCA AAAAAGACTA TGATATTGAT ATTATTACCG GACATTCGCT GGGCGGTAGA 1260 

GATGCGATGA TTTTAGGTAT GAGTAATGAT ATTAAACATA TCGTTGTGTA TAATCCAGCT 1320 
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ATTGAAAAGT ACGATGGTCA 


CATTGTAAGA 


TTTGTGTCTG 


ATGAAGACGA ATTAGATGCA 


1440 


GGTGTCCGCA ATCATTTATA 


TGAAACTGCT 


GGAGAAAAAA 


TAGTACTTAA AAATGGAGAA 


1500 


GGCCATGCAA TGAGTGGTAT 


TTTAATGAGC 


AGAACACAGG 


CTATAATCTT 


AGCTGAATTA 


• 1560 


AACAAAGTTA AAGGCTACCA 


AGACGAAAAT 


AATAAAGCAT 


TAAAATCCGT 


TCGTAAACAA 


1620 


ACGAGGCATA GATTACATAA 


AGTAGAGACG 


TTAAGAGCGA 


ATTGGATTCA 


AACAACGGGT 


1680 


GGATCACTCT CTTCCTCCCw 


ACAACAATTA 


TTAGAAGCTT 


TAACAGCACT 


AACCATTGCC 


1740 


GAAGGCTTAA ATCAATTAGT 


GAATGAAGAA 


AGCCAACATT 


TGAAAAAATG 


TATCACGCGA 


1800 


TGGCACATAA ATTTGGAGAC 


AACTGGAAAA 


AAGCGCAAGA 


AGTTGGAAAT 


GAAATTGGTG 


1860 


AAAAATTAAC CTCTGAAGAG 


G TT AT AGATG 


rATTAAGAAA 


AGGTGGCGCG 


TATGAAAGTa 


1920 


AACTTGAAAC AGATCCCAAA 


AGAAAAATTG 


ATGATAAGAT 


AAAGAAATTA AATGATGTTT 


.1980 


ATAAAAATTG TAATGGCTAT 


ATCGCAAAAA 


TTAAACAGAG 


TATCGAAGCA ATTGTTTCTA 


2040 


ATGACCAAAT GTTAGCGAGC 


CAGATTGATG 


GGATGATGTA 


ATGTTTACTA 


CGTATAAnAA 


2100 


TATTAATGAA CTTGAAAATG 


CCTATGATGA 


AGAAAGAAAA 


CAATTGAATG 


ATGCATTCAA 


2160 


TCAAATTGAT GAATTAAGAC 


ATCAAACACG 


CAAGAltiATGT 


GAACAAATGT 


ATGATCATTT 


2220 


CTTATATCTC AAACATAAAA 


TGAATTmyms 


TGAAGACGCT 


ATGATCAGGA 


TGACACGTAT 


2280 


TATAGAATCT TTCGATAGAG 


AAACGAATCA 


ACGTATCCGA 


CATCACGAAA 


TG 


2332 



30 (2) INFORMATION FOR SEQ ID NO: 478: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
55 ' (D) TOPOLOGY: linear 

(xij SEQUENCE DESCRIPTION: SEQ ID NO: 478: 

40 

TTTACTACCC AGTATCTCTT TTTATAAATT ATATAGCCAC CACATATGGT GGAAAGTCTT 60 
TTTAATTAGA ATTTTGTTTT TTCAGTTAAG AAAGCTTCTA ACTCTGAGAT TGGCATACGA 120 

ACTTGTTCCA TTGAATCTCT GTCACGTACT GTAACTTGAT TATCTTCTAA TGAATCAAAG 180 

45 - 1 

TCGAATGTtA CACAATAAGG TGTaCCGATT TCATCTTGAC GACGGTATCT TTTACCGATA 240 

GATTGTGATT CATCGAAATC GATTGAGAAT TTAGAACTTA ATTGCTCAAA AATCTTAATC 300 

so GtTCGCCAGA TAATTTCTTA CTTAAAGGTA AAATCGCTGC TTTATATGGT GCTAATGCAG 360 

GATGGAAGTG TAAAACTGTA CGTGCATCTT TACTACCTTC AACGCCTTCT TCATCATATG 420 

55 



1402 



EP 0 786 519 A2 

GAATATATTT TTCGTTCGTT TCTGGATCAT GGTATCTGAA ATCTTCACCA GAGTGTTCAG 54 0 

CATGTTTACG TAAGTCGAAG TCTGTACGAC TTGCGATACC CCATAACTCA CCCCAACCAA 600 

S ATGGGAATTT ATATTCAATA TCAGTTGTTG CATTTGAGTA ATGAGATAAT TCATCTTCAT 660 

CATGATCACG TAAACGCATA TTTTCACTGC TCATATTTAA GCTTGTTAAC CAGTCACTTG 720 
CAAAAGTTTT CCAATAATTT TGCCATTCGA TTTCTTCTCC AGGTTTACAG AAGAATTCAA 78 0 

10 GTTCCATTTG TTcAAATTCT CTTGTTCTGa AAATGaAGTT ACCTGGAGTG aTTTCaTTAC 84 0 

GGaATGaTTT ACCAATTTGG ACCGG 865 
(2) INFORMATION FOR SEQ ID NO: 479: 

75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1444 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 479: 



25 


TAATGAGTAA 


ATAAGTACCA 


GATAAAATCA 


TGAATATCAT 


CCAACATGCG 


GTTAACTCTA 


60 




CTAAATAATT 


AATAATAGTA 


TTTTCAGTAA 


ACAAAGAACT 


ATGTATACTT 


CGCATCACAT 


120 




TAGAATACGT 


ATGTTTCGCA 


TTTTGATCTG 


CAACAAATTG 


ATTGTTATGA 


TCTAGAAAGA 


180 


30 


CGTAACGTTG 


ATTTCCTGCC 


ATATCACTCA 


GTGTAATTCG 


TTTGTTATAT 


GGTTCATCAA 


240 




GTATGCTAAC 


TTTACTTACA 


AAGAATCCTT 


CATATTGTTG 


TTCAACTTGA 


TGTACTGCAT 


300 




CATTTAATGT 


TTGATGCGTT 


TTTACATCAC 


TGTCACCAAA 


AAACTCATTC 


TTATAAATAT 


360 


35 


TATTTTCAAC 


TTCTGGAAAG 


AACAAGTAAC 


CAATGCCCGA 


AATGGTTAAA 


GTGATTAACA 


420 




GTGGAGCAAT 


AAATATTGCT 


GCATAGAAAT 


GTAATCTTTG 


TAATGGATTA AATGTATTTT 


480 


40 


TCATATTTCC 


CTCCCAATTG 


GCTATTATAC 


GGTGTCAATT 


CTGTGATGTG 


TGTGAACAAA 


540 


CTGTGACAAT 


ATTTATTTTC 


TAGAAAAATT 


TAACGATGAT 


TTGTGATTTT 


TAGAAAAATG 


600 




AACTTTTAAG 


TTGGAATGTT 


TGAAGAAAAT 


TGATTATTCG 


TATGTTTTAT 


CAAGCAGCTA 


660 


45 


TGATAAAATT 


TAAACATAAT 


ACAATGCGAG 


CCATTTAACG 


ATCTATGTTT AAATGGACAT 


720 




CGATATTGTA 


TGAATTCGTT 


GTAACAAGCA 


AGCATTTCTA 


TGTGAACGAA 


CCAAAGGGGA 


780 




AAGTAACATG 


ATTAATAAAG 


AACAATTAGA 


TCTTTTATAT 


AAATTAAAAA AAGAAGTTGA 


840 


50 


AAAGTCGCGA 


AATGAAGCAC 


TTTTACATAC 


aattaaccaa' 


GTAATTAAGA 


AAGTATATTT 


900 




GCAGCAATAT 


ACATGTTCGT 


TCGTTGGACA 


TTTTTCTGCA 


GGTAAATCGA 


CACTGATAAA 


960 
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TATTGTGTCA 


GTTTCAGACA 


ATCACGATAT 


TATTGCTAAT 


TTGCCGAATC 


AAACGTATGC 


1080 




CAAATTATCT AATTATGATG AAGTAAGGGA AATGAATCGC 


CAAAATGTCG 


ACGTTGAATC 


1140 


5 


TGTAGAAATT 


AATTTTCAAT 


CAGCTAAATT 


TGAAAATGGG 


TTTACGTTGC 


AAGATACACC 


1200 




AGGTGTTGAT 


TCAAATGTTG 


CATCACATCA 


GTCAATAACA 


GAACAATATA 


TGTATACAAG 


1260 




TAATATGATA 


TTTTATACGG 


TTGACTATAA 


CCACGTTCAA 


TCTGAACTTA 


ACTTTAAGTT 


1320 


10 


TATGAAGCAT 


ATAAATGATG 


TTGGaATACC 


TGTTGTGTTT 


ATCATTAATC 


AAATTGACAG 


13B0 




CATCCAAGAC 


GATGGAATTG 


TCATTCTCTA 


CGTnTTAAAT 


CTCGAGTTGG 


AAAAATCAAT 


1440 


15 


TGGC 












1444 


(2) INFORMATION FOR SEQ ID NO: 4 BO: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6309 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








25 


. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 480: 








GCAGCAGCTT 


ATCGTGAGTT 


ATCATTATTA 


TTACGTAGAC 


CTCCAGGTCG 


TGAAkCATAC 


60 




CCAGGTGACG 


TATTCTACTT 


ACATAGTAGA 


tTATTAGAAA 


GAGCAGCAAA 


ATTAAACGAT 


120 


30 


GACTTAGGTG 


GCGGTTCAAT 


TACTGCATTA 


CCAATTATCG 


AAACACAAGC 


TGGTGATATT 


180 




TCAGCTTATG 


TACCAACAAA 


CGTTATTTCA 


ATTACAGATG 


GACAAATCTT 


CTTACAATCT 


240 




GATTTATTCT 


TCTCAGGTGT 


AAGACCAGCG 


ATTAATGCCG 


GACAATCTGT 


ATCTCGTGTT 


300 


35 


GGTGGATCTG 


CACAAATTAA 


AGCAATGAAG 


AAAGTTGCTG 


GTACGTTACG 


TCTTGACTTA 


360 




GCGTCATACA 


GAGAACTTGA 


ATCATTTGCA 


CAATTCGGTT 


CAGACCTTGA 


TGAATTTACT 


420 


40 


GCAAGTAAAT 


TAGAACGTGG 


TAAACGTACT 


GTTGAAGTCT 


TAAAACAAGA 


TCAAAACAAA 


480 


CCATTACCAG 


TCGAACACCA 


AGTGTTGATT 


ATTTATGCAT 


TAACAAAAGG 


ATATTTAGAT 


54 0 




GATATTCCAG 


TTGTAGATAT 


CACACGTTTT 


GAAGACGAGT 


TAAACCACTG 


GGCAGAATCA 


600 


45 


AATGCTACTG 


AACTGTTAAA 


TGAAATCAGA 


GAAACTGGTG 


GCTTACCAGA 


TGCTGAGAAG 


660 


TTTGACACAG 


CAATTAACGA 


ATTCAAAAAA 


AGCTTTAGCA 


AATCTGAATA 


ATAAACAAGT 


720 




TTAGTATAAG 


GTGGTGAGAT 


AGTGGCTTCT 


CTTAAAGAAA 


TAGATACTCG 


AATAAAATCA 


780 


50 


ACCAAAAAAA 


TGAAGCAGAT 


TACGAAAGCG 


ATGAACATGG 


TATCAAGTTC 


AAAACTTCGT 


840 




AGAGCTGAAA 


AAAATACAAA 


ACAATTCACA 


CCATATATGG 


ATAAAATGCA 


AGATGCAATT 


900 
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ACTAGAAGTG GATATTTAGT TATCACGAGT GATAAAGGTT TAGCAGGTGC ATATAGTGCA 1020 

AACGTGCTTA AAAAATTGAT TACTGATATT GAAGCGAAAC ATCAAGATAG TAGCGAATAC 10 BO 

5 AGTATTGTAG TTTTAGGGCA ACAAGGTGTT GATTTCCTAA AAAATAGAGG TTATGACATT 1140 

GAGTATTCTC AAGTAGACGT ACCTGATCAA CCTTCTTTCA AATCTGTTCA AGCACTAGCT 1200 

AACCATGCTA TAGACTTATA CAGTGAAGAA GAAATTGATG AATTAAATAT AT ACT AT AG T 1260 

10 CATTATGTCA GCGTTCTTGA AAACAAGCCT ACATCTAGAC AAGTATTACC ATTATCTCAA 1320 

GAGGATTCTA GTAAGGGGCA TGGTCATTTG TCTTCTTATG AATTTGAGCC AGATAAAGAA 1380 

TCTATCTTAA GTGTAATCTT GCCTCAATAT GTTGAGAGTT TGATTTACGG AACAATATTA 1440 

15 

GACGCAAAAG CAAGTGAGCA TGCAACACGT ATGACTGCGA TGAAAAATGC CACTGATAAT 1500 

GCAACTGAAC TTATTGATGA CTTATCATTA GAATATAACA GAGCGAGACA AGCAGAAATT 1560 

ACGCAACAAA TTACTGAAAT TGTTGGTGGT TCCGCAGCGC TTGAATAATA TTTAAAGGAG 1620 

20 

GAAAATAGCA TGGGAATTGG CCGTGTAACT CAAGTTATGG GTCCTGTAAT TGATGTTCGA 1680 

TTTGAACATA ACGAAgTTCC TAAAATTAAT AACGCCTTGG TTATTGATGT GCCTAAAGAA 1740 

25 GAAGGTACAA TACAACTAAC ATTAGAAGTT GCGCTGCAAT TAGGTGACGA CGTTGTTCGT 1800 

ACAATTGCGA TGGATTCAAC TGATGGTGTC CAAAGAGGCA TGGATGTAAA AGATACAGGC 1860 

AAAGAAATTA GTGTACCTGT TGGTGACGAA ACATTAGGTC GTGTATTTAA TGTACTAGGT .1920 

30 GAAACAATTG ACCTTAAAGA AGAAATTAGT GATTCTGTTC GCCGCGATCC TATCCATCGT 1960 

CAAGCACCAG CATTCGATGA ACTTTCAACA GAAGTTCAAA TTTTAGAAAC AGGTATTAAA 2040 

GTAGTAGATT TACTAGCACC TTATATTAAA GGTGGTAAAA TCGGATTGTT CGGTGGTGCC 2100 

^ b GGTGTAGGTA AAACAGTATT AATCCAAGAA TTAATTAACA ACATCGCTCA AGAGCACGGT 2160 

GGTATTTCTG TATTCGCCGG TGTAGGTGAA CGTACTCGTG AAGGTAACGA TTTATACTTC 2220 

GAAATGAGTG ACAGTGGTGT AATTAAGAAA ACAGCCATGG TATTCGGGCA AATGAATGAG 2280 

40 ' 

CCACCTGGTG CACGTATGCG TGTTGCATTA TCTGGTTTAA CAATGGCTGA ATATTTCCGT 234 0 

GACGAACAAG GTCAAGACGT ATTATTATTC ATCGATAACA TTTTCAGATT TACACAAGCT 2400 

GGTTCTGAGG TATCTGCATT ATTAGGTCGT ATGCCTTCTG CAGTAGGTTA CCAACCAACA 2460 

45 

CTTGCTACTG AAATGGGACA ATTACAAGAA CGTATTACGT CTACAACAAA AGGATCAGTT 2520 

ACTTCTATTC AAGCGGTATT CGTACCTGCC GATGACTATA CTGACCCAGC GCCTGCGACA 2580 

so GCGTTTGCCC ATTTAGATGC AACTACAAAC TTAGAACGTA AATTAACTGA AATGGGTATT 264 0 

TATCCAGCCG TGGATCCATT AGCGTCTACA TCAAGAGCAT TGGAACCATC AATTGTAGGT 2700 
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CAAGATATCA TTGCTATCTT AGGTATGGAC GAATTATCTG ATGAAGATAA ACAAACAGTT 2820 

GAACGCGCAC GTAGAATTCA ATTCTTCTTA TCTCAAAACT TCCACGTAGC GGAACAATTT 2880 

ACTGGTCAAA AAGGTTCTTA TGTACCTGTT AAGACAACAG TTGCAAACTT TAAAGATATC .2940 

TTAGATGGTA AATATGACCA TATTCCAGAA GATGCATTCC GTTTAGTTGG TAGCATGGAT 3000 

GATGTTATTG CAAAAGCTAA AGATATGGGT GTTGAAGTAT AACAATTAGG AGGAATGGAT 3060 

AATGAATACA TTAAACCTAG ATATTGTCAC TCCTAATGGT TCTGTTTACA ATCGTGATAA 3120 

TGTTGAACTC GTTGTTATGC AAACAACAGC TGGTGAGATA GGTGTCATGA GTGGACATAT 3180 

TCCAACTGTA GCTGCTTTAA AAACAGGCTT TGTAAAAGTG AAATTTCACG ATGGAACTGA 3240 

ATATATTGCT GTAAGCGATG GCTTTGTTGA AGTTAGAAAA GATAAAGTTT CAATCATTGT 3300 

TCAGACTGCA GAAACTGCAA GAGAAATTGA TGTTGAAAGA GCTAAATTAG CCAAAGCAAG 3360 

AGCAGAGTCT CAGTTGGAAA ATGATGACGA CAATACTGAT ATTCATAGAG CCGAAAGAGC 3420 

TTTAGAGAGA GCAAATAACC GTTTGCGTGT GGCTGAATTA AAATAGTAAA TAAAGGGTCG 3480 

AAGATGTGAT TTCATATCTT CGACCCTTTT TTGAATTATA TTGATTTAAA GATACAAAAC 3540 

25 . ATGAGAGGGG GGAAGGAATT GATAAAGAAC CATTAAAGAT TTATGATGTA GTGGTTCTTT 3600 

ATCATTAAAC ACAGCTAATG TGTATTTAAA AATAGGaayA CATgAGTAAA ACTCATGTAT 3660 

AAGAAATACT AATTTCTAAA GAAAAAGTAT TTCTTTATGT TGGGGCCCCG TCAACTACTG 3720 

30 CCAAATACAA CACTATAGAG TCTAGACATT GATTTATGTC CGACTCCCAA GAATAGTTTT 3780 

ACTTTTTTAC AATCACTAAT AGATTGCTAA AATCAAAATT TCCTTCACCA CTATCTACAG 3 840 

TCGACATTTC ATTTTTTGAA ATTATCTACA TTTTTTCATA CCAAGATATT TTATAGTTAT 3900 

GATATTTATG TAAAAAGAAT TATATAGTAA GTTAGCTTAA ACTTTACTAA AAACGGGTAT 3960. 

TAAACTTTGT ATCATTATTT AAATTTTTCA TGTACAATGT AATACAGTAA TCTTATGAGG 4020 

TGATAAAATG GATTATATCG GACAATATGC AGTTATCCAT TTAGTGTTAC ATGTTGTATG 4080 

TATTTGTATT GCCTATTGGG CTTTACAATC AATTAGATTA GATCAATTTT TTAAAAAAGG 4140 

ATACGCCACT CAATTACAAG TGTGTATGAT ATTTGTTGCT ATTTTATTAG GCACTGCAGT 4200 

4S AAGCAATTTT ATTGTAGATT TGTTACAATA CTCGACGCAG GTAAAATATT TAATAAAATA ,4260 

AGTCTAACTC TATGATTTGT AATCAAAACT AGATATAATT AAATAATGAC TTAAAATAAT 4320 

TTTAAAATAG GGAAATGTAA AGTAATAGGA GTTCTAAGTG GAGGATTTAC GATGGATAAA 43 80 

J 

50 ATAGTAATCA AAGGTGGAAA TAAATTAACG GGTGAAGTTA AAGTAGAAGG TGCTAAAAAT 4440 

GCAGTATTAC CAATATTGAC AGCATCTTTA TTAGCTTCTG ATAAACCGAG CAAATTAGTT 4500 
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GACGTTACAT ACAAAAAGGA CGAAAATGCT GTTGTCGTTG ATGCAACAAA GACTCTAAAT 4 620 

GAAGAGGCAC CATATGAATA TGTTAGTAAA ATGCGTGCAA GTATTTTAGT TATGGGmCCT 4680 

5 CTTTTAGCAA GACTAGGACA TGCTATTGTT GCATTGCCTG GTGGTTGTGC AATTGGAAGT 4 74 0 

AGACCGATTG AGCAACACAT TAAAGGTTTT GAAGCTTTAG GCGCAGAAAT TCATCTTGAA 4800 

AATGGTAATA TTTATGCTAA TGCTAAAGAT GGATTAAAAG GTACATCAAT TCATTTAGAT 4 860 

10 

TTTCCAAGTG TAGGAGCAAC ACAAAATATT ATTATGGCAG CATCATTAGC TAAGGGTAAG 4 920 

ACTTTAATTG AAAATGCAGC TAAAGAACCT GAAATTGTCG ATTTAGCAAA CTACATTAAT 4 980 

GAAATGGGTG GTAGAATTAC TGGTGCTGGT ACAGACACAA TTACAATCAA TGGTGTAGAA 5040 

TCATTACATG GTGTAGAACA TGCTATCATT CCAGATAGAA TTGAAGCAGG CACATTACTA 5100 

ATCGCTGGTG CTATAACGCG TGGTGATATT TTTGTACGTG GTGCAATCAA AGAACATATC 5160 

20 GCGAGTTTAG TCTATAAACT AGAAGAAATG GGCGTTGAAT TGGACTATCA AGAAGATGGT 5220 

ATTCGTGTAC GTGCTGAAGG GGAATTACAA CCTGTAGACA TCAAAACTCT ACCACATCCT 5280 

GGATTCCCGA CTGATATGCA ATCACAAATG ATGGCATTGT TATTAACGGC AAATGGTCAT 5340 

25 AAAGTCGTAA CCGAAACTGT TTTTGAAAAC CGTTTTATGC ATGTTGCAGA GTTCAAACGT 54 00 

ATGAATGCTA ATATCAATGT AGAAGGTCGT " aGTGCTAAAC TTGAAGGTAA AAGTCAATTG 5460 

CAAGGTGCAC AAGTTAAAGC GACTGATTTA AGaGCAGCAG CCGCCTTAAT TTTAGCTGGA 5520 

30 TTAGTTGCTG ATGGTAAnAC AAGCGTTACT GAATTAACGC ACCTAGATAG AGGCTATGTT 5580 

GACTTACACG GTAAATTGAA GCAATTAGGT GCAGACATTG AACGTATTAA CGATTAATTC 564 0 

AGTAAATTAA TATAATGGAG GATTTCAACC ATGGAAACAA TTTTTGATTA TAACCAAATT 5700 

35 

AAACAAATTA TACCTCACAG ACAGCCATTT TTATTAATTG ATAAAGTAGT TGAATATGAA 5760 

GAAGGTCAAC GTTGTGTGGC TATTAAACAA GTATCAGGAA ACGAACCATT CTTTCAAGGG 5820 

CATTTTCCTG AGTATGcGGT AATGCCAGGC GTATTAATTA CTGAAGCGTT AcTCAAACAG 5880 

40 

GTGCGGTAGC TATTTTAAAT AGTGAAGAAA ATAAAGGTAA AATCGCTTTA TTTGCTGGTA 5940 

TTGATAAATG TCGTTTTAAA CGTCAAGTAG TACCTGGTGA TACTTTAACG TTGGAAGTAG 6000 

45 AAATCACTAA AATTAAAGGA CCAATAGGTA AAGGTAATGC TAAAGCTACT GTCGATGGTC 6060 

AACTTGCTTG TAGTTGTGAA CTTACATTTG CAATTCAAGA TGTAAAATAA AACAAAAAAA 6120 

ACATTCAAAG ATTTAATGTG TTGGCATAAT CTTTGAATGT TTTTTATTTT ACTCTTCTAA 6180 

50 TTTTTCATCC TTTAACTTTG GTTTAGACTG CaTCATTCGA TTAAATGATT TTTTTAATTC 624 0 

TTCACCAGAT AATCCATCAT CAATAAGTTG GTTCTAATAA ACTTTCAGCA TACTGTTGGA 6300 
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(2) INFORMATION FOR SEQ ID NO: 481: 

(i) SEQUENCE CHARACTERISTICS: 
s - (A) LENGTH; 1717 base pairs 

• (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 481: 
AGTTGCTACA CCAGACATGA TGGGTGAAGT TGGTAAATTA GGTCGTGTAT TAGGACCAAA 60 

AGGTTTAATG CCAAACCCTA AAACTGGAAC TGTAACAATG GATGTTAAAA AAGCTGTTGA 120 

15 

AGAAATCAAA GCTGGTAAAG TAGAATATCG TGCTGAAAAA GCTGGTATCG TACATGCATC 180 

AATTGGTAAA GTTTCATTTA CTGATGAACA ATTAATtGAA AACyTCaATA CyTTACAAGA 240 

20 TGTATTAGCT AAAGCTAAAC CATCATCTGC TAAAGGTACA TACTTCAAAT CTGTTGCTGT 300 

AACTACAACA ATGGGTCCTG GAGTTAAAAT TGATACTGCA AGTTTCAAAT AATAAATGAT 360 

ATAAACAATT ACAGGCTGAA AGAAATATCT TTCAGTCTGT AAAAATATAT TGACAATAAG 420 

25 TAATTTCCAA GTTATATTAC TTATTGTGAT TATTTTACCT AAGACAGTAG GAGTTATTTA 4 80 

TAACTTAAAA TTTATCCTGC CGAGGCTAAA ATTGACTTGA ACGTGATGAT CTATGATCTT 540 

TCAAGCACTT TTTGCCGTGG GTAGAAAGTG CTTTTTTTAT TAATTTTAAA AAAAGCACCA 600 

30 AAAATTTAAA TGGAGGTGTC TGAATGTCTG CTATCATTGA AGCTAAAAAA CAACTAGTTG 660 

ATGAAATTGC TGAGGTACTA TCAAATTCAG TTTCAACAGT AATCGTTGAC TACCGTGGAT 720 

TAACAGTAGC TGAAGTTACT GACTTACGTT CACAATTACG TGAAGCTGGT GTTGAGTATA 780 

35 

AAGTATACAA AAACACTATG GTACGTCGTG CAGCTGAAAA AGCTGGTATC GAAGGCTTAG 84 0 

ATGAATTCTT AACAGGTCCT ACTGCTATTG CAACTTCAAG TGAAGATGCT GTAGCTGCAG 900 

CGAAAGTAAT TTCTGGATTT GCTAAAGATC ATGAAGCATT AGAAATTAAA TCAGGCGTTA 960 

40 

TGGAAGGCAA TGTTATTACA GCAGAAGAAG TTAAAACTGT TGGTTCATTA CCTTCACACG 1020 

ATGGTCTTGT ATCTATGCTT TTATCAGTAT TACAAGCTCC TGTACGCAAC TTCGCTTATG 1080 

45 CGGTTAAAGC TATTGGAGAA CAAAAAGAAG AAAACGCTGA ATAATTTTTA GCGTAAAAAA . 1140 

ATTAAAAATA ATGGAGGAAT TATAAAATGG CTAATCATGA ACAAATCATT GAAGCGATTA 1200 

AAGAAATGTC AGTATTAGAA TTAAACGACT TAGTAAAAGC AATTGAAGAA GAATTTGGTG 1260 

50 TAcTGcAGCT GCTCCAGTAG CAGTAGCAGG TGCAGCTGGT GGCGCTGACG CTGCAGCAGA 1320 

AAAAACTGAA TTTGACGTTG AGTTAACTTC AGCTGGTTCA TCTAAAATCA AAGTTGTTAA 1380 
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TCCTAAAGTA ATCAAAGAAG CTTTACCTAA AGAAGAAGCT GAAAAACTTA AAGAACAATT 1500 

AGAAGAAGTT GGAGCTACTG TAGAATTAAA ATAATTCAAG TATCTTAAAC TTAATAATCA 1560 

AAGTTTTATA GCAAGTATTG CTATAATATA ATGATTCTTT GAGAAGTTAA AACCCCGTTA 1620 

TTTTGATAAC GGGgTtTTAT TCaTTTAAAG ACTGAGTGAA ATGTTATAAT TATAATGACG 1680 

AGTTACAAAG TGAAGATGAG GTGGGAATAA TGAGTCA 1717 



10 

(2) INFORMATION FOR SEQ ID NO: 482: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1279 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



2Q (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 482: 

GTAAATCTGT TACTCGAAAT GTAACGATTA AAGAAAAGGG CTCATCTCAA ACATATATTT 60 

TGTTAGGCTA TCCAACAAAA GCACAGAAGA ATAGTCATAG CAAATATAGT GGAGTCTTTA 120 

25 TATATAAAGA CTTGAAATCA ATCGAAGATA CAAATAATGC TATTACGATT ATCACCATAA 180 

TTACGGCTGT TATTTTCTTA ACAATTACAA CAGTCTTTGC GTTTTTCTTA TCGTCAAGAA 240 

TTACAAAACC TTTAAGACGT TTAAGAGACC AAGCTACACG TGTATCTGAA GGGGATTACT 300 

30 CTTATAAACC TTCTGTCACA ACGAAAGATG AAATTGGTCA ATTATCGCAG GCATTTAATC 360 

/ 

AGATGAGTAC AGAAATCGAA GAGCATGTCG ACGCATTATC CACATCTAAA AATATTAGAG 420 

ACAGCTTAAT TAACTCTATG GTAGAAGGTG TCCTAGGTAT TAATGAGAGT CGACaAATTA 480 

35 

TCTTATCTAA TAAGATGGCG AATGATATTA TGGACAATAT TGATGAAGAT GCTAAAGCTT 54 0 

TCTTATTAAG ACAAATAGAA GATACTTTTA AATCAAAACA AACTGAAATG CGCGATTTAG 600 

AAaTGAATGC ACGATTCTTT GTTGTGACCA CAAGCTATAT CGACAAGATT GAACAGGGAG 660 

40 

GTAaAAGTGG TGTTGTTGTG ACAGTTCGTG ATATGACTAA TGAGCACAAT CTAGATCAAA 720 

TGAAGAAAGA TTTCATTGCT AATGTATCAC ATGAATTACG TACACCGATA TCATTACTTC 780 

4S AAGGTTATAC TGAATCAATT GTAGATGGTA TTGTTACAGA ACCGGATGAA ATAAAAGAAT 840 

CGCTTGCCAT TGTCCTTGAT GAATCGAAAC GTTTAAATCG TTTAGTTAAT GAATTGTTAA 900 

ATGTCGCACG CATGGATGCT GAAGGGTTAT CCGTAAATAA AGAAGTTCAG CCTATTGCAG 960 

SO CGTTACTAGA TAAGATGAAA ATTAAGTATC GCCAACAAGC TGATGATTTA GGTCTAAATA 1020 

TGACTTTTAA TTAyTGTAAG AAGCGTGTTT GGAGTTATGA TAwGGATCGC ATGGACCAAG 1080 
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TTACTTGTGA 


TGAAAATGAA 


AGCGAAGATA 


TTTTATACAT 


TAAAGATACA GGTACAGGCA 


1200 




TTGCACCAGA 


ACATTTACmA CAAGTATTTg ATCGTTTTTA TAAAGTTGAT GCAGCGnAnA 


1260 


5 


ACCCCGnGGT 


AACCAnGTA 








1279 




(2) INFORMATION FOR SEQ ID NO: 483: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1144 base pairs . 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 






15 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 483 : 






GAGCTGTTGT 


TACTTTGATG 


CCTGCAGCTT 


TATTACGGCT GACTTGGTAA TGATAAGTTT 


60 


20 


CAGCATATTG 


CTCAATATAT 


GCTATATCAT 


ATTGAATGGT 


ACGAGGTGAT ACACCAAGTT 


120 




GATTAGCAAT 


GGTATTGATT 


GGAATAAACG 


TTTGCTCATG 


AATTAAAAGA TACAAAATTT 


180 




CGATTTGTCT 


ATAACTTAAC 


AACGTAATAT 


CCTCCTATTT 


GTAATTGTAA GCGATTTCTT 


240 


25 


AAAAACGTAG 


ATATGCAATC 


TCTTTCATAT 


TTTAATCCGA 


AAAATTGCAT ATCAAAATGT 


300 




TTATGGCGCA 


AGATTTTATA 


GGAACTTTTA 


AAATAAATTA 


rATATTCATG TTGACAATTT 


360 




AAAAATGTCG 


CAGTATATTT 


AGTTAGACAT 


CTAACGAAAT 


GGTGGTGCAA TAAATGGAAT . 


420 


30 


TCACTTATTC 


GTATTTATTT 


AGAATGATTA 


GTCATGAGAT 


GAAACAAAAG GCTGATCAAA 


480 




AGTTAGAGCA 


ATTTGATATT 


ACAAATGAGC 


AAGGTCATAC 


GTTAGGTTAT CTTTATGCAC 


540 


35 


ATCAACAAGA 


TGGACTGACA 


CAAAATGATa 


TTGcTAAAGC 


ATTACAACGA ACAGGTCCAA 


600 


CTGTCAGTAA 


TTTATTAAGG 


AACCTTGAAC 


GTAAAAAGCT 


GATCTATCGC TATGTCGATG 


660 




CACAAGATAC 


GAGAAGAAAG 


AATATAGGGC 


TGACTACCTC 


TGGGATTAAA CTCGTAGAAG 


720 


40 


CATTCACTTC 


GATATTTGAT 


GAAATGGAAC 


AAACACTCGT ATCGCAGTTA TCTGAAGAAG 


780 




AAAATGAACA 


AATGAAAGCA 


AACTTAACTA 


AAATGTTATC 


TAGTTTACAA TAAATGATAA 


84 0 




GTGTGACTGG 


TAGAAATCAG 


TCACTTTGTC 


TTTAATATTA 


TAGTTAGATA TCTAATTGTT 


900 


45 


AGTAAGCTAA 


TTATTGGAAA 


AGACAAGGAG 


TATTGAACAA 


TGAAAGACGA ACAATTATAT 


y b u 




TATTTTGAGA 


AATCGCCAGT 


ATTTAAAGCG ATGATG.CATT 


TCTCATTGCC AATGATGATA 


1020 




GGGACTTTAT 


TAAGCGTTAT 


TTATGGCATA TTAAATATTT ACTTTATAGG ATTtTyAGAm 


1080 


50 


GAyAGCCACA 
GTTA 


TGATTTCTGC 


tAatCTCTCT 


AACACTGCCA 


GTATTTGCTA TCTTAATGGG 


1140 
1144 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 484: . 

70 

ACAACCAATT TTACTAAACA TGGTrrTCAT gCATTTTCAA TATTTAGATG TCGACCGCAT 60 

AATCGAAGAA TCGCCGACAA TAGTACTTAT CGATGAGTTA GCACATACGA ATATTTCTAG 120 

AGATCGTCAT GAGAAACGAT ATATGGATAT TGAAGAAATT TTAAATCATG GTATCGATGT 180 

75 

TCATACCACT TTGaACATTC aaCATATTGA AAGTTTAAGT AGTCAAATTG AACTGATGAC 240 

CGGTGTACAT GTTAAAGAAC GTGTACCCGA CTATTTCATA ATGAGCGCCG ATGTATTAGA 300 

2 0 AGTCGTAGAT ATCTCACCTG AACAATTAAT TAAACGCTTA AAAGCTGGCA AGGTATATaA 360 

AAAGGATAGG CTAGATGTAG CATTTAGTAA TTTCTTTACG TATGCCCACC TAAGCGAAtG 420 

CGTACATTGA CGTTAAGAAC AGTTGCCGAC TTGATGAGTG ATAAAGAAAA AGTCCGACAC 4 80 

25 AACCATAAAA CGTCACTCAA ACCTCATATT GCTGTGGCAA TTAGTGGGAG CATTTATAAT 540 

GAAGCAGTAA TTAAAGAGGC ATTCCATATT GCTCAAAAAG AACATGCGAa GTTCACTGCT 600 

ATTTATATAG ATGTATTCGA AAAAAACAGG CAATATAAAG ATAGTCAAAA GCAAGTGCAT 660 

30 CAACATCTCA TGCTTGCAAA ATCATTAGGA GCAAAAGTAA AAGTAGTTTA TAGCCAAACC 720 

GTTGCATTAG GATTAGACGA ATGGTGTAAA AATCAAGATG TAACCAAATT AATTATCGGA 780 

CAACATATTA GAAATAAGTG GCGAGACTTT TTCAATACAC CTTTAATTGA CCATTTAATG 840 

35 

TCCTTTGAAC ATAGCTATAA AATCGAAATC GTTCCAATCA AACAAATACC TGTTGAATTG 900 

AAAATGAACA AATCACCCTA TCGTCCTAAA GGCAAACGTT TCGCCATAGA TATGTTAAAA 960 

ATGATTTTGA TTCAAATAAT TTGTGTAATG ATGGGACTGT GGATTTATCA ACTTGATAAG 102 0 

40 

CATGAGTCTA GTACGATTAT TTTAATGATT TTTCTCATCG GCATCATTTT ATTATCCATT 1080 

TGGACGCGGT CCTTCATCAT TGGCTTTTaG CAGCAATTAt TAACGTATTT GTgTkTAATT 114 0 

45 ATkTTTTtAC GGAACCTA 1158 

(2) INFORMATION FOR SEQ ID NO: 485: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2224 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 485: 

ATCATATGGT CGATTTAACA GATCCAACGT ACTGCtAAAT AATTACATGA CGTTTAAACA 60 

TAGCATTGAT TATAACTATT TCTAAGTCTT CGCATTATTT GCGATGATGT GGGAATAGTT 120 

ATTTTTATTT AAAAATATAA AAAAATAGAT GCAGCAAAAT TTTAAAGCAT TTTATTTTGA 180 

ACATATTAAA AGGGAGCGTA TCATAATGGA ATGTAATGTT TATATCGTAT GCATTACGGA 240 

TAAATAATAT ATAAATCATT CTTGAGGAGT GAAAGAATAA TGAGAGACTA CACAAAGCAA 300 

TACATTAATG GCGAATGGGT AGAAAGTAAT AGTAATGAAA CGATAGAAGT TATAAATCCA 360 

GCAACCGAAG AAGTAATCGG GAAAGTTGCT AAAGGTAATA AAGCTGATGT TGATAAAGCC 420 

GTCGAGGCGG CAGACGATGT TTATTTAGAG TTCCGTCATA CATCTGTGAA AGAAAGACAA 4 80 

GCGTTATTAG ATAAAATTGT AAAAGAmTAT GAAAACAGAA AAGACGATAT TGTACAAGCT 540 

ATTACGGATG AATTAGGTGC TCCTTTATCA TTATCTGAGC GTGTCCATTA TCAAATGGGA 600 

CTAAACCATT TTGTTGCAGC GAGAGACGCA TTAGATAACT ACGAATTTGA AGAACGC CGC 660 

GGAGATGATT TAGTTGTTAA AGAAGCAATC GGTGTATCTG GATTAATTAC ACCGTGGAAC 720 

TTCCCTACAA ACCAAACATC ATTAAAATTA GCAGCAGCAT TTGCGGCTGG TAGTCCAGTT 780 

GTACTTAAAC CATCTGAAGA AACACCATTT GCAGCTGTTA TTTTAGCTGA GATTTTTGAT 840 

AAAGTCGGTG TTCCTAAAGG TGTATTTAAC CTTGTTAATG GTGATGGTGC TGGTGTTGGG 900 

30 AATCCTTTAT CTGAACATCC TAAAGTACGC ATGATGTCAT TTACAGGATC AGGCCCTACT 960 

GGTTCTAAAA TTATGGAAAA AGCCGCTAAA GATTTTAAAA AGGTATCATT AGAGCTTGGT 1020 

GGCAAATCAC CATATATCGT CCTAGATGAC GTAGATATTA AAGAAGCGGC TAAAGCAACa 1080 

35 aCAGGCAAAG TTGTTAATAA TACTGGTCAA GTATGTACAG CTGGTACACG TGTTTTAGTG 1140 

CCTAACAAAA TTAAAGATGC ATTCTTAGCT GAATTAAAAG AACAATTTAG CCAAGTGCGT 1200 

GTCGGTAATC CAAGAGAAGA TGGTACACAA GTAGGCCCTA TCATTAGTAA AAAACAATTT 1260 

GATCAAGTAC AAAATTATAT TAATAAAGGT, ATTGAAGAAG GTGCTGAATT ATTTTATGGT 1320 

GGTCCTGGTA AACCAGAAGG ACTTGAAAAA GGATACTTTG CACGTCCGAC AATTTTTATT 1380 

AATGTAGATA ATCAAATGAC GAT AGCACAA GAwGAAATTT TTGGGCCAGT AATGTCAGTT 1440 

ATCACTTATA ACGATTTAGA TGAAGCGATT CAAATTGCAA ATGATACAAA ATATGGTTTG 1500 

GCAGGATATG TTATTGGTAA GGACAAAGAA ACATTGCATA AAGTAGCTCG TTCTATTGAA 1560 

50 GCAGGTACAG TAGAAATAAA CGAAGCAGGT AGAAAGCCAG ATTTACCATT TGGTGGCTAT 1620 

AAACAATCTG GTTTAGGTCG TGAATGGGGC GATTATGGTA TTGAAGAGTT CTTAGAAGTG 1680 
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AGTGCACATG 


ACTAATTAAG 


TTTTGTGTAC TGTTTTAATT 


TTGCAATTTT 


TATAAATAGA 


1800 




TTTTGTAATT 


AAAATAAAAA 


TTTGCTATAG TTATTCATGT 


ATTTAAAAGG TTGGGGATTA 


1860 


£ 


GCATAATGGG 


ATTGTGCTAG 


CACAGTTATT TATGCATTGT 


CATGCCTATC 


TATTACTTAC 


1920 




TAACTAAAAA 


ATAATGAAAT 


GGGTGTAAAC TATATGCCTG 


AAAGAGAACG 


TACATCTCCT 


1980 




CAGTATGAAT 


CATTCCACGA ATTGTACAAG AACTATACTA 


CCAAGGAACT 


CACTCAnAAA 


2040 


10 


GCTAAAACTC 


TTAAGTTGAC 


GAACTATAGT AAATTAAATn 


AAAAAGAACT 


TGTTCTAGCT 


2100 




ATTATGGAAG 


CACAAATGGA 


nAAAGATGGT AACTATTATA 


TGGAAGGTAT 


CTTAGATGAT 


2160 




ATACAACCAG 


ATGGTTATGG 


TTTTTTAAGA ACAGTGAACT 


ATTCTAAAGG 


GGAAAAAGAT 


2220 


15 


ATTT 










2224 




(2) INFORMATION FOR SEQ ID NO: 486: 








20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1690 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








25 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 486: 








ACATTACnTT 


GAATCGAAAG 


TTTCATAAAC GACTTGAATG 


CCAGTTTCTT 


TTTCAAATTT 


60 


30 


CTTAATTAAC 


TCTGGATCAA 


TATATTCGCC CCAATTGTAT 


ACGTAAATTT 


TTTGATTTGT 


120 




ATGCACTTGT 


TCTTTAGATT 


TAAACCAATG ACTTAAAGTA 


AGACAAAGCA 


TACCCACAAC 


180 




TAATGCACCT 


ATAATGAGTT 


GTAAAAATCG TTTCATTATT 


TTACACCTCG 


CTTGATTAGT 


240 


35 


TTTTTCTTAT 


TTATCACGTA 


TTGAATCAAA TAATATCCTA 


GTATTCCTAA 


TACAATAACA 


300 




GCAAACAATA 


ATGTTGAAAT 


CGCATTAATT TCCATACTAA 


TTCCTTTTCT 


CGCCATAGCA 


360 




TAAACTTCAA 


CTGATAACAC 


ACTAAAGCCA TTACCAGTAA 


CGAAGAAACT 


TACTGTGAAA 


420 


40 


TCGTCTAGTG 


AATAAGTTAA 


AGCCATAAAG AATCCTCCTA 


TAATAGAAGG 


TAAAATATTA 


480 




GGAATAATAA 


TGTTGCTTAA 


TAATTGTGGT TCAGTCGCTC 


CTAAATCTCT 


TGCAGCATTT 


540 


45 


AACATATTAT 


TATTCATyTC ATACAGTTGT GGTAAGACGA 


TAATCACAAC 


TATAGGTATG 


600 


CAAAATGCAA 


TATGAGATAT 


TAGAACTGTC CaAAAkCCTA 


AACCAAGACC 


AGTAAAATGG 


660 




• CCAATCGTTG 


TAAACATAAT 


TAAGAATGAT GCACCTATGA 


CAACGTCGGA 


TGATACCATC 


720 


50 


AAGACATTAT 


TCAATGTTAG 


TAAAGTTACT TTAAACTTTT 


TATTTCTTAA 


ATAATAAATA 


780 


GCAATGGCAC 


CAAATGTACC 


AATAACTGTA GAAATTGAGG 


CTGCTAAAAG 


TGCTACAGCT 


840 
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AATGTAAAAT 


GTTCAAAGTG 


AATCATATTA 


CCAGCCGAAT 


TGAATGAATA 


GAACATTAAA 


AAGAATATTG 


GGATGTATAA 


AATCGCTAAA 


AGTATCCCGA 


TATACAGCTT 


TCCATACCAT 


TTCATATGAT 


TCACCCTCTC 


CCATTAGATG 


ATTTTGTAAT 


GATTAAAATG 


AATGCCATAA 


ATACAATTAA 


GAATATAGCT 


ATAGTTGATC 


CCATACCATA 


ATTTTGAATT 


GTTAAAAATT 


GTTCCTCT'AT 


TGCCGTACCT 


ATATTTATGA 


CTTTATTACC 


TGCAATTAAT 


CTTGTAATCA 


TAAATAATGA 


AAGTGATGGA 


ATAAAGGTTA 


CTTGAATCCC 


AGTCATAACA 




TTAACGGCAT 


GATTACTTTT 


CTAAAAGTAT 


AGAAAGGACT 


GGCACCTAAA 


TCACTTGAGG 


CCTGCAATAA 


ATTATTAGGA 


ATTGCTTTCA 


TGCTATTAAA 


TATAGGTAAA ATCATAAATG 


GTATATAAAT 


GTAACTTG CC 


ACTACTAAAA 


ACGCACCAGT 


TGTAAATAAC 


AAATTGAATG 


ATGGTAAATT 


AAATAAGTGG 


AAAATTGATT 


AATCACGCCA 


TCATGACTTA ATAAACCTAT 


AAAAGCATAT 


GTCTTTAACA 


ATAAATTTAT 


CCATGTTGGA ATAATCATTA TCATTAATAA 


GATATTTTGA 


AATTTCGAAC 


GAGTAATATA 


ATAGGCAGnT 


GGATAACTGA 


TAGTCAAGGT 


AATAATTGTT 


ATTGAAGCGG 


CATATAAAAT 


TGAATATGCA AACATTTTCA 


AATATTTTGT 



AGTAAAAATT 

(2) INFORMATION FOR SEQ ID NO: 487: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2112 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 487: 
ACGAAAAGAA ATATTATGAT GAACAAAAAG AAAGAATAAC GATTTATATG AAGTACAATG 
TGAAAGGTTA TAAAAATATA AGCTTCGCTA ATTTTAAAGA AAACCCAATG GATGGTTATT 
CTATTAGTGG TTATATAAAT AAtGaTAAAA AGTTATCATT TACAGCTGGT ATAAGATCTG 
TTGATGATTT TCAATTTGAT ACCGATATTT CTTATACAGA TGAATTGGGT AGAAAATTTA 
ATAAAAATCC TAAGTCAGTT TCTGAAATAA AAAAAGAGCA AAATACGTCC AATAAATAAT 
TGTTCATATT GTGATGAAAC AAAAATATAA GTCATTAGAT GAGTTTAACT ATGTTATAAA 
TATTTGTAGT ATCTATAAAA ATCTCGACAC TATTAAAATG ATAAAGTGCC GAGGTTTTCT 
TACTTATTTA GTTAATTCAA AGTTTATGCC AGATTCATAA GAATTTGTGA CACTTTTAAT 
AGTGTAcCAT TG ATT ATT AC AATTTATCAA ATGGTCCTTT AGAAGGTATA AATAACAAAA 
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TATTATTATG TTCAAAACTT TACGCTCCAA AAAGTAAAAA GGAAGTTAAG CAATGTTTAG 660 

TTGCTTAaCT TCGGaTATTG AACGCATCAG TCCAATTTGA CATAGAGCCT TTTTTAGTTC 720 

TTGATGTTTC TCTTTAAAAC CTTGCATATT TTACAAAAAG AAAAATTAGC AGTATAATTA 780 

AGACAACGAA AATAAGTATT TACTTATACA CCAATCCCCT CACTATTTGC GGTAGTGAGG 640 

GGATTTTTAT TGGTGCGGCT ATATGTCACC TATTTTGTAT TGCGTCTACT TAGCCAATAA 900 

GAAAAAAACG CAATGGCACA GCCACTGATG ACTGGTGCTA TGATGTGAAC GaAAATAAGC 960 

ATCACCTTAT ACACCTCCTC TCTGCGTCTA AATTGACGsC TGAGaGrTAG GcGACTCTAC 1020 

TATTATATCA TCGGCAAATA TACAAGCACA GTCACTTGCT TCTGATAAGT TATATGATTC 1080 

TAGCTGATAG ATTGAATCGT CTACACTTAA TTGGACAAAT TCTATGAGAA TAGATATTGT 1140 

TAATTTAAGA AAGTAGGCGA TTTTATTATG ACAAGAGAAA GAAGATCATT TAGTTCAGAG 1200 

TTTAAGTTAC AAATGGTTAG ATTATATAAA AATGGTAAGC CTAGGAATGA AATTATACGC 1260 

GAG T ATGATT TCACACCTTC GACGTTTGTA AATGGCGGTT ATAAAATGTA GGAAAATGGA 1320 

TAAAGCAACA TCAAAACACG GGTACATTCA ATCACCAAGA TAACTTATCG GATGAAGAAA 1380 

AAGAGCTGAT TAAATTACGC AAAGAAGTTC AACATTTAAA AATGGAGAAC GATATTTTAA 144 0 

AGCAAGTAGC GCTGATTATG GGGCAAAAAT AGAAGTCATT CAAAAGAATG CACATCAATA 1500 

TTCAGTATCA GCAATGTGTA AAGTCCTGAT AATACTAAGA AGTACCTATT ATGATTCTAT 1560 

30 AAAAAGAAAA GATAATAAAA TCACTAAAGA TGATTCAAAC ATAGAACATG CCGTCATAAA 1620 

TATTTTTAAT TCTAATAGAA AAGTCTTTGG TACAAGACGA ATTAAAAATC ATTTAAATGA 1680 

CAAGGGTCTC ACTGTATCTG GACAAAAGAT AGGTCGATCA TGAAAAAATC TAGTTTCTGT 174 0 

35 TTATACGAAA TCTAAATACA AAAATCATCT AAAAGAAACT AATGAAAAAC GAATTAAAAA 1800 

TCTTTATTAT TAGCTGCTGG TGTATTATTT GTTAGTCCAA TTTCATTATC TTATAATTCA 1860 

GATGTAGCTC ATGCTGAAGA TAAGTTAGAC CATTCTCAAG CAAAGGTAAT ATATTTGAGT 1920 

AACCAAAATT TATTTGATGA ACTTGAGAAA AAAGGTTATA AACTGGAAGA TATATTTACA 1980 

AAAGAAGAAA TAAAAAAATA TAAAGCTGAA GACCAATTGA GAGCGGGTAA AACTCAATAT 204 0 

GTAGAAACAG GTAAAGATAC TGCAACATTA TATCTTTCTT CTGCATATAC AAAAACAATA 2100 

GCTGCTTTAG GT 2112 
(2) INFORMATION FOR SEQ ID NO: 4 88: 
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(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 488: 

s GTAGGCACAC ATCTGCCATA TAAACATTCT TTTATACTAG TGTTCTCATA TAGTGTAGAG 60 

TTATAGTCTC CTTCTTGAAT CTCGAATAAT TCAATCAACC TATCAACCTT AGTCTCTTCC 120 

GTTACTTCTT TTTCAATATC AACTATGAAG GGGATATCAA TTGGAATAAA ACTTGACGTC 180 

10 GAACACTTAT TTGTATTTGG ATGAAAACGA ACGAATCCAT CACTAAATCC TGTTGAAAAA 240 

AATATTTTTC CTTGTGATAG ATCCGGATTT TCTCGCGCCC ATTTAATTAA TTCATCTAAT 300 

CTCATTTCTT TTTTAACTTT GATTTTCATT GTTATATCTC . CTCTTGAACA GTAAATTTAT 360 

15 CGTTAAtTGA TACGTATCCA GTCACATTAC ATAAGATGCT ATCAACATCA AAAGTCACAC 420 

AACAGTTGCG TTCAACATCA TTTGAATAGA ATCT 454 



(2) INFORMATION FOR SEQ ID NO: 489: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1372 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 489: 



30 



35 



40 



TTGTCAGAAT 


TAGAATGCTT 


TTGAGTTACT 


TCATAATACT 


CATCAGTTTT 


TTGTGTATCC 


60 


TTTTGACTTT 


TATTTATTTC 


TTTCCACTTA 


CCAGTATGAC 




TACAGTTATT 


120 




TTAAAAAATG 


ATAACCGATG 


TTCTTTTTGG 


TATCTGTGGA 


CTCTAAAAAG 


180 


ACTGAATTGT 


TTTCCTGATT 


ATCAGAATTG 


GTTGTTTGTT 


TATCGTCTGT ATATAACGTA 


240 


TACCCATTCG 


CTTTGCGATT 


TTCAATCGTT 


GTAACTGGAT 


TCTTTGATGA 


ATCAGTACTT 


300 


TTAATGCCAG 


TTCCTAAGAA 


GACAATTTTA 


TCGTTTAATA 


TGAAATATGA 


TTTTTTGGCA 


' 360 


GTTAAAGTTT 


TGTCCTGATT 


TTCAAAATCC 


ATTCCGATAC 


TAGCATGTTG GTCATCAACT 


420 


TTTGTTCCGC 


CAACAAAAGT 


TTTACTCGAC 


TTTTTATCAT 


CCGTATCTTT 


TAATATTTCA 


480 


TTGTCTAAAG 


TTGTTGTACC 


TGATAAACGT 


TTCATATCGG 


CTGTCACCCA 


GAAGTTATCA 


540 


TGATAGTGTT 


TGACATCGCT 


GTTATATAAA 


TAAGACATTC 


CAGCACCAGT 


GTGCCAACCT 


600 


TTTAAATTCT 


CTCCGTTGAT 


ACTTTCATAG 


CGTGCTACGT 


TTTTCGACGT 


CATACTTAAA 


660 


CCAAATGCAA 


AGTCTAAGTC 


TTTGTTATGA 


TAGGTGACAC 


GATCCATGTC 


ATTATATATT 


720 


TTAAGTTGTT 


GTGTTAATCC 


GTTTTTAGAA 


ATACTGTTAT 


CTGTCATTAA AGACTTCATT 


780 
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ACTGAAGATT TGACAATCTT TTTATACTTA GCTTTTGTTG AAT CATC CAT GGCATCACTT 900 

AATCTCAACA ATGATTTCAT TACTGTTGCA GATGCTGAGT GACTGGTTTC ATTTTCACGA 960 

5 CTGATAGCTC TACCTCGTGA TAAATCCATC ATTTCACCTT TATAAATGAG TGGCATAAAT 1020 

CCGTCGTCAA TCCATGACTT TAAGGTTGTA TCATTTTGGG TTTTATCATT AAAAGGTGTT 1080 

TCTTTTATCA TCGGCATCAT TTGAGAAATC CCCTCTAAGA GTACAACGCC ATAAGCACCA 1140 

10 GTGTATGGAA CGTCTTGATG ATCAATGTAA GAGCCATCTT TATAAAATCC ATTACGTTCT 1200 

TTACCAGTGG CAGAATCTTG AACGTAAGTG AAGACTTTAT TAAATGAATC TATAGACTTT 1260 

TTCATCATAT CTTTATCTTC TTCGATAATA CATTCTAAAA GTTTCACCTT AGAAATGTCT 1320 

75 

ACTAnATTTC CGCCTTTAGC AAGTTCAGnT TTTCCTACAC AAGATAATAT TT 1372 

(2) INFORMATION FOR SEQ ID NO: 490: 

(i) SEQUENCE CHARACTERISTICS: 
20 (a) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 90: 



ACAACAATAT AGTAAAGCGT CGTTAATTAA ACAATGGGAC CAATTTGTTC GTCTTATATA 60 

30 AATGTACTTT aCCTTTATTT TTACAAAAAT AGCATTTTCC TATGTCATTT AACTAAACAT 120 

GTAAGTTCGT ATGAACGAGG TTTGTTAAAT AGATGATTCT AGGAAAATGC TTTTTTCTTT 180 

TGACTTAGTT TAAAATATTT TGCCACTTTG TACTGATAGT AGTTGCATTG TACTGTTGTG 24 0 

35 CAGATTCTAT GCTATTAATT GAAAACTGTT GCAATTTTTG AGTATTATTT AGTAATTGGT 300 

CGACCTTTTC AACCATTTGA TTGATGTCAC CTTGAGGTAC TAAATAGCCA TTAAATCCAT 360 

CTTGaATCAG TTCTGaTGGA CCATAATCTA CATCATAACT GATCACTGGT GTACCTACTG 420 

40 AAAgCGACTC TAAAATTGCT AAGCCAAAAC CTTCCATTTT ACTTGTCGAT AACATCAGTT 4 80 

CTGCTTTAGC AATCTCTTCA TTAATATGCG TCTTAAAACC ATGAAATTTA ACATGTTCCA 54 0 

GATATnATGA TAATCTTCTA CAAG 564 



45 

(2) INFORMATION FOR SEQ ID NO: 491: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1277 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 491: 



5 



10 



15 



20 



25 



30 



40 



TATCCACCCC 


ACGAnAGCCC 


CGGAAACTTA 


TTGTGTTACA AGATATATAA 


GCAGAAACGA^ 


60 


ACAACAGTTA 


ACAAAATAAA 


TGAAATTAAA 


CGTTTTAAAA ATGAAACAAA 


TGAAATCATC 


120 


TATTAGGTTA 


TGAAACTGTT 


TATAGCTTGA 


ATAGAAGCAT TTATTTTTTA 


GGAGGACAAT 


180 


TATTATGCGT 


CAAACATTTA 


TGGCAAATGA 


ATCAAACATT GAGCGCAAAT 


GGTATGTTAT 


240 


CGATGCTGAA 


GGCCAAACAT 


TAGGTCGTTT 


ATCATCAGAA GTAGCATCTA 


TCTTACGCGG 


300 


TAAAAATAAA 


GTAACTTACA 


CACCACACGT 


TGATACTGGT GATTATGTAA TCGTTATTAA 


360 


TGCATCAAAA 


ATCGAATTTA 


CTGGTAACAA 


AGAAACTGAC AAAGTTTACT 


ACCGTCACTC 


420 


AAATCACCCA 


GGTGGTATCA 


AATCAATCAC 


TGCTGGTGAA TTAAGAAGAA 


CTAACCCAGA 


480 


ACGTTTAATT 


GAAAACTCAA 


TTAAAGGTAT 


GTTACCAAGC ACTCGTTTAG 


GCGAAAAACA 


540 


AGGTAAAAAA 


TTATTTGTAT 


ATGGTGGCGC 


TGAACATCCA CACGCTGCAC 


AACAACCAGA 


600 


AAACTACGAA 


TTACGTGGTT 


AATTAGAAGG 


AGGAAATGAC TTTGGCACAA 


GTTGAATATA 


660 


GAGGCACAGG 


CCGTCGTAAA 


AACTCwGtAG 


CACGTGTACG TTTaGTACCa GGTGAAGGTA 


720 


ACATCACAGT 


TAATAACCGT 


GACGTACGCG 


AATACTTACC ATTCGAATCA 


TTAATTTTAG 


780 


ACTTAAACCA 


ACCATTTGAT 


GTAACTGAAa 


CTAAAGGTAa CTATGATGTT 


TTAGTTAACG 


640 


TTCATGGTGG 


TGnTTCACTG 


GACAAGCTCA 


AGCTATCCGT CACGGAATCG 


CTCGTGCATT 


900 


ATTAGAAGCA 


GATCCTGAAT 


ACAGAGGTTC 


TTTAAAACGC GCTGGATTAC 


TTACTCGTGA 


960 








TCTTAAAGCA GCTCGTCGTT 


CACCTCAATT 


1020 


CTCAAAACGT 


TAATTGTCGG 


ACGATATATA 


CAAAACACCT CGATATTATG 


TCGAGGTGTT 


1080 


TTTTTGGOGT 


TTTTGCGGCG 


AATATGGAAT 


GTGTAGAATA TAAATGAATT 


TTTACCTTCC 


1140 


CACCATAAAA 


GATGAAGAAC 


CATGAATGTG 


GAGAACAATA AATAGTTGGA 


TATTCTGTTA 


1200 


TTTTTTTGGA 


AGTGGAAGTG 


GATTTGGAAT 


ACTTTACTCn AAACGATTAA AAGGTTTAAA 


1260 


AAAACAACAA 


AnAGAAA 








1277 



(2) INFORMATION FOR SEQ ID NO: 492: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



50 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 492: 
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TTATTGACAT TOTTT T TATC CAAAATTCAT TGTTAAGACA TTTTCTTTAT GAAATAATAA 120 

TnATATTGAA GTATATTTTT ATTATTATTA AAAATAAATA AGGGGATACT TATGAGCACA 180 

AATCAAACAT TTTTAATATT TGTTATAGCA ATTATTCTAC TTACATCTGT AATAGGAATT 240 

GTTGGACGAT ACATGAGTCG TCAACGTCTA TTAAAATCTA TGGAAACATT ATGGCAAACG 300 

ATTTCTCCAT TAGAAGCTTT TATCAGACCG AACTCACATT TCGACTATGA GTATAAGCTC 360 

TACAAGGAAA AATTTGAATC ACATTCATTA GTTGATGATA AAACTTGGTC CGACTTAAAT 420 

ATGAATGCAA TCTTTCATAA GATGAATTAT AATTTAACAG CTATTGGTGA AATGAAGCTA 4 80 

TATGCCTGTT TACGTGGAAT GCTTTCAATT ACGAACAAAT CATTACTTAG TTTATTTAAT 540 

GATAATGCTG AATTTAGAAA AAACGTAACA TATCATTTAG CTTTGATTGG tAAAACTGTT 600 

aTCCAAGATT TCCAGACCAA ATCACACCGG kAAACGTCCA AATATATTGn TCTATGCCCG 660 

GTTTACCAGT ATC 673 
(2) INFORMATION FOR SEQ ID NO: 493: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 124 0 base pairs 

(B) TYPE: nucleic acid 

' (C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



30 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 493: 

AAAAAAAGTT AATGCTGACG GTGTATTAAC TTTTGATATT CTAGAAAATA AATATACTTA 60 

TGAAGTTATT AACGCTATAG GGAAAAGATG GATTGTTAGT CATGTCGAAG GTGAAAACGA 120 

35 CAAGAAAGAA TATGTAATAA CTGTCATTGA TAGGAAATCA GAAGGCGACA GACAACTGGT 180 

TGAATGTACT GCTAGAGAGA TTCCCATAGA CAAGTTAATG ATTGATAGAA TTTATGTTAA 240 

TGTAACAGGA TCTTTTACAG TAGAAAGATA TTTTAACATT GTGTTTCAAG GTACTGGAAT 300 

GCTTTTTGAA GTCGAGGGCA AAGTTAAATC TTCAAAGTTT GAAAATGGTG GTGAAGGCGA 360 

TACAAGGTTA GAAATGTTTA AAAAGGGATT AGAACATTTC GGTTTAGAAT ATAAAATAAC 420 

GTATGACAAA AAGAAAGACA GATATAAGTT TGTATTGACG CCTTTTGCAA ATCAAAAAGC 480 

GTCTTATTTT ATTTCTGACG AAtCAACGCC AACGCTATAA AACTCGAGGA AGATGCAAGT 540 

GATTTCGCCA CCTTCATTAG AGGATATGGT AATTATTCAG GAGAaGAAAC ATTCGAACAC 600 

GCTGGGCTCG TAATGGAAGC TAGAAGTGCA TTAGCTGAAA TATACGGCGA CATCCACGCA 660 

GAACCATTTA AAGATGGTAA AGTGACTGAC CAAGAAACTA TGGATAAAGA ATTACAATCG 720 
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TATCCAGAAG CAGACCCACA ACCCGGAGAC ATAGTACAAA TAAAATCTAC CAAACTAGGT B4 0 

TTGAATGATT TAGTCCGTAT AGTACAAGTT AAAACGATTA GGGGTATAAA CAATGTAATT 900 

GTTAAGCAAG ATGTAACGCT TGGTGAGTTT AATCGAGAAC AACGATATAT GAAAAAAGTT 960 

AATACTGCAG CTAACTATGT TTCTGGATTA AATGATGTTA ACCTTTCTAA TCCTAGTAAA 1020 

GCGGCAGAAA ACTTGAAGTC TAAAGTAGCG TCAATAGCTA AATCAACACT CGATTTGATG 1080 

AGTAGAACTG ATTTGATTGA AGATAAACAA CAGAAGGTAA GCTCTAAAAC TGTGACTACA 1140 

TCTGACGGCA CTATCGTTCA TGATTTTATa GATaAATCtnA ACATTAAaGA TGTAAAAmCGy 1200 

aTTGGAACGa TTGGCGATtC TGTAGCTAGA GGATCACATG 1240 



15 

(2) INFORMATION FOR SEQ ID NO: 494: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 494: 



45 



50 



ACGGTGGATT 


TAGCCTAGAA 


GATTTAACGC 


ATCAAGGTAa 


ATTaTCAGCA 


TTTAGCTTTA 


60 


ATGATCAAaC AGGTCAAGCA 


ACATTGATTA 


CTAATGAAGA 


TGAAAACTTC 


GTAAAAGATG 


120 


AGCAACGTGC 


TGGCGTAGAT 


GCAAATTATT 


ACGCTAAACA 


AACATATGAT 


TATTACAAAG 


180 


ACACATTTGG 


TCGTGAATCA 


TATGaCAACC 


AAGGTAGTCC 


AATTGTTTCA 


TTAACGCATG 


240 


TTAATAACTA 


CGGTGGTCAA 


GATAACAGAA 


ATAATGCCGC 


ATGGATCGGT 


GACAAAATGA 


300 


TCTATGGTGA 


TGGTGATGGT 


CGCACATTCA 


CAAGTTTATC 


GGGTGCAAAT 


GACGTAGTAG 


360 


CACACGAATT 


aACACACGGT 


GTGACACAAG 


AGACAGCGAA 


CTTAGAATAT 


AAGGACCAGT 


420 


CAGGCGCTCT 


AAATGAAAGC 


TTTTCAGATG 


TTTTTGGATA 


CTTTGTAGAT 


GACGAGGATT 


480 


TCTTAATGGG 


TGAAGATGTC 


TACACACCTG 


GAAAAGAGGG 


AGACGCTTTA 


CGCAGCATGT 


540 


CAAACCCAGA ACAATTTGGT 


CAACCAGCTC 


ATATGAAAGA 


CTATGTATTC 


ACTGAAAAAG 


600 


ATAATGGTGG 


CGTACATACG 


AATTCTGGAA 


TTCCAAATAA 


AGCAGCTTAT AACGTGATTC 


660 


AAGCAATAGG 


GAAATCTAAA 


TCAGAACAAA 


TTTACTACCG 


AGCATTAACG 


GAATACTTAA 


720 


CAAGTAATTC 


AAACTTCAAA 


GATTGTAAAG 


ATGCATTATA 


CCAAGCGGCT AAAGATTTAT 


780 


ATGACgAGCA AACAGCTGAA 


CAGGTGTATG 


AAGCATGGAA 


TGAAGTAGGC 


GTGGAGTAAA 


84 0 


AATATATAAA 


CaAGAAGAAG 


TAATGTTAAA 


CACTTATAAA 


TAATTAAATT 


TTAAATACAG 


900 
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ATTAGATGAG AGGAGTGTGA GGGTTGTCTG CCGAAAGACT ACTCGGCAGT CTAAAATCAT 1020 

TACAAGTAGT AGATATGTGA TAATTAAATG CTGACTTAGA ATACAAAATT CATTTTAAAA 1080 

GTTGTCACAA AAAATTTACA TGTATTTTTA TTATCTTTTG CAAAACAAAG TGTTAAATTA 1140 

TAAATGAaAC ATGCATGAAT TTATTTTTTA ATACAAGAAA CGTAACTACC AAAGGAGTTT 1200 

ACAATATGAA GAAAAGTAAA CGATTAGAAA TTGTTTCTAC AATAGTTAAA AAGCATAAGA 1260 

TTTATAAAAA AGAACAAATc ATTTCATATA TTGAAGAATA TTTTGGTGTA A 1311 



(2) INFORMATION FOR SEQ ID NO: 4 95: 

(i) SEQUENCE CHARACTERISTICS: 
is (A) LENGTH: 1761 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 4 95: 
TGCACTTTCT AAAAATACTT GCTTTACTTG TTCCAATTCC TTGTCAGACA TTGATTCGGC 60 

25 TAATTTATTC ATTATTGCCT CAAATACGTT AATTATGTCA TCCATTTCAG TACTATAAAA 120 

ATCAAACCAT TTTGCCGTAT CTTTTTCTCT ATTAAGCTTA TGATCACTTT GAGAGCGCTT 180 
AGCTAATTCT GCATAAATAT ATGGACAAGG TGCCATTGCA GCAATTGTAT AAATAGCATT 240 

30 TTCACGACTA TGCGCTTGGA AATACATATG TTTTATGTAA TGGTCGCCAC TTGGAGGCCA 3 00 

AACTTTTGTT TTAATGATTT CTTCGTATGA TTCACCAACA ATTTGCGCTA 1 AAATATCATG 360 
CGCAAGTACT TCACCTTCAA CCATAAATTC TATTTGCTCT ACTAAAAATT TTACGTCATT 420 

35 CATGCTATTC ATTTTTGGAA TTAACAAAGC ATATAAGTTT GTAAATTCTT TTAAATACGC 480 

AGCATCAGCT TTTAAGTAAT GGCGTAATGC GTCAGcTCCT ATATCTCCGG ATAACATCTT 540 
CTGAATAAAG TCATCCTCAT AAATATCATT AATGATTGGC TTTGCAGCTT GGTACAATTT 6 00 

40 TTGTGAAAAT TCCATTGTAA AAAATCCTCC CTAAATAAAA AAACTACTTC CAACATGAAA 660 

GTAGTTTGAT GGCAATGTTG CTATACTAGC CCCATCACTT CAATAACTAC TTTCCTACGT 720 
TGGTACTAAC CAAATCAGGT CATAAGGGTC TGAACAATTC ATCTCAGCCA, TATCATTAGG 780 

45 

CTCCCCTAGT AGTTCCTTAG TATTCAATTG CAAATTAATC TTAGCAAACG GTTTCAACAT B4 0 

TTTCAATTAT TGTTGCTCAG TTGTATTATT ATCTTTAAAT AATAATTCTA TAATGACATA 900 
TATTTGCGAA ATAAAAAAAC CGGAACATAT CGAGAATTCC CCGATATATT CCAATCTAAA 960 

50 

AGTTACTTAT ATAACTATTA ATTAGCTATG CATAAATGGC TTATGCAGTA ACCCAATGTC 1020 
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5 



10 



15 



20 



TTGCTGGTGA 


TACACCTTTA TATTTAGCAG 


GTGCTACTGA ATCCCAAGTT 


GATTGTAAGA 


1140 


ATTGATACTT 


ACCAGCTGCA CCTGgATGTT 


GGrTTTACAG 


CATGAATATT 


GCCACCTGaT 


1200 


TCACGTTGrG 


CAATTTGTTT TAGATGAGCa 


TTCACATTTA 


CTGATGAACC 


TTCTGATGAT 


1260 


TTTGATyCAG 


TTGGTGTTGC AGTAACTTGT 


GAATTGTTTG 


ATGTTGATGC 


TTGTGGTTGT 


1320 


TGAGTTTGAG 


CATTTTGTGG TGCTTCAACT 


TCTTGTGATT 


GTACTTGATT 


AGCTTGAACA 


1380 


GCTGATGGTG 


CAACATTATT AGTTGCAGGT 


GCTTGTGCAC 


TCATGTCTGC 


TCCATTAGTA 


1440 


CCTGTTGCAT 


GGTAATTCCA AGCAAAGTGT 


GTACCATCTG 


ATTCAAAGTG 


ATAAGTAAAC 


1500 


CCTTCATAGT 


CAAATGTATA ATTATAAGCC 


CCAGCTTCAA 


TTGGTTTTTG 


ATTTAATGTT 


i ten 


TGATCATTTG 


ATTGCGCCAT TTGCCTGAAA 


GATGCTTTAT 


TTAAGTCCGC 


TTCACnTGCA 


1620 


TGGGCTTCGT 


GGACCTGCAT TTCCTGGCTA 


CGATTCCTAA 


ACCTACTGGC 


nAAnGATGAT 


1680 


GCGAGTAATG 


TTTTCTTCAT AATCTTAAAA 


TCCTCCTACA AGTGAATTTG TGTCTCTAAA 


1740 


AGTTTTACAG 


TGGACGACTG T 








1761 



(2) INFORMATION FOR SEQ ID NO: 496: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 794 base pairs 

(B) TYPE: nucleic acid i 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 496: 



50 



TCATTTATGA AAAATGTCGG 


AnAGAnCCaa 


GaAAmaCAAT 


TGAgCGTGAA GAAAAAGCAA 


60 


GACTTAAAGA AGAACAAAAG 


GCACGTCAAA 


ATGAACAGCC 


ACAAATAAAA 


GATGTGAGTG 


120 


ATTTTACGGA AGTGCCTCAA 


GAAAGAGATA 


TTCCAATTTA 


TGGGCATACT 


GAAAATGAAA 


180 


GTAAAAGCCA GAGTCAACCA 


AGTCGAAAAA 


AACGAGTGTT 


TGATGCAGAG 


AATAGTTCGA 


240 


ATAACATCGT AAATCATCAT 


CAAGCAGATC 


AGCAAGAACA 


ATTAACAGAA 


CAAACTCATA 


300 


ACAGTGTTGA AAGTGAAAAC 


ACTATTGAAG 


AAGCTGGTGA 


AGTTACGAAT 


GTATCGTATG 


360 


TTGTTCCACC GTTAACTTTA 


CTTAATCAAC 


CTGCAAAACA 


AAAAGCAACA TCTAAAGCTG 


420 


AAGTGCAACG TAAAGGACAA 


GTACTAGAGA 


ATACATTAAA 


AGATTTTGGG 


GTAAATGCAA 


480 


AAGTGACACA AATTAAAATT 


GGTCCTGCAG 


TAACTCAATA 


TGAAATTCAA 


CCAGCTCAAG 


54 0 


GGGTTAAAGT GAGTAAAATT 


GTAAACTTGC 


ATAATGATAT 


TGCATTAGCT 


TTAGCAGCAA 


600 


AAGATGTTAG AATCGAAGCG 


CCAATACCTG 


GTCGTTCTGC 


AGTAGGTATT 


GAAGTGCCAA 


660 
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ATAAACTAGA AGTTGGATTA GGaAGAGATA TATCAGGTGA TCCAATTACT GTTCCACTAA 780 
ATGAAATGCC ACAC 794 
(2) INFORMATION FOR SEQ ID NO: 497: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 497: 

IS 



2S 



3D 



40 



AGCCAGTTTT 


GcATTTCGTC 


AAATCGCAAT 


GAATATATTG 


ATTGCTTGTC 


AAAATTTGGA 


60 


AGAAATTTAT 


TCTTCTTTTT 


CTTTTCAGCA 


GTTATCATTC 


CTAATTGTCC 


•iTixriTm-A 


120 


TCTTAGTGAT 


AAAGAATCCA 


TCTGAATTAA 


AGTCTTGCGG 


CATGATTTGT AACGTTTTGA 


180 


CCAACTCTCC 


AGTTATCGGA 


TGTTGAAACG 


GTTCAAATTC 


GAAGTTTTTA TTATTTTTCA 


240 


AAAACGTATA 


AATCACGTTT 


TCATTTTCTA 


GTTGCTCAAT 


TGTACATGTT 


GAATAGATGA 


300 


TTTCTCCACC 


TATTTTTACA 


TTGTTTTTTA 


CATTTTCCAA 


TATTTCAAGC 


TGTAATTCAA 


360 


CTAGTGACTC 


AATATGTTGT 


TTGCTTTGAG 


TATACTTAAT 


CTCCGGCTTA 


TGTCTCATTA 


420 


CACCTAATCC 


GCTACATGGT 


GCATCAACAA 


GTATCTTATC 


GTATG1TTTA 


TCATAAGGTT 


480 


TTGTCGCATC 


ATGTTGAAAA 


GCTTTAATAT 


TTGTTAATCG 


TAATTnTTTT 


ATATTAAAAT 


540 


TAATTAAGTC 


TATTTTGTGA 


TCATGTATAT 


CTGAAGCGTC 


AACTTGCCCT 


TCTGGCATTA 


600 


AAACTTCAGC 


AATGTGACAA 


GCTTTACCGC 


CAGGTGCACT 


ACATGCATCT 


AATACGTGAT 


660 


CATGTCGGTC 


TACATTCATA 


ATGTGTGCAa 


CAAACATTGA GCTTTTATCT 


TGAATTGAAA 


720 


CGAATCCATC 


TTTAAATGAA 


CGAGAATGAA 


TAATTGGTTG 


TCCTCCTATA TGGAGACAAT 


7B0 


AAGGTAAGTC 


ATGATCTTTT 


TCAACGTCAT 


AACCTTCGTC 


TTGCAACTTT 


TCAATAATAT 


B40 


CATCTAATGA 


TGCTCGCGTC 


AGGTTGGCAC 


GCACAGTTGT 


TGATGTCGTT 


TCTAAAAATG 


900 


ACTGTAAAAT 


TTTTTCAGTT 


TCTTCGAGAC 


CATAATGTGT 


TGCCCAATGA 


TCTATAATCC 


960 


ACTTCGGCAT 


ACTATACTCG 


ATTGCCATTC 


xrrrrrrAGG 


ATCTGCAATT 


TCATTAAAAT 


1020 


CAGGTAAGTC 


ACTACGCATC 


ATTGTACGTA 


AAATACCATT 


TACGACATTA 


CCATTATGAT 


1080 


AGCCACCGCG 


TTCTTTTGCT 


ATTTCAACTG 


CTTCATTAAT 


AATGGCATGA 


TTTGGAACTT 


1140 


TATCTAnATA 


nACATATTGA 


T 








1161 



SO 

(2) INFORMATION FOR SEQ ID NO: 4 98: 
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(A) LENGTH: 1504 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 498 : 

AGCTCACGTC ATCTTCGGCG GCGCTAAATT AAAATAATCA ATTTCTGAGT TAAACTTTTA 6 0 

TTTACAACAT ACTATTACTA TACATTACAA ATTTTTAAAA TATACATTAC ACTCATTACT 120 

CAATGGmAAG CGTATGATTT CmCAGCCCCC CTAGCTTGTA GAAATCATAC TTTCCTTTTT 1B0 

is TCAATATATA TACAACTATT AAATCCCATA AGATTGCAGA GCACATAAGT AAATTTTTTT 24 0 

AGAGCTTGAG GTTTGTTTAG CTTAAGCAAC CCATGAGCTC AAACACTTCC TGTTCACATA 300 

ACACTACAAA TCGCATTATG TTGCTTAATC TTATGTTTAT ATAAATTACA CACAATAAAT 360 

AGAAAGAATG TGAACATCAT GAATAAATTA TTGCTACTCG TTACATTTAT CATTCGTGTG 420 

GGTTCAGGTA TTGTTATGTT AATGCAAGGC TACGAAAAAT TAACGGGCGG ATTTACGCTG 480 

AAAGGTTTAG TACCAGTCAT CGCTAACAAT ACTGATTCAC CAGAGTGGTA TAAGTGGTTT 54 0 

TTCGCAAATA TAGTTGCACA TACGACGTCA TTATTTGATA TTGTTGTCCC ACTCGGAGAG 600 

ATTGCAATTG GATTAGGTTT AATTTTTGGA GTTTTTGCAT ATGCTGCTAG TTTCTTTGGA 66 0 

GCCTTTGTTA TGATAAATTA TATCTTAGCA GATATGATAT TTACGTATCC TCTTCAATTA 720 

ACTTTCTTTA TCCTTTTACT AATGAGTCAC TCATTGTTAA AACAGATTTC ACTTAAAGAA 7B0 

ATCATTAATT ACTTTAGAGG TCGTAAGAAC AGAGGTGAAA AAATAGATGA CCCACTTACT 84 0 

35 GATCGTGGAT GATGAACAAG ACATTGTAGA CATTTGTCAA ACCTATTTTG AATATGAAGG 900 

TTACAAAGTA ACAACGACAA CTAGCGGTAA AGAAGCAATT TCTTTACTAT CAAATGATAT 96 0 

TGATATCATG GTACTTGATA TCATGATGCC AGAAGTTAAT GGTTACGACA TTGTCAAAGA 1020 

AATGAAAAGG CAAAAATTAG ATATCCCCTT TATCTATTTA ACTGCCAAAA CACAAGAACA 108 0 

TGATACCATT TACGCCTTAA CTTTAGGTGC AGATGACTAT GTCAAAAAAC CATTTAGTCC 114 0 

AAGGGAACTC GTTTTACGTA TTAATAATTT ACTTACAAGA ATGAAGAAAT ACCATCATCA 12 00 

ACCAGTTGAA CAACTGTCGT TTGATGAATT AACACTTATT AACTTAAGTA AAGTtGTGaC .126 0 

tGTAAaTGGT CACGAaGTCC CTATGCGTAT TAAGGAATTT GAGTTATTGT GGTATTTAGC 1320 

TTCTAGAGAA AATGAAGTTA TTTCTAAATC AGAATTACTT GAAAAAGTTT GGGGATATGA 1380 

CTATTACGAA GATGCTAATA CCGTGAATGT CCATATACAC CGTATTAGAG AnAAATTAGA 144 0 

AAAAGAGAGC TTTACAACAT ATACCATCAC AACTGTATGG GGATTAGGAT ATAAATnTGA 1500 

55 
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(2) INFORMATION FOR SEQ ID NO: 499: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1623 base pairs 
5 <B) TYPE: nucleic acid 

<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 499: 



15 



35 



40 



45 



50 



ATTGAAAGCG 


ATAATTCGTA 


nTAATTGAGT 


TTGTTGAAAA 


ATTTAGGGTA 


ATGTAAAGAT 


60 


ATAAAAGATA 


CATAGAyTGG AGAGATATAA 


AGATGTTGAA 


TGAGATACAA 


ATATTAaATA 


120 


aTGGATACCC 


GATGCCTTCA 


GTTGGGTTAG 


GTGTTTATAA 


AATCTCTGAC 


GAAGATATGA 


180 


CTAAAGTTGT 


AAATGCTGCA ATTGACGCAG 


GCTATAGAGC 


GTTTGATACA 


GCATACTTTT 


240 


ATGATAATGA 


GGCTTCACTA 


GGACGAGCAT 


TAAAGGATAA 


TGGCGTCGAT 


AGAGAAGATT 


300 


TGTTTATAAC 


AACGAAGTTA 


TGGAATGACT 


ATCAAGGTTA 


TGAGAAAACA 


TTCGAATATT 


360 


TCAACAAATC 


GATTGAAAAT 


TTACAAACTG 


ATTATCTTGA 


TTTATTTCTA 


ATACATTGGC 


420 


CTTGTGAAGC 


AGATGGTCTA 


TTTTTAGAAA 


CATATAAAGC 


TATGGAAGAA 


CTTTACGAGC 


480 


AAGGTAAGGT 


AAAAGCAATA 


GGTGTATGTA 


ATTTTAATGT 


TCATCATCTA 


GAAAAATTAA 


540 


TGGCTCAATC 


AAGTATCAAA 


CCAATGGTGA 


ATCAAATTGA 


GGTACATCCA 


TATTTTAACC 


600 


AACAAGAATT 


ACAAGAATTT 


TGTGATCGTC 


ACGATATTAA 


AGTGACTGCA 


TGGATGCCTT 


660 


TGATGAGAAA 


TAGAGGACTA 


CTAGACGACC 


CTGTCATTGT 


TAAAATTGCT 


GAAAAATATC 


720 


ATAAAACACC 


AGCACAAGTT 


GTATTACGTT 


GGCATTTAGC 


ACACAATAGA 


ATTATTATTC 


780 


CAAAATCTCA 


GACACCTAAA 


CGCATTCAAG 


AAAATATAGA 


TATTTTAGAT 


TTTAATTTAG 


840 


AATTAACAGA 


AGTAGCTGAA ATTGATGCTT 


TAAATAGAAA 


TGCAAGACAA 


GGTAAAAATC 


900 


CAGATGATGT 


GAAAATTGGG 


GATTTAAAAT 


AACTGGATGT 


TAAATTTTAC 


GTTTATGAAT 


960 


GCCTTTTAAT 


GTGTACATTA 


AAATAAATGA 


GTTGGTTTTT 


ACTATTTGAT 


AAAACAATAC 


1020 


TCAGGTACAT 


TCAAAATCTT 


TTAAATAAAA 


AGGATGGACA 


TAGATGAAAA 


TTAGAGTCGT 


1080 


CATTCCTTGT 


TTTAATGAAG GGGAAGTCAT 


TACACAAACA 


CATCAACAAT 


TAACTGAAAT 


1140 


ACTTTCACAA 


GATAGTAGTG 


TGAAAGGCTA 


TGATTATAAT 


ATGCTTTTCA 


TAGATGATGG 


1200 


TAGTACGGAT 


ACCACTATAG 


ATGAAATGCA 


ACATCTTGCC 


ACAATAGATA 


GGCATGTCAG 


1260 


CTTTATTTCT 


TTTAGTAGAA 


ATTTTGGAAA 


AGAAGCAGCT 


ATGATTGCAG 


GTTACCAGCA 


1320 


TAGTACTGAA 


TTTGATGCAG 


TCATCATGAT 


AGATTGTGAT 


TTGCAACATC 


CACCTGAATA 


1380 
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TAGAAGTGGT GAAAATTTTA GTCGCAAAAC ATTAAGCCAT TTGTATTATA AGTTAGTTAA 1500 

TTGCTTTGTA GAAGAAGTAC AATTTGATGA TGGTGTTGGT GATTTTAGAC TTTTAAGCCA 1560 

AAGAGCTGTT AAATCCATTG CATCACTTGA AGAATATAAT CGnTTTTCAA AAnGGnTATT 1620 

TGA 1623 
(2) INFORMATION FOR SEQ ID NO: 500: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 500: 

20 AAAGTnGGTG AAnCTATATA CTTAATCTAT ATTTATATAT TAACCATTAG GGTTAAAAAT 60 

TACTCTAGCA TTTATGAATA GATGGGAGTT TATTTTATTA TTATATAGGA GAGATGTTGA 120 

ATGACACATC GCGCACTATT AGTTGTTGAC TATTCATATG ACTTTATCGC AGACGACGGC 180 

TTACTAACAT GCGGTAAACC TGGACAAAAT ATTGAAGATT TTATTGTTTC TCGTATCAAT 240 
GACTTTAATT ATTATCAAGA CCATATATTC TTTTTrrTGG ATTTACATTA TTTACATGAC ' 300 

ATTCATCATC CTGAAAGTAA ATTATTCCCA CCACACAATA TCGTAGATAC AAGTGGTAGA 360 

GAATTATACG GTAAAGTAGG TAAATTATAC GAAACAATTA AAGCGCAACC TAATGTACAT 420 
TTCATTGATA AAACGCGCTA TGATTCGTTC TTTGGTACCC CGCTTGATAG TTTATTGAGa v 4 80 

GAAAGAAGTA TTAATCAAGT CGAAATCGTT GGTGTATGTA CCGATATTTG CGTGTTACAT 54 0 

ACAGCAATTT CTGCATACAA CTTAGGtTAT AAAATTTCAG TACCTGCTGA GGGAGTGGCT 600 

CATTT 605 
40 (2) INFORMATION FOR SEQ ID NO: 501: 

(i) SEQUENCE CHARACTERISTICS :. 

(A) LENGTH: 173 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 501: 
TAGGTTnAAA GCATAGnTTT nTCAAAAAGA CAAATCATTC ATATATTGGA GGATATTTTG 60 
GTGTAAGATA TAGTGCAACC ACAATTGCTA AAGACTTGAA GGAACTAAAT ATATATCGTG . 120 
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15 



TGAGAGAAAA GTTTAGACAC TATTGTGAAC ATGAAGTTCT AAGTTCAATC ATCAATGGTT 24 0 

CATACATTAT CGTCAAAACC TCACCTGGTT TCGCCCAAGG CATAAACTAT TTTATCGATC 300 

AGCTAAATAT AGAAGAGATA TTAGGTACGG TGAGTGGAAA TGAGACTACA TTAATCTTAA 360 

CTGCCTCAAA TGATATGGCA GAATACGTAT ATGCAAAATT ATTTAAATAG ACATGTATCA 420 

AATGAATAAT AAAAATTTGT TTCGTATCAC GTGTACTCAA GTTAGTTACC AAATATTAAC 460 

TTGTGTACGC GTTTTTTTAT GGAAAGAAAG AATTCATAGT CATTCAATTG ACTGTATAAA 54 0 

AAACTTTATA CAACATGTTT TTATGGGTAT TTTTGAATAA AAAATGTATA TTTTGACCCA 600 

AAATACCTTT ATTTATGTAT AAAAATCCAT TATTATGTAT TGTATAACAA AAAGATATGA 660 

AATTTTCGAC TTTCTTTATG TGAATATAAT CACATGTAAG CGTTTGAAGA TTGTCTATAC 720 

TCTAAATGAA TTCAAAGATA AAAGGAGGAA ATAGACATGA CAGATGGTCC AATTAAAGTA 780 

20 AATAGCGAAA TTGGAGCTTT AAAAACTGTG TTACTTAAGC GTCCTGGaAA AGAATTAGAA 840 

AATTTAGTAC CTGATTATTT AGATGGATTA CTATTTGATG ATATTCCATA TTTAGAAGTA 900 

GCTCAAAAAG AGCATGACCA TTTTGCGCAG GTGCTAAGAG AAGAGGGTGT TGAAGTACTT 960 

25 TACCTTGAGA AGTTAGCAGC TGAAAGTATT GAAAATCCTC AAGTAAGAAG TGAATTTATT 1020 

GATGATGTAT TAGCAGAGTC TAAAAAAACA ATATTAGGTC ATGAAGAAGA' AATTAAGGCA 1080 

TTATTTGCGA CACTTTCTAA TCAAGAACTT GTAGATAAAA TAATGTCAGG GGTACGTAAG 1140 

GAAGAAATTA ATCCGAAATG TACACATCTA GTAGAGTATA TGGATGATAA GTATCCATTC 1200 

TATTTAGATC CAATGCCAAA CCTTTATTTT ACT AG AG AT C CACAAGCCTC AATAGGACAC 1260 

GGTATAACAA TCAATCGGAT GTTCTGGAGA GCACGACGAC GAGAATCAAT ATTTATTCAA 1320 

TATATTGTAA AG CAT CATC C TAGATTTAAA GATGCGAATA TTCCAATCTG GTTAGATCGA 1380 
GATTGCCCAT TCAATATTGA AGGCGGCGAT GAACTTGTTT TATCTAAAGA TGTCTTGGCT - 1440 

ATAGGCGTTT CAGAACGTAC ATCTGCACAA GCTATTGAAA AGTTAGCGCG ACGTATTTTT 1500 

GAAAATCCGC AGGCGACGTT TAAAAAAGTA GTAGCAATTG AAATTCCAAC TAGTCGAACT 1560 

TTTATGCACT TAGATACAGT ATTTACAATG ATAGATTATG ACAAATTTAC AATGCATTCA 1620 

45 GCCATTTTAA AGGCAGAAGG CaATATGAAT ATATTTATTA TTGAATATGA TGACGTAAAT 1680 

. AAAGATATTG CCATCAAACA ATCTAGTCnT TTAAAAGATA CTTTAGAAGA CGTACTAGG 1739 
12) INFORMATION FOR SEQ ID NO: 502: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1745 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 502: 

CTGTACATAC AGCAATATCG TTAACAAATG AAAACAGTAT TTTAGGATTG TAAACCATGA 60 

TAAACCTAAA ATACTGTTAT TTTTATTACT TAAATTTCTT CTTCAATGCC TTTTCAACAT 120 

AAGGTGGAAC GAATTCAGAA ATATCTGCTC GATAAGCTGC AACTTCTTTA ACAATACTTG 180 

AACTTATAAA TGAATAATTA GTACTAGACA TCATATATAA CGTTTCAATT TCATTGTTCA 240 

ACTTTTTATT CATTGAAGTT AAGCGTAATT CATATTCAAA ATCACTGACT GCTCTTAAAC 300 

CACGTATGAT TGTTTTAGCT CCTACTTGTT CACAATAATC GACTAGTAAA CCACTAAATT 360 

GATGAACCTT GACATTAGGT AAATGTTTAA CAGATTGTTC AATTAAATCC ATACGCTCTT 420 

CTAAACTAAA CGTACCTTCT TTTTTACTAT TTTTAAGAAC ACAGACATGA ATTTCATCAA 4 80 

20 ATCTATCTGT ACTTCTCTCA ATAATGTCTA AATGACCATA AGTAATGGGG TCAAAACTAC 54 0 

CCGGAATGAC CGCTATTGTA TGTTCCATGC TATTCTCCCT TTTCTAATAA CAATGTGTCT 600 

GTCAACCCAT AATGGTAACG TTTAATCATA TTAAACGGTT GATAATCtAT TTCTTCATGA .660 

25 TTGCTAAATT CACAAACGAT GATACCATTT TCTTTCAATA AATTAAACTC TGAAATTAGT 720 

TTTAAAGCTT TATCAATGAG ACCTTTATTA TAAGGTGGAT CTAaGAAAAT GACATCAAAT 780 

TGAATATCAC GTT.TTGACAA TGCTTTTAAA GCTCTATCTG CATTATTTTT ATAAACTTCA 84 0 

GATTGTGCCT CTAAATCCAA ATTCGCAAGA TTTGaTTTAA TAACTTTTAC AGCTTTAAAA 900 

TTTTGaTCAA CAAAGATTAC CTTATCCATA CCTCGAGAGA GTGcTTCTAT TCCAAGCGCC 960 

CCGCTTCCTG CAAATAAATC TAAACCTATA CCTGACACAT CATATAAACT ATTAAAGATA 1020 

CCTTCTTTAA CTTTATCCAT AGTTGGTCTC GTATTACGGC CTTCCATACT TTCTAAAGCT 1080 

TTACTTTTAT GTTTACCTGC AATGACGCGC . ATGTTGTTCA CACTTCCAAT TCATTTAGTT 114 0 

ATTTAATATA ATTTATTGAG AAAAAGGAGA ATGATAAACC AATGAAACAA ACATTTATTA 1200 

CACTTGGTGA AGGTCTAACA GATTTGTTCG AATTCATGAC GATGATTGAA TATAACCATC 1260 

AACGTATTGA TAAAATTATC TATTTTCATT CACCACAAGC TGAAAATAAA AAGTCATCTG 1320 

45 TAGCAATCAT TATGAACCCT ACAACTGGCA ATCATTTCCA AGCATTTTAT ATCATGATAA 13 80 

ACGCTATTAA ATATCCATAT CCAGATTCAA ATAAAAAGTT TCAAATGATA AATGATTGTG 1440 

CTGAAAAATT CGACATACCA ATTTTAGGTA TCGATGTACA GCCCCCTCAA GCATTTCATG 1500 

ATTTATCGTT ATATTATAAT TATTTAATTA GTGTGTTAAG GCTCCAAAAA TGGATACCAG 1560 

aACTTCAATA ATATTAATTA TATATTTCGT . GTTTCTCTTT TTCGTAAGTT TTCTTTAAGT 1620 

55 
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TTTGATCAAC ATCTTGCTCA TTCACATACA TAATTACAAA TTTACGATCT CTATTTGAAT 
GAACG 

(2) INFORMATION FOR SEQ ID NO: 503: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1035 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 503: 



TCGTCTTTAA 


TCTTGCTGAC 


TTTAGAAGGC 


TTACGAGTGC 


GGTTACCATT 


TTTTGCATCT 


TTAACTGATT 


GAACTAAAGC 


TtGACGTGTA 


GATTTATCAG 


CTAAACTAAT 


TGCACCACCA 


ATTACGGCAC 


CAATTAAAAT 


ACCAGGAACA 


AATTTATTTT 


CCATAAAAAA 


CTACCCCTCT 


TTCAAATTTG 


CATCTTTTAC 


GATGTAGTCT 


ATTAAATTAT 


CACAAGATGA 


TAATACCATG 


TCGTATACAC 


CTTCAAAATT 


ATTCGTGTAG 


TATGGATCTG 


GTACATCACT 


CTCTTCCATA 


TTACTAAATT 


CTAACAGTTT 


GAACAATTGT 


CCCTTAAGAT 


TAGGATTGAT 


AGATTTAATA 


TTATCAACGT 


TACTTTGATC 


CATAGCCACA 


ATGTAATCAA 


AATCATCTGT 


CGCTTCGAAT 


AATTCACTAA 


TCATGCCATC 


AAATGGAATA 


TTGTGTTTGT 


TGAGAATTTT 


TTGTGTACCT 


TCATGAGGTG 


GCTCTCCTAA 


ATTCCAGCTA 


CCAGTACCTC 


TTGAATGTAC 


TTTAATATCA 


TGAATATTTC 


TGTCTTTAAG 


TCTTTGTCGC 


ATGATTGCTT 


CTGCCATTGG 


AGAACGACAT 


ATATTGCCAA 


GACAGACAAA 


TGCTACATCT 


ACCATTTTGA 


TTCCTCCAAA 


CTATGTAGTT 


ATATCCCCAT 


TTTATAGCGA 


CTTTAAACAA 


TAAGAAAGCA 


GATTATATAA AATTCTATTA 


AAGTTTATTA 


AATTGTGATA 


CTTTGATAAC 


ATAACTATTA TTAGAGGTGA ACATTGTGGC 


TATGACAAAT 


GAAGAGAAAG 


TnTTAGCTAT 


TAGAGAGAAG 


TTAAATATtG 


TTAATCAAGG 


ATTATTAGAT 


CCTGAAAAAT 


AT AAAAATG C 


AAATGaAGAA 


GAATTAACAG 


ATATATATGA 


TTTTGkTCaA 


yCAAGAGaAA 


GATTGTCGCC 


AAGTGaAGTG 


mCAGCTATTG 


CTGaCGCTTT 


AGGACAATTG 


CGACACGAAT 


AGGAGTGGGA 


ATTTTGACTA 

i 


ATTACAAAGA AAAGTTACAA 


CAATACGCTG 


AACTA 











(2) INFORMATION FOR SEQ ID NO: 504: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 504: 



AnCCTGACAA 


GATTCTTATA 


TTCATAGATA 


TGAGAGCTAA AAATGAAATC AACAAACAnA 


CAATAAAGTA 


AACGATAATA 


GCCCATATAC 


CATTTTGTAA 


CCATATT ACA AATTGTGTTG 


TATTATAGCC 


ATTTCCAGCT 


AATAATTGCT 


GGATAAATGC 


ATTATTGTTT 


AATGTATTTT 


CAAGATTAGC 


AAtCGATGTG 


TTATTACTGA 


ATGAAACAAG 


TGCTATAAAC 


ATCGTAATGA 


CAGTAAGCAC 


TAATAACATC 


ACCCAACATA 


ACCAACCTAG AACTTTTTCA GTTAATCTAC 


TTACTGGACG 


TTTAATTTGA 


GTAAATTGTT 


CTCCAGTCAT 


TCGTTACAAC 


TCCTTATAGT 


ACTTATCCCG 


TTATTATAAC 


TAAATATACA 


GTAAATAACT 


ACTATTTATG 


ATTTTATTTT 


AATGACATTT 


TGAAATTCAA. 


AAAGTTTTCA 


TTGTATTCAC 


TTAAAACTTC 


AGGTCCTAAA 


TCTTTATAAA 


CTTCAAGGCG 


TTCTTGCTCT 


TTCTTAGTCG 


GATAAAAACG 


ATGGTCGTCT 


TTAATCTCTT 


TAGGCAACAA 


TTGTCGAGCA 


GCCTTGTTTG 


GCGTTGCATA 


GCCTACGAAT 


TCTGTATTTT 


GCTTGTTATT 


TTTAGCATCT 


AATAAAAAAT 


TCATAAATTT 


ATATGCACCC 


TCTTTATTTT 


GTGCCGTTTT 


TGGAATTACC 


ATATTGTCGA ACCATAAATT 


CGATCCTTCT 


TTAGGAATAA 


CATAATTATA 


TTTATCCCCT 


TCTTGCACTA 


GAGGTGCTGC 


AACACCACTC 


CAAACAACCG 


CTATGTTACC 


TTCATTTTGT 


TGAAGCATCA 


TGGTAATTTC 


ATCACCTACG 


ACACCTCTTA 


CTTGTGGTGC 


TAGTTTGGTT 


AAATCTCGCT 


CTGCTTCTTT 


TAAATGGTGC 


GAATTACGGT 


CATTAAGATT 


ATACC CAAG T 


TTATTCAAAC 


TCATGCCTAT 


AATCTCTCTA 


GCACCGTCAA 


CTAGTAAAAT 


TTGGTTTTTA 


AATTTAGGAT 


TATACAATGA 


CTTCCAACTA 


TCAAATGATT 


CATTTGGATA 


CTTTTCTTTA 


TTATATAAAA TACCTACAGT 


TCCAAAGAAA 


TAAGGTAAAG 


AATATTTATT 


GCCTCTATCA 


AATGACATAT 


TCATATAATC 


TGAATCTAAA 


TTTTTAATAT 


TAGGTACCTT 


ATTATGATCT 


ATTGGTAACA 


ATAAATGATC 


TCTTTTCAAT 


TTTTGAACTG 


nATATTCACT 


AGGAnAAGCA 


ACATCATAAT 


GTGTACCGCC 


AGTGCGAATT 


TTGGnGTCCA 


nCGCTTCATT 


TGAA 









(2) INFORMATION FOR SEQ ID NO: 505: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5763 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS :- double 

(D) TOPOLOGY: linear 
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ATAATTAACA TCTTTTGCTA TATACCACCA GTTTGATACA TAAAATATCG CAGCAATAAT 60 

ATCATGTTTA ACCCTAATGA TATTATCTGA TTTTAATAAT AAGGTTGCTG TCCCTACAAC 120 

5 ' 

CATTAATAAA ACTATGACTG CTGGTAATAA ACGTTTTAAA CGACGTATCC AAAAGCTTTT 180 

CAATTTAATG ATACCTGTGT CATCATACTC TTTGAGTAAT AAGCTTGTAA TTAAATAACC 240 

AGAGATCACA AAAAATGTAT CCACACCTAA AAAGCCACCT GtCAACCATT GCTTATTTAA 300 

10 - 

GTGGTAAATA ATAATTCCTA GAACAGCGaT TGCCCTCAAA CCATCGAGCC CTGGAATATA 360 

TCTCATTTTC TTATACTTTG TAAAACCCTT TGTTTTGTTC ATTTTTGCAT TCTTCCCTTT 420 

75 TAAAACTGTT CTTCTTAGAT GCTTAATTAA ATTTAGTTAT GCTGTTTAAA AGAATATTGA 4 80 

AATGCATATG TATATTATTG AATTACGACA TCATCAAAAT CATATTGACT AAAATACTGT 540 

TAAATTAAAA AAATTACCAA TGATGATTCT TACTTCGAAA TCCAATTTGT AATGCAACTC 600 

20 GGCAACTTAA AAACTATGAA GTATTATGTA TTGTAATATA ACTGTAATAT AAATTCAATT 660 

TATTATAAAA ATTTTCAAGA AAATATTCAA CTAGAAAATG AATTGTGCAC TCTTGGAAGT 720 

GCAAGTCACT GTCTTAATTC ATATTTTTTG AAACAAGTTA GATATAAATT TTCAAAATAA 780 

25 

AATCAGAAAC TAGAACATAA ATAAGGCTCC CTTCAAAATT TTCATTTTTC AATGTCTACT 840 

TTGAAGGGAG CTTATTCACA ATGAATTATA CTCTACAATG TTATATTGAC TGCGGGCCCA 900 

AACACAGAGA ATTTCGAAAA GAAATTCTAC AGGCAATGCA AGTTTATGTT AGCTCACACC 960 

30 

AAGTGCAATC TTAGCGTAAC GTGACATCAT ATCTTTTGTC CAAGGTGGAC TCCATACGAT 1020 

ATTCACTTCA GTATCCTGAA TTTCAGGAAT CTCTGCTAAT ACTGTTTTAA CTTGaTCAAT 1080 

35 AATTTGAGGT CCCATTGGAC ATCCCATTGA TGTTAAAGTC ATATCAACTG TACATACGCC 1140 

TTCATCATCA ACATTCACTT TGTATACTAA ACCCAAATTA ACGATATCAA TTCCTAATTC 1200 

AGGGTCAATT ACCATTTCTA ATGCACCTAA GATACTATCT TTCAATGCCT CTTCCATCCA 1260 

40 TATCACCTCT TTAATGTCAT ATTATTCATA ATATATCAAA TATCCGACAA AACGCCAATA 1320 

AAATGCTATG ATGTATCTAT ATGAACTAAG CAACTTATGA GGAGAGAGAT ATGCAACCAC 1380 

ATTTAATATG TCTAGACTTA GACGGAACAT TATTAAACGA TAACAAAGAA ATTTCATCAT 1440 

45 ATACTAAACA AGTATTAAAT GAATTACAAC AACGTGGaCA CCAAATTATG ATTGCGACTG 1500 

GCAGACCTTA TCGTGCAAGT CAAATGTATT ATCATGAATT AAATTTAACG ACACCAATTG 1560 

TTAATTTTAA TGGCGCTTAC GTACATCACC CTAAAGATAA AAACTTCAAA ACTTGCCATG 1620 

SO 

AAATTTTAGA TTTAGGCATC GCACAAAACA TTATTCAAGG ATTACAACAA TATCAAGTAT 1680 

CGAATATTAT AGCAGAAGTG AAAGATTATG TTTTCATTAA CAATCATGAT CCAAGATTAT 1740 

55 ' 
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AAGAATCCCC TACCTCAATT TTAATTGAAG CCGAAGAAAG TAAAATACCT GAAATCAAAA 1860 

ATATGCTTAC TCATTTTTAT GCCGATCATA TTGAGCATCG " ACGCTGGGGC GCACCATTCC 1920 

CTGTCATTGA AATTGTAAAA CTTGGTATTA ATAAAGCAAG AGGCATTGAG CAAGTTAGAC 1980 

AATTTTTAAA TATTGACCGA AATAATATTA TTGCATTCGG TGATGAAGAT AATGATATTG 2040 

AAATGATTGA GTACGCGCGT CACGGTGTTG CTATGGAAAA TGGTTTGCAA GAACTTAAAG 2100 

ATGTAGCGAA CAATATTACA TTCAACAATA ATGAAGATGG CATTGGTCGA TATTTGAATG 2160 

ATTTCTTTAA TTTAAATATT AGATATTACT GTTAATTTAT AACTAATCAT TTTATAATAT 2220 

75 TTTAAAACAA TAGGAGGTAA GTTACGATGC CCAAAATAGT CGTAGTCGGA GGAGTCGCTG 2280 

GCGGTGCAAC ATGTGCCAGC CAAATTCGAC GTTTAGATAA AGAAAGTGAC ATTATTATTT 234 0 

TTGAAAAAGA TCGTGATATG AGCTTTGCTA ATTGTGCATT GCCTTATGTC ATTGGCGAaG 24 00 

20 TTGTTGAAGA TAGAAGATAT GCTTTAGCGT ATACACCTGA AAAATTTTAT GATAGAAAGC 2460 

AAATTACAGT AAAAACTTAT CATGAAGTTA TTGCAATCAA TGATGAAAGA CAAACTGTAT 2520 

CTGTATTAAA TAGAAAGACA AACGAACAAT TTGAAGAATC TTACGATAAA CTCATTTTAA 2580 

GCCCTGGTGC AAGTGCAAAT AGCCTTGGCT TTGAAAGTGA TATTACATTT ACACTTAGAA 264 0 

ATTTAGAAGA CACTGATGCT. ATCGATCAAT TCATCAAAGC AAATCAAGTT GATAAAGTAT 2700. 

TGGTTGTAGG TGCAGGTTAT, GTTTCATTAG AAGTTCTTGA AAATCTTTAT GAACGTGGTT 276 0 

TACACCCTAC TTTAATTCAT CGATCTGATA AGATAAATAA ATTAATGGAT GCCGACATGA 2 820 

ATCAACCTAT ACTTGATGAA TTAGATAAGC GGGAGATTCC ATACCGTTTA AATGAGGAAA 2 880 

TTAATGCTAT CAATGGAAAT GAAATTACAT TTAAATCAGG AAAAGTTGAA CATTACGATA 2940 

TGATTATTGA AGGTGTCGGT ACTCACCCCA ATTCAAAATT TATCGAAAGT TCAAATATCA 3 000 

AACTTGATCG AAAAGGTTTC ATACCGGTAA ACGATAAATT TGAAACAAAT GTTCCAAACA 3 060 

40 TTTATGCAAT AGGCGATATT GCAACATCAC ATTATCGACA TGTCGATCTA CCGGCTAGTG 3120 

TTCCTTTAGC TTGGGGCGCT CACCGTGCAG CAAGTATTGT TGCCGAACAA ATTGCTGGAA 3180 

ATGACACTAT TGAATTCAAA GGCTTCTTAG GCAACAATAT TGTGAAGTTC TTTGATTATA 324 0 

CATTTGCGAG TGTCGGCGTT .AAACCAAACG AACTAAAGCA ATTTGACTAT AAAATGGTAG 3300 

AAGTCACTCA AGGTGCACAG GCGAATTATT ACCCAGGAAA TTCCCCTTTA CACTTAAGAG 3360 

TATATTATGA CACTTCAAAC CGTCAGATTT TAAGAGCAGC TGCAGTAGGA AAAGAAGGTG 34 2 0 

CAGATAAACG TATTGATGTA CTATCGATGG CAATGATGAA CCAGCTAACT GTAGATGAGT 34 80 

TAACTGAGTT TGAAGTGGCT TATGCACCAC CATATAGCCA CCCTAAAGAT TTAATCAATA 3540 
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GTTAGAATTA TGTTGGACTG GTACTACTAT CCAGTCCATT TTTTATGTTT AACATTTTTA 3660 

GAATCAAAAA AGACATAAGG TCTTGGACTA ATAATTGTCC ATGCCTTATG TCATATACTA 3720 

5 • 

TATGTCTTAT CAATTAGCCA ATACCGAATA ATTTTGATAT AGGsCCTAAC GGTAGAATGA 3780 

CACCTAATAC CATTGTGATG ATAATTAATG CAATTGTTAT CCAAAACATT GTGTGACTTT 384 0 

GTTCATGTCT CTTTCTTTTA GCAATCGACA CTTCCATCAA TCCAACTACT GCAACACCAC 3900 

10 

ACAGCATTTT CAATGTAAGC AACATATGAT TTGCCCCGCC ATTCATAAAT GACTGAATTA 3960 

ATATCCAAAA TCCTGAAATT AACGTCAACA GCATAAATAA GCGTAAAATC ATGTGCAACG 4020 

15 GTTTGAAAAA. TGGTGATCTG CCTTGATTTT TTGAAATGTT TAAGTATGTA GCGATAAATA 4 080 

AAATAATCGC TAATACCCAA CTTAATATAT GTAAATGTAA CATACTGATT CCCCCCACTT , 414 0 

TAATTATTTA TATTATTAAA TTAAAGCTTC TTGGGATTAA TACCCACTTG CTTGTAATTT 4200 

20 AATCATGATT TGATTATACA CGAATATATA TTCTACCACA CTTCTATATT TGAGAGGAAG 4260 

AACATGACAT TTTATTCCTT ATTAGAATAT TGTGAATCTG CTGTAAAATA ATCAACTACT 4320 

TCTTAATATC AATATTTCAC TTTCATCTCA AAATGGTAAC ATTATAAATA ATTTATCTTT 4380 

25 

AACACCTTTT TAGAAAAGCA AGAAAAAACT AACCAATCTA TATAAAGACT GGTTAGCTTT 4440 

TTAAATGATA ATTATTTAGC GATATAAGTT GTCAGCGTTC CAATATTATC AATAGTCACT 4 500 

TTAACTTCAT CACCTGGTTG TAAAAATTTA GGTGGTTGCA TACCTGCACC AACGCCTGCT 4560 

30 

GGTGTACCAG TTGCAATAAT ATCTCCCGGA TGTAGTGCAA CATATTTTGA AATTTCTTCT 4620 

ATTAATTCAT CAATTTTAAG AATCATTTCG CCAGTGTTAC CATCTTGTCT AATTTCATTG 46 BO 

3s TTAACTTTTG TAACAATATT TACATTTTCA GGTAATGGTA GTTCGTCTTT AGTAACGATA 4740 

TAAGGACCCA TTGGGCAACC GCCAGTTAAA CTTTTTGATA AAAATGCTTG ATCTTGTTCA 4 800 

CTTTGTGCTT TGCGATCAGT GATATCGTTA ATAATTGTAT AGCCGTAAAC ATAATCTAAA 4860 

40 GCTAATGCTT TTGGAATCTT TTCACCAGAC TTACCAATAA CAATACCTAA TTCACCTTCA 4920 

TAATCTAATT GATCAGTAAT ATCTTTATGA TTTGGAATTG TTGCATTATC TCCTGTTAAA 4 980 

GATGACGCTG CTTTTGTAAA TACATATAAT TTTTCCACTT CATGATTTAA TTCGTTCGCA 5040 

45 TGATCTTTGT AATTTCTACC AAAAGCAATC ACATTATTCG GAGGTGTTAC TGGTGGTAAA 5100 

AATTCAATGT CATTAAATGA AATTTTATAG TCTTCAGCTT TGCCGCTATC TTCTGCTGCT 5160 

ACAACTGCTT TACGTACTTG TTCTTGAAAA TCTAAAGTAT GATTTTGTTG TAAACCAGCT 5220 

SO 

AACAATGTTT TAGGATGGAA ATCTCCTTCT GCAAAGTCAG CAAATACTTG TGTTAAATCC 5280 

CATACAGCAT CTTCGCGTTT TACTTTAACG CCATATGAAG TTTTGTCATT ATACTTGAAT 534 0 

55 
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TTCGTTATCA AATAACAAAT AAATAAGTAA GACAATTTTG AAAATGAGTT GTGTTCATTC 
TGCTACAAGG ACTTTGCACT TAATCGAAAT TATTTTTTAT TCTTTTGAAA ATCAAAATAC 
TATAGTTGCA ATGTACCAAA TTTGAAGAAG TATAAATAAC CTTTAACTTC TTTATTAAGA 
ATCGTTTGAA GCGTATTTTG ATAATATTTC ATCTGTATCT TATATTTATT TTTTAATTGT 
GTACCAATTT CTTCATCTGT CATCCCACGG CGACGATTAA ATGCATCGGT TTTATAGTCT 
ACAAAATAAT GCACACCATC TTTAACAAAG ATTAAGTCAA TCATACCTTG AATAATTGAG 
AGG 

(2) INFORMATION FOR SEQ ID NO: 506: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 422 base pairs 
CB) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 506: 

CCATATGATT TTGTGCAATA ACTCTTTTTC TTTCTTCTTT TCGTAAAAAG TTGTACATCG 

i 

CTTTTGTGTT TAAGAGACTA TTGTTTTCTT TAGGTTTTTG AACTTCACTC AGTGTATTTC 
TAGTTGTTAA CACTAAAATT CCAACTGTTG TATCTTTGTA TCTAGCCATA ACTTTATTCA 
GATGTTTGTC ATTTGTAATA ACTACGACAT AATTAAACAC TTCATAATAA TCATTAATTT 
GATTATCTAA TCTATCCAGC TTATCTAATT CTGTTTTAAT CTCATAGACA ACGCCTTTGC 

CGTTTAACAA TATAAAATCA GCaATACTTT TCCCTATGGG CATCTCAGAA AGTGCAGTAG 

■ V 

TTGTATTAAT AGAATGTCGT CCTAGAAGGA GTkTATTAAG TATnGTGTTT TTGTAAAAAT 
AT 

(2) INFORMATION FOR SEQ ID NO: 507: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1188 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 507 : 
GCTTATGTAT TCTCAAAATA TTTATGTGAT ACGCAAAGGA GACATGGTTA TTCGACCAGC 
ATTTGATGAT GACGATCAAA GAAACGGTAG TGAAATAATT CGGTTTGACA AAACGCGTAT 
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TTATCTTGGC AAGAAAGCAG AGACAAACCG CATTACTGGC 
TTTACTAACA CCATTATTTC CAACTTATTT TTTCCCAACA 
5 AAATATTTGG TTAAATATGC ATTATATCGA AAGTATTAAA 

TAAAGTGACA TTTATTAATA ATGAATCAAT CATTCTTCAT 
GCATCAATAT AACAATTCCA TTTTTTACTA TTACATGGTA 

10 

ATCAAAAAAT CCCGACCAAC CAATAGATTA TAATAAAGCC 

ATTGACACGC TATTCTTTAT TTGAAGATAA ATAAATTGTT 
1S TGTTTTATAA ATATAGTGTA AATGTTCTGC ATATTTTTTT 

ACTTAATCTT GTTATAATAA TATTTGTGCT TGaTATTCAA 

AGTAGCTCAG TGGTAGAGCT ATCGGCTGTT AACCGATCGG 
20 TGTGGAGCCA TTGGAAACGT ACTCAAGTTG GCTGAAGAGG 

TCGCGAAAgG CGCGAGGGTT CGAATCCCTC CGTTTCCGTT 

CCATTTTAnC TTTTTTGTTT ACTTATATAT AATGAATGAG 
25 TCAATTTTAA AATTCTAAAA ACCTTTCCTA GATAATCTTC 

TTGTTGAACT TATAATTAAT TTATTACATA GCAATATTTA 

TATCACTACA TGAAATACGA TAATTCGGaT CTCTTAACTT 

30 

TTGgTTTCAT CGTATGATTC ATGTATAATA GCATTTkTTA 

(2) INFORMATION FOR SEQ ID NO: 508: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 840 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 508: 
CCCAACTTCG GTTATAAGAT CCCTCAAAGA TGATGAGGTT AATAGGTTCG AGGTGGAAGC 60 
45 . ATGGTGACAT GTGGAGCTGA CGAATACTAA TCGATCGAAG ACTTAATCAA AATAAATGTT 120 
TTGCGAACAA AAtCACTTTT ACTTACTATC TAGTTTTGAA TGTATAAATT ACATTCATAT 180 
GTCTGGTGAC TATAGCAAGG AGGTCACACC TGTTCCCATG CCGAACACAG AAGTTAAGCT 240 

50 

CCTTAGCGTC GATGGTAGTC GAACTTACGT TCCGCTAGAG TAGAACGTTG CCAGGCATAA 300 
TATTAATCCA CAGTAGCTCA GTGGTAGAGC TATCGGCTGT TAACCGATCG GTCGTAGGTT 360 

55 



ATTTCTAGTA AACCACCTAT 24 0 

CATTCTGACA GACAAAATGA 300 

GAATTAAAAA ATCGTAAATG 360 

GTTTCATACC ACAGTTTATG 420 

GATAAACAAT CTCGCATGAT 4 80 

ACATTGAATG TGTTTGAAGC 54 0 

TATTTTTAAA ATATGCGGAA 6 00 

AAGGTATCTA TTGCAAATTA 660 

ACACATACAA ATTAATCCAC 720 

TCGTAGGTTC GAGTCCTACC 780 

CGCCCCTGCT AAGGtGTAGG 840 

ACTTGCTAAA ATGGTATATA ' 900 

AATTTCACTG TTCTTTTATA 960 

TCTAAGAAAG GTTTTTATAC 1020 

CCTGTTTTTA ACTATAAAAT 1080 

CTCTGCaATT AATGtACTCA 114 0 

AATAATTC 118B 
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GGCGGTAACA CGGGTTCGAG TCCCGTAGgA GTCATACAAG CAGAAGTGAA ATATCGCTTC 460 

TG TTTTTTT A TTACATATTT ATTGTTGAGG AAGGTTGTCC GAGCTGGCCG AAGaGCACGC 54 0 

CTGGaAAGTG TGTAGGCGTC ACAAGCGTCT CAAGGGTTCG AATCCCTTAC CTTCCGTAAA 600 

GGCGCTTAAA TTGGTTTTAC CCATTTTAAG CGCTATTTTT ATTTTGGACT CAATCCCTTG 660 

ATATATCTGC ATTTGAGCTA TTATCCTCAT TTTTACACTT CTTATTTATT TATATCCATT 720 

10 

TAAAATTTTT TAGCCACAAT GTGACTAATT TTTGaTGAAT AATCCTAATT TTAGtCACAA 780 

GATTTTGAAG TTTAGTCACA AAACAAATCA TTCAGATTTT TTTCyATAAA TTTAGTTTCA 840 
1$ (2) INFORMATION FOR SEQ ID NO: 509: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1373 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 509: 



25 



40 



45 



TTCTATGAAA 


CTATCAGCTG 


TTTCTTTTTC 


CTTGGTAAAA 


ATCGTTTTAA 


CGCCATGATA 


60 


CATGTATTCT 


CTTTCAaCAC 


CCGATTGGTC 


ATAAGCATCG 


AAAACACAAA TAATTTCATC 


120 


TGAAATAACT 


GCATTATAAT 


TTGCAATTGC 


ATCTATTAAT 


TGCATTCTAG 


CTTCTTCTAA 


180 


ATTCTCTTTT 


GCAATGGCGC 


TTAGCGTTGG 


TGATTGTCCT 


ATCATATTGT ATCCATCAAT 


240 


GATTAAGTAA 


CGTTCTTTCA 


TTATATTTCT 


CCAACATCAT 


GTCTTTTTCG 


AAATACTTCG 


300 


TACATCATTA 


AACTTGCTGC 


AACCGAAGCA 


TTCAAACTGT 


TTACATGTCC 


AACCATTGGA 


360 


ATCTTAATAT 


AAAAATCGCA 


TTTATCACTT 


ACTAGGCGAC 


TCATACCCTG 


TCCTTCGCTA 


420 


CCAATTACAA 


TAGCCAATGA 


CATGTCCGCT 


TCTAGATTTC 


TATAATCTGT 


TGCATTATTA 


480 


GCTTCAGTGC 


CAGCTACCCA 


AAAGCCATTA 


TCTTTTAGTT 


CATCGATAGT 


TTTAGCTAAA 


540 


TTTGTCACTC 


GAATAACTGG 


TACATGTTCA ATTGCACCTG 


TTGAGGCTTT 


TGCAACTGTT 


600 


TGCGTTAGTG 


TAACTGAACG 


ACGTTTAGGA 


ATAATAACAC 


CATCAACTCC 


CGTTGCATCG 


660 


GCTGTTCTTA 


AAATTGATCC 


CAAGTTATGT 


GGGTCTTCTA 


AGCCGTCTAA 


TATAAGTACT 


720 


GTCAATAAAC 


CTTCTTTTTC 


TTTTTGCTGT 


TTTAAAAATT 


GATCGAAGTC 


AG CATATTCA 


. 780 


TATGGTGCAA 


TAAGCGCTGC 


AACACCCTGA TGTGGTGCAT 


TTGCTAAAAA 


ATCTAATTTA 


840 


GATTTTGGTA 


CAGTTTGAAC 


AATGATTTTT TGATCTTTTG 


CATTTTTTAA AATTTCATTA 


900 


ATTTGTTGCT 


TTTTAATACC 


TTCTTGAATC AATATCTTAT 


TTAyCGGATG CCCAGTAATA 


960 
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TTCGTTTACT ATTGTTATTA TTTTATTTAA TAATGCCTCT AATCGTTCTT CTCTTTTTTC 1080 

TAAATAaAGA AAACCTATCA CTGCTTCTAT OcCTGAACTT TTACGATATG TTTGAACATC 114 0 

5 

AGTGTTTTTA GCTTTAGTAT GACTTTTCGC GTTACGCCCT CGCTTCAAAA TATCCATTTC 1200 

TTCGTCTGTA AACCATTCTT GCTCCATTAA ATATTCTAAC GTTTGCGCCT GACTTTTGGC 1260 

AGATACATAT TTTTTAGACA TTTGATGTAG TTTATTAGGC TTACTTTTAA GCTTTAAAAC 1320 

10 

GATATAGGTA CGTACATATT GATCTAaGAC TGCGTCnCCC ATATATGCTA AGG 1373 
(2) INFORMATION FOR SEQ ID NO: 510: 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 510: 



25 



30 



35 



TCAAGTGCCA 


ACTACACAAC 


TGATTTACAT 


TCTTTAGGTC 


AATATGTACA AGAAGGCCGT 


60 


CGCTTCTTAT 


TCGAAACAGT 


TGTAAAAGTA 


AATCATCCTA 


AATATGATAT TACTATTGAA 


120 


AAAGATAGTG 


ATGATCTAGA 


CGGATTAAAT 


TATTTAGCTG 


GTAAAACAAT CGACGAAgTT 


180 


AACACAAAAG 


CATTCGAAGG 


TACATTATTA 


GCGCATACTG 


ATGGTGGTGT TCCTAACATG 


.24 0 


GTAGTGAACA 


TTCCACAATT 


AGATGAAGAA 


ACTTTCGGTT 


ACGTCGTATA CTTCTTCGAA 


300 


CTTGCTTGTG 


CAATGAGTGG 


ATACCAATTA 


GGCGTAAATC 


CATTTAACCA ACCTGGTGTA 


360 


GAAGCATATA 


AACAAAACAT 


GTTCGCATTA 


tTAGGTAAAC 


CTGGTTTTGA AGaCTTGAAA 


420 


AAAGrATTAG 


rAGAACGTTT 


ATAAAATACA 


TTACTTCAAA 


GATTAGTGAA GTTTGAAAAG 


480 


ATAGAACTAG 


ACGTTAACTA 


TTTAAAGCAT 


ATTTTCGAGG 


TTGTCATTAC AAATGTAAAA 


54 0 


ATGTAATGAC 


AACCTCGTTT 


TTATTTATAT 


GCAAGAACTA 


GGTTACTAGC TAaTGTGACA 


600 


AGATGTTAAG 


AGAAAATTAA 


aGATAAAATA 


ATATCTGCCT 


TACAATAATA TTGTTATACT 


660 


ACTAGnGACT 


GATTTATTAG 


CATGATTACA 


TGTTAATGTT 


TCTTTACTTA GTAATTA 


717 



(2) INFORMATION FOR SEQ ID NO: 511: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 27 00 base pairs 

(B) TYPE : nucleic acid 
so (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AATCTAATTT TTCCTCCTAA TCTCGGATTG GATAGTGGTC AATTTTTCTG GCCCGCCATT 60 

TTAGCGTTTG TTCTAACTGG GATTGGTTTA CCATTATTAG GTGTGATTGT AGGTGCACTT 120 

5 

GATAAAGAAG GATATATTGG CGGATTAAAT AAAATTTCAC CTAAATTTTC AATATTGTTC 180 

TTAATCATCA TTTATTTGAC TATAGGACCA CTTTTTGCAA TACCTAGAAC TGCATCTACA 24 0 

TCTTTTGAAA TGACAATTAC ACCAATTATA CATAGCAATA GTAGTATCGC TTTATTTATA 300 

10 

TTTACGATTA TCTACTTCAT AGTCGTTTTG TATATTTGTT TAAATCCATC TAAGTTAATC 360 

GATCGTATTG GTTCATTATT AACACCATTA TTATTGATTA CTATTTTAGC GATGATTATT 420 

15 AAAGGATACT TAGACTTTAG CGGTAATAGT GCTGGAAAGG GCAATGAAGC ACTATATCAT 480 

TCTAATTTTT CAAGTTTTGC TGAAGGCTTT ACACAAGGCT ATTTAACAAT GGATGCCATT 54 0 

GCAGCAATTG CTTTTTCAAT GATTGTTGTT AATGCAGTAA AACTAACAGG CATTACTAAA 600 

20 ACAAATCAAA TATTCAAACA AACTTTGACT GCTGGTTTAA TTGCAGCCGT AGCTTTAATT 660 

TTCATATATA TTTCATTAGG TTATATTGGT AATCATATGC CAGTAAGTGA CATGACGTTA 720 

GATCAATTGA AATCCAAAGA TCGAAACATT GGGACATATT .TATTAACGAC AATGGCTTCA 780 

25 

ACAGGATTTG GTTCATTCGG AAAATATTTA TTGGGCATCA TTGTGGCGCT GGCATGTCTA 84 0 

ACTACAGCAT GCGGGCTTAT TGTTGCAGTT TCTGAATATT TCCATAGAAT CGTACCTAAA 900 

GTATCATACA AAGCATTTGT ATTAGTTTTC ATTTTAATGA GTTTTATTAT TGCTAACCAA 960 

30 

GG TTTAAATG CTGTTATCTC AATGTCAATT CCGGTATTAA GCATTGTATA CCCAGTAGCA 1020 

ATAACTGTTG TATTATTAAT TTTAATTGCC AAATTCATAC CGACAAAACG CATTTCACAA 1080 

35 CAAATTCCAG TTATTATCGT ATTTATATTG TCGATTTTCA GTGTTATTAG TAAGTTAGGT 1140 

TGGcTGAAAA TTAACTTTAT AGAATGATTG CCTCTAAGAG CGTATTCTTT AGAGTGGkTC 1200 

CCAGTAGCAA TTATTGCAAC GATATTAGGC TATCTAGTCG GCATATTTGT AAAACAAGAT 1260 

40 CCAATTAAAT ATCAACAGGA ATAACGAATA ATATAAAAGA GGTTGGGACA TAAATCCCTA 1320 

AAAAAACAGC AGTAAGATAA TTTTCAATTA GAAAATATCT TACTGCTGTT CTCTATTTAT 1380 

ACAATACTTC GTATTGAATG GCTTCGCTTT CCTAGGGTGC CGTCTCAGCC TCGGTCTTCG 1440 

45 ACTGGCACTG CTCCCTCAGG AGTCTCGCCA TTAATACTAC GTATTAACGT GTAATTTTAC 1500 

TTTGAAATAC TTTAAAAAAA TAAGACACTT TGCCCAACTT GCACATAAAT GTAAAATTCA 1560 

ATAAAATAAA TTTTCTGTGT TGGATCCCTT CGTATAATTT AATAAATACT ACTAAACTAA 1620 

50 

ATTAACGAGG TGCCTTATGT ATAAAAATTA TAACATGACC CAACTTACAC TACCAATAGA 1680 

AACTTCTGTT AGAATTCCTC AAAATGATAA TACGCGATAT GTTAATGAAA TTGTTGAAAC 174 0 
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5 



10 



20 



AAGATATGCA 


TACCGTAATG 


ATAGATATAG 


TTTTAAACGT 


GACTTCAAGC 


TATATGAATG 


1860 


TGATGACTGT 


TCATCATGTT 


CTTTGAGACA 


TCAATGCATG 


AAGCCAAATT 


CGAAATCCAA 


1920 


TAAGAAAATT 


ATGAAGAATT 


ATAATTGGGA 


ATACTTTAAA 


GyCCAAATTA 


ATCAAAAGCT 


1980 


TTCTGAACCA 


GAAACGAAAA 


AAATCTATAG 


TCAAAGAAAA 


ATTGATGTAG 


AGCCTGTTTT 


2040 


TGGATTTATG 


AAGGCTATTT 


TGGGTTTCAC 


TCGAATGTCA 


GTTCGAGGAA 


TAAATAAAGT 


2100 


TAAACGAGAG 


CTAGGTTTTG 


TATTAATGGC 


ACTTAATATA 


AGGAAAATAG 


CAGCTCAACG 


2160 


AGCTGTACAT 


TATAAAATAC 


ATATCAAAAA 


AGCTGATTTC 


TATCAAATAA 


TTAATAGAAA 


2220 


TCAGCTTTTT 


TACATTGCCT 


AAGAATTTAA 


TGTCCCAAGC 


CCTTTTTATC 


GAATAACTTA 


2280 


TTGTAAACCT 


TGTCTTTCTT 


GGTTATTGTT 


TTCGTTATTT 


TTTTCGTGTT 


TTTGTTTCCA 


2340 


CTCTTTTTGA 


GTCATTACAT 


CGTCAACTTG 


CATGTTAACT 


TCAACTACAT 


CTAAACCAGT 


2400 


nninlnl 1 xr\ 


R PIIYJT'T 1 /^ 1 MM 1 


& PTL. Il TTP 
inAUiAAJi XL. 




Uju AAu A ITT 


TTGGTGCTGA 


2460 


TTCACCATAT 


TCTAAGATTA 


CTTTTAAGTC 


TACAGCAGCT 


TGTTTTTCAC 


CAACTTCAAC 


2520 


AGATACACCT 


TGAGTAACAT 


TGTTGCCACT 


TGAGAATGCA 


TTAGTGAATG 


TATCAGTTAA 


2580 


GCCACCTTTC 


ATATCTAAGA 


TACCTTTAAC 


TTCACGTGCA 


GCGATACCAG 


CAATTTTTTC 


2640 


AACAACTTCA 


TCAGAGAATG 


TTAATTTGnT 


TTTGAATTGA 


GGCTCCTGAT 


TTnGTTCnTG 


2700 



(2) INFORMATION FOR SEQ ID NO: 512: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1135 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
« (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 512: 



40 



45 



ATCAAATACA ATAATTAAAA TAaTGGCTAA 


TACaCCTAaA 


ATaATAATcG 


AGTTaGAGTA 


60 


GCCTAAGCGA yCACCcTAAC aGTArACATT 


CTAGGCATAT ATTTaTCTTT 


AGACATTGAA 


120 


GCCGCTAACA TCGGAAAAGC TGTAAATCCA 


GTGTTGGCTG 


CTAATACCAA 


AATCATCACG 


180 


GTCGTTGCCT GTACAAAATA GAAGGCAGCA 


TTATCACCAA 


ATATTTGCAT 


AGCTAATTGT 


240 


GATAAAACTG TCGTTTCCGT TTGTGGcAAA 


ATTCCATAAA 


CATATGCTAA 


ACCAACGATA 


300 


CCAACTAATA AAAACGCTAA AATTGAACCC 


ATAGCAATTA 


ACGTTTTCAC 


AGCATTTTTA 


360 


GCACTTGGCT CTCTAAAATT GGTTACCGCA 


TTTGAAATAG 


CTTCAACACC 


CGTTAACGAT 


420 


GAAGCCCCTG ATGAAAATGC TTTTAATAsC 


AAGAATAATG 


TTACTCCAGG 


AACCGCAGTT 


480 
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AATATTAATA TCACTAACCC TATAATGAAA AGATATACTG GATAGGATAA TACGGTGGCA 600 

GATTCAGTTA AACCACGTAA ATTTAATATT AAAATAAAAA GTACAAGTAw ACATGCAATC 660 

5 

AGTrCkTTAT GCCCATATAA ACTTGGGAAT GCAGCAACAA ATGCATCAGC ACCAGATGAT 720 

ATmCTAACAG CGACAGTCAG TATGTAATCG ACTAATAATG AGCCTCCTGC AAGCAATCCC 7 80 

CATTTTTCTC CTAAATTGGT CTTGGACACC ATATACGCGC CGCCACCTTT AGGATATGCA 84 0 

10 

TAAATAATTT GCCTATAAGA CATAATTAAA GCAGCTAATA AGATCAGAAC AGCACCTGCA - 900 

ATCGGTAAAG TATACCAAGT TGCAACTGCA CCCACTACTG ATAGTGTAAT CAGTATTTGT 960 

15 TCGGGACCGT AAGCCACTGA AGATAATGCA TCCGACGAAA GAATCGCTAA CCCTTTAAAC 1020 

TTCGATATCT TTTCGTCTTT TAGTTCTCTG TTTTTTTTAG GTTGCCCTAT AATAAGtCTT 1080 

TTAAATTGAT kGAcATAAtC TCCTATTCCT , TTTTATAGTT TTnGATGGAA ATCAC 1135 

20 (2) INFORMATION FOR SEQ ID NO: 513: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 91B base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 513: 

30 

TTATGGTAAA ACTTTAACAC AAGCATTACA TAAATCGCAA 
TTGAATTTAA GCCATTTCTA TTCATTTTAC GTAATTATTA 
CACATTTTAT TCAGATTTTT TATCGCGCTC CATTAAATCT 

35 

GATATTTTCA AATAATACTC TATATAATGC ATTTGTAATT 

TTTAGCTAAA TGATAAACTG ATTTAGTTGT ATAAATACCT 
40 AGATAATGCT TGATCCATTG ATTCACCTTG TCCAAGTTTA 

AGAATGTGTT GATGTGCAAG TAACGATTAA GTCACCGATA 

CATAGGATCG GCACCTAACT TTTCACCTAA TCTACTAATT 
45 TAATGCAGCT TTTGCATTAT CACCGTAGCC AATTCCAGCT 

GATGATATTT TTCAATGCAC CACCAAGTTC AACACCAATC 

ACGCAAATAA TCATTCATAA ATAAATCTTG CGTTAATTTA 

SO 

TGAAGCAGCA ACTGTAGTTG GTTGCTTGAC TACAACTTCT 
CAACACGCCA ATACCTGCAT TATATTCAGG TGAAATAGAA 

55 



TCATTTATGT GGCAAAAACG 60 

GCCGTATATG TGTAATAATA 120 

TTTACGCATT CTTTTACTGA 180 

GGCATATCCA CATTTTTTTC 240 

TCAACAACCA TATTCATTTC 300 

TATCCTAATG TGAAATTCCG 360 

CCACCTAAAC CTAGAAATGT 420 

TCCGCTAAGC CACGAGTCAT 480 

ACGATACCAC TTGCTACTGC 54 0 

AAGTCATCAT TCGTGTACAC 600 

CTTACACTTT TATCTTTTGA 660 

TCCGCATGAC TTGGCCCTGA 720 

TCTTCAATCA TTTCTGACAC 780 
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CAGCTTATCA TTAATTTGAG AAGCAACTTC TCGCATTGCT TTAGTAGGTA AAGCCATTAA 900 
GTAAATATCT GCAAATTG 918 

£ 

(2) INFORMATION FOR SEQ ID NO; 514: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 587 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



75 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 514: 

CAATACTCGG TATTATAGAT AGTCCTACTA CATTTATATG GGTTTGTATC ATTACATTAA 60 
TTGCACATCA ACTTGAAGGT AATATCATTA CACCAAATGT AATGGGTAAA TCTTTAAGTA 120 

20 TCCATCCTTT AACAATTATC GTTGTTATTT TAGCAGCAGG AGATTTAGGT GGCTTTACAT 180 

TAGTTCTGAT TGCAGTGCCA TTATATGCTG TACTTAAAAC GGTTGTTAGT AATATTTTCA 240 
AATACCGCCA ACGCATTATT. GATAAAGCAA ACAGTAATGT TAAGGACTAA TTCTGTGGAT 3 00 

25 GTCTTTTAAG AATATAAGAT ACTATCGCAT CAAAAGTTGA AACTACAGCT TTTGAGGCGA 3 60 

TTTTTTTGTG CATAAAAAAT CAGTCmAATG AAATATCAAA TAATTTTCCA TCAGTCCGAT 420 
TATTATAAAA GCAAAAAAGC TTTGCTCACA TATATAATAA CGTGAACAAA GCTGTTGAAT 4 80 

30 

GATATTATTT AATTGCG TGG AATCCGCTAT CTACATGAAT ATTTTCACCT GTAACGCCAC 54 0 

TTGATAAATC ACTTAATAAG TAAGCCGCAG TTTTACCTAC TTCTACT 587 
(2) INFORMATION FOR SEQ ID NO: 515: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 812 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 515: 
45 GGTTCATTCC AAAAAAGTAC GCGATAATTA GCGAAGAAGA ATTTGAAAAT TTAAATGTTG 60 

TTAAACCAAA TAAAAATAAT GTTTTCTGGT CAGTTATAGG AAGTTCGGTT TTGTTTGGAG 120 
TTACTTTAAG GAAATACATA CATGTTTTTG ATGTTCAATT AGATAAGCTA GTTGTAATGA 180 

50 

TATTGTGTGC TCTCGCTTTA ATTTGTGTTA TAGTTTTTTA TTTTAACTTA AATAGAAAGC 240 
TTAAGTTAAA AGTGTTTGAT ACAAATATTG AAAAAnATAA GAGAGTTATA TAAAwAcCAA 300 
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10 



15 



30 



35 



45 



50 



55 



TTTCATTAAT TGCCCTTATG ACAATCGAAC CTCAAAATAT AATAATATTT ATTTATTGGA 420 

-TTATGATGAC AATGCTTTTC TTTTTGTTAA ATATGACTTC GATAGGTAAT GAAAAAGTTC 480 

GCGTTATAAT GAAAAATAAT TGATTACATT TAAAATATTC TAAATGTTGT CGACACAATC 540 

CTTTTAAGAC GCTAGTAGAA TTTAAATGAC TTCTAATGTA TATGAAAGTG TATCAATATA 600 

AAACCAATTG AAAAGAAGTG GAGACATTGC TTTGTGAAAC TGAAAATATT AATAAGAATC 660 

CCAAATATAG AATTATCAAA TACAAAGATG AATATTTGAT GATTGATTTA GTAAGTACAT 720 

GGTTAGCACT CTTTTTCCCA ATGATTAATT GGCTGATTCC AAAAAAGTAC GTCAAAATCA 780 

GCGAAAAAGA TTTTGAAACT TTAAACATTG TG 812 
(2) INFORMATION FOR SEQ ID NO: 516: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 526 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



2t> . (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 516: 

GTTCTATTAT TTCTTCCAAT GGTAATGGAT TTTCGTAAAA TGATTGATAT AAGTTGATAA 60 

TCTCAATAAC TTTTAATTGA TCTGGAAACA TCGTTTTTTG GAACATTATA CTGCGATTTT 120 

CACTTTGTAA TAACTTGTCT TTATCAAATA TCTCACCAGA ATTAGCATTA ACATTACCAA 180 

TTAATATATC AATTAACGTT GACTTTCCAG CACCATTTTT TCCAATTAAA GCGATACATT 24 0 

TACCTTGTTC AATATCGAAC GAAATATTTT TTAGAACACA TCTTTTATTA AATGACTTGT 300 

TGATATTAGA TATTTGAATC AATTTAATCA CCTCTATTTT TTTCTTAATT TAATATTAGT 360 

AAATTTATTA GATTTAAAAT AGAACACTTT GTCATAGATT TGAAATGACA AATGTCATTA 420 

40 TTAGATkTAC ATAATATATT TATCGTtATT TTAATTTTGG GCAAAATAAA AAGAGCCTCT 4 80 

ATAATCGrGC TCCTTACAAA TAaATTATAA AAttGGCGAA CTAAAT 526 
(2) INFORMATION FOR SEQ ID NO: 517: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4544 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 517: 
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GGAACACCTA AAGAAAGAAA TGATGCATTA 
AAAGAAAATA CTAAATGGTT ATGCGATCAA 
GTAATTGATG AACTGTCTAC ATTTAAAAGT 
AAGAAATTAC CACTCATTAA TAGATTTATA 
TTACAGGATT TATGGGCTCA AGTTTATTTG 
TTCAGTCGTT ATCGAGAAAG GTACTTTAAA 
AACTGGGAGC TAAGAGACGG ATCTGAAGAA 
TTAAGCATGA AAGCGAAAGA TTATCTAGAT 
GTAGTCTTAT CTGAAAAAGA AAGAAAAGTA 
GAATCGGAAG AAGAAGGAAC AGTTGTAGCT 
CTTCAACTAT CTAACGGTGC AGTTTATACA 
AAGAAGTTAG ATAAGTTAGA GGAAATTATA 
TTTTATAACT TCaAACATGA TAAAGAAAGA 
TTaGAGGATT CAAACTATAA AGAACGTTGG 
CATCCAGCAA GTGCAGGGCA TGGATTAAAC 
TTTGGACTTA CATGGTCATT GGAATTATAC 
GGACAAAATC ATACGACTAT TATTCATCAC 
GTATATAAAG CTTTACAAAA TAAAGAACTA 
GCAAGAATAG CTAAGCATAA GTAATGGAGG 
AAGCCAGGTA CATTTAAATA TATTGAGTCA 
GAGATAAATA GATTGAGAAT GGAGATACTT 
GTGTATGGAC CGTTGCAAAA AGGTGAACCA 
TTATTGACTA ATAAGATGTT ACGAAACCTA 
TACTTAAAGT TACCTGAAGA TCATAAGAAA 
AAGAAGCTAA AG AT AG AG CA AATAGGAGAT 
ACTATACGAA AGAACTTTGT TAAAGCGGTA 
GCAAAGATTG TGCAAAAGGC CTACAAATCT 
TAAAGTTATC TAAAAGTTAT ACGACACAAG 
TTTGTTATGC AATCAAAGAG GTGTAAGAGA 



AACACAGAGG CTGATATCTA TGTAACCAAT 
TATAAAAAAG AATGGCCATT TGATATGGTT 
CCTAAGAGTC AAAGGTTTAA ATCTATTAAA 
GGATTAACAG GAACACCTAG CCCAAATAGT 
ATAGACAGAG GCGAAAGACT TGAGTCTTCA 
CCAACACATC AAGTTAGCGA ACATGTTTTT 
AAGATATATG AACGAATAGA AGATATATGT 
ATGCCTGACA GAGTTGATAC TAAACAAACA 
TATGCAGAAT TAGAAAAAAA CTATATTTTA 
CAGAATGGGG CATCATTAAG TCAAAAACTA 
GATGATGAAG ATGTAAGACT TATACATGAT 
GAGGAGTCTC AAGGCCAACC AATATTATTG 
ATACTTCaAA GGTTTAAGGa AGCAACCACA 
AATAGTGGAG ACATTAAGCT GCTTATAGCA 
TTACAACAAG GTGGGCACAT TATTGTTTGG 
CAACAAGCAA ATGCAAGATT ATATAGACAA 
ATTATGACCG ATAACACAAT AGATCAAAGA 
ACGCAAGAAG AATTGATGAA AGCTATTAAA 
TATAAGATGG GAAAGGCATC ATACGATATT 
GAGATATATA ACCTACAAGA GAACAAGAAA 
AACCCAACGA AAGAGCTAGA CACTAACATT 
GTTAGAACAA CTGAACTAAT GGCAACAAGG 
GAAGAAATGG TCGAAGCAGT TGAAAGTGAA 
GTAATTAGGC TAAAGTATTG GAATAGAGAT 
GCATGTCACA TGCATCGTAA TACAGTTACT 
cGwtATCATG CAGGTATCAA ATAACATTGT 
GTAGTAATAT GATAGTATCG GAAAGATGTA 
TACACGAGGC ACATCGCTAT GCGtGTGTCT 
TGACCAAGCA TAATAACATT TATAAGCATG 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



AGATAGCATT 


AGATAGGGAT AATCATCTTT GTCAAATGTG 


TTTACGTGAA 


GACATAGTAA 


1920 


CAGATGCAAA 


CATAGTGCAT 


CATATTATTT 


ATGTTGATGA 


AGATTTTAAT 


AAAGCTTTAG 


1980 


ACTTAGATAA 


TTTGATGTCA 


GTTTGTTATA 


GCTGTCATAA 


CAAAATTCAT 


GCAAATGATA 


2040 


ATGACAAAAG 


TAATCTTAAG 


AAAATTAGAG 


TATTAAAAAT 


TTAAATAAAA 


AAATAATTTA 


2100 


TTTTTATAGC 


CCCCTACCCA 


TCGGCTTAAA 


ATGTTTTTTC 


GACGGGTACC 


GGCGGGGGCC 


2160 


CTTCGCTTGC 


AACGCGGATA AACTTTTATG 


AAAGGGGGTC 


TTTATATGAA 


ATTAACAAAA 


2220 


AAACAGCTGA 


AAGAATATAT 


AGAGGATTAT 


AAAAAATCTG 


ATGACATATT 


AATTAATTTG 


2280 


TATATAGAAA 


CGTATGAATT 


TTATTGTCGG 


TTAAGAGATG 


AACTTAAAAA 


TAGTGATTTG 


2340 


ATGATAGAGC 


ATACAAACAA 


GGCTGGTGCG 


AGCAATATTG 


TTAAGAATCC 


ATTAAGCATA 


2400 


GAACTGACAA 


AAACAGTTCA AACACTAAAT AACTTACTCA AGTCTATGGG 


TTTAACTGCA 


2460 


GCACAAAGAA 


AAaAGATAGT 


TCAAGAAGAA 


GGTGGATTCG 


GTGACTATTA 


AAGTTTTAAA 


2520 


TGAACCTTCA 


CCAAAACTAT 


TAACAACATG 


GTATGCAGAG 


CAAGTCACTC 


AAGGGAAAAT 


2580 


AAAAACAAGC 


AAATATGTTA 


AAAAAGAATG 


TGAGAGACAC 


CTTAGATATC 


TAGAAAATGG 


2640 


AGGTAAATGG 


GTATTTGATG 


AAGAATTAGC 


GCACCGTCCT 


ATTCGATTCA 


TAGAAAAGTT 


2700 


TTGTAAACCT 


TCCAAAGGAT 


CTAAACGTCA 


ACTTGTATTA 


CAACCATGGC 


AACATTTTAT 


2760 


TATTGGCAGT 


TTGTTTGGTT 


GGGTTCATAA 


AGAAACAAAA 


CTGCGCAGGT 


TTAmAGAAGC 


2820 


TTTGATATTT 


ATGGGGCGAA 


AAAATGGTAA 


AACAACTACT 


ATATCTGGTG 


TTGCTAACTA 


2880 


TGCTGTTTCT 


CAAGATGGAG 


AAAACGGCGC 


TGAAATCCAT 


CTTTTAGCAA 


ACGTAATGAA 


2940 


ACAAGCTAGG 


ATTCTATTTG 


ATGAATCTAA 


GGCGATGATT 


AAAGCTAGCC 


CAAAGCTTAG 


3000 


AGAAAATTTT 


AGACCTTTGA 


GAGATGAAAT 


TCATTACGAT 


GCAACTATAT 


CTAAAATTAT 


3060 


GCCACAGGCT 


TCAGACAGTG 


ATAAGTTGGA 


TGGTTTAAAT 


ACACATATGG 


GCATTTTTGA 


3120 


TGAAATTCAT 


GAATTTAAAG 


ATTATAAATT 


GATTTCAGTT ATAAAAAACT 


CAAGAGCGGC 


3180 


AAGGTTACAA 


CCCCTTCTTA 


TCTACATTAC 


GACAGCAGGG 


TACCAACTAG 


ATGGACCACT 


3240 


TGTTAATATG 


GTAGAAGCGG 


GAAGAGACAC 


CTTAGATCGA ATCATCGAAG 


ATGAAAGAAC 


'3300 


TTTTTACTAT 


TTAGCTTCTC 


TCGATGATGA 


CGATGATATA 


AATGATTCGT 


CGAATTGGAT 


3360 


TAAAGCAAAT 


CCTAACCTAG 


GTGTTTCTAT 


CGATTTAGCT 


GAAATGAAAG 


AAGAGTGGGA 


3420 


AAAGGCTAAG 


AGAACACCAG 


ATGAACGTGG 


AGATTTTATA 


ACCAAAAGGT 


TTAACATCTT 


3480 


TGCTAATAAT 


GATGAGATGA 


GTTTTATTGA 


TTATCCAACA 


CTTCAAAAAA 


ATAATGACAT 


3540 


TATTTCCTTA 


GATGAGTTGG 


AAGGTAGACC 


ATGTACTATA 


GGTTATGATT 


TATCAGAAAC 


3600 
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5 



10 



15 



AACACATTCT 


TGGATTCCTA 


AGCATAAAGT 


TGAATATTCT 


AACGAAAAGA TACCCTATAT 


3720 


AGAATGGGAA 


GAAGACGGAT 


TACTAACAAT 


ACAAGATAAT 


CCTTATATAG . 


ACTACCAAGA 


3780 


TGTTTTAAAT 


TGGATAATAA 


AGATGAATGA 


GCATTATGTT 


GTCGAAAAAA 


TCACTTATGA 


3840 


TAGGGCGAAT 


GCTTTTAAAT 


TAAATCAAGA 


GTTAAAGAAT 


TATGGCTTTG 


AAACAGAAGA 


3900 


AAcv/AGACAA 


GGGGCTTTGA 


CCTTGAGCCC 


TGCaTTGAAG 


GATCTAAAAG 


AAATGTTTTT 


3960 


AGATGGGAAA 


ATAATATTTA 


ATAATAATCC 


TTTAATGAAA 


TGGTATATCA 


ATAATGTTCA 


4020 


GCTGAAACTA 


GACAGAAATG 


GGAACTGGCT 


GCCATCTAAA 


CAAAGCAGAT 


ATCGTAAAAT 


4080 


AGATGGTTTT 


GCAGCATTTT 


TAAACACATA 


TACAGATATT 


ATGAATAAAG 


TTGTTTCTGA 


4140 


CAAGGGTGAA 


GGAAACATAG 


mATTTATTAG 


TATTArAGAT 


ATAATGCGTT 


AAGGAGGTGA 


4200 


ATGTTATCGC 


AAAAGAGAAT 


ATTGTCACAC 


GCATAAAGAA 


AAAATTGATA 


GACAATTGGA 


4260 


TTGaTCAGTC 


AGCTTCTAAG 


CTTTATGACT 


TTAGCCCATG 


GAAAAATAAA 


TCTTTTTGGG 


4320 


GTGTAATCAA 


TAATACGCTT 


GAAACTAATG 


AAACGATATT 


TTCAGCTATT 


ACnAAGTTAT 


4380 


CTAATTCGAT 


GGCTAGTTTG 


CCCTTGAAAA 


TGTATGAAGA 


TTATAAAGTA 


GTTAATACAG 


4440 


AAGTATCTGA 


TTnACTTACA 


GTGTCACCGA 


ATAATTCTCT 


GAGCAGTTTT 


GATTTTATTA 


4500 


ATCAAATTGA 


AACAATCAGA 


AATGAAAAAG 


GTAATGCATA 


TGTG 




4544 



(2) INFORMATION FOR SEQ ID/ NO: 518: 

30 

(i) SEQUENCE CHARACTERISTICS :' 

(A) LENGTH: 881 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 518: 

40 CATGGCAGAA TATTGAAGCT GTAAAAAAAG GACATGTAAT TTCATATAAA GCAGAAGATT 6 0 

ATTGGTTCAC AGATCCTATT ACATTAGAAC ATTTGAGAAG TAAATTAAAA AAAGAAATTT 120 
TAAATAAAAA ATAATAGAAA TAAGTTGTAA AAATTTTCTT ATGCATTGGT ACTAATGTTT 180 

45 TTAAGGAGTG ATTAAATGAA GCAACTGGTT GGAATTCCCG AATCAATGTT AATTCCTTTG 24 0 

ATAGCTCGAG CAAAAGAGTA CGAAAACGAA AAACCAATAA TAAAAGACGC ACTATCAAAA 300 
AAAATATTTG ATGGTTTAGA TGATATGTAC AAAAATGTTA CATGTGATGA CATGTCTCAA 360 

SO 

ATTGGAATTA GTATACGTAC TGTGATAATA GATTGTGTTA CTAAGAGGCT TATCAAGGAT 420 
AATAAAGATT TAATCGTGGT CAATATAGGT TGTGGCTTAG ATACAAGGTT TCAAAGATTT 4 80 

55 
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ACATTTTTTA AAGAAAGTAA TAGTTATAAG ATGATATCTA AATCTATGCT AGATTACAGT 600 
TGGATTGATG ATGTCAAAAA TTATAAATTT TTTAATAGTA AGTCAGATAT ATTGTTTATT 660 

5 

ATTGAAGGTG TATTGaTGTA TTTTGATGAG AGTGTAATGA CTCAATTATT GGACACTATT 720 
ATCAAAAAGA TGGGAGATCA TAATTTGACA TTTGCGATTg AATTTTGCTC aAAAACAATT 780 
GCGAATAATA CmAAGAGACA TCAATCGGTA TCCAAGTTAT CCTCACCACC TGTTTTTAAA 840 

10 

TATGGGTACC ATGATTTAnA AAAATTGGAT GAnATTTACC C 881 
(2) INFORMATION FOR SEQ ID NO: 519: 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3122 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS-: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 519: 







Inlnl innih 




AnltJiu J. \~ \-~r\ 






25 


ACnAGTTCCA 


TCCAACTGTT 


TAAAATACCC 


CTACAGGTAT 


TTTTAGGGAG GTTATTATGA 


120 




AACAATACGG 


AGAAAaGTTT 


ATCGATGAAT 


TTAGTAAAGC 


AGAATTGGAA AAACTAGCCA 


180 


30 


AGCAAGGGCA 


ATTAATTGAC 


GTTAGAACAG 


AAGAGGAGTA 


TGCATTAGGA CATATCAATG 


240 


GTTCCATACT 


TCATCCTGTT 


GATGAGATTG 


AGTCATTCAA 


TAAAGAAAAA AATAAAACCT 


300 




ATTATGTAAT 


CTGTAGAAGT 


GGTAACAGAA 


GTGCTAATGC 


TAGTAAATAT TTAGCTAAAC 


360 


35 


AAGGTTATAA 


CGTTATAAAT 


CTTGATGGTG 


GTTATAAAGC 


TTATGAAGAA GAAAACGATA 


420 




GTTATGATAC 


ACAAGAAGAA 


TATAAAAGTA 


TAGAAATTAA 


AGCAGATCGT AAACAATTTA 


480 




ACTATCGTGG 


TCTTCAATGT 


CCAGGGCCAA 


TTGTAAAAAT 


TAGTCAAGAA ATGAAGAATA 


540 


40 


TTGAAGTAGG 


TGACCAAATT 


GAAGTCAAAG 


TCACAGACCC 


TGGATTCCCT AGTGACATTA 


600 




AAAGTTGGGT 


GAAACAAACA 


AGGCATACTT 


TAGTTAAGCT 


TGATGAAAAT AACAATGGAA 


660 




TTAATGCGAT 


TATTCAAAAA 


GAAAAAGCAA 


AAGATTTAGA 


TATAAATTAT TCTGCTAAAG 


720 


45 


GTACTACAAT 


TGTATTATTT 


AGTGGAGAAT 


TAGACAAAGC 


TGTAGCAGCG TTGATTATTG 


780 




CAAATGGTGC 


TAGAGCTGCT 


GGAAAAGATG 


TAACTATCTT 


CTTTACTTTT TGGGGGCTTA 


840 


50 


ATG CATTAAA 


AAAAGTGCAA 


ACAGTTAATG 


TTAAAAAGCA 


AGGTATTGCA AAAATGTTTG 


900 


ATTTAATGTT 


GCCCAAAAAG 


AATATACGAA 


TGCCTCTTTC 


CAAAATGAAT ATGTTTGGTT 


960 




TAGGAAATAT 


GATGATGCGC 


TACGTAATGA 


AAAAGAAAAA 


TGTTGATTCA TTACCAACAC 


1020 
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10 



IS 



TCATGGGTAT TCAGAAAGAA GAACTTAGAG ATGAAGTTGA GTACGGTGGT GTAGGCACTT 1140 

ATATTGGTGC TACTGAAAAT GCGAATCATA ATTTATTTAT CTAATTAAAT CTATTAATAA 1200 

AAGGAGTTGT TATCATGTTT TTTAAACAGT TTTACGATAA TCATTTATCT CAAGCATCAT 1260 

ATTTAGTGGG TTGTCAACGT ACAGGAGAGG CAATAATAAT AGACCCTGTT CGTGATTTAT 1320 

CGAAATATAT AGAAGTTGCA GATTCTGAAG GTTTAACAAT TACACAAGCT ACAGAAACAC 1380 

ATATTCATGC TGATTTTGCT TCAGGAATTC GTGATGTGGC TAAACGCTTA AATGCAAATA 144 0 

TATATGTGTC TGGCGAAGGT GAAGATGCAT TAGGGTATAA AAATATGCCA TCAAAAACAC 1500 

AATTTGTTAA ACATGGAGAT ATCATTCAAG TAGGCAATGT TAAATTAGAA GTTCTGCATA 1560 

CTCCAGGACA CACGCCTGAA AGTATTAGCT TTTTACTCAC TGATTTAGGT GGTGGTTCAA 1620 

GTGTTCCGAT GGGATTATTT AGTGGTGACT TTATTTTTGT TGGTGATATA GGTAGACCTG 1680 

20 ATTTAtTAGA AAAATCTGTT CaAATAAAgG GTTCTACAGA AATTAGCGCG AAACAAATGT 174 0 

ATGAGTCCGT TCAAAATATT AAAAATTTAC CAGACTATGT TCAAATCTGG CCGGGTCATG 1800 

GTGCTGGAAG CCCTTGTGGT AAAGCATTAG GTGCCATACC TATATCTACA ATAGGTTATG 1860 

AGAAAATTAA TAACTGGGCA TTTAATGAAA TTGATGAGAC TAAATTTATT GAATCATTAA 1920 

CATCAAATCA ACCAGCACCA CCGCATCATT TTGCACAAAT GAAACAAGTT AATCAGTTTG 1980 

GTATGAATTT ATATCAATCA TATGATGTTT ATCCTAGTTT AGATAATAAG AGAGTAGCAT 2040 

TTGATCTTCG TAGCAAAGAG GCCTTTCACG GTGGCCACAC AAAAGGAACA ATCAATATAC 2100 

CATACAACAA AAACTTTATT AATCAAATTG GTTGGTACTT AGATTTTGAA AAAGATATAG 2160 

ATGTAATTGG AGATAAATCT ACTGTTGAGA AAGCGAAACA CACTTTACAA TTAATTGGGT 2220 

TTGATAAGGT AGCAGGCTAT CGTTTGCCAA AATCAGGCAT TTCAACCCAG TCCGTTCATA 2280 

GCGCTGATAT GACAGGTAAA GAAGAACATG TATTAGACGT ACGTAATGAT GAAGAGTGGA 2340 

40 ATAATGGACA CTTAGATCAA GCAGTTAATA TTCCGCATGG TAAATTATTA AATGAAAATA 2400 

TTCCTTTTAA TAAAGAGGAT AAAATATATG TACATTGTCA GTCAGGTGTT AGAAGTTCAA 2460 

TTGCAGTGGG TATATTGGAA AGCAAAGGTT TTGAAAATGT GGTGAATATT AGAGAAGGCT 252 0 

ATCAAGATTT TCCAGAATCA TTAAAATAAT TTAAGGATGT GGAAAAAATG AATAAGCATT 2580 

ATCAAATTGT TATTATTGGT GGCGGTACAG CAGGTGTTAC CGTAGCATCA AGACTATTAA 2640 

GAAAAAATCA AAACTTAAAA GAGAAAATAG CAATTATAGA TCCAGCAGAC CATCATTACT 2700 

ATCAACCATT ATGGACGTTG GTTGGTGCAG GGGTATCTAG TTTGAAAAGT TCTCGTAAAG 2760 

ATATGGAAAG TGTTATACCT GAAGGTGCTA ACTGGATAAA ACAGGCTGTT TCAAGTTTTC 2820 



25 



30 



35 



45 



50 



55 
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TAGTAGCTCC AGGATTACAG ATTAATTGGT CTTCAATTAA AGGACTAAAA GAAAATATAG 2940 

GTAAAAATGG TGTTTGCTCT AACTATTCAC CTGACTATGT TAACgAAACT TGGAACCAAA 3000 

TTTCTAATTT TAAACAAGGA AATGCCATTT TTACGCATCC AAACACTCCT ATAAAGTGTG 3060 

GAGGTGCgCC TATGAAAATT ATGTATTTAG CTGAAGATTA TTTTAGGAAA CATAAAATCG 3120 

GT 3122 



10 

(2) INFORMATION FOR SEQ ID NO: 520; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 962 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOUX3Y: linear 



20 (xij SEQUENCE DESCRIPTION: SEQ ID NO: 520: 



25 



30 



35 



40 



45 



50 



ATAnAGATAT 


AnATnAATAT 


ATTGAGGTCA 


AACGATGATA 


ATTAAAATTT 


TAACAATTCT 


60 


ATTACTACTT 


TGTATATTGA 


GCTATTTGGT 


TACAAATAGA 


AAGAAGCCTT 


TTCTGTTCTT 


120 




A 1 1 M.1 IjijVj X Kj 


1 L»o lAl I 1AI 


CTTTATAGGA 


TATATTTCAC 


TGGCAATATC 


180 


-TGCCGTAATT 


ATTTATGGTA 


TTATTCAATT 


TATCACAATT 


GATTTTGGTA 


GTTTTTTCTT 


240 


AATGGGTATT 


ATATTGATCT 


TGATTTCAAG 


TATATTCCAA 


TTATTTATAG 


TTAGATTACT . 


300 


TTTTAGAAAA 


AAGAATGTCG 


ATTTGACAGA 


GGTTGTCGTT 


TTAsAsCATT 


TAATTCAATG 


360 


GTTCTTAGTT 


TACTTTGCGA 


TCTATCAAGC 


AGTAAATGAA 


AAAATGGACA 


TTAATGATAT 


420 


TAATATCGAC 


AATTTCCAAT 


CTGTCTTTTT 


TGACGTGTCT 


AATTTGAATT 


TAGTAATTCT 


460 


ACCAACGTTA 


ATCATTAGCT 


GGGTCACAAT 


ATTTAACTAT 


AGAATGAGAA 


GTTACAAATA 


540 


AAATCTATGA 


GATTATACCT 


TCAGACACCA 


ACATTCAAAT 


GGTGTCTTTT 




600 


GTTTTATTTT 


TGAAATTCGA 


AAAAGTAGAG 


GCATGAATTT 


TTTGACTAGT 


GTATAAGTGC 


660 


TGATGAGTCA 


CAAGATAGAT 


AGCTATATTT 


TGTCTATATT 


ATAAAGTGTT 


TATAGTTAAT 


720 


TAATAATTAG 


TTAATTTCAA 


AAGTTGTATA 


AATAGGATAA 


CTTAATAAAT 


GTAAGATAAT 


780 


AATTTGGAGG 


ATAATTAACA 


TGAAAAATAA 


ATTGATAGCA 


AAATCTTTAT 


TAACATTAGC 


840 


GGCAATAGGT 


ATTACTACAA 


CTACAATTGC 


GTCAACAGCA 


GATGCGAGCG 


AAGGATACGG 


900 


TCCAAGAGAA 


AAGAAACCAG 


TGAGTATTAA 


TCACAATATC 


GTAGAGTACA 


ATGATGGTAC 


960 


TTTTAAATAT 


CAATCTAGAC 


CAAAATTTAA 


CTCAACACCT 


AAATATATTA AATTCAAACA 


1020 


TGACTATAAT 


ATTTTAGAAT 


TTAACGATGG 


TACATTCGAA 


TATGGTGCAC 


GTCCACAATT 


1080 



55 
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TCAAAATCTT GTGAGAGAAT TTGAAAAAAC ACATACTGTC AGTGCACACA GAAAAGCACA 1200 

AAAGGCAGTC AACTTAGTTT CGTTTGAATA CAAAGTGAAG AAAATGGTCT TACAAGAGCG 1260 

5 AATTGATAAT GTATTAAAAC AAGGATTAGT TAAATAAAAC TTCAATCGTT GCTGTTATCT 1320 

GGAAATAATT AATTAAATGT TATGTTAATT TTTGTTAATG AAAAAAGTAA TCTATTTAAT 1380 

GACAGGTTAA TGTAATTGTC CTGAAATTGA CTATATACTC AGTAAGTATC AATTTTAAGG 1440 

10 

AGAGCTTATA ATGAAATTTA AAAAATATAT ATTAACAGGa ACATTAGCAT TACTTTTATC 1500 

ATCAACTGGG ATAGCAACTA TAGAAGGGAA TAAAGCAGAT GCAAGTAGTC TGGACAAATA 1560 

TTTAACTGAa aGTCAGTTTC ATGATAAACG CATAGCAGAA GAATTAAGAA CTTTACTTAA 1620 

CAAATCGAAT GTATATGCAT TAGCTGCAGG AAGCTTAAAT CCATATTATA AACGTACGAT 1680 

TATGATGAAT GAATATAGAG CTAAAGCGGC ACTTAAGAAA AATGATTTCG • TATCAATGGC 1740 

2Q TGATGCTAAA GTTGCATTAG AAAAAATATA CAAAGAAATT GATGAAATTA TAAATAGATA 1800 

ATAAATAAAA CAGGTTGAGA CAAAAAATGG TCTTAACCTG TTTTCAATTT GCATATGTGA 1860 

TAAATTCTAT ATCAAAATGC TTATGTATAA TGAATGACAT TTAAAAGTAG GGGAGACAAA 1920 

25 TATAAATACA ATAGTTCCTA GGATTACTCT CAAAATAACT ATATCAATTA TTTACTTTGC 1980 

TCTCCTATTT TTTAAAATAT GTACATGTTT AAACAATCAA AAGTGTACAA TATTAAATTA 2040 

TCATTTCCAG TTCTAGTGCT ATATTGGTAG TAGTTGACTA AATGAAAATA AGCTTATAAC 2100 

30 AAGTTTTTTC AATACTCGTG GGGCCACAAC AGAGAGAAAT AGGATCACCA ATTCCAACAG 2160 

ACAATGCAAG TTGGCGGGGC CCCAACATAG AGAAATTGGA TCACCAATTT CAACAGACAA 2220 

TGCAAGTTGG GGTGGGGCCC AACACAGAAG CTGGCGAAAA GTCAGCATAC AAAAATGTGC 2280 

35 

AAGTTGGCGG GGCCACAACA GAGAGAAATA GGATCACCAA TTCCAACAAA CAATGCAAGT 234 0 

TGGCGGGGGC CCCAACATAG AAGCTGGCGA AAAGTCAGCT TACAAAAATG TGCAAGTTGG 2400 

CGGGGCCCCA ACATAGAGAA ATTGGAACCC CAATTTCTAC AGACAATGCA AGTTGGGGTA 2460 

40 

GGACATCGAT AAAGAAATAC TTTTTCTTTA GCAATTAGTA TTTCTTATGC ATGAGCTTTA 2520 

CTCATGTATT CATTTTTTAA GTACaCATTA GCTACAGCTA ATGATAAAGA ACCACTACAT 2580 

45 AATAAATCAT TAGTGGTTCT TTATCATTTC TATCTCACTC TTTTACTGGA AGAAAAAGTT 264 0 

TACGTTTGTA GAACATGCCA CAATACCAAA AATAATTAAG AAAAATAAGA CGATAAGCAT 2700 

GATGACACTT TTCAAACAAC CTCTATCAGT TTCTTTCGAT TTTCTTTGTT GAACCTTTTT 2760 

50 ATAATCTTCA AGTAGTTTTG CGGCTTTTTT ATTTATATGT TTATTCATGA TGTTGACTCC 2820 

TTATAATATA TGTTTAATTC ATTAAAATAG TTGAAAACAT GACTTGAAAT AAAGATATAA 2880 

55 
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AATTTGATGA 


TGATATTTGC 


TTTTTATTTT 


CCAAATGGAA TTTACTTAAA 


CTGATGCATT 


3000 




AAAATATTAA 


TGAAGCACTA 


GAATACATAA 


ATGAATAGTa 


ATGGTGcACA 


GTATAGAATA 


3060 


5 


ATTAAGGCTA 


TATTAAGTAT 


AAATATCGTT 


AACTGTAAGC 


TATCTTTAGT 


TTTAATATAA 


3120 




ACTATTAGGA 


TAATCGACGT 


AAGAAGAATC 


ATATATATTA 


ATGATGAAGA 


AGTCCATACA 


3180 




AAATCCGCAT 


CATTTGTTGT 


TAATAATGGG 


ACTATAATTA ATCCGAAATT 


AATCATG CAT 


3240 


10 


GCTATATATA 


CTATAATGTT 


ATACACAATG 


TTAATTTTTG 


TTCACCACCT 


TATACTTCTA 


3300 




TTTTAAAAAC 


TTCTTTATAA 


TGATATATTG 


TTTAATGTTG 


AAATAATTAG 


ATTATCTAAT 


3360 


15 


TTTCATTTGC 


TTTACATGTA 


AAAGGCTATA 


TATAGTATGC 


TCTTTATGAT 


TCTAAATGCT 


3420 


TTTTAATATT 


TAATGCTCAT 


CAACATTTGG 


ATTTTGAATA 


TTCAATTCAA 


AAACTTTATT 


3480 




AGCTACGTCA 


ATTGTAAAAT 


CAGAACCATA 


GTTGACATGA 


GCTACTTTTA 


ATTTTCCATC 


3540 


20 


TAAATAATAG 


ATTGCGATTG 


CAACATCGTA 


AAATTCGTCA 


ATGACAAATA 


AACTCTTTTC 


3600 




GTTTGTTACA 


ACCTCATGCT 


CTCCTGAGTA 


TACAACGTTA 


ATTTCCCAAT 


CATTAAAAAC 


3660 




CATTTGTTAA 


CCTCCTTGAA 


CATTTAAATT 


GATTCAACTT AAGTTTAACT 


TATTCATACA 


3720 


25 


ACTTCGTACA 


ATATCTAGAT 


GAACATTAAT 


TGTATTTCTA 


GAAATCTTTT 


TCAATTATAT 


3780 




GTACTAATTA 


TACTTTTAAA 


TTTCTTATTT 


CAGTATAGTT 


TTAAAACGAT 


TTTAAAATAA 


3840 




TTCTGCAAAT 


ATATTAACAC 


ATAATGTGTT 


CAAAAAGTTT 


TGAACAATTT 


CAAAACTTTT 


3900 


30 


ATATAAAGGG 


nTTGACAACA 


TGGATTCAAA 


TnTCTTATTT 


TAAAAATTAC 


CTCATATAGT 


3960 



GTCATGTTAG CCAATTTTTA AG 3 982 

(2) INFORMATION FOR SEQ ID NO: 521: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 , 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 521: 

AGCTTGGaTG ATTTAATaTG GtCCtTTCCC AaCcTTAGAT AATGAAAGAT tTGATAATCC 60 

45 

TGAATATAAA GAAGCTATGA AAAAATATcA ACAGAGATTT ATGGCTGmAG ATGAGGCTTT 120 

GAAGAAATTT TTTAGTGAAG AGAAAAAAAT aAAAAATGGA AATACTGATA ATTTAGATTA 180 

SO TCTAGGATTA TCTCATGAAA GATATGAAAG TGTATTTAAT ACTTTGAAAA AACAAAGTGA 24 0 

GGAGTTCTTA AAAGAAATTG AAGATATAAA AAAAGATAAC CCTGAATTGA AAGACTTTAA 300 

55 
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GTTAGGTAAA ACATTTTATC 


AAAACTATAG 


AGATGATGTT 


GAAAGTTTAT 


ATAGTAAGTT 


420 




AGATTTAATT 


ATGGGATATA 


AAGATGAAGA 


.AAGAGCAAAT AAAAAAGCAG 


TTAACAAAAG 


480 


5 


GATGTTAGAA 


AATAAAAAAG 


AAGACTTAGA 


AACCATAATT 


GATGAATTTT 


TTAGTGATAT 


540 




AGATAAAACA 


AGACCTAATA 


ATATTCCTGT 


TTTAGAAGAT 


GAAAAACAAG 


AAGAGAAAAA 


600 


10 


TCATAAAAAT 


ATGGCTCAAT 


TAAAATCTGA 


CACTGAAGCA 


GCAAAAAGTG 


ATGAATCAAA 


660 


AAGAAGCAAG 


AGAAGTAAAA 


GAAGTTTAAA 


TACTCAAAAT 


CACAAACCTG 


CATCTCAAGA 


720 




AG7TTCTGAA CAACAAAAAG 


CTGAATATGA 


TAAAAGAGCA 


GAAGAAAGAA AAGCGAGATT 


780 




TTTGGATAAT 


CAAAAAATTA 


AGAAAACACC 


TGTAGTGTCA 


TTAGAATATG 


ATTTTGAGCA 


840 


TAAACAACGT ATTGACAACG 


AAAACGACAA 


GAAACTTGTG 


GTTTCTGCAC 


CAACAAAGAA 


900 




ACCAACATCA 


CCGACTACAT 


ATACTGAAAC 


AACGACACAG 


GTACCAATGC 


CTACAGTTGA 


' 960 


20 


GCGTCAAACT 


CAGCAACAAA 


TTATTTATAA 


TGCACCAAAA 


CAATTGGCTG 


GATTAAATGG 


1020 




TGAAAGTCAT 


GATTTCACAA 


CAACGCATCA 


ATCACCAACA 


ACTTCAAATC 


ACACGCATAA 


1080 




TAATGTTGTT 


GAATTTGAAG 


AAACGTCTGC 


TTTACCTGGT 


AGAAAATCAG 


GATCACTGGT 


1140 


25 


TGGTATAAGT 


CAAATTGATT 


CTTCTCATCT 


AACTGAACGT 


GAGAAGCGTG 


TAATTAAGCG 


1200 




TGAACACGTT 


AGAGAAGCTC 


AAAAGTTAGT 


TGATAATTAT 


AAAGATACAC 


ATAGTTATAA 


1260 




AGACCGAATA 


AATGCACAAC 


AAAAAGTAAA 


TACTTTAAGT 


GAAGGTCATC 


AAAAACGTTT 


1320 


30 


TAATAAACAA 


ATCAATAAAG 


TATATAATGG 


GCA 






1353 



(2) INFORMATION FOR SEQ ID NO: 522: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19B7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 522: 



45 



GTCTGAGTCA GGTGCTGTTT 


GGTTAGATGC 


TGAAAAAACA 


AGTCCTTATG 


AATTTTATCA 


60 


ATTCTGGATT AATCAATCAG 


ACGAAGATGT 


AATTAAATTC 


TTnAAATACT 


TTACTTTCTT 


120 


AGGAAAAGAA GAAATTGATC 


GCTTAGAACA 


ATCTAAAAAT 


GAAGCACCGC 


ATTTACGTGA 


180 


AGCTCAAAAA ACATTAGCTG 


AAGAAGTAAC 


TAAATTTATT 


CATGGTGAAG 


ATGCATTAAA 


240 


TGATGCAATC CGTATTTCAC 


AAGCATTATT 


TAGTGGTGAT 


TTAAAATCAT 


TATCAGCGAA 


300 


AGAATTAAAA GATGGATTTA AAGATGTGCC 


TCAAGTGACA 


TTATCAAATG 


ACACAACAAA 


360 
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TGTTAACAAT GGTGCGATTT ATATTAATGG TGAGAGACAA CAAGATGTTA ATTATGCTTT 480 

AGCACCAGAA GATAAAATTG ATGGCGAATT TACGATTATT CGTCGCGGTA AGAAAAAATA 54 0 

CTTCATGGTT AACTATCAAT AAATATAATT GCATAGCTAA ATAAATTAGA GCCTACTCAT 600 

ATTCATTCCT AAGAATGTAA TGAGTAGGCT CTTAATGTAC TTTTCTGTCT GTAAATTATC 660 

TAAAGAAATC TCTATCGCCT TGGCCAGGTG ATTGACGTTC TGATCGGCTT TGACGTTTTG 720 

GTTGTTCTTT TTGTTGTTTT AATTTCACTT TAACTTCTTT TGTTTTACCA TCACGGATAA 780 

CGGTAACAGT GACTGATTCA CCAGG TT T TT TATTTTCATA TAAATAGCTT CTTAAATCAA 840 

CATCATCTTT AATTTTCTTG CCATCAATTT CTGTAATAAT ATCACCTTTT TTAAGATCAA 900 

TATCACTATC AGCTTTGGCG ACATAAATAC CGTCTTCTCT ATCAGTATGA AGTTGCTCGC 960 

GCTCTTCTTC AGGAATATCT TTCAAATTAA TTAAACCAAT ACCAATCGAA GGGCGGTCAA 1020 

TTTTACCATG TTTTACAAGT TGTTCAATTG TTACTTTAAC TTCATTACTT GGAATAGCAA 1080 

ACCCGATACC TTCAACTTGT GTCGCAGCAA TTTTCATTGA GTTAATACCA ACTAAATTAC 114 0 

CATTAATATC TACTAATGCG CCACCTGAGT TACCTGGGTT AATAGCAGCA TCTGTTTGAA 1200 

25 GAACGCTAAC TTTTGTATTG CCACCAGTTG TCTCAGCGTC AATCGTACGT TCGCTTGCTG 126 0 

AAATGATACC AGATGTTACA GAGTTAGCAA ATTGTAATCC TAATGGGTTA CCCATTGCGA ' 1320 

ATACGCTATC GCCAGTTTGT ACTTTTGAAG AGTTGGCAAA TTGAATCGCT TTAATACCTT 1380 

30 TTGTATTTTC AATTTTAAGT ACAGCAATAT CAGTTACTGc ATCTTTACCA ACTAATTTCG 144 0 

CTTTAACTTG TTTTTTATTA TGTAATTGGa CTCTAATTTC ATTTGCGCCA TCAATAACAT 1500 

GATTGTTTGT AACGATATAA GCTGAATTGT TGTTTATTTG aTAGATAACA CCTGAACCTA 1560 

CTCCAGCTTC AGATGGTTTA GATGATTTGC CyTTTAATAA GTCGTCTACA CTTGATGCTT 1620 

TTtGCATGTk AATAACTCCA ACAATTGTAG GGGAGACAGA TTTTATCATT TCATGAACGG 1680 

TACCGAATTT CTTGCTTTGA CCGTCTAATT GATTGCCACC TTTATTATTT GTTGTCTGAA 174 0 

CAGTTGAACC ATCTTTATTT AAAATTGTAC TATTTAATAC TTTGCCTATA CCAAGTACTA 1800 

GAAGTGCACC AATAATTCCA GCAATCAATG CAACGATGAC TGTnTTAAAC CATGGAAATT 1860 

TAGGTCTTCT GTATCTAGGT GTTTGGCTAT GGTTTGTTGT AGAATGATCT GTATGATTAA 1920 

AATCTGACAT ACTTAACCTC CATTATATGA TTTATATATG CTTTAATTAT GTCTTTTnTT 1980 

TATAATT !987 

SO (2) INFORMATION FOR SEQ ID NO: 523: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1351 base pairs 

55 



35 



40 



45 
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70 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 523: 

TTAGAAGTCA AATCATTAnT GGCGTCnTAT CGAGTATACT ATTAACTTCA ACTATTTTAG 60 

CAATTGCATA TATTTTAATG TGGTTTAACG GCCATATGAC aCTAACTTTG ACCTTAACGA 120 

CAATAATTAC AAGCTGTTTA ACCTTATTAA TATGTAGTAT TTTTATTAAT CCACTTATAC 180 

AAAAAATTAA GCAGTTTAAT ATAAAAACTA AGCAATTTGC TAACGGAAAT TACGCAAGCA 24 0 

IS ATGATAAAAC GTTTAATTCA CCAAAAGAAA TTTATGAATT AAATCAATCT TTTAATAAAA 300 

TGGCTTCTGA AATTACGCAA CAAATGAATC AAATTAAATC CGAACAACAA GAAAAAACAG 360 

AACTGATTCA AAACTTAGCC CATGATTTAA AAACACCTTT AGCAAGCATT ATTTCATATT 420 

20 CTGAAGGACT ACGTGATGGT ATAATCACTA AGGATCATGA GATTAAAGAG TCATACGACA 4 80 

TATTAATTAA ACAAGCAAAC AGATTATCAA CATTATTTGA TGATATGACT CAT ATT AT CA 54 0 

CTTTAAATAC AGGTAAAACA TATCCCCCAG AATTAATACA ACTAGACCAA TTACTTGTAT 600 

CAATATTGCA ACCATATGAG CAACGTATCA AACATGAAAA CCGCACATTA GAAGTGAATT 660 

TCTGTAACGA AATTGATGCA TTTTATCAAT ATCGAACGCC ACTTGAGCGT ATTTTAACAA 720 

ACTTACTTGA TAATGCGCTA AAATTTTCAA ATGTTGGTAG TCGCATTGAT ATTAATATTA 780 

GTGAAAACGA AGATCAAGAT ACTATCGACA TTGCTATTAG CGATGAAGGT ATTGGCATTA .840 

TACCAGAACT ACAAGAACGT ATATTCGAAC GTACATTCAG AGTAGAAAAC TCTCGTAATA 900 

CAAAAACGGG TGGTTCTGGA TTAGGCTTAT ATATAGCTAA. TGAACTCGCG CAACAAAATA 960 

ACGCAAAAAT CAGTGTAAGC AGTGATATAG ATGTAGGAAC TACGATGACT GTAACATTAC 1020 

ACAAATTAGA CATTACGTCA TAATCCGATT TATTTATAAA ATAAAATGCA AAGACTAAAA 1080 

AGAAGCTCCC ATTAATGAGG GCTTCTTTTT TTGTTTATTT AGAATAAACT TTATGGGTAT 1140 

CCTTCTCATC ATTTTCAAGA CTTGAAAGAT TTGTAGCTTG AATAATATAT TTAGGACGTG 1200 

CCTTAACTTC ATAATATATC CTGCCAATAT ATTCACCTAC AACACCAATT GAAATTAACT 1260 

45 GTATGCCGCC TAATAATAAA ATAGCTGCAA TCGTTGAAAA ATATCCCGGA ATATTAACAC 1320 

CAGATATCAT AATATTGATG GAATAAATAG A 1351 
(2) INFORMATION FOR SEQ ID NO: 524: 



25 



30 



35 



40 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 524: 



5 


TTGTTGTCAG 


TTTAACGCAA 


CGTGTTACCG ACTTTTCAAG 


TACACAAcAA 


TATATATGTC 


60 




TATTCCATTT 


TTAGCCCcTG 


CCATTTTCAT 


CATTGGTGGT ATTATGTTGT 


TTATTTCAAC 


120 




ATTTAATAGC 


TTAGATGAAA 


CTGCTGAAAA 


TAATAACAAA ATAAAGAAAC 


TAATGATTAA 


180 


10 


AGGACTTATC 


ATTATTAACA 


TTTCATTTAT 


CGTTATGATG 


GTTTTAACAC 


CATATTGGTA 


240 




CTTGTATTTA ATCGTCTATC TTATTTTCTT GTTGTTTTTA TTGTGGCAAA 


AGGTTTATAA 


300 




ATTTTAATAC 


CAAAACTATT 


AAACACTTCT 


GATATTCTTA 


GTTCAAAATA 


TCAGAAGTGT 


360 


15 


TTTATAGTGT 
TTTTGACCCC 


TATCTAGTTC 
nAT 


AGATAAATAT 


TTCCTTACTT 


AAAAAAACGC 


CCTCCTCTTA 


420 
433 




(2) INFORMATION FOR SEQ ID NO: 525: 








20 
25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1845 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 525: 






30 


CCCCCACTAT 


GATATGCTTG 


GCCTATTGCG 


TCAATCCCCT 


TATATTTCGG 


CAATAGAGAT 


60 


GGATGTATAT 


TCAATATTTT 


ACCTTCAAAT 


GAAGCTAATA 


AGTCTGGACC 


AATTAGACGC 


120 




ATGTAGCCAG 


CTAGAATAAT 


CCACTCTACC 


TTATCTTCAT 


TTAATAATGT 


TACTAAATGT 


180 


35 


TGTTCATACG 


CTGCTTTTGA 


ATCAAATTGT 


TTTGGTTCAT 


TAATATAAAC 


AGGAATATCG 


240 




TGCTTTTTTG 


CTCTATCTAT 


ACAAAACGCA 


TTTTGATGAT 


CCGTATATAG 


CGCCGTAACT 


300 




TCAATATTTT 


CAAGTTTTCC 


TGATTCAACA 


TGCTCAACTA 


TATTTkCAAA 


GTTACTTCCT 


360 


40 


GAACCTGATG 


CAAAAATCGC 


AATTTTAACC 


ATTGTTATAC 


CCCCAACAAT 


TCAATTGCAG 


420 




TTGACTCATT 


TTTCACAATA 


TGACCAATTT 


GATAGGCTTC 


CACATTTTGT 


TCTGCTAAAA 


480 




TCTTCAAAGC 


GCGTGATACA 


TCTTTTTCAT 


CAACGATAAC 


CGTATAGCCA 


ATACCCATAT 


540 


45 


TAAAAATGTT 


ATACATTTCA 


TTTGTGTCTA 


TATTGCCTTG 


TTGTTGTAAC 


CAATCAAATA 


600 




TTTTTGGTGT 


TGGAAATGAT 


GTAGTATCAA 


TTCTAGCAGC 


ATATCCGGCT 


GGCAATGCAC 


660 




GTGGAATATT 


TTCATAAAAA 


CCTCCACCAG 


TAATATGATT 


CATTGCCTTA 


ATAGAAACTT 


720 


50 


CTTTTTTCAA 


AGCAAGTACA 


GGTTTGACAT 


ATAATTTAGT 


TGGTTCTAAA 


AAGACATCGA 


780 



55 
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GCACTAAACT GTATCCATTT GAATGAATGC CACTTGACGC AAGCCCTATA ACAACTTGTC 
CCTCTTTCAC TTCTGAACCA TCTACATAGT CATCCTTTTC AACTGCTCCA ACAGCAAATC 
CAGCTACATC ATATTCGCCT TCGTGATACA TTTCACCCAT TTCAGCAGTC TCTCCACCGA 
TAAGTGCAGT ATTCGTTTCA ACACATGCAT CACTAATACC TTTAACAATT TGTTCAATAA 
CTTCAGGAAC AACTTTGTTT GTAGCAATGT AATCTAAAAA ATATAATGGT TCTGCACCTG 
TCGTTAAAAT ATCATTAACA CACATTGCGA CTGCATCGAT ACCTATCGAA TCATGTTTAC 
CATAGTCGAT AGCTAGTTTT AATTTCGTAC CTACTCCGTC TGTTCCAGAA ACTAAAACTG 
GCGCTGtCAT ATTTAATTGT GATAAATCAA ATGTAGCACC GAATCCACCT AAACCACCGA 
TAACTTCTTT ACGCATCGTA CGtTTAACaw GaCtAGaCaT TCtTTCtACA GCTTCATAAC 
CAGCATGAAT ATTTACACCA GATTGTTCAT ATGCTTTAGA CATTTAAATT TCCCTCGCTA 
TCAAAAAAGT GTTTGTTTTT AGAAATATAT TGCTTTTGTC GATGACTTAA ATGCGCTTTA 
TAATTTGCTT CATAATCATA TAACCCTGCA GGATAATCTC CAGTGAAACT TTCTACACAT 
AAGCCACTAT ATGGCGCGTC ATAATCTAAA CCAATTGATT CAATTAACCC ATCTACAGAT 
AGATATGCTA ATGAATCAGC GCCAATATAA TCTTTAATTT CTTCAGGTGA TTTGCTTGCA 
GAAATTAATT CTGCCGTAGT TGAAACATCG ATTCCGTAAA AACTTGGAAA CATAAATTCC 
GGTGATGCTA TACGCACATG CACTTkATTT GCACCAGAAT CkTTTAACAT TTTCACAATG 
CGTCGAATTG TCGTACCGCG nACAATGGAA TCATCAACAA GACTG 
(2) INFORMATION FOR SEQ ID NO: 526: 

(i) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 1141 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 526: 
TCATAAATnG TAAATGTTAA TCGTCATATA ATATTAATTT ACAACACCAT TTTGGTTATT 
TGAAGCTTGT GGCGCTTGTT GTGTGCCACC TTGATTTTGA TTTGAGTTTT GATCTGTAGC 
AGGTTGTTGT TGATTGCTGG AratCACTGTT ATTAGTTGAm TCACTGTTCT CGTTAGATGT 
CTTATCTTTA TCTGTCGTAT CATTTTGTTC TTTTCTCAAT AAACTATTAT CTAAAGGCGT 
TAATGGTATT AATGAACCAT AATGATTAAT GACACGTTGA TCTAAGAAAT CATTTTTATC 
ATTAATAGGT GATAATTCTA AGTCTTTACG ■ AAGTAAGTTT GCATATTTTT GAATGCTTTC 
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s 



10 



15 



20 



AGTTTTAATT 


TCAACATCAT 


TTGTTaAGTA 


CATTTTTGCT AAAGCTTTAA TTTCAGAGTT 


480 


AGTTAAATTA 


TGCTTTGCAT 


TTTTACCTAC 


AATTTGAATC ACGTTATCAA 


GTTTATCAAT 


540 


AGAATCAACT 


TCCTGTGCTT 


TTTGGAATAA 


AATCTTAATT AATTCCATTT 


GACGTTGTCC 


600 


ACGTTTTAAG 


TCTGAATCAT 


GATGTCTAGT 


TCTAGCAACT GCTAAAGCCT 


CATCACCATT 


660 


TAATTTTTGG 


TACCCTTTTT 


TAATTTTAAT 


CTTACCAGTA TCATCTGTGT TAGGTTCATT 


720 


TAAGTCGTAT 


GGCACATCAT 


AGTATATGCC 


ACCAAGCTCA TTTACAGCCT 


CGACAAATGC 


780 


TTTCATATTG 


ACTCTCACAT 


AATAATCAAC 


AGGTACATTC ATGGTAGCTT 


CTACCGAATC 


840 


LKi IoLAjuua 


h TTYSfi Zi PP H P 


P Zi *Pti *m P H TP 


TGCATGGGTA ATCTTATCGT 


AATAGCCAAC 


a r\ f\ 


TTTAGGAATG 


TAGCTGATAG 


TATCACGTGG 


AATACTAAGC ATTCTAATTT 


GATGTTTTGA 


960 


TTGATTAAAA 


GTAGTTAAAA 


TCATAGCGTC 


TGATCTAGAG TGTTCAGCAT 


CCTGTCCTTT 


1020 


TTTTCTTCTT 


CCATCGTTAT 


CATCGATACC 


TAAGAAAAGA ATAGAGATAG 


GTTGTTCTTC 


1080 


GGGATGACTT 


TATTATCTCT 


TAAGTTGGAT 


TGACGTTAGC ATTTTTGCTG 


TCTTGAGAAG 


1140 



A 1141 
(2) INFORMATION FOR SEQ ID NO: 527: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1565 base pairs 

(B) TYPE : nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 527: 

GCACCATTAG CATTGGGCnC ACTGATTGGA GTAnCAGTTG 

AGTAAACmGG CmCaGGCaGC mATamCCCCa TATTATAcTT 
40 AATGCTAATT TTATTCTGGA TAAGAATTTT ATTAACGCGA 

TTCAATGGTA TTAAATTAGC TAAAACGAAT ACGATAAAAA 

ACTTTTAAAG GTGTTTCTGC AAAAGGAAAC GAAGCAAGTC 

AATAATATTT CATTAAAAGA TATCCAAAAA GCTTATGGCA 
. GGTAAAACAA AGGAAGCTGA TAGCGGTATT TTTTACTATC 

GGCATCTGGT TTGTCGTTGA TCATAATAGA GTTGTCGAAG 

SO 

TACAAAACAA GTAAATAAAA TAATGGCATA TTAAGGCTAG 
ATTCTAGTCT TTTTTATTAA ATAATAACGA TTATTGCGTC 

55 



TTGAAAATTC GGCGCCAACA 60 

ATAATGGTTA TATTGGTAAT 120 

TTAAGTATGA TAATGTGAAA 180 

AAGTAGAGAA ATATGATCAA 240 

AATTGCAATT TGTAGTTAAA 300 

AGGACTTGAA AAAAGAAAAT 360 

AAAATGCTAA AAAGACATTA 420 

TAACAGTTGG ACATACACCA 4 80 
AGTGTGAGGA GTGATACCGC * 54 0 

TTACATAGTT GTTTGAAATT 600 
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10 



15 



GATTAAGTAT ATAGAGCACT ATTTTGTATT TGTTAATATT TTCACAAAAA TAAAGCCTTG 720 

ATAAATTTTA AAATATAATT AAGCTCAATT TTTAAAATTT TATTTAGCTA CAGATAACAT 7 BO 

TTTTAAAAAA GAAAAGAATC AATAAATAAA ATCAACGAAC AAAAAGTATA GAAATAAATA 840 

GAAATAATCA TTTACTTTTC TGAAAAATTA AATTAATATT TTATTTATAA GCTGTTTTTA 900 

AGATTTCAGG AGGAATGAAA TGTGaGGAAA TTTTCAAGAT ATGCATTTAC AAGTATGGCA. 960 

GCATTAACCT TGTTGAGCAC TTTATCACCA GCAGCATTAG CGATTGATTC AAAAAATAAA 1020 

CCAGCTAATT CTGATATTAA ATTTGAGGTG ACTCAAAAGA GTGATGCGGT CAAAGCATTA 1080 

AAAGAATTGC CTAAATCCGA AAATGTAAAA AATATTTATC AAGATTACGC TGTTACTGAT 1140 

GTAAAAACTG ATAAAAAAGG ATTTACGCAT TATACATTGC AACCGAGTGT TGATGGTGTT 1200 

CATGCACCTG aCAAAGAAGT GAAAGTACAC GCAGACAAAT CAGGAAAAGT CGTTTTAATC 1260 

20 AATGGGGATA CTGATGCGAA GAAAGTAAAG CCAACGAATA AAGTGACATT AAGTAAAGAT 1320 

GACGCAGCCG ACAAAGCATT TAAAGCAGTT AAGATTGATA AGAATAAAGC GAAAAATCTT 1380 

aAAGATAAAG TCATTAAAGA rAACAAAGTT GAAATCGATG GTGACAGTAA TAAATACGTT 144 0 

25 TATAATGTTG AGTTAATTAC AGTGACACCA GAAATTTCAC ATTGGAAAGT TAAAATTGAT 1500 

GCTCAAACTG GCGAAATTTT AGAAAAAATG AACTTAGTTA AAGAAGCTGC AGAAACTGGT 1560 

AAAGG 1565 

30 

(2) INFORMATION FOR SEQ ID NO: 528: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2870 base pairs 

(B) TYPE: nucleic acid 
35 .(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 528: 

GGATAGGTAA ATCCATTCAA AATTGGTTGT ACTCATTTTA AATCATATTT TTTAATATAG 60 

AAACACATAA TTAAAGGAGT GATGATATTA ATGAATAAAC TTCGAGACAC TACTTTTCTA . 120 

45 TCATATTTAT TTACTATCAT ATTGTGGGGA TCTGCATTTC CAATGATAAA GATTGCGTTA 180 

AATGATTTTA GTGCAGAATC ATTGTCGGCA TTTCGTTTAA TTTTAGCAAC AATAATTCTC 24 0 

TTGCCGTTCG TAATTATAAA GAAATTGCCT ACCCCTGAAC TAAGAGATAT CCCTGTTATT 300 

TTTATTTTAG GATTTTGTGG ATTTGTGATA TATCACACAG CTTTAAATTT TGGTGAAaCT 360 

TTGATTAGTG CAGGTATAtC TGGrATTCyA GTCTCTACAA CGCCTATTTT TTCTAGTGCT 420 

55 



50 
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GCATTTATAG GAATATCCAT TATTTCAATA AGTAAAGATG ATTACACAAC TATTAATGTA 540 

TTAGGTGTTT TTATTATTTT ACTTGCATCT TTTAGTGAAA GTTTGTATTT TACTTTCCAG 600 

5 

AAAAAATACA TAGAAAAATA CGGCTTCATC GCTTTCACAC TATaTACAAT AATGGCAAGC 660 

TCACCATTTA TGCTTATTTT TATTCCTGAA ATCATCAACG ATATACACGG CGCCACTTTT 720 

ACATCAATAG TATCGGTACT TTATTTAGCT ATATTCCCTA CTATAATTCC ATACGTTTTG 780 

10 

CTTGCTTATA TTGTGAAGTC AGTTGGTGTC TCTGATGCAA CAATGTCTCT TTATTTAACA 840 

CCTATCGTTT CTTTATTATT ATCTTATCTG TTATTAGACG AGCTACCAAC AACCCTTGCT 900 

75 ATTATAGGCG GAATTATCAC TCTACTAGGC GTTAGTTTAA GTAACTTCTT TCAAAATACA 960 

TAATTATTCC AAGTCCCGCA CCTCAGAATC CAAAAACATT CGAGTGATAA AATTTTAAAA 1020 

ATCAAAAATA TAAAAATGAT CTAATTTCGT CAAATTTACC AATATAAATA CTAATATTTG 1080 

20 CAATTCACAA AGGGGTATAG TCTGAGTGTA TTCTAATACG AAAGGACTTG GTGGATATGT 1140 

ATTACAGTTA TGGAAATTAT GAAGCATTTG CGCGCCCTAA AAAACCTGAA AATGTAGAAA 1200 

ACAAATCCGC TTACTTAATC GGATCTGGTC TAGCTTCACT TGCTGCAGCT TGTTTTTTAA 1260 

TAAGGGATGG TCAAAfGGAA GGTTCGAAGA TTCATATTTT AGAAGAGTTA CCTAAAGCAG 1320 

GTGGTAGTCT TGATGGTGAA AATATGCCTT TAAAAGGCTA TGTTGTCCGC GGTGGTCGTG 1380 

AAATGGAGAA CCACTTTGAA TGTTTGTGGG ACTTATTCAG ATCTATCCCT TCATTAGAAA 144 0 

30 

TCGATAACGC GTCTGTATTA GATGAGTTCT ATTGGCTAAA CAAAGAAGAC CCTAACTATT 1500 
CTCGCTGTCG TGTTATTGAG AAACAGGGTC AACGTTTAGT CACAGACGGA GACTTCACTT . 1560 

35 TGACTAAAAC GGCGATTAAA GAAATTTTAG ATTTATGCTT AACGAATGAA GAAGATTTAG 1620 

ATGATGTCAA AATAACAGAT GTATTTTCCG ATGACTTCTT TAATTCAAAC TTTTGGATTT 1680 
ACTGGAAAAC GATGTTTGCA TTTGAACCGT GGCATTCTGC AATGGAAATG CGTCGCTATC . 174 0 

40 TAATGCGATT CGTTCATCAT ATTAGTGGTC TCGCAGACTT TTCAGCTTTA AAATTCACTA 1800 

AATATAATCA ATATGAATCT TTAGTATTAC CTATGGTTGA ATATTTAAAA TCGCATGGGG 1860 

TTCAATTTGA ATACGATGTA AAAGTCGAAG ATATTAAAAT AGATGTTACG ACAAGTCAAA 1920 

45 

AAATTGCCCG AGAAATATTA ATTGACCGTA ATGGTAATGC AGAATCTATT AAACTGACTA 1980 

TAAACGATCT TGTCTTTGTG ACAAACGGTA GTATTACAGA AAGCTCTACT TATGGTGATA 204 0 

ATGATACACC AGCGCCACCA ACTGACGAAT TAGGTGGTAG TTGGACACTA TGGAAAAATT 2100 

50 

TAGCGCGACA AAGTCCTGAA TTTGGTAATC CTGATAAGTT TTGCCAAAAT ATTCCTAAAA 2160 

AAAGTTGGTT TGTTTCAGCA ACTTCTACAA CAAACAATAA AGAGATTATC GATACAATAG 2220 

55 
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TCAATGATTC TGCATGGCAA ATGAGTTTTA CAATCAATCG TCAGCAACAG TTTAAAGACC 2340 

AACCTGAAAA TGAAATATCT ACATGGATTT ATGCCTTATA TTCAGATGTA AACGGCGATT 2400 

ATATTAAAAA GCCAATTACA GAATGTAGTG GTAATGAAAT ATGCCAAGAA TGGCTGTATC 2460 

ACTTAGGTGT ATCAACTGAC AAAATTGAAG ACTTAGCAAA ACATGCATCT AATACGATTC 2520 

CTGTTTATAT GCCATATATC ACATCTTATT TCATGACGCG TGCTATCGGC GACAGACCTT 2580 

TAGTCGTCCC GCATCAATCT CAGAACTTAG CATTTATTGG TAACTTTGCA GAAACAGAGC 2640. 

GAGACACTGT ATTTACAACA GAATATTCGG TTCGTACTGC CATGGAAGCT GTTTATCAAT 2700 

TACTAAATAT AGATCGTGGT ATTCCAGAAG TCATCAATAG TCCATTTGAT CTTCGCGTCT 2760 

TAATGGATGC CATATACGAA CTGAATGACC ACCAAGATTT GCGTGAGATT ACTAAAGATT 2620 

CGAAAATGCA AAAACTCGCA TTAGCAGGAT TCCTTAAAAA GATAAAAGGT 2870 
20 (2) INFORMATION FOR SEQ ID NO: 529: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3105 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
25 (D) TOPOLOGY: linear 



10 



15 



30 



35 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 529: 

CnGTTTAGCT GCAAAAAGTA AATAACGACA CTGTATTTGG AATTTTGCAA TTAGAAACAC 60 

TTTTGGGTGA CATTAACTCA ATTTTCAGCG AGATTGAAAG CGAATACAAA ATGTCTAGAG 120 

AAGAAATTTT AATTTTACTA ACTTTATGGC AAAAAGGTTC TATGACGCTT AAAGAAATGG 180 

ACAGATTTGT TGAAGTTAAA CCGTATAAGC GTACGAGAAC GTATAATAAT TTAGTTGAAT 240 

TAGAATGGAT TTACAAAGAG CGTCCTGTTG ACGATGAAAG AACAGTTATT ATTCATTTCA 3 00 

ATGAAAAGTT ACAACAAGAG AAAGTAGAGT TGTTGAATTT CATCAGTGAT GCGATTGCAA 360 

GTAGAGCAAC AGCAATGCAA AATAGTTTAA ACGCAATTAT TGCTGTGTAA GTTTAATAGC 420 

ATAAAAAGAG G T TT T CATTA AGTTGAAAAC CTCTTTTTGT TGTTGGCATT AATTTTTCAA 48 0 

45 ATGTTGACTA CTCAATCCTA AATTATAAAT AGTATAGCGC aCAAaTGCTT AAGAAATTTT 54 0 

TTCTATGGCA CAAATGAATG GAGCATGATT ACGTTGGTTT AAAAATTGAT ATTGCAAAAC 600 

TTGCGCATGC TTTTGATCCA AAGTACTCAA GTAATCAAGC AATGCATGCT TCTCAATTTG 660 

50 

TCCTTCGCTA TGACCATGAT ATATAACAAG TACAATAATA CCTTCAATTG ACATTAATGA 720 

TAGCAATGAA TTAATAGCTT GGATTGTCGT GTCAGGCTTT GTCACGATAG ATTTATCACC 760 

55 
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TATATGATGT TCAATATTTT CATGTCCATC TTTTATTAAA GAAACATGAT TGAAATCCTT 900 

AACTTTATCA CGTGTATTTT CCAAAGCTAA ATCTTGAATG TCGAAACCAT AAACATGTCC 960 

TTCTGGTACT TGTTCGGCTA AAAATAAAGT GTCATTGCCG TTACCGCAAG TTGCGTCTAC 1020 

AACAATACTT TCTGGTGTTA TATGTTGTTT, AATAAGTGTT TTTGAAAAAG GGAGTATACG 1080 

TTCTAATTTC ATTGCTTCAC CTTAGACTTG TAACGGAAGC CTTGATAAGA ATTTCTACGT 1140 

GCTAATTCAG CATCGATGCC ATTTAATACT TCCCATTTAT TAACACTCCA CATTGGACCT 1200 

ACGATGATAT CTATTGGACC ATCACCGGTA ATTCGGTGAA CGATCATTTC AGGGGGAATC 1260 

ACTTCTAATT GGTCCACAAC TAGGTTTGTG TACTCTTCTT GAGTCATAAA AGTTAATAAA 1320 

CCTTTATCGT ATTGTTTTAC CATCGGTGTA CCTTTTAACA AATGAAGTAA ATGAATTTTA 1380 

ATACCTTGTA CATCCATTTG TGCAcTTCTT TGGCAGTAGC CATCATCATG TCATAGTCTT 144 0 

20 CGCCAGGTAA GCCATTAATG ATGTGTGTAC ATACATTGAT ATTATGCTTA CGTAATTTTG 1500 

CCACACCATC ATAATAAGTT TTCATATCAT GGGCACGATT GATTAAATCA GATGTTGACT 1560 

GATGGATTGT TTGTAGTCCT AATTCAACCC ATAAGTATGT TCGTTGATTC AAATCTGCTA 1620 

25 AATATTCGAC AACATCGTCT GGTAGACAGT CAGGACGCGT ACCAATAGAT AATCCCACAA 1680 

CACCCGGTTC TTTAAGTACA GGTTCGAATT TTTCTTTTAA TACTTCAACC GGTGCATGTG 1740 

TATTTGTAAA TGCCTGAAAA TAAGCAATAT ATTTTCCTTC GTGCCATTTC TCATGCATCT 1800 

T TTCT T TA AT TTCTTTAAAT TGTACTGCGA TTGAATCTGC ACGATTACCT GCAAAGTCTC 1860 

CGCTACCTGC AGCAGAACAA AATGTACATC CACCATGTGC TACAGTGCCA TCGCGGTTAG 1920 

GACAGTCAAA CCCGCCATCC AATGCAACTT TAAATATTTT TTGTCCAAAT TTATTTTTTA 1980 

AATGGTAATT CCATGTGTGA TAACGTTTGT TTTCAAAAGC GTATTGGAAA TGATTGCCCA 204 0 

TATGTCATTT TCCTTTCTAT AAAAAAAGAG TTCTAAGTAC AGATTTTAAC ATATTTTAAT 2100 

GTTATAGTGT TTATTATAGT TTGACAAAAA AGAGAGAGGA ACTATGAAAT ATGAATATAC 2160 

CTAAATCAGT CTGGTGGCTA GTAATTGGCA TGGCGTTAAA TATTACTGGT TCCAGTTTTT 2220 

TGTGGCCTTT AAATACAATT TATATGAAAC AAGAACTTGG AAAAAGTTTA ACTGTTGCTG 2280 

45 GTTTAGTGCT AATGATAAAT TCATTTGGCA TGGTTATTGG AAACTTATTA GGTGGTTCAC 234 0 

TATTTGATAA ATTAGGTGGA TACAAGACGA TTTTAATTGG AACTTTCACT TGTCTTTGTA 2400 

GTACAACGCT ACTTAATTTC TTTCACGGGT GGCCTTGGTA TGCTGTATGG CTTGTAATGT 24 60 

50 

TAGGGTTTGG TGGCGGAATG ATTATTCCTG CGATATACGC TATGGCTGGA GCAGTGTGGC 2520 

CAAATGGCGG AAGACAAACG TTTAATGCGA TATACTTAGC GCAAAATATT GGTGTGGCTG 2580 
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ATCTTATTAT GTATGTTGTG TTTGCGCTTG TCGCGGTAAC GCAATTTAAT ATTGAAATTA 2700 

ATGCGAAAGT TAAATATCCA ACTCATTTAG ATATTACTGG TAAAAAGAAT AAAGCAAGAT 2760 

5 TTATTTCATT AGTACTAATT TGTGCAATGT TTGCAATTTG TTGGGTTGCA TATATTCAAT 2820 

GGGAGTCTAC AATCGCTTCA TTTACACAAT CTATTAATAT TTCAATGGCA CAATATAGTG 2880 

TTTTATGGAC AATTAACGGA ATAATGATTT TAGTAGCACA ACCATTAATT AAACCGATTC 2940 

10 

TCTATCTGTT AAAAGGAAAC TTAAAGAAGC AAATGTTTGT CGGCATCATC ATTTTTATGT 3000 

TGTCGTTCTT TGTCACGAGT TTTGCCGAAA t ACTTTACAAT ATTTGyTGTC GGTATGATTA 3 060 

TTTTAACTTT TGGAGaATGT TTGTATGGCC AGCAGTTCCA ACTAT 3105 

(2) INFORMATION FOR SEQ ID NO: 530: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5532 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 530: 



35 



45 



TAATGATTAA 


ACCAGATGCA 


GTACAAAGAA 


ATCTAATTGG 


TGAAGTAATT 


TCAAGAATTG 


60 


AAAGAAAAGG 


ACTAAAACTT 


GTCGGTGGTA 


AATTAATGCA 


AGTACCAATG 


GAACTTGCTG 


120 


AAACACATTA 


TGGTGAACAC 


CAAGGTAAAC 


CATTTTATAA 


TGATTTAATT 


TCATTTATTA 


180 


CATCAGCACC 


AGTGTTCGCA 


ATGGTAGTTG 


AAGGTGAAGA 


TGCAGTTAAT 


GTATCTAGAC 


240 


ATATTATTGG 


CAGCACCAAT 


CCTTCAGAAG 


CTTCACCAGG 


ATCAATTAGA 


GGTGATTTAG 


300 


GTTTAACTGT 


TGGTAGAAAT 


ATCATTCACG 


GTTCAGATTC 


ATTAGAGTCT 


GCTGAACGTG 


360 


AAATTAATCT 


ATGGTTTAAT 


GAAAATGAAA 


TTACTAGCTA 


TGCTTCACCA 


CGTGATGCAT 


420 


GGTTATATGA 


ATAAAATATA 


AACTGTAAAC 


CTTTACGATT 


TATTTATAAA 


GGTAGAAAGG 


480 


GTTTTGTTAT 


GTGGTTAGTC 


ATTATGaTTA 


TACATAACAA 


GGCCCGTTTT 


TTATGTTGTA 


540 


GTAAATTACT 


TGAAAAATTT 


TATAGTTTTT 


kGGTAAcACG 


TAtTaAAAaG 


AGAGGAATAT 


600 


.TCTTTATCAA 


ATGAAACTAA 


ACAGAGAGAA 


GGGGTTGTTA 


AAATGAAgAA TATTATTTCG 


660 


ATTATTTkGG 


GGATTTTAAT 


GTTCTTAAAA 


TTAATGGAAT 


TACTATATGG 


TGCTATATTT 


720 


TTAGATAAAC 


CACTTAATCC 


TATAACAAAA 


ATTATTTTTA 


TACTGACTCT 


CATTTATATT 


780 


TTTTATGTAT 


TAGTAAAAGA 


ATTGATTATA 


TTTTTGaAGT 


CAAAGTATAA 


CAAAAGCGCT 


840 


TAACATATGT 


ATATTTTAAT 


ATCATAATTT 


TTTTAAACGG 


ACTGATTAAC 


TTTATTAATA 


900 
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GATACGATTA TATTAAAACG GCTAATCATT 


TTTAATTAAT 


GATTATATGA 


TGCAACTGTT 


1020 




TAGAAATTCA TGATACTTTT CTACAGACGA 


AT AT ATT AT A 


ATTAATTTTA 


GTTCGTTTAA 


1080 


5 


TATTAAGATA ATTCTGACAT TTAAAATGAG 


ATGTCATCCA 


TTTTCTTAAT 


TGAGCTTGAA 


: 1140 




AACAAACATT TATGAATGCA CAATGAATAT 


GATAAGATTA 


ACAACATATT 


ATAATGTTAT 


1200 


10 


CGTGGAAGTA TGAAAGGAGC GAGTGTGTAT 


GAGATACCTA 


ACATCAGGAG 


AATCACATGG 


1260 


ACCTCAATTA ACAGTTATTG TTGAAGGTGT 


ACCTGCAAAT 


ATAGAAATTA 


AGGTTGAGGA 


1320 




TATTAATAAA GAAATGTTTA AGCGTCAAGG 


CGGTTACGGA 


CGTGGACGTC 


GTATGCAAAT 


1380 


15 


TGAGAAAGAT ACAGTAGAAA TAGTATCAGG 


CGTTAGAAAT 


GGTTATACAT 


TAGGTAGTCC 


1440 




AATTACTATG GTTGTAACCA ATGATGACTT 


TACGCATTGG 


AGAAAAATTA 


TGGGAGCAGC 


1500 




TCCAATAAGT GAAGAAGAAC GTGAAAATAT 


GAAACGTACT 


ATTACAAAAC 


CAAGACCTGG 


1560 


20 


TCATGCAGAT TTGGTTGGAG GTATGAAATA 


TAATCATCGT 


GATTTACGAA 


ATGTGCTAGA 


1620 




GCGATCATCT GCTAGAGAAA CAGCAGCTCG 


AGTTGCAGTC 


GGTGCCTTAT 


GTAAAGTGTT 


1680 




ATTACAACAG TTAGATATCG,. ATATATACAG 


TCGTGTTGTT 


GAAATAGGTG. 


GAATTAAAGA 


.1740 


25 


TAAAGATTTT TATGATTCAG AAACATTTAA 


AGCAAATCTT 


GATCGTAATG 


ATGTTCGTGT 


1800 




AATTGATGAC AGTATCGCAC AAGCAATGCG 


AGATAAAATT 


GACGAaGCTA 


AAAATGAAGG 


1860 


30 


AGATTCAATT GGCGGTGTCG TTCAAGTTGT 


AGTTGAAAAT 


ATGGCTGTTG 


GTGTAGGTAG 


1920 


TTATGTGCAT TATGATCGTA AGTTAGATGG 


TAAGATTGCA 


CAAGGTGTTG 


TCAGCATAAA 


1980 




TGCTTTTAAA GGTGTAAGCT TTGGTGAAGG 


ATTTAAAGCA 


GCTGAAAAGC 


CAGGTAGTGA 


2040 


35 


GATTCAAGAT GAAATTCTAT ATAATAGTGA 


AATTGGTTAT 


TATCGTGGAT 


CTAATCACTT 


2100 




AGGTGGTTTA GAAGGCGGTA TGTCAAATGG 


AATGCCAATT 


ATCGTTAATG 


GTGTAATGAA 


2160 




ACCAATTCCA ACGTTATATA AACCATTAAA 


TTCAGTAGAC 


ATTAATACTA 


AAGAAGACTT 


2220 


40 


TAAAGCAACA ATTGAACGTT CTGATAGTTG 


TGCTGTTCCT 


GCAGCAAGTA 


TCGTCTGCGA 


2280 




ACATGTCGTA GCATTTGAAA TAGCAAAAGC 


ATTATTGGAA 


GAATTCCAAT 


CAAATCATAT 


2340 




TGAGCAACTT AAACAACAAA TTATTGAGCG 


CAGACAATTA 


AATATTGAGT 


TTTAACAACA 


2400 


45 


AGAACAATTG AGGTGTAATC ATGAAATTAC 


AAACAACATA 


CCCTTCAAAT 


AATTATCCAA 


2460 




TATATGTTGA ACACGGTGCA ATTGACCATA 


TTAGCACGTA 


TATTGATCAG 


TTTGATCAAA 


2520 


50 


GTTTTATATT AATTGACGAG CATGTAAATC 


AATATTTTGC 


TGATAAATTT 


GATGATATTT 


2580 


TATCATATGA AAATGTACAT AAAGTTATTA 


TTCCAGCTGG 


TGAAAAGACG 


AAAACATTTG 


2640 




AGCAATATCA AGAAACATTA GAGTATATTT 


TATCCCATCA 


TGTAACTCGT 


AATACAGCAA 


2700 
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ACGAGGCGTG CACTTTATAC AAGTGCCAAC GACTATACTA GCGCATGATT CTAGTGTTGG 2820 

CGGTAAAGTG GGTATTAACT CAAAGCAAGG TAAAAACCTT ATCGGTGCAT TTTATCGTCC 2880 

5 AACTGCTGTG ATTTATGATT TAGTCTTTTT AAAGACGTTA CCATTTGAGC AAATATTAAG 2940 

TGGCTATGCA GAAGTTTATA .AGCATGCGTT ATTGAATGGT GAATCAGCGA CGCAAGATAT 3000 

CGAACAGCAC TTTAAAGATA GAGAGATATT ACAGTCATTA AATGGTATGG ATAAATATAT 3060 

10 

TGCTAAAGGT ATTGAAACGA AGCTGGATAT TGTTATTGCA GATGAAAAAG AACAAGGTGT 3120 

ACGTAAATTT TTAAATTTAG GTCATACATT TGGTCATGCT GTTGAATACT ATCATAAAAT 3X80 

ACCTCATGGT CATGCAGTGA ' TGGTTGGCAT TATCTATCAA TTTATAGTTG CGAATGCTTT 324 0 

15 

■ GTTTGATTCT AAGCATGATA TTAATCATTA TATTCAATAT TTAATACAAC TCGGCTATCC 3300 

TTTAGACATG ATAACTGACT TGGATTTTGA AACGTTATAC CAATATATGC TAAGTGATAA 3360 

20 AAAGAATGAT AAGCAAGGTG TACAAATGGT CTTGATTAGA CAATTTGGAG ATATCGTTGT 3420 

ACAACATGTT GATCAACTAA CATTACAACA TGCATGTGAA CAATTAAAAA CATATTTTAA 34 80 

GTAGGTGAAT GAAATGGTAA ATGAACAAAT CATTGATATT TCAGGTCCGT TAAAGGGCGA 354 0 

25 AATAGAAGTG CCGGGCGATA AGTCAATGAC ACACCGTGCA ATCATGTTGG CGTCGCTAGC 3600 

TGAAGGTGTA TCTACTATAT ATAAGCCACT ACTTGGCGAA GATTGTCGTC GTACGATGGA 3660 

CATTTTCCGA CTGTTAGGTG TAGAAATCAA AGAAGATGAT GAAAAATTAG TTGTGACTTC 3720 

30 

CCCAGGATAT CAATCTTTTA ACACGCCACA TCAAGTATTG TATACAGGTA ATTCTGGTAC 3780 

GACAACACGA TTATTGGCAG GTTTGTTAAG TGGTTTAGGT ATTGAAAGTG TTTTGTCTGG 384 0 

CGATGTTTCA ATTGGTAAAA GGCCAATGGA TCGTGTCTTG AGACCATTGA AACTTATGGA 3 900 

35 

TGCGAATATT GAAGGTATTG AAGATAATTA TACACCATTA ATTATTAAGC CATCTGTCAT 3960 

AAAAGGTATA AATTATCAAA TGGAAGTTGC AAGTGCACAA GTAAAAAGTG CCATTTTATT 4020 

40 TGCAAGTTTG TTTTCTAAGG AACCGACCAT CATTAAAGAA TTAGATGTAA GTCGAAATCA 4080 

TACTGAGACG ATGTTCAAAC ATTTTAATAT TCCAATTGAA GCAGAAGGGT TATCAATTAA 414 0 

TACAACCCCT GAAGCAATTC GATACATTAA ACCTGCAGAT TTTCATGTTC CTGGCGATAT 4200 

45 TTCATCTGCA GCGTTCTTTA TTGTTGCAGC ACTTATCACA CCAGGAAGTG ATGTAACAAT 4260 

TCATAATGTT GGAATCAATC CAACACGTTC AGGTATTATT GATATTGTTG AAAAAATGGG 4320 

CGGTAATATC CAACTTTTCA ATCAAACAAC TGGTGCTGAA CCTACTGCTT CTATTCGTAT 4380 

50 

TCAATACACA CCAATGCTTC AACCAATAAC AATCGAAGGA GAATTAGTTC CAAAAGCAAT 444 0 

TGATGAACTG CCTGTAATAG CATTACTTTG TACACAAGCA GTTGGCACGA GTACAATTAA 4500 

55 
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AAACTTGTTA GGGTTTGAAT TACAACCAAC TAATGATGGA TTGATTATTC ATCCGTCAGA 4620 

ATTTAAAACA AATGCAACAG TTGATAGTTT AACTGATCAT CGAATAGGAA TGATGCTTGC 4680 

AGTTGCTTCT CTACTTTCAA GCGAGCCTGT CAAAATCAAA CAATTTGATG CTGTAAATGT 4740 

ATCATTTCCA GGATTTTTAC CAAAACTAAA GCTTTTAGAA AATGAGGGAT AATATAAAAT 4800 

GGAAGATATC TATAAATTAA TAGACGATAT CAATCTACAA AAACTAGAAA ATTTAGACTC 4 860 

TCGTGTTAAT GAAGCAATAA CTACTGACAA CGATGACGCA TTATTTATTC TAGGAGAGAC 4920 

ACTTTACAAT TTTGGATTAA TGCCaCAAGG TTTGGAAGTA TTCCGCGTGT TATATCACAA 498 0 

ATATCCAGAC GAAAGTGAAT TGCTGATTTA TTTTATTGAA GGTTTAATGT CTCAAAATCA 5040 

AACTGACGAA GCGTTAGAAT ATTTAAGTTA TGTTGAACCA TCACCTGAAA AGTTGATGTT 5100 

AGAAGCAGAT TTATATCAAC AAATTAATAT GATGGAAGTT GCTATTGATA AATTACAAGA 516 0 

20 AGCACTTGAA CTAGAGCCAA ATGATCCAAT AATCCATTTT GCATTGGCTG AAATGTTATA 5220 

TTATGATGGT CAATATTTAC GTGCTACCTC TGAATACGAA ACCGTTTTAG AAACTGGTGA 5280 

ATATCAAGTT AATGGTGTAA ACTTATTCTC TCGTATGGCA GATTGTAGTT TACAAAGTGG 534 0 

25 kAACTATAGT GATkCcGATt CgCTTATACG ATGrAATTAA TGAAGATGAA ATGACTTCAG 5400 

AAGATTATCT CAAAAGAGCC ATTTCTnACG ATAAAAATGA CATCACTCAA GAAGCAATTA 5460 

AAATAATGAC TACATTACTT TCTAAAGATC CTGATTATAT TCAAgGCTAC TTGTATTTAC 552 0 

30 

aATCaTTATA TG 5532 

(2) INFORMATION FOR SEQ ID NO: 531: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 942 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 531: 

AATTGGGTTA TACTATAGGT AAATTTAAGG AGGTAAGAAA ATGGATAAAA AAGAATTAGC 60 

45 GAAATTTATA GGCAATAAAA TCAGATACTA TAGAACCaAA TTGaACTTAA CTCAAGATCa 120 

ACTTGGAGAA AAACTCmACa CTAAAAArGC TACTATTTCA AATTATGAGA CAGGGTACAG 180 

AACTCCTAAA CAAGATGATT TGTTTGAAAT TGCTCATATT TTAAATATCA GTATCGATGA 24 0 

SO 

TTTGTTTCCT ACAAGAAATA ATAAAAAAAA CGACATCACT TCCATATACA ACAAACTCAC 300 

ACCTCCCCGC CAAGAAAACG TACTTAACTA CGCAAATGAG CAATTAGATG AACAGAATAA 360 
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AACTGGTGCT GGCATAGGAG AAGAATTATA TGATGACATA TTGCATGAAG AAGTATTTTT 
TAAAGAAGAC GAAACGCCAT CAAATGCTGA TTTTTGTATT TTAGTTAATG GTGATTCAAT 
GGAACCTATG TTAAAACAAG GAACATACGC TTTTATTAAG AAAGAAGATT CTATTAAAGA 
TGGTACAATT GCACTCGTTG TATTAGATGG AGTAAGTCTT ATCAAGCGTG TAGATATATG 
CGAAGACTAT ATTAATTTGG TATCTCTAAA TCCGAAGTAT GATGATATCA AAGTCGCTTC 
GTTTAGTAAT ATTAAAGTAA TGGGCAAAGT TGTATTGTGA TTAATAGCGC CTATATGGCA 
CTTTAATATA AAAGACGTCT ATTTCAGCAG TGTTTAAAAG GAGTTTATAA TGAAAATAAC 
TAATTGCAAA ATAAAAAAAG AAACTATAGT ATATGAAGTT TTAACTAGTG GTAATCAACC 
ATTCACTTAT GAGTTACCTA . AAGATTTATC GTCACATAAT GC 
(2) INFORMATION FOR SEQ ID NO: 532: 

. (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 417 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 532: 
TGGnAAATGC AAACCAAAAT ATGaTCCTCG TGcAAGTTGA AGCGGGACGT TATGAAGAAT 
GGGTAAAGAA TGGTTATTTT AAACCGTCAG AAGATAAATC AAAAGAAACA TATACAATTG 
TTATCCCGCC ACCAAATGTA ACTGGTAAAT TACATTTAGG ACATGCATGG GATACGACTT 
TACAAGATAT CATTACACGT ATGAAACGTA TGCAAGGATA CGATACGTTA TACTTACCAG 
GTATGGATCA TGCTGGT ATT GCGACACAGG GAAAGGTAGA AGCTAAATTA AATGAACAAG 
GAATAACTAG ATATGATCTT GGTCGTGAAA AGTTTTTAGA ACAGGCATGG GATTGGAAAG 
AAGAGTATGC GTCATTTATT CGTGCGCAAT GGGCTAAATT AGGTCTAGGT TTAGATT 
(2) INFORMATION FOR SEQ ID NO: 533: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 533: 
GATCCTGAAC CCGCATTTGT TTCCACTAAA ACAGTATGCC CACTTTCTAC TAAAGCGTGC 
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ATTTTCATAC 


CATCCACCTC 


CATAATCATC 


TTAACGCGAA CATTTTGAAA 


GCGCAATCAA 


1B0 




AAATCCACAA 


AATTGTAAAG 


GTTATTACAC 


TGACTTTTCC GAAAATTGTG 


GTAAAATATA 


240 


5 


ATTAAGAAAG 


AACAAGGAGG 


CACTTACTAT 


GATTACTTAC AAAAATATTT 


TAATCGCAGT 


300 




TGACGGTTCA 


CATGAAGCGG 


AATGGGCATT 


TAACAGAGCA GTTGGTGTTG 


CTAAACGTAA 


360 


10 


CGATGCGAAG 


TTAACAATTG 


TGAATGTAAT 


TGATTCAAGA ACGTATTCTT 


CTTATGAAGT 


420 


TTATGATGCT 


CAATTTACTG 


AAAAATCTAA 


GCATTTTGCA GAAGAATTAT 


TAAATGGTTA 


480 




TAAAGAAGTA 


GCTACTAACG CTGGTGTTAA AGATGTAGAA ACGCGTCTAG AGTTTGGyTC 


540 


15 


TGyTAAATCT 


ATCATTCCTA 


AAAAGCTTGC 


ACATGAAATT AATGCAGACT 


TGATTATGAG 


600 




TGGTACATCA 


GGCTTAAATG 


CCGTGGAAAg ATTTATTGTT GGTTCTGTAT CAGAATCTAT 


660 




CGTTCGTCAT 


GCGCCATGTG 


ACGTGTTAGT 


TGTTCGTACT GAAGAGTTAC 


CAGCAGACTT 


720 


20 


CCAACCACAA 


GTT 








733 




(2) INFORMATION FOR SEQ ID NO: 534: 






25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6060 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 






30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 534: 








TGATGATCCT GAAGCGCTAT 


TGGATAATTA 


CAACACTGAA GATGTTGATG CACACAATTA 


60 


35 


CAATAATATA AATCATGTTA 


TTTTTGCCTG 


CGATGCGGGT ATGGGTTCTA 


GTGCAATGGG 


120 




TGCArGCATG 


TTACGTAATA 


AATTTAAAAA 


GGCGGGCATT AATGATATTA 


CAGTTACAAA 


180 




TACTGCGATT 


AATCAATTGC 


CAAAAGATGC 


TCAATTAGTT ATCACTCAGA 


AAAAACTAAC 


240 


40 


TGATCGTGCT 


ATTAAACAAA 


CACCAAATGC 


CATCCATATT TCAGTGGATA ATTTCCTTAA 


300 




TTCACCAAGA 


TATGAAGAAC 


TTTTAAATAA 


TCTAAAAAAA GATGATCAAG 


CATAATAATT 


360 




AAATAAATTA 


AAAAATGGAG 


GATACCGCCA 


TGTTATTGAG TACACGTGAA 


AAAGAAATGA 


420 


45 


TAGCCCtAtT 


GATTAAGTAC 


CACGGtCAAt 


ATATCACTAT ACACGACATT 


GCTCAGCAAC 


480 




TTGCGGTGTC 


CTCTCGTACT 


ATTCACCGTG 


AATTAAAAGG TGTTGAAGCA 


TATTTAACTT 


540 


SO 


CATTTTCATT 


AACTTTAGAA 


CGCGCAAACA 


AAAAAGGGcT ACGCATTGCT GGCACAGATT 


600 


CTGATTTAAA 


CGATTTGAAG 


CAATCGATTG 


CACAACATCA AACCATTGAC 


TTATCTGTTG 


660 




AAGAGCAGAA 


AGTAATTATT 


ATATACGCTT 


TGATACAAGC CAAGGAGCCA 


GTTAAACAAT 


720 
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TAGAGCTTGA TTTAAATAAG TACCAACTAT CTTTATCTCG AAAGCGTGGC GAAGGCATTT 840 

ACTTGGTAGG TACTGAATCA AAGAAACGTG AATTTTTAAG TCAATTAATG GTGAATAACT 900 

TAAATAGTAC TAGCGTTTAT TCAGTAATTG AAAATCATTT TGTCTTTCAT TCATTAAATC 960 

AAATCCACAA AGACTTTGTT GACTTAGAGC GCATTTTTAA TGTTGAAAGA CtATTaATGG .1020 

ACTACCTAAG TGCCTTACCC TACCAACTTA CCGAATCAAG TTATTTAACT TTAACTGTCC 1080 

ATATCGTGCT CTCCATTTCA CGTATAAAAA ATGGAGAGtA TGTCGCATTA AACGATGATA 114 0 

TTTATGATTC TGTACAAAAC ACATTTGAAC ACAAAGTaGc AAGCGAACTT GCTGATAAAC 1200 

TTGGTCAAAT ATATGACGTC ACGTTTAATC AGGCAGAAAT TGCTTTCATT ACTATCCATT 1260 

TACGTGGAGC TAAACGAAAA AATCTTAATG ATACATCATT AAATAATCGT TGTGAAGAAA 1320 

ACAAAATTAA AGCGTTTGTT AACAAAGTAG AAATGATTTC CGGTATGACA TTTGCAGATT 1380 

TGGATACTTT AGTAGATGGA CTGACGCTAC ACCTTAATCC TGCAATCAAT CGTTTGCAAG 144 0 

CTAATATCGA GACCTATAAT CCGTTAACAG ACATGATTAA GTTCAAATAT CCAAGACTAT 1500 

TTGAAAATGT AAGATTAGCT TTAAATGATT GTTGGCCTGA TTTGATTTTT CCAGAGAGTG 1560 

& AAATTGCTTT TATAGTtTTA CACTTTGGTG GCTCGATTAA AAACCAAGGT AATCGATTTT 1S20 

TAAACATATT AGTCGTTTGC AGCAGTGGTA TGGGAACTAG TCGTCTATTA TCAACTCGTC 1680 

TAGAGCAAGT TTTTAGTGAG ATTGAGCGTA TTACACAAGC ATCAGTCAGC GATTTGAAGT 1740 

CACTAGATTT AAGTCAATAT GATGGCATTA TTTCTACTGT GAATTTAGAC ATCGACTCCC 1B00 

CCTATTTAAC GGTAAACCCA TTATTACCAG ATAGTGATAT CAGTTATGTC GCACAGTTTT I860 

TAAATACAAA GTCTACGTTC CAAGAGACGC ATGATAAATC ATCAAACATG ATTGATAAGG 1920 

ATGATGTTCA TGTTGAAACG AAAGATGTTG ATGGCAACAC ATCTTTTGAA AATGAACAAA 1980 

CTTCATACTT AACTTCAGTT TTCGAAAAAC ATTTAAGTGA CGAAAAATCA GAACAATTAT 2040 

TGCATCATAT GCGTTCGGGT TTAACTTTGC TTGATTCAGT GAAAATAGTT AGTACCGAAG 2100 

TTAAACAGTG GCAAACATAT ATCGCAGATT ATCTATATCA ATGCGATGTA ATAAACGATG 2160 

CAACGTCATT CGCTGAACTA CTAGAGCAAC GATTGATTGA CAATCCAGGA TGGATATTAA 2220 

45 GTCCATATCC TGTTGCAATA CCACACCTAA GAGACAATAT GATTAAACAC CCTATGATTC 2280 

TAATCACAGT TTTAGAAGAA CCGTTAACAT TGCCTAGTAT TCAAAATGAC AATCAAACAA 2340 

TTAAATATAT GATTTCCATG TTTATTTCTG ACAATGATTT TATGGCATCA CTGGTAAGTG 24 00 

SO 

ACTTGTCCGA ATTTTTAAGT TTGAAATTAG AATCTATTGA TACTTTTATG GAAAATCCAC 24 60 

AGGAACTTGA AACATTATTA AGAAACAAAT TTTTAGAACG AATTAAAAAA CAATTTATTT 2520 
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TAACAGCCAA AATGAAGCAA TTGAAAAAGC 
AACAGATGCT TATATTCAAG CAATGAAAGA 
AAATGGCTTA GCAATTCCTC ATGGCACAGA 
TTTAACATTA TTACAAAT CC CTGAAGGCGT 
CGTGGGAATT GCTGGTAAAG ATGGCGAACA 

10 

ATTTAGCGAA GAAGAAAATG TGGATCGTAT 
ACAAGTATTC GAGGAGGCAG ATGCATAATG 

1S GGTCGTGGTT TCATTGGTTA TATTCTgCAG 
TCAATGAAGA AATCATTAAT GCTTTAGCTC 
ATGAGTCTAA AACAACGACG CGCGTGAAtA 

20 CTGAAGCGTT GAAACAAGCA ATTCTAGAAG 
ACATACTACC TATTATTGCT AAATCTTTTG 
TTAATATTGT TGCTTGTGAG AATGCTATTA 

25 

, TTGATATTAC TGGCCCTCTT GGTAACnaTA 
TTGTACCATT ACAAAAGAAT GAAAATATAT 
GGGTTGTTGA AAAAGATGCA TGGTATGGTC 

30 

ATTTAACACC ATATATTGAG CGTAAATTAT 
CGTATgCTGG tAAATTTGCA GGTAAAGCTA 
TTGAAGCTGG CTTACGCCGT GTTTTAGCTG 

35 

ATTTTACTGA AGCGGAACAA GCTGGTTATG 
CTTATTTATC TGATGAAGTA ACACGTGTCG 

40 AAGATAGAAT TATAAAACCA TTAACATATC 
TATTAAATAC AGCTGCATTG TTATTGAAGT 
AGAAAAATAA TTACATTAAA GAACACGGTT 

4 " TTGACGATGG CTTAGCCGAT GAAATAATTG 
AGCTTTGTTT GAAACAAGAA GTTTCCAACG 
AGTTCCCTTG AATTAACAAT ATTAAATTTC 

so 

ACACTTAACT TGTGTTATGT ATGAAAGGAG 
CGAGCCACAG CCACCTATTC AATGGCTATT 

55 



AGGTAAAGCC TTAGTTGATA GTGGTGCTGT 2640 

TCGTGAGCAA GTCGTATCAA CATTTATGGG 2700 

TGAAGCTAAA ACAAATGTGA TTCACTCAGG 2760 

TGACTGGGAT GGCGAAGTAG TTAAAGTTGT 282 0 

TTTAGACTTG TTATCTAAAA TTGCAATTAC 2880 

CGTTCAAGCA AAATCTGCAG AAGAAATTAA 2940 

AAAGCAGTTC ACTTTGGTGC TGGTAACATA 3000 

ACAACAATGT TAAAGTAACA TTTGCAGACG 3060 

ATGATCATCA ATACGATGTT ATTTTAGCTG 3120 

ATGCTGATGC AATTAATTCA ATGCAACCTT 3180 

CTGATATTAT TACAACAGCT GTTGGTGTTA 3240. 

CGCCTTTCTT AAAAGAAAAA ACAAACCATG 3300 

TGGCAACTGA TACATTGAAA AAAGCAGTAC 3360 

TTCATTTTGC TAACTCAGCA GTTGATAGAA 3420 

TAGACGTTAT GGTTGAGCCA TTTTACGAAT 3480 

CAGAACTAAA CCATATTAAA TATGTTGATG 354 0 

TAACTGTGAA TACAGGACAT GCATATTTAG 3600 

CAGTTTTAGA TGCAGTTGAA GATAGTTCAA 3660 

AAACTAGTCA ATATATTACT AATGAATTTG 3720 

TTGAAAAAAT AATAGATCGT TTCAACAATT 3780 

GACGAGGTAC ATTACGTAAA ATTGGCCCTA 3840 

TTTATAATAA AGATTTAGAA CGCACTGGTT 3900 

ATGATGATAC AGCAGACCAA GAAACTGTTG 3 960 

TAAAAGCGTT TTTAAGTGAA TATGCTAAAG 4020 

AAGCGTACAA TTCACTTTCA TAATTTATTG 4080 

TTATTCGTTA ACAATCAGTA ATAATGTAGT 4140 

TGAACATAAA AAATACTCCC TTCAACATAG 4200 

TATTTTTGCG TTAATAATTT GTTTTATTTT 4260 

GGTCATTACT AAAACAAATT CATATTAACT 4320 



1468 
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TTGAATAAAT TTTATTCTTC AGTTTGTTGG TCTTTCTTAG TGAATCTTCT AATTAAGAAT 4440 

GCCATACCTG CACCTAGAGC TAATTCAGCA TATGGTAAAT CGTCATTATG TGACATACCA 4500 

GTATCTGGTA AAGTTTTAGC TTGTTGTTTA GCTTTATTAA CTTTTCCTTG TTGAGCTGAT 4560 

TTTGTCTTAG CTTGGTGGTC GTCAGTGTTA GTTACATTAA GCATATCTTG ATTAGCACTA 4620 

TTGCTTCCAT TTGAAACTGT AGCTGGAGAT GCATTGGCAC CGTCGTTTTG CGTAgyTTTA 4680 

TTGTTTGCAG CTGAACCAAC TGATTTTTGC GTATCATTAG TATCTGCTGT TGCCGTATCA 474 0 

T CT TT TTG GC TAACATTAGT TGAAGTCATT TTTTCTTTTG CTTCAGAAGA TGCAGATGTT 4 800 

GATGGTTTAT TCGAAACTTC AGTATCAGCT TTGCTTGGCG ATTTATCTGC TTCGTTAGAT 4 860 

GCAACGTTAG TTTCAGACTT AAGTTGTCCT GCATCAGTTT GATTTGTCGT ACTTTCTTCT 4920 

TTATCTTTTG ATGTATTAGA AGGTACATTT GGTTCTGTTA TGTCTGCTGA AGGCAATGTT 4980 

20 TCAGTTGTtG ATTCAACCAT ACTTTGATTT GTTGAATCAC TACCATCTTT TTCTGCCTTA 5040 

GCTTTATTTT CAGATTTTGG TTGTGCAACC TTGTCATTAG TTGATTGAGA TTCAGCACTA 5100 

TTATTTACTT CAGCATTTTG TTTTGAATCA TTTACAGATG CATTATCTTT GCTATCAGCA 516 0 

25 GATGATGCTG CTTCTGTGCT CGCAGTTGTT., GGAGCCGTTG CTGTTGATCC TGTTGGTGCA 522 0 

T T C T CGTTTG TTGCTGTAGT TGTACTATTG TTATTTGTTG TGCTTTCTGC TGGCGTTGCA 5280 

TTATCAGTTT CTGTTACAGG TTTATCAGTT GTGCCGTTAT TAGTTGATTC TACTTCTGGT" 5340 

TTACTAGTTA CATCGTTATC CATTGTCGGA CTGTTTGTTG ATGCATCTAC ACTAGAATTG 54 00 

TTATTAGCTT GCGGTTTATC ATTTGCATCA TCAGTTGCTG ATGTTGCTGT TGTTTCACCT 5460 

GTTGCCGCAT CACTATTATT TGGTGTTGTC GGAGAAGCGT CTGCTTTGCC ATTAGCTGTC 5520 

GTCTCAGATA CGTTAGGTTG TCCAGTATTT TCTGGTGTTG CATTAGCATT TGAATTTGCT 5580 

GTTGCATCAT TATTATCTAT ACCATTATTA GTATCATTAG CATCTGGATC ATTCTGAGGC 5640 

ACAATCGCTT CAATTGCAGG TATCGTTACA TTTTGTAATT CAGCAACTTC TGCATTTGTT 5700 

TGTGTTTTAT CTAATTTATC AGCAAATCTG TCAAAATATC TACCTAAATC CGTACGTGCA 5760 

ATTTCTTTCG CCGATGCATC TGCATCTGCA TTTTTAATTA TTTCTATTTG CTTGTTAACC 5820 

45 ACTTCTCTGA TTGCTTCCAA AGCATTTTTC TTAACTTCAG GATTAATACG TTGTGCTTTA 5880 

AGTTGTTCAA GCGCACTATT TTTGACAGTA GCGATTTCTG CATTTGTAGT TTGATCAGAA 5940 

ATATCTTCAG TTGCTTTTGA TAAAATGTCT TCTAAAGCAT TCGTAAACGC TTCTTTTTCT 6000 

SO 

TCAGTTGTAG CATCAGCGTT GACATTTACA CCTGCTTCAA TCTGGTCTAG TGCAGTTTCT 6060 
(2) INFORMATION FOR SEQ ID NO: 53 5: 

55 



30 



35 



40 
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(A) LENGTH: 977 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 535: 



10 


AACAAAGCCT 


TCCAATTATC 


TGCGTCGGTA 


GAACAAGTAT 


TAGCAACTTT ATCACCTACG 


60 




CTAAACAGTC 


CTTACGATTT 


ATACGGCACG 


ACAAAAATGC 


TAGATATTAC ATTCGATTCA 


120 




TTTGAACATG 


ATGGTACAAC 


GTACCCTGTC 


GACTATGCTA 


CGTTTGAAAA TGATTATGAA 


180 


15 


GATAATAAAG 


ATCCTGAGTT 


TAGACGTAAA 


AGTTTCAAAT 


CGTTTAGCGA TGGGATTCGA 


240 




AAATATCAGC 


ATACTACCGC 


GGCTACATAT 


AATATGCAAG 


TACAACAAGA AAAAATTGAA 


300 


20 


GCTGATTTAC 


GTGGATTTGA 


ATCAGTCATC 


GATTATTTAT 


TACATAGTCA AGAAGTAACG 


360 


CGTGATATGT 


TTGACCGTCA 


AATCGATATG 


ATTATGCGTG ACTTGGCACC AGTTATGCAG 


420 




AAATATGCTA 


AACTTTTACA 


ACGTATTCAC 


GGATTAGATA 


ACATGCGTTT TGAAGACTTG 


480 


25 


AAGATTTCTG 


TAGACCCTGA 


TTATGAACCA 


GAGATTTCAA 


TTGAAGACTC AAAAAATTAT 


540 




ATTTTCGGTG 


CGTTAAGTGT 


TTTAGGTGAT 


GACTATACAA ACATGTTACG TGAAGCATAC 


600 




GATCAGCGAT 


GGaTTGATTT 


TGCACAAAAT 


AAAGGTAAAG 


ATACAGGCGC ATTTTGTGCA 


660 


30 


AGTCCATACT 


TTACACATTC 


ATATGTGTTT 


ATTTCTTGGA 


CTGGTAAAAT GGCTGAAGCA 


720 




TTTGTCTTAG 


CACATGAATT 


AGGTCATGCA 


GGTCATTTTA 


CATTAGCTCA AAAACATCAA 


780 




CCATATCTTG 


AATCAGAAGC 


ATCAATGTAC 


TTTGTTGAAG 


CCCCTTCTAC AATGAATGAA 


840 


35 


ATGTTGATGG 


CCAATTATTT 


ATTTAACACA 


AGTGATAATC 


CAAGATTTAA GCGTtGGGTT 


900 




ATTGGCTCAA 


TTTTATCTAG 


AACATATTAT 


CATAATATGG 


tACCCmTTTA TTAGAAGCnG 


960 




CTTATCCACG 


GGGAGTG 








977 



40 

(2) INFORMATION FOR SEQ ID NO: 536: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 144 0 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 536: 

AGACAGTGAT yGaATTTCAT TTACAGTACA CAAATCATCG AAAAATTGGT AACATTCTTC 60 
TCTATTTTCT AACGTTAgwA TTgCATCAAA CAATTCATCT AACGCTGCAC CTCGTAATTT 120 

55 
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5 



10 



15 



30 



35 



AATCTATTAA 


AGTGTAGCGa 


TTTATATTTT ATTAAATCTG AATCGTTACT 


TTATTTAATT 


240 


TTATGCTAAT 


CCAGCGCGTT 


CGAAAATAGT 


GTCAACTTGa 


TTCAAATGAT 


GTTTAGGATC 


300 


GAAACATTCA 


TCCAATTCTT 


CTTTTGTTAA 


AACACTTGTA 


ATAGACTCAT 


CTTGTTCGAT 


360 


TAATTCACGG 


AACGGTGTTT 


TCG7TTCCCA 


AGATATCATC 


GCTTTTGGTT 


GTACTTTGTC 


420 


GTATGCTTCT 


TCACGAACCA 


TACCTTTATT 


AATTAATGCT 


AATAAGACAC 


GTTGTGAGAA 


460 


AATCAGACCA 


AATGTTTTAT 


CTATGTTATT 


ACGCATATTA 


TCTTCAAATA 


CAGTTAAACG 


54 0 


GTCCACAATA 


TTTGTGAACG 


ATTCAATGCA TAATCTAGTG 


CtATTGTAAC ATCTGGTAAC 


600 


ATAATACGCT 


CAGCAGAAGA 


ATGAGAAATA 


TsTCTTTCAT 


GCCATAATGG 


CACATTCTCA 


660 


TAAGCTGTAG 


TAATATAACC 


ACGAATGACT 


CTTGAAATAC 


CTGTGATATT 


TTCAGAACCA 


720 


ATTGGATTTC 


GTTTATGAGG 


CATTGCAGAT GAACCTTTTT 


GGCCTTTTGC 


AAATGCTTCT 


780 


TCAACTTCTC 


TCGTTTCGGT 


TTTTTGAAGG 


TTACGTATTT 


CAACGGCAAA 


TTTTTCTAGT 


840 


GATGTCGCGA 


TTAATGCTAA 


TGTCGCAATA 


TAGTATGCAT 


GTCGATCGCG 


TTGCAATGTT 


900 


TGCGTTGATA 


CAGGCGCTGT 


GCCAATACCT 


AAATGTTTAC 


ACACATAACT 


TTCTATTTCA 


960 


GGAGGAATG7 


TAGCAAAAGT 


ACCTACTGCA 


CCACTCATTT 


TCCCTACTTC 


AATTTCTTCT 


1020 


CTTACTTGTT 


TGAAACGTTG 


TAAGTTACGT 


TGCATTTCCG 


TGTACCACAA 


TGCCA'ITTTG 


1080 


ACACCAAATG 


TAGTTGGTTC 


TGCATGCACT 


CCATGTGTAC 


GTCCCATCAT 


CAATGTATAT 


1140 


TTATAATTTT 


TTGcj."i"i"i"i 1 C 


AGCTAAAACG 


TCGATAAATC 


TTTCTAAATC 


TTTTTCAATA 


1200 


ATGTCATTTG 


CTTGTTTAAT 


AACGAAACTT 


AAAGCTGTAT 


CTACAACATC 


AGTAGAAGTT 


1260 


AAACCATAAT 


GTACcACTTA 


CGTTCTTCAC 


CTAGCGTTTC 


AGAAACTTGT 


CTAGTAAAGG 


1320 


CTACAACATC 


ATGGCGCGTT 


TCTTGTTCAA TTTCTTGTGC ACGTTCGACA TTTACCTTTG 


1380 


CGTTTTGA CG 


AATTTTTTGT 


ACGTCAGCTT 


TCGGTATATG 


TCCTAATTCA 


CTCCATGCTT 


1440 



40 (2) INFORMATION FOR SEQ ID NO: 537: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 784 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 537: 
^ GATAAATCTA TnCAGTTTCC GTCCAAATAT CtGCaCCTAA AGCTTTTAaG TGTTCTACAA 60 
TATCTgTATA' ACCTCTATAA ATATGTTTAA CATTGTAAAT TGTAGTTACA CCCTCAGCAA 120 

55 
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10 



CATGTAATGT TGATGGTTTT ATCGTTGCTG TGCCTTCGTC AACTTCAATA TTTGCACCCA 240 

TGCGCTTTAA TTCTTCAACA TGTTTAAAAC GCTCCGGATA AATCGTATCA GTTACAAATG 300 

AAGGACCATT TGCCATAAAT AATAATGGTG TAATAGGCTG TTGCAAATCA GTAGCAAAAC 360 

CTGGATATAC TAGTGTTTTA ATATCAACAA ATTGATATGG CGCATTATTA TTGATGCGAA 420 

TTCTTTCGTC TCTTACATCA ACATTCACAC CTAATTCACT AAATTTAGCA GTTAATGTTT 480 

CTACATGTTT CGGAACAATA TTATTTAATA TAACATTTTC TCCACATGCT GCAGCGATAC 540 

ACATATATGT GCCTGCTTCA ATTCTATCAG GTATAACTTG ATACTCAGAA CCATGTAATT 600 

1S CTTTGACGCC ATTGATTTTA ATTGTTGATG TACCCGCTCC CTTAATATTA GCTCCCATAC 660 

TTGTTAAGAA GTTAGCAACA TCAACTACTT CCGGTTCTTT AGCAGCATTT TCAATTACAG 720 

TTTGTCCTGT TGCATAAACT GCAGCTAGCA TAATGtnAAT TGTTGCACCT ACGCnAACCA 780 

20 TATC 784 
(2) INFORMATION FOR SEQ ID NO: 538: 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3733 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 53 8: 

CAATCTCCAC CAATGACACG TCGTCATTCG TGACCTCATA CCArACAAAA AACAGTCTCG 60 

CAATCAAGAC TGTTTTCCAC TCAATATATT CATCCATTAG CGTAATAGAT TATTTGACTT 120 

CTGTAGCTAC AAAGaTTTTA CGTTTTTCCC AAACGCCTGT cTTTTCATTG TAATCATCAC 180 

AAGTAATTAA TGTTAATTGT TTATCTTTAC CTTTTTGTTC ATCTAGAAcT CCTACATCTG 240 

40 TAGGCTTAAC ATCTCTTATA CTTGTCATTT TATACTTACG TGTTTCATTA CCAACTTTAA 300 

AGTACACCAT ACTACCTTTT TTGGCTGCTT TAAGATTTGT AAATTGATAG TTCGGACGGT 360 

CAATGAAAGT GTGTCCTGCA ATTGAAATAT TTTGATCATC TAGTGATTCA TTTTCTTCTG 4 20 

45 CAAAGCTTAC ACCTCTATTT AATTGTTCAG GTGTTGCTGG TCCTGGATAT ACTGGTTCTT 480 

TAATATCAGC ATCTGGAATT TCAATATAGC CTGCCACTTT CGATTTATCT TTCGGAATTT 54 0 

GAGGTTTAGC TTGCTGCTTT TTATCTTTAC TCGCCTGTTC TTTTACATTT TTATCATATT 600 

50 

GTTCAATCTT TTCATCTTTA TCTTTATCGT GAAGATAATT ATCGATATGT GGTTTAGCAA 660 

ACAAATATGC TGCCACTAGG aTAAGTACCA CACCAGCGAT TGTCATTAAT CGATTTGTCC 720 

55 
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AGTATACCAT TAATTTCAAA ATGACTCATA 
GTATACTTTT ACGTTAGATT TCATTACACA 
5 ATAATTACTA AGGGGGAATT GATATGATTA 

TATTAGCGAT ATACAATGAT GCCATTATCA 
AAACCATAGA CGAACGTGTC GCATGGTTTG 

10 

TTGTATTTGA GGAAAATGGA AGTGTCTTAG 
GGCCAGCATA CCTATATACA ATCGAACATT 

is AAGGTATTGC TAGTCAATTA CTACACCATT 
GTGCGCTAGT TGCAGGCATT GATGCTTCCA 
TTGCTTTTAA GCATGCCGGC ACACTGACCA 

20 ATTTAGCATT TTACGAATTA GATTTACAAG 
CAAGACAACC ATGTTAATTC CCTTAACATA 
CAGGGGAGTG GGACAGAAAT GATAAAGAGC 

25 CACATTAGCC ACAGCTAATG TGTAcTTAAA 
AAGAAATACT AATTTCTATA GAAAAAGTAT 
CATTATTGTA AGCTGACTTT TCGTCAgCTT 

30 

GCGCATTTTC GTTCAGTCAA CTACTGCCAA 
TACATTGTCC CTTTTATTTA TTCTTTTCAA 
TTTTCTCAAT AACAAAACGA ATATAGTAGA 

35 

AAGATTTAAA TAATAGATTG ACCCACGTTC 
TGTTAATGAT GAATGCTATA AAGATGATTG 
GTTATTTTAG ATATACGGGG GCGCTTTTGC 

40 

ATAAATCTTT CAACTTCTTC AGAGATAATT 
GACAGATGCT TCTTTTTTAT CAATATTTTA 

45 TCTTCAGCTG TGCCAAAGAT TTTACGTTTA 

CCACCCAATC CAGTTTCACG TAATGATGCA 
TCAATAACTT CATCAACAGG GATTCTACTT 

50 ATTAAAGCGT TACCCGAACC AATTGCATTT 

GCTACTGGAT CACAAACTAA ACCTAATAAA 

55 



GCAATTCATT TTATACTATA AAATTTACAT 840 

TATTTGCATT CAAATAACGA AACGCTTTTA 900 

GATACGCTAA AAAAGAGGAT TTAAACGCTA 960 

ATACTACAGC TGTTTATACT TATGAACCAC 1020 

AAACGAAACA ACGTAAGCAT GAGCCTATCT 1080 

GGTTTGCGAC GTTCGGTTCA TTTAGACCTT 1140 

CTATTTATGT CGATGCTTCA GCTAGAGGAA 1200 

TAATTGTGGA AGCAAAAGCT AAAGGTTATC 1260 

ACAAAGCGAG TATTCAGTTG CATCAAAAGT 132 0 

ATGTAGGTTT TAAATTTAAT AGATGGTTAG 1380 

ACTAGTAATG TTTGAATCAC ATAATATAAA 144 0 

ACAAGCCAAC ATATAAAATT TTAAACTTCT 1500 

CACTAATGAT TTATTATGTA GTGGTTCTTA 1560 

AATAGGAATA CATGAGTAAA ACTCATGCAT 1620 

TTCTTTATCG TCGTCCCACC CCAACTCGCA 1680 

CTGTGTTGGG GCCCAAAAAG CTTGTTACAA 174 0 

TATAACTTTG . TAGAGCATAT TACATTGATT 1800 

ATACTATCCC CATAGCTTTG ATTTAACGCT 1860 

ACATGAAAAC GATAATCATG CTGAGCGATA 1920 

CCTCAGTCGT ATATCCATAT GTAATCGTTG 1980 

ATAGTCTTAG CATATCATCA CTCCTTTTAA 204 0 

AATCACTATT TTGATTAGTA TGCATTTTCC 2100 

AAGAAGCATC TATCTGGTAC TAATGATCCA 2160 

TTGTTATCTC ATTAATTATT TTTAACCATA 2220 

ATTGCTTCGC CAGTTGGTGT GCCTGCTAGT 2280 

GGAAGGTTAC GACCAACCTT ATCCATTGCT 234 0 

TCAATACCTG CTAATGCTAA ATCTGCTGAA 24 00 

CTCATAACAC AAGGAATTTC AACAAGTCCG 2460 

TTACTTATCG CTAATGCCAT AGCGTGCCCG 2520 
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GAACCAACTT CAGnTTGGCA GCCACCTGTT GCACCAGCTA CACTTGCATT GTTTGCTACG 2640 

ACACGCCCAA ACAATGCTGA AGTGAATAAG AAATCAATCA TTTGCTCTTC TGTTAAATCA 2700 

S TGTGTTTTTT CTAATTTAAA AAGTGCACCG GGAATGGTAC CCGAGGAACC AGCTGTTGGC 2760 

GTtGCACAAA TAATACCCAT CGCAGCATTG ACTTCATTTG TTGCAATGGC AcCtTTGcTG 2820 

CGTCAATCAT TTCATATCCA GACAAAGCAT GATGTGTTTC ATTATAATCA CGTAGTTTAG 28 80 

10 

CAGCATCATG ACCAGTGTAG CCCGTTACAC TTTCAACCCC ATCACCTGTC GTCCCTTTGA 2 940 

TTACTGCGTC TCGCATGACA TCTAAATTTT GTTTCATTTG CGCTCGCACT TCATCACGTG 3000 

ATTTACCGCT TAATTCCATT TCTTCTTTAA CCATGATATC CGCAAATGAC ATATTATTTT 3 060 

75 

CTACGGCATA ATCTATAGTC TCTCTAATTG AATCAAACAT GTTTATTCCC CCTCTAATTT 3120 

ATATAGGAAA CGTTTACGTC ACTGTATTTC TCTTTAATTG TATTTAATGT TGATTCTGAG 3180 

20 ATTGCTTTAT TTAATGGTAT TACAACCAAG CATTTATCTT CATCTATCTT AATAAATTCA 3240 

TCTTTACAGT CTAATTTCAT ATCGTTGATA TCATTGATGA AATGATTTAC TTGTGCTTTA 3300 

GTCATATTTC CGTCAACAAC TAAAATTGGT AATCCATGAT TTAAATCTAC TTCTAGTCCA 3360 

2S TTTATATGAA TACCTTTAAT TTTAATTGTA CCACCACCGA TTGAAATACC GATAATTTCA 3420 

ATGTAGCGAC CATCATTACG AGATGATTTG ATATAAGCAC AGTTTGGATG TTGACCAATA 3480 

CTATCGCCTT CTTCTTCGAT GATATCTATT TTAATACCAT CATCAGCTGC AATTTCTAAT 354 0 

GAAGATTTAA TTCGGTTATC AAATGTTGAA TATCCCATTG CTCCACCCAC AATAGCGACA 3600 

TCTGTACCAT GTCCTTGGTG TGTTTGAGCA AATGATTCAT AATAATGTAT TTCAATATTT 3660 
TTATATCTCC CAATATTGCG CGTGCTGAAT TCCCCTTTAC TGCACCAGCC GTATGAGAAC < 3720 

35 

TTGAAGGGCC CAT 3733 

(2) INFORMATION FOR SEQ ID NO: 539: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 525 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 539: 
TGGCTGTCTT CTCTATGAGT GTAGTAAGTA AGTTAACGGA TTTAACGCCA AGGCAAATAC 60 
50 GTTACTATGA AACACATGAA CTCATCAAAC CTGAAAGAAC AGAAGGTCAA AAACGTCTGT 120 
TCTCACTCAA TGATTTGGAA AGATTACTAG AAATTAAATC ATTATTAGAA AAAGGATTTA 180 

55 
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AAGAGATAAG AAAAAAGATG ATTGTAGATG CCACGCAAAA GCCTATTGGA GArACTTTGC 300 

CAATAAATCG TGGTGATTTA TCCCGATTTA TTAAATAAAA TTTGGAGGAT TTTAAAATGC 360 

CAAAACGTAC TTTCACTAAA GACGACATTC GTAAATTTGC AGAaGAGGAA AaTGTAaGaT 420 

ATTTAAGATT ACAATTCACT GATATTTTAG GAACAATTAA AAATGTTGAA GTGCCTGTAA 480 

GCCAATTAGA AAAAGTACTT GATAACGAAA TGATGTTTGA CGGTA 525 



10 

(2) INFORMATION FOR SEQ ID NO: 54 0: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 08 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 540: 





TTGATTTGCT 


ACAAAGTATC 


TnCTCATTTC 


TGTATCCTGA AAAATCTTTA 


GTGTAATAAT 


60 




GTTGTTCAGT 


TTTAATATTT 


TCAGTCATAG 


TTGACTACCT 


CCGTATATTT 


TGATTTAATT 


120 


25 


AAGTTGTATA 


TTTTGATGAA 


CACTTATTGT 


TACTTGTTGG 


CGCAAGTAGC 


AGTTTTTTCA 


180 




TTCTTCATAA 


AAGTATTCCT 


TATAGAATAT 


GAATGTTGCG 


ATACTTGCGA 


ATCCTGCAAT 


240 




TGaCCATGCT 


GtAGTGAAGT 


ATAGAAACGG 


CATAAGTACA 


ATCGCTAAGA 


CTGTGAAGCA 


300 


30 


TAGTACTGCT 


ACTAGGTAGC 


TTTTATAAAT 


GTTACTCATT 


TTCTTTTTTC 


AACTCCTCCA 


360 




TTATTCTCTG 


GTCTGATAAG 


TCGTGATAAG 


GGAATTTTTT 


CCtAGCTAAT 


TGGACTGGTA 


420 


35 


TTCTGCCTCG 


TATCGCAATG 


TATCCTTCAT 


CTTCAAGCTC 


TTTATTCAGT 


TCTCTTATTA 


480 


TTTGTCCTGC 


TTTGGATTTA 


GAAACAGATA 


AAATTACCgC AAGTTCTTTA GCTTGCAAAC 


540 




TATTTTTCAT 


CATATCTTTT 


CCTCCTTTAA 


AATAACTGTT 


GATTCTCTGG 


GTTATCTGCT 


600 


40 


TCGTAATTAT 


CTGCAATAAT 


ACTTTTAGCG 


AAAAAGTCCA AACTGACCTT ATATAGGTTG 


660 




TTCATAGATT 


TCTTTACGTT 


AACCCCTTCC 


TCAAGTACAT AAGGCACCCT AAAATCATTT 


720 




ATAAACAGTC 


CGTTTTCGTC 


TAAAGTAACG 


GTTGGTAATT 


CAGGTTTGTT 


CCGTCTATAA 


780 


45 


ACTTCTCCTA 


GTGTAGGTTT 


TTGCTTTTCA 


GCTTGTTTAG 


TGAAGTCGGA AAATGCCTTA 


840 




AGTAGTTTTA 


TTCCTGAATC 


AGGATCACTG 


TGTCGCTCAA 


TCGTTTCTGC 


TGTAGACTCT 


900 




TTACTAAAAT 


CATTTCTATT 


GATTACAGGC 


TTTCTCGTAT 


TTCGTTCAAT 


CTTCCAAACC . 


960 


50 


TTCCACGTCA 


CAACTGCCAT 


TGTGGTGAGG 


AGGGTTGTTT 


TGTATAGTGC 


GTTCATTTGT 


1020 




AATTCCTCCT 


ATTAAGTTGT 


TTGTTCAATT 


GTGTGTGTTA TTCTTCTT CG 


TCTAAATCAA 


1080 
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CGACTTCGTA AGTGTGCTCA ATCTCGCCTG cATATGTCAC AGTAAGAGTA TCTTTGTGTG 1200 

TGTATGTTTG ACTTTTGTTy TCTtTAACTG CATAAAGTGT TAATACTATA TTGTTTAGCT 1260 

5 

TTyCTTTTTG TTCTGGTGTC ATTTACGCTC CCCCTAmATT AGCyTCATAA CCGAATTCAG 1320 
TCATGATTTC ATGTATTTTC AATCTGCCTT TTTGTGTCCA TCTAGTTTGT AAAACTGTGT 1380 
CTTCTCTGCC ATCAGAACGC ACAATTGT 1408 

10 

(2) INFORMATION FOR SEQ ID NO: 541: 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 541: 

GTTCGTCAAT GATTTTTATC CGATGCTATG AGCATTaTCA AATACAAAAT GCTCTCTTAA 60 

AAGCAGTTAT TGACTGAAAA TCTACTTCTA AGAGAGCACT TTATTTAATT ACTTAAGAAA 120 

25 TCTTGAAATT TCAATATACG ATGTTTATGA TAAGTCGCTT ATTTCATCTT TAGGCTTGTT 180 

ATTAGTAAGT AGTTTAATAC CACTGATTAA CCATAAAGCA AATGTAATTA TGTTACCACT 240 

TATTACAGCT CCAATAATCA ACAATATACC ACTCATTTTT TTGTTTTTAG ATGCTTTAAA 300 

30 

CATACCGATT GCACCTAAAA TAATTGAAAT GATTCCAAAT ATGAATAGGG ATAAGAATAA 360 

TACAGTGAAA ATTGCTGCTG CTGTTTCTGc ATCAACTGGG nCAACCTCAC CATTAACTGT 420 

TGTTGGACAC AT ■' 432 

35 ' 

(2) INFORMATION FOR SEQ ID NO: 542: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2426 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



45 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 542: 

ATAATCATGA AGTnGCTAAA nCGCCAAATA ATGATGGTTC TGGACATGTT GTGTTAAATA 60 

AATTCCTTTC AAATGAAGAG AATCAAAGCC ATAGTAATCG ACTCACTGAT AAATTACArG 120 

50 

GAAGCGATAA AATTAATCAT GCTATGATTG AAAAATTAGC TAAAAGTAAT GCCTCAACGC 180 

AACATTACAC ATATCATAAA CTGAATACGT TACAATCTTT AGATCAACGT ATTG CAAATA 240 
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GTATAAAAAG TCAACGAAAT ATTATTTTGG AAGAACTTGC ACGTACTGAT GATAAAAAGT 360 

ATGCTACACA AAGCATTTTA GAAAGTATAT TTAATAAAGA CGAGGCAGTT AAAATTCTAA 420 

AAGATATACG TGTTGATGGT AAAACAGATC AACAAATTGC AGATCAAATT ACTCGTCATA 480 

TTGATCAATT ATCTCTGACA ACGAGTGATG ATTTATTAAC GTCATTGATT GATCAATCAC 540 

AAGATAAGTC GCTATTGATT TCTCAAATTT TACAAACGAA ATTAGGAAAA GCTGAAGCAG 600 

70 

ATAAATTGGC TAAAGATTGG ACGAATAAAG GATTATCAAA TCGCCAAATC GTTGACCAAT 660 

TGAAGAAACA TTTTGCATCA ACTGGCGACA CGTCTTCAGA TGATATATTA AAAGCAATTT 720 

1S TGAATAATGC CAAAGATAAA AAACAAGCAA TTGAAACGAT TTTAGCAACA CGTATAGAAA 780 

GACAAAAGGC AAAATTACTG GCAGATTTAA TTACTAAAAT AGrAACAGAT CAAAATAAAA 840 

TTTTTAATTT AGTTAAATCG GCATTGAATG GTAAAGCGGA TGATTTATTG AATTTACAAA 900 

20 ' AGAGACTCAA TCAAACGAAA AAAGATATAG ATTATATTTT ATCACCAATA GTAAATCGTC 960 

CAAGTTTACT AGATCGATTG AATAAAAATG GGAAAACGAC AGATTTAAAT AAGTTAGCAA 1020 

ATTTAATGAA TCAAGGATCA GATTTATTAG ACAGTATTCC AGATATACCC ACACCAAAGC 1080 

25 CAGAAAaCGt TAACACTTGG TAAAGGTAAT GGATTGTTAA GTGGATTATT AAATGCTGAT 114 0 

GGTAATGTAT CTTTGCCTAA AGCGGGGGAA ACGATAAAAG AACATTGGTT GCCGATATCT 1200 

GTAATTGTTG GTGCAATGGG TGTACTAATG ATTTGGTTAT CACGACGCAA TAAGTTGAAA 1260 

30 

AATAAAGCAT AATTATATTG GGGGAAGAGC ATCTATATAT TTTTTTAAGT ATATAAGACG 1320 

TCTTATTTCC CCTTAATTTA TTGTGAAGTA TATGCAAAAT GCAATGAATA GATTGTCCAT 1380 

CATTTTAACG TTATAATGAA TTTAACGACT TAGAACTACA CAAGTAAAGG AGAATGAAGA 144 0 

35 

TGTCTCGAAA AACGGCGCTA TTAGTTTTGG ATATGCAAGA AGGTATAGCG AGTAGTGTAC 1500 

CTAGAATAAA AAATATTATT AAAGCGAATC AGAGAGCAAT TGAAGCAGCA AGACAACATC 1560 

40 GAATACCAGT CATTTTCATA CGTTTAGTGT TAGATAAGCA TTTTAATGAT GTCTCCTCGA 1620 

GTAATAAAGT GTTTTCAACA ATTAAAGCTC AAGGATATGC GATTACTGAA GCAGATGCAT 1680 

CTACACGAAT ACTTGAAGAT TTAGCACCAC TAGAAGATGA GCCGATTATT TCTAAGCGAC 174 0 

45 GCTTTAGCGC ATTTACAGGT AGTTACTTGG AAGTTTATTT ACGTGCAAAT GATATTAATC 1800 

ATTTAGTATT AACGGGTGTC TCTACAAGTG GAGCTGTATT GAGCACGGCA TTAGAAAGTG 1860 

TAGATAAAGA CTATTATATT ACTGTTTTAG AAGATGCTGT TGGTGATAGA TCAGATGATA 1920 

SO 

AACATGACTT TATTATTGAA CAAATTTTAT CACGCTCATG TGACATTGAA TCCGTAGAGT 1980 

CATGGAAAAG TAGTTTATAG TTAATATAAC GTCAATTAAA GCTCGGCAGT AATGTTTGAG 2040 

55 



1477 



EP0 786 519 A2 





GAGGAACATT 


TGAACATAAA ATAATATATT TATATAAAAC GACCgAGGCG TTCGAACTGA 


2160 




ATGtCCTCGG 


GTTTAATTGA 


ATAGAAATCG 


GACTTATGAA 


CGAAATATGT 


TTAAGTCGAA 


2220 


5 


CTCCTTGTTT 


ATACTTATAA 


ATTTTACGGG 


TTTAATATAA TACTTATTTA 


CCTGTAATAT 


2280 




ATGCATAATT nCTTCAGTCG GTCAGCCTGT CGTTGCATAG TTCCTATGCA GCAAATGCAT 


2340 


10 


ATCCTAATCC 


TTTAACATTG 


GCATTnCTGC 


AAATGAACGC 


ATAGAATCCA 


TTCACTGTTA 


2400 


ACTTTTTnCA ACAAATGTCT 


nACATG . 








2426 




(2) INFORMATION FOR SEQ ID NO: 543: 








IS 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1874 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND ED NESS : double 

(D) TOPOLOGY: linear 








20 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 543: 








GAGTTGGGGA 


ATGTGCTCAA 


AATATGCGGA 


CTTTATGCAT 


TyCGGAATTG 


sCCaATTGCA 


60 


25 


GCTTTAAGCT 


ATGGTCAAAA 


AAAGAGGGTC 


ACTATAGCAT 


CTGTTTTAGT 


CTTAAATCCG 


120 




GAAATAATCA 


TATTGGATGA 


ACCGACTGCT 


GGTCAAGATT 


TCTATCATTA 


TAATGAGATA 


180 




ATGTCATTTT 


TAATTGAACT 


AAACAGACAG 


GGGAAGACGA 


TTATTATGAT 


TACGCATGAT 


240 


30 


ATGCATTTAT 


TGTCTGAGTA 


TAGTTCAAGA 


ACAGTTGTAT 


TATCAAAAGG 


TCAAGTCGTT 


300 




GCTGATACCA 


CGCCAgTATT 


GGTTTTAAAT 


GATAAAAAAA 


TCTGTGAGAT 


TGCATCATTG 


360 


35 


AGACAAACAT 


CGCTATTTGA 


AATGGCCGAA 


TATATAGGGA 


TTAGCGAGCC 


ACAGAAATTA 


420 


GTACAATTAT 


TTATTAACCA 


TGATAGGAAG 


GTGAGACGcC 


AATGAATCAA 


TATAATACTA 


480 




TAGGTTTTCA 


CCCGGGAAAT 


AGTCGTATTC 


ATCAATTAAA 


TGCGACTGTT 


AAACTTTTAT 


540 


40 


TCTTATTAGT 


TGTTTCTATT 


TCTGCAATGG 


TGACTTATGA 


CACAAGATAT 


TTAATTTTAA 


600 




TTAGTGCTTC 


ATCTATTTTA 


TTGGTCAAAT 


ATGCTCATAT 


TGAATGGAAA 


CAAGTTCGCT 


660 




TTGTTGTTAA 


ATTCATTCTG 


TTTTTCACAA 


TAtTAAATAT 


TATTGCCGTG 


TACATATTTG 


720 


HO 


ACCCTGAATA 


TGGTGTGAAG 


ATTTATAATC 


AGCGTACAGA 


GTTAGTCAAT 


GGTATTGGTC 


780 




GATTTACGCT 


AACATCACAG 


GAATTATTCT 


ATCTTTTTAA 


TCTAATATTA 


AAATATATTA 


840 




GTACAGTTCC 


TTTAGCGTTA 


ATATTTTTAT 


TCACAACGAA 


TCCGAGTCAT TTTGCTGCAA 


900 


SO 


GTTTAAATCA 


GCTAGGTGTG 


AATTATAAAA 


TCAGTTACGC 


AGTCTCACTA 


GCATTAAGGT 


960 




ATATTCCAGA 


TATTCAAGAA 


ACATATTttA 


ATATTTCACA 


AGCGCAACAA 


GCAAGAGGAT 


1020 
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TACCTTTAAT ATTTTCTAGT ATCGAAAGAA TTGACACTAT TAGTACTGCT ATGGAGTTAA 1140 

GACAATTCGG ACAGTATAAA AGGAGAACCT GGTACGTCAA AAAACAATTA AAAAAAGATG 1200 

ATTATGTTGT TTTGTGTTTG ACGTTAATAC TTCTGATGTT AGTAGTTACA TTATTCTTTT 1260 

TAAATAATAG TCGATATTTC AACCCGTGGC ATTAGTATTC ATATAAATAG TCTTTAAATA 1320 

GAAATAGGAG GGAGACATTT AATGATAAAT ACTGAAAGAT TAAATTTAAT GATTCCAAGT 1380 

TCCTCGCATT TAATTGAACT TTATAATATT TGTAGTCATC CACAAGCAAA TATATACACT 144 0 

CCCAAAGGTT TACATAATTC CAAATTAGAC ACACAACGGT GGATTGAAAA ATGGCGAAAC 1500 

CATTGGCAAC AATATCAATT TGGTTACTTT GTATTGGTAA AAAAAATAGA TTGTAGTGTT 1560 
ATTGGTATTT GTGGATATGA ATATCGACAA TTAAAGCAAG AaACAGTATT AAATTTATTT . 1620 

TATAAATTAC ATCCAAGTTT TGAAGGACAA GGGTACGCAT GTGAGGCTAT TACAGCAATC 1680 

20 ACAAATTTTG TGAATTATAT CGATCAAGAA ACAGTAAAAG TTATCAGGAC AAATAAGTGT 1740 

AACCAACGTT CAATAAATTT AGCAGAAAGG CTTAAATTCA AGCGAGACGA TACTATGGAC 1800 
GACATTATCA ATCAAGGAGA TATTGTGTTT TAAnAATAAA ATACTATGAC ATTATCTAAA . 1860 

25 

AAATAAAATT AAAA 1874 

(2} INFORMATION FOR SEQ ID NO : 544: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 5280 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 544: 

TCAACATTTC TAACACCAAT GTGAAAATGA TCTATGTGAT TTGCAATGGC TTGATTTGTA 6 0 

ATATGTGTGC CTAAATGACC TGTAGCACCT GTTAACATAA TATTCATTCA CTTCATCTCC 120 
TAATCTTTAT ATACATAACA TAATACTTAT TTGATGGTTT TCAAAACATT TGATTTTATA ' 180 

AAAAATTCTA ATCTGTATTT ATTGTCGACG TGTATAGTAA ATACGTAAAT ATTATTAATG 24 0 

TTGAAAATGC CGTAATGACG CGTTTTAGTT GATGTGTATC ACTAATATCA TTGAAAATTT 300 

TAATCaGGTA CTACGACAAT ATGATGTCTG TTTTGTGTCT GAAAGTTTTA CAGTTTTTAA 360 

50 AATAAAAATG GTATAAAGTG TGATTTGTAT AAAAAAGAGT CTCGACGGAT AAGAATTGAT 420 

TAATAACAGT TAGCATTTTA TTAATTACCT TAACAATGAT TCAAGTTTAG TTAAATGAGG 4 80 

TTTAATTTGA AAGGGGATAG CGCCTCAATA TAATGTAGGT AGATTGTTCA TATTACGTAA 540 
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AAGCAAGTTT AGCATTGGGA ATGTTAGCAA CAGGTGTAAT TACaTCGAAT GTACAATCAG 6 60 

TACAAGCGAA aGCAGrAGTT AAaCAACAAa GTGAaTCAGA GTTAAAACAC TATTATAATA 720 

AmCCAATTTT AGAGCGTAAA AATGTGACTG GATTTAAATA TACTGATGAG GGTAAACACT 780 

ATTTAGAAGT CACAGTAGGG CAACAGCATT CTCGAATCAC TTTACTTGGA TCTGATAAAG 840 

ATAAATTTAA AGACGGAGAA AACTCAAATA TAGATGTGTT TATCCTTAGA GAAGGTGACA 900 

GTAGACAAGC AACAAATTAC TCAATTGGTG GCGTTACAAA ATCAAATAGT GTGCAGTATA 960 

TTGATTATAT CAATACGCCA ATTTTAGAAA TCAAGAAAGA TAATGAAGAT GTACTTAAAG 1020 

ATTTTTACTA CATTTCAAAA GAAGACATCT CATTAAAAGA ACTTGATTAT AGATTAAGAG 1080 

AACGTGCGAT TAAACAACAC GGCTTGTATT CAAATGGTCT TAAACAAGGT CAAATTACAA 1140 

TTACAATGAA TGATGGCACA ACACATACAA TCGATTTAAG TCAAAAACTT GAAAAAGAAC 1200 

20 GTATGGGTGA GTCAATCGAC GGCACTAAGA TTAATAAAAT TCTAGTAGAA ATGAAATAAT 1260 

ACTTTCTAAC AACAAAGCGC TATGTTGAAT AGTGCTTGTT ATGGAAATAT ATGGAAGTTA 1320 

AGCGACGTAC TGTTGCTTAG CTTCTTTTTT TGAGGGGAAA AGTTACAAAA CTCACACAAA 1380 

CAGTCGCACC ACGCATTATC TTTTGCTTAA ATAGCTTAAT CATATTTTAT GAATAGTTAA 144 0 

AAACAGGTTA ATGTGAATAT CCGAATACAG CTCCTATAAT ATGGGTGTAT GATTCAAATT 1500 

ACGTAATAAA ACAATCTAAT TATAATAGAT TGGAGCATAC AACTATGAAA ATGAAAAATA 1560 

TTGCAAAAAT AAGTTTGTTA TTAGGAATAT TAGCAACAGG TGTAAACACT ACAACGGAAA 1620 

AACCAGTTCA TGCCGAAAAG AAACCTATTG TAATAAGTGa AAATAGCAAA AAATTAAAAG 1680 

35 CTTATTATAA TCAACCTAGT ATTGAATATA AAAATGTGAC AGGTTATATC AGTTTCATTC 174 0 

AACCAAGTAT TAAATTTATG AATATCATAG ATGGTAATTC TGTTAATAAT ATTGCTTTAA 1800 

TTGGCAAAGA TAAGCAACAT TATCATACGG GTGTACATCG TAATCTTAAT ATATTTTACG 1860 

40 TTAATGAGGA TAAGAGATTT GAAGGTGCAA AGTACTCTAT TGGGGGTATC ACGAGTGCAA 1920 

ACGrTaAAGC TGTCGACCTA ATAGCAGAAG CAAGAGTTAT TAAAGAAGAT CATACTGGTG 1980 

AATATGATTA TGACTTTTTC CCATTTAAAA TAGATAAAGA AGCGATGTCA TTGAAAGAGA 2040 

TTGATTTTAA ATTAAGAAAA TACCTTATTG ATAATTATGG TCTTTACGGT GAAATGAGTA 2100 

CAGGAAAAAT TACAGTCAAA AAGAAATACT ATGGAAAGTA TACATTTGAA TTGGATAAAA 2160 

AGTTACAAGA AGACCGTATG TCCGATGTTA TCAATGTCAC AGATATTGAT AGAATTGAAA 2220 

TCAAAGTTAT AAAAGCATAA CACATATACT TGATGACGAA ATAAGTTGAA ATTGAAATAG 2280 

AGAGGTTAAG TGACGATCAA ACGTTGCTTA ACTTCTTTTT AATGCTTAAA AATTATTTCA 2340 
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TTAATAATAC TTCAATAATT GTTAAAAGGG GTTTAATGTG ATTATCTTAG AACGCCATCT 2460 

ATAATGATGT TGTATGATTC AAATTACGTA AAAAGACAAT CGAATATAAT ATAGATTGGA 2520 

GCATACAATT ATGAAAATGA GAACAATTGC TAAAACCAGT TTAGCACTAG GGCTTTTAAC 2580 

AACAGGCGCA ATTACAGTAA CGACGCAATC GGTCAAAGCA GAAAAAATAC AATCAACTAA 264 0 

AGTTGACAAA GTACCAACGC TTAAAGCAGA GCGaTTAGCA ATGATAAACA TAACAGCAGG 2700 

TGCAAATTCA GCGACAACAC AAGCAGCTAA CACAAGACAA GAACGCACGC CTAAACTCGA 2760 

AAAGGCACCA AATACTAATG AGGAAAAAAC CTCAGCTTCC AAAATAGAAA AAATATCACA 2820 

ACCTAAACAA GAAGAGCAGA AAACGCTTAA TATATCAGCA ACGCCAGCGC CTAAACAAGA 2880 

ACAATCACAA ACGACAACCG AATCCACAAC GCCGAAAACT AAAGTGACAA CACCTCCATC 2940 

AACAAACACG CCACAACCAA TGCAATCTAC TAAATCAGAC ACACCACAAT CTCCAACCAT 3000 

20 AAAACAAGCA CAAACAGATA TGACTCCTAA ATATGAAGAT TTAAGAGCGT ATTATACAAA 3060 

ACCGAGTTTT GAATTTGAAA AGCAGTTTGG ATTTATGCTC AAACCATGGA CGACGGTTAG 3120 

GTTTATGAAT GTTATTCCAA ATAGGTTCAT CTATAAAATA GCTTTAGTTG GAAAAGATGA 3180 

GAAAAAATAT AAAGATGGAC CTTACGATAA TATCGATGTA TTTATCGTTT TAGAAGACAA 3240 

TAAATATCAA TTGAAAAAAT ATTCTGTCGG TGGCATCACG AAGACTAATA GTAAAAAAGT 3300 

TAATCACAAA GTAGAATTAA GCATTACTAA AAAAGATAAT CAAGGTATGA TTTCACGCGA 3360 

TGTTTCAGAA TACATGATTA CTAAGGAAGA GATTTCCTTG AAAGAGCTTG ATTTTAAATT 3420 

GAGAAAACAA CTTATTGAAA AACATAATCT TTACGGTAAC ATGGGTTCAG GAACAATCGT 34 80 

35 TATTAAAATG AAAAACGGTG GGAAATATAC GTTTGAATTA CACAAAAAAC TGCAAGAGCA 3540 

TCGTATGGCA GACGTCATAG ATGGCACTAA TATTGATAAC ATTGAAGTGA ATATAAAATA 3600 

ATCATGACAT TCTCTAAATA GAAGCTGTCA TCGGAAAAAC AAGAAGTTAA GTGACAACGG 3660 

TTTACATGTT GCTTAGCTTC TTTTATTATG CGTAATGATG TAAAAAGACG AATATTCATT 3720 

TGTTTGTAAA AGTGGCATTT CTATGTCTTA AAAGTGACGA AACTTCAAAT GTGCCAAGTG 3780 

TTGAATCACA TCAAAATCAT TTTTATTTAA CGAACATTAT GGATTTCTTA ATTTACTTAA 3840 

CGATGATTCA AATATAGTTA AACAAGGTTT AATGTGAATG GAGCAATACG CCATCTATAA 3900 

TAAAGCTGTA TGATTCAATG AATGTAATCG AACAAATCTA ATAATTACGA ATGGAGCATA 3 960 

SO CAACTATGAA AATAACAACG ATTGCTAAAA CAAGTTTAGC ACTAGGCCTT TTAACAACAG 4 020 

GTGTAATCAC AACGACAACG CAAGCAGCAA ACGCGACAAC ACTATCTTCC ACTAAAGTGG 4080 

AAGCACCACA ATCAACACCG CCCTCAACTA AAATAGAAGC ACCGCAATCA AAACCAAACG 4140 
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CGCCTTCAAC TAAAGTGACA ACACCTCCAT CAACAAACAC GCCACAACCA ATGCAATCTA 4260 

CTAAATCAGA CACACCACAA TCGCCAACCA CAAAACAAGT ACCAACAGAA ATAAATCCTA 4320 

AATTTAAAGA TTTAAGAGCG TATTATACGA AACCAAGTTT AGAATTTAAA AATGAGATTG 4380 

GTATTATTTT AAAAAAATGG ACGACAATAA GATTTATGAA TGTTGTCCCA GATTATTTCA 4440 

TATATAAAAT TGCTTTAGTT GGTAAAGATG ATAAAAAATA TGGTGAAGGA GTACATAGGA 4 500 

ATGTCGATGT ATTTGTCGTT TTAGAAGAAA ATAATTACAA TCTGGAAAAA TATTCTGTCG 4560 

GTGGTATCAC AAAGAGTAAT AGTAAAAAAG TTGATCACAA AGCAGGAGTA AGAATTACTA 4620 

AGGAAGATAA TAAAGGTACA ATCTCTCATG ATGTTTCAGA ATTCAAGATT ACTAAAGAAC 4680 

AGATTTCCTT GAAAGAACTT GATTTTAAAT TGAGAAAACA ACTTATTGAA AAAAATAATC 4740 

TGTACGGTAA CGTTGGTTCA GGTAAAATTG TTATTAAAAT GAAAAACGGT GGAAAGTACA 4800 

20 CGTTTGAATT GCACAAAAAA TTACAAGAAA ATCGCATGGC AGATGTCATA GATGGCACTA 4 860 

ATATTGATAA CATTGAAGTG AATATAAAAT AATCATGACA TTCTCTAAAT AGAAGCTGTC 4920 

ATCGGAAAAA CAAGAAGTTA AGTGACAACG GCCTACATGT TGCTTAGCTT CTTTTGTTAT 4 980 

GTTCGATGAT TTGAGAACCC GAATTTTCGA TGGGTCCAAA TATGACGTGG AAGAGACCTG 504 0 

AATTTATCTG TAAATCCCTA TCTATCGGGT GTGAAGCACA ACGGGATCAG TTTTATTTAA 5100 

CGAACATTAT AGATTCCTTA ATTTACTTAA TAATGATTCA ATGATTATTA AACATGGTTT 5160 

AATGTGAAAG GTCAAATACG CTAACTATAA TAAAGCTGTA TGATTCAATA GACGTAAGCG 5220 

AACAAATCTA ATAATTACGA ATGGAGCATA CAACTATGAA AATGACAGCA ATTGCGAAAG 5280 
3S (2) INFORMATION FOR SEQ ID NO: 545: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 886 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 545:^ 

45 

AGTAAAATTG CCGGTATGAT GGACACAAAC GGTGACCTTG GTCAAGGTGA ATTAGCGATT 60 

AATCCACCtA AATCAGATTT gAACGAATTA CCTTGGGCTA CACGTAAAAA TAAACAGCCA 120 

so GCTTCATCCG AAAAAgGTTC AAGTGGTCAT CATGGTAATG CAGCAATGCC TCAAACCAmA 180 

TTAGATTATC AAATATCTAT TGATAAGGTC GTTGAACAGG CGCAAAAAGC TGGTATTAAA 240 

AAGCCGTTTT CAATCGTATA TCCAAGTGAT AAAAATGGTA CCTTTATTGT ATCTAATACT 300 
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GATCAATATA GCGGTAAAAA GCTAGGTACG ATTAAATATG ATGACTACGG TATTATTGCT 420 
AAATGGTTTA CATGGGGCAT TCCGCTTCAC GAAGGTCATT TATTCGGCAT TTTAAATAAA ■ 480 

5 ATCATTAATT TATTTGTATG TATCGCTTTA TTAGTAGCCA TTGGCATGGG GTTTGTCTCT 540 

TGGATAAAGC GTACAAAAAA TACTGCAGTA AAAGTACCAC ATCGCGTAAA AAAACCAGCA 600 

TCTATATCAC TCATAATATG TTTAATTGTA TTAGGATTAT TAATGCCATT ATTTGGATTA 660 

TCACTTATCC TTGTATTTAT AATTGAATTA ATATTATATA TTAAAGATCG TCGTGCTAAA 720 

CAATAATGCA CTTAAAGTTT TGAACTGACG AAATTTACAA AATGgATTCT CGTCTCTCTA 780 

1$ ATTACtTAAA ACGGGgTtCy AaTAATAAAT CgTACTGaTG GgAAAGTTTT TACTTTTTAt 84 0 

CTGtCCGAtT TTTTnGAAwT TGAAGATAAA AAAGCATCTA AAACGC 886 
(2) INFORMATION FOR SEQ ID NO: 546: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 546: 
GGCATTTGTG TCCTTATATA AGGAACTGTG 

30 

TGTAATTCAA AGAGCA.GAAC AGAGTAACAT 

AACAACTAAA TGAAATAATG AAAGTCATTT 
35 ATTAAGAATA ATTCAAGTAT ATTTAAATCG 

TATAATAAAA ATGTATGATT CAAATTACGT 

AGCAAATAAA TATGAAATTT ACAGCATTAG 
40 CAACAGGAAC TTTAACAACA GAAGTTCATT 

TAAATAAACA TGACAAGGAA GCATTATACC 

AAAATATTAG TGCTTTGAAA CATGGTAAAA 

45 

AGATTCAAGT TTTACTGCCT GGAAATGATA 
GGTTAGATGT TTTCTTTGTT CAAGAAAAAA 
so GTGGTGTAAT ACAGAATAAT AAAACATCTG 
CAAAAGAAAA GGGTGAAGAT GCTTTTGTGA 
AAATAACACT AAAAGAACTG GATTATAAGT 

55 



tTAAATACAT TACTGTTGTT AAGTTGTTTT 60 

CATCAGTTGT AGTAAACGAT AATCCGGTAA 120 

AACCTGAACA TTAAAATATA TTTGTTTTTC 180 

AGGTTAATTA TCGTATGAAA CGATGCACGT 240 

AATGAAAACA ATCCAATATA TTAAGATTGG 300 

CAAAAGCGAC ATTAGCTTTA GGAATTTTAA 360 

CAGGTCATGC AAAACAAAAT CAAAAGTCAG 420 

GATACTACAC TGGAAAGACT ATGGAAATGA 4 80 

ACAACTTACG TTTTAAGTTT AGAGGTATTA 540 

AAAGTAAATT TCAACAGCGT AGTTATGAGG 600 

GAGATAAGCA CGATATATTT TATACTGTTG 660 

GAGTTGTCAG TGCACCAATA TTAAATATTT 720 

AAGGTTACCC TTATTACATT AAAAAAGAAA 780 

TGAGAAAGCA TCTAATTGAA AAATACGGAC 84 0 
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ATAACCTTGA TTTAAGATCT AAATTAAAAT TTAAATATAT GGGGGAAGTC ATAGAAAGCA 960 

AACAAATTAA AGATATTGAA GTTAACTTAA AGTAAATCAT TACGAATAAT TAAAAGTAAT 1020 

TGAAGCGGCT TAACGGTGAA ATGTAAATTG GTGCGCATAG CTTATACAAA AAGGATGCAT -1080 

CAATCGATAT CGTCGTTAAG CCGTTTTGGT TTGTGTGTCA TGAATCCTAT CCCAATCTCC 1140 

ATAAAGGTAA AATTTCCACC ACCAACATCA AAATTCTCCA CATCGCAACA TAACCAAATG 1200 

10 

TTATAATAAA TCTATTACAC AAAGAGATAA ATTACTTATT CAAAGGCGGA GGAATCACAT 1260 

GTCTATTACT GAAAAACAAC GTCAGCAACA AGCTGAATTA CATAAAAAAT TATGGTCGAT 1320 

75 TGCGAATGAT TTAAGAGGGA ATATGGATGC GAGTGAATTC CGTAATTACA TTTTAGGCTT 1380 

GATTTTCTAT CGCTTCTTAT CTGAAAAAGC GGAACAAGAA TATGCAGATG CCTTGTCAGG 1440 

TGAAGACATC ACGTATCAAG AAGCATGGGC AGACGAAGAA TACCGTGAAG ACTTAAAAGC 1500 

20 AGAATTAATT GACCAAGTCG GTTACTTCAT TGAGCCAGAA GATTTATTCA GTGCGATGAT 1560 

TCGTGAAATT GAAACGCAAG ATTTCGATAT CGAACACCTG GCGACGGCAA TTCGTAAAGT 1620 

TGAAACATCA ACATTAGGTG AAGAAAGTGA AAATGACTTT ATCGGTCTGT TCAGCGATAT 1680 

25 ■ 

GGATTTGAGT TCAACGCGAC TAGGTAACAA TGTCAAAGAA CGTACTGCTT TAATCTCTAA 1740 

AGTCATGGTT AATCTTGACG ACTTACCATT CGTTCACAGT GACATGGAAA TTGATATGTT 1800 

30 AGGTGATGCA TATGAATTCC TAATTGGGCG CTTTGCGGCG ACAgCGGGTA AAAAAGCAGG 1860 

CGAGTTCTAT ACACCACAAC AAGTATCTAA GATACTGGCG AAGATTGTCA CAGACGGTAA 1920 

AGATAAATTA CGTCACGTGT ATGACCCAAC ATGTGGTTCA GGTTCACTGT TGTTACGTGT 1980 

35 TGGTAAAGAA ACACAAGTGT ATCGTTATTT CGGTCAAGAA CGTAACAATA CTACATACAA 204 0 

CTTAGCACGC ATGAATATGT TATTACATGA TGTGCGTTAT GAGAACTTCG ATATCCGTAA 2100 

TGATGACACA TTGGAAAACC CAGCCTTTTT AGGCAATACA TTTGATGCGG TTATTGCGAA 2160 

40 

CCCACCGTAT AGTGCGAAAT GGACTGCAGA TTCAAAGTTT GAAAATGACG AACGATTCAG 2220 

TGGTTACGGC AAACTTGCGC CTAAGTCTAA AGCAGACTTT GCCTTTATTC AACACATGGT 2280 

ACATTACCTA GACGATGAAG GTACCATGGC CGTTGTACTC CCACATGGTG TATTATTCCG 2340 

45 

AGGTGCTGCA GAAGGTGTCA TTCGTCGTTA TTTAATTGAA GAAAAGAACT ACTTAGAAGC 2400 

TGTGATTGGT TTGCCAGCGA ATATTTTCTA TGGGACAAGT ATTCCAACAT GTATTTTAGT 2460 

50 ATITAAAAAA TGTCGCCAAC AAGACGACAA CGTACTATTT ATCGATGCAT CCAATGATTT 2520 

TGAAAAAGGA AAAAATCAAA ATCATTTAAG CGATGCCCAA GTCGAACGTA TTATAGACAC 2580 

ATATAAGCGT AAGGAAACAA TTGATAAATA TAGCTACAGC GCGACACTAC AAGAGATTGC 264 0 
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GATTGATTTA 


GATCAAGTCC 


AACAAGATTT GAAAAATATC GATAAAGAAA TCGCAGAAAT 


2760 




TGAGCAAGAA 


ATCAATGCAT 


ACCTGAAAGA ACTTGGGGTG TTGAAAGATG AGTAATACAC 


2820 


£ 


AAAAGAAAAA 


TGTGCCAGAA 


TTGAGGTTCC CAGGGTTTGA AGGCGAATGG GAAGAGAAGC 


2880 




AGTTAGGGGA 


TCTTACAGAT 


AGAGTAATTA GGAAAAATAA AAACTTAGAA TCGAAAAAGC 


2940 


10 


CTTTAACAAT 


ATCCGGACAG 


TTAGGTTTAA TTGATCAAAC AGAATATTTT AGTAAATCAG 


3000 


TTTCGTCGAA 


AAATCTAGAA 


AATTATACAC TAATAAAGAA TGGAGAATTC GCGTATAACA 


3060 




AAAGTTATTC 


TAATGGATAC 


CCATTAGGGG CTATTAAAAG ATTAACTAGA TATGATAGTG 


3120 


15 


GTGTATTGTC 


CTCTTTGTAT 


ATTTGTTTTT CTATTAAAAG TGAAATGTCT AAAGACTTCA 


3180 




TGGAAGCATA 


TTTTGATTCG 


ACACACTGGT ATAGAGAAGT TTCTGGAATT GCAGTTGAGG 


3240 




GTGCAAGAAA 


TCACGGATTA 


TTAAATGTTT CTGTGAATGA TTTTTTTACT ATTCTAATTA 


3300 


20 


AATATCCAAG 


TTTAGAAGAA 


CAGCAAAAAA TAGGCAAGTT CTTCAGCAAA CTCGACCGAC 


3360 




AAATTGAATT 


AGAAGAACAA 


AAGCTTGAAT TACTTCAACA ACAGAAAAAA GGCTATATGC 


3420 




AGAAAATTTT 


CTCACAGGAA 


CTGCGATTCA AAGATGAGAA TGGTGAAGAT TATCCAGATT 


3480 


25 


GGGAAAATAG 


CAAAATAGAA 


AAATATTTAA AAGAGAGAAA CGAACGTTCT GACAAAGGGC 


3540 




AAATGCTTTC 


AGTAACTATA 


AATAGTGGCA TTATAAAATT TAGTGAATTG GATAGAAAAG 


3600 


30 


ATAATTCAAG 


TAAAGATAAA 


AGTAATTATA AAGTAGTTAG GAAAAATGAT ATTGCATATA 


3660 


ATTCTATGAG 


AATGTGGCAA 


GGGGCTAGTG GTAAATCAAA TTATAATGGG ATTGTTAGCC 


3720 




CTGCATATAC 


TGTGCTTTAT 


CCAACACAAA ATACTAGCTC ATTATTTATT GGATATAAGT 


> 3780 


35 


TTAAAACACA 


TAGAATGATT 


CATAAATTTA AAATTAATTC ACAAGGATTA ACATCAGATA 


3840 




CATGGAACTT 


AAAATATAAA 


CAATTAmAAA ATATAAATAT AGATATACCT GTATTGGAGG 


3900 




AACAAGAAAA 


GATAGGTGAT 


TTCTTTAAAA AAATGGATAT ATTGATAAGT AAACAGAAAA 


3960 


40 


TGAAAATTGA 


AATATTAGAA 


AAAGAGAAAC AATCCTTTTT ACAAAAAATG TTCTTATAAC 


4020 




TTTGATAAAT 


ACATAGATTG 


CATAAGAATA AAATTTGTAT AATTTAACAT AAAAGTTGTA 


4080 


45 


AAAGTAAAGT 


GAATTAAAAA 


CGAACATTAA ATTTAGGCAC TGTGAAAGCG CAGTGTCTTT 


4140 


TTTGTGTCGA 


AATTGTGTAC 


AGAATAAGTA GTTAAATAAA GATTAAGTTG AGATAAAGTG 


4200 




TTATTCGTAA 


ATAAAAGAGA 


GTAGATCGAT AGGAATTGAA TGATATTAGT TAACTATTTA 


4260 


50 


TTAAATTACT 


TAATAATGAT 


TAATTTTTAG TTAAAGTAAG TTTAATGTGA AGCACGACCA 


4320 




TTGCTCATTA 


TAATGA 




4336 




(2) INFORMATION FOR SEQ ID NO; 547: 
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(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 547: 

10 TAAGCTATCT GAGATAATTG CTGATAACAT TAAACCGGCA ATTTCAGGTT TAATTTCAAA 60 

GCCACGTTCT CTAAACATTT TGTATAAAAT TGTAGCTGTA CAACCAACTG GTTCAGCACG 120 

ATAACATAAA GGACCAGCAG TTTCGAAATT TGCAATTCTG TGATGATCAA TTACATGCTT 180 

AATTGTAGCA GAGGCAATCG TATCAGAACT TTGTTGGaAT TCGTTATGAT CAACTAAGAT 240 

AACATCTTGA CCATCTAAAT CATCTGTTAA TAATTCCGGA GCAGGTACAT TAAATGTATC 300 

TAACGCGAAT TGAGTTTCTG CACTCACATC ACCTAAACGG TATGCTTTGG CTCCTGAATT 360 

ACCTCGAAGT TGTTCaAATT CTGCCaTAAT AATCGCAGAT GAAATTGCAT CAGTGkCTGG 420 

aTTCTTATGT CCGAAAATAT ATGTTTTAGC CAnTGTCAAA TATCTCCCTT GTAAATTGTA 480 

TTCTTTA 4 87 
(2) INFORMATION FOR" SEQ ID NO: 548: 



75 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 871 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 548: 

TTGGTGGTGC AGCAGTTATA GCAATTGTTT TAGCATTCAT TGGTAAGTTC ACTGCATTAA 60 

TTTCTTCTAT ACCTACmCCA GTTATGGGAG GAGTATCTAT ATTACTTTTC GGTATTATTG 120 

CAGCAAGTGG CTTAAGAATG TTAGTTGAAA GCAAAGTAGA TTTTGCGAAC AATCGAAATT 1B0 

TAGTTATAGC TTCTGTAATT TTAGTTGTAG GTATCGGTAA TTTAGTATTT AACTTAAAAG 240 

45 AAATTGGTAT CAACCTTCAA ATTGAGGGGA TGGCATTAGC TGCACTTTCA GGAATTATTT 300 

TGAACTTAAT CTTACCTAAA GAGAAAAAAC AAAACAATTA AGATTTACAA ATTAAGGAGG 360 

GCGCTTTTAT GAATCATTTA TTATCAATGG AACATTTATC TACAGATCAA ATATACAAAC 420 

50 TTATCCAAAA GGCAAGTCAA TTTAAATCTG GTGAACGTCA ACTACCAAAC TTTGAAGGGA 480 

AaTATGTCGC AAATTTATTC TTTGAAAATT CTACTCGrAC AAAATGTAGT TTTGAAATGG 540 

CAGAACTTAA GCTAGGGTTA AAAACGATTA GCTTTGAAAC ATCAACATCA TCTGTTTCAA 600 
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TCATTAGACA TCCGTTTAAT AACTACTATG AAAAATTAGC GAATATTAAC ATCCCAATTG 
CGAATGCTGG TGATGGTAGT GGACAACATC CAACACAAAG TTTACTTGAT TTAATGACGA 
TATATGAAGA ATATGGATAT TTTGAAGGCT TGAATGTATT GATTTGTGGA GAcATTaAAA 
ATTCACGTGT CGCACtAGTA ATTACCAnAG T 
(2) INFORMATION FOR SEQ ID NO: 54 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 549: 
TCAATCTAAT ATATACTTCA TGACTTCCCG CCAAAGGCGC AATCGTAGGA TTAGTCTGTT 
TATCGATAAG ATCTATTAAT ATTGTTTCTA CTTTAGATTC ACCTATTCCC GCAAATCTTA 
ATAGTTCAGA ATGTATAATT CGATTATGGT TTATAAAATG TGACAACAAT TCATTTTTCA 
CCATTGGTTG CATTTCTTTC GGTGGACCTG GTAATAAAAT ' AATTTGTTTG TTTTCAAAAT 
TCACCATCAT TCCTGGAGCC ATGCCATGAT GATTTGTTAA TACAGTTGAA CCTTCAATTA 
CTAAAGCCTG TTGTcTATTA TTAGGTGTCA TTTCTtGTCC TTGTTCCTCA aAAwAGCTTT 
CAATATATTG AAAGAAGGCT CATCAATAAC TAAATCTnTA 
(2) INFORMATION FOR SEQ ID NO: 550: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1523 base pairs - 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 550: 
ATACCTCCAG CTAGAATACC AGCGTATTTC ATAAAATACT TCCTCCATTC AACTATATCT 
ATATTTAATT ATTTAAATTT CGTTGCATTT TCCAATTGAA AACTCATTTT AAAATCAAAA 
CTCTAAATGT CTGTGTATTA CTTAAAATTA TACATATTTT GCTTATATTT TAGCATATTT 
TGTTTAAACC TATATTACAT TATATCAGAC GTTTTCATAC ACAAATAATA ACATACAAGC 
AAACATTTCG TTTATTATTT ATATCACTTA ACTAATTAAT TTATAATTTT TTATTGTTTT 
TAAGTTATCA CTTAAAAATC GTTTGGCaAA TTCGTTGTGA CGCTTGTCCA TCTTCTAATG 
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TTTGTTTTAA TGCATCAATG AGTGCTGTTT GATTTTCAAC AATTGGACCT GGCAACTCTT 480 

TTTTATAATC CATGTAAAAA CCTCTAAGCT CATCGCCATA TTTATCTAAG TCATATGCAT 540 

AGAAAATTTG CGGACGCTTT AATACACCGA AGTCGAACAT GACAGATGAG TAGTCGGTAA 600 

CTAACGCATC GCTGATTAAG TATAAATCCG AAATGTCTTC ATAAtCTGAA ACGTCTTTCA 660 

CAAAATCATC ATGTTCATCA ATACGTGTCA CAACTAAATA ATGCATGCGT AAkAAAATAA • 720 

CATArTCATC ATCCAGCGCT TGACGCAAAG CTTCTATATC AAAGTTAACA TTAAATTGAT 780 

ATGAACCTTC TCGAATCGCT TCATCGTCAC GCCAAGTTGG CGCGTACATA ATCACTTTTT 840 

15 TATCTAATGG AATATTTAAT CTTGTCTTAA TACCATTAAT ATATTCAGTA TCATTGCGTT 900 

TATGTGATAA TTTATCATTT CTTGGATAAC CTGTTTCCAA AATCTTATCT CGACTAACAT 960 

GAAATGCATT TTGAAATATC GATGTCGAAT ATGGATTAGG TGACACTAGA TAATCCCACC 1020 

20 GTTGGCTTTC TTTTTTAAAG CCATCTTGGT AATTTTGAGT ATTTGTTCCT AGCATTTTAA 1080 

CGTTACTAAT ATCCAAACCA ATCT TTTTTA ATGGCGTGCC ATGCCATGTT TGTAAGTACG 1140 

TCGTTCGCGG TGATTTATAT AACCAATCTG GTGTACGTGT GTTAATCATC CwCGCTTTCG 1200 

CTCTTGGCAT CGCTAAAAAC GATTTCATTG AAAACTTTGT AACATATGGT ACATTGTGCT 1260 

GTTGGAATAT GTGTTCATAT CCTTTTTTCA CACCCCATAT TAATTGGGCA TCGCTATGTT 1320 

CAGTTAAGTA TTCATATAAT GCTTTGGGGT TGTCGCTGTA TTGTTTACCA TGAAAGCTTT 1380 

CAAAATAAAT TAGATTCTTG TTTGGCAATT TTGATAGTAA TTTAAAAGTC GTATATATAC 1440 

TATGTTCTAT CAATTTTTTA ATTGTATTTT TAATCATGTC GTACCTCCGA CGTGTTTTTG 1500 

35 TAATTATATT AATATGTATG AGC 1523 
(2) INFORMATION FOR SEQ ID NO: 551: 



25 



30 



40 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 4 923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 551: 

CAGTAAGAGA TTTTCTTAAT TGAAAATAAT CTTACTGCTT TTTTAAATTT AATTTCGAGA 60 

TTCAATATTA GTTTATCTCA TTGTGGCATT AATTGATAAA ATTGTTTTAA TGTTATAAAT 120 

CAAAGTCtTC TTCAACAATT TCAATGTCTG CATCAGATCT ATGATATGTG AAAAAGCTAA 180 

TTCTTATGCG GTCTAAATGC TCTAAATGGT GTCGATACTC TTCGATGGCT GCAACAATTT 240 
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ATGTGGACAA TAAATCTTTT TTCTGCGGAT TATCTAACTC ATAATCAACA TGTGTCACAT 360 

TATAACGTGC TTTTTTAGAA AGGCTAGCTA AAATTTGCTC GTGAAAAGCA GTTAATGAAT 420 

CTAAATCTAA TTTGATTTGT AATAGGAAAT TGTTATTAAG TAAATATAAG TCGTTTTGAT 480 

AACGCGACAA TTTGTTTAAT ACTTCATAAG CTTGTCTAGT CGTCTGAACT ACTTCTCTGA 540 

AAAGTATTTT CTTTCTATTC TGTTGGTGAA TATGTTTTTT TGTAATAGGA CGTTCTTCGC 600 

TATAGTAATC ATAAATTTTC TCTAACTTTT CGACACGTTG TTTTAAATTA TGACTATCTT 660 

GTTTAATATT ATTAAACTCC GTCGTATCAT TTAATACTAA TTTAAACCAC ATAAAAATAT 720 

CTGAGGATAT ATTTAATGAA TTATAGTAAA TTTTTGTTTC AAATTTAGGT GGTAGAAACA 780 

CAAAGTTAAC TAGAGATGAA CTTATGACAC CAATCATTAC AAGTACAAAC CTGTAAAAGG 840 

CGGTAATATA GAAAGAACCG GTATGTTGTC CCATAATGAT TAATGCTGTT ACACTCGCCA 900 

20 AAGTAGCAAC ATGTGCTAAA TTAAATTTAA ATAAAATAGC AATAAGTACT ATGACGGTAA 960 

CACCCATAAT GATAAAATTA TCACTAAAAA TTGTTACCAT TGTAACAGAT AGTATGGCAC 1020 

CTATAATGTT ACCCAATGCT TGATCAGAAA CTGTTTTAAA TGAACGATAA ATACTAGGTT 1080 

25 GCATTGCACA ACAGCACTGA CACCAGCTAA GGCTTTCAGA CCAACATCAT CCGGTAGTAA 114 0 

AGAAGCGATA GACATAGCTA AAATAATGGC TATACCAGTT TTAAAAATCC GAGCTCCTAG 1200 

TCTCAAAAAT AATGACGCCC CTTTTAAGTT TATTGAATAT CTAATATTCG TATTCATTAC 1260 

TGTTATACAC TTACTAGTTA CAAAATTCAA GCTTATTTAT AGTTGTTAAA ATAAATCATA 1320 

CATAATACTG ATAGCGATGT AAAACTTTAG TCAGAGATTA AAATAGTATA AATTTGTAAA 1380 

ATAAAAACTC ACATAGTGAC ATATCAAGTT AAACGTTAAT AGTTAACGAT ATAAAATGAA 1440 

TCTACTATGT GAGCATTTGC TTTATTTTAA TTCAATTAAA AATATACTTC CTTAAAAGTT 1500 

ATTTCATTTG ACTAAAAGCA TAGTCTGCAG CTTTTAAAGT TTGTTTAATA TCTTCTTCTG 1560 

40 TATGTTCAGT TGTTAAGAAC CAAGCTTCAA ACTTAGAAGG TGCTAAATTG ATACCTTGAT 1620 

TTAACATTAA TTTGAAAAAT TTACCGAACG CTTCGCCGTC AGAATGTTCA ACTTGATCAT 1680 

AATGTGTGAC TTTTTCATCT GTAAAGTACA ATGTTAAAGA TCCATAAATA CGATTAATTG 174 0 

TAGCTGTGAT ATTATGTTTT TCGATTAATT TAAGTAAACC TTCTTCTAGT TGTTGGCCTA 1800 

AGCTGTCTAA TTTTT CATAA ACACCGTCTT GTTCTAGTAC TTCGAGTAAT GCAATACCTG 1860 

CTTTCATAGA TAACGGGTTA CCAGCCATTG TACCAGCTTG ATATGCAGGT CCTAGAGGTG 1920 

CTACTTGTTC CATAATATCT TGACGTCCAC CATAGCCTCC AATTGGTAAA CCACCGCCAA 1980 

CAATTTTACC AAATGCAGTT AAATCAGGGA TAACACCTAA TAAATCTTGA GCGGCACCGT 204 0 

55 
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AAATTTCATT AACCTCTTCT AAAAATCCAG GTTGAGGCAT TACCATTCCA AAGTTACCAA 2160 

CAATTGGTTC TACTAATACT GCGGCAATTT CATCACCCCA AAATTCAATT GCTTCTTTAT 2220 

5 

AGGCGTTAAT ATCATTGAAA GGTACAGTAA TGACTTCACG TGCGACGCTT TCTGGAACAC 22 80 

CAGCTGAGTC TGGAGAACCG AGCTGAGATG GGCCGCTACC TGCTGCAACC AATACTAAAT 2340 

CAGAATGGCC ATGATAAGAT CCAGCAAATT TTATAATTTT ATTTCTTTTA GTATATGCAC 2400 

10 

GTGCAACACG AATTGTTGTC ATGACTGCTT CTGTTCCAGA ATTTACAAAG CGAATTTTCT 2460 

CAAGAGATGG AATTGCATCA CGTAATTTTT TGCTGAATTC AATTTCTAAT TCAGTCGGTG 2520 

75 TACCAAATAA AACACCTTTA GCAGCTTG7T CTTGAATTGC TTTAGTAATA TGAGGATGTG 2580 

CATGCCCCGT AATAATTGGA CCGTATGCTT GAAGGTAATC AATAAATTTA TTGCCATCGA 264 0 

CATCATATAA ATATGCACCG TGTCCTTCTT TCATAACAAC AGGTGCACCG CCTCCTACAG 2700 

20 CTTTATAAGA ACGAGAAGGG GAATTGACAC CGCCTAGAAT ATATTCGTTT GAAAGTTGTT 2760 

GTAAACGTTC ACTTTCACTA AAATTCATTT ATATCAACCT CTTTTAATTT AATATTTTCA 2820 

TCTAATATCG TATCATAAAA TTATTATAAT GAAGAAAAAG GTGATTATAT GTTGCAAAAA 2880 

25 

GGAGAACAAT TTCCAATATT TAAATTAGAA AATCAAGACG GAACTGTCAT TACAAATGAT 2940 

ACATTAAAAG GTAAAAAGGC GATTATATAT TTTTATCCTA GAGATAATAC ACCTACTTGT 3000 

ACCACAGAAG CTTGTGACTT TAGAGACAAT TTAGAAATGT TCAATGATTT AGATGTTGCA 3060 

30 

GTATATGGTA TAAGCGGTGA TTCAAAGAAA AAACACCAAA ATTTTATTGA GAAACACGGA 3120 

TTGAATTTCG ATTTATTAGT AGATGAAGAT TTTAAATTAG CTAAAGAAAt GGCGTATATC 3180 

35 AGTTAAAAAA ATCATTTGGC AAAGAAAGTA TGGGCATTGT AAGAACGACT TTTATAATAG 3240 

ATGAACAAGG TAAAGTATTA GATGTTATCG AGAAGGTTAA GGTTAAAACA CAAATAGAAG 33 00 

AACTTAAAAA CATTTTGGGG TGACATATAT GAAAGTTGTT GGGTTAAATC GTATGCGTGA 3360 

40 AGTTGAAACT GAATTACAAC AACGCTTTTC AGATTTAGAT TTTAAATTTT ATAAAAAAGC 3420 

ATCAGAAATA CCTGAGAGCG ACTTGGCTGA TTTAGATATA TTAGTTGGTT ATGATGGCGG 34 80 

TATCAATGAG GCATTTTTAC GACGTTGCCC GAATTTAAAA TGGATTGCAT GGTTTGCAAC 3 540 

45 

GGGTGTAAAT ACATTGCCGT TAGATTATAT TGCAGATCAC GGCATACTTT TAACTAATGG 3600 

AAAAGGTGTT CAAGCTAAAC AATTATCTGA ATACATTTTA GCTTTCATTT TAGATGATTA 3660 

5Q TAAAAAGATG AAA CT AT CAT ATGATAACCA ACGACAACAT ATATATGATT CGAAAATAAC 3720 

TGGTAAACGC CTATCAGGAC AAACAGTTTT ATTTTTAGGT ACAGGTGCAA TTGCTACTAG 3780 

AACTGCGAAG TTAGCAAAGG CTTTTAATAT GAATTTAATT GGTCTGAGCA AGTCAGGTCA 3 840 
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TGCTGACATT 


ATTATAAATG 


CTTTACCAGA 


AACGCAAGAA ACGATTCaTT TaCTAAAGAA 


3960 




AAAACATTTT 


GAATTAATGA 


AAGATGAAGC 


ACTTTTTATA AATATAGGAC GAGGTAGCAT 


4020 


5 


AGTTAAAGAA 


GCGCTCTTAA 


TAGAAGTATT 


AAAAAGTAAA GTTATTCGAC ATGCATATTT 


4080 




AGATGTGTTT 


GAAAATGAAC 


CTTTGAAACC 


TAATCATGAA TTATATGAAT TGGATAATGT 


4140 


IV 


AACTATAACA 


GCGCATATAA 


CTGGTAATGA 


TTATGAAGCA AAGTATGACT TATTAGATAT 


4200 


TTTTAAAAAC 


AATCTAGTTA 


ATTTTCTCAA 


TAAGAATGGT CTAATTGAGA ATGAAGTTGa 


4260 




TGCTAAAAAA 


GGCTATTAAA 


TGArATCATC 


ATGTAAATAT TGACACGCGC GCAATACTAC 


432,0 


15 


AGTTATATTT 


aTAGTAAgTt 


AATaATgATT 


ATATAAGAaA GATGGTgATA TAGATGAGTG 


4380 




TTGAAATAGA 


ATCAATTGAA 


CATGAACTAG 


AAGAATCAAT TGCATCATTG GGACAAGcAG 


4440 




GCGTAAGAAT 


TACACCTCAA 


AGACAAGCAA 


TATTAGGTTa TTTaATTTCT TCACATACTs 


4500 


20 


ATCCAACASC 


TGaTGaAATT 


TATCAAGCAC 


TTTCACCTGa TTTTCCAAAT ATAAGTGTTG 


4560 




CGACAATATA 


TAATAACTTA 


AGAGTGTTTA 


AAGATATTGG AATTGTAAAA GAATTAACAT 


4620 




ATGGAGACTC 


ATCAAGTCGA 


TTCGACTTTa 


ATACACATAA TCATTATCAT ATTATATGTG 


4680 


25 


AACAATGTGG 


TAAGATTGTT 


GATTTTCAAT 


ATCCACAGTT AAATGAAATT GAAAGATTAG 


4740 




CTCAGCATAT 


GACTGACTTT 


GACGTAACAC 


ATCATCGAAT GGAAATTTAT GGAGTTTGTA 


4800 


30 


AAGAATGCCA 


AGATAAATAA 


TTTAACTTTG 


GTAGTATGAC AAATTAAAAA AGCGTTACTw 


4860 


ACTTCATATA 


AGTAAGCGTA 


ATATTTAAGA 


nGTTAAACGA CATGaAAGTt GTTTAACTTT 


4920 



TTT 4923 
3S (2) INFORMATION FOR SEQ ID NO: 552: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 917 base pairs 

(B) TYPE: nucleic acid 
.(C) STRANDEDNESS: double 

40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 552: 



50 



TCCGGCTTTA AAAACTTTTC 


CCAATTCCAG 


CTTGGGCCTT 


TGGCATTAAT 


ATTAACCTCC 


60 


TGGTTCGGAT TAATTGGGAA 


CCTAACCnTT 


TAGGCAATAA 


TTGGTTTAGG 


CAACTTCCAA 


120 


ATTGGTGGTT CAACCAACGT 


CTTTGGATAC 


CyTGcTCATT 


TAATTCTAAA ATGgTyrGAA 


180 


CGCATTTTGG TACCCAAAAt 


GgTGACGTTC 


GTTTGCACGG 


TCTAATAAAT 


TGTCTAAGTT 


240 


GTCGATTGGT TTCATTAATT 


GAaCACCATT 


TTGCTCTTCA 


GCAAGACCTT 


TGTCTACTTT 


300 
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GCGATCCATT GTTTGTTCGA AAAGAATAGC ACCCAAAATT TTATCTGGTG AGAATGAAGG 
TGAAGTTACC ACACGTGTAC GGATATCGTG AACAAGTTGG AACATTTCGT CTTCATTGCT 
AT ATTGATCT TCGTTTACAC CATATTCTTT AAGTGCTTTT GGTGTACTAC CACCACTTTG 
GTCTAATGCG GCAATAAAGC CTTTTCCATT TTTCATTTTT TCTAATTGCT CTTTATTCAT 
ACTTTCCACT CCTTAACTTT TCAATACACC TCCAGTATGA TAAAAATGAG AACATTTCTC 
AAGTCATAAA CCTTGAAAAG TGTATAAAAT GTGAAAAATA ATTGTCAGTT TAATTAAAAA 
TATTATTTTA TTCTAGGTAT GACTAACGCC ATTAATGACA TAAAGAAAAT ATGTGTAATA 
ATCCAACCGA TTAATTCTGT CACACTAAAT TGAAAAATTG . GACGTTGCGC AATAAATACT 
AAAAAGGGAT ACAATGCTAT ' AAATAAGAAA AATAAAGGGA TATAACATAG ATAGTAAAGC 
CTTTTAGAAG TATGAAA 

(2) INFORMATION FOR SEQ ID NO: 553: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 553: 
ACTCCCATAT CCACTGGCAA GATTTATAAT TTGTAATTCT AAAGTATAAT AGCCTATATT 
TTTCAAAATC TCTTTTTTTA TGATTAACAT CCCTCTAAGT GCACCTAAAG GTTTTTATGT 
AATGTATTAA ATATTTCCAT TTATACATAT ACATCTTGTA TAAAAAGAAA GAACTCCATA 
TATACTCAAA AGGTATACTG AAGCTCTATA TATTATATAA AATTATTATA CTATTTTGAA 
AAACATAAGT AAATCACTAA ACATGATTTC TTTCACTTTT AGAAAACTTT TAATACTATA 
AAAGCACCCA CTCAGTCACT AGTTTGGGCA GTTATTGTAT GCCTATTGaA CTCAATGCgT 
ATATTACAAT ACCTTTTtCG CATATTCATA TAAGacTTTG CATCTTTAAG CTTAATTGCT 
ATCTCTTTCT CT 

(2) INFORMATION FOR SEQ ID NO: 554: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1374 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTTTGCTTTA TGTTAAATGG ATTATTaTTA GAATAAAAAA TCGGTGATGA GCTAAAAAAG 60 

TGTGTAGGAT GTTTTCCkAA CCCAATTTTT ACATCCGAAG ATATCGAACA ATATCTTCCT 120 

5 ACTTCTACAT TATTAAAATC ACTACCAAAT CCAATATAAC TGTATTCACC AATGTGAGAA 180 

TTCCTGATTT TACACCATCT ATCTATATAG TTATTGCCAT CAAATTTTGA GTTTGTAATA 240 

TACGCCAAGC GATGAATCTT AACATTCGAT TCTTTAGAGG ACTGGTTTTT CAGCAAACCA 300 

10 

ATTATCTTTT CAATCGCTAT CCTCATCGTC ATTTCCTCCA AGTATTTCGA TTGATAATAT 360 

CTTTATAGCT TTGAATAATT TTAACTACCT TTGTCGAAAC GTTAGTGTCT TTATAATCAA 420 

TAGCATCAAT CATCGGTTCG TTATTGTTTT GCATCTCTCT TGCTAGTTCA ACGGATTGGA 480 

T5 

TTAGATTGTT ATAGGTAATA CCACCTACAA TAACCGTACC TTTATCTAGT ACTTCCGGTC 54 0 

TTTCTGTGGA AGTTCGAATA AGGACACCAG GGAACTTCAA AATAGACGAC TCTTCTGACA 600 

20 ATGTTCCACT ATCTGATAGC ACAACAAATG CATCTTTTTG CAATGCATTA TAATCAAAGA 660 

AACCAAATGG CTTTAACTGT TTAACTAATG GATCAAATTC AAATTTACTT TCTTCAATTT 720 

TCTTCCAACT TCTTGGATGC GTTGAATAAA TCACAGGCAT TTTATACTTT TTGGCAATAT 780. 

25 CATTTATCGC ATTCATTAAT GATTTAAAAT TCTTTTCATT ATCGATATTC TCTTCTCTAT 840 

GCGCAGaTAC TAAAATGTAT TGTTGCGGTT CTAATCCTAG TTTATTTAAA ACGTCACTGT 900 

GATTAATTTT ATCTCGATGC GCTTCTATCA CTTCTGTCAT CGGTGATCCT GTyACAAAGA 960 

30 

TATTCGCTTT ATTGAAGCCT TCATCTAATA AATAACGTCT GCTATGTTCC GTATAAGGTA 1020 

GATTCACATC ACTGACATGG TCAACAATTT TACGATTGAT TTCTTCAGGT ACATTCTGAT 1080 

CAAAGCATCT ATTACCCGCT TCCATGTGGA ACACAGGAAT CTTTAATCGT TTAGCAGATA 114 0 

35 

CTGCTGCTAA ACAACTATTT GTATCACCAA GAATTAAAAG TGCATCTGGT TGTTCGCGTA 1200 

ATAAAACATC ATATGTCTTC GCAATAATAT TCCCCAtCGT TTCTCCAnGt TACTTCCAAC 1260 

40 TGCCTCTAAG TAGTGGTCCG GTTGTCTTAA TTCCAAATCA TCAAAGAAAA TTTGATTCAA 1320 

TGTATAATCA TAATTTTGAC CAGTGTGTAC TAATATCTGA TTAnAAATAT TGAT 1374 
(2) INFORMATION FOR SEQ ID NO: 555: 

4 * (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 555: 

55 
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TTACCTCTAG GCATGTCCCT TTCACGGTTT GCTTATGATA ACGTTATCGA CATTGTTAAA 
TTGTCCTTTT TGAAAAAATA ATTCTGGTGA GATAGACGCT ACTTGGaTTT CaTCaGTTTG 
TAAtAACGCA GtATAAtTAC CaTTACTAAA tTGaGTTAAT CGTTCaTATA AAGTACTAAT 
AGGATAATAA ATGTTATCCG TTAAgCGCGC CGTATAGTTC ACTTGATACG TTTCGCCTTC 
AACAATTGCT TGTTGGACAC GTTTAATATT AGTCATCATA ACCTCAGAAG ATTCAACAAA 
. TGAAAAATGA TACTTTGATA CATAAGAAGT TTGATGTTCA TATGTTGAAT TTATGCTTTC 
CGCTTTTTCA AAACTATAAG CTGCTGCATA AATATCATCT TTAGCTAATG AATGTGTACA 
CATAGCATGA TTAAAATACT TTGCCGCTTC GTAACTTAAA TATAACGAGA CATATCJACC 
TTGTCGTkGt GCGCTTGTGC AAAGTGTATC ACTTCTCCCA CATCAGCCAA CTTAGTAGCA 
ACATACTTCT TTATAAATCC CTTTAATTGA ATATGGTATT GCTTATATTC ATTTTCAGTT 
AAATAGTAGC GATAATTATA TTCTATTCTC ACAGTAATCA CCTACCTTCG ATAAAAATAA 
TTCAACTTGT CGATAACCGT ATTCACTCAA AATAGATTCA GGATGATATT GCACACCAAA 
AACCGGAAAT CTAATATGCT CAAATGCCAT AATAATCGCT TCATCGTTTT TTGCTGTAAT 
CTTTAAGCAA TTTGGAAAAG TCGCTCCGTC AGCAATTAAT GAATGATAAC GCATTACATT 
GAAATTTTGA GGCAGTCCTT GAAAAATACC TTCATTGGTA TGGCGTAACT GTGTAGTATG 
TCCGTGTACA GGATGATAGC CGTGAATGAT ATTTCCACCA AAATAAGACA CGATACATTG 
AAATCCTAAA CATACACCTA GTATAGGTAC ACGCTGATAA AATTGTTCTA ACACTTCATT 
CAAGATAGGA TAATCATCCG GaTTACCCGG CCCAGGCGAA ATAACAATTG CTTTTGGCTT 
CATATTAATG ACGTCTTCTA TCAGCAGATT ATCAATACCA ACAACTTGAA CTGTTAGTTT 
CGTTTGAGTC TTAATATAGT CTATTAAATT ATATGTAAAT GaATCATTAT TATCTATGAC 
TAGAATCATT GTATACTCCG TTCTAAATGT GTTTTATTTT TATAATATGT ATTGGATGTA 
GCTAAAACTT TAAAAGCATT GTCATTATCC TGACCTTGAT TTTAACTAAT ATATGGTATA 
TTCTATTCAT CGTACATAAA TGAATATCAG AGGTTCCTAG CTGAAACCCT CTATAAAAAA 
CTAGGCCATT GAAATTTCAA ACATTCGTTG GG 
(2) INFORMATION FOR SEQ ID NO: 556: 

(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 1054 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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15 



AGAACAGCAA GGATTACTTA CTGAGGAATT AAAGAAGGAT ATTTTAAAAC AGAACAAATT 60 

ACAACGTGTT GAAGACCTAT ATAGGCCTTT TAAACAAAAG AAAAAGACAA GGGCAACTGA 120 

GGCGAAACGT AAAGGGTTAG AGCCATTAGC GATATGGATG AAGGCACGTA AACATGAAGT 1B0 

CTCAATTGAA GAAAAAGCAC AACAATTTAT AAATGAAGAA GTGCAATCGG TTGAAGATGC 240 

TATCAAAGGT GCACAAGATA TTATTGCGGA ACAAATTTCA GATAATCCTA AATATAGAAC 300 

AAAAATTTTA AAAGATATGT ATCATCAAGG TGTGTTAACT ACATCTAAAA AGAAAAATGC 360 

TGAAGATGAA AAAGGTATTT TTGAAATGTA CTATGCATAT AGTGAGCCAA TTAAACGCAT .420 

TGCTAATCAT AGAGTTTTAG CTGTTAATCG TGGTGAAAAA GAGAAAGTAT TATCTGTAAA 4 BO 

GTTTGAATTC GATACGACAT CAGTAGAGGA TTTCATTGCA CGTCAAGAAA TCAATCATAA 540 

TAATGTAAAT CGCAGTTATA TTTTAGAGGC GATTAAAGAT AGCTTGAAAC GCTTAATTGT 600 

20 CCCTTCGATA GAGCGTGAAA TCCATGCTGA TTTAACTGAA AAAGCTGAAA ATCATGCAAT 660 

AGATGTTTTT AGTGAAAACT TAAGAAATCT ATTACTGCAA CCTCCAATGA AAGGTAAACA 720 

AATATTAGGC GTAGATCCAG CATTTAGAAC AGGTTGTAAA TTAGCAGTCA TTAACCCATT 780 

CGGTACTTTT ATAGCAAAAG GTGTGATTTA TCCGCATCCA CCAGTTTCTA AAAAAGAGGC B4 0 

AGCAGAGAAG GATTTTGTAC AAATGGTTAA AGCGTATGAT GTGCAATTAA TTGCAATTGG 900 

CAATGGTACT GCAAGTCGTG AAACAGAACA ATTTGTTGCA GATTTAATTA AAAAGCATCA 960 

' GTTGCCAGTA CAATTCATCA TTGTCAATGA AGCGGGCGCT TCAGTATACT CAGCATCAGA 1020 

AATTGCTAGA GATGAATTTC CTGATTTTCA AGTG 1054 
3S (2) INFORMATION FOR SEQ ID NO: 557: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
40 (D) TOPOLOGY: linear 



25 



30 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 557: 

AAATGTCAGA ATACAAGAAA AAAATAATTG AATTAATTGA AAGTAATTTA ACAGGATATG 60 

AAATTTCTAA AAAAACTGGA GTTTCTCAAT ACGTACTTTC ACAATTAAGA CAGGGCAAAC 120 

GCGAAGTAGA TAATCTAACC CTGAATACAA CAGAAAAATT ATATGAATAT GCCAATAAAG 180 

TTTTGTAATT TAACTAATGT ATAAATTAAT CAAGCTATGT TTATTTGATT TAACTATTAA 24 0 

TAAAAATCAT ATGGTGAATG GATATTATAA TAATTAAAAT ACAAAAATAG TAGATTCCAA 300 
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AAGGGAAAAT AAGTGTTAAG TTTTAAATGA TAAAAAAGAT TGGAATGGAT CGTCTTGAAA 420 

TGCTCCCTTC AAAGTTTTCA TTTTTTCAAT GTCGACTTCG AAGGGGGCAT TTTCATTAAA 480 

5 

TTGTTATAGC TTTTTATATT TGTATAATGA ACATATAAGT TTAAGAAGGT GCGAGTGAAG 540 

GAAATAAAAA AGCTCAAATG TACCAAATTG TTAATCTTAA TAAATCTCTA CTTTATAAAG 600 

ATTGAATGGA CATTCGAGCG TTAATCAGTC AGGAGGGACT TTCCCTCCTA CAATTTAATA 660 

10 

ATAATACTTG CTTCACCACT ATACAAGGAG TGAGTTGTTA TGTTCAAAGT GAATTATTCG 720 
ATTTTAAGTT ATTATCCAGA ATATAATATC GCAGTAAGTT GGCAACGTTT AAGAGAAGGA 780 

15 AAAACAATAA AAAACAAGAT TTAATACTGC TGCGTCATGA GGCGCTTGAA CATTATTTGA 840 
TGAATAAGTA TAATTTCAAC TATGATTATG CACATAAAAT TGTATCAAAA AAATACGATT 900 
ATTCAATTTT TATAAAAaAG AAGGTGGATT AAATGCTTAC ATTAATAAAA TTGGAAAGAA 960 

20 GATGaACAGG tTATAATATA TGraTATATa CCTGaAGATG aTATAAGTAC CGGGTAAAGG 1020 
GTCCCGTACC TTTTTAATTA AAAAAGTTCC AGGGGGT 1057 
(2) INFORMATION FOR SEQ ID NO: 558: 

25 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3754 base pairs 
■ (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 558: 

3S CTGTGCTGTA TTTACtTTAA TTTGACGACA TTTGAGAAGT ATTATGATGG nTGTAGTTAT 60 
TTTATGAAAG TAATGTATTA ACAATCGAAT TACTAAGTCT ATATTGGTAG GAATATCGAT 120 
GTTTAGTTTA AATGGAATGC ACTATTTAAG TTTTTAATAT GGAGATGTTT GTGACTTTTC 180 

40 GATGATTAAG ATTTTTATAG GTGTGCATCA TTTCAAATAA ACTTTGTGTT TAAAATTGAG 240 
CTTAGGAAAT CGATAGGTTT AGATGAGGAT ATTGTTGAAG TTATGTGTCT TGTATCCTTA 300 
GTTGTTATAA AAGCGCAAAA AATAGCACCG CTTTCTCTTT ATCTGTGTAG AAAGGATGCT 360 

45 

ATTATTGTAA AACAATAGTT TTAATTTnAT TTTCTGATaT ATCATATGTr ATTcTACCTG 420 
TATCAATTTC TATCGAATTA TAACCATCAA AATTATCAAC TTTATCATCA AAGTCTATCA 4 80 

CTTTT CCAAT TAATATTTTA TTATTAGTAA GCGTTAATTT GACTAATTTG CCTATGTATG 54 0 

50 

ACTGTAAATT CATATTTAAT CACTCCTTTT TAATATACGG AACTACATGA AAACCAGTTT 600 
TAGAATAATG AACCTTACCT AATTTCGTTT CAATATATTT ACCATTCACA TAAGATTTAC 660 

55 
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AAACTGGAAT ATCAATAAAT , TCTTTT T TAG 

TAGTATAACT TGGTAATAAA GATGAATTAT 

5 

AAATATGACG CTTTTGCTTA ACATTATTCA 

TTTCAAGCAT TTCTTTTTTA GCTTTTTGTA 

AATCTAATAC TTTTTATCTC AGTTTCATCA 

TCTACAACAA GATTTCCTGT TTCGCTCTCG 

TCGTATTCTT CATTATTCAA AAGCGTGATA 

1$ CTCAATACTT TTCACCTACT TATCAATATA 
AATGGCACTT CCCTTTCTGG TTTCAATATA 
TTGATTAAAA TTAATAATCT CTTTCAAATC 

20 TGTTGACATT TTTTCTCTTA ACAATTCATT 
AGGCAATTTC TTATTATTTA AAATGGCTCT 

TAACATATAT CTATTTTATT TTCCGCAATT 

25 

CATTTCTTTC ATCTGGTTTT TAGCTTTTtC 

ATATTTACCt TCACGCTCTT TAAAGAACTT 

GGTACTTCTA CACCAAGGAT GCATAGGTGG 

30 

TTTAAATATT TTTCCGTTAA GTGAATGACA 
AACATATTTA TATTCGCCAT CTTCACCAAG 

3S TGCATTTTCA GTAAATAGTT AAAAAAGCGT 
TCTAAAAAGG ATGCTATTTA TCTTTTGAAT 
CAAATTCTTC AACTAATTCT TTAAAAGATT 

40 CTAATTGTGA ATCTAACATA ATTAAAATCA 
ATAAAGTTTT TTCAATAATT TTTCAATATG 
ATATTTAGCA CAAAATTTAC TCACTATTTT 

45 

TTTTACTTGT CCCTTATTAG TAACGATTGT 
TGTAAATAAA TCATATTCTG AAAATCCTGA 
TGAATTCGGT TTACTGTTAA ATAATAATTC 

50 

ACTATCTTGA TTGACATATA CTTTTGTAAA 
ATTGCTATCA TTGTTTTCCA TGAGATATAC 

55 



TCATCTTTTC ACTTTCATTA GAATCTATTA 780 

TTTTCTTATT ATTATATTCA TTTGTGCATA 840 

ATTTCATCGT CATTTTACCA TTACTTATCA 900 

GAAAGGATGC TATTTTGTAA TTATTTTAAT 960 

AACGAATAAA CAGCAAAATC AGTCTGTATA 1020 

AATTCATTTT CATAGTCAGT TACAAACCCT 10 BO 

ATAACATCTT TACGATATGC ATCTnCAATT 114 0 

AGGTACTATA TGaGcACCTG GTCTTCGAAT 1200 

CACATTTTCG ATATGTATTT TTCCAATAAT 1260 

AAATCGCTCA TCACTTAATA TTAGATTGCC 1320 

CAATAAATCT ATAGAAAGTA TTGTATAGCT 1380 

TTTTTTATTT TCATTATATA GGTGATGACC 1440 

TATTTCTATT TTTATTTTAC CATTTTTAAT 1500 

CTGtAATTAT GCTTCTTTTA CTTCTACTTG 1560 

GTCCCGCCAA TTGCCAACAT GTGGCACTGT 1620 

CGCATTCACA CCTGGTATCA TATCTTTAAC 1680 

TAATTTAGAT GTTTTACTAT CTATTTTGGC 174 0 

TTCTTTTAAA TATGTTAACT TTTGTGCTTC 1800 

ATAAAAATAG CACCACTTTC TCTTTAkCTG 1860 

TTGAATTCTT TTTCGCTTTT TCTATACTTT 1920 

CACTCAATTC TTTTGCAGTT ACATTTCCAT 19B0 

TCTCACTTTA TATTTAATCa TATTTATACT 2040 

GCTATCATTA ATGATATTAA TATGCGTAAA 2100 

ACCATGAAAC CTATTTGACT TGGTAATAAA 2160 

CATTGATTTT ATTGATGGAT GCTTAAAAAA 2220 

CTGTCCAGGA TGGTTATGTA ACATAACAAT 2280 

GGTTGCTTGT TACCCTGGCA CAAAAGATAC 234 0 

TTTACCATCT TTTAACAAAT AAGCTACTTC 24 00 

CTCCGTTTAT AGtCTGTCGA CTGATATTCC 2460 
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25 



30 



35 



GAAGGTTAAA 


ATTTTTAAAG 


TAAACTTTCT 


TTATCCCCAT 


GCTACGAGTT 


CAGATTCAGG 


2580 


AAATAGCCCT 


TTACTAGTAT 


TTATGTATAT 


TCTGTCTATG 


GCATGAATAA 


AATAATTATC 


2640 


TCTTGTATTT 


tTTTCTAAAC 


TAGATTTTTC 


AGCATTGATA 


ACTTCAAGAC 


TATCTATATC 


2700 


CATTTGAATA 


ATACCAGGCT 


TAATATTTTC 


ATCATTATTA 


GGAAAATATT 


TATATGTAAC 


2760 


ACTTTTATCA 


TTAATTTCTT 


TTATTTTTAA 


TATTAGCAAT 


CATTTCCACC 


TCTAATTAAT 


2820 


TAAAATACTA 


TAATTATATT 


TTATTTCTGT 


AAGTTTATGT 


GCCTCTATAT AGTGTAAATT 


2880 


ATACTTATTC 


ATTAGATAGT 


GTTCAAGAGC 


TTCATGTTTC 


TACATTATTA 


TATCCATTTT 


2940 


TTTAATATTT 


TTCCCTTCTC 


TTAAACGTTG 


CCAACTTTGA 


GCCATATAAA 


AGTCAGGATC 


3000 


AAATTGTTTA 


AATCCACTTT 


CTAATAAATA 


CTTATTTTCA AATATATGTT 


CATAAACTCT 


3060 


TTGAATTAAA 


TTTTTATTTA 


TATTAgTATT 


TTTAGCAATT TTAGAAATCT 


CTATCTGTTT 


3120 


ATCTCGATTT 


CTAACTGAAT 


TATAATAAAT 


TTGAGCATGT 


CTGTTCCTTT 


TGATACCGTA 


3180 


TTCATCACTT 


TTATTATTAA 


GTGCACCTGA 


TTCAATAAAA 


CAACOTTCTA 


CTTGATATTT 


3240 


ACCTTCACGC 


TCTTTAAAGA 


ACTTGTCTCG 


CCAATTGCCG 


ACATGTGGCA 


CTGTGGTACT 


3300 


TCTACACCAA 


GGATGCATAG 


GTGGCGCATT 


CACACCTGGT 


ATCATATCTT 


TAACTTTAAA 


3360 


TATTTTTCCG 


TTGAGTGAAT 


GACATAATTT 


AGATGTTTTA 


CTATCTATTT 


TGGCAACATA 


3420 


TTTATATTCG 


CCATCTTCCA 


CCAAGTTCyT 


TTAAATATGT TAACTTTTGT GACTCtTCyT 


3480 


TTTCAACGAA 


TAATGAAAAA 


AGCATATAAA 


AATAGCATCG 


CTTTCTCTTT 


ATCTGTGTAG 


3540 


AAAGGATGCT 


TTAATACCaT 


GCTATTTTAT 


AATTTTcGGG 


AAATTcTTGC 


TTCTCGATAA 


3600 


AGTCTCTTAC 


TACAGAAAAA 


GACTTATTAC 


GATATAACAT AAAATATTCT 


TCATTTTCTA 


. 3660 


TTTCAGATAA 


TGAGAAATCT 


ATAACCTCTG 


CATCTTTTTT ATTAAAAGTT 


ACTGAACCTT 


3720 


TACCGTTACT 


TATATCATCT 


TCAGGTATAT 


ATTC 






3754 



40 (2) INFORMATION FOR SEQ ID NO: 559: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 815 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 559: 

SO 

ATTTAACTAA AACTATAaAT AATCAAATGA TATTGGAAGA TATTaGCATA GATATCGAAA 60 
AAGGTAAATT GACTTCTTTA ATTGGACCTA ATGGTGCGGG TAAGAGTACT TTACTTTCAG 120 

55 
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CTGATTATAA AAATAATGAC TTGTCGAAAA AAATATCTAT ATTAAAACAA ACAAACCATA 24 0 

CTGAAATGAA TATTACGGTA GAGCAGTTGG TAAACTTTGG ACGATTCCCT TATTCTAAAG 300 

GTCGTTTGAC GAAAGAGGAT CATGATATTG TCAATGATGC GCTAGATTTG TTGCAACTAC 360 

AAGATATCAG AAATCGTAAT ATTAAGTCAT TATCTGGTGG ACAACGTCAG CGTGCATACA 420 

TTGCAATGAC AATAGCACAA GATACTGAAT ATATTTTGCT AGATGAACCA TTAAATAATT 4 80 

TAGATATGAA GCATGCTGTT CAAATTATGC AAACGTTAAA AATGTTAGCG CATAAAATGA 540 

ATAAAGCGAT TGTCATTGTG TTACATGATA TTAACTTTGC GTCCTGTTAT TCAGATCAGA 600 

TTGTAGCATT GAAAAACGGA CAACTAGTTA AGTCAGATTT GAAAGATAAT GTCATTCAAA 660 

GTAGTGTTTT AAGTGATTTA TATGACATGA ATATTCAAAT TGAACATATA AGAAATCAAA 720 

GGATTTGTTT ATATTTTAAG GATTGATAAT TTGGAGaCAC TTTAAAGGGG TGATGCGCCA 780 

20 ATTAAAGAAG GGTTAAACGT AAAGCATTTA TTTAT B15 

(2) INFORMATION FOR SEQ ID NO: 560: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 919 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 560: 

GAAACGAATA ATAAATTTAC TGAGTTATTA GTTGAAAAAG CTAATAAACA TGATGATGTT 60 

35 CTCGATAmGT TGATTAATAT TTTAAAATAA GCGATACACA CTACTAAAAT TGTATTATTA 120 

TTATGTTnAA TTGACnCCTC CTAAATTTGC AAAGATAGCA ATTTAGGAGG CGTGTTTATT . 180 

TTTATTGACG TCTAACTCTA AAAGATATAA ATTAGACATT TACAAATGAT GTAAATAACG 240 

40 CAATTTCTAT CATCGCTGAT AACAATTCAT GGTTTAATAT GCAATGAGCA TATACTTTTT 300 

AAATAGTATT ATTCACTAGT TTTAACAATC AATTAATTGG TATATGATAC TTTTATTGGT 360 

TATTTTTATC CCATAGTGTG ATAATTACTA TTTTTCATTC ATAATAAAGG TTTAAAGCAT 420 

GTTAATAGTG TGTAAGATTA ACATGTACTG AAAAACATGT TTAACATAAT GATATAAGGA 4 80 

GTGACGTACA TGATCCGTCT AGGTAAAATG TCAGATTTAG ATCAAATCTT AAATCTAGTA 540 

GAAGAAGCAA AAGAATTAAT GAAAGAACAC GACAACGAGC AATGGGACGA TCAGTACCCA 600 

CTTTTAGAAC ATTTTGAAGA AGATATTGCT AAAGATTATT TGTACGTATT AGAGGAAAAT 660 

GACAAAATTT .ATGGCTTTAT TGTTGTCGAC CAAGACCAAG CAGAATGGTA yGnyGACATT 720 

55 
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TATAAAGGAG CTGCTACAGA ATTATTCAAT TATGTTATTG ATGTAGTTAA AGCACGTGGT 
GCAGAAGTTA TTTTAACGGA CACCTTTGCG TTAAACAAAC CTGCACAAGG TTTATTTGCC 
AAATTTGGAT TTCATAAGG 
(2) INFORMATION FOR SEQ ID NO: 561: 

ii) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 518 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 561: 
ATCATATAAA CCGGCTGCTT CTAAAAACGA AAATACTGTT ACTGGACCTA AAAATTTAAA 
CCCGTATTGT TTTAAATCTT TAGATAGTTG TGTTGCAGTA TCATCAACTG TGATACGATC 
AGAAGCATGT TCATACTGCA AATCTTTAGG CTTACCATTT ACATATGACC ATAAAAATTT 
ACTAAAACTA CCATATGCTT GTTCAATTTT TAAATACCCT TGAGCTTGAT TAACAATTGC 
TTCTAATTTT TTACGATGAT GAACGATATT TGGAAAAGTC ATTAAGCGGT CGATATCTTG 
AGCGGTCATT TGTGCTACCT TTTCTGGTTC GAAATCATAA AATGCTTCTT CATAGGCTTC 
TTTCTTTTTT AAAATAGTTA ACCAAGATAG CCCAGCATGT TGTGaTTCTA ATGCTAAAAG 
TTTAAACAAT GCCTTGCnAT CATAGAGCGG TTGTCCCCAT ACATGGATCm TGATAGTCTA 
AGTAGACTGG GATCTTTAGT ACCAAATGCG CATTCATT 
(2) INFORMATION FOR SEQ ID NO: 562: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 562: 
CTTATTTAAA AGTAAATCAA TCAATGTATT ATAATCCGAA TAGTCCGCAT AAAGCTGGTT 
TGCaGGCAAA tCAATTACTA CAACAAGCAA AAACCCAAAT TAATGCAATG rTTaATTCAA 
AAACAAATTA TGATGTTGTA TTCACTAGTG GTGCAcTGAA TCCAATAATC TTGCTTTAAA 
AGGTATTGCC TATCGTAAAT TTGATACAGC GAAGGAAATA ATTACATCCG TGTTAGAGCA 
TCCGTCCGTA TTAGAGGTTG TAAGATATTT GGAAGCACAC GAAGGATTTA AAGTTAAATA 
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CAAAGTCGGT 


TTAGTAACAT 


GTATGTATGT 


AAATAATGTA ACTGGACAAA TACAGCCTAT 


420 




TCCACAAATG 


GCTAAAGTTA 


TAAAAAATTA 


TCCTAAGGCA 


CATTTTCATG 


TAGATGCGGT 


480 


5 


TCAAGCATTC 


GGCAAAATTT 


CAATGGATCT 


CAATAACATA 


GATAGTATTA 


GTTTAAGTGG 


54 0 




ACACAAGTTT 


AATGGTTTAA 


AAGGACAAGG 


CGTCTTACTT 


GTAAATCACA 


TTCAAAATGT 


600 


10 


TGAACCAACT 


GTCCATGGTG 


GTGGTCAAGA 


ATATGGTGTT 


AGAAGTGGAA 


CAGTTAATTT 


660 


GCCAAATGAT 


ATTGCAATGG 


TTAAAGCGAT 


GAAGATAGCT 


AATGAAAACT 


TTGAAGCATT 


720 




GAATGCATTT 


GTTACTGAGT 


TAAATAATGA 


CGTCCGTCAA 


TTTTTAAATA 


AATATCATGG 


780 


15 


AGTTTATATT 


AATTCTTCAA 


CTTCAGGTTC 


ACCATTCGTT 


TTAAATATTA 


GTTTTCCTGG 


840 




CGTAAAAGGT 


GAAGTATTAG 


TTAATGCTTT 


TTCAAAATAT 


GACATTATGA 


TATCTACGAC 


900 




AAGTGCTTGT 


TCATCTAAAC 


GTAATAAATT 


AAATGAAGTA TTGGCTGCAA 


TGGGATTATC 


960 


20 


AGACAAATCT 


ATTGAAGGTA 


GTATAAGATT 


ATCATTTGGG 


GCTACTACAA 


CTAAAGAAGA 


1020 




TATAGCGAGG 


TTTAAAGAAA 


TATTTATCAT 


CATTTATGAG 


GAAATTAAGG 


AGTTGCTAAA 


1080 




ATAATGAAGT 


ATGATCACTT 


GCTTGTTAGA 


TACGGGGAGT 


TAACATTAAA 


GGGTTCAAAT 


1140 


25 


AGAAAGAAAT 


TTGTAAATCA 


ATTAAGAAAT 


AATGTAAATA 


AGTCaTTAAA AGGACTTGAT 


1200 




GGGTTTGTCG 


TTAAAGGCaA 


ACGAGATCGT 


ATGTATATTG 


AACTTGAAGA 


CCATGCaGAT 


1260 


30 


ATAAATGAAA 


TAACATATCG 


ATTATCAAAA 


ATTTTCGGTA 


TTAAATCTAT 


TAGTCCAGTA 


1320 


TTAAAAGTAG 


AAAAAACAAT 


AGAGGCAATA 


AGTGCAGCGG CAATTAAATT gCGCAGaATT 


1380 




TGAAGaAAAC 


AGCACATTTA 


AAATTGATGT 


GAAGCGTGCC 


CGATTAAAAT 


TTCCCCAATG 


1440 


35 


GATACGGTAT 


GGAATTACAG 


CGTGGAATTG 


GGGTGGTGCC 


AGTATTGGAG 


CACTTCGCCA 


1500 




TATTTCCAGT 


GGATGTCCAA 


CGTCCCAGnC 


CCAGGAATT 






1539 



(2) INFORMATION FOR SEQ ID NO: 563: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 96 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 563: 
ATAnCGTATA CATGTGTTCT TTAAAATTGT GATAAGGAGT TTAGGATGGT TTATTTAAAA 60 

SO 

TCAATAGATG CCATTGGATT TAAGTCTTTT GCAGATCAAA CCAATGTTCA ATTCGATAAA 120 
GGTGTAACTG CAATTGTTGG TCCAAATGGA AGCGGTAAAA GTAATATTAC AGATGCTATT 180 
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ATCTTCTCAG GTGCAGAACA TCGCAAAGCT CAAAATTATG CTGAAGTACA GTTAAGATTA 
GATAATCATT CTAAAAAGCT CAGTGTTGAT GAAAACGAAG TTATTGTAAC AAGAAGATTG 
TATCGAAGTG GTGAAAGTGA GTACTACATA AATAATGACC GTGCAAGATT AAAAGATATT 
GCCGATTTAT TTTTAGATTC TGGATTGGGA AAAGAAGCGT ATAGCATTaT CTCGCAAGGT 
AGAGTTGATG AAATACTAAA TGCTAAACCA ATTGATAGAC GTCAAATTAT TGAAGAATCG 
GCTGGTGTAC TTAAATATAA AAAACGTAAG GCTGAATCAT TAAATAAACT TGACCAAACA 
GAAGATAATT TAACGAGAGT AGAAGACATT TTATATGATT TGGAAGGTCG CGTAGAACCT 
CTAAAAGAGG AGGCAGCTAT AGCTAAAGAA TATAAGACAC TTTCACATCA AATGAAACAT 
AGTGACATTG TAGTTACAGT sCACGATATT GATCAATATA CAAATGACAA TAGACAATTA 
GATCAACGTT TAAATGATTT ACAAGGCCAA CAAGCAAATA AAGAAGCTGA CAAGCAACGT 
TTAAGCCAAC AAATTCAACA ATATAAAGGT AAACGTCATC AACTTGATAA TGATGTTGAA 
TCgCTTAATT ATCAATTAGT AAAAGCTACG GAAGCCTTTG AAAAATATAC GGGACAATTA 
AATGTTTT 

(2) INFORMATION FOR SEQ ID NO: 564: <• ■* 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 564: 
TTGTGTGTAT GCATTCAATG TGCTCTGTTT GTAAATGGCT AGCTATATAA TTTAgGATTC 
GAGGATCGTC ATCGACAACA AGACATTGCA CCATAGCTAT AAACTCCCTT ATCTTTTTCA 
TTTATTATAC ATGTAAAATA TTTTTGCGTA AAAAAACAAT TGTTCATATT GAGTTCATAT 
TTCAACCTTA TACTGACGCT AAAGAAGAAA TAGGGAGAAG TGAATCGATA TGAAATTAGC 
GATAAAAGAG ATTATGTTTT ACAAATTTCG TTATATTTTA ATCACATTAA TCATTCTTTT 
ATTAAGTATT ATGGTGTTAT TTATTAGTGG TTTAGCTCAn GGGCTTGGTA GGGaGAATAT 
TTCGTVATTT GAACACTTTG GATAATGATG aaTATGtTGT TCaAAAAATG AAAGAGCCGC 
aAATTGaGAA ATCGCA 

(2) INFORMATION FOR SEQ ID NO: 565: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2554 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 565: 

ATGTTTTGTC AATATCAATT GTTTGAGATA AATCCGCTTG TATAAAACGA ACTTTATCAT 60 

CATTAAATTT GCTAGTTAAT TCATTTATAT CAGTACGATA ATATTGCACA TAAACTTCAA 120 

ATCCATCAGT TAATAATTGT TTGACTATCT CAGAACCAAT TGAACCAGAA CCACCTAATA 180 

CTAATGCTTT CATTACTTTT TAATCTCCAA ACGACTATCG ACTTGCTGAT CTAAATTTAA 24 0 

75 ATATAACGAT GACGTTTCGT TAATACTATC TAACGTGATA TTTTCAACAA TGTCTAACAT 300 

GTCAAACACG CTAACACCTT CAAAATACAA TTTAGTATAT TGATTAGCAA TATATTCAGG 360 

TGAGTTTAAA CTTGATATGA ATTCACCTAT AAATTGCTTT TTCAAAAGTT CAAATGCTTC 420 

TGCATCTTGG AAAtTGCCTT TTTTATCACG CAACTCATCT AATAATAATT TTTTTAATTT 480 

ATCTGGTTCT TCAGTAGCAC TTGTCACGAT TGAAAAACTA TACGTCGGCT CTAGTACAAA 54 0 

TTGATAACCA AATGTAT CAT CGATAAGTCC "TTCGTTTAAT AAATTCTGAT AAAAATCTGT 600 

TTCTTCCCCA AAAATTAACT CAAAGAATAA TGACATTTCT AAATCACGTT GTACATATTT 660 

TTGAGGCGCT TCTTGTAATG GTTTATTTTT AAAACCAAGC ATTAGTCTTG GTGATTGAAT 720 

30 TTTCATAGAT TCAGTAACAA ATGCTTCTTT AACATCcTCC GGTTCATCAA CAAGTCCTCG 780 

TTCGATTTTG GGTTGGTTAA CTTTATTACG AGCATCCTCG TGTTGTTTTA CTATTCgACA 84 0 

TATTGCTTCA GGATCCACAT CGCCAACAAC AAATAAAACC ATATTTGATG GATGATAAAA 900 

35 CGTTTCATAA CATAGATACA AATCATCTTT TGTAATATCG TATATACTTT CTACACTACC 960 

GGCAATATCA ACACGTATTG GATGTTGTTG ATACATTGCA CGCAAtGTAT TaAACATTAA 1020 

TTTATATCCA GGTTGCTCTT GaTACATTTT TATTTCTTCT GCAATAATAC CTTTTTCTTT 1080 

ATCAACAGTT TCTTTTGTAA AATAAGGCGT TTCaACCATT GTAAGTAAAC GTTTAATGTT 114 0 

GTTTTCAATA TtATCAGTTG CACTGAACAA GTAGCTTGTA CGATCAAAGC TTGtAAACGC 1200 

ATTTGCTTGT GCGTTATCTT CAGCAAACGC AGTAAATAAG CTTCTTCTTC TTTTTCAAAT 1260 

AATTTATGTT CcTAAAAAGT GAGCAACTCC ATCAGGTACA GTAACAAATT GGTCTTGTCC 1320 

AAGGGGTTTG AATTGATTAT CTAATGAACC AAATTGTGTA GTGTAAGTGA CAAATGTCTT 1380 

50 TTGAAAACCT GGTTkGGGGA TAATAAATAA TCGTAAACCA TTTTCTAATt CTTGTTCGAA 1440 

tACTCTTTCG TCTATTAATT CATAATAACG CTCTTTCATT ATTTATCCCC TCCTTTTGTC 1500 

AACACATAGA TtGTATCTAA AAATGCTTTT TCAGCAACAG AAACAATATC TTCGCGACTT 1560 
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s 



20 



30 



TTATGCATAA 


TCTCTATAAT 


ACTTTTCGGA 


CGATCTTCAG 


ATTCATATCG ATGAGAAATG 


1680 


ATTACTTTTT 


TAGCTAACTC 


TAATTTTTCT 


TCAGTGAAAT 


CTCCTGCTTT TATTTTTTCA 


1740 


AATTCACTTA 


TAATAGTGTC 


TTTTGCAGTT 


TCGTACTTAT 


CACTTGAAAC CCCACTCAAA 


1800 


ACAAATAAAT 


AGCCATTTTT 


GCCATCAATT 


TGTGAATGTA 


TAGAGTACGC TAAACTTTGC 


1B60 


TTTTCTCGCA 


CTTCATTAAA 


TAAAACAGAT 


GAAGGATCTC 


CTCCAAACAT CATGTTAAAT 


1920 


ACAACAAAGG 


CAGCATATCC 


ACTTTGTCCA 


TATTGTGTTG 


GAAAACGGTA TCCCATATTT 


1980 


AATTTAGCTT 


GATCCACGTC 


ATCATATTCA 


ACAATATAAT 


CAACTTCTTC ATCGTGTAAA 


2040 


TGATGAGTAG 


AATGTTGGAA 


TTGATGTTTA 


TCGAATGGTT 


TAAGTGCAAA TTTTTCACGT 


2100 


ATTTGTTTCT 


CAACACTTTC 


AGGTTCTACA 


TTGCCGACAA 


CATAAACAGA ACATTGATCA 


2160 


TTATTAATCA 


TTGATTGATA 


TGTATGATAT 


AGTGTTTCAG 


CAGTAATATG TGGGATTTGT 


2220 


TCTAGTTGTC 


CTGTAGATAA 


GTATTTATAT 


GCTTCATTTT 


CAAACATATG GTCGAGTAAT 


2280 


TTTAAAAACG 


AATATTGTGC 


TTTATTATCT 


ACCATTGCTT 


CTATTTTTTT GGCTAATAAT 


2340 


GTTTTCTCTT 


GGTTAACAAA 


ATTATCATTG 


AATGCTTTAT TTTCAATTAA TGGATTCCAA 


2400 


ATGATTTCTT 


GtAATAAATC 


TAATCCTTGa 


TTAAATAATG AwTCACCGkT TCyTAAATAA 


2460 


CGkkCaTTAA 


caATTyCTAA 


tGaAAATGtA 


ATgACaTGCT GaTCTTTGAA TTTTGAAATT 


2520 


GTACTATTCA 


CATACGCACC 


ATATAAATCG 


GCTA 




2554 



(2) INFORMATION FOR SEQ ID NO: 566: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 1424 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 566: 
TnTCGCTATT TTnAATTGGT TTTGTATGGT TTAAGTTATA TCAATATACA ACAnACCCTA 60 
AAGCTGATAT CCCAGGTATC ATTTTTAGTA CGATTGGTTT TGGTGCTTTG TTATATGGTT 120 

45 

r 

TCTCAGAAGC TGGCAACAAA GGTTGGGGTT CAGTAGAGAT AGAAACAATG TTTGCGATTG 180 
GTATTATCTT TATTATTCTA TTCGTTATTA GAGAATTAAG AATGAAATCA CCAATGTTGA 24 0 

SO ATTTAGAAGT ATTGAAATTC CCaACATTTA CATTAACAAC AATTATTAAT ATGGTTGTAA 300 
TGTTAAGTTT ATATGGTGGT ATGATTTTAT TACCGATTTA TTTACAAAAT TTACGCGGAT 360 
TcTCAGCATT AGATTCCGGA TTGTTATTAT TACCTGGTTC TCTAATTATG GGTCTACTAG 420 
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TTGCTGTAAT 


GACTTATGCA 


ACATGGGAAT 


TAACTAAATT 


AAATATGGAT ACACCATATA 


540 


TGACAATCAT 


GGGTATCTAT 


GTACTTCGTT 


CATTTGGTAT 


GGCATTTATA ATGATGCCAA 


€00 


TGGTAACTGC 


AGCTATTAAT 


GCGTTACCGG 


GACGACTTGC 


CTCTCATGGT AATGCTTTCT 


660 


TAAATACGAT 


GCGTCAATTA 


GCAGGCTCTA 


TAGGTACAGC 


AATCTTAGTT ACTGTAATGA 


720 


CAACACAAAC 


TACACAACAC 


TTATCAGCTT 


TTGGGGAAGA 


GTTAGATAAA ACGAATCCTG 


780 


TTGTACAAGA 


TCATATGCGT 


GAATTAGCAT 


CACAATATGG 


CGGACAAGAA 


GGCGCAATGA 


B40 


AAGTGTTACT 


ACAATTTGTA 


AATAAACTAG 


CAACGGTTGA 


AGGTATTAAT 


GATGCATTTA 


900 


TAGTTGCAAC 


GATATTTAGC 


ATCATCGCCT 


TAATTTTATG 


TTTATTTTTA 


CAAAGTAATA 


960 


AAAAAGCAAA 


AGCTACAGCT 


CAAAAGTTAG 


ATGCAGATAA 


TAGTATCAAT 


CATGAATAAA 


1020 


TAAAATAAAT 


TAATTGAAGT 


GTGACTAATC 


AAAAATTATG 


TTGTGGGGAC 


ATGATTTTTA 


1080 


AAGTATCGGT 


GCCAAATATG 


GTTATCGATA 


CTTTTTTTAT 


TTGTTGATTT 


ATAGAATGTT 


1140 


AGAGGAATTA 


TATTAAAATT 


TGGCATTGAC 


GTAGTAGGTC 


ATTAATAAAG 


AAAAAGCAGG 


1200 


AAGTGGGTCA 


ACGAAATGAA 


TTTTGTGAAA 


ATAACATTTC 


TGTCCCAATC 


CCTACTATAT 


1260 


AACATTATTT 


TAAACGAGGC 


ATGCGATTAC 


GGAAGAATAA 


GCTTATAACA 


AGTAAACCGA 


1320 


TGCTACAGCC 


AAGTAAAATG 


ATGCCGTTAT 


GAATAGCGTC 


ACTTgCTGTA ATCACTTGAT 


1380 


CTGGTGGTAC 


ATTTAAATAA 


TATTTTTTGA 


AAACATCTGC 


AATT 




1424 



(2> INFORMATION FOR SEQ ID NO: 567: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 676 base pairs 
35 (b) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

40 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 567: 
TCTCCCATGT TCTGCTAAAT GACGCATCAC TTTTACTTCA TGAGGCGTCA ATACACGTCC 60 
TTCACCAGCA TTCAAACCGA CAACATTTAA AGGCCCATAT TCAATACGAG ACAGTTTCGT 120 

45 

CACTTGATGA CCAAAATGTT CGAACATTCT TCTGACTTGG CGATTACGAC CTTCTGTAAT 180 
TGTAATTTCA ACCAATGTTG TGTTTTTATC TTTATCTTGT TTCTTAACTT TCACTTCAGC .240 
SO CGGTTGCGTC ATACCATCTT CTAATTCAAT ACCTTTTTCT AGCGCTTTCA. CTTCTTCTCT 300 
CATTAAATAA CCTTTTAATT TCGCAACATA TTTTTTCTTA ATTTGATATC TTGGATGTGT 36 0 

CATTAAATTA GTAAATTCAC CATCATTTGT GAGTAATAAC AATCCAGAAG TATCATAGTC 420 
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ACGTCCTCTA TCATCAGATA CACTTGTkAT CACTTGAGTT GGkTTATGGA AkAAAATGkA 540 

AAtTTTGTCT TCTAGTTCTA TTTTAATACC TTCAACTTCA ATCGTATCTG ATGGCTTCAC 600 

TTTTGTTCCT AATTCAGTGA CAGTCGTACC ATTCACTTTC ACTTTTCCTT CAGAAATTAA 660 

AGTTTCTGCC TTACGT 575 

(2) INFORMATION FOR SEQ ID NO: 568: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: . 568 : 

20 GAAACGGTTC TACCAAAAAA CAGTAAGGGC TAAACCCAAT CATGGTAAGA CAAAAAGTAC 60 

AAATAGCCAT GCCCAAGTTG AACTCGCTGT ACGCCTATTT CTTTCTAAAA AGATAATAAT 120 

AAAAGCCAAT ACTAAATTAA TGATGAATCC AATGGCTAAA ATAATAGTAA ATAACGTTCC 180 

TAAATCGTTT GAAAATGTAA ATCGCATAGT CTTTTCTCCT ATAAAGAAAG GCACAAAAAA 24 0 

ACATTTTGCA CCTTTCACGT CATATTATTT ATTCACAGAT AAAGTTAAAA TTGcATTGAA 300 

TTCTTCTTCA TTATTTGGGA ATGTTCTTTC TTCTATTTCT TTAATAGTAA TATTTACTAA 360 

TTTTAAATTT GTAGCTTCTT CAGAACTTAA AAAAGCATTA ATGTTTTTTT CTAATAACTC 420 

kAAAGTCTCA GCTGtAAAaG TTTTAAGTTT AATT 454 
35 (2) INFORMATION FOR SEQ ID NO: 569: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 894 base pairs 

(B) TYPE: nucleic acid 
4Q (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 56 9: 

45 

ACGATATTAC CACCCTCTGA AATAACATCA ATCATGTGAT CTGTAAAGTC CCAAGGATGA 60 
CTTGTTGTGA AACGAACTCT TGGAATCGCT ATTTTAGAAA TTGCTTGTAA AAGATCTCCT 120 
50 AAGTCATATT CTATATCCTG TAAATCTTTA CCATAAGAAT TTACATTTTG ACCTAAAAGC 180 
GTTATTTCTT TGTAACCTTC ACGAGCAAGT TCACGTACTT CATCTATAAT GTCTTCAGGT 240 
CTACGGCTTC GTTCTTTACC TCTTGTAAAT GGAACAATAC AATATGTACA AAACTTATCA 3 00 
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TCAATAACGT CTCCTTCTTT AGACCATACT TCAACAACCA TTGCTTTAGA TAAGTATGCT 420 

TCTTCTAAAA TTTCTGGTAA ATGATGAATA TTATGTGTAC CAAATATCAT ATCTACATTT 4 BO 

TGATACGATT TTAAAATTTT ATTCACTACT GACTCTTCTT GTGACATACA ACCACAAACA 540 

CCGATTAAAA TATCAGGTCG TTCTTTTTTC AAATrCTTCA AATTACCTAT TTCACTAAAC 600 

ACTTTGTTCT CGGCATTTTC TCTAATCGCA CATGTATTAA TTAAAATAAC ATCTGCAGTG 660 

TTAATATCAG TCGTkGCTTG aTAGCCTAAT GCyTCmAGTA TACCAGCAAT GACCTCAGTG 720 

TCATGTGCAT TCATTTGACA TCCATATGTT TTAATTAAAA ATGTACGCTC GTTCCCCATA 780 

CCGCGATATT TTyCATCAAT TtGGgAAATC nCTATTATAA CGAACTTCTk GTtTACCnCC 840 

TTTTTT nCGC TCCTTTAAAA TTAAGGCGGC TGATAAACAG GTCCAAAATA TTAC 894 
(2) INFORMATION. FOR SEQ ID NO: 570: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 570: 
TATCAATCCC ACAGCACATG CTGAACAAGA TCAAACATGG GAGAAGATTA AAGAACGCGG 60 
TGAACTTAGA GTGGGTCTTT CTGCAGATTA TGCACCAATG GAATTTGAGC ATACAGTTAA 120 
TGGTAAGACT GAGTATGCAG GTGTAGATAT TGATTTAGCT AAAAAAATTG CGAAAGATAA 180 
35 „ TAATTTAAAA TTAAAAATCG TCAATATGTC ATTTGATAGT TTGTTAGGAG CTCTTAAAAC 240 
TGGAAAAATT GATATTATTA TTTCCGGAAT GACTTCAACG CCTGAACGTA AGAAGCAAGT 300 
TGATTTTTCA GATTCATATA TGATGACTAA AAATATCATG CTTGTAAAGA AAGATAAAGT 360 

40 

TAATGAATAT AAAGATATCm AAGACTTTAA TAATAAAanA GTnGGGGCAC AAAGGGACTG 420 
AACCAGAAAA AATCGCTCAA C 441 
(2) INFORMATION FOR SEQ ID NO: 571: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1205 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS :. double 
50 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 571: 
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AAGTCGTTCA TCTTTTTCTA CAAGGGTGTA AAAATAATCA ATCATATCGT ATAACGCTTC 12.0 

. TTTACCAATG ATGTCATATG GTGTTGTTGT CATTTAATCA CCCATTTTCA AAAATTTACT 180 

GTTACGAACT TAAGTTAATA TATAACTAAT ATAACATGAT TTTAAACATT TGAAAGAAAT 24 0 

ATGCATATTT GCCAATTTAA TTTATATTGT TTGAAAGTGT TTCTTTTTTC TTGAAAAAAC 300 

GTTGAACTTT ATTTAAAGGT tGATGATGTT CGAGGTTTAG . TTCGTTtAAT AAAGATtGGA 360 

ACTTTTGTAA ACCTTGATTA TAGTCTTTAA CTTCGAACTC TAACTCATAA TCCGTAGTAT 420 

CGAAATACTC ACTTTTATCT AAAACCAGTA AATCACCTTT ATATTTAGTT TCTTGGCGAT 480 

75 ATGTCGTTAA TGCACCAAGT ATTGATAAAG TTGTATCTTT TACACCAAAC TGTTCAACTA 540 

TAATTTGACG AATGTCATCT GGAAGATTGT CGTTTGAAAT AATCAAGTTC ATCTCTGGTT 600 

TAATGTCGAC GATATAGTTG TATTCTAATA GACCAACCTT TGCTGGTGTC TTTAAAGTCA 660 

on _ 

TTTCATATTG ATTGTCTTTA ACTCTTATGC GTAGTGCAGA GCGATGTTCC TTTAATTTGA 720 

AATCGGGTGT ATCAATATAG TAATTGACTT GCTTAmAAAG CACACTGTCT TTAAAATATT 780 

TCTCTTGCAA TTTATTATAG ATTGAtGCAG TTATCATTTG TtTAAATTCT ATtTCATGAT 840 

25 

TTGTTGCCAT GATATGTATA CACCTCGTAT CAAATTCAAT TTATCTTAAC TATATTATGA 900 

ATGACAAAGT TGAATTTTAA AAGTAATTTC CTTTATCTAT TATCAATGTT AATTTGACCA 960 

30 TTAAAAATAG TGTTCGTAAG TGTTTTGTAT TATTGaATTG TGTTAAAATG TTATGGAATA 1020 

AGAGGAGGAT TAAGCATGSG TTTwTATATT AATGAAATTA AAATTAAAGA TGACATACTT 1080 

TATTGTTATA CAGAAGATTC TATTAAAGGA TTATCTGAAG TAGGACAAAT GCTCGTTGAT 1140 

35 AGTGATAATT ATGCCTTTGC GTATACATTA GATGATGGTA AAGCGTATGC TTATCTCATT 1200 

TTCGT 1205 
(2) INFORMATION FOR SEQ ID NO: 572: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 572: 

50 TGAAGAAATA GCAATGATGA AATGCCGTAT GTTATATGAG ACGGGGTCAT TTCTTGAATT 60 

AAGAGAAGAA ACAATTGTCT TATTGAAAAC TGGCATACAA CAATATGATG CATTGATGAT 120 

TTATTACGTG" AAAAGTTTGA TTGGTTTGGG ACAATATTTT GAAGCGGTAG AAGTAATTCA 180 
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ATTTGCTAAG TCAAAATTAA TTGAAGATGA AAAACGATTG ACTCAGTCAT TAGCTGATTT 300 

TGrTACGTTA TCAATGAGGG AACAGACGCA CTTGATTTTG AAGTTAATAG ACAATGGTCA 360 

TTTTCAATTT CAAGAAACGG TATTATATAT ATkAAaAyCT AATaCGTACa GTtATAACCT 420 

CATTAGTTTA ATGATTGAGT ATTTAAGGTT CGCAAATTGT ACACAAGAAC TGACAATTGA 4 80 

AAAGTATGGT ATGGATGTAA CTTTTGTACC AGCTAATTTA AAAGGGCTAG AACATACAAC 540 

ACTTAAAGAA AAAGTTATAC CTAACGTTAT 570 
(2) INFORMATION FOR SEQ ID NO: 573: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 939 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 573: 
GTTGAATGGT TAGCAGCTGC AGTTGTATTA TATTTCTGTG GTGTAATTGT TGACGCTCAT 60 

GTATCATTCA TGTCCTTTAT TGCAATATTT ATCATTGCTG CATTATCAGG TTTAGTCAGC 120 

TTTATTCCTG GTGGTTTCGG CGCTTTCGAT TTAGTTGTAT TACTAGGATT TAAAACTTTA 180 

GGTGTCCCTG AGGAAAAAGT ATTATTAATG CTACTTCTAT ATCGTTTTGC GTACTATTTT 240 

GTACCGGTAA TTATTGCATT AATTTTATCA TCATTTGAAT TTGGTACATC AGCTAAGAAG 300 

TACATTGAGG GATCTAAATA CTTTATTCCT GCTAAAGATG TTACGTCATT TTTAATGTCT 360 

35 TATCAAAAGG ATATTATTGC TAAAATTCCA TCATTATCAT TAGCAATTTT AGTATTCTTT 420 

ACAAGTATGA TCTTTTTTGT AAATAACTTA ACGATTGTkT ACGATGCTTT tATATGATGG 4 80 

AAATCACTTA ACGTATTATA TTCTATtGGC AATTCATACT AGTGCTTGTT TATTACTTTT 540 

ACTGAATGTA GTTGGTATTT ATAAGCAAAG TAGACGTGCC ATTATCTTTG CTATGATTTC 600 

AATTTTATTA ATCACAGTGG CGACATTCTT CACTTACGCT TCATATATTT TAATAACATG 660 

GTTAGCTATT ATTTTTGTTC TGCTTATTGT AGCTTTCCGT AGAGCGAATA GGTTGAAACG 720 

CCCAGTAAGA ATGAGAAATA TAGTTGCAAT GCTTTTATTC AGTTTATTTA TTTTATATGT 780 

TAACCATATA TTTaTTGCTG GAACGTTATA TGCATTAGAT ATTTATACGA TTGAAATGCA 840 

50 TACATCTGTA TTGCGCTATT ACTTCTGGCT TACGATTTTA ATCATCGCTA TCATCATAGG 900 

TATGATTGCA TGGTTGTTTG ATTATCAATT TAGCAAAGT 939 
(2) INFORMATION FOR SEQ ID NO: 574: 
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(A) LENGTH: 1059 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 574: 

GAATTAATTA AATATTACAC ACAGCCTCAT TTTTCATTTT CAAATAAATG GCTATATCAA 60 

TATGATAATG GAAACATTTA TGTTGAACTT AnGAGATATT CATGGTCAGC ACATATATCT 120 

TTATGGGGCG CTGAAaGTyG GGGAAATATT AATCAGTTAA AAGATCGTTA CGTAGATGTG 180 

TTTGGACTAA AAGACAAAGA TACTGATCAG TTATGGTGGT CTTATAGAGA GACATTTACA 24 0 

GGTGGCGTTA CACCAGCCGC AAAACCTTCT GATAAAACTT ATAATCTTTT TGTGCAATAC 300 

AAAGATAAAC TACAAACGAT TATTGGTGCG CATAAAATAT ACCAAGGCAA TAAACCAGTA 360 

TTAACATTGA AAGAAATCGA TTTCCGTGCA CGAGAAGCGT TAATAAAAAA TAAAATATTA .420 

TATAACGAAA ATCGTAATAA AGGTAAGCTT AAGATCACCG GTGGCGGTAA TAACTACACT 480 

ATTGATTTAA GCAAAAGATT ACATTCAGAT CTAGCAAATG TTTATGTTAA AAATCCTAAT 540 

AAAATAACTG TTGACGTCCT CTTTGATTAG TATATGAAGG TGACTTATAC TTCATGCACT 600 

TTAATTCCAA ATCAGATTAT TTAAATGATA ATTTTTAAAG TGTATGATGT ATATAATAGG 660 

30 TAAAATTTTC TATATATTTA AATGGAATTG GGAGTAGGAA TGTGACAGAA ATAGTATTTT 720 

ATAAAATTTA TTCtTGTCAC TCCCCAACTT GCACATTATT GTAAGCTGAC TTTCCGCCAG 780 

CTTCTATGTT GGGGCCCCGC CAACTTGCAT TGTCTGTAGA aTTTCTTTTT GAAATTCTCT 840 

ATGTTGGGGC CCCGCCTATA ATTGAAAAAT GCTTGTTACA TGGGCATTTT CATTCGGTCA 900 

ACTACTACCA ATATAATATT GtAGaGCCTA AGACATTGAT TTATTATGTC TTAGGCTCTA 960 

TTCCTTCATT TAATGATTAA nTTATTATAG CAATACTTTA TTGTCCCATG ATTAGTGTTC 1020 

TTTTAATGAG ACATAGTAAC TATAAAGTTT AATAATCGT 1059 
(2) INFORMATION FOR SEQ ID NO: 575: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 575: 
GTTTGCTTTA GGTCCTGTTT CATATTTATA CTTCGAAGGA TTTACCTTTT TGaAGTCTGG 60 
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TTCAACTkGC TTTTTATTCT TTTCGAAATC AGCTGGTTGA GTAGTTATGA GTTCATTATT 180 

TTTATTAGmA TAAATCTTAC CATTAACATA TTTATAATCT TTTGTTATAA AGTCACCATT 240 

TCTGAATGGA ACTACTTGAT TATGACCTTT AGAGAATAAA TCAGTACCGA ACATTAAATA 300 

GTTCTTCGTA TCTATACCAG CCAAATGTAA AATTGTTGGC ATTACATCGA CTTGACCAGC 360 

ATATTCATTA TTGATACCAC CAGATTTACC AGGGATTTTA ATCCAGAAAC CAGTTCTGTT 420 

TAAATCTGTA AATTTAGCCG GTGTGATTTT TTCACCTAAT AGTTTTTCCA TGGCATTGTT 4 80 

ATGGTTTTCA GAGATACCAT AGTGGTCACC ATAAATCATA ATCACTGAAT TGTCATATAA 54 0 

TC CTTTTTT C TTCAAGTCAT TAATATATTC TTCT 574 
(2) INFORMATION FOR SEQ ID NO: 576: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 576: 

CAATGTTTTA TAGTACAATA TATTTTnAAT AATACTCGTT AAGGAGAATG ATATGATATC 60 

AATTCACGCA ATTTCGACAG GAAAAATCCA AGATTTGCCT TATAGCTCGA AAAGACCGAT 120 

GCGCTCTGCT TTAGATAAAA CTAAGATTTC ACAAACAACA TGGTTATCTT CAACTGGTTT 180 

CACTGGTGAT GAACAGGCTT ATAAAGATCA TGGTGGACCA CATAAAGCAG TTTGTGGGTT 240 

35 TAGTAAGCAT AATTATGCAC TGTATCAAGA TGATTTACCT ACACTACCTA CTCATGCGAT 300 

GTTTGGAGAG AATTTAACAT TTGATTATTT AGACGAATCT GATGTTTACT TTGGTAATCA 360 

ATATCGTTTA GGTGAAGCGT TAATTGAGGT TTCTGAAATT AGAGAACCAT ACTGGAAAAT 420 

TCAAGCAAAA TATAATATTC CTGATTTAGT GAAGCGCATG TCTACATCTG GTAAAACAGG 480 

TTTCTATTTC CGGGTATTAA AACAAGGCTA TGTATCTCCA AATGATCAGC TTTACTTAAT 54 0 

ACAAGAAGCA CCAATCGAAC ATCGTTTATC TGTACAACAG CTTAATGACC TTTATTATAA 600 

TGATAGACAA AATCAAGaTA TGTTACGATA TGCACTAAAC AATCCATTTC TGTCACCAAC 660 

AAGACGCGAT AAACTTCAAA AAATGTATAA CAGAACATTG GAAATAATTA CCTTTCATTn 720 

so ATAAGTGTTA AATGAACTTT TCAAAACAnA AAGGAATCAA CTTCACACAT CGTTTGTATG 780 

AATAGTCTTA TCTATA 796 
(2) INFORMATION FOR SEQ ID NO: 577: 

55 
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(A) LENGTH: 1095 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 577: 

1Q AGAAATTATG ACAAAATATA AAGATGGAAA GTTAGTTTAT GCATCAGTCG AACCAGGATC 60 

TTACGTAATA CATAAAGATG ATGCAATTAA ATATGACGAT TATTCTAAGT TAAAAAAATT 120 

AAGTCAGCTA ACTAAACTTG ATCATCCAAA ACCAGTTCCA TATAGCGTaC TCAAATCAAA 1B0 

75 TCTTTCGGAG TACCTTTAAC AAGCGTTTCA TTTATGACAC ATGGATCAAA GGATACTAAA 240 

GATGAAGTGT TGCCGGCATT GGCCTATTTC ACTTTTTCAC CAAAAAATTA TGAAGACAAG 300 

TCTAATCCAG ATCCAAAAGT TTTAAATTTA GTACATATGG ATTTCTTAAA TGCATCTAGT 360 

20 

GATTTTGGTA ACGCACATTT TGTTGTTTTA AGTAAATATA TTAAAGAGTA TGAATCAAAC 420 
TATGAAACAG CGTCAGATGA TTCTTTAAAA TAGTATTTAC TGTGTGAAAA ATAAATAGTG 480 
TACTACATTA AATAATCGCA ATAATAATCC CGATAAACAA TCAGCATTAC TGCTTATCAC 540 

25 

ATAGAGTTCG TAATAACTAT AACTCTATGA TTCGCAAATA ATAAATGATT GTCATCGGGA 600 
TTTATTTTTA TCAATTTATA AAGTGACATT ACCTTGTTCA TCAGCAGGTT TGAAAACAGT 660 

30 AATCACTGCA CTAATAATTG CTAAAATGTG TGGGATACCT GTCCAACAGA ATATTAAGTG 720 
TAGAATACCT TGCATATTCT TGCCGGCATA AAATTTATGA ATACCAAAAC TACCTAAGAA 7B0 
CAATGCTAAT AAAATATAAA TAACTTTGTT TACTTGCATT TCTTTCCCTC CAGTTGAATT 84 0 

35 GCTTATAATG ACATTAGCTT CTCTTTTTAT TATACCCACT TTTAGTTCAA ACATTCTAGT .900 
TTAAGCATTC CCAATCATCT AAATTTCAGT TATTCAATCC TTACAATAAA TTTAGGATTA 960 
CATTTCAGTT GCATTGTATT ATTTTACGTG TGAAATATAC GTAATGAATC ACATGACAAy 1020 

40 

CTyCAAATTG AAAAATATAC ATTCTATGAT GTAAGGTCGC ATTTTTAATA TATTTACGTn 1080 
AAAATAGTTT GGATG . 1095 

(2) INFORMATION FOR SEQ ID NO: 578: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 489 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
so (D) TOPOLOGY: linear 



(xi) ' SEQUENCE DESCRIPTION: SEQ ID NO: 578: 

55 
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ACTGCCAAAA TAATCATAGC CATGTTCTAC AGCTGCTTTC GCTACAATAT CCAAACGCAT 120 

TTCAAAACAA GCGGTACAAC GTAAGCCGCC TTCTTTTTCA TCAGCTAATT CTTTATCCTT 180 

CACCATTTTC ACAAACTTAT GTGGTTCATA AGGTGCTTCA ATATACTTCA CATTCGCACC 240 

AGTCTTGCGA TTAAAATCTT CCACAAATTG TTCTTGCACT TTAGCACGTC GTAAGTACTC 300 

ATTTTTCGGA TGAATATTTG AATTCGCGAA ATAAATTGCA ATGTCTGCAT ATTGTGTTAA 360 

AAACTCTAAT GTATATGTAC TACAAGGTGC ACAACAACTA TGCAATAAGa TTTTAGGTCT 420 

GATTGCTTCT CTTTCCCACT GsCCGATTAA TnTCTTCAAC ACCTTGTCCA TAATTAATTT 480 

GTnGATTTn 485 
(2) INFORMATION FOR SEQ ID NO: 579: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1287 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 579: 

CGTACTGCAT CATTTGTTGA AAAATTTGCA AACTTCTTCA AGATGTTATG GCTTAGACTT 60 

AAAGCGATGA AGCACTACAA AGCCTTAAAT AAAGAATCTA AGAAGCAAGA ATTTGAAAAT 120 

TCATTCAAAG ATGTTCAAAA AATTATGCGT ATTGTGAATC ACAATATTAT TTTACGCTTA 180 

AAAGAAGAAC AAAATAGTAC AAATGTACTT GAGGTTAGCT TAGTCATTAA TCATTACTAT 24 0 

3S GATATGAGTC GCTCATTAAA GTGGCGTGCA CAACGTCGAA AAGAACGTCA AGAAAACAGC 300 

AATCAAATCA TACCGCAAGC TATGTTCCAT AACCACAAAT TGGAAGCATT GTACTTACAA 360 

CGTCATCTTT TAGATGAATT AATTCGCAAA AATAAAATCA ACAATATCGT TGCAGCTCAA 420 

40 ATTCGAGAAA ATATCAATTa CAACGAAATT GTCTTGTCTT TACAGTCCAA ACATTaAGCA 480 

AGaCwTaCaw TmCCCCCGTA CATATGAGAC AAAGTCATTA TCATCTCATA TGTACGGGGT 54 0 

TTTTATATTC AACATCAAAA AATCAGATTG ATGAAAAGTA AATAACCTTT CATCAATCCG 600 

ATTTGATTAT AGAATCTATT TTTTAAGTTT AAATGGAATT GTACATACGT TAACATTCTT 660 

TTGATAAATT AAATACAATT TCATACGCAA ACTAGTTTGA TTGTGTAATA AATTATGCCA 720 

ACGTTTCTTA GTAATAAATT CTGGTATCAC TACTGTAATC ATATAGTTTT GATCGTTGGC 780 

TTTACGATTA ATCTTATCGA TAAAACGTGA AATTGGTCGT ATAATACTGC GATATTCCGA 84 0 

ATGTAAAATA ACTAATCTTA CATCTGGGAA ATGACGTTTC CATTTCTCTT GGAATGCTTT 900 

55 
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TGCGTAATAA ATAGACTTAT CAACTGCTGT TGTAATACTT GTTATCGGCA CAATTGCTAA 1020 

ATTACGATCT ACCACGTCCA CATTAAGAAC ATCAATGTCA GAACGTAATT GTTCTGCGAT 1080 

ATCTCGATAA TGkTTGkTAA TTTTCAAGAA GAAAATCACC ACGAACGGCA AGAAAATAAG 114 0 

TATCGGCCAT ACTTGGCTAA ATTTAGTTAT GAGTAAAATC cATAAnAACA ATAAATGTCA 1200 

CGATACCACC AAGTAAGTTC ACAGACAACT TACTTAACCA ATTCTTAGGA CGTTCATGAA 1260 

10 

TCCATTTAAT AACCATACCG AATTGTG 1287 
(2) INFORMATION FOR SEQ ID NO: 580: 

15 (i> SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 580: 
GCCTCCTTTC CTATATCCAT GGGTGTCCTA GTTGGGAATG GCTTATTCCC TATCCCAAAC 60 

25 

ATCTGTCAAT TCAAGACTAT CACTGTATAC TAAATCGCCT -nTCTATACnT CCATnCATTT 120 
CATTTTCCAG TTGGAACAAT TTGTTTAATT ATTTTAAATA TCATTATCAA TCCAGAAGTA 180 
TTTACTATTC ACTTTTACAA TAATCAATCA TTTAACTACA CATGGGTAGT CGGTGGTTTG 240 

30 

CTTGGGGTTA GCTTTTTAAC TGGCAATTTA TTGTTATTGC CAAAATTAGG TGCAACATTA 300 
ACTGTAATTG CAACAGTTGC GGGTCAAATT ATTATGGGTG TCATTATTGA TACATTTGGA 36 0 

35 TTATTTGGCG CTACAATTCA TGATTTTAAT TTAATTAAAG CAATTGGAGT ATTGTTACTC 42 0 

ATTGTCGGCA TCGTCATAAT GAATCAATTT AACAAGAATA ATTTATTACT AACTGATCAA 480 
AAGTATTTAC TGTTTTGGCT TCTATTAGGA TTTATTTTTG GTTTCTTTCC ACCTATTCAA 54 0 

40 ACGACAATTA ATAGTGCTTT AGCTAGTCAT ACTCATTCAC CAGCCTTTGC ATCATTAGTA 600 
TCATTTACAA TTGGGTCAAT AGCGCTATTG ATTTTAACCG CTATTTTTAA TCGTTCTTTA 660 
AAACTAAAAA CAAGTCATTT AAAATTCGGT AAATTAAAGC CTATCTATTT TACTGGCGGT 720 

45 

ATACTTGGCA TGGCTTTTGT AACAGCTAAC ATTATCTTAA TGCCTCATAT GGGTGCAGCA 7 B0 

TTAACAACAC TTATTGGGAT GTTTGGCCAG ATTCTAATGG GCATATTGAT AGATCACTTT 840 
GGATTATTTG GTTCACCTAA AATAGCAATG ACATCCAGAA AAACTATTGG TCTATTATGT 900 

50 

ATTTTGACAG GCATTATACT TTTAAGATTA TTTTAAATTA ACTTTTAGCT TATCATTTTA 960 
ACTTGTAATT ATTTTTAAAA GTGATAAGCT ATTTTTTTGT GGTCTAAAAA TCTTTAGAAA 1020 

55 
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CAACTCATTC TTAAGACCTA AATTAATGTT ATnTTTTAAT AATTTACACC AAATTAATAG 1140 

CAAAAATTAT 6TTATTCGTG CTAATATTTC ATAGTTGGTT ATTCAATTAA TTAAAAATAA 1200 

GTCAAAATGC ACAACTTTTT ATn 1223 



(2) INFORMATION FOR SEQ ID NO: 581: 

(i) SEQUENCE CHARACTERISTICS: 
70 (A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5B1: 
CACCTTTTTA TCATGCTTAG TTATCAATAA ATCTATATTT TGCTGTTTTA CAATTTTTTT 60 

20 AACTTTATCA ATCTCATTAT CTTGGACTAA ATAAATATAT GATCTTGCAT CTGTTGCTAG 120 
AGCTTGTTCG TGTTTTTCTG ATAAAACATA TGTGATGGAA GCGTGAATAA TAATGCCTAA 180 
TGTAACAAAA CTGATAATTA ATATACTGCT TATCAATAAC ATTAAGCGGT GGTGAAACTT 240 

25 CATCATTGTT CTTTAGGTCT TTCCAATTTA TAGCCTAAGC CACGCACAGT TTTAATAAGT 300 
TGTGGCTTCT TAGGATTATC TTCTAATTTA TCTCTTAAAT GACTGATATG TACATCAACA 360 
ATTCTTGAGT CTCCTGCAAA TTCATAATTC CATACCGTAT TTAACATATG CTCTCTCGTA 420 

. 30 

ATGACTCTGC CTTGTCTTTC TATCAAATAA AGCA 454 

(2) INFORMATION FOR SEQ ID NO: 582: 

(i) SEQUENCE CHARACTERISTICS: 
55 (A) LENGTH: 452 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 582: 



45 



50 



TTTATAAGAT 


TTTATTTACA AATAATTGGT 


TTTCATATGT 


ATAAACACTT 


TTGACTTTCA 


60 


AATCTTAACG 


ATAATTCTAT 


TACAATACAA 


TCCCCTATTA 


GAATGATTTA 


TGTAATAAAA 


120 


AAAGCGGAGT 


TTCCCCCAGC 


TTTTCTAAAC 


GACTACATAA 


AATATAAGAT 


TGCAATTAAA 


180 


TGCAATAGTG 


ATGCTATTAC 


AATAAAAATA 


TGCCAAATCA 


TATGAAAATA 


TGGTCTATTC 


240 


TTTTGTGCAT 


AAAACCATGC 


ACCAATTGTA 


TAAGACACAC 


CACCTAAGAA 


AATGAATAAT 


300 


ATGAATATCC ATGATGTGCG 


AATAAAAATA ATTGGTAACA 


AGATAATACC 


TACCCAGCCC 


360 
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AAAATCCCCC AAAGTGTCGT TCCCCATAAT AA 452 
(2) INFORMATION FOR SEQ ID NO: 583: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 583: 



IS 



30 



35 



SO 



CGCTTTTTGC 


AATAAGTCAT 


TAGCCGCTTT 


TAAGCCTTCT 


TCTTTTCGAT 


CTACAACAAG 


60 


TAAAATAAAT 


GGCTTTAACG 


CTTCTTCTTT 


TTCACTTTCA 


AGCATATCTG 


GTTTTTGAAC 


120 


CATTTCAAAT 


GGAGATTTCA 


ATCCATTATT 


ATCGCTCATT 


TCAATAATTG 


CATCATACTG 


180 


TGCTTGTGAC 


ATACTTGCAA 


TAGCCTGTTT 


TGCATTTTCT 


TGAAGGAAAT 


ATAAGTTTTT 


240 


CAATTTAGGA 


TGCTTATTTA 


ATGTACTTAA 


TGTAATCGGT 


GTAATGTCTT 


TCTCATAAGA 


300 


CACTTCAATC 


ACTGTACTAT 


TTGTTCTACC 


AGGAATTGGT 


GGTTTTTCAT 


GAATATGCTT 


360 


TGATACTTCT 


CCAATTCCAA 


CGACAGATTG 


ATTTTTCGTT 


CGATTATAAA 


AAATAATATT 


420 


GTCGCCTTCT 


TCTAACTGAG 


TATAAAAATG 


ATAACCATTA 


CGTTTAATAC 


CGTTGTACGT 


480 


GTGCGTATAA 


ATCGTATATT 


GGTTTCCAGG 


TTCAAATTCT 


TCAGTTTCAG 


CTAAAAAGAA 


54 0 


ATAACGCGGT 


ATCTTAATTT 


CGCCTTTACC 


AAGACCACTT 


ATTAAATCAA ACTCTTCTGC 


600 


AGTGATTTGA 


TTGAACAATG 


TCTCTTTCAT 


ATnACTTATA 


CGAAATTCCA AAGCTTCACT 


660 


ACGCTTTAAA 


TAATCTGCTG 


TTAATGGTTT 


CAATTGTTCA 


TTAAAACGAA 


ACTGTACACG 


720 


TATTTTATTT 


TGTGCACCTG 


TTTCAACACT 


AATAATTTCA 


CCACATCCAA GTAGTCCAGT 


780 


ATCCGTCTGA 


ACTTGATAAA 


AGATGACTTG 


ATCTCCTACT 


TTAGCCTTTT TAAACGCTCT 


84 0 


AAATCCTTGA 


GATGGGTTAA 


AATGTGCGCC 


TGATTCAAAT 


AAAGCTGTTT 


GTCCTACTAA 


900 


CGGTTCATTA 


TGATTCCAAC 


GGTTATATCC 


ACAATTCAAC 


CAAAAATAAT 


TCGTTTCTGC 


960 


TGTCATCTTA 


ATACTCCTTA 


ACCTGAATAA 


ATTTTAGAAA 


CACTATGAAT 


TACATTCTTT 


1020 


tAGTGTTTCT 


TATGCAGTTG 


GACGCGTATG 


CGAACAACTG 


TATACCCTTT 


GTTCACTGCG 


1080 


ATTTTAATCG 


CATTTCCTAT 


AACATTGTAG 


CGCCCAGGAC 


ATTAATTTAC 


GTCCCAGACC 


1140 


CTTATCGTTT 


TCACTTCTAA 


GTAAGTCGAA 


CTATTTTGCT 


TTACAACAAG 


TGCGACTCTA 


1200 


AATACAGTTG 


GACACACATA 


CGAGCAACTG 


TATACCTTTT 


AATCAGTTTT 


CTATATTTTA 


1260 


TTTATTATAT 


CTGTCTTAAT 


GATAAAAATT 


GTTACAAACA 


GTTTAACATA 


TTTAGCTACC 


1320 
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ATGATACCAC TATGCTTGCn TATCTCTATA GCGCCATTGA TACACATTTT TAAATATCTA 1440 
TACTGCCGTT AGAATTTTAT CATGTCTnAA TT 1472 
5 (2) INFORMATION FOR SEQ ID NO: 584: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



75 



20 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 584: 

GGTAGTGAAT GGGGTTCAAG ACAACAGTAT TGGAAGTACG AATGAATCAC AGTTTTTAGG 60 

AAATTATATT GTAATTAAGC ACGCAGAAAA TGAGTATAGC TTAATAGCTC ATTTACACCA 120 

ATATTCAATC ATTGTGAATG AGGGGCAAAA TGTTAAATAT GGTGATATCA TTGGGAAGGT 180 

TGGGAATTCT GGCAATTCTA CTGAACCTCA TATACATTTT CAAGTAATGA ATGATAAGAA 240 

TATTGAAGCA TGTACATCTT TAAAAATTCG ATTTATAAAT AATCGAGAAC TTATCAAAGG 300 

25 GGATGTGGTC TGCGGATTAC AAGCTGAATG ATGGCGATAC TTATAAAATC TCGACACTAT 360 

AAAAATGGTA TAGTGTCGAG ATTTTCTTGC TTATTTAGTT AATTCAAAGT GCACGCCGGA 420 

TTCATTAGAA GTCGACGTAT TTTTGTTTGT AATAGAGTAA CCGGTCATTG AAATTTTAGA 480 

TTCAATATCT GAAGCGGAAT TTGTAGATTC AGGATTATAG AAGCTACATT CATAAGTGTT 54 0 

ATCATCTTTC TTTTTAAGTA TAAACATACC TTTGGCTTTA ACTTCGACTT TAGTGTTGTT 600 

AATGTCAAAA GTTTGAGTGC TATTACTATA ATTAACACCA GCCCAAACCG ATTCATTATC 660 

TTTCACAACG GGGAAGTCAT CTTCTTGCTT AACGACGTnA CTTCATCTTT CTCTGTCTTA 720 

AAGACATCTT TAGATAAGCC TGGATACAAC ACATATCCAT ATTTATTGTC AGAATTAGAA 780 

40 TGCTTTT 7 B 7 
(2) INFORMATION FOR SEQ ID NO: 585: 



30 



35 



45 



SO 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 585: 
ATGCTAAATT AGGGGGAATA TTTACGATAA AGAGACCAGA AAAATAATAA AACACATGCT 60 
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TAAAATGAAG 


ACTATTTTTT 


ATTACAAGAA 


AATGTATCTA 


GTAAACTTAA 


AGTAGCAAGA 


180 


CCTAATAAAT 


TTAATGCATG 


TTGTGCACCT 


TTTTTACCTT 


GGCCAGCTTC 


GAAATGTTTG 


240 


TAAGCAGCTA CACTTAAAAT 


GCCTATCGTT 


GATAGTGATG 


CAAGGCGAGA AATGTTTTTA 


300 


TTGATAAAGC 


TAGCTGAGTA 


TAAAGCAGCA 


GTAGTTGCTT 


CTGCAATGCC 


GACGTATTTT 


360 


ACAAGTTCTT 


TTTGCAAGCC 


AAAAGTATGT 


TCAAACAGTT 


CAATCATACC 


CTTATCTTCT 


420 


TGCAATTTAG GTTTACTGGC 


TTGGTATAGC 


TCTTTCGCAA GTTTTAAATT 


CGTTGCGTAA 


480 


CGCAAAATCA 


TATTTAATTC 


CTCCCAATAT 


TTGATTTTTT 


GTGAAAGATG 


ATTACTTTAT 


54 0 


CATTTTTACC 


CGTTTCTATA 


AAAATGAATC 


AATTATGTAA 


CGTATGTGTA 


GTTTAGGAAT 




GTTTGCTATG 


GAAATATAAT 


TCTGTTCACT 


CAAAATGTAT 


GAAATTAATG 


TGTAGTTTTG 


660 


TCGAGTTGCT 


CTTTTAATTT 


GGTTAGATTG 


TTTTTTAGAG 


AAGCGGTACT 


ATTTTTAAGT 


720 


GCATCAACAG 


ATTTACCTTC 


GTTTTGAGAC 


ATTGAGTTTA 


TTACAGCACG 


AAGTTCTGTT 


780 


TCTAGTATGT 


CaGCGTCGCt 


TTAGCATTAG 


AACTTAaTAt 


TTAtAcTCTT 




830 



(2) INFORMATION FOR SEQ ID NO: 586: 

25 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 586: 

35 TTAGGACGTT TTTACAATCA GTACAATGAG CTCATTGTTA TTAGTCCTTT AACGGCGTCT 60 

TTTAATGCTG GCGCTACATT TGGGCGATTT CATCATTTAA TTGATACTGA AACTTTAGCA 120 

AAATTAGAAC ATGAAAAAGG ACATTATTAT CAGAAGATGA TATGTGATGA CAATGTAGAA 180 

40 ATGATTTCTA TAAATAACAT ACCGAAATAT CCGAGAAATC ATAATGTATT AACTAATCAT 240 

GACTCATACG AATATTCATT GAATTTAGGA AGTAGTAATA GTTATTCAAA GTATGAGCTT 3 00 

ACCTTAGATG ATATTTATGT TGGTGCTACC TTTtAACAAA TTATATTTAT ATTCTAGCCm 360 

45 

ACtAAATAAA AGGGkaCtaT TTGaATCaAA CmATaTGtAT TAACCTTTTT TA .412 
(2) INFORMATION FOR SEQ ID NO: 587: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4709 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 587: 
TTCAGTTTGA AATTAATCAT .ATAAATTTCT TATGGGAGGG TTGATATCTT AATGATTAAC 60 

5 ATTATTTCAG CTATAGGATC TATTGGAACA TTTATTATGG CTTTATTTTA TTTTGTATCA 120 

GTTTCAGTTC AACTTTATCA AATGAAAATT AGCTTTCTGC CAGCTTTAGG TTTTAACCAA 180 

ATTTTATTAG AAAGGGAGGA GGATCAACTT AATATAATGA ATTCGGCAAC AGAAGAGCAT 240 

10 

CATCATAAAG ATTATATTAA ACTATATAAT TTAGGTGGCG GTGCTGCTAA rAAAATTGCA 3 00 

ATAGAGGTTT TATTGGGGAA GGATArAGTC ATTCAGAAAA AATACGTGCA TATTkTACCT 360 

AGTAAAGAAG GGTACATGTT ACCAATTAAT AAAAATGTGT ACGAAGAATT AGAAAGAACG 420 

15 

ATTGAsAACA ATGGTCATGA AGCTGATTTG AATGTACGTA TGACTTATTA TCATAATGTA 4 80 

AGTCGCAAAC AACAGGAAGT TATATTAAAA GGTCAAATCG ACCGTTTTAA TACTTATAAT 540 

20 AATAAAGAAA TTTATGATTT GCAGTTTATC TAAAAATTGA TTTAAGAGGG TAGTTGTTTA 600 
TTGCGAAAAA TATCATTCAA TTTTAATGAA ATAATGGCGT CATTACTATA AAATATTACT 660 
TTATGTTGTA ATGCATTTTT CTATAAGATA GAACTAAAAG GAGGGGCAAA GATGCAAATT 720 

25 AGACAAATAC ATCAACATGA CTTTGCTCAA GTGGACCAGT TAATTAGAAC GGCATTTGAA 780 
AATAGTGAAC ATGGTTATGG TAATGAATCA GAGCTAGTAG ACCAAATTCG TCTAAGTGAT 840 
ACGTATGACA ATACCTTAGA ATT AG TAG CT GTTCTTCAAA ATGAAGTTGT AGGGCACGGT - 900 

30 

TTACTAAGTG AAGTTTATCT TGATAACGAG GCACAACGGG AAATTGGATT AGTGTTAGCA 960 

CCTGTATCTG TTGATATTCA TCATCAAAAT AAAGGTATTG GGAAGCGATT GATTCAAGCA 1020 

TTAGAACGAG AAGCAATATT AAAAGGATAT AATTTTATCA GTGTATTAGG ATGGCCGACG 1080 

35 

TATTATGCCA ATCTAGGATA TCAACGCGCA AGTATGTACG ACATTTATCC ACCATATGAT 1140 

GGTATACCAG ACGAAGCGTT TTTAATTAAA GAATTAAAAG TGAACAGTTT AGCGGGAAAA 1200 

40 ACAGGTACCA TAAATTACAC ATCTGCTTTT GAAAAAATAT GATTTCAAGC TAGGATTACA 1260 

TTAGGTAGAG TTCATATTAA TAATAAAAAA TGTTTGCAAT CAAATCGTAC GTTGTCGTTT 1320 

GTAATTCTTA AAATAGCAAT AAATAAAATG TTTGTTAGTA AAGTATTATT GTGGATAATA 13 BO 

45 AAATATCGAT ACAAATTAAT TGCTATAATG CAaTTTTAGT GTATAATTCC ATTGACAGAG 1440 

ATTAAATATA TCTTTAAAGG GTATATAGTT AATATAAAAT GACTTTTTAA AAAGAGGGAA 1500 

TAAAATGAAT ATGAAGAAAA AAGAAAAACA CGCAATTCGG AAAAAATCGA TTGGCGTGGC 1560 

50 

TTCAGTGCTT GTAGGTACGT TAATCGGTTT TGGACTACTC AGCAGTAAAG AAGCAGATGC 1620 

AAGTGAAAAT AGTGTTACGC AATCTGATAG CGCAAGTAAC GAAAGCAAAA GTAATGATTC 1680 

55 
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GTCAAACACT AATAATGGCG AAACGAGTGT GGCGCAAAAT CCAGCACAAC AGGAAACGAC 1800 

ACAATCATCA TCAACAAATG CAACTACGGA AGAAACGCCG GTAACTGGTG AAGCTACTAC 1860 

5 

TACGACAACG AATCAAGCTA ATACACCGGC AACAACTCAA TCAAGCAATA CAAATGCGGA 1920 

GGAATTAGTG AATCAAACAA GTAATGAAAC GACTTCTAAT GATACTAATA CAGTATCATC 1980 

TGTAAATTCA CCTCAAAATT CTACAAATGC GGAAAATGTT TCAACAACGC AAGATACTTC 2040 

10 

AACTGAAGCA ACACCTTCAA ACAATGAATC AGCTCCACAG AGTACAGATG CAAGTAATAA 2100 

AGATGTAGTT AATCAAGCGG TTAATACAAG TGCGCCTAGA ATGAGAGCAT TTAGTTTAGC 2160 

75 GGCAGTAGCT GCAGATGCAC CGGTAGCTGG CACAGATATT ACGAATCAGT TGACGAATGT 2220 

GACAGTTGGT ATTGACTCTG GTACGACTGT GTATCCGCAC CAAGCAGGTT ATGTCAAACT 2280 

GAATTATGGT TTTTCAGTGC CTAATTCTGC TGTTAAAGGT GACACATTCA AAATAACTGT 234 0 

20 ACCTAAAGAA TTAAACTTAA ATGGTGTAAC TTCAACTGCT AAAGTGCCAC CAATTATGGC 2400 

TGGAGATCAA GTATTGGCAA ATGGTGTAAT CGATAGTGAT GGTAATGTTA TTTATACATT 2460 

m TACAGACTAT GTAAATACTA AAGATGATGT AAAAGCAACT TTGACCATGC CCGCTTATAT 2520 

25 

TGACCCTGAA AATGTTAAAA AGACAGGTAA TGTGACATTG GCTACTGGCA TAGGTAGTAC 2580 

AACAGCAAAC AAAACAGTAT TAGTAGATTA TGAAAAATAT GGTAAGTTTT ATAACTTATC 264 0 

TATTAAAGGT ACAATTGACC AAATCGATAA AACAAATAAT ACGTATCGTC AGACAATTTA 2700 

30 

TGTCAATCCA AGTGGAGATA ACGTTATTGC. GCCGG TTTTA ACAGGTAATT TAAAACCAAA 2760 

TACGGATAGT AATGCATTAA TAGATCAGCA AAATACAAGT ATTAAAGTAT ATAAAGTAGA 2820 

35 TAATGCAGCT GATTTATCTG AAAGTTACTT TGTGAATCCA GAAAACTTTG AGGATGTCAC 2880 

TAATAGTGTG AATATTACAT TCCCAAATCC AAATCAATAT AAAGTAGAGT TTAATACGCC 2 94 0 

TGATGATCAA ATTACAACAC CGTATATAGT AGTTGTTAAT GGTCATATTG ATCCGAATAG 3000 

AO 

CAAAGGTGAT TTAGCTTTAC GTTCAACTTT ATATGGGTAT AACTCGAATA TAATTTGGCG 3060 

CTCTATGTCA TGGGACAACG AAGTAGCATT TAATAACGGA TCAGGTTCTG GTGACGGTAT 3120 

CGATAAACCA GTTGTTCCTG AACAACCTGA TGAGCCTGGT GAAATTGAAC CAATTCCAGA 3180 

45 

GGATTCAGAT TCTGACCCAG GTTCAGATTC TGGCAGCGAT TCTAATTCAG ATAGCGGTTC 3240 

AGATTCGGGT AGTGATTCTA CATCAGATAG TGGTTCAGAT TCAGCGAGTG ATTCAGATTC 33 00 

SQ AGCAAGTGAT TCAGACTCAG CGAGTGATTC AGATTCAGCA AGCGATTCCG ACTCAGCGAG 33 60 

CGATTCCGAC TCAGACAATG ACTCGGATTC AGATAGCGAT TCTGACTCAG ACAGTGACTC 3420 

AGATTCCGAC AGTGACTCAG ATTCAGATAG CGATTCTGAC TCAGACAGTG ACTCGGATTC 3480 
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5 



IS 



25 



CGATTCTGAC 


TCCGACAGTG 


ATTCCGACTC 


AGACAGCGAT 


TCAGATTCCG 


ACAGTGATTC 


3600 


CGACTCAGAT 


AGCGATTCCG 


ACTCAGATAG 


CGACTCAGAT 


TCAGACAGCG 


ATTCAGATTC 


3660 


AGACAGCGAT 


TCAGATTCAG 


ATAGCGATTC 


AGATTCCGAC 


AGTGACTCAG 


ATTCCGACAG 


3720 


TGACTCGGAT 


TCAGATAGCG 


ATTCAGATTC 


CGACAGTGAC 


TCAGATTCCG 


ACAGTGACTC 


3780 


AGACTCAGAC 


AGTGATTCGG 


ATTCAGCGAG 


TGATTCGGAT 


TCAGATAGTG 


ATTCCGACTC 


3840 


CGACAGTGAC 


TCGGATTCAG 


ATAGCGACTC 


AGACTCGGAT 


AGCGACTCGG 


ATTCAGATAG 


3900 


CGATTCGGAC 


TCAGATAGCG 


ATTCAGAATC 


AGACAGCGAT 


TCAGATTCAG 


ACAGCGACTC 


3960 


AGACAGTGAC 


TCAGATTCAG 


ATAGTGACTC 


GGATTCAGCG 


AGTGATTCAG 


ACTCAGGTAG 


4020 


TGACTCCGAT 


TCATCAAGTG 


ATTCCGACTC 


AGAAAGTGAT 


TCAAATAGCG 


ATTCCGAGTC 


40B0 


AGTTTCTAAC 


AATAATGTAG 


TTCCGCCTAA 


TTCACCTAAA 


AATGGTACTA 


ATGCTTCTAA 


4140 


TAAAAATGAG 


GCTAAAGATA 


GTAAAGAACC 


ATTACCAGAT 


ACAGGTTCTG 


AAGATGAAGC 


4200 


AAATACGTCA 


CTAATTTGGG 


GATTATTAGC 


ATCAATAGGT 


TCATTACTAC 


TTTTCAGAAG 


4260 


AAAAAAAGAA 


AATAAAGATA 


AGAAATAAGT 


AATAATGATA 


TTAAATTAAT 


CATATGATTC 


4320 


ATGAAgnAAc 


rCCTTAAAAG 


GTGGCTTTTT 


TACTTGGATT 


TTCCAAATAT 


ATTGTTTGAA 


43B0 


TATAATTAAT 


AATTAATTCA 


TCAACAGTTA 


ATTATTTTAA 


AAAGGTAGAT 


GTTATATAAT 


4440 


TTGGCTTGGC 


GAAAAAATAG 


GGTGTAAGGT 


AGGTTGTTAA 


TTAGGGAAAA 


TTAAGGAGAA 


4500 


AATACAGTTG 


AAAAATAAAT 


TGCTAGTTTT 


ATCATTGGGA 


GCATTATGTG 


TATCACAAAT 


4S60 


TTGGGAAAGT 


AATCGTGCGA 


GTGCAGTGGT 


TTCTGGGGAG 


AAGAATCCAT ATGTATCTAG 


4620 


TCGTTGAAAC 


TGACTAATAA 


TAAAAATAAA 


TCTAGAACAG 


TAGAAGAGTA 


TAAGAAAAGA 


4680 


TTGGATGATT 


TCAATATGGT 


CCnTTCCCA 








4709 



(2) INFORMATION FOR SEQ ID NO: 588: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 588: 
CTTTTTTAAt TAwCgGaAtA TTGtCaTgaT tAcAcTTCGt TAGGGTTTAc gTCGtAATTT 60 

50 

CATTTaaTAA gCgCTTCAcC ATTAAAtGTG gTAmCCTTTA ATTCGCCAGT AGAAACATCA 120 
CAGTAACTAA GCGCAATTTC AGGTTGATTC ATAACAAAAC TTAAAATATA GTTATTTTGT 180 

55 . 
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CGTCTAACCA TACCTTTCGT TTGTTTCGGA TCTTCCATCT GTTCACAAAT AGCTACTTTA 300 

TATCCATTAT TAACAAGTGT ATCTATATAA CTATCTGCAG AATGATACGG AACACCACAC 360 

5 ATCGGAATTG GATTTTCTTT TTTAGCATCT CTTTTAGTTA AAGTAATTTC AAGTAyACGT 4 20 

GATGCCTCCT TGGCATCTTC ATAAAACATT TCATAGAAAT CACCTAGTCT AAAAAATAAT 4 BO 

AAGCAATCTT GGTATTCTGA TTTTATTTTT AAATATTGCT GCATCATTGG TGTAACATTA 54 0 

10 

GACATATTAT TTCTTCACAA CCCTTGTCTC TTTTTAAAAT TTGTCTTTAC AATATATTCG 600 

TTTGTAAGyT TTTTAATTAT TAATTATTTA ACTTATACAT TTTAACATAC TTACTTTTAC 660 

AAACCTATTC ATACCATATA ATCACGAAGC ATCTTAAATG TATAAGAAAA CGCCTCAAAC 720 

75 

CTAATAAAAT GTGTCAATAG CATGTTTAGA ATTAAATTAA AATTCTAACA TTCAAGACAT 780 

TTAATTAAGT AAGGGCGTTC AATATTAAAA TGAACAATGA CTCTGTTTGA AATCATATAT 840 

20 CATAAAATTA TTTTATAAAC CTTTGAAGAA TACCACGTTT TTTTAGAGTA ATTAATAAGA 900 

AATAACTTAT AATAGATCCG ATAGCACTTG aGACTATGaA CGTAATCATT AACGGTTTAA 960 

TGAAGAAGTC TTGAAGCCCA AGGaAATATG CTAATGGtAT aCaAATTAAA cTTCCgATGA 1020 

25 CaCCAGTTCC aAGTACTTCa CCGACCGCGG CCaTAAATAT ATGTTTACGA TATnygTAAA 1080 

ACATACTAGC CAATAAAACT CCAATCATAC TACCCGGAAA TGCAAAAGst GTACCAGTAC 114 0 

' CAAAAAGAAC TCTTAAAATT GATGATATAA GCGCTTGAGC TAATCCATAC CAAGGACCTA 1200 

30 

CTATGACCGC ACTTAATACA TTTACAAAAT GCTGTACTGG TGCTGCCTTA ACTGGTCCTA 1260 

GAGGAATGAT GATAATACTG CTTAATACAA CATTTATTGC AATTAAAAGT GCAGTTATAG 1320^ 

u 

CCAGTTTTCT TGATTTCATA TGATTGTTCT CCTTTTTGTT TGTAATTAAT CACTATGCTT 1380 

35 

GGCTTTATTA TGGTCATTTA AACGTGTTTC CATTGTTGAT ACAAACATTT TCAATAATTG 1440 
ATTCGCTTCA TATTGTGAAG TTTGAAACTG TTCAACTATG GGCAATGTAT TTATTTCTGC 1500 
40 TTCTAtACTC TGAATGGtAT GTTCCGACTG nTCCAGCGCA TTTGTTTCCC GnAA 1554 

(2) INFORMATION FOR SEQ ID NO: 589: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 638 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double ' 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 589: 
TTTTGACGAA AAAAGTTGAT TTACAAATAT ATAAACGTTG TGATTTCAAT GTTTGTATAG 60 

55 
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TATATACATG ACAGCAACTT GGGAAAAAAA GGAAGGTAAC 


GAAGGTTTAT 


TAACTGTTAC 


180 




TGTTCCTGCA 


GAAAAAGTAA ACAAAGCrCT GAGAGATCCC 


CTCATAATTT 


CCCCAAAGCG 


240 


5 


TAACCATGTG 


TGAATAAATT TTGAGCTAGT AGGGTTGCAG 


CCACGAGTAA 


GTCTTCCCTT 


300 




GTTATTGTGT 


AGCCAGAATG CCGCAAAACT TCCATGCCTA AGCGAACTGT TGAGAGTACG 


360 




TTTCGATTTC 


TGACTGTGTT AGCCTGGAAG TGCTTGTCCC 


AACCTTGTTT 


CTGAGCATGA 


420 


10 


ACGsCCGCAA 


GCCAACATGT TAGTTGAAGC ATCAGGGCGA 


TTAGCAGCAT 


GmTATCAAAA 


480 




CGCTCTGAGC TGCTCGTTCG GCTATGGCGT AGGCcTAGTC CGTAGgCAGG ACTTTTCAAG 


540 


15 


TCTCGGAAGG yTTCTTCAAT CTGCATTCGC TTCGAATAGA TATTAACAAG TTGTTTGGGT 


600 


GTTcGAATTk 


CAACArGTaA GTtAGtTGCT AGAnCCCA 






63 6 




(2) INFORMATION FOR SEQ ID NO: 590: 








20 
25 


<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1242 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 590: 








AAAATATTCC 


CGTACATTTT GATGTCTGTA GGGGCTTTTT 


TGACTTTAGG 


ATTTGTCATT 


60 


30 


TTTTCAATTC 


ATAAAGGGAG ACGAACGAAA AATGAATCAG 


CACGTAAAAG 


TAACATTTGA 


120 




TTTTACTAAT 


TATAATTACG GCACATATGA CTTAGCAGTA 


CCAGCATATT 


TACCGATAAA 


180 


35 


AAACTTAATA 


GCTTTAGTAT TGGATAGTTT GGACATTTCA 


ATATTTGATG 


TCAATACACA 


240 


AATTAAAGTG 


ATGACGAAAG GTCAATTACT TGTTGAAAAT 


GATCGACTCA 


TTGATTATCA 


300 




AATCGCTGAT 


GGAGATATTT TGAAGTTACT ATAGGAGGGA 


AAATAGATGG 


TTAAAAATCA 


360 


40 


TAACCCTAAA 


AATGAAATGC aAGATATGTT • AACGCCTTTA 


GATGCTGAAG 


AAGCAGCTAA 


420 




AACAAAATTA 


CGCTTAGATA TGAGAGAGAT TCCTAAGTCT 


TCAATTAAAC 


CAGAACATTT 


480 




TCATTTAATG 


TACTTATTAG AACAACATTC TCCATATTTT 


ATAGATGCTG 


AATTAACTGA 


54 0 


45 


ACTACGTGAC 


aGTTTCCaAA TACATtATGA CATTAATGAC 


AATCATACAC 


CTTTTGATAA 


600 




TATTAAATCA 


TTTACTAAAA ATGAAAAATT ACGTTACTTA 


CTCAATATCA 


AAAATTTAGA 


660 




AGAAgTAAAT 


CGTACACGCT ACACATTTGT GTTGGCACCA 


GATGAATTAT 


TTTTCACAAG 


720 


50 


AGATGGATTA 


CCCATTGCTA AAACAAGAGG GTTACAAAAT 


GTTGTTGATC 


CATTACCTGT 


780 




GTCAGAAGCT 


GAATTTTTAA CAAGATATAA AGCGCTGGTT 


ATCTGTGCAT 


TCAATGAGAA 


84 0 
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AACTAAAGTT ATTGAAGCGG CAACGTTAGA TTTACTAACG GCATTTTTAG ATGAACAGTA 960 

TCAGAAACAA GAACAAGATT ATAGTCAAAA TTATGCATAT GTACGCAAAG TAGGACATAC 1020 

5 

CGTTTTCAAA TGGGTTGCTA TCGGTATGAC AACGTTAAGT GTTTTATTAA TTGCATTCTT 1080 

AGCCTTTTTA TATTTTTCAG TAATGAAGCA TAATGAGCGC ATTGAAAAAG GATACCAAGC 1140 

ATTTGTAAAG GATGtTATAC GCAAGTACTA AATACGTATG ATGATTTAGA TGGTaAAAAt 1200 

TgaTAAAGAG GCACTTTACA TTTATGCCAA AAGTTATATC CA 1242 

(2)' INFORMATION FOR SEQ ID NO: 591: 

15 Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 591: 



25 



35 



45 



TTCCAGATAG 


AGCCTTAGTT 


GCTGCCGCTG 


AATTGTCTGA 


TCGTTACATC 


ACTGATCGTT 


60 


TTTTTACCAG 


ATAAAGCGAT 


TGATTTAGTT 


GACCAAGCAT 


GTGCAACAAT 


TCGTACGGAA 


120 


ATGGGATCAA 


ATCCAACTGA 


ATTGGATCAA 


GTTAATAGAC 


GTGTCATGCA 


ATTAGAAATT 


180 


GAAGAAAGCG 


CACTTAAAAA 


TGAATCTGAC 


AATGCGAGCA 


AACAGAGATT 


ACAAGAACTA 


240 


CAAGAAGAGC 


TTGCCAATGA 


AAAAGAGAAA 


CAAGCAGCAC 


TTCAATCTCG 


TGTAGAATCA 


300 


GAAAAAGAAA 


AAATAGCAAA 


TTTACAAGAA AAACGTGCGC 


AACTAGATGA AAGTAGACAA 


360 


GCGTTGGAAG 


ATGCACAAAC 


AAATAACAAT 


TTAGAAAAAG 


CTGCTGAACT 


ACAATATGGA 


420 


ACAATTCCTC 


AATTGGAAAA 


AGAACTTAGA 


GAATTAGAGG 


ATAATTTCCA AGATGAGCAA 


4B0 


GGTGAAGATA 


CAGATCGAAT 


GATTCGTGAA 


GTTGTAACAG 


ACGAAGAAAT 


TGGCGATATT 


540 


GTCAGCCAAT 


GGACAGGCAT 


ACCAGTTTCA 


AAATTAGTTG 


AAACAGAACG 


TGAAAAATTA 


600 


CTTCACTTAA 


GTGACATCTT 


GCATAAACGT 


GTTGTAGGTC 


AAGATAAAGC 


GGTTGACCTG 


660 


GTTTCAGATG 


CAGTAGTTAG 


AGCAAGAGCA 


GGTATTAAAG 


TnCAAACAGA 


CCTATTGGTA 


720 


GTTTCTnATT 


CCTAGGTCCn 


ACTG 








744 



(2) INFORMATION FOR SEQ ID NO: 592: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1449 base pairs 

(B) TYPE: nucleic acid 
'(C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 
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10 



15 



20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 592: 

TTGTTATAGT GTCTGAATCT GTGTAACTCT TGCCCATGTG TTCTGAGTAA AGCACCCACT 60 

GTTTATTTAC TTTTCGTTGT AGTCTAGCTT CGTGTAGTAG TTTGTTTAAC TTTTGTGCTG 120 

ATATACCGTA GTCTGCCGCG ATTTGAGTTG TGGCTAATGT GCCAGTTGAT TTTAAGATTT 1B0 

CATCAACATA ATCTGCTTTG GGTTTTAGCT CTCCAATTTC TTGTTGTAAA AGTAAGTTTT 240 

GCT CTTTTTC TTTCTTATAC TCAGTCAACA CTGTAATGAT GTAGTCTGGA TCTTTTAATG 300 

TTTGTTCAAT TACATTGTCT GTTGCGTATA TACCG.TGTTT GCGAATAGCT GGTaGGACAT 360 

CTGATGTTAC CCAGCGTTTG AATTTTCTAG CGGTTTCTCT AATTTTTTCG TTTTTGCTTT 420 

GTTTAGAAGC ATCGAAGATT AGACTGTATA ATCCTGATTC GTTGATAATG ATCATATTTC 460 

TGTTTTGACC TGATGCACTA AATTGGTGCG TCAGTTTGTC CTCGCTATCA ACATGATTTC 54 0 

TAATGGCATT GTCTGATCTT GCATATCCTA AAATCTCAGC AATATCTTTT CCTACAAAAT 600 

AAGGTTCGTT TTCAATTTCT ACTGTTCTTA CTGGTAGCTC TTTAAAATTA AATGTTTGTA .660 

ATGCTTGCAT TTGAGTATCC TCCTTTTTCC TCAACACCCA CATTCAGCAG ACGGTTATCG 720 

25 CAATGACTAT CGAATGTATT TAAACGCGGC TCATATCATC GCCAGcTCTC GCTCACATCT 780 

GCTCAATGTG GATGTTGATA AGCGTGGTTA TATTAAGAAG - TGAATGTTAC TGATTCACTT 84 0 

TCCGCCACTC TGTTAAATCA GTAACTTTGT TATCGCTTTC AACACCGTTA AGCTTGTCTA 900 

ACGCTTTCAC TACTTTTTGG AACTCTTTGA TAGCACTtCG TAGCTTTTTA GTAATTTCAT 960 

CTTCTACCAT TTCCAAACCA GCAAATGCGT CTTCGTTATT CATGCTTAGA TGTTTGTTGA 1020 

AAAGATCTCG AGTGTATCTT ATTTCTTTAA GTGATTTATC ATAAGCTTCA ATTTGTCCTG 10 80 . 

AAAGGTTATG ATATTTTAGT TGTAGTTTTA CTAATTTTAA TGATTGGTCT TGCATTTGTT 1140 

ATGTCTCCTT TAAGATGTTT GTTTGCGTTT CGTGTACTTT GTGGGTAAAA AAATATCTCC 1200 

AATATTTTCG TCAAAAAAAT CAGCGATAAT AAACATCTCA TCATTCTTAA ATTGATGCTT 1260 

TCCTAATTCT TTTAAACGAT AACCTTCAGT TGATATATTC AAGAGGTTTG CTAAATCTTC 1320 

TTGAGTACAC TTTCTTTCTT TTCTCAACTT TATTAAATTC CATTGCATGT TGTCACCTCC 1380 

45 CGCTTTACAA AACCTACTAT ACACGATACG GTACTTGnGT CAACATAAAA GTTTGCTTTT 1440 

CGTGTATTT 144 9 
<2) INFORMATION FOR SEQ ID NO: 593: 



30 



35 



40 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1170 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 593: 

5 ACCAAAAACA CTTTCTAAAG AAATTGTTTC AGAACCATTT GCTAAAAACC AATTACGTGA 60 

AAAAGCTCGT CTAACTAACA TTAAAGGTTT AGAGATTCCA AGAGTTTCAT ACACTTTAGA 120 

CGATGATGAT TTCATTACAG ACGTAGAAAC AGCAAAAGAA TTAAAAGCAA AAGGTGATAC 180 

10 

AGTCAAGTTC ACTACTAATA AATTCAAAGT ATTTGCTGCA ATTTCAGATA CTGTAATTCA 24 0 

TGGATCAGAT GTAGATTTAG TAAACTGGGT TGAAAACGCA CTACAATCAG GATTAGCAGC 300 

TAAAGAGCGT AAAGATGCCT TAGCAGTAAG TCCTAAATCT GGATTAGAAC ACATGTCATT 360 

IS 

TTATAATGGA TCTGTTAAAG AAGTTGArGG AGCAGACATG TATGATGCTA TTATTAACGC 420 

TTTAGCAGAT TTACATGAAG ATTATCGTGA TAACGCAACA ATTTATATGC GATATGCAGA 480 

20 TTATGTCAAA ATTATTAGTG TTCTTTCAAA TGGAACAACA AATTTCTTTG ACACACCAGC 540 

AGAAAAAGTA TTTGGCAAAC CAGTAGTATT TACAGATGCA GCAGTTAAAC CTATTGTGGG 600 

AGATTTCAAT TATTTTGGAA TTAACTATGA TGGAACAACT TATGACACTG ATAAAGATGT 660 

25 TAAAAAAGGC GAATATTTGT TTGTATTAAC AGCATGGTAT GATCAGCAAC GTACATTAGA 720 

CAGTGCATTC AGAATTGCAA AAGCAAAAGA AAATACAGGt CCATTACCCA GCTAAGCCCC 780 

AAAAGGTTAA TGTAACAGCT AAGGCTAAAT CAGCTGTAAT ATCAGCCGAA TAGGGGTGAT 840 

30 

GAAATGAGTT TAGAAGAAAk TAAATTGTGG TTGAGAATTG ACTATAATTT CGAAAATGAT 900 

TTAATTGAAG GTCTCATTCA ATCGGCTAAG TCTGAATTAC TATTAAGTGG GGTnCCAGAT 960 

TATGACAAAG ATGACTTGGA ATACCCGCTT TTTTGTACAG CGATTAAATA TATCATTGCA 1020 

35 

AGAGATTATG AAAGTCGTGG ATACTCAAAT GACCAATCTA GAAGCAAGGT GTTTAATGAA 1080 

AAAGGATTGC AAAAAATGAn TTTGAAATTA AAAAAGTGGT AGGTGATTTT TAAATGGAAT 1140 

TTAATGAATT TAAAGATCGC GCGTATnTTT 1170 

40 

(2) INFORMATION FOR SEQ ID NO: 594: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1393 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: 1 linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 594: 

TGAAGGGTGT TTTGTTTATG GCCAATTGCT GTGTTATTAA AnCGAATGTT TCGAATGGAA 60 

55 
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5 



75 



20 



25 



35 



ATCATTCCGG 


AAGAGGACAT 


GTCTTCAGCT 


TCTGATTTGA AAGCACAGTT 


TTATCGCGGT 


180 


ACTTTGCAGC 


GCTTTTACCA 


AtCriTTGCAC 


GCAGAAAAGC 


TTACACCTTA 


TGTTATGTCT 


240 


TATGACGATA 


TCATTTCATT 


TTGTAAAGAA 


AACAATATCT 


CTGAAGTAGT 


GACTGCGGGT 


300 


GATATTATGA 


GTTATCATCT 


TGAAGAATAT 


GATATTTTAC 


ATCAACGTTC 


TTTATTCAAT 


360 


GAAGCACGCA 


TTGCCGTTAC 


TTTGATACGT 


GGGAATCATT 


ACTTTAAAGC 


GAGTAAAACA 


420 


ATGAATCAAC 


AAGGGGAGCC 


ATACAATGTT 


TTTACTAGTT 


TCTATAAAAA ATGGCGACCT 


480 


TACTTGAGGC 


ATAGAGACGT 


ATATCACTAT 


GATTTAAAAT 


CATTCGAAAA 


CTTTGTCATT 


54 0 


GCATCACCTG 


ATGATTTAGT 


GTTTGATGAC 


ATAGCATTTG 


GATCCTCACA 


aATAATTGAA 


600 


CAGatAAATG 


GCAACATTTT 


TTAGATCAAG 


ATATACAGAA 


TTACGAAAGC 


GGAAGAGACT 


660 


ATTTACCTGA 


AGTATTAACA 


AGTCAGCTAA 


GTGTTGCTTT 


AGCATATGGA 


TTATTAGATA 


720 


TTATTGAAAT 


TTTTAATGAT 


TTATTGGCGC 


GTTATGATGA 


AGATGAGGCA AACTATGAAG 


780 


CATTTATACG 


TGAACTCATT 


TTTAGAGAAT 


TTTATTATGT 


GTTAATGACA 


CAGTATCCTG 


B40 


AAACCTCATA 


CCAAGCTTTC 


AAACCTAAAT 


ATCGACAGAT AAAATGGTCG 


CAAAATGAAG 


900 


CGGATTTTAA 


TGCATGGTGC 


GAAGGGCAAA 


CAGGATTTCC 


AATCATTGAT 


GCAGCAATAA 


960 


TGGAATTGAC 


ACAAACTGGT 


TTTATGCATA 


ATCGAATGAG 


AATGGTTGTG 


TCGCAATTTT 


1020 


TAACCAAAGA 


TTTATTTATA 


GATTGGACAT 


GGGGAGAAAA 


ATTCTTTAGA 


AAGCACCTTA 


1080 


TTGACTATGA 


TGCAGCATCA 


AATATTCATG 


GATGGCAATG 


GTCTGCTTCT 


ACAGGTACGG 


1140 


ATGCAGTGCC 


GTATTTTAGA 


ATGTTTAATC 


CAATAAGACA 


GAGTGAACGC 


TTTGATGCTA 


1200 


AAGCTTTGTA 


TATCAAAACA 


TATCTTCCGA 


TTTTTAATCA AATTGATGCA AAATATTTGC 


1260 


ATGATACACA 


ACGCAATGAG 


TCCAACCTTT 


TTGAACAGGG 


GATTGAATTA 


GGTAGTCATT 


1320 


ATCCAAGACA 


AATGGTAGAT 


CATCAAGAAA 


AACGTACACA AGTTTTAGCT ACATTTAAAG 


1380 


CGCTAGACTA 


ATT 










1393 



(2) INFORMATION FOR SEQ ID NO: 595: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1484 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 595: 
TTAACTTGAA AGATTTCCCC GACATCTATA ACTTGTATAA TGTGTATATG TCGTTTTTAG 60 

55 



C 
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AAAAATTAAA AAAAGAATAT CTTGATTGGT ATAATCAGAC CTTAGAGTTT TCTAATTTAT 18 D 

CAAACAATGT AGTAAGwATA GATACTCCTT TTAAAGATAA TTCTTTAGAT AATTTAATAA 240 

S . TTTACGCTTT ATACGATCAG TCCAGAGACA TGATTACACT GACAGATGAC GGCTATACTA 300 

TATtTGATTT AGAAAATAAT GGTATTTCTT TAAATAAATC AAAAAAACGT AAAAAGATTT 360 

TTGAAGAGCA CCTTTCAGCT TACGGTATTA AATATAACGA TAAAACTCAC GAAATTTTTG 420 

10 

TTCAAACTAA CTTTAAAAAT TTTAATAAAT CGAAACATAA TTTATTACAG TGCCTTATAT 480 

TTGTTAATGA TATGTACTTA CTTTcTAATC CTAAGTCACA GAACATATTT ACAGAAGATG 54 0 

TTGCAAACAA ATTGGATGAA CATAACATTT ATTACGGAAG AGATTTACCT ATTATAGGAA 600 

75 

GCAGTGGTGT TGTTCATAAT TTCGACTTTT TTATTAGCGC TAAGAAAAAT CAAAAAGAAA 660 

AATTTATCAA TGCTATTTCT AACCCTAATA ATTCTATGAT TATTAAGTCG AAAATAACGG 72 0 

2Q . ATGCTATGCA AGCAAAAAAA ATAAAAAGAC ACAGGCAAAA TGAGTTTATT TTTATTTTAA 780 

ATGACTCAAA AAAAGAAATA AATGAACATA ATAAAAATCT TCTTCATGAA AACTATATTA B4 0 

GTACAATAGA TTATAGCGAA TTAGATGAAA AGATAGGTTT ATTGATTTAA TATATATAGA 900 

25 CGTGATAATA TCAATGTTTA TATTAAATGA AACGAATTTG AAAATTTCGA AACAGCTTAA 96 0 

GATAGCAAAT TGAATAGCCT TATTGATAAT GCAGAATCGT CTACACTTAG TTGAACAAAT 1020 

TCTATGAGAA TAGATATTGT TAAACTATTT GGGTAGGCGA TTTTATTTTG ACAAGAGTCA 1080 

30 GAAGATTATT TAGTTCATAG TTTAAGTTAC AAATGGCTAA ACTTTATGAA AATGGTATGA 114 0 

CTAGAATCGA AATTATACTC GAATACGATT TAACACTCTC AATCTTCTCG AACTGAATAA 1200 

ATACTGGGTC ATTCAATCAT CAAGACAACT TAATAAGTGA TGATAAATGT TCATACCATT 1260 

35 

TTGCAAATAT AACCAGATAC TCTCAACAAA ATGTAGAGTT CAAAATTAAA AATGTCCCTA 1320 

ATTGGCACAT TGCTGTATGA TAATAAATTC AACTTAGATT TCAACAATAT AACTAATGAT 13 BO 

GCTTATATTT ACCGCGATAT CAAACCAAAC AGCCCATCCA CAAACAAATT AGAATATATC 1440 

40 

mTGTTAATAA TAGTAAATGA TATATCTTCC AAAGACCX3CA AACT 14 84 
(2) INFORMATION FOR SEQ ID NO: 596: 

45 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1435 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION : - SEQ ID NO: 596: 
55 ' 
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CTTTATCaTT AAATGCCaTT TTGaATCCTT CTTGAATTTG TTTGATTTCC TTTTCAGTAT 120 

TATTAACCTT GTTCAAGACA GTGCCATTAA TTTTCTTCTT GAGTGGACCA ACCTCATCTT 180 

5 TATTAGTTGA TTCATGAATC GTTAATCCAA TGTGTGGCTG TATTCTATAA CCATCATTCG 240 

CTATAGTTGA AACATATTGT GaTAATTGTA ATGGTGTATA GGTATCATAT TGACCAATTG 300 

ATAAATCTAG ATAATTACCT GGATTATTTG TTAATGGTTC GATTTGACCT CTTGTTTCAT 36 0 

10 

TTGGTAAATC TATCCCTGTT TTCACACCTA AGCCTACTTG ATTTAATCCT CTTCTTAGCT 420 

TTTGGGCAGG TGAACTTATG TCTGAAGGTA AAGCCATACC AGAATAATAA GGGTCTCCCG 4 80 

CTAATTTTAA TGCTGTTTTA AACATATATA CGTTTGATGA ATGCATCAAA GCTTGCTTAT 54 0 

75 

CATTAATAGT TACATGCCCG TTTTTATTGA AGTATGATCG TTTTGTCAAA CCACCTTGGA €00 

AATGTAATGG TTCATCGACC ATTGTTTCTC CAACTTTGAT AGCTTTATTC TGATAACCGG 660 

20 CTAATAATGT TCCACCTTTT ACAGAAGATC CAACCGCAAA TTGAGAAGTA AACGTACCAA 720 

TGTCATAATC AGTCATTTTA CCACTCTTAT TAATCTGCTT TCCGGCAAGC GCAAGAATGT 780 

CTCCATTTTT AGGATTTTGT ACAACCATCA TTGcATTATC CATATCTTTG GCACCTTGAC 840 

25 TGCGAsTTCT TAATTTGTTT ATCTAATAAT GCTTCTACTT CTTTTT GAAG ATCTATATCG 900 

ATCGTTAATT TCAAATCTTG ACCGCGAGCG CCAGGrTTTA ACACTTCTGA AGATGTAACT 960 

TTACCAGATT TGTCCGTTGT GTATTTCATT TCTTTCTTCT TACCACGcAA TACATCTTCA 1020 

30 

TATTGATATT CTAGGTAAGA TTTTCCAACA CGATCATTGC GTGAATATCC TTTGGATAAG 1080 

TAATGTTCTG TCAATTCTTT TGGaATACCT TCAGCAGGTG TCGATACATC TCCGAATATA 1140 

CCTCTTAAAG TATCGCCATA TGGATATTTT CTATCCCAAT CCATAGACGT GTTAACACCT 1200 

35 

GGTAATTTGG AAAGTTGCTG AGAAACTGCT GCATACTCTT TTTCACTGAC ATCTTCATTT 1260 

TTTATCATTT GTGGATCTAA AACTGTTCCT GGCATTCATC TCTCGAAAAA TAGCTAAAAC 1320 

TTGGTAAATC TTTAGAAGAC AATTCATCTA ATTGTGGATT TTCCGGATTT CGGATAACAG 1380 

40 

TTGGTnTATC CATAATGGAT CCTGGTTTAA TACCTTCCAn CnGGCGAACA TAGCC 1435 
(2) INFORMATION FOR SEQ ID NO: 597: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 516 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS ; double 

(D) TOPOLOGY: linear 

so 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 597: 

55 

I 
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10 



15 



30 



35 



50 



55 



AGAAAAAGAT TTAGAAAAGA TTACCATTCG CGACATAACA ACACGCGCTG ATATCAATAG 120 

AGGGACATTT TACTTACATT ACGAGGATAA ATATATGTTA CTCGCAGATA TGGAAGATGA 180 

GTATATTTCA GAACTAACGA CATATACTCA ATTTGATTTG TTACGCGGTT CTTCArTTGA 240 

AGACATTGCG AATACTTTTG TAAATAATAT ACTCAAAAAT ATTTTCCAAC ATATTCATGA 300 

TAATTTAGAG TTTTATCATA CTATTTTACA ATTGGAACGC ACGAGTCAAT TAGAATTGAA 360 

AATCAACGAA CATATTAAAA ATAATATGCA ACGTTACATT AGTATCAATC ATTCTATCGG 420 

AGGCGTTCCA GAGaTGTATT TTTACAGTTA CGTTTCCGGA GCAACAATTT CaATTAnnTA 480 

AATACTGGGG TAATGGACAA ACAGCCCATT TTCAGT 516 
(2) INFORMATION FOR SEQ ID NO: 598: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1955 base pairs 
20 (b) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 598: 

AGTACTTTAC CATGAGGTAA GACTTTATTC CCTGCTTTTA TAGATTGTGC TTTTATTTGT 60 

GTGCCAGAAC CAATAGtAct GGTcTAATGA TTTTGCACTC ACATTATCAA TAGCTTTTTG 120 

TTTGTCTTGA CCTTCAACAT CTGGCACTTT ACTATACTCT GCATTAGATG TGTCATCTTT 180 

TGATTTACCT ACATTTAAAT ATTTCAAAGT ATTTTCCATT ATTGGTTTAA ACGCTTTACT 24 0 

AACACCTAAT TCATAAGCTT CTTGGTCATT TTTTTGTGCC AAGCTCATAC CAGCGTATAC 300 

AATAACTTTA GGATTTTTCT TCGGCGCGTC ACCCATAAAA CTTACAAAAT ATGGGTTTGG 360 

ACCTTTAACG TATCCACCAC CATTAGGTGC AGCGACTTGT GCTGTACCAG TCTTACCTTC 420 

40 GACCTCATAA CCATCAATAC GATAGTTTGC AGCGTGACTC TTCTTACTAT TCACAACTAA 480 

ATCCAATTGC TTTTCAACTT TTTCAGCAGT ATCTTTTGTT ATTGGTTTGC CTGCGATTTG 54 0 

TTTTTGCCCT TTATAAAATT GTCTTTTACT AACAGGATTT TCAACGCTAT TCACAAACCA 600 

45 TGGTTTTAAC ATATTACCAT CATTAAAGAA CGCTGATTGC GCTTGTAACA TTTGAACAGG 660 

TGTTACTGTT GTCGATTGAC CAAATGATGA CGTTTTTTGt TGCAACTCAT TACTCGATCC 720 

AATTTGaCCA GGTGCTTCTC CATCAAACAT ACCTTTAGTT GaTTTTCCaA ATCCaAATCG 780 

TTCATACCAA GATTTCATTT TGTCTGCACC AACTAAATCT TGTAAATGCA TCATCAATGT. 84 0 

ATTAGATGAA TAAGTAAATC CGAGTGACAT TGGGATTTCA CCCCAACCGA CTCTATTCCA 900 



1530 



EP 0 786 519 A2 



10 



15 



AAAAGCACCT TCTTGAATAG CAGCTGCTAA CCCATATGAT TTAAATGTTG ATCCAGGCTC 1020 

GTATGTGTTT TGATAAAGGT CATTTGCCCA CTTTTTACCA AAGTCTTTAC CAGTTTCAGG 1080 

ATTAAATGTT GGTCGCTGAC TGTATGcTAA AATTTCTCCA GTTTTGGCAT CCATGACAAC 1140 

CGCAAATAAA TCTTTCGGCT GGTATCTTTC AACCATGCCA TCTAAAGCTT CTTCAACAAA 1200 

TACTTGAATA TTTGAATCGA TTGTTAAATG GACATCATCA CCACGTTTAG GCTGCTTCTC 1260 

TTTTTTAGTA TTTGGTGCGA TATATCCCCA AATATCATGA ATATATCTCA ATGATCCTTT 1320 
AGATCCACTT AAATAACTAT CAAAAATCTT TTCAACTCCA AGTGCACCTT TAAGTTCACC i 1380 

AGTATCCGGA TTTTTCTGAG CTCTACCAAT TAAGTGTGAT GCAAAATTGC CATTTGGATA 1440 

AAAGCGTTCT GTTTCAGGCA ATAAAGAAAT ACCAGGCAAA TTCATTTTCT CTATTTTCAA 1500 

TTTGTCCTGA TACGTTAAAT TTGTTCCTTT GCGTCCAAAT TCAATTTGGA AAGCTTTCTT 1560 

20 TTGACTAAGT CTCTTTTCAA TTTCCTCTGG CTTCATATTA ATGACTGTAG ATAATTTCTT 1620 

TGCAGTCTCT TTTTTATCAA CTACATGCCT AGGTTTTTTA GAATTGGCAC TCGCCTTTTT 1680 

ATCTATTACT GCAACAAGTT TATATCTTTC TACATCTTCT GCTAGCACTT TACCATkACG 1740 

ATCATATATC TTTCCTCGTT CTGGTTGTTG TGCATTCTTA ACTAAATACT TTTCATTTGC 1B00 

CTTCATGACT AAATCTTGAC CATTAGAATG TCCAGTAATC ATGATATATG AAATTCTTAA 1860 

AACCAATATA AAAAAGAGCA GTCCGAATAA ACCAACAAGT AGGACTGCCC CTATTTTATT 1920 

TTTTTTAATT TTAATTTTTT TGGTCGCCAT TACTA 1955 
(2) INFORMATION FOR SEQ ID NO: 599: 



25 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 704 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(XD SEQUENCE DESCRIPTION: SEQ ID NO: 599: 

TAGCAAGAGT AGGTGATAAC AGTTCTTTTA ACGTGTTTGC TGCTTGCTCA TTTCTTGGCT 60 

45 CATTGTAGAC TAAAATATAA TAAGCGTTTT CAAATAAGTG CTTTTTAGCA TTTAGTACAC 120 

CAGATTTATG ACTACCAGCC ATTGGATGAC CACTGACTAA ATGAATATTA TGCTTTAATA 180 

AATTGCATTC GTGTTGCTGT ATCATTGCTT TAGTACTACC AGTATCAGAA ACAATAACAC 240 

50 

CAGGTTTAGT TGGCATATCT ATAAGCTCGC TAAGATATTT ATTTGTGATA GCAACAGGTG 300 

TTGCATAAAT AATTACATCG GCTTTTTTAA TAGCTTCACT ATAATTTAAA CATTTTTCAT 360. 

55 
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TATTAGTATT AGGGTTATGG TATTTTATAT TGCTAGCAAG ACTTCCACCA ATTAATCCAA 
GCCCAACAAA TAAAACTGTT GTCATATAAA TCACCTTATT TCGAAATTTT CAGAATAATA 
ACATTGTAAA TGAGCTGTTG ACACAGTGCA ATAGTAAATA AAAATCGATA ATAGCATTAA 
TAGATAAACG GAGATAAATC ATCTACAATA AAGAGTATAG TAACACAATG GCAACGGAGG 
GGTAAATCAA TGGAACCAAT ATTAGAAATG ATtAAAACAT TAAC 
(2) INFORMATION FOR SEQ ID NO: 600: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 600: 
CCTCTAAAGG ATCACGAACC TT T TTCATCG TGACTAAGCC TATAATACCC TTAAAc CTAT 
TATTATTAAC TTTTACCTCT GTGTATTCCC TATCAATTAG GCGACGCCAG TGACGTTTAT 
CTATATACTT TACTTTCACA GTCACCAACT CCTTGTCATT ATTATATAAA ACTACATAAA 
TGATGTCATG TCATGATACA GTTTTATAGT TTTTGTTAAT CATATGAAAA TATTTATTAT 
TATTTTACTA CAACTCGCTT CAATTTACTT AAAATAGACA ATATTAATTA GATAGTACAC 
ACATTTCTTC ATAAAAGTGA TTTTTCAAAA ATATAAATAA CACACTCTTA TCGTTTTCAA 
AATCATTTAA TGCTATTTTC ATTAAAAACA GCTGAAGCAT CAAATCTATT CTGATTCAAT 
CAAGAATACA TATAAAAATG AAGTGACTCA AAGGTTTATT AACAACCTTC AAACCACTTC 
ATTGATCACT TTTATTTTAA AGCATATTTT TCGATTACTG ATTTAAGATG CGGATATTGC 
GTAATTAATT CAGATTGCTT AAACAATTCA AACTGCTTAA ACTCAAATGC CGGTTGACAC 
ATACAACCTA CTAAACTAAA AGTATTTGAT ATTTCGATTG AAGAAGCAAA AATTGTTCCT 
TTAGGCACTA CATATTGCAA TACATCTCCA TTTTGGATAT CAGTACCCAA TGTTGCAGTC 
GTATATTCCC CATCCGGATT TATCATATGA ATTGTTAGAG AATCGCCAGC ATGATAGTAC 
CATACTTCAT CAGCATCAAT TCGATGAAAA TGCGAAATAT TGTCATCTGT AAGTAAAAAA 
TAAATACTAC TAAACGGCGC TCTGCGTCCA TCTTTCAATA CTTCTCGAAT TGTCTCTCTA 
TAGAAACCAC CTTCAGGATG TGATTCAAGT TGCAATTcAT CaATCCATTG TTCTGCTGAT 
TTCATTATTT CAAATCCACA TTATGGAATA CGTTTTGTAC ATCTTCTAAA TCTTCTAATG 
CATCGATTAA TTTTTCAAAT. GTTACTTGGT CCGCTTCAGA AAGTTCAATA TCTGTTTGAG 
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GAACGACTGC AAATTGAT 

(2) INFORMATION FOR SEQ ID NO: 601: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 827 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 601: 



TGATACTTAT 


TTTTCAATAT 


TATTTGGAAT 


AAAnTCTTnT 


AATTGTGGTG 


TGAAATAAAC 


ACCGAAATTC 


TTTTCAGATT 


CATCTTTTcC 


ACTACCTGCA 


TATAAAATAC CAATCAGTyC 


ATGTTTAGAA 


TTCAATACAG 


GAGATCCTGA 


ATTACCTGGT 


TGTGCATACG 


CATCAAATTC 


CATAAACGTT 


CCACTGATAT 


GGTTAATCGT 


TCCTGTCGAT 


TCAAACATTT 


TATATTTTGT 


TTGTGCACCC 


TTTGGATAAC 


CAATAACAGA 


AATTCTATCT 


TTCACTTTTG 


CTCCGTCTGC 


AAATTTTGTA 


TAACTAACGT 


TCTTATTAAA 


ATTCAAACCT 


TCTGTACTTG 


TTTCATGAAC 


ATGAACTATC 


GCAAGGTCTT 


CTTTTCCGGG 


ATATTCTACA 


ATGTCTTTAA 


CGTCGTAGTT 


TCCTCCGCCT 


TTACCTTTAC 


TCGAATGATG 


TGCTGATACT 


CTATTTTTAA AAATATCATT 


ACTTTTAGCG 


ATATGTTTGT 


TAGTTACGAT 


TGTATTTTTA 


CCAACAACTA 


CACCAGTACC 


ACCCACAAAT 


GCTACCACTG 


aATTGTATGG 


TTCCTTAGTT 


GCATCGGTAA TTTCTTTGAC 


ATTCTTTTCT 


GCTTTGGCAA 


TTGAATGArG 


CTGATTAGAA 


ATATTTTCAG 


CAAAACCAAG 


AGATGTTAAA 


ATAGTTAAAG 


CAGTTAAACC 


TTTAACCATT 


ACATTTTTAT 


TCATATTTTA 


CCTCCTTCCA 


AATTTATTTT 


nATAGTTAAT 


GTATCTAGAT 


ACACATTTAC 


TATATACATT 


TTAAATAAAA 


ATATTnATTA 


AATATTAATT 


AACCACTTAT 


nAACGTT 





(2) INFORMATION FOR SEQ ID NO: 602: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 796 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 602: 
nGGnACGTAA AGTATTTATG TTAAATGGCA nAAATATATG AATGATGAGT TACCTGGATT 
ACCAATGTTC CAAGGTAAAT CGATAACTAT TGTTAACGAT AAAGTACGAA ACTTAGACAT 
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CGACGATTTT GTCGAGGTTT ACTTATATGG GGGGATGTTA TGAATAATGT ATTGTTAGAG 240 

GTTAAAGATT TAGAAACATC ATTAAAAATA AATAATGAAT GGTTAGCAAC TGTTGAAAAT 300 

5 

ATTTCTTTTG AATTATCTAA AGGAGAAGTT TTGGGTATAG TAGGGGAATC TGGTTGCGGT 360 

AAGTCCATAT TAAGTAAGTC AATTATTAAA TTATTACCAG AAAAGATATC TAAACTAAGT 420 

10 AATGGAGAAG TTATATTTGA TGGTAAACGA ATCGATACGC TCAATGAGAA GCAATTGCTA 480 

GATATTCGAG GAAATGATAT TGCTATGATT TTTCAAGAAC CTATGACTGC TTTAAATCCT 540 

GTATTTACCA TAAAAAATCA ACTTGTGGAA TCTATAAAAT CACATAAAAA AATTTCTAAA 600 

75 AAAGAAGCAA ATAAATTAGC AAAAGATTTA CTAAAAAAAG TTGGAATTGC TAGACAAGAT 660 

GAAATATTAA ATAGCTATCC TCATCAATTA TCTGGTGGTA TGAGACAAAG AGTAATGATT 720 

GCAATGGCCA TTTCATGTTC TCcTAAATTA TTAATTGCTG aTGAACCTAC AACAGCATTG 780 

20 

GATGTCACGA TTCAAG 796 
(2) INFORMATION FOR SEQ ID NO: 603: 

25 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 ' 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 603: 



35 



40 



45 



50 



55 



GGATGTGTTC 


ATCCCAAATT 


GaAAcAgAaA 


TTTATGATCA 


AAtGATGTGG 


ATTGGTTTTA 


60 


AaCCATTCAa 


AATTaCCAAT 


ATTAmaCAAG 


AATCTGAAGA 


CATTAAATCA TTTtACAGTT 


120 


GAAACTGAAG 


AATATGACTT 


TAGTGAATTT 


ACACCAGGCC 


AATACATCAC 


AGTTGATGTT 


180 


TCTAGTGATA 


AACTTCCATA 


TAGAGCTAAA 


CGTCACTATT 


CTATCGTATC AGGTGAAAAA 


240 


AACCATTTAA 


CTTTl'GGCGT 


TAAACGTGAT 


GTCACAACAG 


AACATGAAGG 


CGAAGTTTCA 


300 


ACAATTTTAC 


ACGATGAGAT 


TAAAGAAGGC 


GATATGATTA 


ATTTASTGCG 


CCTGTAGGTG 


360 


GATTCGTATT 


AGAGAATACG 


ACTGAACCAC 


AACTTTTCTT 


AGGTTCAGGT 


ATTGGTGTTA 


420 


CACCTTTAGT 


AGCTATGTAT 


GAAGCTGCCT 


CTGCCAAAGG 


TTTAGATACA 


CAGATGGTTC 


480 


AAGTTGCTGA 


AAATGAACAA 


CATTTACCTT 


TCAAAGACAA 


CTTCAACAGT 


ATCGCAAGCC 


54 0 


ATCATGACAA 


CGCTAAATTA 


TATACACACT 


TAnAgATAAA 


CAAGGCTATA TTGGTGCTGA 


600 


AGAATTACAA 


GTATTTTTAG 


CAAATAAACC 


TGAAATTTAT 


ATCTGTGGTG 


GTACAAAATT 


660 


CTTACAaTCT 


ATGATCGAAG 


CACTTAAATC 


TTTAAATTAC 


GATATGGATC 


GCGTACACTA 


720 
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ATATGAAACT 


TAATCAACGT 


TACGTAAAAG 


TATTTGCATT ATATTTCGTA AGTATTGTTA 


640 




CTGCAAATAT 


TATTGTTAAA 


AATAATAATT 


TAATTAAAAC AtTGATACAA ACCATAGCCG 


900 


b 


GGTACACGGT 


CTTTGCAGTT 


GGTTTGAAGT 


ATTTAACTAA ACGTAAAAAT AAATGACATC 


960 




TATACCAAAA 


CAGCTATGAC 


TTTACTTTGT 


CATGGCTGTT TTTTAATTGG GAGTAGGACA 


1020 


10 


GAGATGATAT 


TTTCGCAAAA 


TTTATTTGGT 


CGTCCCACCA CAACATGCAT TGATGTATGC 


1080 


TCACTGAATT 


TCATAAGAAA 


GGAGTTCACA 


AGATGACCGT AGATATTGGA CGGATTTATG 


1140 




ACAATAAAGA 


TAATACCGAC 


GCTATTCGTA 


TCCTAGTCGA TAGAGTCTGG CCGAGAGGTA 


1200 


15 


TTTCGAAAAG 


AACTGCTAAC 


CTAGATTATT 


GGTTAAAAGA CATTGCCCCT TCTACTGAGT 


1260 




TGCGACAATG 


GTTCCAACAT 


GATCCTAAAC 


TTTTTGGAGC TTTTAAAGAA AAATATGAAA 


1320 




AAGAATTACG 


TGATCAGGAT 


GCGCAAAAAG 


ATGCTTTTGA AAAATTAAAG GATATTGTAA 


1380 


20 


ATCAGCATAA 


TCATGTTCTA 


TTGTTATATG 


CAGCAAAAGA TACTAAACAT AACCAAGCTG 


1440 




TAGTACTACA 


GCAGTTGCTC 


AATACTTAGT 


TATATATGTT TATCGTGAAT CACTATAAAT 


1500 


25 


GTCGCTAACT 


TCATTTCTAA 


TATTTATTTT 


TAAAAAGCAT AATTACTACA ATTAATTGGA 


1560 


ACTTTAATAA 


TTACTAAACT 


TGGAACAATA 


TTTTACTTTA ACAAAATAAA GTTTTAAA 


1618 



(2) INFORMATION FOR SEQ ID NO: 604: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2115 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 604: 
ATTATAATGC AAATATTGAA GCATTTAAAA CAGTCGCTAA AGCAGTAGGC AAAGAGAAAG 60 

40 

AAGGCGAGAA GCGTCTGGAA AAGCATGATA AAATATTAGC GGAGATTAGA AAGAAAATTG 120 
AACAGAGTAC GTTAAAATCT GCATTTGCAT TCGGTATCTC AAGAGCAGGT ATGTTTATTA 180 
45 ATAATGAAGA TACATTTATG GGACAATTCT TAATTAAAAT GGGTATTCAA CCTGAaGtcA 24 0 

tnAAAArAmAA AACTACGCAT GTTGGTGAAC GCAAGGGTGG TCCTTATATA TATTTAAATA 300 
ATGAAGAACT TGCCAATATC AATCCAAAAG TTATGATTTT AGCCACTGAC GGAAAAACGG 360 

50 

ACAAAAATAG AACGAAATTC ATTGATCCTG CAGTTTGGAA ATCATTAAAA GCTGTGAAAG 420 
ATAACAAAGT TTATGACGTT G AC CGAAATA AGTGGTTGAA ATCAAGGGGG ATTATCGCAA 4 80 

GTGAAAGTAT GGCAGAAGAT TTAGAAAAAA TTGCAGAAAA AGCAAAATAA AAATACAGCG 540 

55 
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TGTGATGATT TACCTGAAAG TTTATTTCGA ATAAATTTAA TTACATAACC GACAAGGATT 660 

GTTTTAACAG TTCTTTTAAT GAATTGGCGC ATCGTTACAT ACCTCATTTC TCTATATCTT 720 

ACGAACTATA TACCCATTCA TATATGCTTT TTAAACGTCA TTGTCACAAT TTAATTTTTA 780 

QGGAATATAA TATAACCATC TTTATCTGCT TTTTTAGTAA AAATGACAAA AATTGCATGT 840 

ATTATTGaGA TGATGGTAGG GATACCTGTC CAGAAAAATA ATAAGTGAAA AAGACCTTGT 900 

CCAAATTTAT CAGCATAAAA TTTATGaATA CCTAAACCTC CAAGAAATAA TGCAACmATa 960 

ACATAAATGG CTTTATTGAC TTTCATTTGT AAtCCTCCTT AACTATAATT CTACTTAAAT 1020 

75 TCGTTGTGAA AACCAATATT TCTAACTTTA GAATTTTCAA ACTTTCTAAA ATTATAAGTA 10 BO 

TATCTTTTTA AAATAAGCTA GAATTTCTAT ATAATAAATG TTAATAACGT AAAAGGGAAT 1140 

GATGACATAG TGATACGTCA AGCACGTCCA GAGGACCGAT TTGATATTGC GAAGTTAGTT 1200 

TATATGGTTT GGGATGATAT GGAATTAGAA TTGGTAAAGC ATCTACCTAA AGACATGGTA 126 0 

TTAGATGCAA TTGAAAAAAG CTGTGTTGAT GCAACATATC GAACTTTTTA TCAGCATATT 1320 

TTAGTTTATG AAGTAGAAAA TAAAGTAGCA GGTTGTATTA TTAGCTATAG TGGTGAAAAT 1380 

GAATTGAAAT ACGAAAAAGC ATGGGAACTA CTTGACTTGC CAGAAAAAAT AAAACAATAT 144 0 

GGCACGCCAT TACCTGTAAA AGAAGCTAAA GACGATGAGT ATTATATAGA AACAATTGCG 1500 

ACATTTGCAG CATATAGAGG TAGAGGCATC GCGACAAAGT TATTAACGTC ATTACTTGAA 156 0 

TCAAATACAC ATGTTAAATG GAGTTTGAAT TGCGATATTA ATAATGAAGC AGCATTAAAG 1620 

TTATATAAAA AAGTAGGCTT TATATCTGAT GGACAGATTG AATTATACAA GCACATGTAT 1680 

CATCATTTAA TTGTTAAATA AAATACTCGA CAGTTCGATG TAAGTCGATT GCCGAGTAGT 174 0 

ATCATTTTCT ATTAAATGCC TGCAAATAAT GCACTAATAT AAATACCTAA TGCATATAAT 1800 

AAACCGAAAA ATGTATTTGT TTTACCAGCA GCAGCCATTG CTGGCATCAT TGTAGGCGGT 1860 

GTATCATTCT TCTTGAAACG TCTGATAACT TTAACAGGCA TTGGGAATGA TAACAACGCA 1920 

AGTAAGTAAA ATAATGAGCC ACCAGGTTTA ATAATGATCG TAAGTACAAT AAAGGCATAA 1980 

45 GCGATAAAGT ACATGATTGC CATAAATGTT AAAGAAGCAT TTTTACCTAA TAGAATGGGT 2040 

AAAGTTTTGC GACCACTTGC TTTATCTTTG ACACGGTCGC GAATATTGTT AGCCATATTA 2100 

ATTAAACCGA TAGTG 2115 
(2) INFORMATION FOR SEQ ID NO: 605: 



20 



25 



30 



35 



40 



50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1003 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 
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10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 605: 

TATGTCTCAA ACTCAAATTA ATCAAATGTT CCAGCAGAAA AACATGTCTA CTGAATTAAA - 60 

ACGTCGTTAT GCACAACGTT TATTACAGTT TCCACATGTA CACAATAAAG AATACTTGAA 120 

ATCTTATGCT AAAAACCCTA AAGAAACTAA AGATAGTTAT ATTTCTGGTT TTAAAGAGAA 180 

TCAATTGATT AAAATAGAAG CGATTAAATC ATTGTTTGCA ATGGATAAAT CTCCATTAGA 24 0 

ACATGTTAAA CCTGCTACAA AACCAGACGC TTCTTGGGAT GAGATGAAAC AAAAAGCAGT 300 

TGAAATTGGT AAAGCTGATA CTACATCGAA TAAATTTGGT ATTAGAGATC AATACTGGAA 360 

ATTAATTCAA GAAAGTAAGC GTAAAGTTAG ACGTGACTAC GAATTCAATG TTAATTCTCC 420 

AGAATTCCAA GATTTAGAAT TACTTGTAAA AACAATGCGT GCTGCTGGTG CAGATGTTCA 4 80 

20 ATATGTAAGT ATTCCATCAA ACGGTGTATG GTATGACCAC ATTGGTATCG ATAAAGAACG 54 0 

TCGTCAAGCA GTTTATAAAA AAATCCATTC TACTGTTGTA GATAATGGTG GTAAAATTTA 600 

CGATATGACT GATAAAGATT ATGAAAAATA TGTTATCAGT GATGCCGTAC ACATCGGTTG 660 

25 GAAaGGTt GG GTTTATATGG ATGAGCmAAT TGCGAAACAT ATGAAAGGTG AACCACAACC 720 

TGAAGTAGAT AAACCTAAAA nTTAAAATAC AAATAGCACA TAACTCAACG ATTTTGATTG 780 

AGCGTATGTG CTAT TT T T AT ATTTTAAATT TCATAGAATA GAATAGTAAT ATGTGCTTGG 84 0 

ATATGTGGCA ATAATAAAAT AATTAATCAG ATAAaTAGTA TAAAATAACT TTCCCATCAG 90.0 

TCCAATTTGA CAGCGAAAAA AGACAGGTAA TAACTGATTA TAAATAATTC AGTATTCCTG 960 

TCTTTGTTGT TATTCATAAT ATGTTCTGTT AACTTAATAT CTT 1003 
12) INFORMATION FOR SEQ ID NO: - 606: 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1097 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 606: 
GCTnTTATAT TTAAAAAATT TATTnGCGGA TAATTACCTT TATTAAATCC CACCCATTAG 6 0 

50 GAwTaCGAGT AATAGGACTA ATACTGGAAT CACATAATGT AACATAACGT CCCTCCTTTA 120 
ACTTAATTTT AATTGTAATC AAATTTG ACA ATAAGTCAAA mCATTAATAC CTATGATArG 180 
TATCATTTAT TAACATATGT ATCATATTTT TAATCTTGCG TAATTTTTAT CGTTAACTAT 24 0 

55 
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CTTTTGTCAT TTTAATTATG TTAAGATAAT AGTAGTTATA GAAGTTCAAT CTATAGGAGG 360 

CATAGCATGG ATATTCCAAA AATCACGACA TTTTTAATGT TTAATAACCA AGCTGAAGAA 420 

5 

GCTGTTAAAC TATACACAAG CTTATTTGAA GATAGTGAGA TTATAACAAT GGCTAAGTAT 480 

GGTGAAAATG GACCTGGTGA TCCCGGGACT GTACAACACT CAATATTTAC ATTAAATGGA 540 

1Q CAAGTATTCA TGGCGATTGA TGCTAATAGT GGCACAGAAT TACCAATGAA TCCTGCGATT 600 

TCATTATTTG TTACAGTAAA AGATACTATT GAAATGGAAC GACTATTTAA TGGATTAAAA 660 

GATGAAGGTG CCATTTTAAT GCCAAAAACG AATATGCCAC CATACAGAGA GTTTGCTTGG 720 

15 GTTCAAGATA AGTTTGGAGT AAGTTTTCAA TTAGCATTAC CTGaGTAAAA GGATTCGCAC 780 

AGCtTGgAAT GATAAAGAAA CACTTTTTCT TATGCATGCG TTTTACCTAT GTATTTCTAT 84 0 

TTTAAGTACA CATTAGCATT TTAGTTTCGC TCATTTTTAA AATCACAAAG ATAATTTGTA 900 

20 

ACTTAAATGA TGATACGTTA TTTAAAAACA CGATACTTCG TTTCAATGAA CGCATTAAAT 960 

AATAAATAAA CACCTCACCA TAAGAAAGGA CTACTTTCTT TTGTGAGGTG TTTatTTGTC 1020 

2$ GACAACTTGA TTATGATTGT TTCATTTTTT GAATAAGTTC ATAATCAGGT GTAGCATACA 1080 

AAGTTTTTTG ATTGTCA 1097 
(2) INFORMATION FOR SEQ ID NO: 607: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2031 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

3B 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 607: 

GAGGTCTAGC ATGTCTCGTT CAAAAAAATA CTTTTACTTA TCTAGCTTAA TGATTATTTT 60 

40 

AAGCTTTTTC TTTAATACAA ATAACGTTTT CCTAAGTGGA CTTTTTAATT CTTTTATTAA 120 

ATTAATACTT TTCTGCAGTG TTATTAACTC AATTGTACTA ATTTTGTCTA TAATTTTTGC 180 

45 AGATCGTTCA ATTAAATCAC TAAAGCCTGA TGCAGATTGG ATTAGAATTG CGAGTAAAAG 240 

TTTGCCTTGG aTTATTCTAA TTGTTATTTT AGTACATATC TTTTCAATTG TTCGTACATT 300 

CGGTTTTATT TAAAAAAGTT AATATGTCAT TGTAGCCTTA ATACAAAACA ATACAATGTA 360 

so 

TCATGCTATA ATGAGTAAAA CAATTTGATA ACGTTGTTGC GTATAAAAAA TATTAGATTT 420 

TCGAAATCAT AACTATGCAT CTAATCGCTA TAGTTATACA ACAAGATATA ACATATAATG 4 80 

__ AGGTTTGATA ATGCATCGAC AATTTTTGTC GTCGCGTTGC CAAAACCTCT TTTTTAAATT 540 
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10 



TTTAGGCGCT TTATTACGTT ACCTCATTTC TTTTCTGAAT ACTGACGGAG GTTTTCCAAT 660 

CGGAACACTG ATAGCCAATT TGACTGGTGC CTTTGTAATG GGATTGCTAA CAGCCTTAAC 720 

AATTGCATTT TTTTCAAACC ATCCGACCCT AAAAAAAGCT ATTACGACTG GTTTTCTTGG 780 

TGCTTTAACG ACTTTTTCAA CATTTCAATT AGAATTAATA CATATGTTTG ATCATCAACA 840 

ATTTATAACT TTACTACTAT ATGCTGTAAC AAGTTATGTC TTTGGTATTT TGTTATGTTA 900 

CGTCGGTATA AAACTAGGTG GTGGTTTATC ATGATATCAA TCATTTTAGT CATGATTGGC 960 

GGCGGTTTTG GCGCAATTGC TAGAAGTGCC ATTACTGATT ATTTTAATCA TAAATTTACT 1020 

is TCAAAGTTAC CTATCGCAAC ATTGATAGTA AATCTAGTTG GTAGTTTTTT AATTGGATTA 1080 

ACTATAGGCT TATCAATTTC AATCTCATGG TTCCCTGCGT TCTTTGTTAC CGGTTTTTTA 1140 

GGTGGCTTAA CAACTTTCTC AACGTTAGCC AAAGAACTTA CACTAATGAT GACGCCAAAA 1200 

TTTAATATTA ACCTTTTTCT CAATTATTCA CTTTTACAAT TCATCATTGG ATTTATAGCT 1260 

TGTTATATTG GGTATCATAT TTAAAAATAA AATGCTTCAT TCAGCAAATA GGTAAATTAC 1320 

GACACCTTCC TGAACGAAGC ATTTTTTAAT TTTCATGCAA ATTTTTAAGC ACCATATAAT 1380 

GCCTACCAAA TTTCAATAAT CTTTGTTGCC GTTTAAATAA TGTGAATGTC AATAAATTCT 1440 

CCAAACTAGT CGAAAATAAA GGGAGTGGGA CATAAATCCC TAAAAAAACA GCAGTAAGAT 1500 

30 AATTTTCAAT TAGAAAATAT CTTACTGCTG TTCTGTATTT ATACAATACT TCGTATTGAa 1560 

TGGcTTCGCT TTCCTAGGGT GCCGTCTCAG CCTCGGcTTC GACTGGCACT GCTCCCTCAG 1620 

GAGTCTCGCC ATTAATAcTA CGTATTAACA TGTAATTTTA CTTTTACATA CTTTAAAAAA 1680 

TAAGACACTT TGCCCAACTT ACACTACCAA TAGAAACCTC TGTTAGAATT CCTCAAAATG 174 0 

ATATTTCGCG ATATGTTAAT GAAATTGTTG AAACGATACC TGATAGCGAA TTCGATGAAT 1800 
TCAGACATCA TCGTGGCGCA ACATCCTATC ATCCAAAAAT GATGTTAAAA ATCATCTTAT • 1860 
ATGCATATAC TCAATCTGTA TTTTCTGGTC GAAGAATAGA GAAATTACTT CATGACAGTA . 1920 

TTCGAATGAT GTGGTTAGCT CAAAATCAAA CACCTTCTTA TAAAACTATT AATCGTTTTA 1980 

45 GAGTGAATCC TAATACTGAT GCGTTAATTG AATCTTTATT TATTCAGTTC C 2031 
(2) INFORMATION FOR SEQ ID NO: 608: 



20 



25 



35 



40 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 687 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TCCCCGGGAA TCGAACCCGC GAnCTCCTGC GTGACATGCA GGCGTGTTAA CCGCTACACT 60 

ACGAGACCTA TTAnATTAAA AACTATGTAT TGCGGGAGGC GGATTTGAAC CACCGACCTT 120 

CGGGTTATGA GCCCGACGAG CTACCGAACT GCTCCATCCC GCGCTAATAT TATTTTGAAT 180 

TACCTAATTA ATATACCATA ATCAAAAACC TAAAGTsrrG AACTTTTTGA ATTTAATTTA 240 

AATGTTATCT CTTAAATAAT TACTTAAATA TCGTAGCAAC ATGTTCTCTG TTGAACACAA 300 

10 

ATATTAGTAT ATTCATTTTT GTAGTGTACG TCAACGACAT TTTCAAAGTT TTTTGTGTAA 360 

AAAAACGCTT CTTATTCCCT TTTATCATAT AAGTGTCTAA TAGTTGTCAT AAATAGTGkT 420 

15 ' AAAGCATTTA AAAAGGTATA GGAGTTATAA AGTTTACAAC GCCTATACCT TCTGAAAAAG 480 

AAATTATAAA AGCTTGTTAC ACCGCATATT CTTTCAGTCA GCGACTACCA ATATAACATT 540 

GTAGCCCtAA GACATTGCTT GACGCCTCAn TTACAACAAT TTTTCAAAAT CAGCAGCTAC 600 

20 

CTACTGACAC AACATAAGAC AACCCnTACA CTAnCTATCG TGTCATGTAA TCTTGCATCC 660 

GATCTTGCAA CGCTGTAAAT GTTTCGA 687 

(2) INFORMATION FOR SEQ ID NO: 609: 

25 ' 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 843 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 .(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 609: 

35 ' 

GTGTTGAATA" CTTTGACAAA ATGAAATCAT TTGTTCTTTC GTTTCAAATT TAACTGTTTG 60 

AATTAAATCA GTTCGTTTTT CGTAGTACTT CGGCGTTGTG TTCATATTCA TTTTTTCTAA 120 

AAATAAACTA GTAAACAATG CACCTTTAAG ACTCTGACTG ACAACGTGTG GTGCTAAAAA 180 

40 

GAAACCTTGA TACATTTCAA GCAATGCATT TAATGATGCA CCCGCTTC1T TACCAATACC 240 

AGGTGCTGTC AATCTATAAC CACATCGTTC AATTAAATCT TTTCTACCAG CAATGTATCC 300 

45 ACCAATCTTA GCTAAACCAC CGCCAGGGTT TTTAATTAAT GATCCTGCTA TTAAATCGGC 360 

ACCACATTCT ATAGGTTCAC GTCTTTCAAC AAATTCCCCA TAACAGTTAT CCACAAATAT 420 

TAAAATATTA GGATGCACGT TTTTCAACCT AGTAATTACC TTTTCAATTT CATCTAGCGG 4 80 

so 

AATTGAAGGT CTTTGATCAT AGCCTTTCGA ACGTTGAATC GCTATTACTT TGGTGCGCTC 540 

AGAAACCCCa TCTAACACAC TTTCAATATC GATCTTACCT TCTTTAAGTG CAATATCTTT 600 

ATACGATmCG CCATGCTCCA TTAAACTTTC AATACCATTT CCGTTTACGC CAATGACTTC ^660 

55 
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ACTTTGTAAT GCAATAGTAA TCGCATGCGT ACCTGAAATA ATTTGCGGAC GAACAATTGC 



780 



ATCTTCTGCT TTAAATGCCT GCGCATATAT TTCTTAATTG AACGAATTTA GAACTAATCG 



840 



TAA 



843 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 610: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 610: 



GATTATTATA 


AACCGAATGA 


AACAAAAAAG 


TTTTTACATA 


TTTTCAACAA 


CTTTATTTCA 


60 


ATAAAGCATC 


AAGTTGATCT 


AAAGTTTGAT 


TCATACCTTG 


TTCAACACCC 


ATGTTTATGA 


120 


CTTGTTGAGC 


GGCTTCTTTT 


GTTGGAAATA 


CCGATGTTGA 


TGTCACTGTC 


GTTTTCGTTG 


180 


TATTGCTCTT 


GGAAAATGAC 


AAAGTAATTT 


TCATGCTTGG 


CATTTTTGTA 


TCTTTTGCAC 


240 


CTTGAGGTGT 


CGCAAATGAG 


TCAATATATT 


CAATTAAATA 


TGGACGCTTA ACCGTTTTAT 


300 


ATTCTGCTAA 


TGTATAGCTG 


GTCATTGTAG 


GTGTTTTAAT 


CGCATAAAAT 


GCATCACCGC 


360 


CTGAAACAGC 


ATTAAAGCGA 


AACACTTTGG 


TACTAGCGTC 


TTTTGGAtGA AACCATTTTT 


420 


CAAATAATGC 


TTTkGTCGTA. 


TAGGCATCAA 


ACACCTTTTC 


AATCGGTGCT 


TCTATCGTTC 


480 


TTGAAAAAAT 


AATTTTATTA 


TCTTCAACTT 


TAATCGTCAT 


CTTCCCACTC 


CCTATCTTTT 


540 


ATATTTACTC 


ATCTTAACAT 


GCATTGAGTA 


ACATATGTTA 


CCAAATCATT 


TGTTATAAAC 


600 


TATAGTAATA 


TTAATCATTC 


CCTCTGTnGA 


AATTGATAAC 


TACGAAATAA 


AAAACACACT 


660 


CTATTCAGCT 


ATTAACAGTT 


GAGTAGAGTG 


TGCAAACCTG 


TTGAATTAAT 


GCGCTAACAT 


720 


TTCTTCTTTG 


ATTTGGTCTT 


nATnTAATTT 


TGAAGGATAA 


TATGTTGGCC 


AGTTA 


775 



(2) INFORMATION FOR SEQ ID NO: 611: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 728 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 611: 
ATCAAAATTT GATACTAAAG GGCTCACGCT ATTTTATGTA TTTATTGGAT TGATTATGTT 60 
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TGTGGCAATG TGTTTATTTA AAGTAGAAAA ACATGTTTCT TCACCATTTT TACCTGTGGT 18.0 

TGAATTTAAT CGTTCGATTA CTTTAGTTTT TATAACTGAC CTTCTAACAG CTATTTGTTT 240 

AATGGGATTC AATTTATATA TTCCAGTCTA CCTTCAAGAA CAACTAGGAT TATCTCCATT 300 

GCAAAGTGGA TTGGTTATTT TTCCTTTATC TGTAGCTTGG ATTACATTGA ATTTTAATTT 360 

ACaTCGAATT GAAGCAAAAC TATCAAGGAA AGTTATTTAC TTACTATCAT TTACATTGCT 420 

ACTAGTAAGT AGTATTATCA TTTCATTTGG TATTAAATTG CCGGTACTTA TAGCATTTGT 480 

GTTAATTTTG GCAGGATTAA GTTTTGGATA TATTTATACG AAAGATAGTG TGATTGTCCA 540 

is AGAGGAAACT AGCCCATTAC AAATGAAGAA AATGATGTCA TTTTATGGAT TAACTAAAAA 600 

TCTTGGGGCA TCAATAGGTT CAACAATAAT GGGATATCTT TATGCGATAC AATCAGGAAT 660 

CTTTGGTCCA AACTTACACA nTGTGTTAAG TGCTGTTGCT GTAATTAGCA TGGGCTTATn 720 

GTTTTATG 728 
(2) INFORMATION FOR SEQ ID NOr 612: 



20 



25 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 
-■ • '(A) LENGTH: 913 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 612: 

TATCAGAGCA AATAAGCAAG TTAGATAAAA AGATTTCTGA CTTACAATTA ATTAGACGCT 60 

CTGTATGTGA ATTTATTAAA GGACTCTCTC TAATAGATAC CAGCATTTTA AACAAGACAC 120 

TACAGTCACA ATATGATAAA GAAGCATCTA TAAAATATGG TCATACGAAA GCATATCAnT 180 

CATTTATTAG ACGTAAAGAC AGCTTACAAT CGCAGGATAT CAGACATAAA TTGACAACTA 240 

TCTTCAATAA ATTTAATCAT ATGTCTTTGA GTCATTATCC AATCCAAGAT TGTAGTGATC 300 

TCGTATTTGA GTGGAAGGCA TTTATGAACA CTATCGCTGA TTTTGmTGAT GAAACATTAT 360 

45 GCTGTATTGC TAAAACATAT GAAGATGATA CGCGTTTCAA AGATTACTTT AATTCATATG 420 

ATAATCAAAA TTTAGCATCA TACATTTCAG AAGCTGTTAA TTATTTTTTG AGCAATGTGA 480 

ATAAGAGCGA CAATTTTTAA TCCTCATACA GATGCTACAA CACTGAATAC ACCAACTAAA 540 

50 

ATAATTGTTC AAAGTAGAAA ACTAATCATT TATTCTCTAA ATCAATAACT ATATTAAAAG 600 

TTATACCTTT GCAAAGCGAA TTAGTATAGG TTACCGAAAG GAGAAAGGAT TAGGTTCCAT 660 

ss TCGATTTATA AAAAAATATA TTTCGAAACA TACAAAAGCG CCAGAAGATA ATCATTTCCT 720 
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TCAATGCATA TTCAACTAGC TCAGGTTTAC TCTTTAAGCC AAGCTTTGTC ATAATATGCG 840 

TCTTATGTGC TTCTACTGTT TTCACAGATA CAAATAATTT CTCTGCAATT tCTTTATTCC 900 

CGTAACCTTT GGC 513 
(2) INFORMATION FOR SEQ ID NO: 613: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 613: 



20 



30 



35 



ACAAGTGTTG 


ATGTCATAGT 


AGCGTCAAwT 


GTGCTTTATT 


TCTGGGACAC 


ATTTAAATTT 


60 


TTCCACCCCA 


TTACTTGGAG 


CTTAATCTTT 


ATTACAATTT 


TACTATTATT 


AAACATTTTT 


120 


TCTGTAAAAT 


CATTTGGAGA 


AACTGAGTTT 


TGGTTATCAT 


TGATTAAAGT 


GTTAACAATT 


180 


ATCGTATTCG 


TTATTTTTGG 


CTTTTTAATG ATTTTCGGTA 


TCTTAGGTGG 


TCATACATAT 


240 


GGAT-TTGAAA 


ACTATACAAA 


AGGCCAAGCA 


CCGTTTGTTG 


GTGGTATCTC 


TGGTTTCTTA 


300 


GGCGTATTAT 


TAGTCGCCGG 


ATTTTCGGTT 


GGTGGTACAG 


AAGTAGTAGC 


AGTAACTGCT 


360 


GGTGAATCAG 


ATGACCCTAA 


AAAGTCTATG 


CCTAAGGCAA 


TTAAACAAGT 


ATTTTGGCGT 


420 


ATTCTTTTAT 


TCTATGTCTT 


ATCAATTGCA 


GTAATTGGTG 


CAATTATTCC 


GTACACAGAT 


480 


CCATCATTAT 


TAAGAGCAAG 


TAGTTCAATA AGTCAAAGCC 


CATTTACAAT 


TGTATTCGAT 


54 0 


AGAGtAGgCA 


TAGCCTTTGC 


AGCATCAGTA 


ATCAACGCGG 


TTATTTTAAC 


TTCATTATTA 


600 


TCCGCTGCAA 


ATTCAGGTGT 


TTATACAACA 


GGCAGAATGT 


TGTATTCCTT 


AAGT 


654 



(2) INFORMATION FOR SEQ ID NO: 614: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 642 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 614:. 
50 TCGATCTTTA TTGATAATAA TTAAATTGTC GCCTTTAAAA TGTGATATTA ATCCTGCGGC 60 
AGGnTGTACA ACGAGTGATG AACCTAGTAC AACAAGGGTG TCAGCATGTT CAATTTTATT 120 
TAATGCCCTT ATGATGGTAG GTTGATCTAA CATTTCACCG TATAATACGA TGTCCGGTCG 180 

55 
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ATAAGATTTA TGACATACAT TACAATAAAA ACGATTTAAC GTGCCATGTA ATTCATCAAC 300 

ATGTTGACTT CCAGCGTCTG AGTGCAAACC ATCGATATTT TGCGTGATGA CACCTAAAGA 360 

5 

TTGTTGATTA CGTTCTAATT TTGCAATCCA ATCATGAACG ATATTGGGCA TCGTATCGAC 420 

AAATAGTAAG CGCTTATGGC AGAAATTGAT AAAACCTTCA GGATCATCTT CTAAATAATC 480 

ACGGCTTAAC AAGTATTCTG GCGAAAGCCC ATCTTTTGaA ATTTCATCAA ATAAGCCACC 540 

10 

CATTGaACGG AAATCTGGAA CGCCACTTGC GACAGATACA CCAGCACCTG TAAAAAATGT 600 

AATACGATTC GAACTATCTA TAATATGTTT TAGTGTCTCT AA 642 

IS (2) INFORMATION FOR SBQ ID NO: 615: 

(i) SEQUENCE CHARACTERISTICS: 

<A) LENGTH: 1210 base pairs 
(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 615: 

25 



35 



45 



50 



TTTCCCCnCC 


CCnCCAAATA 


TCCAAnGGAA 


CTTTAATAGT 


CCAATTGGCA 


CAGTAAAACT 


60 


ATGGCATTTT 


ATAAGTATAA 


TATATCTGTA 


ATTTATGGTC 


AATTAGTAAA 


TTGTTTTTTA 


120 


TTTGAAACAT 


ATTTCACATc 


AAAATCACAA 


AGACTTTTAG 


ATTTTGtTCT 


AAAAATCTCT 


180 


TAATAATTTA 


TTTAATGAGA 


AGAGTTGCTT 


ATATAGTAAA 


TTGTGAAGCC 


GTTAAAACAA 


240 


CGTTACAAAA 


CCTATATCTT 


TAATACGGAA 


CCATATGGTA 


TGAATCAAGG 


AATACTTAAA 


300 


CTAAAACTTC 


TCTATCAGAT 


TTATTTGTTG 


CGAAATCAAC 


AACTTTAATT 


GCTTGCCCTT 


360 


CATTTAATGG 


ATAATTTGCT 


TGCGTAATTT 


TAACTTTTAC 


AATTTGACCT 


ATGAGTGATT 


420 


CGTCACCTTC 


AAATTGTACT 


TTCATATAAT 


TATCTGCATA 


TCCAACTAAT 


GTACCTTCTG 


480 


TGtCACCCTG 


TTCCTCAGGA 


ATTACTTCAA 


GCACATCTTG 


ATCAAATTTA 


GACGCATATA 


540 


ACTTTCCGAG 


TTGATTGCTT 


AGCGTAATTA 


ACTTATGCAC 


CCGTTCATTT 


TTAATTTCTT 


600 


CATCAATTTG 


GTCATCCATT 


CTTGCAGCTG 


GCGTGCCAAT 


TCTAGGAGAA 


TAAGGGAAAA 


660 


CATGCAGTTC 


AGAGAACTTA 


TGCTTTACGA 


TAAAATCATA 


TGTTTCTTGG 


AACTCAGCTT 


720 


CAGTTTCACC 


TGGGAAACCA 


ACAATTACAT 


CACTCGTAAy 


TGCCAAGTCT 


GGTAAAGCTT 


780 


TATGCAATTT 


TGTTAATCGT 


TCTGAAAATC 


TATCCATTGT 


ATACTTACGT 


CTCATACGTT 


840 


TTAATACTGT 


ATCTGAACCA 


GATTGTAATG 


GAATATGCAA 


ATGACGCACA ACTTTTGTTG 


900 


AACGTTCTAA 


AACGTCAATT 


ACTTCATCTG 


TAAGTTGACT 


TGCTTCAATT 


GAAGAAATTC 


960 



55 
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CTTTTAAATC TTGACCATAT CCACCTGTAT GAATTCCCGT CAATACAATT TCCTTATATC 1080 

CTGAATTCAC TAGTTGCGTC GCTTGTTCAA CTACTTTTTC CGGATCTCTT GAACGCATTA 1140 

AGCCACGAGC CCATGGAATA ATAcAGAATG TGCAGAAGTT GTTACAACCT TCTTGAATTT 1200 

TTAATGACGC 1210 



. (2) INFORMATION FOR SEQ ID NO:. €16: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 652 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
is (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 616: 



30 



35 



TAATAAAATA 


CAATACTTTT 


CAATACAGAG 


AATCCCGCAA 


TGTCGGGATT 


CTTATTTATG 


60 


CTGATTTTGT 


TTTTGTCTAT 


GATCAGGGAC 


TTtCAGGGAC 


TCAATTAATT 


ATCACATTCA 


120 


TATTTTCTTA 


TGACCAAATT 


GATACATATG 


ATGCTATTTA ATGAGAAAAT 


TAGGCATCAC 


180 


TTGGTTATTG 


AATTTCCTTC 


ATTAAACTTT 


CCAGCTCAAT 


TTAATAGTTA 


GTCGACTATT 


240 


ATTCATTAAA 


CACTTTTTAA 


TCATAAAAAA 


GTGTTTTTGa 


TAATTCACTa 


CCaAAAACAC 


300 


CTTCTTACTT 


ATAATTCTAT 


TTGTTCACCA 


TTTCTAATCT 


TATCGGCTAA ATCATTCAGT 


360 


TTTCTTAATC 


GGTGATTTAC 


ACCTGATTTT 


GAAATTGGAC 


CAGTTGATAC 


CATTTCTCCA 


420 


AGCTCTTTCA 


ACGAAATTTC 


TTGATGTTCT 


ACTCGAATTC 


TAGCAATCTC 


TCTCAACCTG 


480 


TCTGGTAAAT 


TTTCAATACC 


AATTTCTTTA 


TCAATCAATT 


TAATGCTCTC 


AACTTGTTTC 


540 


ATCGCAGCAC 


TAACTGTTTT 


ATTTAGATTG 


GcCGTTTCAC 


AATTAACGAG 


TCGGTTAACA 


600 


GAATTACGCA 


TATCTCTTAC 


AATACGTACG 


TCTTCAAATT 


TTAATAACGC 


TT 


652 



(2) INFORMATION FOR SEQ ID NO: 617: 

( i ) SEQUENCE . CHARACTERISTICS : 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 617: 

TnACCACTTT AAAATAGCGC TTAATAAAtG AAGGGGGCAA GTCTTATGAC GtTTTACaAT 60 
TTCATCaTGG GTTTTCAAAA TGrTAACACA CCATTTGGTA TATTGGCCGa ACACGTTAGT 120 
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10 



20 



TCTAATTACA CAGATCATCA ATTAATTGAA ACTACAAATA GAG CT ATT AG CTTATATATG 24 0 

GCAAATTAAT TTGAGTAGTA CCAATTATGA TGTATTAGTG CATCCCAAAT ATCTTTTGTT 300 

TTAAAGTTTA TTTCATCATT TCTTATCGAA AATGGTGTAA TAATGTCTTT ATCTAACCAA 360 

GTGTTGATAA GTTCATTTGG TACACCATCT AACAACATTT CACTTTTACT AATTATAAAA 420 

CATTCCCAGT CAAGTGAAAC ATTTTGTGGA TTCACATAAT TACATTGATT ATGATTATCC 480 

ATAAACACTC ACTCCTTTAA AATTCTGTAC TCTTCATTGC GTTTTACCCC GTCACATTAT 540 

CTTTTAAACT AAAATCATCA TTACTTATGA AAAAAATGTA CATCAAAAGC AAAGGTTTTC 600 

15 GCTACCGAAA AAGTTTAAAT AATGGTttAA TATATttGGT ACTCATTTTA ATAAAAAGAG 660 

AATACATTTT GAGCTATCAA TACTTTTTAT TGAAGAGGTG TTATTsyTGG CTAAAACGTT 720 

ATATTTAATG CGCCACGGAC AAACTTTGTT TAATTTTAAG GGACTAATTc AGGGATTTGG 780 

AGATTCGCCG CTAACAGA 798 

(2) INFORMATION FOR SEQ ID NO: 618: 

(i) SEQUENCE CHARACTERISTICS: 
2S (A) LENGTH: 1786 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 618: 

ATACGCTCAA TTGATAAAAA TTTAATATCT GCCATTTGTA TACGCATCGC TTCGAATGTT 60 

TCCGTTGCAA TATCAAATGA TAATTGTGAC TGGAACCTTA AACATCGAAT CATACGTAAA 120 

GCATCTTCTT GGAATCGTTC CTCAGCTATA CCTACAGTTC TTATTATTCG ATTATTAATA 180 

TCTTGTTGAC CATCAAAATA ATCATACAAT TTGTATGCTG TATCCATTGC TATCGCATTC 240 

ATCGTGAAAT CTCGTCGTTG CAAATCTTCG TATAAATCAC GAACAAATGT AACACCACTT 300 

GGTCTACGGT GATCGACATA ATCTTCTTCA GCCCGGAATG TTGTCACTTC ATAATTTtCA 360 

4S TCATtAAAaa CTACATTtAT CGTGCCAtGT yCTTnACCTA CAGGTATCGT ATGACTAAAG 420 

ATAGATTCTA TTTCATCCGG CGTTGCACTT GTTGTGATAT CTATATCATG AATATTTCTT 480 

CCCATGACAT AATCTCTTAC AGAGCCACCT ACATAATATG CTTCAAAACC ATTGT CTTGA 54 0 

so ATTTGTTCTA ATATAGGCCT TGCCTGTTCA AATAATGATT TATCCATATT ATTACTCGCC 600 

TTTACTTTTG TTATGCTCAT TTAGCATTTT TTGATAATAA TACTCATATT GATCTGTAAT 660 

AAGTTCTGAT CCAAAACGTT CAGCAATATC TGCTAGCATG TTTTTCTGAA GTTTGTTGTA 720 
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ATCTACGACA 


AATCCAGTTT 


CACCATGTTT 


AATAACCTCT 


TTAATTCCAC 


CGGCATTTGA 


840 


ACCAATTGGA 


ACGACGCCTG 


TwTTCATAGC 


CTCAAGTAAA 


GTTAGTCCAA 


AGCTTTCTTT 


900 


TTCACTTAAT 


AATAATACTA 


AGTCAGATAA 


TTGGTAAAAT 


TCACTTACGC 


AATCTTGTTT 


960 


CCCTAAAAAT 


AAAACATCCT 


CTTCTACGTT 


TAACTCTTTC 


GTCAATTGAC 


GCATTGGCAC 


1020 


TAATTCAGGA 


CCATCTCCAA 


GTAAAATTAA 


TTTACTAGGT 


ATCTTTTCAC 


GTACTTTTGC 


10B0 


AAATGTTTCT 


ATAATAGTAT 


CTATGCGTTT 


TACTTGTCTA 


AAATTCGATA 


CATGTATTAA 


1140 


CACTTTTTCA 


TCTGGTGCTA 


TACCAAATTG 


TGATTTTAAT 


GCTGTGTTAT 


GTTTAGTTGG 


1200 


AAACTCATTT 


TCACGTACAA 


AATTATAAAT 


CGGTATAATT 


TCTTTGTTAG 


TTTCAATAAT 


1260 


TTCATGTGTT 


TCTTGTGCTA 


AAGATTTACT 


CACACTTGTC 


ACAATATCAC 


TTTTTTCAAT 


1320 


GCCAAATTTA 


ATTGCACCTT 


GGAGTGAATG 


ATCATAGCCC 


AAAACAGTAA 


TATCAGTACC 


1380 


GTGTAGCGTT 


GTCATAATTT 


TTATATCTTT 


ACCTGACATC 


TCACGAGCTA 


AAATCCCACA 


1440 


AATTGCATGA 


GGTACAGCAT 


AGTGCATATG 


CAACAAATCA 


AGATCATATT 


CTTTAATAAC 


1500 


TTCAGCGATT 


TTAGTACTTA 


ACGTAATATC 


ATACGGTGGA 


TACTGAAATA 


CTGCATATTG 


1560 


ATTCACTTCA 


ACTTGATGAA 


AAATCATATT 


CGGTAATGGT 


TTTCTTATTC 


TAAACGGGAT 


1620 


ATTTGAAGTG 


ATAAAATGTA 


CTTCGTGACC 


TCGCTCTGCT 


AATTTAATTC 


CTAATTCTGT 


1680 


GGCAATAATT 


CCAGAACCAC 


CCATGGACGG 


GTAACATGTT 


ATACCTATCn 


TCATTCGCTT 


1740 


GCCCATCCTT 


TCTTTCTATT 


TCTCnCTATG 


ATnCTCGATG 


CGTAGA 




1786 



(2) INFORMATION FOR SEQ ID NO: 619: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 844 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 619: 

45 ACAGGTATGG ACTTnGCTCA AATGACACGA CATTATTTAT CAAGACCTAT TGCTATAATC 60 
TTTTGGATCA TTGCAGAACT AGCAATTATC GCTACAGATA TTGCTGAGGT TATTGGTAGT 120 
GCTATTGCTC TTAATCTCCT ATTTAACATA CCTTTAATCG TCGGTGCACT AATAACTGTA 180 

°° CTTGATGTAT TTTTACTACT TTTTATAATG AAATATGGTT TTAGAAAAAT TGAAGCTATT 240 
GTTGGTACAT TAATTTTCAC AGTGTTATTC ATCTTTATAT TTGAAGTCTA TATTTCATCA 300 
CCACAGTTGA ATGCTGTGTT AAATGGATTT ATACCACATA GTGAAATCAT TACAAATAAC 360 
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TTACATTCAT CAATTGTACA ATCTAGAACA TACTCAAGAC ATAACAATGA AGAAAAAGCG 
CAAGGaTTAA ATTTGCTACG ATAGATTCGA ACATTCAGTT ATCAATCGCA TTTGTAGTCA 
ATTGCTTATT ATTAGTGtTA GGAGCATCAC TATTTTkCAA CTCAAATGCT GACGATTTAG 
GTGGTTTCTA TGATTTATAT CACGCCTTAA AAACTGAACC TGTACTAGGT GCAACAATGG 
GTGCAATCAT GAGTACATTA TTTGCAGTTG CATTATTAGC GTCAGGTCAA AATTCAACGA 
TTACTGGTAC TTTAGCAGGA CAAATTGTAA TGGAAGGATT TTTAAGATTA CACATACCAA 
ATTGGTTAAG ACGTTTAATT ACACGTTCTC TTGCTGTCAT TCCTGTTATC GTATGCTTAA 
TCAT • 

(2) INFORMATION FOR SEQ ID NO: 620: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 574 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 620: 
TGTATTGCAG TCATGCCATA AATTTTTTGA ATAATAAACG GAGACGCAGA AATATAAGTA 
AATAATATTA CAAATGTCAT ACCTTGGAAT GAGCATTGGT AATACAAAAC GTGGCGTCTT 
CAhTAATATT TTGAAGTTTT TAAACATTGT CTTTAaTCCA CTACTTGACT CACGATTTGT 
CACTGTTAAT GATTCAGGTA CTTTTAATAA AGAACCTATG ACCATGACGA AGCCAAAGAT 
AGTCAGAATG ACAAAGACCA TACGCCAGAC AGAATAATTT AAAATTATGC CCCCTATTGT 
TGGCGCAACA ACTGGTGCAA TACCATTAAC AAGCATCAAT AATGCCATAA ACTTAGTTAG 
TTCATTACCA CTATACATAT CACTTGCTAT AGCTCTTGAA ATAACTGCTG tGCGCCACCT 
GTCACTCCTT GAAGaAATCT TAATGCAACC ATCAGCCAAA TATTATGTAC AAAAACAATA 
CCTAAACTTG CTAATGTAAA AATAATCATA GCTATAATAA GCGGCTTTCT GCGCCCAGTT 
GAATCTGAAA TTGGACnAGC AACCAAATTA CCAA 
(2) INFORMATION FOR SEQ ID NO: 621: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 523 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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AAGCAAGAGA GAGTACAACA TTTATATGAT ATTAAAGACT TACATCGATA CTACTCATCA 60 

GAAAGTTTTG AATTCAGTAA TATTAGTGGT AAGGTTGAAA ATTATAACGG TTCTAACGTT 120 

£ 

GTACGCTTTA ACCAAGAAAA TCAAAATCAC CAATTATTCT TATTAGGTAA AGATAAAGAG 180 

AAATATAAAG AAGGCATTGA AGGCAAAGAT GTCTTTGTGG TAAAAGAATT AATTGATCCA 24 0 

10 AACGGTAGAT TATCTACTGT TGGTGGTGTG ACTAAGAAAA ATAACAAATC TTCTGAAACT 300 

AATACACATT TATTTGTTAA TAAAGTGTAT GGCGGAAATT TAGATGCATC AATTGACTCA 360 

TTTTCAATTA ATAAAGAAGA AGTTTCACTG AAAGAACTTG aTTTCAAAAT TAGACAACAT 420 

TTAGTTAAAA ATTATGGTTT ATATAAAGGT ACGACTAAAT ACGGTAAGaT CACTATCAAT 4 80 

TTGAAAGATG GAGAAAAGCA AGAAATTGAT TTAGGTGATA AAT 523 
(2) INFORMATION FOR SEQ ID NO: 622: 



75 



20 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1871 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
25 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 622: 

30 AGAAGTGTGA nAAAAATTTA AnAGAGATAT GCACATAGAT GACGCATTGC TATATCCAAG 60 

CAATTGAGAA AGCTGCTGAT GCTCCAAATC aCGGAATGAG GGAACCATGG AGAGTTGTGC 120 

ATGTTCCGAA AGACAGATTA GGAGATATGA GTAAGGATAT TTCTAAATTT GCATTTCCTA 180 

ATGAATTAGA TAAGCAACAA TGTCATTATG ATGCAGTTAC GAAACTAGGT GGCATGTTAT 24 0 

TGCTTATTTT AAAAACAGAT CCAAGACAAC GTCAAAATGA TGAAAACTAC TTTGCATTTG 300 

tGCATATGCA CAAAATCTTA TGTTGTTACT TTATGAAGCG GGAATAGGTA CATGTTGGAA 360 

40 

aTCGCCATTA TATATCTATG ATCCTAAAGT AAGnAAACAC TTGGTATAAA GnAAGATGAA 420 

GTTCTTGCTG GATTCTTATA TTTAACGGAT TTAGAAGnAG ATATGCCTAA AGCACCACGT 4 80 

45 AAAAATAGAA ACTTAATTAC ATTATATTAA TATGTATAAT TATAGAAACA TTAATAAAAG 540 

CTGAGTCATG AATTGATGGA CATCTATCGA GTTAGAGATT TAATCTAACT TACTAGAGTC 600 

GGTACAATAA CAGTCTCAGC TTTTTATTGT GCAGTATATA CACATTTTTA TTTTAGTATT 660 

TATTtAAAAG TTtCTGCTaA AAATGATTCA ACTTGTTCAG GTGACTTAGC ATTTGCTGAA 720 

TGAAGGTGTG CAATTTTATC GCCGTTTTTA AATACTAGCA AGCTAGGGAT ACCCATAACT 780 

TCATTTTCAA CAACTACATC TTCTAATTCA TCACGATTAA CAGTATACCA TTGGTAATCA 84 0 
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CCTGCCTCAA ATTTAACAAT TACAGGTGTA TCGCTATTAA TTACAGATTT AAATGATTCA 960 

TTACTTTTGA TTGATTGCAT TGTAACAACT CCTCTAGATA GTTTAATAAT TTTTATTATA 1020 

GCTAAATTTA TATCATAATA AAAAATTTTA GCTTCAAAAT GAAAGCCTTT TGCTTTGGAA 1080 

AATGATATAT TTATTTTAAA TACATAAAGG AGGTTGCAGT CGTATGATTA AATTTTACCA 114 0 

ATATAAGAAT TGTACAACTT GTAAAAAGGC AGCAAAGTTT TTAGATGAAT aTGGCGTAAG 1200 

TTATGAACCA ATTGATATCG TTCAACATAC ACCTACAATA AATGAATTTA AAACAATAAT 1260 

TGCAAATACA GGCGTAGAAA TTAATAAATT GTTTAATACA CACGGCGCGA AATATCGTGA 1320 

GCTTGATTTG AAAAATAAAT TACAAACTTT ATCAGATGAT GAAAAGTTAG AGTTGTTATC 1380 

ATCTGATGGT ATGTTAGTAA AGCGTCCTCT AGCAGTAATG GGCGATAAGA TAACATTAGG 1440 

ATTTAAAGAA GATCAATATA AAGAGACTTG GTTAGCGTAA GTGaAATGTA AGCGTTTACT 1500 

AAATATCTCG ATATTTAGAT TCATTACATG TAAAATGAAA TAAGCTATAC AATTGTTAAT 1560 

TTTTATAAAT ATAGTTGAAT AGCATCTAGC CTTATGGCAT CATTAATGAT GTAAAGATTA 1620 

ATTAGGAGGG GATTCTCTTG GCAGTACCAA ATGAAyTGAA ATATTCAAAA GAGCATGAAT 1660 

GGGTTAAAGT TGAAGGTAAT GTAGCAATAA TTGGAATCAC AGAATACGCA CAAAGCGAGT 1740 

TAGGTGATAT TGTTTTCGTT GAATTACCAG AAACAGATGA TGAAATTAAT GAAGGGGATA IB 00 

CGTTTGGTAG CGTAGAATCA GTTAAAACTG TATCAGAATT ATATGCACCA ATCTCTGGTA 18 60 

AAGTAGTTnA A 1871 
(2) INFORMATION FOR SEQ ID NO: 623: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 623: 

TTGCTGTAGA AGATAAAGAG TCCCATCAAT GGATAGGCTT TATAGGTTTG AATTATATTC 60 

CAGAAACAAG CGATTATCCA TTTAAAGAAT TACCGCTTTA TGAAATAGGT TGGCGCTTGT 120 

TGCCAGAATT TTGGGGAAAA GGATTAGCAA CTGAAGGCGC AAAGGCAACA TTGAAGTTAG 180 

CAGAAGAACA TCAAATATAC GATGTCTATA GTTTTACAGC AGAAGCAAAT AAAGCTTCAC 240 

AACGTGTAAT GGAAAAAATT GGCATGACAG TGTATGATCA TTTCGAATTA CCCAATCTAA 300 

GTAAGTATCA TTTATTaAAA AGGcAAGTGC GCTATTACAT TAATCTTCcG AAAGTGGAAA 360 
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ATTAgGGGGT GtTTnGTkTri ATTTTTTTAA n 451 
(2) INFORMATION FOR SEQ ID NO: 624: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 665 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 624: 

CGATAGTTAT CTAAAGCCAT TTTnGATTGT GTTATGAAAT CTAATGATGC GTGATAATTT 60 

AATGCrACAT AACGrTaATA TAAAATATCA ATAGTGAACA TTTGAGCAAA TAATGAAGTT 120 

GTTGCTCCCA TATGCATCTC ATTTTCATCA GTTTTCCCAT AAGTTAAAAC AATATTTGAT 180 

20 GCCTGTGCTA CGGGATTATC CCTTGTACTA GTAATTGTAA TTATAGGTAT ATGGTAGTCA 24 0 

TCAATAACTT TAACCATTGA TTGCATTTCA CTTTGCGTAC CATTGTTAGT AATAAGAATA 300 

aCACTGTCGT TCGAATTGTG AGTTGCTAAT AATGTAGCAA AAATATGTGT TTCTTGAACA 360 

AGTTGAATAT TAAGACCTAT TCTTGATAAC TTTTGGTATA AGTCGGTAGC AACTACAAAA 420 

GATGCGCCAA AACCATATAT AAAAATCGTC TCAGAACGTT TTAAACAATG ACATATTTGA 4 80 

TCAATAGTTT TATCATTTAA TTCGTTATTT GCATGATTAA GTGCGCGTGT AGTACGTGTA 54 0 

TGGAGTTTAG TTCTTAAAGA TTCTGTGCTT TCGTTATTCA TTAATTCAAC ATTGTAAATT 600 

GATGATGCTT TAGGAACATA TTTAGATATA TTTATTTTCA AGTCGTGAAA ACCGCCATCA 660 

35 GTAAT 665 
(2) INFORMATION FOR SEQ ID NO: 625: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 625: 

GATGCCAATT AACCGTGCAT ATAATGTTGA GAAGTTAATC GAAGCAATTC AATATTATCA 60 

so AGAAAAAACA AATCGTCGTG TTACTTTTGA ATATGGTCTG TTTGGTGGTG TGAATGACCA 120 

ACTAGAACAT GCAAGAGAAT TAGCACATTT AwTAAAAGGC TTAAACTGCC ATGTTAACTT 180 

AATTCCtGTC AACCATGTTC CAGAAAGAAA TTATGTGAAA ACGGCTAAAA ATGATATCTT 24 0 

55 
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TTCGGATATT GACGCAGCTT GTGGTCAATT AAGAGCAAAG GAACGACAAG TAGAAACGAG 360 

GTAAAGACAA ATGCTAGAGG CACAATTTTT TACTGATACT GGACAACATA GAGATAAGAA 420 

TGAAGATGCG GGTGGTATTT TTTATAATCA AACTAATCAA CAACTTTTAG TTCTGTGTGA 480 

TGGTATGGGT GGCCATAAAG CAGGAGAAGT TGCAAGTAAA TTTGTTACAG ATGAGTTGAA 54 0 

ATCyCGTTTT GAAGCGGAAA ATCTTATAGA ACAACATCAA GCTGAAAATT GGTTGCGTAA 600 

TAATATAAAA GATATAAATT TTCAGTTATA TCACTATGCA CAAGAAAATG CAGAATATAA - 660 

AGGTATGGGT ACAACATGTG TTTGTGCACT TGTTTTTGAA AAATCAGTTG TGATAGCAAA 720 

TGTCGGTGAT TCTAGAGCCT ATGTTATTAA TAGTCGACAA ATTGAACAAA TTACTAGTGA 78 0 

TCACTCATTT GTTAATCATC TTGTTTTAAC GGGTCAAATT ACGCCGGAAG AAGCATTTAC 84 0 

ACATCCACAA CGTAATATTA TTACGAAGGT GATGGGCACA GATAAACGTG TGAGTCCAGA 900 

20 TTTGTTTATT AAGCGATTAA ATTTTTATGA TTATTTATTA TTAAATTCAG ATGGATTAAC 960 

TGATTATGTT AAAGACAATG AAATTAAGCG TTTGTTAGTA AAAGAAGGTA CAATAGAAGA 1020 

TCATGGTGAT CAATTAATGC AATTGGCATT AGATAACCAT TCGAAAGATA ACGTTACTTT 108 0 

CATACTCGCG GCTATTGAAG GTGATAAAGT ATGATAGGTA AAATAATAAA TGAACGATAT 114 0 

. AAAATTGTAG ATAAGCTTGG CGGCGGTGGC ATGAGTACCG TTTATCTTGC TGAAGATACG 1200 

ATACTTAACA TTAAAGTTGC AATTAAGGCG ATTTTTATAC CACCTAGAGA AAAAGAAGAA 126 0 

ACATTAAAAC GTTTTGAACG AGAAGTACAT AACTCATCAC AGCTATCACA TCAAAATATA 132 0 

GTAAGTATGA TCGATGTTGA TGAAGAAGAT GACTGTTACT ACTTAGTAAT GGAATATATC 138 0 

35 GAAGGTCCGA CTTTGTCTGA GTATATTGAA AGTCATGGGC CATTAAGTGT TGACACAGCG 144 0 

ATTAATTTTA CGAATCAAAT ATTGGATGGC ATTAAACATG CGCATGATAT GCGTATTGTA 1500 

CATAGAGATA TTAAGCCACA AAATATATTA ATTGACAGCA ATAAAACGTT GAAAATATTT 1560 

40 GATTTTGGAA TTGCTAAAGC TTTAAGTGAG ACGTCTTTAA CTCAGACTAA TCATGTGTTA 1620 

GGTACTGTGC AGTACTTTTC GCCAGAACAA GCAAAAGGTG AGGCAACGGA TGAATGTACA 1680 

GATATTTATT CTATAGGTAT kGTGTTATAT GAAaTGCTTG TTGGTGAACC ACCCTTTAAT 1740 

GGAGAAACTG CAGTTAGCAT TGCGATTAAA CATATTCAGG ATTCTGTGCC AAATGTGACA 1800 

ACAGATGTAC GTAAGGATAT TCCGCAATCT TTAAGTAATG TCATTTTACG CGCTACAGAA 1860 

AAAGACnAAG CGAATCGTTA CAAAACAATT CAAGAAATGA AAGATGATTT GAGTAGTGTT 1920 

TTACATGAAA ATCGAGCGAA TGAAGATGTC TATGAACTCG ATAAAATGAA AACGATAGCG 1980 

GTACCTTTGA AAAAAGAAGA TCTAGCAAAG CATATTAGTG AACATAAGTC GAATCAACCT 204 0 
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AGCCAGAAGG TACGGTGTAC GAACCAAAAC CTAAAAAGAA ATCAACACGA AAGATTGTGC .2160 

TCTTATCACT AATCTTTTCG TTGTTAATGA TTGCACTTGT TTCTTTTGTG GCAATGGCAA 2220 

TGTTTGGTAA TAAATACGAA GAGaCACCTG ATGTAATCGG GAAATCTGTA AAAGAAGCAG 2280 

AGCAAATATT CAATAAAAAC AACCTGAAAT TGGGTAAAAT TTCTAGAAGT TATAGTGATA 2340 

AATATCCTGA AAATGAAATT ATTAAGACAA CTCCTAATAC TGGTGAACGT GTTGAACGTG 24 00 

GTGACAGTGT TGATGTTGTT ATATCAAAGG GCCCTGAAAA GGTTAAAATG CCAAATGTCA 2460 

TTGGTTTACC TAAGGAGGAA GCCTTGCAGA AATTAAAATC GTTAGGTCTT AAAGATGTTA 2520 

CGATTGAAAA AGTATATAAT AATCAAGCG 2549 
(2) INFORMATION FOR SEQ ID NO: 626: 



(i) SEQUENCE CHARACTERISTICS: . 

(A) LENGTH: 2286 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
' (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 626: 

TGCTTACTTC GCCTTCAATA CGTACTAATT CATGTCCACA ACTTGGACAA TGGGTTGGCA 60 

TATGATATGT GACAGCATCC TCAGGTCTAC GTTCTGGAAT ACTACGTACA ACTTCAGGTA 120 

TGATGTCACC TGCTTTTTTC ACTACAACAC TATCACCAAT TCGAATATCT CTGTCATGAA 180 

TTAAATCCTC ATTGTGCAAA GATGCTCTTG aTACAGTTGT ACCAGCTACT TTTACTGGTT 240 

CTAAAATAGC AGTAGGTGTG ACTACACCTG TTCGTCCAAT ACTTAATTCA ATATCTAATA 300 

ATTTAGTTAC TACTTCCTCA GCTGGAAATT TATAAGCAAT GGCCCATCTA GGAGATTTTT 360 

GTGTGAATCC CATCTCATCC TGTTGATCTA AATCATTAAC CTTAATAACA ATCCCATCAA 420 

40 TATCATAAGG TAATGACTCT CTTTGGCTTG TCCATTTTTC AATATACTCT AAAACACCAT 480 

CGATATTATT TACACGCGCT CTATTTTTAT TCGTTGTAAA ACCTAATTTA TCTAACTCAT 540 

CTAATGCTTC ACTTTGCGAA CGCGCATTGA AATCAGTGAA ATCATTGACA CTATATATAA 600 

ATACGCTTAG CTTTCGTTTT GCCGTTAATT TAGAATCTAA CTGTCTTAAT GATCCCGCAG 660 

CAGCGTTTCT TGGATTnGCA AATAACTGCT CATCATTTTT TTCTTTTTCT TCATTTAATC 720 

GTAAAAATGA ACGTCTCGGC ATATATGCTT CACCACGAAC TTCTACATTT AATGGtTCTT 780 

TCATTTTCAA AGGTATCGCA TGAATTGTTT TTaAATTTTC GGTAATATCT TCACCTGTTG 840 

TTCCATCACC ACGTGTTAAA CCTTGaACGA AGTATCCATC AACATATTTC AATGATACTG 900 
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GTTGGTCGAA TTTTCTCAAA TCATCCTCAT TAAATGCATT CCCTAAACTT AACATTGGCG 1020 
TGTCATGGTT GACTTTATTG AAAGAGGCTT GGGCTTCACC GCCAACTCTA ACTGTTGGAG 1080 
5 AATCTACAGT CTTATACTCA GGATGCTCCT CTTCTATTTT AATCAGTTCA TGAAGTAATT 1140 
TGTCATATTC ACTATCTGGT ACAGATGGAT TATCCTCTAC ATAGTATTCA TAACTGTATT 1200 
GATTTAATAA ATCATGTAAC TCGTTCACAC GAGACGATAA ATCAGGCATC CCTTAATCCT 1260 

10 

CCTTTTTTtC AATTGGTGCA AATTGCGCTA ACAAACGTTT TGGCCCTTGT GATTTAAAGA 1320 
TAATATCTAG TTCGATTGAG CCATTTTTCT CGTTTACATT ACTCACCATG CCTTCTCCCC 1380 

is AGGCTTTATG CATCACTTTG TCACCTACAT TCCAATCAGA TGACAATACT TGTT TTT TCG 1440 
TTGACGTTGT TCGTTGACTA AATCCGCGTT TAGCAAAAGG TTTTGCCTTA GGTTGTATCG 1500 
TTTGTCGTTT GCCACTTGAA TGATTTTCTA ATAGTGATTC TGGAATTTCC TTTAAAAATC ' 1560 

20 TGGATGGCAT ATTTGACTGA GGGCGACCAA ATAACATTCT TGATGTCGCA TGAGTGATAT 1620 
ATAACAGCTC TTCAGCCCTT GTAATTGCTA CATAACAAAT ACGACGTTCT TCTTGCATTT 1680 
• CATGATCATC TTCACTCTTA ATCGCTCTAA ' TATGTGGGAA TAAAGATTCT TCCATCCCCA ■ 1740 

25 

TTATAAAGAC AATTGGAAAT TCAAGGCCCT TAGCCGAGTG CATCGTCATT AGTGTTACGC 18 00 

CATTTTCAGT ATCTGCCTCA TCAATATCAG CTACTAACGA TAAATCCGTT AAAAAGTTAA 1860 
TTAATGACTG TTCTTCTAAT GGGGTATTTT CCTCATAGTC TTTTGGTACT GACATAAATT 1920 

30 

CATCGATGTT TTCTAATCTA CTTCGAGATT CTAATGTATT TTCACGTTCA AGCATTTCTC 1980 
GATAGCCAGA CTTTTGTAAT ACTTCATCAA CTATTTCATG AATTTCTAAA AATTCTTGTT 204 0 

3B CTTTTATCAA GCTTTGGATT AACTCGTAAA AATTAAGACA CTCTTGTGTC ACCTTTTTTG 2100 
aCAATCCGAT AAAATCAGCT TCTCCAAGTG CATCAAACAT ACTGATATTG TTTTGAAGTG 2160 
CATAGTTTTG AACTTTTTCA ACAGATG AAG GACCTACACC TCTTTTTGGA ACATTaATAA 2220 

40 TACGTTGCAA ACTAATGTCA TCATTACTAT TGGCAATTAT ACGCAAATAA CTTAATAAAT 2280 
CTTTGA 2286 
(2) INFORMATION FOR SEQ ID NO: 627: 

45 

(i) SEQUENCE CHARACTERISTICS: 
« (A) LENGTH: 400 base pairs 

(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

oO 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 627: 

55 
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TGACATTAAA AGTTGGGTGA AnAAACAAGG CATACTTTAG TTAAGCTTGA TGAAAATAAC 120 

AATGGAATTA ATGCGATTAT TCAAAAAGAA AAAGCAAAAG ATTTAGATAT AAATTATTCT 180 

5 GCTAAAGGTA CTACCAATTG TATTATTTAG TGGAGAATTA GACAAGCTGT AGCAGCGTTG 24 0 

ATTATTGCCA ATGGTGCTAG AGCTGCTGGA AAAGATGTAA CTACCTCCTT ACTTTTTGGG 300 

GGCTTATGCC nTTAAAAAAG TGCCACCGTT AATGTTAAAA GCAGTTGCCA AAATGTTTGA 360 

10 

TTnATGTTGC CCCAAAAGAT TTACGATGCC CCTTCCCAAA 400 

(2) INFORMATION FOR SEQ ID NO: €28: 

(i) SEQUENCE CHARACTERISTICS: 
75 (A) LENGTH: 4 53 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 628: 
AATAATTTGG GCGCTnTTTG CGTCGGGATA TTATACCGCT TCCTTAATTG TTCAACATTG 60 
25 TAATCXCTGT TTTTCAATTG ATATTTTGCA GAGTAAATTG GTACTTCTGG GTTATATGAC 120 
ACTTCGTCCT CTTTATAGTT TTCCAATTCT TTGAAATTCC CGTATTGTAC AAAGAAGTTA 180 
AATTCTTCGA TTTCTTTTTT TACTTTTTCG TCATCGATTG GTTTTAATGG AATCATTTTA 24 0 

30 

TTAkTTTCCA TTTTCACAGG ATATCTTTTT GTATGATTGT GTGTCATTCC ATCGCTATCT 300 
TCAACAACTT CTCTAACAAT ATAATGCCCT TTAGCCGTTC TAGTATTTCT GTTAATTTCT . 360 
AAAACTGCTC CTCTrGATTC CAGATTTTCT CCTTTTAATT GGATTTTCAT TTCAGATCTA 420 

35 

ATTAGcCAAG TACCTTTATC AT.CTt TTTT A AAT 453 
(2) INFORMATION FOR SEQ ID NO: 629: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1221 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 629: 
TCATATGCAT TTGCAAAATA AACGCCAGAA GCAAGGTTTA GAATTGGGCC GTCCGTTTTG 60 

50 

CTCAaTTCAC TTGcATTCAA TAATTCATGC TGATCATGAT CAATTTGCTT ATCTAATTCT 120 
GCaATTTTCT TCATTTGCTT ATCTGATTTG TTTTCTTTCG CCATCATTTG ATCACGACGA 180 
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GCTAATGGTA 


CTAATACTTT 


ATCTCCATAT 


GTGTCAATAA 


AGTTATAAAA ATAATCATCT 


300 


GTTTTTGATA 


CAAATCCAGC 


ACGCTCTTCA 


GTTTCACGAT 


ATAAATCTAA GAAAAGATTG 


360 


AACTCATCAC 


GTTCAAGGAA 


TCTGACTTTA 


ACACCATAGT 


TTATCGCTTT ATTAATATTA 


420 


CGTTTACGTT 


GACTATCAAA 


TGTCTTTTTC 


AATGTTTCGG 


GTGTTTTACC 


TTCAAGGTTT 


4B0 


AATACGCCCA 


TCCATCGTAC 


TTGGCTCGAT 


GTATCATACT 


CAGTTGTAAA GCCATGATGC 


540 


TCGTAACCAT 


GTGATTTAAA 


CAAGTTTACT 


AGGGCATCAT 


TTTTCTCGCG 


ACCTTCAAAT 


€00 


GGCACGATAT 


CTTTATCATA 


TAGATGATAT 


AACCAATACG 


GATCTAATTT 


AACATATAAA 


660 


CATTGATGTT 


GCTGTAAATA 


TTTATCTAAC 


TCTTTTAAAT AATAATCAAC 


TAATCCTAAA 


720 


TCTGAAAAAT 


CGATTACTGG 


ACCACGATTC 


GAATAGTAAA 


CATAACTTCC 


CATAGTAGGA 


780 


ATTTTAGAGA 


AAAGGCTTGC 


TGCAATTACT 


TTGTTATTGT 


CGTCTTTAAT 


ACCTAATAAA 


840 


ACTACTTCAA 


AGCCATCATT 


CTCACGGGTA 


ACTATATTTT 


CTTTTACTTG 


GAAATAATGA 


900 


CTTTCCAATG 


ATGGATTTTG 


TACAAAGTTG 


TCAAATTCGG 


TAACAGTTAA 


CTCTGTAAAT 


960 


XT CATGTTTT 


GATAATTC C t 


TCCTAAAAAA 


TTCTGTCTTT 


AACTTTTTTA 


AGTGCGGTAT 


1020 


ATGCTGCGTA 


AACAGGTTTA 


TTAATTGGTT 


TAATAAAGTC 


ACCAACATAT 


TCmATAATTT 


1080 


CAGCATTGTA 


ACCTTTTTTG 


AATTTAACTA 


CACCAGCATC 


TTCAGCATCT 


TCTGTAAATT 


1140 


TrCCACTAAC 


ACCATAGAAA 


TTATAACGGT 


CAATGCATGA 


TTTAATGCAT AATTAATCAT 


1200 


TTCCCATTGC 


ACTGCATAAC 


T 








1221 



(2) INFORMATION FOR SEQ ID NO: 630: 



35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1121 base pairs 

(B) TYPE : nucleic acid 

(C) .STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 - - 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 630: 

TGGCCCAAnT AACACAAGTA ATTGCTGCTA ATGGTAACAT CACAAAAAAT GAAATCGTAA 6 0 

45 

CTACAAATGT TAAACCTTGG AATACACCAA CCATTTCTGG TAATCGTTTA CTATAGTATC 120 

TATTGTGAAT CCAAGTAATA ATAGCTGAAA TAATAATACC ACCTAGAATA TTCGTATCCA 1B0 

ATGTGGCAAT ACCTGCAATT GATTTTAAAC CAGGTACATT TTCAACGCCT TTTTCTAAAT 240 

50 

TAGCGCCAAA CGTATGTGGC CATTGTGTTA AAATGGCATT TATAAATGTA TTAAACATTA 300 

AGTAACCCAT CAATGCTGCA AGTGCTGCAT GACCTGGTGC TTTTTTAGCT AAAGAAAGTG 360 
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TTACTGACCA AAATTTAAAC CAAAACGTAT GTTGATCTGC TAAACTCCCC ATGATTGTAG 480 

GATTTTTAAA TAATGTCGCA AAGCCAAGCA CTATCCCAAA GAAAGCGAAC ATTAATACCG 540 

GTACAATCAT TGCACTACCG AAACGCTTTA TCGCATTCAT CTTCTATTCC CTCCATATCA 600 

TCTTTCCTAA CAATACATCT AATTAGATTC ATTTATAAAT AGATGTCTTA CTATTTAAAT 660 

ATAATATATA GTAAACGCTT ACACACCTAC AACGACATTG ACGTATTTTG AAAGTATTTT 720 

GTATAATCAG ATTATCTTTT CATATAGTGA AAATTTTTTC ACGACCTTAT ATATGACATC 780 

GTTGTATTTG TAATACATTC GTTTTAAACG CATAATCAAA CCTATATCAA TACACAAATA 84 0 

TATATAATGA CATACAAGAT TTTAATGTAA TAACGATCTA TTACACATTT ATTTTCAAGG 900 

AGGTTGAATA TGTTTTTAGA TGAACACATT AATCGAAACT TTGATAAACT TAATGATAAT 960 

GATTTACATA TCGCTCACTT TATCAATACA CATATAGATG AATGTAAAAA TATGAAAATA 1020 

20 CAAGATTTAG CGCAATTCAC ACATGCCTCG AATGCAACCA TTCACAGATT TACACGCAAA 1080 

TnAGGTTTTG ACGGTnATAG TGGATTTAAA TCGTACCTTA A 1121 
(2) INFORMATION FOR SEQ ID NO: 631: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 4005 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 631: 

AACCTTCCAT TTTACTTGAT CGATAACATC AGTTCTGCTT TACAAATCTC TTCATTAATA 60 

TGCGTCTTAA AACCATGAAA TTTAACATGT TCCGATAAAT GATAATCTTC TACAAGTTGT 120 

CGATATTCTG ACAAACCATT TCCATGTCCA TAAATATTCA ATTGAATATT GGGATGTTTT 180 

40 GTTACTAATT GCTTGATTAC TTCAATTTGA TGTTTAATTT GTTTATTTTC AACGAGGCGA 24 0 

GCAATTGATA TGATATGATT TTTCTCCTTT TGATTGATGT CAAATTGATA CTTTAAATTT 300 

GCCACGTAGC CAACCGGAAT ATTGATAACT GGTATTTTAT TTTCAATATA TTGTGAAATA 360 

TCTTGGCATT GCTTTTCTGT TGATACAACA ATCGCTTTAT AACGTGTTAA ATTATTAAAC 420 

ACTGTTTTAT AAAAACTTTT TATACCATTA CCGGCACCGG ATAAATGTGT ACTATGGAGC 4 80 

ACAACAATAA CTGGAATACT TTGATTTAAT CCCGCTATAA CATTTCCTAA TTCATGAGGA 540 

CGATCTAATA TGATTTGATC ATTATTTTTA CATAATTGAT GGAGAAAATA TTGAACTAAT 600 

TCATCTTCTG TATCAAAAAA TTGTTGATGC TGGTCTTCAT TTAAGATAAC CTTTGTGAGC 660 
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TAGTAATTTT CGAGTACAAT CCGTTGTCCT TCACCTAAAA TTCGAGAACA ACTTAAAAAG 780 

CCTCTTCCAT CATACAATTC GCGTTTTACT TTTCTTCTTT TATGATCAAA ATAATTCACA 84 0 

TAATTTAATT GATGATACTG TTTATCTAAA AAATGAGCAT ACATTACAAA TTGCTCTTCA 900 

TCATATATTC TGACATCATT TGAATTTTCC ACAAATTTCA ATGTGTACCT ACATGACTTT 960 

TCCCAATACT GTATCCAGTT AACTTGCTTT GTCTTTTTAT AATTGATTGC TTTTTGAAAA 1020 

TAGTCATACA TTGTAAATAC ATCATTTTCA ATCTGATGTT GCTTCGCATA TGTGTATGAA 1080 

TAAGGATTCC ATTTAACATA TACACATTTT GAAGATATGC CGTGTTGTTT GAACAACTTC 114 0 

AATCTATTTA TTTGCGCTTT TTCTACACCT GTAATTTTAC TTTCTAAAAT TGTTCCTAAA 1200 

ATGTAATTCA TATTATCGCC TCATATAAGT TTTATTCCGT ATCTTTATTG TTTATTTTAT 126 0 

ATGAAAAATA CATCTATTGC ATGTGTAATT ATAAAAAAAC CAGGCCACAA GGACCTGGGT 1320 

20 ' CATATTGTAT TATTTGTTTT GTTTTTTGCG ACGACCGAAT AACAATAATG AACCTAATGC 1380 

TGCAAATAAT CCACCAAATA ACGTTGCGTT ATTTGAGCCG TTATTTTCAC TACCTGTTTC 144 0 

TGGTAATGCT TTTGCTTTAT TGTGATGGTC TTTAGTAGTA CTCATTGGTT TAACAGGTGT 1500 

ATGTTTTCCT GCATCCGAGT CTGAATCGCT GTCTGAATCA CTGTCTGAGT CTGAGTCGCT 156 0 

ATCAGAGTCT GAGTCGCTGT CCGAATCTGA GTCGCTATCT GAGTCTGAGT CGCTGTCTGA 162 0 

ATCTGAATCA CTGTCTGAGT CTGAGTCGCT ATCTGAGTCT GAATCGCTGT CTGAATCTGA 168 0 

GTCGCTATCT GAGTCTGAAT CGCTGTCTGA ATCTGAGTCG CTATCTGAGT CCGAATCGCT 174 0 

ATCTGAATCT GAGTCGCTGT CTGAGTCTGA GTCGCTATCT GAGTCTGAAT CGCTGTCTGA 1800 

35 GTCCGAATCG CTATCTGAAT CTGAGTCGCT GTCTGAGTCT GAATCGCTAT CTGAATCTGA 1860 

GTCGCTATCT GAGTCTGAAT CGCTGTCCGA ATCTGAGTCG CTATCTGAAT CTGAGTCGCT 1920 
GTCTGAATCT GAATCACTGT CTGAGTCTGA GTCGCTGTCT GAGTCTGAGT CGCTGTCTGA . 1980 

40 GTCACTATCT GAGTCTGAAT CGCTGTCTGA TGTATCTTCT TCGAAGTATC CGTTATCAAG 204 0 

TGTGAAATCA TCATGATCCG TAATTGTTAC GTCAACTTCG CCACCATCTG CATCTTTATC 2100 

ATCTTCAGTT GTATTTGTAA CTGTTTGTGT TAAGCCAGCA GGCTTTTCAA AAATAACTTT 2160 

GTATTTACCG CTATCTAAAT TATCAAAGCA GTATTTACCA TTTTCATCTG TTTTAGTTGT 2220 

TCCAATTACT TCGCCTTTTT CATTTAATAA AGTAACTTTA ACATCTTTGA TACCTTTTTC 2280 

AGTTGAATCT TGTTTGCCGT CTTTATTACT GTCGTACCAA ACATAATCAC CTAAACTATA 2340 

TTTTGGTGTT TTATAGAAAC CACTGTCTAA TGTCATGTTA TCTGCATCTT TAATGACACC 2400 

TGTTGTTGTT AAACCATTAG AATCTTTTTC AGTATCATTT CCAGAAGTTA CTGAAGTTGG 2460 
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TTGATATTTA CCATTTTCAT CTGTTGTAAC TGTTTTTAAA ACTTTGTCGT TTTCATCTTT 2580 

TAACGTAACT GTTACACCTG AAATGCCCTT TTCATCTTTA TCTTGAACAC CGTTTTTATT 2640 

TGTATCTTCC CATACATAGT CACCTAAGTT GTAAGTCGGT TTGTAGAAAC CAGAGTCAAT 2700 

AGTATCGTTA TCTTTATCTT TAATGACACC TGTTGTTGAT GTACCATTTG AATCTATACC 2760 

TTCATCAGTT CCTGAACCTA CTTGTGTTGG TGTGTAACCT GATGGTGTTT CGAATTCAAC 2B20 

TTTATAAGTT CCATTTTCTA ATCCAGTAAA TTGATATTTA CCATCTTTAT CTGTTTTAGT 2880 

TGTTTGTAAA ACTTCACCGT TTTCATTTTT CAATGTAACT GTTACGCCTG AAATACCTTT 2940 

TTCAGTTGAA TCCTGCTTAC CATCTTTATT TGTATCTTCC CATACATAAT TACCTAAATT 3000. 

ATATTTTGGT GTTTTGTAGA ATCCACTATC TAATGTCATG TTATCAGCAC CATTAATAAC 3060 

ACCTGTTGTT GTTAAACCAT TAGAGTCTTT TTCAATGTCG CTACCAGATG TTACTGTAGT 3120 

20 CGGTGTATAG CCTTCTGGTG TAGTAAATTC AACTTTATAA TTACCATTAT . CTAAATCAGT 3180 

AAATTTATAT TTGCCATCAG CGTCTGTTGT AACTGTTTTT AaCAgTTACC GTTTTCATCT 324 0 

TTTAATGTTA CCGTTACGCC AGATATACCT TTTTCATCTT GGTCTTGGAT ACCATTTTTA '33 00 

TTTGTATCTT CCCAGACATA GTCACCTAAG TTGTATTTAG GTTTGTAAAT ACCTAAGTCT 3360 

GCAGATAAGT TATCTTTGCC ATTAACTGTA ATAACTGAAG ATAAGCCGTT TGAATCTAAT 3420 

TCTTCGTTAT TACCTTGTTT TGAAGGGGTT ACTTCATAAC CTTTTGGTAA GTTTGAAAAT 3480 

TCTACACGGT AATCTCCATT AGGTAAGTTT GGAATCAAGT ATGACCCATC TTCTTTAGTA 3 54 0 

ACTGCTTCTC CTACTTTTGT ATTTGTATTA TTATCAAATA CAGTTACAGT TACATTGCCA 3600 

ACGCCTTTTT CTCCTAATTC TTGAACACCG TTTTTATTAG TATCTTCCCA TACGTAGTTA 3 660 

CCAATTTTAT ATACTTCTTG ACCAGctCCG CCACTTTGGT TATTAGTAAA TCCTAAAGCA 3720 

TTGCCAGTAG AAACGGATTT ATTACCTGTT GAAGATAAAG TAGCCATTTG AACAAGTGTT 3780 

40 GGGCTTTCGC TATTTGTATA TTGGAATTTT GTATTAACCA TTACAACATA AGCAGAATCT 3840 

GCATTTCCAA AATCAATAAC AGCGCTATTG TtGTCGCCAT ATGTAATTTT CTGCAAGTAT . 3900 

TGATTTGTTA CATCTGTAAG CTCTTTAGTA TTCACATCGT ATCCTTTATT TAATGTATAA 3960 

45 

CCTTTAGGAA CTTGATATAT TTTTATATCT GTTACATCTT TATTT 4005 

(2) INFORMATION FOR SEQ ID NO: 632: 

(i) SEQUENCE CHARACTERISTICS : 
so (A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 632: 

TATTAGGTTA CTCTAGTTTC CAAAGCGGGA ATTTTAATGT TATTAACAGC AAGGACAGCA 60 

AAAGCAATAT CGGCGCATTG ATTGAAAATC CAGGAATATA TCCTTTTATG TCTGGATATG 120 

AAAACTTGAA GTTATTGAAT GAATCAAAAA ACACTCAAGA TATCGATAAA ATTGTCTCAC 180 

AACTTCATAT GGATGAATAC ATTCATAAAA AAGCTAAAAC GTATTCTCTT GGTATGAAAC 240 
AAAAATTAGG. AATTGCTATA GCATTTTTAA ATAAACCTCA ATTCATTATC TTAGATGAAC . 300 

CAATGAATGG CTTAGATCCA AAAGCTGTGC GAGATGTACG TGAATTGATT GTCCAAAAAG 36 0 

15 CGCAAGAAGG TGTTACTTTC TTAATTTCGA GTCATATTTT AAGTGAATTA GTTAAAATCA 420 

CAAACTCTAT CCTTATTATT AACAAAGGTA AAATTGTTAC AGAAACATCG GAAGAAGAAC 480 

TTAAACAATT TAAAGATAAT GATTTAGAAA ATGTATTACT AGAAATCATA GAAAGGGAGG 54 0 

20 ACCAAGCATA AAATGGGAAC TTTAATTAAA CAAGAATGTT TCAAATTATT TAAAAAGAAA 600 

TCAACTTTTA TCGCACCTAT TGTCTTTATT CTACTAATGG TTGCTCAAGG TTATATTGCT 660 

ACAAAATACA ATGAAATTTT TACGCCACAG GAATCTTTCA CATCTGCTTA TAATGGTTTT 720 

TCATGGTTTG CATTTTTATT AATTATTCAA GCAAGTACAA TCATTTCAAT GGAATTTCAT 780 

TACGGTACGA TTAAAAATTT ACTCTATCGT GAATATTCAA GAACAACTAT GATTGTTAGC 84 0 

AAAATCATCA CATTATTTAT TATTTCTTTA ATTTATTTTG TTATTACAAT TATTGCTTCA 900 

ATTGTTATTG GGTCTTTATT CTTTAATGAT TTAAATATAT TTGAAAGTAG CGGTAATCAA 960 

TTATCTTTAT TGAATCAATT ATTATTAGTT AGTTTAGGCA CATTTGTTGG CGTTTGGTTA ,1020 

35 GTTTTAAGCT TAACGTTGCT ATTATCATCT GCAACAAATT CAACGGGAGT AGCCATTGCT 1080 

GTAGGTATTG TTTTTTATTT TGCAAGTTCT ATTTTAGCAG TTATTCAAAC GGCACTTTTA 114 0 

GAAAAAATAG ACTGGCTAAA GTGGAATCCT ATTAATATGA TGAATATTAT GCTTCAAACA 1200 

40 GTTGAAAAAG GCTTTAGTAA GTCGACAAAA TTAGAACTTC ATGAATTGTT TATTGGTAAT 1260 

ATTGCTTATA TTTCTATTTT CTTAATACTT GTAGTATTTA TTTTCAAGAA GAAAAATATT 1320 

TAGTAACTTA AAGTATTAAA TGTCTAAATA CACACATATT CCATCGTAAT TCAAAATCAT 1380 

TTTCAAATCC CTTCACCCAA ATAATGGTGC GGGGATTTTT TCATCCAAAT TTTGGAAATT 144 0 
(2) INFORMATION FOR SEQ ID NO: 633: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 633: 



5 
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GCTGACATAA TTGCATCAAA 


TTTCACATCC CCATAAAAAT CGCTACCACA TAACCTACGA 


60 


TAATACCTAC AAGAACTGGA 


ATTAAAGATA 


GGAATCCTTT AAAAAATCCT TGAACGACTA 


120 


TTGTTACAAG 


CAAGGTTATC 


ATTGCAACAA 


TTAAGAAACT GATATTGTAA 


CCTTTCATAT 


180 


CTCCAGGATT 


TTCATACATT 


GCCATATTGA 


CTGCAGTAGG CGCTAAGCTT 


AAACCAATTA 


240 


CCATGATGAC 


TGGTCCAACA 


ACAACTGGTG 


GTAATAATTT CATTAACCAT GCTGTCCCAC 


300 


TTAATTTGAT 


TAGAATCCCG 


ATGATGACGT ACATAACACC ACTCATGaAT AATGCTACAA 


360 


GCATGTCTCC 


TAAGCTATGC 


GTACTTAATC CCGTGATAAT TGGCGTGATA AATGCAAAGC 


420 


TAGATCCCAA 


GTATGCTGGT 


ATTTGCGCCT 


TCGTTATTAA GATATAAAGT 


AATGTACCGA 


480 


TTCCCGAAGC 


TAGTAACGCT 


GCTGATATTG 


GTAGTCCTGT TAAGAATGGT ACTAGTACTG 


• 540 


TTGCGCCAAA 


CATCGCAAAT 


AAATGTTGTA 


AGCTTAAAAA TGCCCATTGC 


GCTGGTTGTG 


600 


GTTTTTCATT 


TACATCTAGT 


ACGGGTTTTA 


CTGTTCGTTC AAACATTTCA 


TCATTTTGCA 


660 


TAATATTCAT TTCCTCCGAT 


AATAAAAAAA TCTCTTTACA TCAGTATATG 


TAAAGAGACA 


720 


AAAAGTGTGA 


CAAGTTGCTA 


CAAGTCATTT 


TCGTCCATAG AAATTGACTT 


ATAGTTGTCG 


780 


AACATGAGGG 


TATTATTAGA 


TAAACAAGCA 


TATGAAAACT TATTTATCAT 


TCAACTCCCC 


840 


CACCTTTTTC 


AGTCTCTCGT 


ACTGAATTAA 


AAGGGGtATT ATTTAATTAT 


AACTGCATTT 


900 


CTTTGATCcA TTtCTTCyAA 


ATAGACACTT 


ACCGTTTCCT CTTTAGAAGT 


AGGTAwATTT 


960 


TTACCAACAA AATCTGCTCG 


AATTGGTAAC 


TCACGATGTC CTCGATCAAC 


CAAAGCAGCT 


1020 


AAACCAATTT 


TAATAGGTCT 


AGCATTTAGC 


AAAATAGCAT CAAGTGAAGC 


ACGAACCGTT 


1080 


CGACCAGTAT 


ACAGCACATC 


GTCAATAATG ATGACTACTT TATCTGTAAT ATCTGTGTCG 


1140 


ATGTCTATTG 


CGTCTTTTGT 


CGTAAGTGAT 


GACATGTGcT CTATATCATC 


TCTAAAGTAT 


1200 


GTAATATCAA TTGTTCCAGT 


AGGTATACGT 


TGTTGCTCAA TTTGATGAAT 


TTtATCTTGT 


1260 


ATACGATTCG CTAAATATTC 


ACCTCTTGTn 


TTGATACCTA AAAGATTAAA 


TTATCAGTAC 


1320 


CTT 










1323 



45 (2) INFORMATION FOR SEQ ID NO: 634: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 761 base pairs 

(B) TYPE: nucleic acid 
SO (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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ACCCATCTCA TCGTATTTTG .AATTTAATAG ACGAGATCGA TGTATATCTG AATTCATCCC 60 

AACTATGGAT TAATGTTGGT ACATCATtAA cGCATAACCA ACATTTTGAG CAGTTGTTTT 120 

ATAAGTAACG TGATTTTTAT CTAATTGCCC TCTTAATGCG TCCTCTGTAA ATTCAACACT 180 

ATCAGAACCA TTAGAGGTCG CTTCATATAA GTTATTAGAT GCAATATGTG CTAAATCGCT 240 

ATTGATTTTC AATGGTTTTA ATCCTTTTAA TTTTCTCATT TCATTCGTTA CTTCATAAAG 3 00 

AGAAATTAAT TGATTTGGAT TTTGCTCAAC TGGACGCTTA TTATGCTCTT CTGACGTAGA 360 

ATTAGAATTT AATTGATAAG GTTCAATATC TGCTAACATT TCTTTTGTTA AAAATCGTAC 420 

ACTTAGCACC TTTTTCGATT GTTGATCAGA ATACACTTGT GCATATATGT CGCCATATTT 4 80 

AATCaGTGTT TGTGTTTTTA AATCTTCATC TGAAAGTTCA AATTCATATT TTTTACCATC 540 

AACTTTAAAG GACGGTTCTG GATTAATACT TGTATGATTA AAAATTTCTG CAGAATGTTG 600 

20 TCCTATTTTT AACGGACTAA CATTGACTTT CTCACCTGTA GCATACACTG AAACGATTTC 660 

TTCACGTTTA GTTGAAACAA TGTAATAACT GTTTTTGTCT TTAAACACAT AATTTTTGTA 720 

rCCATCTCTA AAAGGGTAGA CrCGATCTGC TTGTCCAAAT T 761 
25 12) 'INFORMATION FOR SEQ ID NO: 635: " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 827 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

.(D) TOPOLOGY: linear 



35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 635: 

nTAAATATAT TTATATATTA TAGAATAGAA AGACCTGAAG ATTGAATATC TTTCGCAAAG 60 

CCTTTAACTG TATCTACTGA TAATTCGTTA ATATCGCGAC CTAAGTTTGT ATTCACTTTT 120 

40 TTCACAACAT CTGCTGGGCA TGTAATAATA TCTGCACCAA TTTCATCAGC TTGAATCACA 180 

TTGAATAATT CGCGGCAACT TGCCCATAAT AATTTAACGC CGTCTTTACT ATGCGTAACT 240 
TTnACAGCCT CTkTCATTAA TGGTaATGGA TCTACGCCTG TAtCTGCAAT ACGTCCTGCA . 300 

45 AATACTGAAA CATATGTTGG CACACCTTCA GTTACTGCTT CAGTTATTTC TTTAACTTGT 360 

TCAATTGTGT AAACAGCCGT AACGTTTAAT CTCACATTGT CAGCTGAAAG TTTTTTAATT -420 

AAAGGAATCG TTGATTCACC TTTTGTATTT ACAATAGGAA TTTTAACAAA TACATTTTCG 480 

50 

CCATATTGTT TTAAAATTGC TGCTTCTTTk TCCATAGTTT CTAAATCGTC TGCAAATACT 540 

TCAAATGAr A TTGAAGCATC TGGAATTTCT TTCACAG CTT CTTCAGCAAA AGCTTTGTAA 600 
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TTTTTATAAG CTGCTTTCAT TnCTTCAATA TCTGCACCGk CCGCAAATAC TTCTACATTT 720 

AGTTTAGCCA TATAAyATAG CCTCCTTGAT ' TCTTATTAAA ATTTTAACAA CATCTGCATG 780 

k CTTTTTCTT ACAACCATTT GTAAAAAATG ATTTTTATTT CTTtGTT 827 
(2) INFORMATION FOR SEQ ID NO: 636: { 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1478 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 636: 

TTAGTCGGTA TAACCATCGG CTAGGTGGTT TTTGTATTAA AAAAGTGGAT aCCAAAATTT 60 

20 ATTAATAATT ATTTTAATGT TAGAAAAAAA CTAAATAAAA ACTCGCTAAT GATATCCAAT 120 

AATATGTATA CAAAACGAGA CATATATTGC ATATGATTAA CGAGATACTG AAAATATTTT 180 

ATCACCCCTA AAATGATTAT TCATTTTCAG CGGTAATTCG ACCTAAAGTC AAACTTACAA 24 0 

TAAAACCGAT GATAAATACT ACTAATGAAA CGAACCACAT CACGATATTA GTTGGTAAAC 3 00 

CTGGAAATAC TGCAAAGAGG GAGCCAACAA CAAAACCAAT GATTAATGCA AAAGTCATTA 360 

GTTTATGATG TGTTAGGAAA TACTGGATAA TTTTGCTTGA AATAATGAAT CCAGCAAGCA 420 

CGCCAAATCC GACTGCAAGT AATATAGGAA GACCTGCAAA GTTAAGTTTA ACAACTTCAG 4 B0 

ATATTGCTAG CATGACCGTA CCATAGACGC CAAATACTAA TAACATAAAT GACCCTGAAA 54 0 

TACCTGGGAG TAACATAGCA CTAGATGCAC ACATACCTGC AATAAAATAT TTAATAATAA 600 

GACTAGTTGA TAGAGTAAGT GTTTCTCCAG CATGTTTATC ACCATTATTC ATTAATGTAA 660 

TAACAATTAA GATAGCGATA CCAGCTATAA CCATCATGTA ATGTyTAGTT GTAAATGACG 72 0 

40 TTTTATAGTT AGAAATTTTC AATAAATATG GAACGATACC AATGATTAAT CCACCAAAGA 780 

AAAACATAGT TGGAATATGG TGTTGGCTTA ATAAATAATT AAAAAGATTA CTTAGTGATC 84 0 

CCATTGCCAG TAACATTCCA ATTATAATGG GGATTAAAAA TGTAAAACTT GGCCAAAAAC 900 

45 GTCGTGAGAA TATGCCGCTA ATTGAAGCGA TAAATTGATT GTAAATACCT AACAATAATG 960 

CGATAGTCCC ACCGCTAACA CCAGGTACCA AGTCACTCGT TCCCATAGCA AAACCTTTTA 1020 

GAATATTAAT CCATTTAAAT TGTTGCATGA ATAACTCCTT TCAAACGATT GGAATAAAAT 1080 

CATAAATAGC ATCATACCAT ATTACAAATG TCCTAGTGAA ATGATAACAT ATTTTAAATT 114 0' 

CATAAAATCC . ATTG AG AAAT TATGTGCACT TATTATCATT TATATTTTTA AAGAGAGCGG 1200 
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AGGTATAAGT AAGTTATAAT TAACTGAACG CATTATTACA AAGTCTTTTT GACTACAAAT 1320 

TAAAATTATT ATAAACTAGT TAAGAAAACT TTATATTTTA CGGAGGGAAT ATAAAATGGC 13 80 

ATCAACATTA GAAATyAAAG ACCTACATGT GTCTATTGAG GATAAAGAAA TCTTAAAAGG 1440 

TGTTAACTTG ACAATTAACA CTGATGAAAT ACATGCGA 1478 



(2) INFORMATION FOR SEQ ID NO: 637: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1995 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 637: 

20 ATTACAGCAT CTTCTCTAGG TAGATTATTA AAAGATAGAG GTCTAAATGT AACAATTCAA 60 

AAATTCGATC CATACTTAAA TGTTGACCCA GGTACAATGA GTCCTTATCA ACATGGTGAA 120 

GTATTCGTAA nGGATGATGG TGCAGAAACT GACCTAGACT TAGGACATTA CGAAAGATTT 180 

25 ATTGATATTA ATTTAAACAA GTTTTCAAAT GTGACAGCCG GTAAAGTGTA TTCACACGTA 24 0 

1TGAAAAAAG AACGTCGTGG TGATTACTTA GGCGGAACAG TTCAAGTTAT TCCGCATATT 300 

ACAAATGAAA TTAAAGAACG TTTATTACTT GCAGGGGAAA GTACGAATGC AGACGTTGTT 360 

30 

ATCACTGAAA TTGGCGGTAC AACAGGTGAT ATTGAGTCAT TACCGTTTAT TGAAGCGATT 420 

CGTCAAATTC GTAGCGATTT AGGTAGAGAA AATGTTATGT ATGTTCACTG TACATTACTG 4 80 

CCTTATATTA AAGCTGCTGG AGAAATGAAA ACGAAGCCAA CACAACATAG TGTTAAAGAA 54 0 

35 

TTACGAGGCT TAGGTATTCA ACCAGACTTA ATCGTTGTAA GAACTGAATA TGAAATGACA 600 

CAAGATTTAA AAGATAAAAT TGCATTATTC TGTGACATTA ATAAAGAAAG TGTTATTGAA 660 

40 TGTCGTGATG CAGACTCTTT ATACGAAATT CCATTACAAT TAAGCCAACA AAATATGGAT 720 

GATATCGTTA TTAAACGTTT ACAATTAAAC GCGAAATATG AAACACAGCT TGATGAATGG 780 

AAACAGTTGT TAGATATCGT TAATAATTTA GATGGTAAAA TTACAATTGG TTTAGTAGGT 840 

45 AAATATGTTA GCTTACAAGA TGCATATTTA TCAGTTGTTG AATCATTGAA ACATGCTGGA 900 

TATCCTTTTG CCAAAGATAT TGACATTAGA TGGATTGATT CAAGTGAAGT AACAGATGAA 960 

AATGCAGCCG AATACCTTGC AGATGTCGAC GGTATTTTAG TACCAGGTGG ATTTGGTTTC 1020 

50 

CGTGCAAGTG AAGGTAAAAT TAGTGCAATT AAGTATGCTA GAGAAAACAA TGTACCATTC 1080 

TTTGGTATTT GTTTAGGAAT GCAACTTGCA ACAGTTGAAT TTTCAAGAAA CGTATTAGGC 114 0 
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TTACCAGAAC AAAAAGATAT CGAAGATTTA GGTGGTACAT TACGCTTAGG CTTATATTCA 1260 

TGTTCAATTA AAGAAGGCAC ATTGGCACAA GATGTTTATG GTAAAGCGGA AATTGAAGAA 1320 

AGACATCGTC ATCGTTATGA ATTTAATAAT GACTATAGAG AACAATTAGA AGCAAATGGT 1380 

ATGGTGATTT CTGGTACAAG tCCAGATGGA CGTTTAGTAG AAATGGTAGA GATTCCGACA 144 0 

AATGtTTCTT TATTGCTTGT CAATTCCACC CAGAATTCTT ATCTAGACCA AATCGTCCGC 1500 

ACCCGATTTT TAAATCATTT ATTGAAGCTT CATTAAAATA TCAACAAAAT AAATAAATTT 1560 

GCTAATAAAA CCGGTACTTT CATTGTTAAA CATTGAAAGT ACCGGTTTnT CGTATAATTT 1620 

TAATATTATG TTAGTGACAA GGTATGAAAT AACAATAGTG ACTTTTATAA TTCTAAGTCT 1680 

CTTGTCATTT CAATCATTTG TGTATAAATG TCATAGTATA CATAATTCAA TGCCATCGCA 1740 

TGTGGTyGGA CAATCTTATC GTAATCTTCA GTGTAGACTA TAGGTcTTGG TGTAGATAAA 1800 

20 TCGATAAAAT GTACGAGATG AT CAGGG AAA TCATCTGTTT TAGGTTTGTT GCTTATTAAG I860 

ACCACATCGA TATCTAAGTC GATAAGTTTT TGAATATCTA ATGCAACTTG ATyATTATAA 1920 

AATGGTGCGA ATAATAATAC ACGATCAGTT GAGTCAATTT CTTTAAwkTC TTTAATAGCG 1980 

TaAGTTTnCG GCTAG . m 1995 

(2) INFORMATION FOR SEQ ID NO: 638: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1107 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 638: 

ATTAGTGATG AAAGTCAGAT TGAAGCTTTA TTAACAGCTG AAAAATATTC AGAAATGATT 60 

GGTGAATAAT CACCGTGTAA CTCCTTAATC TAAGATTGAG GAGTTTATTT TTAGTCTGAG 120 

AAAATAAATG ATATGAAAGA AAAATTATTA GGTACTATTA TTTGGAGTAT TGCTACATTT 180 

TATTATTCAA GAATGATGGA AATAATGAAT TTAGCTATTT TAAAAATAAA AATTGGGGGA 240 

AGTTAATATG CTAAACATTC AAGACGTTAn CATnCTTTCT AAAAAGGAGC AAAAAGCATA 300 

TAACCGTTTC GTAGAATCTG TAGAAAACGG TAATTTACCA GTACTACCAT GTATTGAAAT 360 

50 GGATCTAAAA GAGATGCAAG AAGAAACATT AAACCAGAGT AAGATTGGTG GAATGCCATT 420 

TTTAAAATCT TTTAAAGATA TACCATTAGA TGAAAATAAT GTACCAATGG TATTGCTAGC 4 80 

ACAGATTAAT TTGGATGATC TTCCAGAACA ACAAGAATTA TTTCCTGTAA AAGAAGGGAT 540 
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AAACAATATA AACTCAAGGC TTGTTTATAT AAAAGAGCCA ATTACAGATT TATCACTCGA 
AAATATTCAA GCGCATTTGA AGTCATTAGA TGCTGATAAT GAGGATATCC CGTTCAGTGG 
AGCATTTTCT ATAGAATTTA GATTGTCGAA ACAAACTATT ACATGTACTG ATTATAAGTA 
CGATGAGGAC GTGCTTGCAT TGTGGAATAA AGTCAATCCA TCCTTCGCGC TAAAATCAAT 
GTTTGGTGGT TATGATGAAT TGATGGAACC TGTGTGTAmC . AywTTTACTG CTAAGgAACC 
ATTTAATCAA CTTGGTGGTT ATCCATATTT TGACCAAATA GATCCAAGAA CGAACGATcA 
AGAACTGAAA ATGTATGATA GAGTCTTACT GCAAATTGaT TCTACAAGAG ATGGTAATTC 
TTCGATTATa TGGGGTGaTT TAGGTATTGc CAATATCtTA GTGaAATCTA CTGrACCTTG 
aGGcTAtGaa GTTTTGAATG ATTACCT 
'(2) INFORMATION FOR SEQ ID NO: 639: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) . TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 639: 



ATTCATATTA 


TTATAAATTA 


TTTCTACACC 


ATCCCAATTG 


AGTTGTTTTT 


CATAATTTAA 


ATGTAATTCC 


ACTAACTCCC 


TACCAATTTC 


AACAAATCCA 


TATACATCCT 


TTAATATCGG 


TATTCGCGGA 


AAACCTTTAC 


TCAAATCACT 


TGAATATTTG 


TTCACATAAT ATTTATGATG 


CAAAATTGCA 


TATATATAAT 


ATACTATCTC 


TTCTGAATTA 


AGATTTATTT 


TCTTTTTAAA 


AGAATTAGgA 


AATATTATCT 


ACArGCCTCA 


AACTATCTTT 


ACCTTTGtAT 


GTAGCAAAGC 


CTTkGCCATT 


ACCAATAAAt 


TGGAAATTAG 


GTAATATGTC 


CGTGATCATA 


GCCGAGAATT 


CTTTATTCAT 


TCCCTGTCCT 


TGTATATAAA 


TCACCTGTCC 


AGTATTCTCC 


ATTATATTAT 


AATATCTACT 


TGGCATTTCC 


ATAATATTTT 


TGTCGTACAC 


TATCCATTTT 


TTTGTAAATG 


GTCTATGCAT 


AAATTTAACA 


ATTCTCTCTG 


GATTAATTGA AATATTTTTT 


CCTTTAGAAA 


ATTTTTGGGT 


AAGTCCTCGT 


GTCCAACTaA 


TaAATGTTTC 


ATCTTTGTTC 


ACTAAATTTA 


TACGTTCTCT 


TGAATCTAAG 


ATATCAATTA 


ATCTATCTAT 


TTCAGAATTA 


TAGTTATCTA 


CAAGTAATTT 


TGcATTTACT 


AATGCTTTTT 


CATTCGAAAA 


ATTTGTTACC 


CAATTATCTC 


TTGCTGaATT 


TACTCCATTA 


AATkGAkCTA 


AATATATAGA ATTTTCAATA 


TCCTTGGAAT 


CATACATTGG 


TAAATAATTC 


CCCATAGTTT 


ATGTCTCGGT 


GATTAATCCA 


ATCATTGGGG 
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AAAG 

(2) INFORMATION FOR SEQ ID NO: 640: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 640: 
CGATGTCTTT ACTATTAGAC TTAGCCATTG GTTTCACCTC TCCAAAAATT GTAAATGTGT 
ATCATCAATA TGAAAGTTAC ATAAAACTGA CATATTTCTT TAAAATATCA ACGCCATTGA 
TAACTTCCTG TTTTAATTGA TACGCTGTAA CAAAATACTA TAGTTAGTGC TTACATGTAT 
ATGTTAAAGC AAGCAGTGGT AAATGTAAAT TATAATTATT CATTAACTTT GCAATATATT 
AAATCTTTTA TTCATAGAAG ATAAATATCA AATCAATCAT AATTATTTGA CAACAAATAG 
CTAACGATTG TTTTAATCTA CATTTGGCTT ATAGCATTTT AAACCTATAC TCTATTTTGA 
TACAATATAA GTGTAAAATC AATCATAAAA AGGATATTCA ATATCTGCAT CCAAGAAAAA 
CATTACAATT ACCTTT 

(2) INFORMATION FOR SEQ ID NO: 641: 

( i ) SEQUENCE . CHARACTERISTICS : 

(A) LENGTH: 442 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 641: 
GTTATTAAAT TCAGAGTGGT AGCAAATTAA AGTTAnTCAA GAGTTAAGAT GAATTTAATT 
CATGAACACG TCTATTATTT TTATAATTGT AGCAAATAAA GCTTTACATC AAGGAGGTAA 
TTAAATATGT TCAAAAAATA TGACTCAAAA AATTCAATCG TATTAAAATC TATTCTATCG 
CTAGGTATCA TCTATGGGGG AACATTTGGA ATATATCCAA AAGCAGACGC GTCAACACAA 
AATTCCTCAA GTGTACAAGA TAAACAATTA CAAAAAGTTG AAGAAGTACC AAATAATTCA 
GAAAAAGCTT TGGTTAAAAA ACTTTACGAT AGATACAGCA AGGATACAAT AAATGGAAAA 
TCTAATAAAT CTAGGAATTG GGTTTATTCA GAGAGACCTT TAAATGAAAA CCAAGTTCGT 
ATACATTTAG AAGGAACATA CA 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 642: 

10 

CCAATTTTGG TATGAATTAT ACAGATAATT CnGCGCCCGG AGGATCATTT GCTTATTTAA 60 

ATCAATTCGG TGTGGATAAA TGGATGAATG AAGGGTATAT GGCATAAGGA GAACATTTTA 120 

1S ACTACTGCCA ATAACGGAAG ATATATTTAT CAAGCTGGAA CTTCATTAGC CACACCTAAA 180 

GTTTCGGGAG CACTAGCTTT AATCATTGAT AAATATCATC TTGAAAAACA TCCAGATAAA 240 

GCGATTGAAT TGTTATATCA GCATGGGACA TCTAAGAATA ATAAACCATT TAGTAGATAT 300 

20 GGGCATGGTG AGCTTGATGT GTATAAAGCA TTAAATGTAG CAAATCAAAA AGCAAGTTAA 360 

TAAATCAAAG GAGTTTTTGA TTATGGCAAA ATTAGTTACT GAAAACATTT CGAAGCGGTT 42 0 

TAAAAATCAA GATGTATTAA AGCATATTAA TATCACTTTA GAAAATAACG AAGTTTATGG 4 80 

25 

ATTACTTGGT ATTAATGGAG CCGGTAAAAC GACACTTATG AAAATTATAT GTGGCATACT 54 0 

TCAACAAGAT TCAGGGGAAA TTAAATTAGA TAATAGACCA ATGACACGAA ATGATTTGCA 600 

/ 

CAAAGTTGGT TCGCTTATTG AAACACCTGC GACATATAAT CATTTAAGTG CACAAGATAA 660 

30 

TTTGAAAATT GTGTGTTTAA ATGAAAGCGT TGATTTCaGC GAAATTAATA GTGTTTTAAG 720 

CTTAGTCAAT TTAAATGTCG ATAAAAAGAA AAAGGTTAAG GACTTTTCTT TAGGTATGAA 780 

35 ACAAAGACTT GGAATTGCAA TGGCaTTAAT TAAAAaGCCA GAAATTTTAG TATTAGACGA 840 

ACCATCTAAT GGTTTAGACC CATATGGAAT CCAAGAACTT AGAGAACTTC TAAAATTATT 900 

AACAGAACAA GGTACTAGTA TTATTATTTC AAGTCACATT TTATCTGAAA TCCAAGTTTT 96 0 

40 AGCAGATCAT ATCGGTATTA TTCATGAGGG TGAGCTAAAA TATCAGCAAA GAAATAACAA 102 0 

AGATGAAAAC TTAGAAGAGA TATTCTTCAA AATAACGAAA GGTGATTACA AATGATACAT 1080 

TTAAAGATaG AAGGTATCAA ATTTAAAAAt TCTTTCAGTA TGTATGTTTT ATTAATAAGT 1140 

45 

CcGcTGGTAT TTCTTTGTTT TGCTATTTTC ACAGTCTTAT TCGCCAAAAG TAATACGGGA 1200 

ACAGCGAATA GTGTGTCACC ATATATAACT TTACTATTTA ATATTTGGCC AATTGCTTTC 126 0 

ATCCCGATTG TATTATGTAT GGCTTGTAAT TCGTTATTTA AAATTGAAAT GAGAAATAAA 132 0 

50 

TCATTTAATT ATTACTTAAG TAATAATTGG TCGATTACAA AAGAAATAAG AG CAAAG ATT 13 BD 

TTCATTTTAT CAATAGCATT TTTGGTACAT TGCTTTTTAG TATTTATTAT TGCTTATATA 144 0 
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TTGATGTATG TAGTATCTCT ACCATTGATA CCGCTCAACT TTTTATTAAC TCGATACTTT 1560 

GGTGTGTTCG TATCAATATT AATAAACTTA GTATTATCAG TCATTTGTGT CTTGTTTTTA 1620 

5 

ACATTGAAGA GTTTATTTTG GGTGTTGCCG TGGGGGATAA TGCAGAGAAT CCCGCTTATT 16 BO 

ACGCTTGGTA TACTACCTAA TGGtTTAGTT GTAAACCATA ATTCAAAATA CTTTAATGAT 1740 

CTCAATGCCT TATATATTTC GATTATTGTT AGCATCATTA TTTTCGCGAT AGTAACATTT IB 00 

10 

TTAAATAATA AGAAAAGTTG GCGATTAAAA TGATAATTAA CGAATTAAAG TCATGTAAGT * 1B60 

TGAAATTTTC TAAGCAAGCG CTCACATTTG TACCCATTAT TGTAACCATA TTGTTTATAT 1920 

75 TATTTATAAA TTGGTATTTA AACGTAAATT TATGGAATGG TCGACAAATm AGTTTGTTTA 1980 

CAGCGAGTTT TAATGCAATT ACATCGCTAT TAATTTCTAT AAACGTCTAT CAAGTTATCA 2040 

ATTTTGAAGA AAATATTGGT CACTTTAATC ATATTTTAGG AAAAGCTAAT AGGCTAAATT 2100 

20< GGTTAAATGC ATCAATGATT TTTACTTATA CTATTACAGC CATATGTATT CT ATT AG CAT 2160 

CAATTAATTT ATTGTGGCAT TCACATGATA TGAAAATAAC ACTTATGTTT ATAGGcGTaT 2220 

CATTGTTTTT CAATGTaATT ATATTACTGC TACTTTTTAT TTTTAGTATT TTCATTAAAG 22 BO 

25 

ATGTAATGGC TATTGTTGTC GGAGTTTTAA TGTTTATTTT TAACGTTTAT TTTGGATTAG 2340 

AAGTGCTTGG AGATCATTCG TGGTTCTATT TACCAATCAC ATATGCTACA CGTTACGTTT 2400 

ATATGTTTAn CGAAGGGGAG TATACCAGTT ACATTAACAT TGGGCAATCT ATATriATTAT 24 60 

30 

CACGnTGCCG AT 24 72 
(2) INFORMATION FOR SEQ ID NO: 643: 
35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 646 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 643: 
ATAATATAGG AATTATTTCG ATnACAGATT TTACGAATCG TGCTACGATG CAAAATGAAn 60 

45 

ATAAAGATCC ATATGGCGAA AAGTTAGCTT ATGGAATTGC TTTTAATGGC AGTGTGGATA 120 
TGCAAGGGGA TAAACAAGTC ACAATTCCAA AATATAGTGT AGTTACAATT ACTGGCGAAA 1B0 
ATAGTAAAAA TTATCGTGTT ACCGCCGATA ATAAGACTTA CTATGTTAGT AAAGATAAAT 240 

50 

TAGAATATTT TAACCCGGCA GGTTTATATC AAACGCATAG TTTTAAAAAA TTAGCACCAT 300 
ATATGAAATC AAATTATAGT AATTACTATG CATACTTTAA TAGTCAATTA CATAAAAAGC 360 
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CACAACAACC GATACAATTA CTTTTCAATG ATAATAATCA GTTATACGGT TTTGTTTATC 480 

CAATTGTAGA TAAAAAAGAA TTAAAAGATA AGTTTAATAT TAACAATAAC ATTTGGATTA 540 

CTAAAGTTGG GAATGGATAT TGTATTGCCA ATTTGAAAGA AGACAAATGG ATTTATATTG €00 

AATTGTAGGT GTAAAGATGC TAGATAATAT TATTTTATAT TTTAAA 646 
(2) INFORMATION FOR SEQ ID NO: 644: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
1S CD) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 644: 

20 TAATATCGGA ATTTGATAAT GAAGATATCT AATTTTTTAA TATTCGTAGC TTTTATTTTT 60 

CTACTTATTT TATGTTTATT TTTAATCTTA CAAATGACAA ACCATTAAAA GTAGCATCCC 120 

AACATCAAAC AAAAAAACAA TTCATCAAAT AAAAATCGCT ACAAAACCAA GTCATTAAAC 180 

ACGCAATAAT TAAAATTTTC CACTCATTAT AATTCTGAAT TCCAAATGTC GAATTCCGAA 240 

AACCAAACTC CAAATTCCAA AAACGCAACT CCAAAATTAA AAGCATTTCC CTACCATTCG 300 

GGAAATGCTT TTTACATACT GgATTACTCT GTCATTAATG ATTTTACAAC GGgAAACCAT 360 

GTCGtCATGT ATGACCaAAG TAGCGTCGCT AtCaTaAgGt GGTTCGGATC TTTATTGGAT 420 

AATnAT 426 
3$ (2) INFORMATION FOR SEQ ID NO: 645: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 645: 

nTTATTTACC CACAACATGT TGCGACATTA GGTAAATGGG TACCTTATTT ACTTGGTATT 60 

GTTATGTTAG GTATGGGATT AACAATTACA CCTAATGATT TCAAAATGGT CTTTAAAGCA 120 

CCTAGAGCAG TAATTATTGG TGTCTGTCTA CAATTCAGTA TTATGCCCAC ATTAGCATTT 180 

ATAATTGCAA AGTCTTTTCA TTTACCACCT GATATTGCTG TTGGCGTAAT ATTAGTTGGA 24 0 

GTGGGACATC AAGTAATGTA ATGAGTTATT TAGCCAAAGC TAACGTAGCA 300 
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ATATATCTAT TTGCAAATGA ATGGTTGGAA 
GTTCAAGTTG TATTAATTCC AATTGCTTTA 
5 ATTGCTGAAA AAGCTTCTAC AGCTTTGCCA 

TTAGCAATAG TTGTAGGTGG CAGTAAGCAC 
TTAGTAGTTA TTTTACATAA CGTATTAGGG 

10 

TTAAAATTAG ATCGACAAGA TCAAAAAGCA 
GGTTTAGCTG TGTCATTAGC aGCATTGCAT 

■ GTGTTTAGTT TCATTCATAA TATAACAGGG 

TTATAATTGC ACTAATAGAA TGAAGTGGTC 
AAAAATAGAG GAGTAAATAT ATGTATAGAG 

20 TAGATACGGA ACAACATTTA TTTAATGTTA 
ATCCTATAAG CATTGATTTT TATCGTTCTT 
ACCATTTAAT TAAAGCGATT GGTTCGGAAA 
TTACTAGTAC AACATTACCG ATGATTGATA 
AACGTCACAT TCCTATGGCA ATTGCCACAA 
TTAAAGCATT AGGTCTAGAC GATTATATAG 

30 

AAGTTAAACC TGACCCTGAA TTATATTTAT 
CACAATGTTT GGCTATTGAA GATTCTGTAA 
TAGATGTTAT TGTTAATACG AATAAAATGA 

35 

ATGTAGCAAA AGATATTGAT TACGATCAAA 
CGTATCATGA TGGGTTACAT TATATTGTTT 

40 GGAAATTTGG TGATTGGTCC TATATTTAAC 
TCTTTTGTTG kTGGkTCmAT GrTTTACTTA 
TTACAAGGTA AACTTTAACG AGAAAACCAC 

45 TGTGAAATGG TAGAACGTTC ATGTTTATGT 
TATTTTAATA ATAACGTCAC GATGATAAGT 
ATTGTAGCTC TTGTTGCA1T ACTTCCTTGT 

SO 

GGCACAAAGC TAATAATTAG TAAGATTAAT 
ATGAACGACC TCCTTTATTT TTTTCAATCA 

55 



GTATCTTTCG TGAGTATGTT GTGGTCAGTT 420 

GGTATTGTTT TGCAAATTAT TAATCGTAAA 480 

ATTATATCAG TTGTTGCTAT TTCATTAATT 54 0 

CAAATCTTAA CTACAGGATT ATTAATATTT 600 

TATACGATTG GATATTGGTT AGCTCGTCTT 660 

GTCAGTATTG AAGTTGGAAT GCAGAACTCT 720 

TTTAATCCAA TTGCAGCAGT ACCAGGCGCA 780 

CCTATTTTAG CAAAGTATTG GTCAAAAAAG 84 0 

ATCGGACTAT GTTAAGCTTT GATAAAGAGA 900 

CAGTTATATT TGATTTCGAT GGAACAATAA 960 

TTAATAAACA TTTAGAGATG CATAATGCCG 1020 

CTATTGGAGG AGCAGCTACA GATTTGCATG 1080 

ATAAAGATAA ACTTTATGAA GAACATCATC 114 0 

CGATTAnATC ATTGATGGCA TTTTTAAAGC 1200 

GTAGTGTGAA AGCGGAAATA ATGCCCACCT 1260 

AGGTAGTTGT TGGTAGAGAm GATGTTGAAC 1320 

CTGCAGTACA ACAATTAAAT TATATGCCGA 13 80 

ATGGTGCAAC AGCCGCGATT GCAGCTGGAT 144 0 

CAAGCGCACA GGACTTTTCT AATGTAGATT 1500 

TTGTAGCGCG TTTCTTTACG AAATAGGAGG 1560 

TTTCTAGCTG GTCCAGTAAT TTTAGGCGTT 1620 

AAACAGACAC CATTTCX3CGT GCAAGTAAGA 1680 

ATACTCGCAA CAATTGGCTA TTTTTTACTA 1740 

CTTACCTCAT TAAATGGACG ACCATATGTA 1800 

ATGAGATAGG GTGGTTTAAA TAGTTACATA 1860 

ACAATTAAGA TAATATCTAT GCCTACCATA 1920 

TCTTTTGCTG ATTTCATAGC ACGGTAGTTT 1980 

ACAATTACAC CAATTAATGC TGTTGTCATG 2040 

r 

ATTCCCAAAT AAACGTAGCA ATCACACCGA 2100 
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CAATAATTAA 


TGCAATCGGT 


AAAGTCGTAC 


CGAGTTTAAT 


CTTGCGCTCT GGAGAATTAA 


2220 




TAATAGTAAA 


TACTGTAAGA 


CAAATGAGTA 


TGAAAGCAAG 


TGTTGCAATA ATAGTTCTTC 


2280 


5 


CAACTAAATA 


TAGGATGTCA 


GGTTTTTCCA 


TACCGATATA ATTTATGATG AAAAATGCTA 


2340 




CAGCAAAGAG 


TACCGATATT 


TTTGTAGCAC 


GTAGCAGTAT 


TTGTTTTAAC ATTGATATAC 


2400 




TCCTTTTTAA 


TATTATTAAA 


ATTATATCAT 


AATTACCAAG AATAGCTGAA GTTGTATGTG 


2460 


10 


ACTCAACGGT 


ACTTGAGCAA 


CTTTTTTAAT 


TTTTTAGAAA 


AATCACAAAA 


TAATTGTTTG 


2520 




CAAAGTTGCA 


AAAGCCTGCT 


ATAGTAGTTC 


TGTAAACGAT 


TGCATGGTAT 


G CAAAT ATTA 


2580 


15 


ATGTACCAAA 


ATCGATAATT 


TATAGTATAA 


TTACGGCAAT 


AAGTTTTTTT 


ATGGATTTAT 


2640 


TTAGTATCAA 


TCAGAGATGG 


GGTAAGAAGT 


TATGGAGAAC 


AATGAACTAC 


AAAGGGGATT 


2700 




GAGTGCCCGT 


CAAATTCAAA 


TGATTGCACT 


TGGTGGTACG 


ATTGGCGTGG 


GGCTTTTCAT 


2760 


20 


GGGTGCGACA 


AGTACAATTA 


AATGGACAGG 


CCCATCAGTT 


ATCCTTGCAT 


ATTTAATTGC 


2820 




GGGTATCTTT 


TTATTTTTAA 


TCATGAGAGC 


AATGGGGGAA ATGATTTATT 


TAAACCCTAC 


2880 




AACAGGATCA 


TTTGCAACAT 


TTGCAAGTGA 


TTATATACAT 


CCTGCAGCAG 


GTTATATGAC 


2940 


25 


AGCATGGAGT 


AATATATTCC 


AATGGATTGT 


AGTTGGTATG 


AGTGAGGTCA 


TCGCAGTAGG 


3000 




AGAATATATG 


AAGTTTTGGT 


TCCCGGAATT 


GCCAACTTGG 


ATTCCTGGTG 


TTATTGCTAT 


3060 




TTTATTATTA 


ATGGCAGCGA 


AT1TATTCTC 


GGTAAAAGCG 


TTTGGAGAAT TTGAATTTTG 


3120 


30 


GTTTGCTTTA 


ATTAAAGTTG 


TAACAATTAT 


TTTAATGATT 


ATTGCTGGTT 


TTGGTCTTAT 


3180 




nnTCTnTGGT 


TTTGGAAATG 


GTGGCCATGC 


GGTAGGTATT 


TCTAATCTAT 


GGACAAATGG 


3240 



C 3241 

35 

(2) INFORMATION FOR SEQ ID NO: 646: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1311 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

4$ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 646: 

AGGCGTCAAC TCAGATGGTT TAATAATTGC CGTATTACCT GCTGCAATAG CACCGATTAA 60 

AGGTtCGAaC ACTAGTtGAA AAGGATAGTT AAATGGTGCA ATGATCAAAA CTGTTCCATA 120 

SO AGGTTCTTTT TTGATATAGC TTTTTGTTGG AAATAAATAT AAAGGTGTGT CTACATTkTT 180 

TGTTTTAGTC CAGTTTTTAA GTTCcTTACG GGCAATTTTG ATACTTyTCA AAGTTATGCC 240 

55 
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AATATCGCTC TCGTATGATT TAATAGCTTT GCTTAACTTC TTTAATTGCT CTTTTCTAAA 360 

ACTAATATCT TTAGTTTGTT GTGTATTGAA AAAAGCTTTA CTGTCATAAA ATTTTTGCTC 420 

AATGATATTC ATAATGAAAA GAACCTCCTT ATATGATTAT TTTGGAAAAA GCGATTAATT 4 80 

GATTTGAATG TTGTGGCCGT TAATTTTAAA TGGTCTTTCG AATTATATAT GTTGAAAGTT 540 

GAAAATAGAG CGATGAATCG TGTACATAAT AATATTTATA ACTTTAATCA TAACGAAAAA '600 

GGTAGGAAGA AAACAAAAAT TTATACTCAA CATCGCAAAT ATTTTAAGAA AATGTAAAGA 660 

CAAAAGGGGA ATTGTATAGA AATCACTAAT CTGTGGGTTA GGGTAGCTAA AGGAATAAAA 720 

ACTACTATTG AAAAAGGGTT GTAAATTAGT CAAACGTAAA TAAAAAACAG TTCATTGAAA 7 80 

GTGAAATAAA TTCTACTTTA ATGAACTGTT AGTTAAATAC AACATGTCTA TAATTAGACA 840 

GTAATATAGT ATTATTTTGT TAATGCTTCA GTGATTTGAG GTACGATTTG TTTTTTTCGA 900 

GAAACGACAC CAGATAAGAA GGCCATGTCA TCTTCTAATT GAAGATTGAA TGtTCGCCAA 960 

CTTTATCTTT TTCAGCACCT ACAACTAAAA TTTTAGAATC ACTATTAATG ATGTCAGTAA 1020 

CAACAAGTAC AAATAAGTCA TATTTTTCTT GTGCACTTAC AGCTAACATT TCTTTTTCTA 1080 

25 rATCTTCTTT ACGATTTAAC ACTTCGTCAA GGTCAACAGC ATTAACTTGT GCAATACGAG 114 0 

TCACATAGTC ACCCATAGTA AATGATTTAG CATCCATGTT TAATAAGAAT TCAACTGATT 1200 

TATCAGTTGT TGAAGCACCT GCTTTtAACA TATCTAAGCC GTACTTTTGA ATATCAACTT 1260 

30 

TAGCAATATC TTTnAATTCT tCAGCTGCTT TAACATCTTG TTGTGTACAT G 1311 

(2) INFORMATION FOR SEQ ID NO: 647: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 14 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

40 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 647: 
CATATACTTT TTCATTTCTT TACGAGATAC TTTACCAGAG GATTTAGACT TCATACGCTG 6 0 

45 ATCCATATGT GCTTGCGTTT CAGaATGTCC ACAAACACAA CGATATACCG CTTCTTTCCC 120 
TTTACCAAAC AACGTTAATT TCTTTTTACA GTTTGGACAT CTTGCATTTG TTTTGCGCTG 18 0 

TACATTCTTT TTCGTCTTAC AAGATGGATC TTGGCACACA AGCATCTGAC CATTTTTAGT 240 

50 „ 

TTTAACTTTA ATCATGAATT TACCACACGT TGGGCATTCT GTGGTTGTTA AATTATCGTG 300 
TTTATATTTA CGATCACTAT TTTTAATCCC ATTTACAACA TCTTTCGTAA AATCTTTCAT 360 
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CCATTGTGcA GTTAAAAGTG GCGACGTTAA TTCTTCTGGT GCTAATTCTA ATATTTGTTT 480 

ACCTTTTGAC GTTACTTTAA TTTTACCGTC TCTTGATTCA ATGGCATTCA TATTAAATAA 540 
TTTATCGATA ATGTCGGCCC TTGTTGCAAC TGTGCCGATA CCACCTGTTT GTTTTAAAGT . 600 

TTGCGCATAT TTTTTATCCT TCAATTGAAT AAAGTTCTGA GGGTTCTCCA TCGCTTTTAA 660 

TAACGAACCT TCATTAAAAT ATTCTGGAGG TGTTGTTTCA TGTTCTCTAA TATTTGTTTT 720 

TGAAATCTTC ACTTCATCGC CTTCTGAAAA AGGCTGTTGC ATCTCTGTAA TAGATTCACC 780 

TTGTCTAATA GATTTAAAAC CTAAAACAGT TGTTACATTC TCTTTCAAAA CAAATGTGTG 84 0 

15 CCCTGCAACC TCTAAAGTTA CAGTTATCGC GTCATACTCG TGCGGAGGCA TTAAAGCTTC 900 

TAAAAAACGC TCGACAATCA TATCGTATAA CTTTAATTCT CTATTACTTA AGTCTGACAT 960 

GACAGGTCTC ACTTCTGTAG GAATAATTGC ATGGTGATCA GATACTTTTT GATTATTAAA 1020 

20 TATCGACATT TTTGATGAAA ATGTTTTAGA CATTAATGGG CGTGCTTGGT CTTTATATGT 1080 

TGTTGCCATC GTCACCTGAA TACGTTCTTT CATAGTATCT ACCATATCAG TTGTTAAATA 1140 

GTTTGAATCT GTTCTTGGAT AGGTTACGAC TTTATGTCTC TCATATAAGC TTTGAAGTGT 1200 

ATTCAATGTT TCTTTAGGTC CAATTTTATA ACGTCTATAC ATATCTTGTT GTAAATCTGT 1260 

TAAATTGTAC AGTGATTGCG GATACGACTT CTTATGTTTA GTAGCAACAG ATTTAATCTT 1320 

ACCATCGACA TTTTTCAAAT TATTAACCAT CTGTTCTAAA GTTTCTTTAw TGGtATATCG 1380 

CTGATTTGaw TCTAGCTGAA AATCAAACCC TTTTACCGTT AATGATAATG TAAAGTATTG 144 0 

TTGTGGnTTG AACTGATTAA TCTCTTGTTG GTCGTGTAAT TTACTAAATT GAAACGGn 1498 
35 (2) INFORMATION FOR SEQ ID NO: 648: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1044 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 648: 

45 

TAAAACAATT CGTTTTATTA ATACAGTTTG TAAAAGTATT CTCTTAGTAC TAATAGCTAA 60 

TGTAATGATA GAAAATACTA GTGTTATTAA AGATTTGAAT AAAATAAAAG AAACTGAGAA 120 

so ATATTGGAAT GTATTAGATG ATTATTACAC GATTGAATTT GCACCTTATC ACGAAACAAA 180 

ACAAAGTTTG ATTGATAATA TGGTGCGATC AGAACAATTA GTAAAGGCTA GTGAAGCAGA 240 

AAATAATGCG ATTTTATTCA AACCAAAGGG TGACTCCGTT GACAATGACA ACTTTTCGCC 3 00 

55 
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TCAACCTGAT 


ATTCCGATAA 


AAAATCAAAA 


AAATAATGTC 


GAAGTAATTA 


TTCCACAAAA 


420 


GTTTCATGCA 


ATGCGTAATG 


AAATCAATCA 


AGCATATCAT 


TCATGGTTTG AATTTGTACA 


480 


AAATAAAAAT 


AATAAAGAGA 


ATAAGTTATC 


TATACAGTTT 


AU. UAACAAAA 


ATGATTb 1 V-vj 


540 


AATTTTTTCA 


TTTGATGCAC 


GAGATAGTCG 


CCATTTGTCA 


TTTATAGAGG 


CGCCAATCAT 


600 


TGTGAATGTT 


CAGGCATCAG 


ATTTATCGaA 


TGATTTttAT 


TATGCCATGa 


TCaGTCaAGG 


660 


CGGGTATTTa 


TTCaAAAATT 


ATGaCGCGCT 


AGTAAAAAAT 


ATTGGAAAAG 


TATCCATCTT 


720 


GATGGGGAAA 


TCCAGTGGAA 


TAACCAATTA 


TAAAGATAGC 


GTGATGGAAA 


TGTATCATGA 


780 


AAACAATTTG 


AAATTAACAG 


TACTCAACTT 


TTCACAAATC 


ATTATCGCAA 


TCATTTTAAT 


840 


AATTATTATT 


TTATTTGATG 


TGAAATATTA 


TTTTGAACAG 


CATCGAAAAT 


TACTCGTAAT 


900 


CAAAAAGCTA 


TATGGTTATT 


CAACATTAAG 


AGCCAATTAC 


CAATACTTAT 


TAATAAATAA 


960 


TATAGTTGTT 


ATTTTTATTG 


GAATATTGAC 


GAATGTAATT 


TTACATTCTC 


ACTATATAAT 


1020 


GATGTTATTT 


GCAACGATTC 


TTGT 








1044 



(2) INFORMATION FOR SEQ ID NO: 649: 

25 . (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 649: 

3S GAACATATTG GGTTATGCAA GGnGGTCACT CTTCACACTT ATAAACAACA TTTTAATAAT 60 
GTAAAGTTTA ACCAGCTAAC ACTTTTGTTA GCTGGTTTTT ATTTTCCTTC AATTTTTAAA 120 
TGGTTAAGTC CCCTTCTATA TCTTATAAGA CAATCATTAT AATCAATTCA ATTAATACAT 180 

40 TAACAACCAC AACTAATAAA TATAGTAACT TCAAAATCCA TATTTATGTC TAAAGATAAT 24 0 

CTCAATGTTG TTCACGTCAA TAAAATTATC CCTAGGTTTT TAAAAATTGT ACATGTTTAA 300 
ACAATCAAAA GTGTACATTA TTAAATTATC ATTTCCAGTT AGATTTAGAA AACATTCACA 360 

45 

CCACGCATGG ACCAACGTAT TCGTCTTcAT TcATTTTAG 399 

(2) INFORMATION FOR SEQ ID NO: 650: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 74 7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
(b) TOPOLOGY: linear 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 650: 





GTACTAGGTT 


CTAnGATTCC 


CTACTGTAGT 


CATAGATATT 


TTCCAAACAG AGCCTGACAA 


60 


5 


CTTTTCATTT 


TTAATAGTGA 


TCACTATATT 


AATCGCTATT 


GCAATTTATG 


CATGTCGATT 


120 




TGTTTGGGTT 


TATTTCTGGT 


ACAAAGATTT 


TTATTTCCCG 


AAAAATATAC 


AATCTTATCT 


1B0 




AGACGAGGAA 


CATGATTCAC 


ATGAAACACC 


ACCTTCTCGA 


GTGCGTTACG 


CATTTATTAT 


24 0 


10 


GACCATGTGT 


GGTATTCACG 


GTACAATTTC 


ACTTTCAATG 


GCACTTACAT 


TACCATTTAT 


300 




CATTACAAAA 


GGACAAGCAT 


TCGAATACCG 


TAATGATTTA 


TTGTTTATTG 


CATCTTTCAT 


360 


15 


GGTATTAATT 


AGTTTAATCT 


TAGCGCAAAT 


TGTTTTACCT 


TTAATTACAC 


CATCTGCCGA 


420 




AGATACTACT 


TTTAAAGGTA 


TGACTTATCA 


ATCTGCCAAA 


ATTTTCATTG 


TTCAAAAAGT 


480 




GATCCAGCAT 


TTTAAAAACG 


AAAGTAAAAA 


AGACAAAAAC 


GATACAAATT 


ATCGCCCAGT 


54 0 


20 


ATTAAACCAA 


TACTATGGAG 


AATTGTTATT 


TTTATTAAAT 


TCAGAACCTG 


ATAATCAAAA 


600 




TACrAAAGAa 


CTCAAACGTT 


TAGAAGATAT 


TGCaAAAGTA 


ATCGAAACAT 


CTACACTTGA 


660 




GCGTTTAATT 


GATAAAGGTA 


AGGCAACATA 


TCAGGATATT 


AATAATTACC 


GCAATATTGT 


720 


25 


CGAATTAACA 


GAGACACACC 


GTACTGC 








747 



(2) INFORMATION FOR SEQ ID NO: 651: 

(i) SEQUENCE CHARACTERISTICS: 
30 <A) LENGTH: 1373 base pairs 

<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) topology: linear 

35 ' 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 651: 

ATACAATACT CTTTTATTAT TCAATAAGCC ACTTCCTATA 

40 ATTTTTCGAT GCTAACAAAA AATCACACTA TCATCTTTTA 

GCAAATCTGT AAAATTTATA AAGCAGAAAC AATTCAACTT 

TAAACCTTCT ACATTATAGT TCCAAGCATC TTACACATGA 

45 

AATTGTGACA TAGCCTGTTG ATATTGTGTT TCATTGATAT 
TCTAAGTTCG TGCTTACACG TTGCGTAAAA . TTCTCTGACA 
CTAGGTGCAT TGACTTTACT AGCTAAAATA GCGCTTTGTA 

50 

■ GTTGTACTAT TTTTATTCAC GGTTGTTCCA AAGTAATGGT 
TATTGATTAT CCCCAAAGTA AATATTATTT AAATAAAAGC 

55 



GCAAATGTTT AACTTTAAAT 60 

AAATGAAAGT GTGATTACAA 120 

TATCATTATG ACATTTCAAT 180 

ATGCAAGTAT TTAACGATTT 24 0 

AATTTTGTTG CTTCATTTTT 300 

TATTATTGAT ATTATATACG 360 

AAACTGTTAT GTGAGACATT 420 

TTGCTGCGCC CTCAAGCGTA 480 

TTAAAATTTC GTTCTTATTA 540 
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TCATTATCAT 


AAAAATAATT 


TTTGACAACT 


TGTTGTGTAA 


TGGTACTACC 


ACCTTGCACA 


660 




TCTCTGTCGC 


TAATCGTTGA 


AAATAAAGCT 


CTAGTTGTAC 


CTTTCAAATC 


GAATCCATGA 


720 


5 


TGATTGTAGA 


ATCGTTCATC 


TTCCATTGAA 


ATAAAGGCAC 


CTTTAACATA 


CTCTGGCATG 


780 




TTATCAGCTG 


ACACAAAACT 


ACTTTTATTT 


TCAATTTTTC 


TTAGTTCATC 


CACATTATCG 


640 


10 


CGTGTAGATA 


AAAAATACAT 


GATACCAATA 


AACAATGCGA 


TAATGATTAG 


AATGGTTAAT 


900 


AATATTTTTA 


AXAGTATTCG 


TTTACTTTTT 


TTCTTTTTCG 


GCGGTTTGCC 


AACTGGTTGA 


960 




TAATACGTAT 


TATAGTGAGG 


TTCGTGTTTC 


ATATGCTCAA 


AATGTTCATT 


TGAGTTTGAG 


1020 


15 


TACCTATCGC 


TTCTTTTCAT 


GCGTTTGCTC 


CTTCTTTTAA 


AACTCACTTA GtATATACCT 


1080 




TGaGTTTACC 


AGTACTATCA 


CAAATAGGCT 


ACACTTTTTG 


GGAAAATCAG 


TCCAAGGGCT 


1140 




TACAATCGTA 


TACGCCATCA 


TACTTACTTT 


TTTGTTTTTT 


GAAAAAATTA TAGATAAATC 


1200 


20 


ATTGCAATTT 


TAAATATTAA 


TCATGTCAAA 


TATTGTTATA 


TTTTATAAAA ATAAAAGACC 


1260 




ATCCCTATTA 


AATGCCAATA 


GAGACGACCT 


TTTATTTGTT 


ATTCATTTAT 


TAAAACTAAA 


1320 




ATCCATATTT 


CATTTCAAAC 


GAAAATATAT 


AAATTTTAAC 


AATCGrTAAC 


CAC 


1373 



' 25 

(2) INFORMATION FOR SEQ ID NO: 652: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 859 base pairs 

(B) TYPE: nucleic acid 
30 <C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 652: 

GGCAGATAAT TTAGTCATTG TTGAATCGCC TGCAAAAGCA AnAACCATTG AAAAGTATTT 60 
AGGTAAGAAA TATAAAGTTA TAGCTTCAAT GGGACACGTC AGAGACTTAC CAAGAAGTCA 12 0 

40 AATGGGTGTC GACACTGAAG ATAATTACGA ACCAAAATAT ATAACAATAC GCGGAAAAGG 180 

i 

TCCTGTTGTA AAAGAATTGA AAAAACATGC AAAAAAAGCG AAAAACGTCT TTCTCGCAAG 240 

TGACCCCGAC CGTGAAGGTG AAGCAATTGC TTGGCATTTA TCAAAAATTT TAGAGCTTGA 300 

45 

AGATTCTAAA GAAAATCGCG TTGTTTTCAA CGAAATAACT AAAGACGCTG TTAAAGAAAG 360 

TTTTAAAAAT CCTAGAGAAA TTGAAATGAA CTTAGTCGAT GCACAACAAG CGCGTCGAAT 420 

so ATTAGATAGA TTAGTTGGCT ATAACATCTC GCCAGTTCTT TGGAAAAAAG TAAAAAAAGG 4 80 

GTTGTCAGCG GGTCGAGTTC AATCTGTTaG CmTTCGTTTA GTCATTGACC GTGAAAATGA 540 

nATTCGAAAC TTTAAACCAG AnGAATATTG GACTATTGAA GGAGAATTTA GATACAAAAA 6 00 



1577 



EP O 786 519 A2 





aAAAGATGTT 


GAGAAAATTA CAGCTGCATT 


AGATGGAGAT CAATTCGAAA 


TTACAAACGT 


720 




GACTAAAAAA GAAAAAACGC GTAATCCAGC AAACCCATTT ACAACTTCTA. CATTACAACA 


780 


5 


AGAGGCGGcA 
ATATGAAGGT 


CGTAAATTAA AcTTTmAAGC 
ATAGATTTG 


AAGAAAAACA ATGATGGTCG 


CACAACAATT 


84 0 
859 


10 
15 


{2) INFORMATION FOR SEQ ID NO: 653: 






(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 653: 






20 


TTCAACTTCG 


TTTGGAAATC ATGTTCCTCA 


ATTGTTGGAC TTAAACGAAT 


TTCTTTAACA 


60 




TTGATAATTT 


TTTGTTTCTT TTTCATTTCT 


TTTTCTTTTT TCTGTTGTTC 


GAATTTGAAT. 


120 




TTACCGTAAT 


CCATAATTCT TGCAACTGGT 


GGTTTCGCAT TCGGTGCAAC 


GACCACTAAG 


180 


25 


TCTAAATCTA 


CACGTTCAGC CATTTCTAAA 


GCTTCACGCT TTGATTTAAC 


ACCAATTTGT 


240 




TCACCATCTT 


GACCGATTAA ACGTAATTCT 


TTTGCACGAA TTTTGTCATT 


GATTTGAGTT 


300 


30 


TGATCTTTTG 


CTATGGTTGA CACCTCCAAA 


ATTTTTACGA AATTTGCACC AAGCAAAAAG 


360 


GAAGAGCAGG 


TATAAAATAC CCGCTCTTCC 


TTATACACAG TTATGTGTAA 


TGTGATTAAC 


420 




CTGCCAACTG 


CTTTATGCGT CGCTACAGGT 


GAGAAGCGGG TGCTTCTACT 


TGGTTCGTTT 


480 


35 


. CGTATTCAAC 


GTTATTAATC ATATCAACAA 


TTCACATTTA AGTCAACACT 


ATAACTGTAA 


. 540 




TTATTTTTAT 


TTTAACCTTT TATTTCATCC 


ATTGACACGT CTTGACGTAA ATCTACTTGT 


600 




TCTAATGGAA 


TTTTTTTCGT TTTATATCGA 


AGCTTATGAT AAATAAAGAA 


TGCTAAAAAT 


660 


40 


ACTGGAATTC 
AAGTCAACAT 


CCATATACGT AATTAAGAAG 
CTTGCCCAAn AAnTACT 


CGACTAAAAT TAAAATCTCC 


TGTnTTAATA 


720 
747 



(2) INFORMATION FOR SEQ ID NO: 654: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 501 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 654: 
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TTGTGCTTCT TTTTTAGCTT CTTGAACCTC TTGTGCCTCT TGTGATGTAT CACTyAAATT 120 

ATTTGCACTT GCTTCTTCTT TTATCGCTGC TTGTTGTGCT TTCAATGCCA CTGCTTTTGr 180 

TTCTTyATTT GATACAGCCA CACTTTTATC CGCTTCTGCT TGTGCTTCTC TTTTAGCTTC 24 0 

TTGAATCTCT TGTGCTTCTT GTGATGTATC ACTTAAATTA TTTGCACTTG CTTCTTCTTT 300 

TATCGCTGCT TGTTGTGCCT TTAATGCCGC TTGCTCATTT TTAGATTTGT TTAAAAATCC 360 
TTCAACACGT TCTTTTGTAT AGGCAACCGT TTCTTCAAGT TGCGTT TTT C TTTCTTCAAA • 420 

CTTTTGCGAC AGTTCTTGTn CTTTGACTTT nAAATCATCT GCTTTTTGAT AAACTTTATT 4 80 

TTAAAATACC AACCTAAAGC C 501 
(2) INFORMATION FOR SEQ ID NO: 655: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 655: 

CACCTTGTCA TAATTAATTT TTTGATTTTT CATTTTACTG ATAATAGGTT CAGCATTAAT 60 

CATGATTTAA CCTCCCACAT TTAATCATTA ACTTCTATTA TATATGATTC ATATTAAATG 120 

30 

TCAGTCAAAA AAGTTAGAAA TTCATTTTAA TGCATTTATA TTTCGAAAAT CCCTTATGTA 1 180 

TCTAAAAGCA TTTTTTAAAC TTGAATTTTA AAACACTAAA CAACACATAC GTCTCTGTGT 240 

35 CATTTTCATT TTTTGTATGT CATATATATG TTTACTTCAT TTAAATCAAT TTCATCTTAT 300 

AATTTATCGT GTATTTTACA AAAGATTGAC TTCAATTCAT CGTAAAAGTT ATACTTTTGC 360 

CATTTTTTAA TGTAACATGG TGTTAGTAAT AAAAATAATA CATTGAGGTG TTTTACATGA 420 

CAGCATTATT CCCTTATATC GCTTTTGAAA ATTCAAAAGA AGCCCTTGCA TATTACGAAG 480 

AAGTATTTGG - TGCAACTGAC GTTAAACGTT TAGAAGTTGG CGAAGAACAA GCGTCACATT 540 

TTGGTATGAC TAAGGAAGAA GCGCAAGAAG CAACTATGCA TGCTGAATTT GAAGTGCTTG 600 

GCGTAAAAGT GTTATGTTCT GATTCTTTTG, GTCGCGCTGA CAAAATTaAT AATGGCATAT 660 

CATTATTAAT TGATTATGAT GTTAACAATA AGGAAGATGC TGATAAAGTT GAAGCATTCT 720 

so ATGAGCAAAT TAAAGATCAT TCTTCAATTG AAATAGAATT ACCGTTTGCT GACCAATTCT 780 

GGGGTGGCAA AATGGGCGTC TTTACCGATA AATACGGTGT TCGTTGGATG 830 
(2) INFORMATION FOR SEQ ID NO: 656: 
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(A) LENGTH: 539 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 656: 



10 



15 



20 



25 



30 



35 



40 



45 



GTATCCATGG GCCCGTTnCG 


CACAACATTT 


GGnACAATTA 


ggt a c a rcir\ n 


A lljU-olUw 


0 u 


TCGTTTCGGT ATAATTTTAT 


CGAATGaAGG 


CGGTGCGTTA 


r*A tura &Tf2 a 


a a pta ppaTa 


ion 


CGAATATTAC ATTGGTGGTA 


AATTAGGTTC 


TGGTCAACAA 


TGGTATTCAT 


GGATTCATAT 


180 


CAATGATTTA ATTCAAGCTA 


TTTTATTTTT 


AATAAATAAC 


GAGTCAGCTA 


GTGGTCCGTT 


240 


TAATTTAACT GCACCTATAC 


CTGAACGTCA 


AAATTTATTT 


GGCTACACTT 


TAGCAAGAGC 


300 


TATGCATAAG CCTCATGAAA 


CTTGGGCACC 


AAGTCTTGCA 


ATGCGTCTCA 


TACTTGGTCA 


360 


AATGTCAACA GTAGTATTGG 


ATACTCAAAA 


AGTATTACCT 


aATAAAATTC 


rtrtV?V_rtA AVjnjrtj 


A *5 fi 


aTTCCAATTT AAATATAGTA ATTTAAAAAT 


GGnACTTGAA 


GATTTAATTA 


AAGAATAATP 
rtrtOrtrt Artrt. A W 


a fl n 


AATACCATTA ATGAGCATTA 


GAAACAACAT 


ATGTACTAAA 


TGTAATGTCT 


AGAGPGACT 

rtOrt\J»»^\Jrt W J. 


no 
D J ? 


(2) INFORMATION FOR SEQ ID NO: 657: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1106 base pairs 

(B) TYPE: nucleic acid. 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 657: 






ATCAAGTTGA ATATCATCCA 


TATTTAACCC 


AACATAAATT 


GArATTATAT 


TTGGCAGCAC 


60 


AACGTATCGT gATGGaATCT TGGTCACCAT TGATGAATGC 


ACAAATTTTA 


AATGATGAGA 


120 


CAATTAAAGA CATTGCTCAA 


GAATTAGGAA AGTCACCTGC 


CCAAGTTGTT 


TTAAGATGGA 


1B0 


ATGTGCAGCA TGCTGTGGTT 


ACAATCCCTA AATCGGTGAC 


ACCAAACAGA 


ATCTCTGAAA 


240 


ATTTCCAAAT ATTTGATTTC 


GAATTATCAG ATGAACAAAT 


GACGCGAATT 


GATGGTTTAA 


300 


ATCAAGATAA GAGAATTGGA 


CCTGATCCAA AAAAATTTGA 


AGGCTAGATT 


AAAATCGCTC 


360 


AACTGATGAA AAGGTTAGAT 


GAATTGTCAG 


GGCTTGGGAC 


ATTAAGTTCT 


TAGGCAATGT 


420 


AAAAAAGCTG ATTTCTATTA 


ATTATTTGAT AGAAATCAGC 


TTTTTTGATA 


TGTATTTTAT 


480 


AATGTACAGC TCGTTGAGCT 


GCTATTTTCC 


TTATATTAAG 


TGCCATTAAT 


ACAAAACCTA 


540 


GCTCTCGTTT AACTTTATTT 


AtTCCTCGAA 


CTGACGTTCG 


AGTTAAACCC 


AAAATAGCCT 


600 
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15 



30 



SO 



ss 



CTGGTTCAGA AAGCTTTTGA TTAATTTGGA CTTTAAAGTA TTCCCAATTA TAATTCTTCA 720 

TGATTtTCTT ATTGGATTTT GAATTTGCTT TCATGCATTG TTGCCTCAAA GAACATGCTG 780 

AACAGTCATC GCATTCATAT AGTTTGAAGT CTCGTTTAAA ACCATATCTA TCATTACGGT 84 0 

ATGCATATCT TTTAAAACCT ATTCTTTTGT TATTAGGACA TATAAATTCA TCATTAAGTT 900 

CGTCATATTT CCAATTTTGA GTGTCGAAAA TGTCACTTTT AAACTTTCTA GTTTTATCtT 960 

TAATAAACAT GCCATACGTA ATAAGTGGCG TTTTATTAAA ATCATCTATA ATGGCCATAT 1020 

AGTTTTGGCT CACTACCCAT AACCTGGCAT CAGCTACCAA ATGAACCGAA GGGA TTTTT G 1080 

GAATCCATTG GTTGAAAAAA TGGGAA 1106 
(2) INFORMATION FOR SEQ ID NO: 658: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 
20 <B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 658: 

TTTTAACTTT ACTCTTTGAT TTAAAGAGTG ATAAATGTTT ACAGTTTAAT TAAAACTGCA 60 

TAAGAACTTC TAGCTTTTCT CTTTCGTTCA AAGAGAAGCA GCTGTTCGCA GTTTAATCAA 12 0 

AACCACATAA AGCTTTTAAC TTTACTCTTT GATTTAAAGA GTGATAAATG TTTACAGTTT 180 

AATTAAAACT GCATAAGAAC TTCTAGCTTT TCTCTTTCGT TCAAAGAGAA GCAGCTGTTC 24 0 

35 GCAGTTTAAT CAAAACCACA TAAAGCTTTT AACTTTACTC TTTGATTTAA AGAGTGACAA 300 

ATGTTTACAG TTTAATTAAA ACTGCATAAG AACTTCTAGC TTTTCTCTTT CGTTCAAAGA 360 

GAAGTTCTAA TACCACCATA TCGTGCGATC GGGAACGGTA 400 
40 [2) INFORMATION FOR SEQ ID NO: 659: 
- (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1899 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 659: 
ATAATTACAA TGGCTACTGC AATTATTGTT GGAATTATAT ATTTCAGTTT AGTCATCATT 60 
TATGCTCCCT TGATTTCAAA TTCATATCAT TAGTTTACCA TATTGAAGAT GATATAATAA 120 
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AACTTTAACT 


GAGCTTCATG 


GGGCACCAGG TTTTGAAGAA GAAGTAAAAA ATTATATGAC 


240 




TCAGCAAATG 


GCGCCGTACG 


TAGATGAATT TATTGAAAAT CGTATGGGTG 


GATTTTTTGG 


300 


5 


TGTGAAAAAA 


TCTAAAAATC 


CAAATGCAAA ACGTGTAATG ATTGCAGCAC 


ATATGGaTGA 


360 




AATCGGATTT ATGATTACAA 


ATATCACTAA* AAATGGAATG ATTCAATTCA 


CAAATTTAGG 


420 


IV 


TGGTGTTGCA AATGATATTT 


GGCAAGGACA ACGCTTAGTA ATTAAAAATA GAAATGGCGA 


480 


TAAAATTATC 


GGTGTTGTTT 


CTAATATACC TAAACATTTT CGTACTGGTA 


GTGAAGGTGC 


540 




ACCGGAAATT AAAGATTTAA 


CATTAGATAT AGGTGCTCAA AATGAAGATG AGGTGCGTGA 


600 


15 


gCGCGGAATA GATATAGGAG 


ATACAATTGT ACCTCACACG CCATTCACAC 


AGTTATCTGA 


660 




ACATCGATAT 


AGTGCTAAAG 


CATGGGATAA TCGTTATGGT TGTGTCTTGG 


CAATTGAAAT 


720 




ACTAGAATTA TTAAAAGATA 


TAGAATTAGA TGTAGACTTG TATGTTGGCG 


CAAATGTTCA 


7B0 


20 


AGAAGAGGTT GGATTACGAG 


GTGCGAAAGC ATCTGCAGAG ATGATAGACC 


CAGACGTTGC 


840 




ATTTGTAGTT 


GATTGTTCAC 


CTGCCAATGA CGTTAAAGGA AACCAACCAT 


TATCTGGTGA 


900 




ACTTGGTAAA 


GGGACGTTAA 


TTCGCATAAA AGACGGTACA ATGATTTTAA 


AGCCTGTATT 


960 


25 


TAGAGACTAT 


TTATTAAAGT 


TAGTAGAAGC ACATGACATT GAACATCAAT 


ACTATATGTC 


1020 




ACCAGGTGGA 


ACAGATGGTG 


GAGAAATTCA TAAAGCTAAT ATTGGTATTC 


CGACTGCAGT 


1080 


of 


TATTGGTGTA 


TGTGCACGAT 


ATATTCATAG TACAGACTCA GTATTTGATA TAAGAGACTA 


1140 


TTTTGCAGCT 


AGATCTTTAC 


TTTCAGAAGC CATTTGTAAT TTAGATAATA ATCAAATAGA 


1200 




AACATTACAA 


TATAAATAAT 


CGGGTAATAA CAACTATTAT CTCTAAATAG 


TTATATATAA 


1260 


35 


TCATTAATTA 


AGGAGACATA 


AAAATGAAAC AACTTGAATC AGAACAACAA 


TTTGAATCTT 


1320 




TAAAACAAGG 


TGCTACAGTA 


TTTGAATTCA CTGCAGGCTG GTGTCCAGAT 


TGTAGAGTGA 


1380 




TAGAACCAGA 


TTTACCGGAA 


TTAGAAGCGA GATATCCTAT GTTTGACTTC 


GTATCAGTAG . 


1440 


40 


ACCGTGATAA ATTTATGGAT 


ATTTGTATTG AAAATGGTAT TATGGGTATT 


CCAAGTTTTC 


1500 




TAGTATATAA 


AAATGGAGAA 


CTGCTTGGAA GTTATATTGG AAAAGAACGA 


AAATCAATTG 


1560 




AACAGATAGA 


TGCATTTTTA 


GCTCAATACG TGTAATTTAG ACTAGAGAAA 


AACGGGGTAA 


1620 


.45 


TACTCGTTTT 


TCTCTGTTAC 


TATGTGTTGA TTTATTGTAA ACTATTATAA 


GGTGCGAAAT 


1680 




TAGGAGTGTT 


ACATATGAAT 


ACCTTTCAAA TGAGAGATAA ATTAAAGGAA 


CGTTTAAGCC 


1740 


50 


ATTTAGACGT 


TGATTTTAAA 


TTTAATCGTG AAGAAGAAAC TTTGCGTATT 


TATCGAACAG 


1800 


ATAATAACAA AGGTATCACG 


ATTAAACTTA ACGCTATAGT CGCAAAATAT 


GAAGATAAAA 


1860 




AAGAAAAAAT 


TGTAGATGAA 


ATTGTTTATT ACGTTGATG 




1899 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3774 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
5 (D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 660: 

10 

GTATAATATC CTTTCAATCT GTTTTCATAT TTTATATATT TTTTAAATAT AAGTGCTAAA 60 

TGTTTTAACT AAAGCATAGA TTGACAAGAT GTTATACAGA ATTTCAAATT CTATCCAATA 120 

15 TTGTTCGAAG TGTAGTATCA CTGGATTGGT ATTAAACAAT GTAAAGGAGA GATTGCAAAT 180 

GCCGTATAAT TACAAGAAAC AAAATGGAGA GTTAATGTCT GTAATGAGCC AAGGTGAAAA 24 0 

GTTTATTCAT CAATCACCCG TTAATGATGA ACTTAGTGCA TTGATTAAGT TATTAATTTC 3 00 

20 TAAAATTAAC GGTTGTCATT ATTGTGTTGA TATCCATAAA AAAGAATTAA AGGAATTGGG 360 

TGTAaCACAA ATGAAAATTG ATGAAGTCTT GAGTTTTAGA CATTTAGATT TATTTACTGA 420 

. TCAAGAAAAA GTGACGCTTG AATTTGCAGA AATGTTAAAT TCAATCAAAG ACTTTAAGAA 4 80 

GTTTGAAATT ATTGACCGGC TAAAATCATT TTATGATGAA GAACAAATTA TTGATCTTGT 540 

CTTTGTTGTA AACCAAATTA ACGGTTGGAA CAGATTAAAT ATTATTAGTG ATAGACTATA 600 

ATTGTTCATA TAAATGCAGA GTTTCATCTC GAACGCTATA TCATAACAAA TCATGCCACT 660 

30 

ATACAGGTCA AATCTTGTAT AGTGGCATTT TAATTTATCC CTTTGAATAC TGTTATTTAA 720 

CGAATATCGG TCCACCTGGT CCAACAGGGA TACCTAATAG GAACCAAATG ATGACAAATA 7 B0 

3s • CTGTCCATAC AATACTTAGC GCGATTGAAT ACGGCATTAA ACTAGAAAGT AAGGCTCCGA 84 0 

GTTTCATGCG TTTATCGTAT TTTTGTGCAT AAGTTAATAA TAAAGGTAAG TACGGCATCA 900 

TCGGTGTAAT TGGATTGGTA ATTGAATCGC CTACACGGTA AATGACTTGT GTGAATGCGG 960 

40 GATGAAAGCC GATAAGGATT AACATTGGTA CGAATATCGG TCCTAAAATA CCCCATTTAG 1020 

CCGATGCGCT TCCGATTAAC ATGTTGACCA TTGCACTCAG TACAATAATA CCTAGTATCA 1080 

ATACAATACC GTTTTGATGT TCTAATAATT TGGCACCTTT AACAGCAGCG ATAATTCCTA 1140 

45 

AATTACTCCA CTTTAAATAC GCAAGTAGCT GTGCTGCAAA AAACACAATA ACGATAAATG 1200 

TTCCCATTGA TCCTACAGCA TCGCCGAACA TTTTACCTAA GTCT TT TGTA TTTTTAATTT' 1260 

CTTTGCTTAA AATCCCATAA ACTAATCCAG GTACTAAAAA TACGACAAGA ATAATTAATC 1320 

SO ' 

CGACACCGTT AATTAATGGC GCATCGTCTA GTAAGCTGCC TGTTTTAGCA TTTCTTAAAA 13 80 

AGCTATGTTC AGGAATGGCT GTAATAATTA ATAAAATAAT TGTGACTATG AAACTGATAT 144 0 
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CATCATGCAT TAAACTGTCA TCATATTTTC CTAATCTAGG AATAATGAGC TTAGTTGTAA 1560 

CTAGCAATAT CGTAGGAAGT AATACAACGA CACTCGCTGC GATAAAGTAC CAGTTCATAG 1620 

CAACGTTTGT TTTAATAGAA TCTGAAACGA TACGTGTTGC CGGTTCTGTA AATGAATAGA 1680 

CCAAAGCATC TTGCATACCA ACAACTATAT TTGCTGCAAA TCCTCCAACA GCGGAAGCAT 1740 

ATGCCATCGT TAGTCCAGCG ATAGGGTGAT AGCCAATTTT AATAAAAAGC ATTGCTGCAA 1800 

GCGGCGGCAA GATAATTGTC GCAGCATCGC CGGCTGTACT ACCTAAAATA CCAATTAATA IB 60 

TAATAGTCGG TAAAATTAAG AAACGTGGTG CGCGATTCAC AACAGAAATC ATTAACTTAT . 1920 

CGAAGTATCC TGTTTTCTCT GCAACACCAA TACCAATCAT CACTGCTAGT ACTAAGCCTA 1980 
ATGCTGGGAA CTCTGAGAAA TTTTTAATCG TATCATTCAT TATCATCGTA AATCCATCAT 1 204 0 

GGCTAATTAT ATTTTTAATA TAAATGGTTT GATGCGTACC TGGATGCTTA ACAGATACAT 2100 

20 TAAATAATGA GATAACCCAT GTCATAATGG CTAAGCCCAC ACACATTAAA AAGAATAAGA 2160 

CGCTAGGATC TGGCAATTTA TTTCCGATTT TTTCAACACT ATTCAAGAAA CGATTGACGA 2220 

TAGACCCCTT TTGTTGATGT TTTGATGTCA TCAATTATT.C CCCCCTTTGT TAAATATTTA 22 8 0 

25 AAGTGTAACA AAAAATACTC TCAAAAGTAA CAATTTTCAG GAAATAAAAA AACTAATATT 234 0 

GTTAAATATT TTGAGTTATT CAATAGAAAG TGTATAGCAG AGTAGTTAAG ACTGCCTGAA 2400 

GACTTATCTA TTAGGTTTAT GAAGCATCGA ACAGTGGAAA ATAAGGACTG TAAGTTTAAG 2460 

ATATGTTGTA TAGGAGTGAC TGAATGAAAC GTTTGGAAAA TAAAGTAGCA GTCGTAACAG 2520 

GAGCAAGTAC AGGTATCGGT CAAGCTTCTG CAATCGCTTT AGCTCAATAA GGTGCGTATG 258 0 

TATTGGCGGT AGACATAGCT GAAGCGGTAT CGGAGACTGT CGATAAAATT AAAAGTAATG 2640 

GTGACAATGC GAAGGCGTAT AATGTGGATA ATGCAAGCGA ACAACAAGTG GTAGACTTTG 2700 

TGTCTGACAT AAAGGAACAG TTTGGAAGAA TCGATGTGTT GTTTAATAAT GCCGGTGTGG 2760 

ATAATGCGGC TGATAGAATT CATGAGTATC CAATAGATGT GTATGACAAG ATTATGAATG 2820 

TAGATATGCG TGGGACATTT TTAATGACGA AAATGATGTT ACCTTTAATG ATGAATCAAG 2880 

GTGGCTCTAT TGTTAATACG TCTTCATTTT CCGGACAAGC AGCAGACTTG TATCGCTCTG 294 0 

45 GATATAATGC TGCGAAAGGT GCAGTGATTA ATTTTACAAA ATCAATCGCA ATTGAGTATG 3000 

GCCGTGATAG CATTCGAGCC AATGCGATTG CACCAGGTAC AATTGAAACC CCGTTAGTAG 3060 

ATAAACTGAC AGGTACGAGT GAGGATGATG CAGGTAAAAC ATTTAGAGAA AATCAAAAAT 3120 

SO 

GGATGACTCC GCTGGGACGT TTAGGTAAAC CAGAAGAAGT TGCTAAATTA GTAGTCTTCT 3180 

TAGCATCTGA CGACAGTTCA TTCATAACTG GAGAGACGAT TCGAATTGAT GGTGGTGTGA 3240 
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AATTATTTAA AATCGATATC CAGTGGAAAA GAATTTGGCA TGAAGTAGGC AATGTATGCA 3360 

TATTGTCGCA ATGAAAGACA TTTAAAGCGG AGATTAATTA ATCTTTGAAA ATCACATATT 3420 

5 

GTTCATTTGA AGTGTATTGC TTAAAGCTTG TTTTAACGTA TTGTAGGTCG TGCATCTAAT 3480 

TTATCAGAAT CTCGAGTTCA AACTTTTGGA GTATCTTTAA AATAAGCTGT ATTTTGTCTA 3540 

TTCTAATAAA TTAAGGAGAA TTTTATGTTA AAAGAAAAAG AAAGTTTTAG ATTGCTATAT 3600 

10 

CAAGCTATAA GAGAGATTGC AGATAAAATT GGAGATAATC AGTTAGAAAC TAATTCCGTT 3660 

AGTTTATtAT tATTGGACTT TGATTTTGAA CATGAAGTAT TTGATGAATT GTATCTTGTG 3720 

75 ATTTtAAAAT ATTTAAATAC AGTAAGTATA GAGAACATAA GTCATAGTGA GCTT 3774 

(2) INFORMATION FOR SEQ . ID NO: 661: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1076 base pairs 
20 (BJ TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 661: 



30 



35 



50 



TACTGGTTTT 


GGATTTTGGA 


GGTAATACAG 


TACCTAATAG 


TAATnATGGT 


GTCGTCAnAA 


60 


TTATATCTTT 


CCGTGATGTT 


CTTACTCATT 


AGAACATCnG 


CCTTCAGAGG 


AATCATGATA 


120 


CGAGGAATAA 


GAAATTTAAA 


TGTGAGCGAA 


GTCAATATAG 


TATTTGCGAT 


TATTTTTATT 


180 


AACTCCATTA 


TTATTGTTAG 


TTTGATTTTT 


CGAGGATAAC 


TTCAATTTTT 


GCATTTTGAG 


240 


GTTTTTTAAC 


ATATCTATTT 


GCATCAGTTG 


ATGGCAACCT 


TTTACTTAAA 


TCTATTGTGT 


300 


AGTTATTGTC 


TGCACCTGTT 


ATTTTAATTT 


GTCCTTTATT 


ATAAGAATTA 


TTATATAATT 


360 


TTTTACTTTT 


AATTAATGTT 


TGACGAATAC 


GAAAATCTAA 


TTCTTTTAAA 


GTTAAAACAG 


420 


GCTTATTGCC 


TTCATAAACT 


GGAAATCCGC 


CAGTAAACGT 


TTCTGCTTTA 


TCTTTATATG 


480 


TTACATTCAG 


TTTATAGTGT 


TTATCGTTAG ATGTTGCTGC 


AGGAGTAACA 


CCACCAGTAA 


540 


ACGTTTCTTG 


AGATAATGCA 


AAAGaATCAA 


TGGTTTCTTG 


GTCTTTTATG 


CyAAAAATAT 


600 


CAACGCTTTT 


ATTTCTTAAT 


TGGTTGATAT 


TGCCCCAACT 


TTCAGGTCCA 


TAAACTTGAA 


660 


TATGACTATA 


CCAAGaAAAC 


TGTAACAACG 


TTGCATGAAT 


CgTACCGTTA TCTTTTTGCC 


720 


ATAACGTACT 


GTTAGAGAAG 


GTTAAATATT 


TTTGCGAGTA 


ATATTTAGTT AACTCATTAA 


780 


CGTTAGTTTC 


GTTTTGATTT 


ATATAAtAAG 


cTTTCGCTTC 


AGATGAAGAA TTGATAkGTG 


840 


TATTAGGAAA 


TTGTGTAGAT 


GCTGTACCTA ATAGTAACAA 


TGTTGTTGAT AAAATAATTT 


900 
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ATAAAAAGGG GTTAATTAGA TAATTGAAAT TATCCGCATT TACAAAAGGT AATAGGTTAG 1020 
TTAGATTTTT CGAGTATGAC TcAATTTCTG CATTACGAGG ATTTTTAACA TAACGGTT 1078 
5 <2) INFORMATION FOR SEQ ID NO: 662: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 13 98 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 662: 



30 



35 



40 



50 



AACTTGCTTT 


TTAGTATCTA 


CGTTAATATT 


AATAGCTTTA 


ATTTCACTTG 


TATTAATTAA 


60 


ATCAGCTGTG 


TAAATACCTG 


CTTTCAAATC 


GATAACTTTC 


TTTGTTCCAT 


TTTTAAAGTA 


120 


AACAGTATAT 


TTCGCTTGCT 


TCGATAGTCT 


TAAATCTATA 


TCACTAATAC 


CTCTGTCTGA 


180 


TTTTAAAACT 


GATTTAACTC 


TATCCTCTAA 


ATCTTTATAA 


CTAATATTTT 


GATTCTTATT 


240 


AAATGTTAAG 


CTTGATAAAA 


TATTTTGGCT 


TGTACCGTTC 


ACAGTGaTTG 


CATATGGAAC 


300 


ATGGACTTTA 


GAATATCCAT 


GGTGTAACGA 


ACTTGATGAT 


TTATCTAATG 


GCTTAGCTGC 


360 


GGCAGACGCT 


TCATTATTAT 


TAAAGTTTGC 


ACCTGTTGAT 


GCTAAAACAC 


CTAATGCTAA 


420 


AGTTGTTGTA 


ATCAATGACT 


TAAATTTCAT 


AAATTATCTC 


TCCTTTTTTG 


TGTAATTCGT 


' 480 


ATTTGCAACT 


TAATTATAGC 


CAGACTTTCT 


CTATTTTTTG 


AATTAACTGA 


ATATTAATAA 


54 0 


TAAATTATCT 


TTAACAATAA 


TTTTTTAACA 


CTGTTAAAAG 


TTCTTTTAAT 


TTTGATTAAC 


600 


TAATTAATTT 


ACAATACCTA 


AAATGTTGTT 


TGGTTTTGTT 


TATACCAAGC 


TTCAAACTTA 


660 


AATGTCATAA 


CAACATTCAT 


TTCTTAATTC 


CTATTAGATT 


TGTCGATTAT 


ATTTACAGCA 


720 


TCTTTATACT 


CAAAAAACAT 


TTACTTAAAA 


ATATAAATTC 


GATTTAATAA 


TTAATTTAAA 


780 


TTTAGTTAAT 


CAATTTTGCA 


TCTATTTTGT 


TGTAAGCTAT 


ATAAAAGGAG 


TGATAATGAT 


840 


GGTGAAAAAA 


ACAAAATCCA 


ATTCACTAAA 


AAAAGTTGCA 


ACACTTGCAT 


TAGCAAATTT 


900 


• ATTATTAGTT 


GGTGCACTTA 


CTGaCAATAG 


TGCCAAAGCC 


GAATCT AAGA 


AAGATGATAC 


960 


TGATTTGAAG 


TTAGTTAGTC 


ATAACGTTTA 


TATGTTATCG 


ACCGTTTTGT 


ATCCAAACTG 


1020 


GGGGCAATAT 


AAACGCGCTG 


ATTTAATCGG 


ACAATCTTCT 


TATATTAAAA 


ATAATGATGT 


1080 


CGTAATATTC 


AATGAAGCAT 


TTGATAATGG 


TGCATCAGAC 


AAATTATTAA 


GTAATGTGAA 


' 1140 


AAAAGAATAT 


CCTTATCAAA 


CACCTGTACT 


CGGCCGTTCT 


CAATCAGGGT 


GGGACAAAAC 


1200 


TGAAGGTAGC 


TACTCATCAA 


CTGTTGCAGA 


AGATGGTGGC 


GTAGCGATTG 


TAAGTAAATA 


1260 
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CAACAAAGGC TTTGTTTATA CAAAAATAGA GAAAAATGGT AAGAACGTTC ACGTTATCGG 13 80 

TACACATACA CAATCTGA 13 98 

5 (2) INFORMATION FOR SEQ ID NO: 663: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH 1200 base pairs 

(B) TYPE : nucleic acid 

w (C) . STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 663: 





TTGTAATTGG 


AACGTACGAA 


CTTTTCTAGG 


TAAGAAACGT 


CGAATCTCGT 


CCTCATTATA 


60 




ACCAACTTGT 


AGTCGTTTAT 


TATCTAAAAT 


AATTGGACGA 


CGTAATAAGC 


CAGGATTATC 


120 


20 


TTGAATGATT 


GAATATAAGT 


CTTGTAATGG 


TAGTGAATCA 


ATATCAACAT 


TTAATTTTTG ■ 


180 




GTATGTTTTA 


GAACGTGTAG 


AAATGATTTC 


ATCAGTACCG 


TCTTCAGTCA 


TTTTTAATAT 


240 




TTGCTTAATT 


TCATCAATTG 


TTAAATGTTC 


AGAAAAAATA 


TTACGCTCCG 


TATACGGAAT 


300 


25 


GTCcATGTTC 


tTGkTAACCA 


TGCTTTCGCT 


TTTACGGcAA 


GATGTGcAaC 


TTGGtGaAGT 


360 




aAATAATGtT 


ACCATACATC 


TCACTCTCCT 


ATTTGAATGA 


ATAAAATTCA 


TTGCTTAAAA 


420 




TTTAGTTATA 


GATCAAGAAA 


AAACATTTTT 


TTCTAAAATT 


CTTAATCGTT ACTATTTATT 


4B0 


30 


ATAACTATCT 


AACATTAAAA 


TTAAATGAGA 


AAAACCTAAT 


TTTTCAGATA AGTTTCTACA 


540 




CTTATAAAAA 


AGATTATTAA 


TCCCTTTGTT 


AGTAGTAAGT 


TATACGTATA 


TTCTAACACA 


600 


35 . 


TCTTACATTT 


TTAAGAAATA 


CTGTTATAAT 


GATAATTATT 


AAAATATTAC 


TAAGAAAGTA 


660 


GGCATTTAAA 


TGGAGACATT 


ATTTTCAGGC 


ATCCAACCTA 


GTGGAATTCC 


TACTATTGGA 


720 




AATTATATTG 


GCGCACTAAA 


ACAATTTGTT 


GATGTGCAAA 


ATGACTATGA 


TTGTTATTTC 


780 


40 


TGTATCGTAG 


ATCAACATGC 


AATTACAATG 


CCACAAGATC 


GTTTAAAATT 


ACGTAAACAG 


840 




ACCAGACAAT 


TAGCAGCGAT 


TTATTTAGCT 


TCTGGTATAG 


ATCCAGACAA 


AGCAACATTG 


900 




TTCATACAAT 


CTGAAGTCcC 


TGCACACGTA 


CAAGCAGGAT 


GGATGTTAAC 


TACGATTGCT 


960 


45 


TCTGTTGGAG 


AATTAGAGCG 


TATGACGCAA 


TACAAAGATA 


AAGCTCAGAA 


AGCAGTTGAA 


1020 




GGTATACCTG' 


CTGGTCTATT 


AACATATCCA 


CCTTTAATGG 


CAGCTGATAT 


TGTTCTTTAC 


1080 




AATACTAATA 


TCGTTCCAGT 


TGGAGATGAC 


CAAAAGCAGC 


ATATCGAATT 


GACTCGTAAC 


1140 


SO 


CTTGTAGATA 


GATTTAATAG 


TCGCTATAAT 


GATGTGCTTG 


TGAACCTGAA ATTCGTATGC 


1200 



(2) INFORMATION FOR SEQ ID NO: 664: 
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(A) LENGTH: 787 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNES5 : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 664: 



If) 


TGTAC CTTTT 


GCTTTAATAA 


ATACTG TTTC 


TTTATCATAA 


TTAGGTTCAG 


TTAAGAAATT 


£ n 

P V 




AAATTGTAAG 


CTTTG AG T AA 


TATTTTTTTG 


ACTATCACTT 


GTTGTAGCTG 


TACGTGTATA 


±z u 




CATTTTAGTA 


TCACCATCAA 




AGAAACTTGT 


TTGATTTCAG 


AATTAATCTT 


JL O U 


15 


TGCAAATGAA 


GTAG CTGGT A 


ATACAGTGAA 


AGTAGTCGAT 


AGTGCTAAsG 


wACAAATTGT 


o a. n 




GATATTTTTA 


CATAGTTGTT 


TAATCATTAG 


TAATCCGCCC 


TTTCAATATT 


ATCCTTCTTT 


300 


20 


ATAAGGTTTA 


TTGTCATCAG 


AATATTTATC 


AACGACTTTA ACTGTTTTAT 


TTTTCCAATC 


360 


AACTTCATAA 


GTGACAATTA 


ATCTTTGACC 


ATCTTTATTT 


TtcTCTAAAA 


TTGGAGGTGC 


420 




" ATAATGT ATy 


CCAGGTCTGT 


TTTTCAAAAT 


ATCTTGATTT CGTGTGTATG TTACTTCAAA 


480 


25 


TTGCGTTTTC 


TCATTTGACT 


TTTCATTAGA 


TAAATAAGTT 


AAAAATTCTG 


GATTAAAGCC 


540 




ACTTCTTACT 


AATGCTGGGT 


ATCTATATTT 


TGAAGCAAAG 


CTTAGTTCaG 


GGTTTTCTAC 


600 




AGTAGCAAIT 


CTCGTATTTC 


TATAGAATAA 


TAATTCATCA 


TTTCTATTTT 


TCACTTCTCC 


660 


30 


ACCATACTTC 


AAGTCATTCG 


CAATAACTGA 


CCAGTGTACA TGgCCAGTTn ATTTATTTTT 


720 




ACCGCTGGCA 


ATTGTGTCAT 


AATTTGCTGC 


ATCATCAACT 


AATCGTTTTG 


GAAGTAGCTA 


780 



TTTGCTG 787 

35 

(2) INFORMATION FOR SEQ ID NO: 665: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



4S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 665: 

TTATCTGTCA TTAACCACTG GTACATGTGA ACCCGGCnTn AATCAATATA TATATTTAAA 60 

AGCAAAGGCG CGCCATGTGC CTTTtTTnTA tTTTGTATAT CTGTATCAAA ATCGATTTGA 120 

50 

TTAAAATCCG CTTTATTTTA TCATCTATTC AAATGATTTT AGTGCGCTTA TTTTACTATG 180 

GCATTATTGC CTAACCTGTT' TGAGATATAT TAATATTTGA TGATTCGTAT TCAAAGTTTT 24 0 

CATTTAACAT GTATTTAGTA TCATGATAGC TGCTTCATTG ATGATATACT ACCTACTTTT 300 
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CAGAGGCAAT ATTGAACAAT TCTTCATCAT GACTATTTTC CATCACATAA 


CTATGCTTAG 


420 


£ 


CGAACGCTAA CATATCTTTA TCATTATTCG CATCTCCGAA GGCCATGAGC 


TCTGAAGGAG 


460 




ACATTTCCCA TTTATCTAAC AATCGTTTTA ATGCCTGnCC TTTAGTCATG 
(2) INFORMATION FOR SEQ ID NO: 666: 


TTT 


533 


10 
15 


<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 






(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 666: 






20 


GACTTCTCTT GCACTAAAAA TTGCTCAAAT TGCTCGCTGT CATAACCAGG 


CGGTGTTTCC 


60 


AACCATACAT ATATGCCACC TTTAGCATGA ACAAATGGCA AATCAGCTTT 


TGCAAGCATG 


120 




GCTTCGAATC GGTCACGACG TGTTTTAAAT ACATTGCTTT GTTCTTCTAA 


AAAATCATCA 


180 




TAATGGATTC AAAGCATATA TTGCGGTCAT CTTGTAAATG CACCAAACAT 


CCCAGCATTT 


240 




GTGTGCGTTT GGTACTTTTT CAAAGCTTGA AATCATATCT TTATTACCAA 


CTGCCAAAAC 


300 




CGACTCTGAA AACCTGGACA TGTTGATGAA CCTTTnAGAC AAGGGAAGAA ATTTGCAATC 


360 


30 


GCAACATCTT TGCCCATTTT nCCGAAGCAA GTGnGACTAG 
(2) INFORMATION FOR SEQ ID NO: 667: 




400 


35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 412 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 






40 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 667: 








TCTCCAATGC TACTCAAATT AAAAAGGTTT TAAATATTGT TAATTCTGAT 


CCAGAGCGAA 


60 


45 


AGATTGTTAT CGTTTCTGCT CCAGGTAAAA GACATGATAA TGATATTAAA ACAACTGATT 


120 




TGTTAATCAG ATTATATGAA AAGGTCATTA ATCATCTTGA TTATCATGAT 


AAAAAAAGAG 


180 




AAATTATTCA GCGTTATGAT GATATTGTAA AAGAATTGCA AATGGATGAA 


AGTATTTTAC 


240 


SO 


GGACGATAGA TGTGACTTTG GAACATTATA TAAATCAATT AAAAAATGAA 


CCAAAGAGAC 


300 




TATTAGATGC ATTACTTTCT TGTGGTGAAG ATTTTAATGC GCaAyTGAtA GCyTTATATA 


360 




ATAAtAGTCa gGTtACCAAC AAmATwTATA TCCcCGAAGG AAGCGGGTAT 


TT 


412 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
.ID) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668: 

W 

TTACTCGTTA AAGATATTGT AAAAGATGAA GTGACAGAAT ATGACATTCA TCAAATGTTA 60 
CCGCATCCGA TTAATATGGT AAGGGTTAGA CTTTTTGGTG TGAAATTAAA AGAGATTATA 120 
75 GCTAAAAGTA ATAAACAAGA nTATATGTAT GAACATGCAC AAGGTTTGGG TTTCAGAGGG 180 
AATATATTTG GAGGATATAT TCTTTATAAT TTAGGGTACA TTCATTCTAC AGGGCGTTAC 240 
TATCTGAATG GAGAAGAAAT CGAGGACGAC AAGGAATATG TACTAGGTAC GATAGATATG 300 

20 

TATACGTTCG GTCnTATTnC CCAACATTGA AGGATTACCA AAGAGTATTT AATGCCAGAG 360 
TTTTTCAGAG GTATATTTAA nGAAAATTAT TGGCTATTTA 400 
(2) INFORMATION FOR SEQ ID NO: 669: 

25 

. (i) SEQUENCE CHARACTERISTICS: 

(A} LENGTH: 1236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
,„ (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 669: 



35 


TGTTCGATAT 


TTTTAAATTT 


ATCTTTTAAA TACAACAACT 


GTTTCCGTAA 


TGATTTAACT 


60 




GTTTTATGAC 


TAATGCCATT 


GAATATTTCT AGCGTTTTAT 


TTAACTTATC 


GATAATCGCA 


120 


40 


TGTAAATCCT 


TCAAAATGTC 


TTTTGTTTCA AAAGTAAATA 


CATTATGGAA 


GCGATGAATA 


180 


TCATCATCAT 


AAACATCAGA 


ATCATTGATA ATCGTAAATA 


TCGTTGAGAA 


CAATTGCTCA 


240 




TTTAACTCAT 


GAATCTCATT 


CATACTAGCC TTCAAGCCAA 


AAATATCAAT 


TGGTGCAATA 


300 


45 


TCTAATTTTT 


CCAAAATTCG 


CTGCTTTTCC AGTTGATCAA TTGCCTTTAA CAATTTTTCA 


360 




TTTTCGTTTT 


TACCAATCAA 


ACCAAGCTGA TATTTAATAT 


CAGCATAACT 


CAACTCATTT 


420 




GTCACTTGAT 


TTAAGGCATA 


GTCTGGTAAG CGATGTGCTT 


CATCCACTAT 


ACAATCATCA 


480 


50 


AACAATTGAT 


ATATTGAATT 


TTCAACATCA GAATGaATTA 


AATGTGCATG 


ATTTGTAATA 


54 0 




CCAATTTGaA 


TGTTCTGTGC 


ATTTCGCTTA ATAAAATTAT 


aATAATGAAC 


ATCGTGACGT 


600 




GCCGGTACAT 


ATGTTTCAAT 


TTTCTGGTcA AAATACATCT 


TTTGACCACC 


TTTTAAATTT 


660 
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ATATTCACtT CGTAATTACT TGTGTCATCT TTTAAAATTT GACTAATAAG CCCCAATGAA 7B0 
ATGTAATCaC TTtTACTTTT AATCAATAGT GCATTAATTT TAAAATTCAA CGCTTCATTC 840 

5 

ATTGCTGGAA TATC TTT T T C TAACAATTGA CTTTGCAGTA ATTTAGTATT GGTAGAAATC 900 
ATGACATGCT TCCCAGTTTC AATATTATAC ATCAAGGCCG CAAGTAAATA TGCTAATGAT 960 

w TTACCACTGC CTAGTGATGC TTCAATCATT GCTTTTTCAC TATGCATGAG CTGATCTAAT 1020 
ATAGTTTCCG CTAAATATAA TTGTTGCGGT CGATATGTTA AGCCAAGTTG ATCTACAGCT 1080 
TTGCTATATA AAGACTTCAA GCTGCCATTA TAATTTGTTG TCGGC' T TTTT AAAATCAACT 1140 

IS ■ TGCTTACGAT AGATAATCTG TTCGAACTTT TCGTACGATT TATCCAATGG CTTTGCATCA 1200 
TATTGCCTAA CCATCTCAAA GAAAATATCA TACAAA 1236 
(2) INFORMATION FOR SEQ ID NO: 670: 

20 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1819 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLiOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 670: 



30 


ACAsAATTAT 


TGACCAATAT 


GACTCGTGGA 


CTGATATGTT 


TAAAGCACTA 


CTGCATGAAA 


60 




CATTTAAAGC 


ATATGGCGTT 


CTATTTATAG 


ATGCGCAGTT 


TGAGCCGTTA 


AGAAAAATGG 


120 




AAGCGCCTAT 


GTTTAAAAAG 


ATTTTGAAAA 


AACATCAGTT 


GCTTGATGAT 


GCTTTTAGAG 


180 


35 


CAACACAACA 


ACGTACTCAA 


AATCAAGGCT 


TGAATGCGAT 


GATACAAACA 


GATACAAATG 


.240 




TTCATTTATT 


CTTACATGAT 


GAAAATATGC 


GTCAATTAGT 


tTCGTATGAT GGTAAGCAyT 


300 


40 


TTAmATTAAA 


TAAAACAGAT 


AAGACATATA 


TAAAGGAAGA 


AATTATAAAT 


ATTGCGGAAA 


360 


ATCAACCkGA 


ATTATTTTCT 


aATAATGTAG 


TGACAAGACC 


ATTAATGGnA 


GAATGGTTAT 


420 




TTGAACACGG 


TGGCATTTGT 


TGGAGGACCG 


AGTGAAATTA 


AGTaCTGGGC 


TGAACTAAAA 


480 


45 


GATGTATTTG 


AACTATTTGA 


TGTTGAAATG 


CCtATCGTGA 


TGCCAAGGCT 


TAGAATTACT 


540 




TATTTAAATG 


ACCGTATAGA 


AAAATTACTT 


TCGAAATACA 


ATATTCCATT 


AGAAAAAGTG 


600 




TTAGTCGATG 


GTGTTGAAGG 


AGAAAGAAGT 


AAGTTTATTA 


GAGAACAAGC 


ATCACATCAA 


660 


50 


TTTATTGAAA 


AGGTAGAAGG 


TATGATTGAA 


CAACAGCGTC 


GTCTAAACAA AGACTTATTA 


720 




GATGAAGTGG 


CGGGGAATCA 


AAATAATATT 


AACCTTGTGA 


ATAAAAATAA 


TGAAATTCAT 


780 




ATACAACAGT 


ATGATTATTT 


GTTAAAACGT 


TATCTTTTAA 


ACATTGAAAG 


AGrAAACGAC 


840 



55 
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5 



10 



15 



20 



25 



GAAAGAATAT 


GGAATCCACT 


TCAAATXTTG 


AATGATTTTG GGACAGATGT 


GTTCAAGCCC 


960 


TCCACCTATC 


CACCACTTTC 


TTACACTTTT 


GATCATATTA 


TTATAAAACC 


TTAATATACC 


1020 


AAGGGTTTAG 


CCCGATTTAT 


CTTAATGATA 


AATCGGGCAT 


TTTTTTGTTT 


TTTAAAATAA 


1080 


ATTTCACAAA 


TTTTGTATAA 


ATAGTGGTGG 


ATAGTGGGGA GATGTGGTAA 


ATTATATATA 


1140 


AGGTGAGGTG 


ATAAAAAATG 


TTCATGGGAG 


aATACGATCA 


TCAATTAGAT 


ACAAAAGGAC 


1200 


GTATGATTAT 


ACCGTCCAAG 


TTTC GTTATG 


ACTTAAATGA 


GCGTTTTATT 


ATCACAAGAG 


1260 


GCCTTGATAA 


ATGTTTATTC 


GGTTACACTC 


TAGACGAATG 


GCAACAGATT 


GAAGAGAAAA 


1320 


TGAAAACCTT 


ACCTATGACA 


AAAAAAGACG 


CACGTAAGTT 


TATGCGTATG 


TTC?TTCTCTG 


13 80 


GTGCTGTTGA 


AGTAGAACTT 


GATAAGCAAG 


GGCGTATTAA 


CATCCCTCAA 




JLtt \J 


AATACGCTAA 




GAATGTAPAG 


TAATCGGTGT 


TTCAAATCGT 


M.A Xt^rtVXrlX X X 




GGGATAGAGA 


AACTTGGAAT 




AAGAATCTGA AGAAAGTTTC 


GAAGATATTG 

\Jtfv\\?f\ X X XVJ 




CTGAAGATTT 


AATAGATTTT 


GATTTTTAAA 


ATGGAGGAAT 


TGAAGtGTTT 


CATCATATCA 


1620 


GCGTTATGTT 


AAACGAAACC 


ATTGATTATT 


TAAATGTAAA AGAAAATGGT 


GTGTACATTG 


1680 


ACTGTACGCT 


AGGTGGAGCG 


GGACAnGCCC 


TTTATTTACT 


AAATCAATTA 


AATGACGACG 


1740 


GAAGATTAAT 


AGCAATCGAT 


CAAGACCAAA 


CTGCAATTGA 


TAATGCTAAA 


nGGGTATTAA 


1800 


AGGATCATTT 


GCATAAAnG 










1819 



(2) INFORMATION FOR SEQ ID NO: 671: 

<i) SEQUENCE CHARACTERISTICS: 
3S (A) LENGTH: 609 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 671: 
ACCAATTATT GGACAGGAAT TAGCAAAAAA TGCAATGCTT GCATTAATCT ATGCATCGAT 60 

4S AGGTATCATC ATCTATGTAT CATTACGATT TGAATGGCGC ATGGGTCTTT CATCTGTATT 120 
GGCATTATTA CATGATGTAT TCATTATAGT AGCGATTTTC AGTTTATTTA GAATTGAAGT 180 
AGATTTAACA TTTATCGCCG CTGTATTAAC AATTGTCGGT TATTCAATTA ATGATACAAT 24 0 

50 CGTAACGTTT GACCGTGTAC GTGAAAACTT ACAAAAGGTT AAAGTGATTA CGACAACAGA 300 
ACAAATTGAT GATATCGTTA ATAGATCAAT tAGACAGACA ATGACACGTT CAATTAATAC ' 360 

AGTATTAACA GTTATTGTAG TAGTAGTTGC TATACTATTC TTCGGTGCTC CTACGATATT 420 

55. 
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TGCCGTTCCG CTATGGGGAA TAATGAAAAA ACGTCAGTTG AAAAAATCGC CGAAACACAA 54 0 

ATTAGTTGTA TATAAAgAAA AGAAATCGAA CGATGAAAAG ATTTTAGTTT AAAAtGaATT 600 

5 AAGCGGTAT 609 

(2) INFORMATION FOR SEQ ID NO: 672: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 2222 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 672: 

CTTAAAACAG CAATTTGAAC GTTTACAAAA TGAACAAATC TTTGTTTATG TTTGTCATGG 60 

20 TAATCACGAT CCTTTATCAT CAAAGATTTC ATCAAACTGG CCAGATAATG TtnTTGTATT 120 

TTCAAATAAA GTTGAGACGT ATGAAGCAAT TACTAAATCT GGTGAAACAA TTTATATTCA 180 

CGGATTTAGT TATGAAAATA GAGCAAGTTA TGAGAACAAG ATTGATGAAT ATCCATCAAG 240 

25 

TCAAGGCCAA AAAGGCATAC ATATTGGTGT CTTGCATGGT ACGTATAGTA AATCTTCAGT 300 

TAACGAAAGA TATACCGAGT TCATTTTAGA AGATTTAAAC AGTAAATTGT ATCATTATTG 360 

GGCTTTAGGT CATATACATG AACGTCAACA ATTAAGTGAT ATGCCTGTAA TTAACTATTC 42 0 

30 

AGGTAATATT CAAGGTAGAC ATTTTAATGA GCAaGGTGAA aAAGGTTGCT TATTAATCGm 480 

GGGTGACCAC TTAAAATTAA AGACTAAATT TTATCCTACA CAGTATATTA GATTTGAAGA 540 

35 AGCAACTATT GAAACGGATA AGACATCTAA GCAAGGTTTA " TACGAGGTCA TTCAAAACTT 600 

TAAAGAACAA GTGAGAGAAG AAGGAAAAGC CTTTTATCGT TTAACGCTTG TTATTAATAG 660 

TGAGACATTA ATTTCAC CTC AAGATTTATT ACAAGTTGAA GAAATGATTA CAGATTATGA 720 

40 

AGAAAACGAA AATCAATTTG TATATATTGA TGAGTTAAAA ATACAATATG CACAAAATGA 7B0 

TGAGTCACCT TTAGTTAATG AATTTTCAGC GGAATTATTA GTCGATCAAA CTGTTTTTGA 84 0 

TAAAGCGATG TCAGATTTAT ATTTAAATCC AAGGGCATCT AAGTTCCTAG ACGATTATGG 900 

45 " 

AACATTCGAC CATACAGCAT TAGTTAATCG TGCTGAAGAA ATATTAAAAG CTGAAATGAG 960 

AGGTGAACAA AATGATAATT AAATCACTTG AAATTTATGG TTACGGTCAA TTTGTTCAaC 1020 

SO GTAAAATTGa ATTTAATAAA AACyTCaCTG AAATTTTTGG TGAAAATGAA GCGGGTAAAT 1080 

CGACGATTCA AGCATtCATC CATTCGATAT TATTTGGATT TCCAACTAAA AAGTCTAAAG 1140 

AGCCAAGACT AGAACCACGT CTAGGTAACC AATACGGTGG TAAATTAGTA CTTATTCTTG 1200 

55 
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TATATTTACC. TAATGGTGCT GTGCGTGATG ATGCTTGGTT ACAAAAGAAA CTTAATTATA 
TTTCTAAAAA GACATATCAA GGTATCTTTT CATTTGATGT ACTAGGGCTT CAAGACATTC 
ATAGAAATCT AAATGAAAAA CAATTGCAAG ATTATTTATT ACAAGCmGGG GCTTTAGGAT 
CAACTGAaTT CACGTCAATG CGCGAAGTGA TTAATCGTAA AAaAGATGAA TTATATAAAA 
aATCAGGTAA AAATCCGATC ATTAATCAAC AAATTGAGCA ATTAAAACAA CTAGAAAGTC 
AAATTCGTGA AGAAGAAGCA AAGCTAGAAA CATATCATCG CTTAGTAGAT GATCGAGATA 
AATCATCACG TCGATTAGAG AATTTAAAGC ATAATTTAAA TCAATTATCA AAAATGCATG 
AAGAAAAACA AAAAGAGGTT GCTTTACATG ATCATTCACA AGAATGGAAG TCTCTAGAAC 
AACAGTTAAA TATTGAGCCA ATCACATTCC CAGAAAAAGG TGTGGATCGT TACGAAAAAG 
CACGAGCGCA TAAGCAATCG TTAGwAAGAG ATATTGGTTT AAGAAATGAG CGTTTAGCTC 
AACTTAAAGA AGAAGCGACT CAATTAGAGC CAGTTAAACA ATCTGATATT GACGCCTTCA 
TTAGTTTGAA TCAACAAGAA AATGAAATTA AAAATAAAGA ATTTGAACTT ACTGCAATCG 
AAAAGGATAT TGCGAATAAA CAACGTGATA AAGATGAATT GCAATCAAAT ATTGGTTGGT 
CTGAAACGCA TCATGACGTA GATAGTTCAG AGGCAATGAA AAGTTATGTC AGTGAGCAAA 
TCAAGAATAA ACAAGAACAA GCTGCATACA TTAAACAATT AGAACGTAGT TTAGAAGAAA 
ATAAAATCGA AGATAATGCG GTTCATAGCG AACTAGATTC TGTTGAAGAA AAATAGTTCC 
TG 

(2) INFORMATION FOR SEQ ID NO: 673: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 406 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 673: 
TTGCAGGTAT CATTTTTTTA ATGCCATATG GATTATGTTT TCTACCGTTT TATAAGCAAA 
AAAAGAAAAA ACAGACATTT AAAAAATACA TGGTTTACAC TACGATTGGT TTGTCAATTT 
GTCTAGGCTT ATCTCTAGTT TTGGTTCACA CTACGAAAAT TTATATGGAC GAAGGTGGCG 
TAAGATACTA TTACGGTAGT TTTGTAATGA AACAAGCGGG CGGTTATGCT TATTTAGCTT 
TAGCGGTACT TTCAACGTTG TTAATTGTTG CGAAAAAAGC TACAAATAAA AATAAAGAAA 
TCGAAACCGT CGACAATACA AATATAACGG AAAGATAATT AAGGGAGTGC TCATTCAGGA 
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(2) INFORMATION FOR SEQ ID NO: 674: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 927 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 674: 



75 



20 



25 



30 



40 



CCACTTTGTG 


GATTTTCTTT 


ATATTCTCCA 


CGCTTGtACT 


ACAACTTCTT 


TCGTTTCTGT 


60 


TTCATCACCG 


ACTGCCGCAT 


TCGTTAGCAC 


ATGTAATAAC 


TCTTTTGCGG TTAATACATT 


120 


CTCATCTATA 


ATCTTATCTT 


TTTGTTCTTG 


TATATATTGC 


TTGATGTGCG 


GCTTTTTCAA 


180 


TAACCTACAC 


GCTGTCACAT 


GTGCGCTATT 


TGCGTTATAT 


CCTGyTTTTA TGGCACTTTG 


240 


TGTTACATTC 


AGTGTTCTAA 


TATACTCATT 


CACAAAACGT 


GCTTGCTTTG 


CAGTTAACTC 


300 


ACTCATTTTA 


TCACCCCCAC 


AATTTTATCT 


AATATGGTTT 


CATACCATAA 


TATTACAGAT 


360 


TGTTCTGAAC 


AATCTAAGGC 


ACTACTAATA 


TCTTGATAAC 


TAAGTCCTTG 


TATAAGGGAG 


420 


TCAAAAATAT 


AAAACTCTTT 


ATCGGTCGCT 


AATCTGTCAA 


CAATCATTTC 


TATGTGATTC 


480 


TTTATAATAT 


GATCATTGAC 


ATTATCGTCT 


GTCATCAATT 


CGTCAGAATC 


TTCATCACCT 


540 


ATTAAAAAGA 


AATCATCAGT 


ATTTATTTCA 


TCATCGCCCC 


GTTAACTAGC 


TTTGAAGTCT 


600 


TTAGCACACT 


TGCATATACC 


GGCTGTCGTG 


CTGGCkAGAT 


ACTAGCATTG 


AAGTGGTCTG 


660 


ATATTGATTT 


TGAAAACAAC 


ACGATTAGTA 


TTACTAAAAC 


ATATTACAAT 


CCAAATAATA 


720 


ACAAGAAGAA 


ATATCAGATA 


CTTCCCCCTA 


AAACTGAAAG 


TTCTATCGGT 


AAGATTTCCG 


780 


TTGACCCAAA 


TGTAATAAAG 


GtGtTGCGtG 


ATTATAAGAT 


AAACGTTCAA AATAATTGGA 


840 


AAAACGAATT 


ATATAATGAT 


AACCATTGTA 


ATGAAAAAAC 


TATCAATGTG 


GATACAAACC 


900 


ATTATGTCAC 


GTACCAGCAT 


ACTCAAA 








927 



(2) INFORMATION FOR SEQ ID NO: 675: 

( i } SEQUENCE. CHARACTERISTICS : 
45 (A) LENGTH: 1100 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 675: 

GATGTTAAnA TAAGTACATC ATTTTGACTC CCAAATATTG GCTTTAGACC TTGAAATGCT 60 

55 . 
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5 



10 



IS 



20 



25 



30 



CTCATAATGG 


CATCAGGTAC 


CAGGGGTTGG 


GaCCAGGTGT 


TAATAACAAC 


GGTTGATGAT 


180 


AATACATTTA 


TATGCCTCCT 


ATAAATAAGA 


TTTCACTATT 


TTAGCAAATT 


TTCTGAAAAT 


240 


TTAAAAGCCT 


AAAAAgtGAT 


AACGCTATAT 


TGTCGAACAA 


ACGTTATCAC 


TTAACGATTT 


300 


TATTTGGCCT 


GAATGATGGG 


ATAGATTTTT 


ATGTCCACAT 


TATTTCTTAC 


AGCATTTGAA 


360 


ATCATGCAAT 


TATTATCTGC 


AATTGTTATC 


AATTTTGGTA ATCGCTTTTC 


TAATTGTGCT 


420 


ATTTGATCAC 


TTGGAATTTG 


AATTTGAGGA 


TGGTGCACAA 


TTTTTGACAT ACTGAATTTT 


480 


CCGTTATTTA 


AACAAGCTGT 


TCCAATCGAT 


TGTTGTkCAA 


TTGAAATATC 


TGTGAACTTT 


540 


GCACGTTCAA 


GAGTAGCTGC 


TAATGAGATG 


ATATAACATG ATGAAGCGGC TGATACTAAC 


600 


ATTTCATCGG 


GATTTGTTCC 


TATACCAACA 


CCACCTAAAG AAGCAGGTAT AGAAATATTC 


660 


TCTGAAAGTA 


TGTCGCCTTG 


AACGTTTCCG 


ACATTGTTAC 


GACCACCTTG 


CCAAGAAGTT 


720 


TGGACTTTAA 


AGTCATGTTG 


ATGCAATTGC 


TTTAACCTCC 


AATATAATTG 


TGATAGTTTA 


780 


ATTTTAGAAT 


AGTTTATCAA 


AGTTTAACAG 


AAAGGTGACT 


TATCAATGAC 


TCTGAATAAA 


840 


CTGAAAGATG 


AATTACAAAT 


TGTTTCG C AC 


CGTGGATTGC 


CGAGTGATTT 


TCCTGAAAAT 


900 


ACAATGGTCG 


GTTATCGAGA 


GGTAATGGGG 


CTCAATGTTG 


CTATGTTAGA AATAGATGTT 


960 


CATTTGACCA 


AAGACCAACA 


TTTTGTTGTG 


ATACATGaTG 


AAACAATTGa 


TAGAACATCG 


1020 


GaTGGtArGG 


GGCGTAwTGc 


TGaTTACACA 


TTATCGCAAT 


TAAAATCATT 


TGATTTTGGT 


1080 


AGTTATAAAG 


ATGTTGCTTT 










1100 



(2) INFORMATION FOR SEQ ID NO: 676: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 460 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

40 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 676: 
. ATTAATTTCA TATGGAAATA GTTGAACTAA CTGCGCTGTA TAAGCTTCAA CAGTTTGATG 60 

45 

AATTAACGAT TGATCTTCAA TATATCCGTA GAACAAATCT TCAGTACAAA CTACTTTACC 120 
TTTATCAGGT TTAATTGCAC CTGCCAACAA TTGACCTACC AACGCTTTGG AAGATTCAGG " 180 
TTCACCAATT ATACCTAATG CTTCTCCTTG ATAAATATGT AAACTAATAT TGTTTAAATC 24 0 

SO 

GATATCTTCA GCATCATATC CAAAAGGTAA ATACCATTTC TTATTCTGTT TATTCCTATA 300 
GTAGTGTGTT ACTTTTAGTA ACTTTAAAAC AATTGAACTT CCCATCTATT TTCATCCTTC 360 
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CCCCACGCaA AAATACCTTT TAATCTTnCT ACTTTAAAAT 




460 




(2) INFORMATION FOR SEQ ID NO: €77: 






5 
10 


(i) SEQUENCE CHARACTERISTICS: 

\*\f LtCMViKslri - DaS€ pailTS 

<B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 677: 






15 


GTCAAGTTTT CGGCTAGATT TTAAACGCTC ATCACTATGT AGAATGAAAC 


TTTCGAAAAA 


60 




CTGTAAATCA TCATAACCTT TTACATAAAC ATAACCTTCG CCACCAATTG 


CTTGAATTAA 


120 




ACATTGGGCG GCCATTTGAA TTTCTAAAGA TTGTTTTTCT AGCCTATTAA 


AGATACCTAT 


1B0 


20 


TAGTTGTGTG TTTAAGATTT TTGACATCTT TATCCTCCAA TCTACTTATA 


AAATATTGTA 


240 




ATTAATGACT XCATATTATG CAACGGCTTA AATTGTATAA AAATGTATAC 


GTTTGCATTT 


300 


25 


AGTATAACTA TCGCATTTTT CAAAAAATAC ACATTTAATC TGCAGTATTT 


CAATGCATTG 


360 


ACGCTATTTT TTTGATATAA TTACTTTGAA AAATACGTGC GTAAGCACTC 


AAGGAGGAAC 


420 




TTTCATGCCT TTAGTTTCAA TGAAAGAAAT GTTAATTGAT GCAAAAGAAA 


ATGGTTATGC 


480 


30 


GGTAGGTCAA TACAATATTA ATAACCTAGA ATTCACTCAA GCAATTTTAG 


AAGCGTCACA 


540 




AGAAGAAAAT GCACCTGTAA TTTTAGGTGT TTCTGAAGGT GCTGCTCGTT 


ACATGAGCGG 


600 




TTTCTACACA ATTGTTAAAA TGGTTGAAGG GTTAATGCAT GACTTAAACA 


TCACTATTCC 


660 


35 


TGTAGCAATC CATTTAGACC ATGGTTCAAG CTTTGAAAAA TGTAAAGAAG 


CTATCGATGC 


720 




TGGTTTCACA TCAGTAATGA TCGATGCTTC ACACAGCCCA TTCGAAGAAA 


ACGTAGCAAC 


780 




AACTAAAAAA GTTGTTGAAT . ACGCTCATGA AAAAGGTGTT TCTGTAGAAG 


CTGAATTAGG 


840 


40 


TACTGTTGGT GGACaAGAAG ATGATGTTGT AGCAGACGGC ATCATTTATG 


CTGATCCTAA 


900 




AGAATGTCAA GAACTAGTTG AAAAAACTGG TATTGaTGCA TTAGCGCCAc 


ATtAGGTTCA 


960 


45 


GTTCATGGTC CATACAAAGG TGAACCAAAA TTAGGATTTA AAGAAATGGA 


AGAAATCGGT 


1020 


TTATCTACAG GTTTACCATT AGTATTACAC GGTGGTACTG GTATCCCGAC 


TAAAGATATC 


1080 




CAAAAAGCAA TTCCATTTGG TACAGCTAAA ATTAACGTAA ACACTGAAAA 


CCAAATCGCT 


1140 


50 


TCA9CAAAAG CAGTTCGTGA CGTTTTAAAT AACGACAAAG AAGTTTACGA 


TCCTCGTAAA 


1200 




TACTTAGGAC CTGCACGTGa AGCCATCAAA GAAACmGTTA AAGGtAAAAT 


TAAAGAGTTC 


1260 




GGTACTTCTA ACCGCGCTAA ATAATTAATA TTTAGTCTTT AAGTTATTAA 


TAACGTAGGG 


1320 
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AATAAATAAA. ACAGTTTGAT TTTAAAATGA AAGCGTAAAA ATGGTAAAAT ATATCAAAAT 1440 
TGATTGTGAT A 1451 
5 (2) INFORMATION FOR SEQ ID NO: 678: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 668 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 678: 





nGTATTGAAG 


CGGTTAAACA 


AACACCTAAT 


GCAACTGACG 


AAGAAAAGCA 


GGCTGCTGTT 


60 




AATCAAATCA 


ATCAACTTAA 


AGATCAAGCA 


ATTAATCAAA 


TTAATCAAAA 


CCAAACAAAT 


120 


20 


GATCAGGTAG 


ACACAACTAC 


AAATCAAGCG 


GTAAATGCTA 


TAGATAATGT 


TGAAGCTGAA 


180 




GTAGTAATTA 


AAACAAAGGC 


AATTGCAGAT 


ATTGAAAAAG 


CTGTTAAAGA 


AAAGCAACAG 


240 




CAAATTGATA 


ATAGTCTTGA 


TTCAACAGAT 


AATGAGAAAG 


AAGTTGCTTC 


ACAAGCATTA 


300 


25 


GCTAAAGAAA 


AAGAAAAAGC 


ACTTGCAGCT 


ATTGACCAAG 


CTCAAACGAA 


TAGTCAGGTG 


360 




AATCAAGCAG 


CAACAAATGG 


TGTATCAGCG 


ATTAAAATTA 


TTCAACCTGA 


AACAAAAGTT 


420 


30 


AAACCAGCTG' 


CACGTGAAAA 


AATcAATCAA 


AAAGCGAATG 


AATTACGTGc 


TAAGATTAAT 


480 


CAGGATAAAG 


AAGCAACAGC 


AGAAGAAAGA 


CAAGTAGCAC 


TAGATAAAAT 


CAATGAATTT 


54 0 




GTAAATCAAG 


CCATGACAGA 


TATTACGAAT 


AATAGAACAA 


ATCAACAAGT 


TGATGATACA 


600 


35 


ACAgTCAgCG 


CTgATAGCtT 


GCTTTAGTGA 


CGCCTGACCA TATTGTTAGA GCgCTGCTAG 


660 




AGATGCGT 












668 



(2) INFORMATION FOR SEQ ID NO: 679: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1906 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 679: 
GATCCAAATT TAAAAGGAAA AATAGCCTTT AACGAATTTA CGAAACAAAT TGAATGTTTA 60 

50 

GGGAAAGTGC CATGGAATAC TAATTTTAAG ACACGTCAAT GGCAAGACGG TGATGATAGC 120 
AGTTTAAGAA GTTATATCGA AAAGATTTAT GACATACACC ATTCAGGTAA AACAAAAGAT 180 
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ATCGTGGGAT 


GGACATAAAC GCCTTGAAAA 


GTTATTTATC 


AAATACTTAG 


GTGTTGAAGA 


300 




CcTGaAGTGA ATAGAACAAC TACCAAAAAA 


GCATTGACTG 


CTGGAATCGC 


TAGAGTAATG 


360 


£ 


VaAGCCTGGAT 


GTAAATTTGA CTATATGCTT 


ACACTTTATG 


GTCCTCAAGG 


TGTAGGTAAA 


420 




T CTGCTTTGC 


TAAAAAAATT AGGTGGTGCA 


TGGTTTTCTG 


ACAGTTTAGT 


TTCTGTTACA 


460 




GGTAAAGAAG 


CTTATGAGGC CTTACAAGGC 


GTTTGGCTAA 


TGGAAATGGC 


AGAACTTGCA 


540 


•to 


GCTACAAGAA 


AAGCTGAAGT TGAAGCTATT 


AAGCATTTCA 


TATCTAAACA 


AGTTGACCGA 


600 




TTTCGTGTTG 


CTTATGGGCA TTATATTGAA 


GATTTTCCAA 


GGCAATGTAT 


TTTCATTGGT 


660 


15 


ACAACTAATA 


AAGTTGATTT CTTAAGAGAT 


GAAACTGGTG 


GAAGACGTTT 


TTGGCCAATG 


720 


ACTGTAAATC 


CAGAGAGAGT TGAAGTGAAC 


TGGTCTAAAC 


TAACCAAAGA 


TGAGATTGAC 


780 




CAAATTTGGG 


CAGAAGCTAA ACACTATTAT 


GAACAAGGAG 


AAGATTTATT 


CCTTAACCCT 


840 


20 


GAACTAGAAG 


AAGAAATGCG TTCAATACAA AGCAAACATA 


CTGAGGAATC 


TCCATATACA 


900 




GGCATTATTG 


ATGAATATCT TAACACACCm 


ATTCCtAGCa 


ATTGGGATGA 


CTTAACTATC 


960 




TTTGAACGAA GACGATTTTA TCaAGGTGAT 


GTTGATATGT 


TACCaACAGG 


AAATGTaGAT 


1020 


25 


TACGTTAAAA 


GAAATAAAGT CTGTGCGCTT 


GAAGTGTTTG 


TTGAATGTTT 


TGGTAAAGAT 


1080 




AAGGGAGATA GTAGAGGAT.C TATGGAAATT AGAAAGATTT 


CAAACATCTT AAGACAATTA 


1140 




GACAATTGGT 


CTGTATATGA TGGTAATAAA 


AGTGGGAAAA 


TTCGATTTGG 


AAAAGATTAT 


1200 


30 


GGTGTACAGA 


TAGCTTATGT AAGAGATGAA .AGTTTAGAGG 


ATTTAATATA 


AGAAATATTG 


1260 




AATAAATATG 


CATTTTAGAG TGTTGTATCA 


GATGTTGCAT 


CATTTTTTGA 


GTGATGCAAC 


1320 


35 


ACGGGAGTGT 


AAAAAGTAAT CGTAGGTGTT 


GTATCATTTT 


TGGTGATGCA 


ACATTGATGC 


1380 


AACAAATGAT 


ACAACACCTC TTTCCTTTCT 


AGCTGTAGGG 


TTCAACCCTG 


TTTGTTTCCA 


1440 




ATGTTGCATC 


AAATTCACTA TAAAGTTTAA AAAGTAGTGT 


TAGGGAGTAA AGGGGTATAG 


1500 


40 


GGGTAACCCT 


CTAACAGCTA TTTTTAAAAG 


TTTGGCAAGA 


ATTGATACAA 


CATCGGAACA 


1560 




CAAATATAAA TTTTGTATAC AAGGTGAATA AATGAAAGAA 


TCGACATTAG 


AAAAATATTT 


1620 




AGTGAAAGAG ATAACAAAGC TAAACGGTTT ATGTTTAAAA 


TGGGTCGCAC 


CTGGAACAAG 


1680 


45 


AGGTGTGCCA GATAGAATTA TTATTATGCC AGAAGGAAAA 


ACATATTTTG 


TAGAAATGAA 


1740 




GCAAGAAAAA GGAAAGTTGC ATCCTTTACA ArAATATGTG 


CATAGACAAT 


TTGAAAATAG 


1800 




AGaTCATAAA 


GTATATGTGT TATGGAATAA 


AGAACAAGTA 


AATACTTTTA 


TCAGAwTGGT 


I860 


SO 


AGTGAACATT 


TGGCGATTGA CTTTCAAACC 


ACATAGCTllT 


CCAAAG 




1906 




(2) INFORMATION FOR SEQ ID NO: 680: 









55 
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(A) LENGTH: 94 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: €80: 



10 


ZX 21" [ " I '21' I "" r w I *f"121 
rtnl Aril 1 XVJtr* 




LZiTTZvZiarar 


TTGATTTGTC 


AGTTTAAGyA TTTGAGTGCA 


60 






2i'7Y^Tr , a r ri r 2 r ra 

nlviWilulA 


i. 1 A/i/il/V? 1 vji 


GAAUAATATA 


AGCCGAGAGA 


AATTGCTCAA 


120 




TTrt ZX TY2 f*21 TV 


1 AAAAGAGAA- 


AGTGATTTAT 


AATGCCATAC 


AACGATGTrA AAATAAAATA 


1B0 


IS 


r r r ^^^*^p^pr ^n^^ 
AAAUVjI lAl 1 


1 LiviAAlliAI 


WTV* T\ T\ 7\ T\ C*f*C* 

TTGAAAAGCG 


CCTTAGGACG 


TGAATTGAAT 


TATAACGTGT 


240 






iuul I IviAvJA 


TTTGTTATAA 


ATTTTATGTA 


TAGTATACTG 


GTATTATAAT 


300 


20 


VxAJtl/irvtGGX 


GAATTATTGT 


GAGAAAAAIA 


CCTTTAAATT 


GTGAAGCTTG 


TGGCAATAGA 


360 


nftl XAlAAlb 


1 i t-lJTAAGCA 


AGAAGGCTCG 


GCAACAAGAT 


TAACCTTAAA 


GAAATATTGT 


420 




CCAAAATGTA 


ACGCGCACAC 


AATTCATAAA 


GAATCGAAAT 


AAATACATTC 


GAAATAATAC 


4B0 


25 


TTTGATAATA 


TGTTCAAAGG 


ATTTGGAGGT 


TGAGCAGATG 


GCTAAAAAAG 


AAAGTTTCTT 


540 




TaAAGGCGTT 


AAGTCTGAAA 


TGGAAAAAAC 


AAGTTGGCCG 


ACGAAAGAAG 


AGCTATTTAA 


600 




ATATACTGTA 


ATTGTAGTTT 


CTACTGTTAT 


ATTCTTCTTA 


GTCTTTTTCT 


ATGCCTTAGA 


660 


30 


TTTAGGAATT 


ACAGCATTGA 


AAAATTTATT 


ATTTGGTTAG 


AGGAGTGAAG 


ACATGTCTGA 


720 




AGAAGTTGGC 


GCAAAGCtTG 


GTATGCAGTG 


CATACATATT 


CTGGATATGA 


AAATAAAGTT 


780 




AAAAAGAATT 


TAGAAAAAAG 


AGTAGAATCT 


ATGaATATGA 


CTGAACAAAT 


CTTTAGAGTA 


840 


35 


GTCATACCGG 


AAGGAAGAAG 


GAAACCTCCA 


GTAAnAAGnT 


.GGCCAAGCCT AAAACCGCCT 


900 




GTTAAAAAAA 


ACCATTCCCC 


TGGGnTAnGG 


TTTTAAGTGG 


GAATTTAA 




948 



(2) INFORMATION FOR SEQ ID NO: 681: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 863 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 681: 
50 ACAnATAATA ACAAAGCGCT TGCTAGTACC TCTTAAAAAG ATGATGCTAG CAAGCGCTTk 60 

TCTATACTAT ATATTATTTT TCTAAAATTT TAACACCCTC TTGAGTGCCT ACAATAACTT 120 
GATCTGCCAT ATCTAAGAAG TATCCTGTCT CAAACACACC TGTCAGATGA ATTAAATACT 180 

55 
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TATCAGTTAT AAATGCGACA TCTTCGTTTA CACGACGTTC TACTTTTATA TCAGCGTATG 
ATTCAATTTT ACGTAATATG TGATACCAGT TAAATTTATC CACCTCTACT GGTAACTTAA 
ACGTCTCACC TAAGTATTGA ACTATTTTCG TTTCATCGAC AACCACAACA AAACGCGATG 
CCATTTCATC TATAACTTTC TCTCTGAACA GCGCACCACC GCCACCTTTA ATTATATTTA 
AAGATGGATC TACTTCATCA GCACCATCAA TTGCTAAGTC GATATGATCA ACATCATTGA 
TTTCACATAT TTTAATACCT AATTCTTTTG CTAAAAATGC AATTTTATTA GAAGTGCATA 
CACCTGTAAT. ATTGTAACCA CGTTCTTTAA TTAGTTGCGC CATTTGAGGT AAGAGTAATT 
CCATTGTACT TCCTGTACCA ATTCCCAGCG TCATGTCACC ATTGATTkGA CTTAAAACAT 
CATTTAATCT CATTAACTTG AGTGCTTTGA CATCTTTCAT GAAGGTAGCC TCCCATATTT 
AtGTAATCTA TTCAATTCAT ATTTTACATG ACTCGTATAA ATTAACATAC CCTTATnGCT 
AACCATTTGT GTTAAACATA TCG 
(2) INFORMATION FOR SEQ ID NO: 682: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 80 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 682: 

TTCATTTTTG TTGTTCAGCA ATTTTGGTCC GAATTTCAAT TCATCAGATG ATAGCTCTAT 

TAATTCTGTA GAATATTCTG CtACAACAAT TTCATAAATA TGGCCTTTTT CTTCCATTAT 

TATTTCATCa ATTATTTCAT AATTCAATTG TTGTAATGTT TGTCTTAAAT TTTCAGTTTG 

GATATTACTT TGTAAAATCA ACCTTGGATG TTGACTTAAC TTATCTTGCC CATCTTTTAA 

AATTTTAGCA ATAAGTGGTC CGCCCATACC ACAAATTGTG aTATTATCGA TTACGTCCTC 

AGGTTGAATA ACACTTAAGC • CATCCCCTAA ACGTACATCA ATTCTATCTA CTAATTGGTT 

TGCAGCTACA TTTTTCACAG CAGCTTGAAA AGGGCCTTGA ATAACTTCTC CAGCAATAcC 

GaTTCGCATA AATGGTTTTG AATTGCATAG ATTGGCAAAT AAGCATGATC TGAGCCAATA 

(2) INFORMATION FOR SEQ ID NO: 683 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 689 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 683: 

CTGCAAAAAA TATTGGTATA ATAAGAGGGA ACAGTGTGAA CAAGTTAATA ACTTGTGGAT 

AACTGGAAAG TTGATAACAA TTTGGAGGAC CAAACGACAT GAAAATCACC ATTTTAGCTG 

TAGGGAAACT AAAAGAGAAA TATTGGAAGC AAGCCATAGC AGAATATGAA AAACGTTTAG 

GCCCATACAC CAAGATAGAC ATCATAGAAG TTCCAGACGA AAAAGCACCA GAAAATATGA 

GTGACAAAGA AATTGAGCAA GTAAAAGAaA AAGAAGGCCA ACGAATACTA GCCAAAATCa 

AACCACAATC CACAGTCATT ACATTAGAAA TACAAGGAAA GATGCTATCT TCCGAAGrTT 

gGcCCAAGAA TTGAACCAAC GCATGACCCA AGGGCAAAGC GACTTTGTTT TCGTCATTGG 

CGGATCAAAC GGCCTGCACA AGGACgTCTT ACAACGCaTA AcTACGCACT ATCATTCAGC 

AAAATGACAT TCCCACATCA AATGATGCGG GTTGTGTTAA TTGAACAAGT GTACAGAGCA 

TTTAAGATTA TGCGAGGAGA GGCGTATCAT AAGTAAAACT AAAAAATTCT GTATGAGGAG 

ATAATAATTT GGaGGGTGTT AAATGGkGGa CaTTAAATCC mCGTTCATTC mATATATAAG 

ATATATCACG GTAATTGCGC ATATAACTT 

(2) INFORMATION FOR SEQ ID NO: 6B4: 

(i) SEQUENCE CHARACTERISTICS: 
■ (A) LENGTH: 858 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 684: 
TTATTAAATT GGTATGTGTT CATTATACAT ATGaCAAATA TGaATGTAAA CCGATAATTT 
AGATTTTTTG GAATAACCTG AAAATTCAAG TTaTAGCGTT GCTTATATTT TAAAAGGTGG 
TGATAATGAG ACTTTTTGAA AAATAAAATT CAAAATACTT ATAGCATAAT CAATATGCAC 
ATTAAATAAA TGTACTCTTT TAATGCGTTG ATAACTGTAT TTGTAATTTA GAGAAGGGGT 
GTTCACTATG CTGATGTGTT AAAAAATAAA ATAAAAAGGA CACCTCGATG CTATAAATAT 
TAGCATCGAG ACGCCCAGTT AATGTCTATT AAATTGAATA TAGTCTCGGA CATGAATCAA 
TGCCCTAGGC CCTGCAATGT TATATTGACA GTAGTTGACT GAATGAAAAT GACTTTGTAG 
CTAGCTTTTT TCAATCCTTG TCGGTGCAAC ACATAGAGAA ATTGGATTCC TAATTTCTAC 
AAACAATACA AGTTGCGGAA TAAGTCCCAA TATAGAAGGT GACAGTAAGC CAACTTACAA 
TAATGTGCAA GTTGGTCGGG CCTCAATACA GAGATTTTCG AAAAGAAATT CTACATATTA 
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GCCACCTTGT TTATTTAAAT CGATAACACG GTTTGCGATT GTATTGATAA ATTCAAAGTC 720 

ATATGAAGTA AAGATAATAG AACCTTTGAA TGATTTAAGT CCATCATTAA CAGCAGTAAT 780 

ACTTTCTAAG TCTAAGGGTT GTGGGTCATC AAGTAAAAGA CGTTGCCCTG ATAACATCAT 840 

TTACTAGCAn CACGACTT 858 
(2) INFORMATION FOR SEQ ID NO: 685: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 685: 

20 TTAGTATTTT CAGGnTGGAA ATTGATGTTT GAAATTCACT GAAATGGGCC AGAACCTgAA 60 

ATGaAACTTT ACCAACAATT TGGtCTTCAT CAATGAGGCC AAACGCACGG CTATCTTTAC 120 

TTACTTCACG ATTATCTCCA AGCACTAAAT ATTTACCTTT TGGAATGACA TTTGATTTAG 180 

GATTCGCATT CGGTAAATCT TTAACTTGGA AAGTCCCAGT AATGTAATCA CCTTGTTTAT 240 

GTTTTAAATT GTAGTTTAAA TATGGTTCAT CTTGTTTTTT ACCATTGACA TATAATGTAT 300 

CATTTTTGTA TTCTACTTTA TCACCAGGAA CACCGATGAC ACGTTTAACA TAGTCATCAT 360 

TTTTGTTTGC ATGGAAGACA ACTACATTAC CTTTTTCCAA ACCACCTGTT TTATATCCAA 420 

CAATGTTTAC AGCTACTCGC TCGCCATCTT TCAAAGTTGG ATCCATTGAT TCACCTTTAA 480 

» 

3S TTGTATATGG CGTAACAATA AATTTACCTA CTATAAATAA AATGACAAAA GCGACTGCAA 54 0 

TTGAAATAAT CCATTCCAAT ATTTCTTTTT TCAATTTTGA CACCTCTTTT TAAGATTTGA 600 

ACTGAACAGT CCATTTTGAA AAAGGATAGT ATCGTAAACt AACATTACCA ATAATAtCCT 660 

40 TTTTATCGAT TAAACCAAAT TGTCTTGAAT CGTGCTTGTk ATTATCTTGA TCATTTAGCA 720 

CAACAAAATT GTTTGGCGGA ATAATATCAC CATCTAATTC TTTAAAATTG CGCAAACTAA 780 

AATCTTTAAT TTTTCTGTTC TTGGCATAAG ATGCGTCAAC CGGTCGGTCA TCACGGTATA 840 

ATTGTCCCTG ACGAAACGCC ATTGATTGAC CAGGTTTGGC AATAATTCGA CTAGTATATA 900 

TCTCGTTACC ACGCCTATAT GTAATGATAT CACCATTATT CAATTGATTA AATGTAACTT 960 

TAATTTTATT TACAATAACA CGATCCCCTT TGTTAAGGGT TGGTGACATA TCATTATTCG 1020 

GAATGACATG ACCAACTATT ACAAAAGTTT GTACGAACAG TACAATGATA ATAGCAAGTA 1080 

TCAATGAAAT CAAATATTTT ACAACTTTTT TCACGATGTC ACTCCTTTTT CGATCCCATA 1140 
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ATCCCTCTTA AAGGATGCGT TAAAATTGTA GTAATTTCCT TACCTAAATA ACCTAAAATA 1260 

ATTGTTGAAA CTAACTTTGA TGATGCCAAA ACAATGAAAT AATATTTAGG TCTAATATGA 1320 

5 GATAGACTCG CTACAAAATT TATTAATGTA TTTGGCGTAA AAGGAAAACA AAGTAAAATA 1380 

AACAATGGGA TTAATCCTTG GCGATCAATA AAACTAATCA AGCGTTGAAC AGCAGTACGT 1440 

TGTTTAATTC GCTGCATCCT CTCAGTGTTC ACCAATCGTT TACAGATCAA ATAGACTGTA 1500 

10 

AATGTTCCAG AAATTAATCC AAGCCAACTA ATCAATATAC CTAAAATAGG TCCATAAGCT 1560 

TGAATGTTAA TTAAAATATA GAGTGCTAAA GGAAATACTG GAATTATAGC TCTAATATAT 1620 

AACAATATAA ATCCAGGTAA ATAACCAAAC TGTCGAAATA TCTCAAACCA TTCTTCTACT 1680 

15 

TGATGAAACG ACAAATCATC AATCCCTTTC TTTGGTTGAA GATAATTATT CTTACATTAT 1740 

AAAGTTA 1747 
20 (2) INFORMATION FOR SEQ ID NO: 686: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 645 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 686: 

30 



35 



40 



CGTAAAGATT 


ACTATTTAGG 


AGGGTGACTA 


TGAAAAAGAA 


ATTAGGTATG 


TTACTTCTTG 


60 


TACCAGCCGT 


AACTTTATCA 


TTAGCCGCAT 


GTGGGAATGA 


TGATGGAAAA 


GATAaAGATG 


120 


GCAAGGTAAC 


AATTwAAcGa 


CaGTTaTCCm 


TTGCAATcAT 


TTgCAGAGCA AATTGGTGGA 


180 


AAACACGTGA 


AGGTATCATC 


AATCTATCCA 


GCAGGGACAG 


ATTTACATAG 


CTATGAACCA 


240 


ACACAAAAAG 


ATATATTAAG 


TGCAAGCAAG 


TCAGACTTGT 


TTATGTATAC 


AGGGGATAAT 


300 


TTAGATCCGG 


TTGCTAAGAA 


AGTTGCATCT 


ACTATCAAAG 


ATAAAGATAA 


AAAACTGTCT 


360 


TTAGAGGATA 


AATTAGATAA 


AGCAAAGCTT 


TTAACTGATC 


AACACGAGCA 


TGGTGAAGAG 


420 


CATGAACATG 


AGGGACATGA 


TCATGAGAAA 


GAAGAACATC 


ATCATCATCA 


TGGTGGATAT 


480 


GATCCACACG 


TATGGTTAGA 


TCCTAAAATT 


AACCAAACTT 


TCGCTAAAGA 


AATTAAAGAT 


540 


GAATTAGTGA 


AAAAAGATCC 


AaAACATAAA 


GATGACTATG 


AGAaAAACTA 


CnaAAATTAA 


600 


ACGACGATCT 


TAAGAAAATT 


GATAACGATA 


TGAAGCAAGT 


TACAA 




645 



(2) INFORMATION FOR SEQ ID NO: 6B7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 956 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



s 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 687: 





TTTGTTACTG 


CTTCTAAAAT 


AATATCCTTT 


AATTGTTTAA 


CATGTTGGAT 


TGTCATATGA 


60 


10 


GGTGATGGTA 


CATTAAAAGG 


ATTTAATTCA 


TCTATTTGTG 


CATATTGATT 


TATGACATCT 


120 




TGATGCATTG 


AAATAGGGTT 


GATATCATTT 


GTTACTACTT. 


TATTAGATTG 


GTCTTGTGAC 


1B0 




ATACTAATGG 


TGCCACCAGT 


ATGAATAACA 


AGTAGATGTT 


TCATATATTT 


CCTCCTATAT 


240 


15 


TTAATTTACC 


TAATTATGAT 


AAAATATTAT 


TCATAAAACG 


ACAAGGAAGG 


GAAATGACGC 


300 




ATGAAAGCCA 


TTAATATTGC 


ATTAGATGGT 


CCAGCTGCTG 


CCGGAAAAAG 


TACAATTGCG 


360 




AAACGTGTAG 


CCAGCGAACT 


ATCAATGATT 


TATGTCGATA 


CAGGAGCAAT 


GTATCGTGCA 


420 


20 


TTAACATACA 


AATATTtAAA 


ATTAAACAAA 


ACTGAGGACT. 


TTGCAAAACT 


AGTTGACCAA 


4B0 




ACAACATTAG 


ATTTAACTTA 


TAAAGCAGAT 


AAAGGTCAAT 


GTGTCATTTT 


AGATAACGAA 


540 


25 


GATGTAACAG 


ACTTTTTAAG 


AAATAATGAT 


GTGACGCAAC 


ATGTTTCATA 


CGTTGCATCT 


600 


AAAGAGCCAG 


TACGTTCATT 


CGCCGTTAAA 


AAACAAAAAG 


AGTTAGCTGC 


AGAAAAAGGT 


660 




ATCGTAATGG 


ATGGTCGCGA 


TATCGGAACT 


GTAGTGCTAC 


CAGATGCAGA 


TTTAAAAGTA 


720 


30 


TATATGATTG 


CATCAGTTGA 


AGAGCGAGCA 


GAAAGAAGAT 


ATAAAGATAA 


TCAATTAAGA 


780 




GGTATCGAAT 


CAAATTTTGA 


AGATTTAAAA 


CGTGATATTG 


AAGCTCGTGA 


TCAATATGAC 


840 




ATGAACCGTG 


AAATATCACC 


ATTAAGAAAA 


GCAGATGATG 


CAGTGACATT 


AGATACGACm 


900 


35 


GGCmAGTCGA 


TTGAAGAAGT 


TACTGACGAn 


ATTTTAGCGA 


TGGTGAGTnC 


AATTnA 


956 



(2) INFORMATION FOR SEQ ID- NO: 688: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 1166 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 668: 
AAAAGGGAGG AAAGAGAACA GTTAAATATG AATACAAAAT AATTTTGTTT TCGGACAGCA 60 
so GGGGTATTAG ACGCGATTGA CAATGTCTGT TTAATTAAAC GTAATGTTTA TTTAAGCGAT 120 

GAATATTAGG TGAAAAGTTT TTGAATTTGA ATGTAATTGA GGTTTATTGA TTAGACATTT 180 
. TATTGAATTG CGTGTTATTA TATAAATGTA AAAATAAGAC GACATGCGCG AACATGTCGT 24 0 

55 
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TAATGCGGAA TGGTTTTTTT ATTTTCCGCT AATTGAAATA AAAATGACGT TTTAATATAT 
TATGGGCTAG GTGGTTTGTA AGAAAGGGTT AGTTATTAAT GTTTTATGAA TTAAGGAAAT 
TTGAGTTTAA GGTTTAATCA ATTGTGATTT TGTTGATGAA GCGTTTAGTT AGAGTATTTT 
CGCCACCACT AGTTACTCCT TCTCCCACTT TACCCGAGAC TGGAGAAGAg CTATCTGAAG 
AATAAATAGA TACTTTTTGG CCATTTTGTA GTAAACCAAG ACCTTTTAAC TkCTCGGTTA 
GAGAATTCCA TGTATTTTTA GCATCTAGCT TTTTGTTAAA GTCAyCGTAG ACATTTTCCT 
TAGTTAAATC AATTTGTTTT AATCCTTTAA AGTCTATAGA TTGTGTTAGA TGGCCTCCAT 
CATCATTTCc AGGAGCAGAA ACGCTGCTAG AATATCCATT GCTTAATAAA TAAGTAACGT 
TGATTGTTTC GTACTCGTTA CTTAAAATAA TATCAGAATC ATGTAAgAAT CTTTAACTTT 
TTTCCATAAT TGACCATCTG TCATTTTTTC TTCTGCTTTA GCCGTTTTAA CAACTTTATT 
TGTATCTAAT CCTAAGTATG AAGAATGTAA GCCTGTTCCT • AATGTTGTTA ATACTAAAGC 
ACTTGCTACT aATGTTTTAC CTAAAAAttT TGTATTCATT TTTATTGCTC CTTtTTTTAT 
ATTGTAAACG TTTACAATGA AAATATAATA ATAATTTTTT AAAAGAACAA TTAACTAAAT 
ATCAAAAATG TATTAACTAT CTATTAACTA AAAAATAGAA TAATTTTTTA ACATAGTTTT 
GTTGTTTTGA ATTATAAAAA CTAAAG 
(2) INFORMATION FOR SEQ ID NO: 689: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1083 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 689: 
AGCACTGAAG GATGGCTAGT TGTCATAGCA TATGCGGTCA TTGGTATAGT TATAACGAGC 
GTTTCTTATA TGTTGTCTAT TAAAATTTTT AACAAACAAG AACTAkAATT GTCGATTTTG 
GTGATGATAG TATTTGAAAT AATATTTAGA GCAGGTGATA AATCTTTACG ATTGTCATCT 
GTTCTTTTkG GTGTGGAATG AAATGTGGGG GATAAGTATA GGTGACATAT CTATATTGAT 
TTATTTGTTT TGAGGTGGTT ATGTTGTGTG GGAATTATTT CCTTTTAGAT AGCGGGGATT 
AGAGGATATA TGTTATTTAT AAGTATCATT TGATGATTGT ATAGGCTAAC GATTTCCTCG 
GAAATATTTA AAAACCTCGA TCATGTAGCA TAACTGAAGT TTGTCACAAA AGTATAATGT 
GAAGTTCGAC ACTTTTGGaT TCAGTTCAAA TACTTTGACC GAGGTAAATA CTATTTATTC 
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10 



TGATACTGAG ATAATCATTA CATGGTCGTG ACCTTTAAAT AAAAGGCTGA CAATATAAGA 600 

CATAACGAGT ATACCTAGTG AATATGAAAT ATACTTCGCG TTTGTCAGTT CATTATGGAA 660 

ATAAGGCGTG ATTAACCATA ATCCAATATA GAATATTAAA ACACTGATAT ACATCATATT 720 

AATTTCAAAC AAGTCATTTA GTTTATTGTT ATTACTAAAA ACAATTGCAG CATTAATCAC 780 

ACCTAAAGCG ATATTGATTA ATAGATGCGT ATACGATAAA CGGAAACCGA TAGATGTTAA 84 0 

TTTATGATTA ATATAATTTT CAGTAATGAT CCAATATACA CCGAAAAGAC TAATTAAAAT 900 

CATAAATTGG AATATATAAA TGTAACTAAA ATGATCAATG CTAAATGATG ACGAAGCTAA 960 

15 ACCAACCAGT ACCTCGCCAr AGwTAtAATT GTTAGTAACG AAAAACGTCT ACTAAATGCA 1020 

TCATATTAAC AGGTnTAATA CAAGTATTTC TGAAATGGAA TAAGnCTGTC GCTGCATGAT 1080 

ACG 1083 
20 (2) INFORMATION FOR SEQ ID NO: 690; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 690: 

TTAATCATCT GGATGTATTT AGTATTnAGA ATAATAAnAA AACGATCATG TTGTATtTGA 60 

GTAGCGATTG GTTTGCGGAA TTAGGCTTTA . CTTTCTTTAA TTACCACTAT ACAGCAAAGT 12 0 

3S TGATTAAATC ATCCTATAAT TTGAAATGTC TACTATTAAA ATTGACATAT CGATACCTTG 180 

ATAATCAGCC TCTTAATGAC GCTGATAyTA GAAAATTACA GGATATTATT AAAATCATTG 240 

CAAAAGAAGC AAGTATGGAT AAAAAGATTG CACAAAATCA ATATCGATAT GCGTATTATG 300 

40 GTGATTTGCG TGATGAGCTC GAATATATTT ATCAAAATGT AAATCAACGA TTGACATTAA 360 

AAAGTGTCGC TGATAAATTA TTTGTCTCAA AGTCAAATTT GTCATCACAA TTCCACTTAC 420 

TTATGGGCAT GGGTTTTAAA AAATATATTG ATACTTTGAA AATnGGTAAA TCGATTGAAA 4 80 

TTCTACTTAC TACTGATAGT ACTATTAGCA ACATAAGTGA nCATTTAGGT TTTAGTAGTA 54 0 

GCTCCACTTA CTCTAAAATG TTTAAAAGTT ATATGGATAT CACACCGAAT GAATATCGTA 600 

ATTTATCAAA ATATAATAAn TGTTTAC 627 

(2) INFORMATION FOR SEQ ID NO : 691: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 641 base pairs 

55 



45 



50 
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10 



(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 691: 

TTGAACGTAA TGCTAGCAAA TGACTTTGTG CCATAAAATA TTCTTCCCAT TTGATTCTTT 60 

CCAAGATGTT CACCTTCCAT ACTTAAAATT TAGTAACATT TTCTAATAAT ATAAGATTAA 120 

TCACAAAAAA TAAAATTTGC AATTAAAATA ATCCATTATG TCGTGAAATA AGATTTCAGT 180 

TTATCAAAAG TTTTACTTCC AAAACCTTTT ACTTTTTTCA AATCGTCAAT TTCTTGAAAT 24 0 

is GCACCTTGTT GGTTGCGATA TTCAACAATT GCATTAGCTT TAGCTTGCCC AACTCCAGGA ' 300 

ACAGACATCA ATTCTGATAC AGATGCCGTA TTTAAATTTA CTTTAGTATT ATTTGTGTTC 36 0 

CCaTTTTTTT CGTGCACACT GTTTACTTCA ATTTGTGGTT CAACATTCTT TTGTCCTTTA 420 

TGAGGTATGA AAATCATTTT TTGATCTGTT AATTTTTCAG ACAAATTAAT TTGACTTACA 480 

TCTGCATCCT CCAATAATTG TGCTTTATCA AGTAAATCAA CTACTCTATC CTTAGATGTC 54 0 

ATTTTATAAA CATTAGGATG TTTAACAGCA CCTTTTACAT CGACATATAC AGGACCCTTA 600 

TTTTTGGAAT TATCTCCATC TTTGACCTGG ACATCTTCTA C 641 
(2) INFORMATION FOR SEQ ID NO: 692: 



20 



25 



30 



35 



45 



50 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 631 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 692: 

TATTAAAGnA CTTGATGAAC CAAATCATAA AAAGCnATAT ATGTTATTTG CAGCTGGCAT 60 

40 TGTGTTnGCA ACTATTTTAC TTATTTCGGC ACATTTATAC AGCAGAAAGA GAGGTAACCA 120 

AGTTTGAGAA TCATAAAGTA TTTAACCATT TTAGTGATAA GCGTCGTTAT CTTAACCAGC 180 

TGTCAATCTT CCAGTTCTCA AGAATCAACT AAATCCGGCG AATTCAGAAT CGTACCAACA 240 

ACTGTTGCAT TGACAATGAC ATTGGACAAA TTGGATTTAC CAATTGTCGG CAAACCCACG 3 00 

TCATATAAGA CATTGCCTAA TCGTTATAAA GATGTACCGG AAATTGGTCA ACCAATGGAG 360 

CCGAATGTTG AAGCTGTTAA AAAGTTAAAA CCAACACATG TTTTGAGTGT GTCAACGATT 420 

AAAGATGAAA TGCAACCATT TTACAAACAA TTAAATATGA AAGGCTACTT TTATGATTTT 480 

GATAGTTTAA AAGGGATGCA AAAGTCGATT ACACAATTAG GTGaTCAATT TAATCGTAAA 540 



55 
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GCAGCTAAAC AAAAGAAACA TCCCAAAGTA T 




631 




(2) INFORMATION FOR SEQ ID NO: 693: 






5 
10 


(i) SEQUENCE CHARACTERISTICS: 

±uCdVt\j i. n . xx JJaac pairs 

(B) TYPE: nucleic acid 
(D) topology: linear * 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 693: 






IS 


AATTTAACTA TGTTTTCCAC ATTGTTTCAT GTCACGAAAA GGACAACGCG 


CGACTATAAG 


60 




TATCAACTAT TTCCACAAGT TTTATTGGTG TTTTTATTAT TCATCGATAC 


GCTTCATTTT 


120 




CATCTCTCCA ACACAAAAAA GAAGCTAAGC AACTTATGTT GCCTAACTCC TCTATACTAT 


180 


20 


CCATATTTTA CTATTATCCA TATTTCATTG AATTATCTAA TGTTGGcTTC 


TATTTTTTCA 


240 




ATATTTCTAC CGTCAATGAC GTCACTCATG CGATTTGTTT GTAATTTTTT 


ATTAAGTTCA 


300 




AACGTATAAT AGCCGCCATC TTTCATTATC ACTTTTATCT TACTATCTTT 


AGGAAACTTT 


360 


25 


TTATACAGAT CAAAATTTTG AATTAAATAC TGTCTCAATT TAAAGTCGAG 


TTCTTTAAGT 


420 




GAAATCTCTT CTTTATAAAT GTAGTGTACT CTACCGTACG TAGCAATACC 


GTCACCTTCA 


4B0 




TCTCTCTTGA TTTGAAATCT TGGTGCGTTT ATATAATCAT AATAAGCGTC 


TTGATTTTTC 


540 


TTAGTGACAC CACCATATGA AAACACTGTG CCATTACGGT TTTCCGCTTC 


TTTAACAACA 


600 




AATATGTCTA ATCCCGGATT TTTACGTGCT TTAAATCTTT CAATATCTTT 


ACCAAATATC 


660 


35 


TGTACTCTTG TGAATTTTCT ATTTTTATCA AAGATAAGGT AATGCTTGCC 


ACCTTTGCTA 


720 




TAACGATAAC CAGTAACATT TTTAAGTTCC TTACTTGCGC CACTATAGTA 


ATCTCTTAAG 


780 




TCAAAGATAT CTTTTGTCAC ATTTTCATAT TTTGCTTTAT GTTCACTCGC 


ATTTACAGTT 


84 0 


40 


TGATGCAATG ACGTTATTGT TCCTGTTGCT AAAATACCTA ATGCTAAACT TGCTTTCGCA 


900 




ATTGCTGTCA TTTTCATAGT TGTATGCTCC ATTCGTAATT ATTAGATTTG 


TTCGCTTACG 


* 960 




TCTATTGAAT CATACAGCTT TATTATAGTT AGCGTATTTG ACCTTTCACA 


TTAAACCATG 


1020 


45 


TTTAATAATC ATTGAATCAT TATTAAGTAA ATTAAGGAtC TATAATGTTC 
CTGAnCCCGT TGTGCTTCAC ACCCGnTnGA T 


GTTAAATAAA 


1080 
1111 



(2) INFORMATION FOR SEQ ID NO: 694: 

SO 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 426 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 694: 

TTATGGATGG ATTAAGAGGT CGTGTTGAAA AAATCAACGA TAACTCTGTT ATTGTTGACT 60 

TAACAATTAT GGAAAATTTT AATGACCTTG ATTTACCGGA AAAAACTGTT ATCAATCATA 120 

AACGATATAA GATTGTTGAA TAAGAAGGTA AGTTATAATG AATAAAATCT CGAAGGCTTT 180 

AACTTGGTTT ATTATAAGTT TCATTATATT TCATCTCATA TTATTTATTA TGTGGGGCGA 240 

ACACCAAGAA TACTGGTATT TATATACAGG TATAATGCTA ATTGCTGGTA TCAGTTATGT 300 

ATTTTATCAA AGAGATATTG AATCTAAGCG GTTGCTTACA TCAATTGGTG TTGGTATTAT 360 

TACGGCAATT ATTTTAATTA TGCtTCAACT TTTATkCTCA CTTATAACTT CTAATTTAAG 420 

TTATAG 426 
20 (2) INFORMATION FOR SEQ ID NO: 695: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 737 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



15 



25 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 695: 

AAAAGGGGnG TAAGGTTTAG CTCAAGTACG AGAAGTCTTT GGTGATGAAG CAATTGATGA 60 

AAATGGTGAG ATGAATCGTC GTTATATGGG TGATCTAGTG TTTAATCATC CAGAAAAACG 120 

CTTAGAATTA AATGCTATCA TACATCCTAT CGTGCGAGAT ATTATGGAAG AAGAAAAGCA 180 

AGAATATTTA AAACAAGGAT ATAATGTAAT CATGGATATT CCATTATTAT TTGAAAATGA 24 0 

ATTGGAAAAT GCAGTAGACG AaGTGTGGGT TGTATACACT TCTGAAAGTA TACAAATGGA 300 

40 TCGTTTAATG CAACGTAATA ATTTGTCATT AGAAGATGCG AAAGCACGTG TCTATAGCCA 360 

AATTTCTATT GATAAAAAAA GCCGAATGGC CGATCATGTT ATCGATAATT TAGGGGATAA 420 

ACTTGAATTA AAACAAAACC TTGAGAGATT GTTAGAAGAA GAAGGTTATA TTGAAAaGCC 480 

GAATTACGGA GAAGAAGATT AATATTACAC TATAAATAAG TCATTACTTT ACGTACGCGT 540 

TGATGTATGT AAGTAATGAC TATTTTTTAT AAAAAAGATA AATAAATCAA CGGAAAACGC 600 

TTTCAAATTT CATATAATAT GCTATACTAA TTCCATAAAG TATAACACAt AAAGATCAAG 660 

GGGTGCTTTT AATGTCAACG AATATTGCAA TTAATGGTAT GGGTAGAATT GGAAGAATGG 720 

TATTACGTAT TGCATTA 737 

55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 432 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 696: 
AACGATAAGA ACAAAAGATA TATTACAAAG CGTTTATTTA AAACGTTATT TATTACGCGC 
GATGATGGCA GGATTTATTA TCGGGATTAT TACGGTCTTC GTATTATCAG TTAAAGCAAC 
ACACGAACCA GATTTACCGC CAGgCATTGT GAATATGGCC AGTGCCATTA CATTCAGCTT 
TGCGTTAGTA CTCATTTTAT TTACAAACTC CGAACTACTA ACCAGTAACT TCATGTACTT 
TACTGTAGGC CTGTATTmTA AAGTAATTAA ACCAACTAGA GTATTGaAAA TATTTTTATT 
ATGCTTTGCA GGAAATATTT TAGGTGCTGC TATTTyATTT AGTTTCATGC GTTTTTCAAA 
TGTAATGACG CCAGATAwGt TAAAyCAGTT ATCAGCAGTT ATAGAGCATA AAACGTTGTC 
TACTGGTTTT GT 

(2) INFORMATION FOR SEQ ID NO: 697: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: '782 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 697: 



CTTTTATACG 


AAAGTTAAnA 


TCAAATATAA 


CATTAATGTT 


TGATGGGGAT 


TTTGCGGGTA 


GTGAmGCAAC ACTTAAAACA GGTCAAAATT 


TGTTACAGCA 


AGGGCTAAAT GTATTTGTTA 


TACAATTGCC 


ATCAGGCATG 


GATCCGGATG 


AATACATTGG 


TAAGTATGGC 


AACGATGCAT 


TTACTGCTTT 


TGTAAAAAAT 


GACAAAAAGT 


CATTTGCACA 


TTATAAAGTG 


AGTATATTAA 


AAGATGAAAT 


TGCACATAAT 


GACCTTTCAT 


ATGAACGTTA 


TTTGAAAGAA 


CTAAGTCATG 


ATATTTCGCT 


TATGAAATCA 


TCGATTTTGC 


AACAAAAGGC 


TTTAAATGAT 


GTTGCACCAT 


TTTTCAATGT 


TAGTCCTGAG 


CAATTAGCTA 


ACGAAATACA 


ATTCAATCAA 


GCACCAGCCA 


ATTATTATCC 


AGAAGATGAG 


TATGGCGGTT 


ACATTGAACC 


TGAGCCAATT 


GGTATGGCAC 


AATTTGACAA 


TTTGAGCCGT 


CAAGAAAAAG 


CGGACnAGCA 


TTTTTAAAAC 


ATTTAATGAG 


AGATAAAGAT 


ACATTTTTAA 


ATTATTATGA 


AAGTGTTGAT 


AAGGATAACT 


TCACAAATCA 
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TATCAGTGAT GCTGTGCAGT ATGTTAATTC AAATGAGTTG AGAGAAACAC TAATTAGCTT 720 

AGAACAATAT AATTTGAATG ACGAACCATA TGAAAATGAA ATTGATGATT ATGTCAATGT 780 

5 TA 782 

(2) INFORMATION FOR SEQ ID NO: 698: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 714 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID . NO: 698.: 



20 



25 



30 



35 



AAATCATTGG 


CAAAATATAC 


GAATTGCATT 


TAAATCGTCA 


ATTTACATAT 


ATTTTTGCGT 


60 


AGTAATCAAT 


CGTTATCATT 


GTTATTTATC 


GTTACATTAT 


TTCGAGTATC 


AGTATGTATT 


120 


TCGGGCTTCG 


TTTGATAACG 


ACATTTCTTT 


GTGACATCGC 


TTCATCAGTG 


TAACAACAAA 


160 


TACAATGATT 


TCGTGATGTT 


AGTTACCCAT 


TTTATGTGTT 


GCATAAAATA 


TGTTGTTATA 


240 


AAACATTTTA 


AATCATTTTA 


TATAAACAAT 


CTATATATTT 


TTGGCATTTC 


CAAAATATCA 


300 


CTTGTTATAT 


TAAAAACCGA 


CAAGACATTT 


TATCTTATCG 


GTTGAAATTT 


GTTATTGTTA 


360 


TTTGTAATGT 


TTTTAGGTTT 


CTTTTTAATA 


TAATATATTT 


CAGTGAAAAT 


ACATGATTGA 


420 


TTGTGATTTC 


ACTGAAACAT 


GGTTAATTGC 


GTTGTTGATG 


AATAACTTTA 


GCATAAATAT 


480 


AGGAAGTTAT 


TTTGTACATC 


GCCATATATA 


GAAACGAAAT 


TATAATGACA 


GCTAGTACGT 


54 0 


AACTTGTTAA 


AAATATATGA 


TGGTTATTAA 


TACCTATCAT 


ATTTAGTAAC 


GTATATACAA 


600 


TGTTACTAGA 


AATTAATGTG 


TGAATCAGTG 


CTACTGTTAT 


TGGTATtGCG 


AACAAGAAAG 


660 


TCATTTGATT 


TCGTGTTATC 


TTTGCTATTC 


TTCCATTATC 


TAAACCAAGT 


TTTT 


714 



4Q (2) INFORMATION FOR SEQ ID NO: 699: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 699: 
ACTGAGAGCA ATAATTTAGT AACTTCTACT CAAGGAATTA TTAAAGAAGC ATTGCATAAA 60 
TTGGGATTTG nnTTTAATTA AAGAACCTTT AAGAATGTTA CAAGTGCGTA TCCCTGTACG 120 
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TGGACCAACA AAAGGGGGCG TGCGTTTCCA CCCAGATGTT GATGAAGAAG AAGTAAAAGC 24 0 

ATTATCAATG TGGATGACTT TGAAATGTGG CATTGTGAAA CTTACCATAC GGTGGTGGTT 300 

AAGGGTGGTG ATCGTTTGTG GATCCACGTC AAAnGAGCAT TCCATGAAGT TGGAACGTTT 360 

A 361 



(2) INFORMATION FOR SEQ ID NO: 700: 

10 

.(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 943 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 700: 



20 


GTCCTTAATT 


GGTATCCACC 


TTTAGCACCG 


CGTACACTTC 


GAATTAACCC 


CGCATTTCTT 


60 




AAAGGACCTA 


CAAGCTGTTC 


TAAATATAAA 


TCACTCAAAT 


TATTTTCTTC 


AGCAATTGAC 


120 




TTTAATGATA 


TACATCCTTG 


CCCCTCTTTT 


TTAGCAAGAG 


AAATCATCAA 


TGTAAGTCCA 


180 


25 


TATCTCCCTT 


TAGTAGAAAT 


TTTCATTGTA 


TAACCTCACT 


TAATTCGAAT 


ATTGATATTC 


240 




CCATTTTAGC 


ATTTTTTGAG 


TTAAGATAGT 


ATAAGAAAGG 


TGTGACAAAT 


GTGAGTACAG 


300 




AACCATTAGC 


ATCGAGAATG 


CGCCCAAAAA 


ATATAGATGA 


AATCATTTCC 


CAACAACATT 


360 


30 


TAGTTGGACC 


AAGAGGCATT 


ATCAGAAGAA 


TGGTTGATAC 


AAAAAAATTA 


ACTTCAATGA 


420 




rrmTATGG 


TCCACCTGGT 


ATAGGCAAAA 


CAAGTATTGC 


CAAAGCAATT 


TCGGGCAGTA 


480 


35 


CGCAATATAA 


ATTCAGACAA 


TTGAATGCTG 


TAACTAACAC 


TAAAAAAGAT 


ATGCAACTTG 


540 


TTGTTGAAGA 


AGCTAAAATG 


TCTGGTCAAG 


TTATCTTGTT 


ATTAGATGAA ATACATCGAC 


600 




TAGATAAAGC 


TAAACAAGAC 


TTTTTATTAC 


CTCATTTAGA 


AAATGGCAAA ATCGTCTTGA 


660 


40 


TCGGTGCTAC 


AACTTCAAAT 


CCTTATCATG 


CTATCAATCC 


AGCGATTCGT 


TCAAGAGCGC 


720 




AAATTTyCGA 


GTTATATCCT 


TTAAATGACG 


AaGATGTGCG 


CCAAGCGTTA ACTCGTGCAA 


780 




TAGAAGATGA 


TGAGAATGGT 


TTGAAAmCAT 


ATCaACCCAA 


AATTGATGAA GATGCCATGA 


840 


45 


CCTACTTTTC 


TACACAAAGC 


CAAGGTGATG 


TTCGTAGTGC 


GTTAAATGCA 


TTGGAATTAG 


900 




CTGTATTAAG 


CGCAGATAAT 


GACAAAGACG 


GTTATCGACA 


TGT 




943 



(2) INFORMATION FOR SEQ ID NO:. 701: 

50 <i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 445 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 701; 

^ CATTAAGTGA AGTTGTTGAT ACACCCATGC ATCAAGTCAA TTGTTCTGTT GATTTAGATA 60 

CAGAAAGCTT ATTAGGCTTT AAAACAATTA AAACAAATGC GGAAGGTCAA CAAGAAATTG . 120 

TCTTTGTAGA TGGTCCAGTT ATTAAAGCTA TGAAAGAGGG GCATATTTTA TATATTGATG 180 

w AAATAAATAT GGCTAAACCT GAAACATTGC CTGTATTAAA TGGGGTCTTA GATTATCGTC 240 

GTCAAATTAC GAATCCATAC ACTGGTGAAG TAATCAAAGC TGTACCAGGA TTTAACGTTA 300 

TAGCAGCGAT AAATGAAGGT TATGTTGGtA CTTTGCCAAT GAATGAAGCA CTAAAAAAaT 360 

75 CGCyTTGTTG TTaTTCACGT JcGATTATaTT GATGGGGaCA TTTAAAAAAT GTGAnTAAGG 420 

AGCAAGGTTT ATTACAAGAT GTTAA 445 
(2) INFORMATION FOR SEQ ID NO: 702 : 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 752 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 702: 



30 


TGCAAGATAA 


ACAAATTTGT 


CATTGTTTTG 


GTTGTAAAAA AGGTGGCAAT GTTTTTCAAT 


60 


TTACTCAAGA 


AATTAAAGAC 


ATATCATTTG 


TTGAAGCGGT TAAAGAATTA GGTGATAGas 


120 




TTAATGTTGC 


TGTAGATATT 


GAGGCAACAC 


AATCTAACTC AAATGTTCAA ATTGCTTCkG 


180 


35 


AyGATTTACA 


AATGATTGAA ATGCATGAGT 


TAATACAAGA ATTTTATTAT TACGCTTTAA 


240 




CAAAGACAGT 


CGAAGGCGAA 


CAAGCATTAA 


CGTACTTACA AGAACGTGGT TTTACAGATG 


300 




CGCTTATTAA 


AGAGCGAGGC ATTGGCTTTG 


CACCCGATAG CTCACATTTT TGTCATGATT 


360 


40 


TTCTTCAAAA 


AAAGGGTTAC 


GATATTGAAT 


TAGCATATGA AGCCGGATTA TTATCACGTA 


420 




ACGAAGAAAA 


TTTCAGTTAT 


TACGATAGAT 


TTCGAAATCG TATTATGTTT CCTTTGAAAA 


480 




ATGCGCAAGG 


AAGAATTGTT 


GGATATTCAG 


GTCGAACATA TACCGGTCAA GAACCAAAAT 


540 


45 


ACTTAAATAG 


TCCTGAAACA 


CCTATCTTTC 


AAAAAAGAAA GTTGTTATAC AACTTAGATA 


600 




AAGCGCGTAA 


ATCAATTAGA 


AAATTAGATG 


AAATCGTaTT ACTAGAAGGT TTTATGGATG 


660 


SO 


TTATAAAATC 


TGATACTGCT 


GGCTTGAAAA 


ACGTTGTTGC AACAATGGGT ACACAGTTGT 


720 


CAGATGAACA 


TATTACTTTT ATACGAAAGT 


TA 


752 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 703: 

10 





nCGTTTTTGT 


nAAACnGCTT 


ATATGTATAA 


CTTTTCCCAA 


TTTATCATAA 


GTTTTGATAG 


60 




AAGGAGTTGG 


ATCGCTAAAG 


AAAATATCTC 


TAAAGATATC 


ATCTTCAATG 


ATAGGAATAT 


120 


75 


TATGTCtTTC 


GCtATAAGTA 


ATTATATTTT 


TCTTTTGCTC 


ATTCGTTAAA 


GAACGACCTG 


180 


TCGGGTTATT 


AAACCTAGGT 


TCTATATAAA 


TCGCTTTATT 


TTTAAAATTA 


ATAAATCTAT 


240 




CAATGATGGT 


ATCAATTTCA 


TTAATTTGAT 


TATAAGGAAC 


AT CAATATGT 


CTAAAATTCA 


300 


20 


ATTGCTCAAA 


AACATTTGTA 


GAGTGAATAT 


ATGATGGTGT 


ATTCGAAATT 


ATTATGGCAT 


360 




CTTGACCTAA 


AAACCCAATA 


GATAAAAGTT 


GAATGGCATG 


TAAAGCGCCT 


GAAGTGATCA 


420 




TTACATTTTC 


TCTACCTACA 


TTTATACCTT 


GCTTTGACAT 


TCGTTCAACG 


ATAATATCTC 


480 


25 


TTAACTTGAT 


ATAACCATAG 


CCATTATTAT 


AACCAAAAGA 


TAAGTCTTCA 


ATATGACTGG . * 


540 




CTGTATTAGA 


CATGGCTTTT 


TTCAATTGAA 


TATGTGGCAT 


TAACGATATA 


CCCAATTCAC 


600 




CTTTACTTAT 


ATGTATATAC 


GAATCATCTG 


TCTCAATTTT 


ATTAATTAAT 


TGCACCGTAT 


660 


30 . 


ACTGACTTCT 


TTGTTGAGAG 


GACCATAACA 


TCATTTCAGA 


CCACTTATTT 


GTAATATGTG 


720 




CTTCATTCAA 


ATAGTCATTA 


ACATATGkTC 


CACTACCTAC 


TTTAGTATAG 


ATAAATCCTT 


780 




CAGCTTCTAA 


TAACTCAATA 


CTTTTAATAA 


TCGTTACTCT 


ATTTACGTTG 




830 



(2) INFORMATION FOR SEQ ID NO: 704: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 704: 



50 



AGTGGTGTTG GAAAAGCTAT TATGAAATTA TTACGTGAAC 


AACAAGTTTA 


ATAAAAAAAG 


60 


AGGGGTCAAA TATGAAAGGA TTAATTATTA TTGGCAGTGC ACAAGTGAAT 


TCACATACAA 


120 


GTGCACTAGC AAGATACTTA ACTGAGCATT TTAAAACACA 


TGATATTGAA 


GCGGAAATAT 


180 


TCGATTTAGC AGAAAAACCG TTAAATCAAT TAGATTTTTC 


AGGAACAACA 


CCGTCTATTG 


240 
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TATTAGGAAC GCCAAACTAT CATGGTTCAT ATTCTGGAAT ATTGAAAAAT GCATTAGATC 360 

ATCTAAATAT GGATTATTTT AAAATGAAAC CTGTAGGCTT AATAGGAAAT AGTGGTGGTA 420 

TTGTTAGTTC AGAGCCATTG TCACATTTAA GAGTAATCGT CAGAAGTTTA CTAGGCATTG 480 

CTGTACCAAC TCAAATAGCA ACACATGATT CTGATTTTGC TAAAAATGAA GATGGTTCAT 540 

ATTACTTAAA TGATAGTGAA TTCCAATTAC GAGCAAGATT ATTTGTCGAT CAAATTGTAT 600 

CTTTTGTGAA TAATAGTCCA TATGAACATT TAAAATAATA TTAAnAAATA TGTAAATnT 659 



(2) INFORMATION FOR SEQ ID NO: 705: 

(i) SEQUENCE CHARACTERISTICS : 
is (A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 705: 



nCACCATATA GTAACTGGCA CCAACTACAT TACCGTCTTT TAAAAAGATT TTTTTATAGT 60 

25 TATTATCAAC ACTATTAAAT ATTTCAATAC CTTTAATTTC TGCATTTTCT ACAATTTGAC 120 

CAGCACTATA CAAGTCACAC CCAGAAACTT TTAATGACGT AAATGTTGTT GATCCCTTGT 180 

ATCCGTTCGT TTCTTTATTT GTTAAATGAT CAGCTAATAC TTTACCTTGT TCATATAGTG 24 0 

30 GTGCAACGAG TCCATAAACT TTGCCGTTAT GTCTGCACAT TCACCAACTG CATATACATT 300 

GCTATCACTT GTTTGCATCA CATCATTGAC AACAATACCA CGATTACATC TAGACCTGAT 360 

CTTGGCACTC CTGGGAAGGC GGAACCACTG CATACAACTA 400 



35 (2) INFORMATION FOR SEQ ID NO: 706: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 380 base pairs 

(B) TYPE: nucleic acid 

40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 706: 

45 

TTAAAAATAC AGCTACAGGT AATTTTAATG ATTTTTCATC AATATCAAAT TTGGGATTAT 60 
GGTGTGGCGC TGTAATACCT TTAACCAGTC AGAAAGAATG CACCTGGTCG TACTTTCAAA 120 
50 TAATGTGAAA AATCTTCTCC AATCATCATT AAATCTGATT CATTAAAGCG TACATGTAAG 180 
TCATTTGTTG CTTCTTTAAT AACTTGGATA TGCTTTCTCG TTTATTATGG ACAGGCAAAT 240 

55 
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GCTTAATCCA TTTTGTCCAT ACATGATTCT GTATATCTGA AATCGAAAGT TCTnACTGTA 


360 




CCTTTACAAA ATGCCTTGnn 


380 


5 


T MTTOPMaTTnXT CAt CPA TT1 MO* *7fl7 - 
V^J x vie \JKrU\ X ±\jr* fUK S£»^ XiJ XMV . fvf. 




10 


(i) SEQUENCE CHARACTERISTICS : 

l/i/ LiCjNo I ri : 4 UU JJciSe: polxS 

(B) TYPE: nucleic acid 

ILJ b IKANUbiJN&bb : aOUDie 

(D) TOPOLOGY: linear 




15 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 707: 






CAnAAAGGAT CAAAGTGATC GGTATAGTGA TAATTAGCTC CAAAGAAAGA ATATTCTAAA 


60 


• 


TCAAATCCAT ACCAAGCAGA AAGTATTAGC GAATATCAAT TTAAATGGTA CCGATTCAAA 


120 


20 


TAAAGAAACA CGACATATAG AATTTTTACT TGATGATTTT AGTGAATCAT ATGAACCAGG 


180 




AGATTGTATA GTAGCATTAC CGCAAAACGA CCCTGAATTG GTTGAAAAAC TAATATCCAT 


240 




GTTAGGTTGG GATCCGCAAT CTCCGGTGCC AATTAATGAT CATGGTGATA CAGTTCCTAT 


300 


25 


TGTTGAAGCA CTAACATCAC ATTTTGAATT TACTAAATTA ACATTGCCAT TATTGAAAAA 


360 




TGCAGATATC TATTTTGACA ATGAAGAATT ATCTGAnCGT 


400 


30 

35 

■ 


(2) INFORMATION FOR SEQ ID NO: 708: 




(i) SEQUENCE CHARACTERISTICS : 

\J\) iibNblH; 44 / Dase pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : QOUDie 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 708: 




40 


TAATGCTGGT ACGGGTCATG CAGCATTATG TGAGTTGAAC TACACAGTTT TACAACCTGA 


60 




TGGTTCTATC GACATCGAAA AAGCGAAAGT GATTAACGAA GAGTTTGAGA TTTCAAAACA 


120 




ATTCTGGGGT CACTTAGTGA AAAGCGGTAG CATCGAGAAC CCAAGAGAAT TTATCAATCC 


180 


45 


ATTACCACAC ATCAGTTATG TTAGAGGTAA AAACAATGTT AAATTCTTAA AAGATCGTTA 


240 




CGAAGCGATG AAAGCTTTCC CTATGTTCGA TAATATCGAA TATACTGAAG ACATCGAAGT 


300 


50 


AATGAAAAAA TGGATTCCAT TGATGATGAA AGGCCGTGAA GATAACCCTG GTATCATGGc 


360 


GGCAAGTAAA ATTGACGArG GTmCAGATGt AAmCTyCGGT GAATTAACAC GTAAAATGGC 


420 




TAAAAGCATT GAAGCACATC CAAATGC 


447 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1448 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 709: 

10 . 

GTGTCTACCT GTTTTTGTTG TTCAATTGTT AACTTTTCTT TTTGAATAGT AGTATTCGAT 60 

AATTCTTTAT CGCTTATAAA ATCATCTAGT GGATGGCTTT CTTTGAAACT TTTATTTTCA 120 

GTCATCAATC ATCTCTCCAA TAGTTGCTAT TTTACATCAA TCTTGATGAT GTTTTTGATG 180 

75 

TAACTTATTT AATTCCATTT CAATATCTAA ACGTTCATAA TCATCTTCGT TGAGACGCTT 24 0 

TAAATCAGCG ATTAATGTTC GTTTGACCTC ATCCAAAGTA ATTCGTGTTT GTTCTAACTT 300 

20 TTGCTGTTCA TTAATTGATT TTTTGGGCAT TTTTGCTAGA CGTGTATATG CATCAACCAA 360 

ATTTAAAGCA TTATCAATAT GAGAATAAAA AAAGCCTTCA ACTTTATAAA ATGATGCAGG 420 

TCTCTGTCTA ACTGTCGTAT AAATAGAACG TGAAATTTGG TATATATCAT TAATCTGCCT 4 80 

25 AAAATCTTTA ATTGATCTTA TATTGACATA CGTTTTTAAA ATACCTCTAA GTTTTTGGTG 54 0 

•TGTATGATTT AACTGATTTC GAATATAGCG ATAGTCTTTT CTAGTCAAAC CAATTTCGTT 600 

TAAATATTTG CGTGAAGTGA GTTTTTGTAT CGGTAGGTAT GTCATTAAAA AGCCAACAAT 660 

30 

ACCAATAGAC ATATCAATTA AAAAAGATAC ATCAAGTGCA ATCATCCCAA ATATGCTTGT 720 

TAAAAACGCT ACAGGAATTC CCACTAACAC CCCAAATATA TGAGAAATAT TATATCTCAC 780 

TGTCATCTTC CTTTATTTAG CATTTTATAT TGATCGAAAA TCCGATAATT TTTGATTTAG 64 0 

35 

TTCTAACTCT TCAAGTTGAT GGCTTGTTAC ATTTGATGCT GGTGAGGCAC CTTCAATTAC 900 

ACCTTGAATA AATCTCTCTA TATCTGCGTC ATCCCCTTGT GCATATATCT CTACATAGTC 960 

4Q ATCTACATTT TGAACAGTAC CGACAATGTT ATAGTTCATT GCAATGCGTT GTGTAAAATA 1020 

TCTAAATCCG ACGCCTTGAA CGCGTCCGAA TACTTGTAAA TGTATATGtC TCaTTTTTAC 1080 

CACCTCATAA TGTTATTATA CGTAGTTTTA CTTAAAAAAA CTAATAATTA CTATAGTTAC 114 0 

45 TACTTTGTTT GTTTCAAGTC GTCAAACTTG ATTTTCAGAG GATAAAGGTA TAAAAATAAG 1200 

TATAGAGTTT TTGAAGTATG GAAGGGGTCT TTAATAATGT GGACAGTTAC CAAAATTAGA 1260 

GCCGATTATG AGGGATGGTG GTTATTCAGT GACTGGCCAG AAAACATTGT TGAAAAATAT 1320 

50 

CAATATCAAG ATTTTGATGA CATGTTTAAG CACTATCAAC AATTGATTAA TCAATGTAAA 1380 

GTTCAGTTCG ATAACTATGT CACAGGCAAA TATAATATTT ATGCATTTTA TAATAATTGT 1440 

55 



1618 



EP 0 786 519 A2 



(2) INFORMATION FOR SEQ ID NO: 710: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs * 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 710: 
GCTGnAAAAT ATGTTAGAAG CAGATGCATG GGCAAAGCTG GGGTCCTTAT TTATTGTCAG 
GTAAAGATGT CTTCAATTCA ACTATTGGAA TATATGGTAT GGGAGATATT GGTAAAGCTT 
TTGCAAGAAG GTTGCAAGGG TTTAATACTA ATATTCTTTA TCATAATCGA TCAAGACATA 
AAGATGCAGA GGCGGACTTT AATGCAACAT ATGTTTCTTT TGAAACGTTG TTAGCAGAAA 
GTGATTTTAT CATCTGTACA GCGCCACTTA CAAAAGAAAC ACATCATAAA TTTAATGCTG 
AAGCATTTGA ACAAATGAAA AATGATGCAA TTTTTATTAA - TATCGGTAGA GGACAAATTG 
TAGATGAAAC AGCATTAATC GATGCACTAG ACAATAAAGA AATTTTAGCA TGTGGTTTAG 
ATGTATTAGC AAATGaACCG ATTGATCaTA CACATCCATT aATGGGaCGT GaTAATGtTC 
TGaTTaCACC aCACATTGGG TAGGCGCATT CAGTTAACn 
(2) INFORMATION FOR SEQ ID NO: 711: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 134 9 base pairs 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS: double . 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 711: 
CTATTACAGG TGGCGAACCA. ATGTTTTCTA AAAAGTCTAT TAGAAATGTT GTTAAACCTC 
TATTAAAGTA TGCACATCAT CGAGGTATAT ATACACAAAT GAATTCAAAC CTAACATTGC 
CTCAAGATCG TTATTTAGAT ATTGCTGAAT ATATCGATGT TATGCATATC TCACACAACT 
GGGGAACAAC TGATGAATTC GCAAATGTTG GCTTTGGCGC AATGAAGAAG CAACCACCGT 
TAAAAGCTAA GTTAAAATTA TATGAACAAA TGATTTCGAA TGCACGTACA TTATCAGAAC 
AAGGAATGTT TGTATCTGCG GAAACAATGC TCAATCAAAG TACGCTACCA CATTTACGAA 
AAATACATCA AGAAGTCGTT CATGATATGA AATGTAGCAG ACACGAGATT CACCCTATGT 
ATCCAGCTGA CTTTGCAAGT CAATTAAATG TGTTAACTCT AGCGGAAATG AAAAAGACAA 
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TGTTTCCATG CTTAAAGGAT GATGAAGATC AAAAGTTACT ATCACGTTTA AGAAATGCTA 600 

AAAATGTAAC GACTAGAAAT GACCCGGATG GCCGTAGTCG TTTAAATGTC AATGTATTTA 660 

CAGGTAATGT AATCGTAACT GATTTCGGAG ATGAAACAGG TACAATTTCG AATATACAAA 720 

AAGATAAATT AACAGATGTA TTTGATAAAT GGTTATCCTC TGATCTTGCT AAATCATTAA 780 

ATTGTCATTG TTCCGAGTTT AGTTGTTTAG GGCCAAATGT TCTTGTTAAA AATATGTACT 840 

ATCCGAATAT GGATTTTAAA GATAATGAGC GTCATATGCA CAAACAACCA CAAATTATAC 900 

AATTTTAAAA ACTCTTAATT ATGCGGAGAA GCACTTTATC GATAAGTAGT CTCCGCATAT 960 

TTTAATGCTA TTATAAAATA AAAAACAATT AATTGCTGGC AGTACTCTAC TTAAATAATA 1020 

AAGGGCATTT AATAGGACTA ATAGTCTATA ATAAAAGGGG TAAATTTTAA CTAAAAGCAT 1080 

AAACGTGCAT AATCAAAAAG ACAGaTTGTA GGTGGAATAT TCGAaCATAA CAGTTCAATT 114 0 

CATCCTTAAC AATCTGTCTT TATATTTTTA GTCTCTAATA TGTTGCACTT GAGCTAAATA 1200 

TTCAATTGTT gTTTACTTtC AaTGCGaCGt GCTTttCTtC GTtCAaCaCG TgwGGTGCTG 1260 

TATCATAAAA CCATTTTTCA ACATCATCTT CTGGATATAC ACCAGGTACA TGTTTAGGTT 1320 

25 GCCTTCATCA TCTAACGCAA CAAATGTAA 1349 
(2) INFORMATION FOR SEQ ID NO: 712: 



10 



IS 



20 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 712: 

GAAATTAAAA AAGCAATTGG ACAAGATGCA ACAGTGTCAT TGTTTGATGA ATTTGATAAA 60 

AAATTATACA CTTACGGCGA TAACTGGGGT CGTGGTGGAG AAGTATTATA TCAAGCATTT 12 0 

GGTTTGAAAA TGCAACCAGA ACAACAAAAG TTAACTGCCA AAAnCAGGTn GGCCTGAATG 180 

GAACCAGGAG GAAnTTGAAA ATATGCTGGG GATTACATTG TGAGTACAAG TGAAGGTAAA 24 0 

45 CCTACACCAG GATACGAATC AACAAACATG TGGAAGAATT TGAAAGCTAC TAAAGAAGGA .300 

CATATTGTTA AAGTTGATGC TGGTACATAC TGGTACAACG TCCTTATACA TTAGATTTCA 360 

TGCGTAAAGA TTTAAAA 377 

50 

(2) INFORMATION FOR SEQ ID NO: 713: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 528 base pairs 

55 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

s 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 713: 
AATGAAnCGA AAAATGACTA TAAAAAGTTT AAAGTGTTTT CACTTATTTC AACACTTGTC 60 

10 ATTGTCATTT TAGCAATTAT AAGATTTGTT CATAAAATGA TGTAATTAGA GTGAGACATT 120 

GTTTTATGTC TCAGGATCCA GTTATTCATT ATATCTACAA TATTTACGAT TATATAAATA 180 
ACCCGAGATT TTAGTATGAT TCATTtCACT AAAATCTCGG GTTTCTATTT GATAATTTTT 24 0 

75 AATGGGATAT GGCATGTATA CGTTCTTGCC TTTTATCTCA TTTCCAATGA TTAATCTGGA 300 
TATTGTTCTA AAAATGCTTT CGCTTCTTTA TTAACTGTTT TAAAATCAAT ACCTTGTTGC 360 
ATCGCTGCAA AGACACATCC ACAATAACAC TGCCTAAAGA TATTATAGTC ATTACACATT 420 

20 

TCTATGGaAC GCTCATAACC TTTACTTTTC yTAAAATCAC TTGGcAAATA GTTCACaTCG 480 
TATATTTTTT GGACATCCAT ACCAAGTTCA TTGATTAATT GTGCGTTC 528 
(2) INFORMATION FOR SEQ ID NO: 714: 

25 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 731 base pairs 
{B> TYPE: nucleic acid 
■ (C) STRANDEDNESS: double 
, n (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 714: 



35 


TTTAATGGCA 


TGAAGTCACA GTATATAAAG 


AATCTGGTGT 


GACAATCAGT 


ATGACTAAGT 


60 




ATATAAAAAA AGAGCAGGTG CGAAAT AATG 


GCGAAAGAGT 


CGAAATCAGC 


TAATGAAATT 


120 




TCACCTGAGC 


AAATTAACCA ATGGATTAAA 


GAACACCAAG 


AAAATAAGAA 


TACAGATGCA 


180 


40 


CAGGATAAGT 


TAGTTAAACA TTACCAAAAA 


CTAATTGAGT 


CATTGGCATA 


TAAATATTCT 


240 




AAAGGACAAT 


CACATCACGA AGATTTAGTT 


CAAGTTGGTA 


TGGTTGGTTT 


AATAGGTGCC 


300 


45 


ATAAATAGAT 


TCGATATGTC CTTTGAACGG 


AAGTTTGAAG 


CCTTTTTAGT 


ACCTACTGTA 


360 


ATCGGTGAAA 


TCAAAAGATA TCTACGAGAT 


AAAACTTGGA 


GTGTACATGT 


TCCGAGACGT 


420 




ATTAAAGAAA 


TTGGGCCAAG AATCAAAAAA 


GTGAGCGATG 


AACTAACCGC 


TGaATTaGaG 


480 


50 


cGTTcACyTT CTATcAGTGA AATAGCTGAT 


CGATTAGAAG 


TCTCAGAAGA 


AGAAGTGTTA 


540 




GAAGCAATGG 


AAATGGGACA AAGTTATAAT 


GCGTTAAGTG 


TTGATCATTC 


CATTGAAGCT 


600 




GATAAAGATG 


GTTCAACTGT TACGCTATTA 


GATATTATGG 


GGCAACAAGA 


TGACCnTTAT 


660 



55 
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CGAGAAATCA T 731 
(2) INFORMATION FOR SEQ ID NO: 715: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 830 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

10 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 715: 



15 


TTTTAGrGGa 


AAGCGaATTA 


GtCaTATwCg 


CaGCAGATwG 


aATGATTGaA AAmGaAATTG 


60 




aTTCGATTCC 


AATTGTAAGA 


AAAAAAGATA 


ATCAAAAGTA 


TGAAGTAATT 


GGAAGAATTT 


120 




CCAAAACAAC 


AATAGCTAAG 


TTATTAGTAG 


CATTATATAA 


AGAATAGGTG 


AGAAGTAATG 


180 


20 


GAAAAAATTA 


AAATTATCGT 


AGCTTCAGAT 


TCTATAGGTG 


AAACGGCAGA 


GTTAGTTGCT 


240 




AGGGCAGGTA 


TTTCACAATT 


CAATCCTAAG 


CAATGTAAAA 


ATGAATTATT 


AAGATATCCA 


300 




TATATTGAAT 


CTTTTGAAGA 


TGTTGATGAA 


GTGATTCAAG 


TTGCAAAAGA 


TACAAATGCT 


360 


25 


ATCATTGTTT 


ATACACTTAT 


TAAACCTGAA 


ATGAAGCAAT 


ATATGAGTGA 


GAAAGTAGCA 


420 




GAATTCCAAT 


TGAAGTCTGT 


CGATATCATG 


GGGCCATTAA 


TGGATTTATT 


ATCTGCTTCG 


480 




GTTGAAGAAA 


AACCTTATAA 


TGAGCCAGGT 


ATCGTTCATA 


GATTAGATGA 


TGCATATTTC 


54 0 


30 


AAGAAAATTG 


ATGCGATAGA 


GTTTGCAGTT 


AAATATGATG 


ATGGTAAAGA 


TCCTAAAGGA 


600 




TTACCTAAAG 


CTGATATTGT 


TTTACTTGGT 


ATTTCGAGAA 


CTTCAAAGAC 


ACCATTATCT 


660 


35 


CAGTATTTAG 


CGCATAAGAG 


TTACAAAGTT 


ATGAATGTAC 


CGATTGTACC AGAAaGTGAC 


720 


ACCGCCAGAT 


GGCTTATATg 


GATATTAATC 


CAAAGAAATG 


TATCGCACTT 


AAAATAAGTG 


780 




AAGAnAAATT 


AAATCGCATT 


AGAnAAGAGC 


GACTAAAACA 


ATTAnGACTA 




830 



(2) INFORMATION FOR SEQ ID NO: 716: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1159 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double. 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 716: 
50 nGAAGAACAA GTTTAnCATC TAAATGCCCG TTTAAAACTA CAACTACTTG ACGATGTTAA 6 0 

ATCAGTGTTT AAnTCTCAAA TGACGCAAAA TAGTGATTTT AATGAAGAAA AGAAAGTGTC 120 

55 
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AGAACGTATA 


AAAAAATACT 


TTAATAAGCA 


ACTCACTGAG 


CAAATTGCAC CAATCGTTCA 


240 




ACAATTAGCA 

< 


GATTTACATG 


TCATTATTAA 


TCCTCAGTTT 


AACTTTGAAT CAGCTAATAT 


300 


5 


AGAGCAACCA 


TTATTGCACA 


TCGATTTCAA 


CGATATGCTA AATGCATTGC CTAAACAATT 


360 




AACAAAACGT 


AAAATTTTGA 


ATCCAAATGG 


GCAAAGAGAT ATACATGAAT CAATTTGTCA 


420 




AAGTACGTTA 


GGATTATTAC 


AACCACAAAT 


GGGATTATTG 


AGGCAACAGC TTGAATTATA 


480 


10 


TGTAAAGCAA 


ATGGCTGTAG 


AAGCTGAATC 


GCAATTTGAA 


AGTTTTGAAG CTAATATTCA 


540 




AACGCAAATA 


AACGATTTAT 


TAGCATTTGA 


TTTAGATACA 


ACACTTATCA ATCAATTGAA 


600 


75 


AGATAAACAT 


CAACAACTGA 


AAACTATTTT 


ATATTAAGAa 


AGAAGGAACG TTTTAAATGC 


660 


CTAATAAAAT 


ATTACTTGTA 


GATGGTATGG 


CGCTATTATT 


TAGACATTTC TATGCTACAA 


720 




GTCTTCATAA 


ACAATTTATG 


TACAATTCAC 


AAGGTGAACC 


TACAAATGGA ATACAAGGAT 


780 


20 


TTGTGCGTCA 


TATCTTTTCG 


GCAATACATG 


AAATACGCCC 


TACACATGTA GCTGTATGTT 


B40 




GGGATATGGG 


ACAATCAACT 


TTTAGAAATG 


ATATGTTTGA 


tGGTTATAAG CAAAAkCGTT 


900 




CTGCACCACC 


AGAAGAATTG 


ATACCACAAT 


TTGATTATGT 


TAAAGAAATT TCAGAGCAAT 


960 


25 


TTGGCTTTGT 


AAATATTGGC 


GTTAAAAACT 


ATGAAGCGGA TGATGTTATA GGTACATTAG 


1020 




CACAACAATA 


TTCAACTGAT 


AACGATGTCT 


ATATTATTAC 


GGGCGACAAA GATTTACTGC 


1080 




AATGTATTAA 


TGACAATGTT 


GAAGTTGGCT 


AATTAAAAAA 


GGTTTAACAT TTATAATAGA 


1140 


30 


TATnCATTAC 


ATCGTTTTn 








1159 



(2) INFORMATION FOR SEQ ID NO: 717: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 574 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 717: 

GATTTAATCA ATGCAGTTGC AGAGCAAGCT GATTTAACTA AAAAAGAAGC TGGTTCAGCA * 60 

45 GTAGATGCTG TATTCGAATC AATCCAAAAC TCACTTGCTA AAGGTGAAAA AGTACAATTA 120 

ATTGGTTTCG GTAACTTTGA GGTACGTGAA CGTGCTGCAC GTAAAGGTCG TAACCCTCAA 180 

ACTGGTAAAG AAATTGATAT CCCAGCAAGT AAAGTTCCAG CATTCAAAGC TGGTAAAGCA 24 0 

SO TTAAAAGATG CTGTAAAATA ATTTTACTTA AAAAGCCCTG AATAAGGGCT TTTTATTTTG 3 00 

CTTTTAATAC TTACAACCTG TACATAAATT GTAATGTTCT TCTAAGTTTT. TAATCTTTGG 360 

55 
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AACTACAAAA TATACATATG aATATTGaGa TTAATTGTTA GCGTTGaATT TACTTAAAAG 4 80 

GTAACCATGT CTACTATAGT ATTTTACGTT ATTTAAAAAG ATGAATAATG TAAATGAAGT 540 

5 AAAGGTTATT ATGAGAATTA CAAAAGCTAC ATAA 574 

(2) INFORMATION FOR SEQ ID NO: 718: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 493 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 . 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 718: 
AAGaTATTGA AAmTGmAaTG GaGCACGCCC TTATTTGATA GAAGTAAAAG ACATTTAATT 60 
2Q CTTACCGATG CAGGTCAAAT TTTTTATGAG AAAAGTAAAG AAATTGTtGC ACTGTATGAT 120 
TATTTACCAT CTGAAATGGA ACGCTTGAAT GGACTGGAAA CAGGACATAT AAACATGGGC 180 
ATGTCGGCAG TCATGAATAT GAAGATTCTT ATCAATATTC TTGGTGCATT CCATCAACAA 240 
25 TATCCAAATG TTACATATAA TTTGATAGAA AATGGCGGTA AAACAATTGA ACAGCAAATT 300 
ATCAATGATG AAGTAGATAT AGGCGTGACC ACTTTGCCAG TCGATCATCA TATTTTCGAT 36 0 

TATACTACCC TAGATAAGGA AGATTTGCGA CTTATCGTGA GCAGAGAGCA TCGACTCGCA 420 
AAATATGAAA CTGTTAAACT CGAAGATTTA GCAGGTGAAG ACTTCATTTT ATTTAATAAA 480 
GACTTTTACT TGA 493 
(2) INFORMATION FOR SEQ ID NO: 719: 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1257 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 719: 

45 GCATTGCCAA AACAATACGT TTATATTATT GAGGAACTAC TGTATAAAAG TAATGAATAT 60 

CAAAATAAAA AATCATATTA CGAAACACTT GTTAACCAAG TAATTGAACT TAAACAGGCA 120 

GATGATTTAA TTATTGGACT CGCTTATTCC GTACAACGCT TAGTCGTCGA TCATTTACAC 180 

50 GTTGTCGGTG ATATTTATGA TCGTGGACCA CAACCAGATA AAATTATGGA TACACTGATT 24 0 

AATTATCATT CCCTAGATAT TCAATGGGGT AATCATGATG TGCTTTGGGT TGGAGCCTAT 300 
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5 



70 



75 



20 



25 



GATATTATCG 


AAGACGCTTA 


TGGCATTAAT 


TTAAGACCAC TGCTTACTTT 


AGCTGAAAAA 


420 


TACTATGACG 


CAGATAATCC 


TGCTTTTAAG 


CCTAAAAAAA GACCTGACAA ACACGAACGT 


480 


TTAACTCAAC 


GTGAAGAAAG 


TCAAATTACT 


AAAATTCATC AAGCTATTGC 


GATGATTCAA 


540 


TTCAAGTTAG 


AAATACCAAT 


TATTAAACGT 


CGTCCAAATT TCGAAATGGA 


AGAACGTCTT 


600 


GTGCTTGAAA 


AGGTTAATTA 


TGATACAAAT 


GAAATTACAG TTTATGGTAA 


TACATACCCA 


660 


TTGAAAGACA 


CATGTTTCCA 


AACTGTCAAT 


CGTGATAATC CAGCAGAATT ACTACCTGAA 


720 


GAAGAAGAAG 


TCATGAATAA 


ACTATTATTG 


TCATTCCAAC AATCTGAAAA ATTACGTCGT 


780 


CATATGTCTT 


TCTTGATGCG 


TAAAGGCTCT 


CTTTACTTAC CATATAATGG 


CAATTTACTC 


840 


ATTCATGGTT 


GTATTCCAGT 


TGATGAAAAT 


GGTGAGATGG AATCATTTGA AATTGATGGT 


900 


CATACTTACA 


GCGGCCAAGA 


ATTATTAGAT 


GTGTTTGAGT ATCATGTCCG 


TAAATCATTT 


960 


GATGAAAAAG 


AAAATACTGA 


TGACTTATCG 


ACGGATTTAG TTTGGTATTT 


ATGGACTGGG 


1020 


AAATATTCGT 


CACTATTTGG 


TAAACGTGCC 


ATGACTACGT TTGAGCGATA 


CTTTATTGCA 


10B0 


GATAAAGCTT 


CTCATAAAGA 


AGAAAAGAAT 


CCGTACTATC aTCTTCGTGA 


AGATGTGaAT 


1140 


ATGGkTCGTA 


AAATGCTCaG 


TGaTTTCGGA 


TTAAATCCAG ATGAAGGACG 


CATTATTAAT 


1200 


GGTCACACAC 


CAGTGAAAGA 


AATCAATGGC 


GAGATCCTAT CAAGGCTGAT 


GGAAAGA 


1257 



(2) INFORMATION FOR SEQ ID NO: 720: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
35 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 720: 
^0 TGAGCACATG ACAGTGGCTG AAAACATTAA ATTTTTTAAA TCACTTTGTA AAAATCCAAT 60 
TAACGATACA ACTATCAACG AATATTTACA GCAATTAAAC TTTGATGATA CGTCTGCCAA 120 
AGTATCTACA TTGTCCGGTG GGAATAAACG TAAAATTAAT ATATTAGTAG GTTTACTAGG 180 

45 

TCAACCTCGA ATTCTCATTT TAGATGAACC GACAGTTGGT ATTGATTTAA AATCTAGACA 240 
TGACATCCAC CAACTACTTA ACATCATGAA ATCTAAATGT TTAATTATAT TAACTACCCA 300 
TCATTTAGAT GAAGTTGAGG CACTTGCAGA TGGTTATCAA GTTAATTGGG CCAGGTnCCC 360 

50 

TTTTnTTCAA CAGTTTTTGG GGGCCAACCA TGGGCTTATA 400 
(2) INFORMATION FOR SEQ ID NO: 721: 

55 
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(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) . TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 721: 

CTTATTTATA TTATAATATA AATATTTTAT TTTTCAATAC TTAATTTTAA AAATCCCCTA 

TGAAAATGTT GATGAGCTAC ATCTTTAAAT CTTATTATAC ATTGTATAAA ATTATATTGC 

GAGGTAGTAA ATTGATATTA TACACTTTTG GTTGTGTATG CATCTCACCT ATCTTTTTTG 

CTTTTTTCAA ATAAGAAAGT ATTAAAAATy AATGCCTATA CAGGAACAGA CATGGCTAAA 

CTTGATTTAA ATAGTCTTGA CGACGAGCAC GTAAAATTAT TAATAAATGA ATTAAAATAT 

CCAGAAACTC ATATCGATGT AAATGAATTA AAAACAATAG TTGCTAGTCG AATAAATGAA 

AGGCAAGAAA TAATAAGTTT TAAGTTAGGA ATAAAGTACT TATTAACAAT AAAAAGAGGG 

AACATAGAAA AAGATAGGTT TTCAATTTCA ATCATTTTCA AAGATACCTA TCACACCCTA 

GTTAGAATAG ATATTAACGG TGGTACTCAC GATAATCCAG ATGGAACAAT CGCTCCGAAA 

AGTCATATTC ACATATATAA TGATAAGTAT 

(2) INFORMATION FOR SEQ ID NO: 722: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 722: 

ATTTTAGTCT TAGGTGTTGA TTGCATGATG AATGCAGAAG TTATACCTGC AGCCATATTA 

GCACCTTCAT TATTGTGTAT AATTTGAATA TTATTTGCTT TTAAAGTATG TCCAATTAAG 

TTTGAAGTCG TTGTTTTACC ATTTGTTCCA CTGATAAATA CAATATCATC AACTTGCTCT 

GCTAATTTTC TTAATATATC TGTATCCACT TTTCTAGCGA TTTGTCCAGG TAAATCTGTT 

CCTCTTTTAC CTACTGCTCT ACTTGCTTTA CGCGCCAATT TCGCTAGATG GATTGCCGTC 

CACTGTCTCA TGTGTTTCCT CCtCAAATTt CCACTCGCAT CATTATAACA TGACAAGGCA 

ACTTCAAAAA AGTTTCTCAA TCACAAATnG ATACCAGTGT 

(2) INFORMATION FOR SEQ ID NO: 723: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1113 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 723: 





GATTATATGC 


TGTTGGCAAA 


ACAATATACC 


CAGATGAATT 


TTTATTTTTA 


GTAATAGTAT 


60 


10 


AACCATGCTT 


TGTCTTACTA 


ACTTTGACAG 


CTTCTAATGA 


ATTTGAAGCG 


TCTTTAAGCG 


120 




TGGTATAATC 


TTCGCCGTAT 


ATCCCTTTTA 


AATTTACTCG 


ATACTTACCT 


TTAGGCAATG 


180 




ATAATCTAAT 


TCTATCTGGA 


GCTTTAATGC 


GTATCGTTAC 


GGGTGTTACA ACGCGTCGAT 


240 


15 


ATTTATAAGT 


GAGTTTATTT 


CTTTCTTGTG 


TATATTCATT 


CACTTTAACA 


TCATGAGCTT 


300 




TATCCGGCGA 


AAGTAATTCT 


AAATCCATTT 


CAAAATACAA 


ATCTTTAAAT 


TGATTAGAAA 


360 




CTGATTTTGG 


CAACTGTACA 


GTTAGACCAC 


CATTATTTTG 


TTTAACTTGT 


AATAAATGTT 


420 


20 


TTGTAGGAGA 


TTGCCAGGCT 


GCACTATTTA 


ATTTAATTGT 


TGAATCTGAT 


AGTAAATTTT 


480 




TATTGGCTTT 


AAAATGTGTA 


TTAACATCTT 


TAATATTGTT 


AGAAACAATC 


CCTTGCAACA 


540 


25 


TTGCTTGTTC 


TTTATCTAAT 


GGAGATTTTA 


ATT CTTTATT 


GGAAAAGACC TTATTTGTAA 


600 


TATGTGCACT 


TGGATAATGG 


ATGGTATTTT 


TAGAATGAAT 


CCAACGAACT 


TTATTGTCTT 


660 




TGTGTTCAGA 


CTTAATTTTA 


AATCCATATG 


GtAAGTTGTC 


ATCATGATtC 


ACTCTAATTC 


720 


30 


GATCATTAAC 


ATTCCAAAGT 


GATAGTaAAT 


TTgACGATgC ' 


CAAGTAATCT 


ATAAGTGCTG 


780 




TTTTTATCGA 


TTGGCATATT 


AATTTGGAGT 


GTCTTGTCAT 


AATATTTTAA AATGTCTC.CA 


840 




TTAAAAATAC 


TAGAATATAA 


TGAAATGCCA 


TTATAATGAT 


ATATAAATGG 


TGAATTTAAT 


900 


35 


GCATAGTCTG 


ACATATAATC 


AATGCGATTA 


AATGAGCCTG 


TTGCATTTTG ATTTATCTTT 


960 




TTTATAAGCT 


GGTTTACATA 


GTTACTATGG 


TAATCATGTT 


GTTTCAACGT 


TGATAATGAT 


1020 




TGTTGATAAG 


GTTTGATTGC 


CATGTTTTTG 


TTGTTATCTA 


AAATGACGAT 


TTGTTGAATC 


1080 


40 


ATaACGATTA 


ATACTAATAT 


TGCAACGGTT 


AAT 






1113 



(2) INFORMATION FOR SEQ ID NO: 724: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 724: 
TGCCGTTTAG CAAAATCTAG TATCGCTTGA TGATCAGATT CTGAAATTTC AGTGTGTACT 60 

55 
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GATTGATTAA GTTTATATGC AAGTGCATGT TCTCGTCCAC CAGCACCAAT TACTAATACA 1B0 

TTCATTTATG TTACGCCCCT TTAAAATTAG TGTTTAAAAT GTCGAGTGCC TGTGACTACC 240 

5 ATTGCAATAC CATGTTTATT AGCCATATCA ATTGAATCTT GATCTTTAAT CGAACCACCC 300 

GGTTGGATAA TTGCCTTTAT ACCATGTTGT GCTGCAAGTT CAACTGTATC TCCCATAGGG 360 

AAAAATCCAT CAGATACTAA CGgCTACATG ATCATTGATT TCAATAGCTC TCTCTAACGC 420 

10 

AATTTTAGCA GCACCGACAC GATTCATTGG CCAGCAnCTA TACC 464 

(2) INFORMATION FOR SEQ ID NO: 725: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 1440 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 725: 



25 



30 



35 



40 



GTTAAATnCA 


nTATTTnAAA 


TaTATTACCT TATTAGAAAA 


AGTCGTAATA 


TGAGGTGTAC 


60 


AAATGACGCA 


AATTTTAATA 


GTAGAAGATG AACAAAACTT 


AGCAAGATTT 


CTTGAATTGG 


120 


AACTCACACA 


TGAAAATTAC 


AATGTGGACA CAGAGTATGA 


TGGACAAGAC 


GGTTTAGATA 


180 


AAGCGCTTAG 


CCATTACTAT 


GATTTAATCA TATTAGATTT 


AATGTTGCCG 


TCAATTAATG 


" 240 


GCTTAGAAAT 


TTGTCGCAAA 


ATTAGACAAC AACAATCTAC 


ACCTATCATT 


ATAATTACAG 


300 


CGAAAAGTGA 


TACGTATGAC 


AAAGTTGCTG GGCTTGATTA 


CGGTGCAGAC 


GATTATATAG 


360 


TTAAGCCGTT 


TGATATTGAA 


GAACTTTTAG CAAGAATTCG 


TGCAATTTTA 


CGTCGTCAGC 


420 


CACAAAAGGA 


TATTATCGAT 


GTCAACGGTA TTACAATTGA 


TAAGAACGCT 


TTTAAAGTGA 


. 480 


CGGTAAATGG 


CGCAGAAATT 


GAaTTAACAA AAACAGAGTA 


TGATTTACTA 


TATCTTCTAG 


54 0 


CTGAAAATAA AAACCATGTT 


ATGCAACGGG AACAAATTTT 


AAATCATGTA 


TGGGGTTATA 


600 


ATAGTGAAGT 


AGAAACAAAT 


GTCGTAGATG TTTATATAAG 


rTATTTACGA AACAAGTTAA 


660 


AACCATACGA 


TCGTGACAAA 


ATGATTGAAA CAGTTCGTGG 


CGTTGGGTAT 


GTGATACGAT 


720 


GACAAAACGT 


AAATTGCGCA 


ATAACTGGAT TATTGTTACC 


ACGATGATTA 


CGTTTGTCAC 


780 


GATATTTTTG 


TTTTGTTTAA 


TTATTATTTT TTTCTTGAAA 


GATACACTGC 


ATAATAGTGA 


640 


GCTTGATGAT 


GCaGAACGAA 


GCTCaAGCGA TATTAATAAt 


TTATTTCaTT CTAAgCCTGT 


900 


TAAAgATATA TCTGCaTTAG 


ACTTGAATGC aTCTTTAgGT 


AAtTTTCaAG AGAT AATTAT 


960 


TTATGATGAG 


CATAATAATA 


AATTATTTGA GACATCGAAT 


GATAACACAG 


TGAGAGTTGA 


1020 
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ATATTTAATT ATTAAAGAAC CAATTACAAC GCAAGATTTC AAAGGGTATA GCTTGTTAAT 
TCATTCACTA GAAAATTATG ATAACATCGT AAAATCATTG TATATCATTG CGCTGGCATT 
TGGAGTGATT GCAACAATTA TAACTGCCAC AATCAGTTAT GTATTTTCAA CACAAATTAC 
TAAACCGCTT GTCAGTTTAT CAAATAAAAT GATTGAGATT CGACGAGATG GTTTTCAAAA 
TAAATTGCAA TTAAATACAA ATTATGAAGA AATAGATAAT TTAGCAAATA CGTTTAATGA 
GATGATGAGC CAAATTGAAG AATCATTTAA TCAACAAAGA CAATTTGTTG AAGATGCGTC 
(2) INFORMATION FOR SEQ ID NO: 726; 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 375 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 726: 
TGATATTCCG ACTTGCGAGG ATTATTGCGC ATTAAGGATT AACGCAGTAT GAGTCCAAAG 
ACGCGCGACA CCTGGAAAAA GGATTACCTA ATGCCTTATT TACAGTAACC TTGTATGATA 
AAGATCGGTT AATTGGTATG GGTAGAGTGA TTGGCGATGG CGGAACTGTT TTTCAAATTG 
TTGATATTGC AGTTTTGAAA AGTTACCAAG GTCAAGGTTA CGGCATCTAA TTATGGAGCA 
TATTATGCAA TATATTAAAG GTGTGGCTGT TGAGAGTACA TACGTTATCT GATTGGCAGA 
CTACCCAGCG GGnTAAATTA TATACCAAAT TTGGGTTTnA TACCTACCGA ACCAGAnTCC 
AGGCGGTGAT GTATG 

(2) INFORMATION FOR SEQ ID NO: 727: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 471 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 727: 
TAAAAAACTT AGAGATGAAA ATCATATTAT TTATCTGTTT TTGGGACAAT TTTTTACTAA 
AAATGAAGAT CCATGGCATC AAATACTTAA TGATTTAGAA GTTACAAATT CTGTTGATAA 
TTTTTTAAGG TCAATAAGTA ATAAGGCCAA AGAAACAAAA AAAAGAGCTT TTATTATTAT 
TGATGCGCTT AATGAAGGTG AAGGTAAAAG GTTATGGGGA AATTATTTTC AAAGCTTTAT 
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AGATGTTATA TTACCCAAAA ACGCAATACA AGATAACAAT ATTGTAGTAT TTCAGCATGA 360 

AGGTTTTAGT AAGGAAGAAA ACTATAATCC AATTGTATCT TTTTGTGATT TTTATGGATT 420 

5 

AGAGCTACCT AAGTTACCTA TATTAAATCC AGAATTCAAC AATCCATTAT T 471 

(2) INFORMATION FOR SEQ ID NO: 728: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 1750 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 728: 

TAGGATGTTC CACCCTgGCA ATTGcTCTAA GACATCtACa GCAGTAGAAT ayCCTGTTTT 60 

20 CGTCtTTTTA ATAACAGGTA ATTGTAATGT CTCAAACAAT ACAACACCTA ATTGCTTAGG 120 

AGAATTTATA TTAAAATCTT CACCAGCTGC ATCATGGATA TTTCGAATCA AGACGTCTAA 180 

TTTTTCTTGA ATTTCTTTTT CCATTTCTTC TAAATCATGA ACATCTGTAA ATATACCAAT 240 

TTCTTCCATT TCACTTAAAA TCTTAGCTAG CGGTAGCTCT AAATCAGCCA AGAGTTCTAC 300 

CTGATTGTAT TCTTCTAATT GTTTATCCAT ATTTGGTTTC GCAAAGTAAA TTGCATCAGT 360 

AATAGAAGCA ACATATGGAT TTAAAACATC ATCTTCAGGT ACCTTAAATT TCTTACCTTT 42 0 

TCCATATATA CTCACATCGT CTTTCACAAA- ACTTTGACCG TACAATGAAA CAACTGATTG 480 

AACATCACTA ATCGTACGAG ATGGATCAAT TATATAACTG GCCAACATAA TATCGAAAGA 540 

AATATTTTGA ATATCAATCC CCAATCTATG TGATGCTACA TATGT TT T T T TAGCATCATA 600 

TACGACTTTT TTCGAATTCG GATTTTCTAA CCATGAAACT AGTTCGACAT AATTATTTAT 660 

GTCATCCGCA TTAATTACAA TATGTTTCTC ACCTGtAAAT aAAGAGAATT TTAAAAtATT 720 

40 ATTtCGCAAA TAGtTACCAC CGtCTAATTC GAAATGGATG GCCGCTTCTT TcAATGAAGT 780 

AAAATCCAAT ATTATCAAAA GACGTTTCcA ATTTCAAATG TCTTtTCTAT TGCATCTTCA 840 

ACGcTtGCTG ATTGaTCAAT GTCAGCCAAC AATTGTTTGA ATTCTAACTT CTTAAACAAT 900 

TCGATTTTTT CTTGTTGTTC ATCTtGAkGA GTCATTAACG TATCTTCaAG TTTTACTTCa 960 

ATCGGACTAT CTACATTAAt CGTTGCTAAT TCTTTACTCA TTAATGCATC TTCTTTGCTA 1020 

TTTTGAAGTT TTTCTTTTAA CTTTTTACCT GAAATTTCAT CTAAATGTTC ATAGACACCT 1080 

TCTACTGTGT CAAATTGGTT TAGCAATTTT ATTGCTGTTT TCTCTCCAAC ACCTGCAACA 1140 

CCTGGTATAT TATCAGAAGT ATCTCCCATT AATCCTTTCA TATCAATAAT TTGATTAGGT 1200 

55 
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10 



15 



CCTTTTTTAG TGTAATAAAT GGTTACATTA TCCGTTGCAA GTTGTGTTAA ATCTCGGTCT 1320 

CCCGTAATAA TAATTGTCTG AAAGCCCGCT TTATCTGCTT CTTTACTTAA AGTTCCGATA 1380 

ATATCATCTG CCTCATAGTT ATCTAATTCA TAACGTTTAA TATGATAAGC ATCTAATAAT 144 0 

TGGCGAATAT AAGGAAATTG CTCACTTAGT TCAGGCGGCG TTTTCTGGCG TCCACCTTTA 1500 

TATTCACTAT ATTTTTCATG TCTGAAAGTC GTTTTACCTG CATCAAACGC TACTAAAAAA 1560 

TGATTTGGCT TTTCTTCTTT TAAAATCTTC TCTAGTAACA TTGCAAAACC ATATACTGCA 1620 

TTGGTATGAA TGCCTGCTTT GTTTGATAAC AAAGGTAATG CATAAaAAGC TCTAAAACTT 1680 

AAGCTATTAC CATCGaTTAA TACTAATTTA TTCACAATTT TAACCTCCAG AACTAATTTA 1740 

TATATnTGTG 1750 
(2) INFORMATION FOR SEQ ID NO: 729: 

20 <i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH :' 43 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 . 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 729: 
AAATTAATTT TAACATCCTT TCAAAATAGT TTTAACGGAT CcTCCcAAAA CGTAAACTCA 60 
CATCAGTTAC TTGTAACATG CATTTTCTCC TTTTTTTCAT TCGATATTCT AACGGAAGAA 120 
TTATATCATA TTATCGTCAC AGTTTCGACC TCATATAAGT TGTAATGATA GAATGACTCA 180 
CACATGTTAT AATAATAAAG AATACAAGAA TCGAAGGAGA ATAAACATGG CATTAGACAA 240 
AGATATAGTA GGGTCTATAG AATTCCTTGA AGTAGTAGGG TTACCAAGGT CAACTTACCT 3 00 

TTTAAAGGAC CAACGGTGAA ACGTAAAGTT AAACCAATCA GAATGAACGT GATGATTGAA 360 
TTAGAAGTAG GGGAGnATAT AGTTCCTCAT TTATCCAACC GGTCAGGTGG ATTATTGCAC 420 
TCAAATATGC nGnTATACG 43 9 

(2) INFORMATION FOR SEQ ID NO: 730: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 481 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



30 



40 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 730: 
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AGTTTAGGAT TGATGACAAA GCCATTACGA CCAACAAATC CTAAACCTGC ACGTTCTGCT 120 

ACTGCCCTAT CTGATAATAC ACCCGTATCT ACCATAGATT TGATTTCAAC ATCTGGAACT 180 

5 

TTAGATTCAA TAAATGCAGC TAACATGTCT AATCGTTTAC GCATAATTGT ATGATAATCT 240 

TGACCCCACG ATGCTCTAGC AAATAAGCCT CTGCGATCAC CTCTAACACT CTTAGGTGCA 300 

CCTTTCAGTT TGTTAGGATA ACCAACTGCA ATTGCTATGA TTGACCTTGC TGTTGGTAAG 360 

10 

GATAATTTAG GCTCCGTTCG TAAAGCAATA TCAGATTCTT CAAaTCCTGA GGCATAACCA 420 

TTTGCATGAT ATGCTTCTAG CTTTTGCTTC AATTCATCAA AGGGnTCGGC AGTAGTAAAT 4 80 

15 c 481 

(2) INFORMATION FOR SEQ ID NO: 731: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 628 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



' (Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 731: 



30 



35 



SO 



GTGATGAAAA 


ATCTTTTTCT 


ATTGATTGAG 


AAAATAGTTA 


TTTATATAGT TTTTAAAGAA 


60 


AAAAAGTGAT 


AATATTTAGC 


TAATCTAATG 


AAAATTGATT 


TATTGGACAA AAAATACATT 


120 


TTAAATGAGT 


AGAGGAGGCT 


GCAAwTGGGT 


TACAGAACTT 


TAAAAAGTAT TTTTCATGAA 


180 


CACAATGAAA 


GTAAAATGAA 


GGAAGAGTAT 


ATTAAAAGAT 


TTAATTCTTT mGCTTCTTTC 


240 


AATACTAATA 


TTAATATCAT 


ACCTATGGAA 


AATGGAAAAA 


AAGTTAATGA TTTGGAATAT 


300 


CCTCTATTCT 


TTATGGTGAC 


TAAAAATCTA 


TCAAAAAAAC 


AAGAATTAAT ATCAATTAAT 


360 


AGTAGAAAAA 


TTGATAGAGC 


ACTTAATTCT 


TTACCATATG 


CAGCTAGAGA ACAATATTTT 


420 


AATGATTTAT 


TAATCGATGA 


ATTACAAAGT 


ACTAATGAAA 


TTGAAAATGT ATTTAGTACT 


460 


AAACAAGAGA 


TTGCACATGC 


GTTAAATAAC 


CAAGCATCAG 


AATTTCTTAA GTTCAGAGGC 


54 0 


CTCGTGGATC 


AATATAAAGA 


GATAGAACTT 


AATAAAAAAA 


TTAAAGTTGA TAATGTAAGA 


600 


GACATTAGAG 


CGATTTATGA 


TAAATTAGTT 


TCAAATGAAA 


TTAACGAACA AGATAAGTTA 


660 


GATGGAGAGC 


TATTTCGTAA 


AAATTTTGTC 


GGTGTGCATG 


ATGGGTCAAC GAATAAATAT 


720 


ATACATGTTG 


GGTTACAACC 


TGAAACCAAA 


ATTGTTGAAT 


TTATAGGTGA AATGCTAACA 


780 


TTTTTAAAAT 


ATTTTGATGC 


GCCTCAGCCG 


TTCAAAATCA 


TGGCTAGT 


828 



(2) INFORMATION FOR SEQ ID NO:. 732: 

55 
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(A) LENGTH: 1622 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



15 



20 



30 



35 



40 



45 



50 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 732: 

CGTATGTTTC GAATTTTATG nATTGCATTG GATAATAGTA CCAGTTAAAA GAACTAAGAT 60 

TGTAATTATC CCCCCTTAAA ATTCAAGAAT ATATTTTCTT AATCTATCAG CAAGAGGTTA 120 

TATTTCAAAG TGTTTATTTT TAAAACAACA TCTAAAAGCC ATTTcATAAt AAwGTTAAAT 180 

CaTTAGaATG TATAAAGATT CCaATTAATA AAAATAGATA AAATGCAATA AAGTTCCAAA 240 

TACTTATTAT TATTCATCAC AAATTACGTG ATGCCCTCTA CAACACTAAA TCAAGGATGA 300 

TATTTATTTT ATATACATTA GCATTCATTG TTTGCTTACC CACATTATCC TGTTTAGTCT 360 

TTTTTATAAC TCTAGTCTTG AATGTCATAT TTACAAAGAA AAAAACACTA AAATATTTAA 420 

AACTAACACT ATTTATTCAC TTTGTTCTTT TATAGCACCT TTTATGTTTA TTTTCTTGTA 480 

TTCAATAAAT AGTATGGAGA TGATAATTTA AAGTAGATAT ATTTGAAGTA TACTATAGAG 54 0 

TAAATTATAT TGGGGAAATA TGTAAACGAA CCGGTTGATT TTATAGTGGA TTAACACTTC 600 

ATAGAGAATA TAATCAAGAA CAAAAACAGT CAATAGGTGT GAATTTTTTT AAAGACGGAT 660 

CTGTTGCTAC TAAAAAAAGA AGTATACAAT TCAATAGTTA AACCTTAAAA CAAGAAATAT 720 

TATTCAAAAT CAATGAATTT CCTATCTTAT TAGTTTTAAC AATATTTATT CTCATAGAGT 780 

TTGTCCAATT AAATATAGAT GATTCAAATA TTTATAAGCA TACAAAAGAG CAGTAAGACA 840 

TTTTCTAATA GAAAATAACT TTACTGCTCT TTGTACATTA CTCATCAACT ATTGTAAATT 900 

AAATAATAAA TAACTACCTA TTTTATTATT CAGCTAAAAA ATGCTCTGCT AAATATTTTG 960 

CTGCGCCATC CTCTTCATTC GTGTATGCCG TTACATCTGA AGTTAATGCT TGGATTTCAG 1020 

GGCGTGCATT TTTCATAGCA ACTGTATAAT GACCAAATTC AAACATTGCT CTATCATTGT 1080 

CGCTATCTCC AATAACTAAC GTTTCTTCTT GATGAATACC AAAATGTGCA ATCATTTCTT 1140 

TAATGCCTGT ACCTTTATCA GTTTGATAAG CCATTGTTTC CGCATTAAAT CTTGATGAAT 1200 

TTGArACACT AATCTGTAGT TGCACATGAT TTTGCTTTAA TTCATCTCTA AATGCTGTTA 1260 

TTTTTTCTAA ATTAGAACTG AATAGATAAA TTTTTGAATA TTCACCTTCA GGAAACTTAG 1320 

TTACCCAATC TATCTTACCA GCAAGCGCAT CTTGTCTTGA AGACCATTCA CTATGACTTA 1380 

CGCCATTAAT AGGATCTTGA CTACGAATCA TATCTCGCAT CCATGTTTCA TCTTCTTTTA 1440 

AAGAAACTCT ATTACCTTCA AAAGGAAATA CCTCATAATA AATTTGTTGG CGGTTAGCTA 1500 
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CTACTTCTCC AATTGTTCCA TTTGAACTAA TGATGCCATT AACCGOAAAA 


TCTTGAGGTA 


1620 




CA 




1622 


£ 


(2) INFORMATION FOR SEQ ID NO: 733: 






10 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 






15 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 733: 






AATGCCAnAT AAATCCAAAT TCTTTTAACG AACAAGAGTG CCCAAGTTAT 


TGCTGATCGT 


60 




TATGCCAGAT AnGCATATCA ATGATAATTA TGGTTTAGAA AGAATTTCTA 


AGACAAATCA 


120 


20 


TGGATATAAT TATGTGTATT CCAATGATAA TTCAACTAGT AAGCAACATG 


TAAGTATTTC 


180 




AAATCAAGGC ATAATAACGA AATAATAGAT GGAACAGTGT ATTCTAATTG 


GATATACTGT 


240 




TTTTATTTTG CAATAATTTA ATTTAAAAAG GTGAATTCAA CTTATAAAAT 


GATGTAAATG 


300 


25 


TTATGTCAAA ATCAACCAAT CCGTAATGTA TTTTAAAATG TTAATATAGT 


TCTGAAGAAG 


360 




TATAAATGAG GTGTTGAAAT GGCTAAAAAT AAGAAAACgA ACGCGATGCG 


TATGCTTGAT 


420 




CGTGCAAAAA TTAAATACGA AGTTCATAGC TTTGAGGTAC nTTAAGAACA 


TT 


472 




(2) INFORMATION FOR SEQ ID NO: 734: 






35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 519 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 






40 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 734: 






CATAGCCCCA AATTTTTTCT ATTATTTGTT CACGAGTAAA GATTTGCTTA 


GGACGTGCTG 


60 




CAAGCATAAA TAATAATTGA AATTCCTTGT TCGGTAACGT CATCGTTTTA 


TTAGATACTT 


120 


45 


GGAGTTCCAA ATAGGATTGG TTTAGCGTTA AGTTGCCAAT AGTCATTTCT 


GAATTTGAAT 


180 




TGATATTATA TCGACGTAAT ACAGCACGAA TTCTAAAAAT AAGTTC CTTA 


ACCTCAAAGG 


240 




GTTTGGTTAC ATAATCGTCA GTACCGCTTA TAAACGCACG CTCTTTGTCA 


CTAAGTGCAT 


300 


50 


CCCGCGCTGT TAACATAATA ACTGGTATAT CATAATCATT TTTTAATGTA 


TTACATAATT 


.360 




GAAAGCCGTC CATACCATCC ATCATAATAT CTACCACTGc AATATCGaCA 


CG CTGTTTTT 


420 
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10 



is 



20 



30 



35 



40 



GTTTTGTnAA ATGGGCTAGC TATAATAATT TAGGGGATT 519 
(2) INFORMATION FOR SEQ ID NO: 735: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 735: 

ACTTGAAAAC GATAAACAGT ACTTTCCAGC TACACATTGG AAAGCTATAA ATGGGATACC €0 

TTATGCAGGC AGTAGTGATA TTGATGGATT GCCTCAAGAC GGTATCATTT CGGTAAATGA 120 

TAAAAATAAA TTAGATAATT TAAAAATAGG CGAACAGGAA TTATTCAAAA TAGCATTGTA 180 

CAGAAATCCC CAAACGGTAA ATTGTGGAAA ATAACAGTTG ACGATAGTGG GAAACTTGGT 240 

ACAGTGCTAT TTTATTAGAA AGGAAGGTGC ATTATGGAAA ATTTGTATTT AATAAAGGAT 3 00 

TTGGGAGCTT TAGCAGGTCG AGATTATAGA GCTAAGGAAA TACAAAACTT ACAAGGAATA 360 

25 GGCATTGCGC TGGGCTGACC ACCAGGGTTT AnGTGnCCnC 4 00 

(2) INFORMATION FOR SEQ ID NO: 736: 



(i) SEQUENCE CHARACTERISTICS : . 

(A) LENGTH: 780 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 736 : 

C TTTTTT ATT GAAAAAGTGG TATTTGATTA TCTATTTACT ATTTATATTA GCGGCACTCC 60 

TTATTACATT AACGACAATC CAACATGTAA CAGAAGATGA CAATCATnTT AATATAGGTG 120 

TCGTAGATAA AGATCAATCA AGTGAAACGA AATTAATCTT AAACTCTATT GGTAAAGGGA 180 

GTAACCTAGG AAAAAACGTG AGCATTAAAG CATATGATGA TAAGCAAGCA CATACTTTGT 24 0 

45 TAAAAAAACA TAAACTTCAA GG CTATTTTG TTTTTGATAA AGGTATGACC AAGGCATTTT 300 

ATAAACAAGG CGAACTACCA ATTTCAGTAT ATACATATGA TCAACAATCC ATGAAAAGTG 360 

TCGTGCTATC TCAGCTAACA GATTCTGTTT ACCAACGTCT TATGCGATCA ATGGGTGGCA 420 

SO TCTTAGCTTT TCAAGACTTA GCACCGAAAG CATCACATTC TGACAGTATC AATGTTATGA 480 

CTGATTTGCT GATtACAGGA TTAAACCGTT CAGGTGCAtT TAACTTAGAA CCGATTCATT 540 
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CACTATCTTT ATTTACAGTT TTGAAAATGA ATCAAGATAC TGTATTGAAA GCGCGATTGA 
AAATGTTTCA TTTTTCTAAA GAGCGTTTAT TAATCATTCG TACGTTGATT ACATGGTTTT 
ATACTATGTT ATGGGnGTAT CGnTGGTGTA GTTTGGAATG TGGTTCAGTA TTCCGAATnA 
(2) INFORMATION FOR SEQ ID NO: 737: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 946 base pairs 

(B) TYPE.: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 737: 
TATTTAATTT CTTnATTnTG GTTTTTAATT nTAAGATGAC CATAAATGAA T T TCTTCTGC 
TGTCATAGAA TTAGAGCAAA ATACGAGCAT AAATTTAACT AATTTCTTGC TTGTCTTATA 
GTCTTGATTG CCTTGTAAAA ATGTTAGTAT TAATTGCTTC ATCGTTTCGA ATTGTTGTGT 
TGATTTAATA GCGATTGCTA AACCTATATT T^TTTTCGAAA AGCTTTTCAA AACATCTATT 
TAACAGGTTG TAATTCGTTT GGCTAATCTC AATATCGTGA ATATCGTCAA TGAAAACGAC 
AGGTTTTGGT AAGTGCTCAA AGTTAATATC GTAATATTCA TTGAAAAtAA ATTGAAATAG 
TTCATTGAAA TTACTAAATC. GAACAAATAC TTTGGCACGC TGACTTCGAT CTTTCGGATT 
AAAATCATCT ATATGAATAT CTGTTGTATT TGTAGCCAGT TGAGCTGTTA AATCAATTGT 
TGAAATTAAC TCAGTGAATT GTGATACGTT ATCTTGTTGG AAGTGTATTT GTGGTGGTTC 
AGTAATCTTG GAAACGAGTG AACGGAACTG TTTAGGACTA AAATGCAAGT AATTCTTAAA 
TTGATTTGCA AAATTTGTAT GACTGTTAAA ACCGGCCAAT TCAGAAACAG TTGTAATAGA 
ATGTTTTGTA GAAAGTAATA AATTGATGGC GTTAACAAGT TTAATACTTG TAAAATAATC 
TTTAAAATTC ATACTCAAAT ATCGAACAAA AAGATTAGAA CAATATGATT CAGAGATGTT 
GCAATGCATC GCTATATCTT TTAAAGACAA GTGCGCATCA ATATkGTCAT GAATATAGTT 
TACACAGTCT ATGAACACCG GATTGCTTAA AGCAATATTT GGTAAATATT CATGATCTAT 
TCTTATAAAA GCTTCTTTAA GTAAAGTGTC TATnATACTT TGACCT ■ 
(2) INFORMATION FOR SEQ ID NO: 736: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 97 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 B: 

TTTATTTGAT GATGTACTAA AGCAAGATGA AAGATTTGTT ATTGTAGTTC AAGCATTAGA 60 

5 

AGAAAAGAAC GGACAATTAT TAAAGAGAAC TTTAAGGGAA TATCCCGGTT TAAACCATAA 120 

ACAAATGAAT GATTTATTTA TGCACTTAAA GGAATTATTT TCCGAAGAAT CATTTGCTGA 1B0 

10 AAACCAATCA GCGTTCAGTA TTACAGTTTA TACAAATTTA GATTATACTG CTGACCAAAT 240 

ATATGCTCAT GTAAAACGTT TCAGAGGTAA GCATGACTGG ACACAAACAG CTAAATAAAA 300 

TCAAGATGCT TGCAGTGgAT TkTAACmATT AATGtTAGaT gTCraCyGcAT TTTTaATGCC 360 

75 ATTCGGTAAT TAATCCGGAT" GGTCCAATTT AATTAAC 397 

(2) INFORMATION FOR SEQ ID NO: 739: 

(i) SEQUENCE. CHARACTERISTICS: 
20 (A) LENGTH : 915 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 73 9: 





CAAGGTATTT 


CATTTGACCC 


ATTCACACCA 


TTAATAGTGG 


CAGCAGCATT ATACTTTGTC 


60 


30 


TTAACATTTG 


TACTTACACG 


CATTATGAAC 


ATGATTGAAG 


GGAGATTGAA 


TGCCAGTGAT 


120 




TAAAATAAAC 


AATCTTAATA 


AAGTTTTTGG 


AGATAATGAA 


GTTTTAAAAs 


ATATCAATCT 


180 




TGAAATCAAT 


CAAGGGGAAG 


TAGTAGCAAT 


AATAGGTCCA 


TCTGGTAGTG 


GTAAAAGTAC 


240 


35 




TGTATGAATT 


TATTAGAAGT 


ACCTACTAAA 


GGTCAAGTGA 


TTTTTGAAGG 


300 




CAATGACTTA 


ACGGAAAAAG 


GGACACAAGT 


AGATAAACTA 


CGTCAAAAAA 


TGGGTATGGT 


360 


40 


ATTTCAAAAC 


TTCAACCTAT 


•TTCCACATAA 


AAAAGTTGTC 


GATAATATTA 


TTTTAGCTCC 


420 


TAAATTATTA 


AAGAAAGATA 


ATAACGATGA 


ATTACATAAG 


GAAGCATTGT 


CGTTATTAGA 


460 




TAAAGTGGGA 


TTAAAAGAAA 


AAGCAGATGT 


ATATCCGAAT 


CAATTATCAG 


GTGGTCAAAA 


540 


45 


ACAAAGGGTA 


GCAATTGCAA 


GAGCTTTAGC 


AATGCATCCA 


GATGTTATTT 


TATTCGATGA 


600 




ACCAACTTCA 


GCATTAGATC 


CTGAGGTAGT 


TGGTGATGTA 


TTAAAAGTAA 


TGAAAGACCT 


660 




AGCCAAAGAA 


GGTATGACCA 


TGGTGGTTGT 


GACACATGAA 


ATGGGATTTG 


CCAAAGATGT 


720 


50 


AAGTGACAAA 


GTCATATTTA 


TGGCAGATGG 


CGTTGTCGTA 


GAGTCAGGCA 


CnCCAGTCGA 


7B0 




AATATTTGAA 


CAACCGCAAC 


ATGAAAGAAC 


ACAAAATTTC 


TTAGCAAGAG 


TATTATAACA 


840 




ACCTAACGAG 


GCTTGAATAT 


ATGATACGCA 


CCACAAAGTT 


ATATCATATA TTCAAGCTTT 


900 
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(2) INFORMATION FOR SEQ ID NO: 740: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 435 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 740: 
AGTAAATGCG ATGTTGCATG GAGGCTATAG CAATGAGCAA TAAAGCTTGG GGCGGTAGAT 
TTGAAGTACA ACCTGAAGAG TGGGTTGACG ACTTTAACGC ATCCATTACT TTTGATCAAA 
CGCTCATAGA TCAAGATATC GAAGGCAGCA TTGCACATGC AACTATGCTT GCGAATCAAG 
GCATTATTAG TCAACAAGAC AGCGAACAAA TTATACAAGG ACTAAAATCT ATTCAACATG 
ATTATCATCA AGATCAAATT CAATTTAGTG CATCATTAGA AGATATTCAT TTAAATATTG 
AACATGgAAT TAATTAAACG TATCGGTGgA TGCTGGTGGT AAGTTGCATA CTGGACGCgT 
AGtAAACGAT CAAGTTGcAA CAGACATGCA CTTGTACACT GAAGAAACAA GTGCAAGnnn 
TCATCGCATT GGATG 

(2) INFORMATION FOR SEQ ID NO: 741: 

s 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1198 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 741: 
AATATCATAA CCAAAGCTTG TCATTACTTA TATGTAAAGA TTAGAGGCTA ATTACATAAT 
CCTAACTTAT TTAACAGGAA GTCATGAAAA CTATATTACA ACTAAAAGTA TAACTTTCCA 
TCACCAAAGT CAGAATTTAT TGAAACACTC AACTGATAAT ATAGAAGGCT AATTATAAAA 
ATACCTGGCG ATTAATAAAC TGTTTCCGTT TTTTAGCACT TAATATATTT AAAAAGACAA 
ATAGAGCCCT TTCAGGCTCT ATTTnCTGTA TCCTTAACAA CAGAAGCTGT TAAAACTACC 
CGTCTTTTCa CAaCCAAaGc TaCAAAAaaG aTgGCTAGTT ATTCTTTCGT CACCCGCTGA 
ATCATTAGTG TTGTTATTTC CTTTAACTTG CACGTCTAAA TCAAGAaCTT TTTCCAAaGA 
TaAAACCCCC TATTTAAAAG TTGAAGTTAA GACCCcTTCA ATTGTCATAT TAATCATTAC 
CACACAATAA ATCAATGAAT TTAATAATAA TTATTTGTTA TTAATTTGTA AATCGTCATT 
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TGAATATTTT TTGTTTAACT 


CTTAAATTTC GGGTTTAACT TCCTCTATTT 


CTAACAGTTA 


' 660 


5 


TACTCCAGGA TTAGTTTCTT 


TAGAATCCGT ACCGATTAAT TTAACAGTAT 


GGTTTTGCCA 


720 




GTCAACTTCA TAAGTAGATG 


TAAACGTTAC TGTATTTTGA TnTTTGTAGT 


TATTTCCAAC 


780 




CCAGTGTAGT CGATTCCATT GATTAGTATA TCTATCCATT TCTCTTTGGT 


AAGTTACTTT 


84 0 


10 


GATTTTAGAT TtTTTTGTAT CATTTTGTTT ATGAGAAAGT ACGCTTATAA 


ATTCTGGGTT 


900 




AAAGTTACCA CGCGCCAATA AAGGCATTTG ATGTGTTGGC AAGAAATTTT 


GACCAGCATT 


960 




TGAACTACTT TGTCTACCAC 


CTAAAAACAG TTCATTACCA TATGTTGGGT 


CATAACTATC 


1020 


15 


TCTACCATAT GGTCCCCAAC 


CATTATTCAT AATTTTGTGC GCCTCAACAC 


CCCAGCCAAT 


1080 




TGATTTATGA TTTGTTTTTC 


TATCAATCGT AGTTCTGTAA CTTTCTTGTT 


TATAATTTAT 


1140 




CGTTTCTGAA AATGATTTTG 


ATCCATTTAA TCCACCTGAT AAGCCATTAG 


ATATATTA 


1198 


20 


(2) INFORMATION FOR SEQ ID NO; 742: 






on 
do 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 






30 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 742: 








GTGGCATATn CAAACTACTA 


CTGAAAGATC AATTACATAC GTACAGATTT 


TGCAAAATGA 


60 




TCAAGATTAT CCAACAATTT 


TTAGATGAGa CAAAACGTAA ATCTGTAATT 


AATTcAGATG 


120 


35 


TTAATGTAAC GGTAAAAGAT 


AGAATAATGA CTTTATcAAC GTGCGAAGAT 


GCATATAGTG 


180 




AAACGACGAA AAGAATTGtT 


GTTGTCGCAA AAATAATTAA GGTAAGTTAA 


ACAGAAAmGA 


240 


40 


GGATAATTAT GAAATTTATG 


GCAGAAAATA GGCTGACGTT AACaAAAGGA 


ACAGCAAAAG 


300 


ATATTATAGA ACGATTTTAC 


ACGAGACATG GGATTGAAAC ATTAGAAGGC 


TTTGATGGCA 


360 




TGTTTGTTAC ACAAACTTTA 


GAACAAGAAG ATTTTGATGA AGTGAAAATT 


TTAACAGTTT 


420 


45 


GGAAATCAAA GCAAGCTTTT 


ACGGATTGGT TAAAATCTGA TGTCTTTAAA 


GCAGCGCATA 


480 




AACATGTTAG AAGTAAAAAT 


GAAGATGAAA GTAGCCCGAT TATAAATAAC 


AAAGTAATTA 


540 




CATATGATAT AGGCTATAGT 


TACATGAAAT AATTGAATTT AAAGAGGTTG 


CAGTACTTGT 


600 


50 


TATGACAATT GGTATTGTTA 


GCTTTTGTAA CGACCATACA TGATACCGAT 


GATGGTCGTT 


660 




TTTTTAATGa ACACAAACAT 


GCTAACAACA AATTGCTAAA ACATAGTTTG 


aTTTGAATGT 


720 


55 


GgCTTTGGAA AATAATCATC 


CAtTATATCa TTtTGGATTn TATTTTGGAA 


GAAGAAAAGA 


780 
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(2) INFORMATION FOR SEQ ID NO: 743: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1209 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 743: 





TGCTGGTAAA 


TCTTCCCTTA TCAAGTCTTT 


AATTGGTGAA 


TTTAATGCTA 


CCGGTACTAA 


60 


15 


ATTGTTATAT 


AACAAACCTA TACAACAACA 


ACTGCAACAT 


ATTACATATA 


TTCCACAAAA > 


120 




AGCACATATT 


GATTTAGATT TTCCTATAAG 


TGTGGAACAA 


GTGATTTTAT 


CAGGTTGCTA 


1B0 




CAAAGAAATT 


GGATGGTTTA GACGACCTAA 


TAAATCAGCA 


AGGGATAAAC 


TCAAACAGTT 


240 


20 


ATTAAGCGAT 


TTAGAATTAG AATCTTTACG 


TCATCGACAA ATTTCAGAAT 


TAAGTGGTGG 


300 




ACAATTACAA 


CGTGTGCTAG TAGCAAGAGC 


ATTGATGTCC 


GnAAGTGAAG 


TTTATTTTCT 


360 


25 


AGATGAGCCG 


TTTGTCGGAA TTGATTTTAG 


TAGCGAAAAA 


tTAATCATGA 


CAAAAATcGA 


420 


GAACTTAAaA 


CAACAAGGAA AACTTATTcT 


TATCATCCAC 


CATGATCTAT 


CAAAAGCAAA 


480 




GCAATACTTT 


GATCGCATTA TTCTATTAAA 


TCAAACATTA 


CGATACTTTG 


GTGATAGTGA 


540 


30 


AGAGGCTATG 


AGTGTCACTC GCTTAAACGA 


AACATTTATG 


AGTAGCACTG 


ACTGTAGTGA 


600 




CCCTAGTCAA 


AGGAGCAATA TAACATGTTA 


GAGTTTGTCG 


AACATTTATT 


TACATATCAA 


660 




TTCTTGAATC 


GAGCATTGAT AACTTCAATT 


ATTGTAGGGA 


TAGTTTGTGG 


TACAGTTGGT 


720 


35 


AGTTTAATTG 


TATTACGTGG' TCTTTCATTA 


ATGGGAGATG 


CAATGAGTCA 


CGCAGTATTA , 


780 




CCTGGTGTTG 


CCCTATCATT CTTATTTGGT 


ATTCCAATGT TTGTAGGCGC ACTTATAACT 


840 


40 


GGTATGATCG 


CAAGTATTTT TATCGGTTAT 


ATCACATCTA 


GTAGTAAAAC 


GAAACCTGAC 


900 


GCCGCTATTG 


GAATTAGTTT TACCGCTTTC 


TTAGCTTCGG 


GGATTATTAT 


TATTAGCTTA 


960 




ATAAACACTA 


CGACAGATTT GTACCATATT 


CTGTTTGGTA ATTTACTGGC AATCACAAAT 


1020 


45 


AGTGCATTTT 


TAACAACTAT TGTGATTGGT 


TCAATCGTCC 


TTATTCTAAT 


CATTATTTTC 


1080 




TATCGTCCAT 


TAATGATTTC TACATTCGAT 


CCAACGTTTA GTAGAATGAG 


TGGTCTAAAT 


1140 




ACGACGTTAT 


TACATTACTT TGTGATGTTG' 


tTACTCTCAT 


TAGnAACAGT 


AGCAAGTAtT 


1200 


50 


CmAACGGTA 










1209 




(2) INFORMATION FOR SEQ " ID NO: 744: 









(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 675 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 744: 





CCGCCTTTCG 


TAAACCTAAT ATAATACGAA 


ATTTTCGTAT 


TGTCAACATT 


AAATACGTTT 


60 


10 


TTTTCGTAAA 


AAACTTTACT ATGATATGAA 


AATTTCGTAT 


AATAAGAAAA AAGGAGGTAA 


120 




GTAATATGAA 


CAAAGAmAGm AATATTATTA 


TAGCCAAAAm 


CATTAGAAAA 


TTTCTCAACG 


1B0 




ATTCAAATAT 


GTCTCAAAAG AAACTTGCTG 


AaCTCATTAA 


CATAAAACCA 


TCTACTTTAA 


240 


15 


GCGATTATTT 


AAATTTACGT TCCAACCCCT 


CTCACGGCGT TATACAAAGG ATAGCTGATG 






TTTTCGAGGT 


TGGTAAAAGC GACATAGATA 


CTACATACAA 


AGACGATAAC 


GACATCACTT 


360 


20 


CCATATACAA 


CAAACTCACA CCTCCCCGCC 


AAGAAAACGT 


ACTTAACTAT 


GCAAATGAAC 


420 


AATTGGAAGA 


ACAGAATTCT AAAGGAGATA 


ACGTTGTAGA 


TATTAATTCA 


TATAAACAGG 


4B0 




AGAAAACTCC 


AGTTAACGTC AATGGTTGCG 


TCTCTGCTGG 


TGTAGGAGAA 


CGTTTACACG 


540 


25 


ATGAAACGCT 


ATTTACTGAA ATGGTTAAAG 


GACCTATCCC CACACACGAT 


TTAGCGTTAA 


600 




AAGTAAATGG 


TGATTCTATG GrACCTATGT 


TTAAAGATGG 


CGAAATCATA 


TTTGTGGAGA 


660 




AAACTCACAA 


TnTGA 








675 



30 (2) INFORMATION FOR SEQ ID NO: 745: 



(i) SEQUENCE CHARACTERISTICS: 

CA) LENGTH: 924 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 745: 

40 



CAGGTGTCAT AGTTGAACTT GGtGGTTTTG 


GACTAATCGT TAAATCATCA ATTTGTGTCA 


60 


TCCAAGGTTG GAATTTACTG CCTTGTGTCT 


CCAGTGCGCT GAAAATACCT 


TTATCTTGAA 


120 


ATAAGTCAAC TAACTCTTGG ATACCTTTAA 


TTAATGCTGG GTTACCACCA 


GAAATTGTAA 


180 


CGTGGTTAAA TAAATCGCCA CCAATTCGTT 


TTAATTCATC ATAAATTTCT 


TCAGCGGTCA 


240 


TGAGTTTTAT ATCGCCTTTA GCACTACCAT 


CCCAAGTAAA TGCAGAATCA 


CACCAGCTAC 


300 


AGgATaATCA CATCCAGCTG TTCTCACAAA 


CATCGTTTTT CTACCGATTA 


CTCGACCTTC 


360 


ACCCTGAATG GTTGGACCGA ATATTTCGAG 


TACAGGAATT TTAGCCATTA 


GTTACACCTG 


420 


TTCCTTTGGT CTAAATACGA CATAACTTGT 


TGGTGTTTCT CTTACAAATA 


CTTGAATACA 


480 
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TACGATTTCA GTTGAAGGGA TTTTGTTTTT AAAAGCAGGT AAGTTATTTA ACAGTTGATG 600 

GTCAAATTTA CCGTGTATCA TCTTTTTCAA ATGGCTAAAG TTCACTAAGA AGCCAGTGTC 660 

5 

ATCTAGTTTA TCACCGACAA TTGTTAAaya cAAAGTAAGT ATGACCATGG ACATTTTGAC 720 

AAATACCTGC TTCTTCACAT GGAATGTGAT GTGCAGCCGA AAAATTAAAA TCTTTATTTA 780 

io ATTCGAATTG ATATGGATGC GTTGTACTAG GATAGATTTG TTGTAACATT TTAAAGCGCT 840 

CCTTTACTTT CAAGATATTG ATTTAGTCCA CGTTGACGTA AATGACAAGC TGGACATTCA 900 

CCACAGCCAT CCCCAATGAT ACCG 924 

15 (2) INFORMATION FOR SEQ ID NO: 746: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 971 base pairs > 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 746: 



30 



35 



40 



45 



ATCTTTTGGG 


TATTTTCTAT 


ATTAGCTGTT 


GTTTTCTTAG 


TATTATTTAG TTTTGCTGTT 


60 


GGTGCATCAA 


ATGTACCAAT 


GATGATTTTA 


ACATTTATAT 


TACTCGTTGC AACCTTTGGA 


120 


ATTGGATTTA 


CTACAAAGAA 


AAAATATCGA 


GAAAACGATT 


GGCTATAAGT TATATCGACA 


180 


ATAACACGCG 


TTGATTTAAG 


ACCACACATG 


TTCATGATGA 


ACATTTAACT GG TTTTATAT 


240 


TGGCGCGTGT 


TTTTATTATT 


TAAGGTATAA 


AAGCGTAATA 


TGTTACCTAA TTGTTAGGAC 


300 


TACATATTAC 


GCTTTTGGTA 


ATTATTTTGA 


TTTTCTTTTA 


AATCGTTGTT CATGTAATTG 


360 


TATTAACGCA 


TCATGGTGGG 


GCTCTAAaTA 


ATCATtGCTA 


ATTcTTGATG GAGACCmCGA 


420 


TTtCCAACTA 


rGATkGAATT 


kGGACCmCTA 


ATkGTTAATG 


GTcctCCTAG TAAATTGGAA 


480 


GCTTGTCCAT 


TaCTTCcATA 


TAAAtAACCA 


ATCCGCCAGC 


AAAtCCCcAT GGTkGAAGTC 


540 


TTGGCGTCAT 


ATATGCTTCT 


AAaTTACCTG 


TAGCAACTGA 


AACGATTTCA AGCGCTGCAC 


600 


TACCATATGC 


CCTTGCACTT 


CTAGAATCAT 


TAACAATTTC 


TTTAAAGATT TCTCCTAAAA 


660 


TTGGTTTAGT 


TAACCAGTTC 


GGATTGATCC 


CAATAATGCT 


TTGTCTTAGA TTAGAATCAT 


720 


TCAATGGTTT 


CAAGGGTTGG 


CTACCACGAT 


ATGCACCTTC 


CCCTACTTTA GCATGATATA . 


780 


AGACATCAGC 


CATAACATCA 


TATACAAAAC 


CTGCATAAGG 


TTTACCATCG ATATAAATAC 


840 


CAATTGAAAT 


TGCGAAATTT 


TCTTGTTGAT 


GAACAAAATT 


CAATGTACCG TCTATTGGGT 


900 


CAACAATCCA 


TACCGTACCT 


TTGGAAGTAT 


CGATGTCATG 


ACCATGCCCT TCTTCACCTA 


960 
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(2) INFORMATION FOR SEQ ID NO: 747: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 865 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



10 . 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 74 7 : 



75 



25 



TTCTCGTCAC 


TGTACGTTGT 


ATTGCGGCAT 


CATCCATTAT 


GATACGTTCA 


GACATATTCT 


60 


CACCCTTTCA 


AAATTTAGAA 


ACAAAAAACC 


TCATGTCTTT AATATCGACA TGAGGTTTAG 


120 


TATATACAAT 


CAGCCTATTT 


ACATAGAAGA 


ATCGTAGTAT 


TACTACTTAT 


TCTTGCATAT 


180 


GTGAAAGTGT 


TATGATCATT 


TTAGTATATA 


TCTATATTTT 


CATGTCTTTG 


ACGTCTCTCT 


240 


GGACGCTCAA 






ACTTAAACTG 


TTATAAGGAT 


AATACTAATC 


300 


AGGCGCTTTG 


TCAATCTATA 


TTGAAAAATT 


AACCATACTC 


AGTAAAAATG 


TTAACCTTAA 


360 


TCTATTTTAT 


AATGTGTTTT 


CATTTAAAAA 


AAACGAAAAC 


GATAAACACT 


TATAGTTTAC 


420 


ATAATAATGT 


TATCGTAATT 


ATAGCACGAC 


ACAATTATGC ATCTCTTTTT 


CGAATTGTAT 


480 


CTAATAAATC 


TTCAAAGTCT 


TGTGGTAATT 


CAGCATGTCT 


TTCAATATAT 


TCACCTGTTA 


540 


CTGGATGTTC 


GAATCCAATA 


AGTCCAGCAT 


GTAGAGCTTG 


ACCACCAATA 


TCCAATGTCT 


600 


TTTTCGGTCC 


ATACTTTGGA 


TCACCAACTA 


ATGGGAAGCC 


AATATATTTC 


ATGTGAACAC 


660 


GGATTTGaTG 


CGTACGTCCT 


GTTTCAAGTT 


GACATTCAAC 


AAGCGTATAA 


TCTTTAAAAT 


720 


GTTCTAGTAC 


GTTAAAATGT 


GTCACTGCTT 


CCTTACCATC 


ATCAnCAACA 


GCCATAGATT 


780 


GACGATCATT 


TTTGTTTCTA 


CCAATTGGCG 


CATCGATTGT ACCGTAATCA 


TGAGGAATAT 


84 0 


TCCCGTGGAA 


CTAAAnCGAT 


ATAGT 








865 



40 (2) INFORMATION FOR SEQ ID NO: 748: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 818 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 748: 
ATAACATTGT GGAGCCTATG ACATTGATTT ATGTCTCGGT CTCAAATGTT CTTGTAAAAA 60 
AACTAAGTAT AGTTGAATTT TAAGTTAAAA AAATTGTTTT TGCAAAAATA AAATAATCCA , 120 
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TTTTAGGTTC TTCGCCTCTA AAAATCCTTG CGaTATTAGA GCGATGTCTA ATTATCAATA 
TGATTGAAAC TAAGAAACTA ACGACTAATA AAATATAGTC TTGAATGATA AGCGAGCCAA 
TCACACAGCA AATTGCTGCA ACGATACTTG CTAAAGAAAC ATATTTAAAA ATCTTCAATA 
CAATAAAGAA GATAATTGCA AGTAfTAGTA AAAGTATCGG ATTGACTCCC AAGACGACAC 
CTGCACTAGT TGCAACAGCT TTGCCACCTT GGAATTTTAA ATAAACAGGA TAAACGTGTC 
CAAGTATAGC GAATAAGCCA ACAATTAAAC CATTTGTAAA AAAAGTACTA ATAGGGCCAT 
CTGCGTGAAC TTGTAACCAT AAAGGGAAGA AAACAGTTAT GAACCCTTTG AAAATATCTA 
GAAATGTTAC CAAGAATCCT GCAGGACGAC CTAATACTCT AAAGCTATTA GTAGCGCCAG 
TATTACCACT ACCAAATTGT CTAATATCTT TTTTGAAAAA TAATTTTCCA ATTACGAATC 
CACTTGGGAA AGCGCCGATA AGATAACTTA GTAGTAACAT GACGATTATC ATCATAAATA 
TTACACATCC TTTAATATCT TAGGACTATT TTATCATA 
(2) INFORMATION FOR SEQ ID NO: 749: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 971 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 749: 
ATATCCAATa ACTGTTCATT GTCTTGATAT TCCTTAAATG TTTCATAATG CTCATTAGAA 
ATGACAATTT CAAGCAACCA ACCCGTGCCA CTTTCCTCTT TATTAATTAT TACGCCATCT 
TCTAATTCAT ATTCTATAAT TCTGCCGTGT TCATTCACAA TTTGAAACCT TACTGCTTTA 
AAAGTTTTCA TATTCCCACA TCCCATTAGT TTAATATTTA TGATTTTTGA ATTATTTAGA 
AGATTTAAGG TCATTTCATG ACTAATTTCT TTAAATATAA CAAATAACGA GATGCTATAG 
CTATTAATTA TATAAAATTT GcCATTTTGa CCATTTTTAA GCTAGTAATT AAGATATAGG 
CATAGGAGGT GAATCACACA CTACATGCTA AATAAAATCG TAATTGTCGG GAGcTGACGA 
AAGACGCACA AATATTTGAA AAGGAGGATA GAAAAATTGC AACGTTTTGT GTTGCAACGC 
ACCGAAATTA TAAAGATGAA AATGGAGAAA TCGTCTGTGA TTACTTATTC TGTAAAGCAT 
TTGGCAAGTT AGCTTCTAAT ATAGAAAAAT ATACTAATCA AGGTACATTG GTTGGTATAA 
CTGGTCAAAT GAGATCAAGA AAGTATGATA AAGACGGACA AACACACTTT. GTCACTGAAT. 
TATATGTTGA AACAATAAAA TTTATGTCCC CTAAATCCCA AAATAATGAA ATTCTCTCAG 
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TCCAATTATC 


CTAAACATCC 


TTAATATACA 


TTTAATCCAT 


CCTAAATTTT 


TTATAGATCT 


B40 




AATTAACTTG 


AAATATACTC 


ACTTAAAAAC 


GACTTACATG 


ACCTAGTACT 


GTTGGTAAGG 


900 


£ 


. TCGnCTTTTG 
AAAAAATTGC 


GTGTAGTTTT 
C 


CCTAATAGAn 


AATAGACnTA 


GGTTGCCAAA 


CCGATTACAT 


960 
971 


10 


(2) INFORMATION FOR SEQ ID NO: 750: 








15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 832 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 










(xi) ! 


SEQUENCE DESCRIPTION: SEQ ID NO: 750: 


r 




20 


ATAGATTACT 


CTTTTGATCT 


GTAGATCCTA 


CTGTAACGAC 


ATTGTCCATA 


GATGCAGGAA 


6.0 




CATCTTTCAC 


TTCGCCATTA 


CCTTGATATT 


CACGCTGTAA 


TTTTAGTTTC 


TGTTTGTCAT 


120 




TGACATCAAT 


ACCATCATTA 


CCAGCTGCAG 


CAACAACGAT 


AGATTTTTTC 


TTCTTGGCGT 


180 


25 


AATTGATTGC 


TTTCTGTAAC 


GCATCGTATT 


CTACTTTTTC 


ATCTTTTCTA 


AATGTTTGAT 


240 




GGTCATTTTT 


GTCCAAAATA 


ATATAACTAC 


CAACACTAAT 


ATTAATGACT 


TGaTTTCCAT 


300 


30 


CATTTGCAGC 


TTGAACAATC 


GCTTTTGATA 


CCCAAAGCAG 


TTCTGTTTTT 


TTACTACCAA 


360 


ACACGCGATA 


CATTGTAAAT 


TTGTTATTCG 


GTGCAACACC 


TATTAACTTA 


CCATTAGCAC 


420 




TCGTTTGACC 


CGACACCATC 


GTGCCATGTC 


CTTTCCTATC 


ATTGACATCG 


TGAACATCAC 


" 480 


35 




CGGTTCAGTA 


CCTCTAAAAC 


CGTTTAAAGG 


TACTAAATTT 


TTAGAATCAG 


54 0 




TCGAGAAATT 


ATTTTTCAAA 


TCGTCATGGT 


TTTTCATCAC 


ACCTGTATCT 


ATGATTGCTA 


600 




TTTTGGTGTT 


AGCATGTTTT 


GGCAAATCAT 


CATACGATGC 


ACCATTATTG 


GTTATTTTAT 


660 


40 


TCATATCCCA 


TTGTCTTGAA 


AATAATGACT 


CATTTGATGT 


TCTGTCTATT 


GTTTTCTCGC 


720 




TAGTAATACA 


AGTTGAACAT 


GTGGCATTGA 


TATATTTAAt 


ATCATTTTTA 


TAGTTTGCTA 


780 


45 


AAGCATTAGC 


ATGCATTTTC 


GTCATTTTAA 


TCTGTGCCAC 


ATGGAATTCC 


GG 


832 



<2) INFORMATION FOR SEQ ID NO: 751: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 47E base pairs 

(B) TYPE: nucleic acid 
so (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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5 



10 



15 



20 



25 



30 



35 



40 



GTAATATCAA 


TCTCTTCATA AGCTGAATTA 


TTTTCATGCA 


CTTCTTGATG 


TGATGATTTG 


60 


TCACGAACCG 


CTACAACTAA 


CATTTTATCG 


TCTAAAATAA 


GTTGTTTATA 


TTTTTCTAAT 


120 


TCATCAGGCG 


CTAAGTTGTA 


GCGTGATAAA 


ACTGCATGTT 


CACCATCTTC 


TCCTGTTAAC 


180 


AGTTTAGTCA TTCTATCACT AAATGTTCCA CTTGTTGAGA TAAGGGAGAT 


TTCAGAGTCG 


240 


TGTAAGTCAT 


TTAGGTGTAA 


TTTACTTTTA 


CTAATAATTG 




TTCTAAATAA 


300 


CCTTCAGATT 


TCTTTTGATT 


GATTACGTTG 


TATAATTCGC 


CAGTGTCATT 


TACTACAGTA 


360 


ATATCTGCCA 


TAGTTGTCGC 


CCCTTTAAAA ATTTGTTTAT 


tTAATCTTTT 


ACCCTTCTTA 


420 


TnATAAAGTA 


AAACCCTTAC 


ATTATTAAGT 


nATAAGTCTT 


CATTCGCATT 


AAACG 


475 


(2) INFORMATION FOR SEQ ID NO: 752: 








(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1019 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 








. <xi) SEQUENCE. DESCRIPTION: SEQ ID NO: 752: 




r 


GCGACACTTG 


TGAGTTCTCC 


AGCAGCAAAC 


GCGTTATCTT 


CAAAGGCTAT 


GGACAATCAT 


60 


CCACAACAAA 


CGCAGTCAAG 


CAAACAGCAA 


ACACCTAAGA 


TTCAAAAAGG 


CGGTAACCTT 


120 


AAACCATTAG 


AACAACGTGA 


ACACGCAAAT 


GTTATATTAC 


CAAATAACGA 


TCGTCACCAA 


180 


ATCACAGATA 


CAACGAATGG 


TyATTATGCA 


CCCGTAACTT 


ATATTCAAGT 


TGAAGCACCT 


240 


ACTGGTACAT 


TTATTGCTTC 


CGGTGTAGTT 


GTAGGTAAAG 


ATACTCTTTT 


AACAAATAAA 


300 


CACGTCGTAG 


ATGCTACGCA 


CGGTGATCCT 


CATGCTTTAA AAGCATTCCC 


TTCTGCAATT 


360 


AACCAAGACA 


ATTATCCAAA 


TGGTGGTTTC 


ACTGCTGAAC AAATCACTAA 


ATATTCAGGC 


420 


GAAGGTGATT 


TAGCAATAGT 


TAAATTCTCC 


CCTAATGAGC AAAACAAACA 


TATTGGTGAA 


480 


GTAGTTAAAC 


CAGCAACAAT 


GAGTAATAAT 


GCTGAAACAC 


AAGTTAACCA 


AAATATTACT 


540 


GTAACAGGAT 


ATCCTGGTGA 


TAAACCTGTA 


GCAACAATGT 


GGGAAAGTAA 


AGGAAAAATC 


600 


ACTTACCTCA 


AAGGCGAACT 


ATGCAATATG 


ATTTAAGTAC 


AACTGGTGGT 


AATTCAGGTT 


660 


CACCTGTATT 


TAATGAAAAA 


AATGAAGTGA 


TCGGaATTCA 


TTGGGGCGGT 


GTACCAAATG 


720 


AATTTaATGG 


TGCGGTATTT 


ATTAATGAAA 


ATGTACGCAA 


CTl'CTTAAAA 


CAAAATATTG 


780 


AAGATATCCA 


TTTTGCCAAC 


GATGACCAAC 


CTAATAACCC 


AGATAATCCT 


GATAACCCTA 


840 


ACAATCCTGA 


TAACCCTAAC 


AACCCAGATG 


AACCAAATAA 


CCCnGACAAC 


CCTAACAACC 


900 
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CAATTAAGTT CGTCTAGATA AAATAGGAAG CAAAGTCTTA GCAACGTAAA ATATTTTGA 1019 
(2) INFORMATION FOR SEQ ID NO: 753: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 533 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 753: 

ATTCTTATCC CAGGTAGGTG nTTTACTTCT TATATTAGGT GCAATAACTC ATGCCACTCA 60 

TCTTGTGGCA CAACATGAaT CGCATTTTCA AAAATTGCAT GTCCTGCTTG ATCAGTCAAA 120 

CCGGCAGCAT GATCAAAATG CATATGTGTC ATTAGCACAT AATCAATATC CTTTGGCGTT 180 

AAATTATAAT TTGCCAAATC AGCAATTATA TGACTTTCTT CATCTACTCC AAAATTACGT 24 0 

AATTGCTTTT CAGATAATTT ACCATTACCA ATACCTGCAT CTATAATCAA ATTATATTGA 300 

GCCGTTTGAA TCAAAATTGG ATGTGTCGGT AAATTGATTT GATTTCGTTC ATTTGCATTG 360 

TATTGCTTTG ACCACAACGG CTTCGGAACA ACACCAAACA TTGCACCGCC ATCCATTTTT 420 

GTATTGCCAC CATTTAGaTA ATGAATAGaT ATATCCCCGA TTTTCATAAC ATCACCTATT 480 

CTTTCTGTAT TCGTTATTAC AATTAGCATG TGTTTATTAC ATCATATCAC TAT 533 



(2) INFORMATION FOR SEQ ID NO: 754: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 754: 



45 



ATCCCCTGCA 


AGTGATAAAA 


TTATTGAAAA 


AGGCGACATG 


ATTACATTAG 


ATTTTGGCGC 


60 


GTATTATAAC 


GGCTATTGTT 


CAGATATTAC 


TAGAACATTT 


GCTATTGGAG 


AACCAGATCC 


120 


TAAACTGAAA 


GAnATATATC 


AnATAGTACT 


TGAATCTCAA 


ATGAAAGCAA 


TTAATGAGAt 


180 


TAGACCTGGC 


ATGACTGGTG 


CAGAAGCTGA 


TGCCATTTCA 


AGAAACTATt 


TAGAGTCAAA 


240 


AGGGTATGGA 


AAAGAATTTG 


GACATTCACT 


AGGACATGGT 


ATTGGTTTAG 


AAATCCATGA 


300 


AGGGCCAATG 


CTGGCTCGTA 


CGATACaAGA 


TAAACTTCaA 


GTTAACAACT 


GTGTTACAGT 


360 


AGAACCTGGT 


GTTTATATAG 


AAGGTTTGGG 


CGGTATAAGA 


ATAGAAGATG 


ATATTTTAAT 


420 
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AGCGTGTAAA ATGAGGAGGA AACTGAATGA TTTCGGTTAA TGATTTTAAA ACAGGTTTAA 
CAATTTCTGT TGATAACGCT ATTTGGAAAG TTATAGACTT CCAACATGTA AAGCCTGGTA 
AAGGTTCAGC ATTCGTTCGT TCAAAATTAC GTAATTTAAG AACTGGTGCA ATTCAAGAGA 
AAACGTTTAG AGCTGGTGAA AAAGTTGAAC CAGCAATGaT TGaAAATCGT cGCATGCAAT 
ATTTATATGC TGACGGAGAT AATCATGTAT TTATGGATAA TGAAAGCTTT GAACAAACAG 
AACTTTCAAG TGATTACTTA AAAGAAGAAT TGAATTACTT AAAAGAAGGT ATGGAAGTAC • 
AAATTCAAAC ATACGAAGGT GaAACTATCG GTGTTGAATT ACCTAAAACT GTTGAATTAA 
CAGTAACTGA AACAGAACCT GGTATTAAAG GTGATACTGC AACTGGTGCC ACTaAATCGG 
CAACTGTTGA AACTGGTTAT ACATTAAATG TACCTTTATT TGTAAACGAA GGTGACGTTT 
TAATTATCAA CACTGGTGAT GGAAGCTACA TTTCAAGAGG ATAATCTCTA ATTTGTTAAC 
AAATAGCTTG TATTCACTAT ACTGATTTAA CGTAAGAnAT TCTAAATAAG TCTCATAAAG 
CTATTGCCTA AAATGATTAT AGGTTATATG CTGATATGAG GCTTTTTATT TTTAAAATAA 
TTTTTAAAAT ATAAAACGAA TCGTC 
(2) INFORMATION FOR SEQ ID NO: 755: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 755: 
ACAAACAAAT CACTTAGTTA AAATTGACAG TGCATTATAT TCCGATAAAT TATTCAACAT 
TGTAGAAAnn AGAATTGATA CACCAGATAT TGGCTATAAT ACAGTGGTTT TATCAGAAAA 
ATGAGTGTAG AAATTAAAGG GATACCTGAn GTGTTGAATA AATTAGAATC GGTATACGGT 
AAACAAGCAA TGCAGGCTAA GAGTGATAAA GCTTTAAATG AAGCATCTGA ATTTTTTATA 
AAGGCTTTAA AGAAAGAGTT CGAGAGCTTT AAAGATACGG GTGCCAGTAT AGAAGAAATG 
ACTAAATCTA AGCCTTATAC AAAAGTTGGT AGTCAAGAAA GGGCTGTTTT AATTGAATGG 
GTAGGTCCTA TGAATCGCAA AAACATTATT CACTTGAATG AACATGGTTA TACAAGAGAT 
GGAAAAAAAT ATACACCAAG AGGTTTTGGA GTTATTGCAA AAACATTAGC TGCTAGCGAA 
CGTAAGTATA GAGAAATTAT AAAAAAGGAG TTGGCCAGAT AAATGAATAT ATTAAACACC 
ATAAAAGGAA TTTTATTATC TGATGCAGAG CTCAAAACAC ATATAAATTC TAGAATATAC 
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TATGATTTGC CTTCAGACTT TATGTCTGAT AAATATCTCA GTGAAGAATA CTTAATTCAA 720 

ATAGATGTAG AATCTTCAAA TAATCAGAAA ACAATTGATA TAACAAAACG AATAAGATAC 780 

CTGTTATATC AACAAAATTT AATTCAAGCA TCTAGTCAGT TAGATGCTTA TTTTGAAGAA 840 

ACTAAACGTT ATGTGATGTC GAGACGATAT CAAGGCATAC CCAAAAATAT ATATTATAAA 900 

AATCAGCGCA TCGAATAGGT GTGCTTTTTA ATTTTTAAGG AGGAAATAAG CAATGGCAGA 960 

AGGACAAGGT TCTTATAAAG TAGGTTTTAA AAGATTATAC GTTGGAGTTT TTAACCCAGA 1020 
A6CAACAAAA GTAGTTAAAC GCATGACATG GGAAGATGAA AAAGGTGGTA CAGTTGACCT - 1080 

AAATATCACA GGTTTAGCAC CAGATTTAGT AGATATGTTT GCATCTAACA AACGTGTATG 1140 

GATGaAAAAA CAAGGTACTA ATGAAaGTTA AGTCcTGaCA TGaGTaTTTT CaATATTCCa 1200 

AGTGaTGaTT TAAaCACaGT TaTTGGaCGT ACTAAAGATA AAAATGGGTA CATCTTGGGT 1260 
20 AGGAG 

(2) INFORMATION FOR SEQ ID NO: 756: 



10 



15 



25 



30 



35 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1111 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



1265 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 756: 

CATATACCCG GGTTCCAAGA ATCCTCTATC CATATGGTCC CCCAATTTTG ATTCCACCAT 60 

ATTGTTAAAT ATCACTTTCC CAGCCTACTT TTTTATnCAG TTGGGCTCTC TAAAATTGTT 120 

TTGAAATCCA GGTTGAACAT ATTTCCAGTG TATGACCAAT CGAAACATTT GCACCAATAA 180 

GGCCGCCAAT TTTTCCTGTA TCATCACCAG TAACATTACC GTTGAATCCA TAAGTTAAAG 240 

40 TACTCATATA CTCTTTTGTA TCAATCGAAT TTCTTGGATA GTAATCAGAT ATTTGAGCTA .300 

CTTCATTATC AGGTAGTTGC AACTGTACCT TAAAGGCTGA AGGCCAGGCT AAACCACTTT 360 

TGTTAGCACC TTCTTCGCTA TAAACTCTAT ATTGACCAGC AATGGTACCT TTCGTTCTAA 420 

4 * TAACTAGCAG TTTTTTATTA TGATTTTTAT CATCGATAAA ACTATAAAAT AC TTTTT TGT 480 

GCATGCCATT TTCTTTATCA TAAGTGACTA AATCACCTGT TTTTACTGTA GTATTGCTTC 540 

CAATATCTGT AGTACCGGTT TTAATATTAA TATCAGAATC TGCGGCATTA GCGACAGGAT 600 

50 

TCATTAATAT GGAACCTAGC AATAGTGTTG TTGTTACTGA GCTGACTATA CGTGTTTTCA 660 

TTTTCATCAT CCTTCTATTT TTTAAAACGA TTTGAGGAAA CAATAATCAA TATGTCAATT 720 

55 
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w 



GTAGCAACTG ATAAATTACT GAGTGATGAT GAGTGATTAT TTTAAGAATA TGTTTTTAAC 840 

TTTTATTTAA AATTTGAAAG GAAGCATTTC . AATTTCGAGG GTTAGTCAAA GTTGAATAAA 900 

TTCTTTATGA AACAAGGAAA AGACATAGCT AATTTtATTG ATTAATTTCT TTAAAACTAA 960 

TGATTTGTTT GATTTAAAAA TGTAATCGAT TACAATATAA AAATACAAAT ATCTTAGAAT 1020 

TAAATCAATT AATTAACTAT TAAATAAAAA TTAACTATAT ATTAACTAGT GTAAATTAAT 1080 

AAATAGAAAT AGAGAAAAAG GGTATTAATT A HH 
(2) INFORMATION FOR SEQ ID NO: 757: 

. 1$ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 757: 
TATCTATTGT ATGTAACAAT AGGTACATAA TATTTTTAGG GTGGGTTATA TGAGCATAAT 60 
TACAAGATTG TTTAATAACA GTGATTTTGA AAAATTAAAT CAACTATGTA AATTATATGA 120 
TGATCTAGGT TATCCAACAA ATGAGAATGA TTTAAAAAAG AGACTAAAGA AAATAACGAA 180 
TCATGATGAT TACTTCCTAC TGCTTTTGAT AAAAGAAAAT AAAATAATTG GTTTAAGTGG 24 0 

TATGTGTAAA ATGATGTTTT ACGAAAwAAA TGCAGAGTAT ATGAGAATCC TTGCGTTTGT 3 00 

TATACATTCT GAATTTAGGA AAAAAGGTTA TGGAAAGAGA TTATTAGCTG ATTCTGAAGA 360 
ATTTTCTAAA CGGTTGAATT GTAAAGCmAT AaCACTAAAT AGTGGTAATA GAGATGAnAG 420 
ACATCTGCAC ATAAACTATA TAGTGATAnT GGGTATGTTA GCAATA 466 
(2) INFORMATION FOR SEQ ID NO: 758: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 02 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 758: 

CGTACAGCAT AACCAAATAC ATATCTGCCA AAATCGCCAT ATGCCCCAGT TCCTCTTCCA 60 

SO 

TATTCOGTTG TCATTGAGTT TTTAATTTTA TCGATAGAAA AATGTAAATG GCTATCTAAT 120 

TTGTTTTTTA AAGATTTTAT GTAATCtCTA TATTTAAATT CGAATATTCT TTTTTATACC 180 

55 



25 



30 



35 
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AACTAATATA TwCAATTGTA TGCCTAGCAT AATCTCTCAT TAAAATATGA GGATATACCA 300 

TTTCTTTATC AAATATtTCT TCATAAATAT AATTAGCAAT CTCTATATGT ATTtCACAAA 360 

TGTCGATTCG AAGTGTCGCC CCTAAAACAG rAGCATATAA TC 402 

(2) INFORMATION FOR SEQ ID NO: 759: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 338 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 759: 

AAGTTACTCA ACAGCCATTT GACACGTCTC GATTAGAGCG ATTAGGTATA ACCGAGCGCC 6 0 

20 AAACTAAAGA CATGTATCGT TTATTAGGAC TAGCCAAATA TGAAGATCGA TTTGTTATTC 120 

CAACATCACA CAAAGAAACC TATTTAGATA CGTATCACGC ACAAGGTAGT ACAGGATACA 180 

ATTACGGCGG CGAGCATTTT GGAGATAACT GTGAAGGCTG TGGCGTTGCA GTAGGTTCAG 240 
25 GGAAAACTGG TCAAGAAATT TATAATGGAG AATTCTATGG AGGGATTTCC GTGGTTAATT 300 

TCGATAATTT AAAAAATATC CAAGAAAGTT TGGGTnnn 338 

(2) INFORMATION FOR SEQ ID NO: 760: 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 399 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 760: 

CTCTATTATA TTTAGAGTTA TAGCTATTGC aGTTGTTTAT TTATTGAAAA TAAATGTACT 60 

ATCATTAGtT TTAGCAAGTG TATTAGGCAG TTTgGTATCt AGGCTACTAT TATCTATTAT 120 

TTTAAATTTA CCTGTGTGGG TAGTGTTGTT AAaCGCGATT CCAGGCGTAA TATTCACTTT 180 

45 AATTGTAGcT ATTCCTTTAT ATCTCACATT GAGAAAAAGA ATGGCAGTAT TACTAAGATA 24 0 

ATAAATCAAA ACACGGTCGT CACAATTACT GTTGGCGACC GTGTTTTACT AGCTATTTAT 300 

TGTTTTCAGT TTCTTTTGTA TCTAACAATT TCACTTTGTG ATTTTCCCAA TCAATTTCAT 360 

50 

ATGTTGaTTT AAATGTTCTA GTTTTAAAGT TTTTATAAT 399 

(2) INFORMATION FOR SEQ ID NO: 761: 

55 
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(A) LENGTH: 639 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 761: 
GCATATnCAA AACAATAGCT GCTTTAGGTG CGGGGGCTAT TAGCGTTATT TCTGCACTAA 
CAGGAGGTTT AGTTGGCGCA GGTGTTGGTG GTTTCCTTGG ATCTATAGCT GCATCGAATA 
TTGATACTAG TAAGGGAATA TATATAAAAT TAAAAACTAA AAAGTATGCA GCCGGGGAAT 
ACGTTCTGAC AGGAGAAAAA TGGGGATATC AGTAAGGGTG ATTTTATGGA TAGATTAAAA 
TATTCACTTA AAGTTGGAAT TTTAGCATTA TTATTATTTT GTACTTTAAA TTATTTAGTT 
CCAATGCAAA GCAATGCTTT TTCAATAATT ATATATTCGG CAATTTTTGC TGTGTTACTT 
ATGCTTTTAG TTTATATATT TTTAGGAATT TTAAAGAAAT GACATGAAAC AAATTAGCAT 
TGGCTATGAA GAAATCTATG GGGATAGAAT TTTTCATAGC CAT TTTTT AA AAGAGCATGA 
AGTAAAAAAT TGGAAAACCG TCAATCAAAT AATTGAAAAA GAACACTTGG ACAAAAATGA 
ATATGnAAGC GGAACCGCCG TTATTTTTCT TTTGCAGGAT CGTTAGATTA GATTnCGGTT 
GTATAGTATA ATGAAAAATA TTATAATAAA TTTATAAAA 
(2) INFORMATION FOR SEQ ID NO: 762: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 762: 
nTGTTCGTTG TCTGGATCCA ACGTAAACAG TCGCTTCATC TAATACAATG ATAGGCGCAT 
CTTTTAATAT CATTCTTGCA ATAGTGACTC GTTGTTTTTC ACCACCAGAC AATTTATCTC 
CCACTGTACC AACATTCGTA TCATATCCAT CTGGCAACTT TTCAATAAAT TCATGACATT 
GTGCTAACTT GGcAGCTTTT TCAACTGCCT CATCCGTAGC TTCTGGATTG CCAAGTTTAA 
TATTTTCTTT AAAAGTAAGA TTTAATAAAA AGTTATCTTG TCCAACAAAA CCAACTAAAT 
CGTTAAGTTG TTTCGATTCA ATATCTTTTA TATTTATACC GCCAATCGTA ATTTCACCTG 
AAGTCACATC CCAGTATCGT GATATAAGCT TGGCAATGGT TGACTTACCG CTACCAGaTG 
CCCCGACGAT AGCTGTGAAA TTATTTTCTG GTACTGTAAA TGATAAATGC TTAAAGACCA 
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GAGGCTTCTT AAATTTTGTT GACAATACTA ACTCTTCTAG ACTTAATATT TGATTCACTT 
CAGTTAATGC GTATTGTATA GACTTTAAAT GATTTACATA ATTAGTAAAA TTCTTAATCG 
GTGCTACTAC ACCTAAAGAT AATACGATGC ATAGGAAAAA TTCCGCATAG TTTAATTGGT 
TGATAGATAT CAAATACATG CCGACCGGTA AAATCCCTAA AAATGTTGAA GGTAATACAC 
TAGCTCCTAA ATTCATATAC CCCCATGTAT TTTTAAACCA ATTCAAAGTG TGAATCTTAT 
AATTATCTAC TG 

(2) INFORMATION FOR SEQ ID NO: 763: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 852 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 763: 
CTCTTCATGC GTCATACGTT GTGTCGATTG ATCAGTTGTT TTGTCTAAAT CACTAGCTTT 
AAATTTAGAT TGATTTGATT GACGTGTCGT AAATnGTTGT TCCTTTTGTT GGCGTTTGCC 
TTTTTTTCTT GATCTTATTA AAAATAAATT GATAACCCCA ACAATAATGA GCGCTAAAAT 
AATGTAGCTA ATAATGAAGG TCGTAGTCAT . TTAATGACCC CCTTAATTTT TATGGATTTT 
TACTTCAGCG TTCaTTCCaG GAACAACTTG TTTAGACGGT TcTGATTCTA GAGTGATTTT 
AACAGGTATT ACTTGAGAAA CTTTAGTGTA GTTACCATCA CTATTTGATG ATGGCATTAA 
TGAAAAgtTG CAGCAGTTGC TTTTCCAATA CTATCAACTT TACCTTTAAT AGAAGCTTTT 
TGACCGTCAA TAGTCACATC AACATCTTTA CCTACTTCAA CATCTTTAAT ATCTTTTTCG 
TCAATATTTG CTGTTACATA TAAATCATCT AAATTGTATG CATAAGCGAT TGGGTTACCA 
GCTTGCACCA TTGAACCTTC CATACCATCT AATTTGGCAA TTGTACCTTT TTGAGGCATT 
TTAAGATCCA TATCTTTCGT TTCGCCATCT TGACCTTGTA CAGTAACAAT TGCTACTTTG 
TCACCTTTAT CGAGTTTGTC ACCTTGTTTA ACATTAAGTG ATTTAATTTG TCCAGATGCA 
GGACTTGCTA TTTTAATTTG ATCGCCATTT ACTTTTGCAT TATCAGTTGT TACATAGCTT 
GTTGTTTTAT TCCAAAAATA AAAGCCAGCA ATCCCAATGG CTAACnGTAC AACAACCGTA 
ATGACATTnA AT 

(2) INFORMATION FOR SEQ ID NO: 764: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 
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75 



20 



(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 764: 

ACATATTCGA CATTCATTTT AAATTGCATA TAACTACATT AACCATTCAT TGTGCAGCGC 60 

10 CTGTTGTTTA GATGATCATT TATTGACTTT TTGCTCTTAT nCCAnGCAAT TTTTTGATTT 120 

ATTGCTTATA ATACATTTTA AGACAAATGT TTGTTTCAAC ATCGCATGCG ACATCATTAA 180 

yTATTCATTT AAATTTATCC AGTTTCTATT TGTCTGCAAT TGCTGAAAAT TAAAGGAGCT 24 0 

TATTTTTAAT AATGGAACAA ATTATCACTG AATTTATTAG CCGTTTTGGG TATGCAGCCA 3 00 

TTTTTATATT AATTTkATkA GAAAATGTAC TACCTaTCGT aCCATCGGgA AATTATTCTG 360 

ACATTTGCTG GTCTTATGTC TGTTAAATCA CATTTATCAA TTTTAACTTT ATTTATTATT 420 

GCAACTATCG CATCGTTTAT AGGGCTGTTA ATTTTATATT ATATCTGCCG TTTGATTTCA 4 80 

GAAGAACGTC TATATCGTTT TATTGATCGA CACGGTAAGT GGATTAAATT GAAAAGTAAG 54 0 

25 GATTTAAAGC GAGCAAATGA TTGGTTTAAA AAGTATGGCG TATGGGCTGT ATTkATCTGT 600 

CGTTTCATAC CTGTATTACG TGTAtTGATT ACCATTCCAG CTGGTGkaAA CCGCATGAAT 660 

GTTGTGACaT TtACCGTTAT TTCATTAATA GGTACTACAA TCTGGAATTT CGGTTTAATT 720 

30 TTACTAGGAC GCACTTTGAG CGATAGC 747 

(2) INFORMATION FOR SEQ ID NO: 765: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 820 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 765: 

TTTACCTTTT AACCAATCAT GCATTAATGC AATGTGTACT GTCTTAACAA ATGTTGCATA 60 

45 TtCATCTTCT GCACTCCACT TTTTCAATTC CTGTTGATTA ATGGCATGTT TTTCTCCATT 120 

TTCCAACATA TTGACATACA ATTTATAATG TTTTTCAACT GCCTGAATAA AAGCGAGTTG 180 

ATTTTTATTT AAATGTGATT TTAACTTTTC TAAATCTGAA TTAATAAAGT CTTCTATAGT 24 0 

50 CGAATAAGAC ATATCTTGGT ATTCAACTAC TGCATTAATT TCATCTAATA ATTGCGATTG 300 

TGATTTTAGC GTTATATAAC TCTCGTCTCC ATAAGATATT TCTGTAGTAG CTGCTGTAGT 360 

AGGATTTGGA GAACTGACTA AAATATGCTC GCCTAATAAT GCATTTATCa AACTACTTTT 420 
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TGTTTCGTGG ATATCTTGCT TTGTACGCTT GAACAAAGGC ACATCCGAAA TTATATCAAG 540 

CGCTTTTTGA ATATCTATAG ACATATTTGT TGTAGCTGTA TTTTGATTTA GCTGCTGATT 600 

5 ATCACGATTA TCCTGAGCAT TATCAGTAGC CACTTGATAT GTTGTCTCTT GTCGACCTAT 660 

TAATTTATCT AGAGATTCAT CTAAATGAAT ATAGTAGTGA CGATAATTCT TAGTCGTCAA 720 

TGACTGACGC AGCTCATTTA ATTCTGTATA ACGTTGATAT TCTTTTAAAT CATCACTTTC 780 

10 

TTCTGTTGGn AATTCATCTG CCTGCACATT TTCTATnATT 820 
(2) INFORMATION FOR SEQ ID NO: 766: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 672 base pairs 

(B) TYPE ^ nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766: 



25 



35 



40 



TCAACATGAC 


ATTAATTTAA 


TAGCGATGCA 


TACAAATTTA GATGTAAATC CGTATGGTGT 


60 


CAATATGATG 


TTGGCGAAGG 


CGATGGGTTT 


GAAGAACATT TCAATAATAA ATAATCAACA 


120 


AGATGTATAC 


TATAAAGTTC 


AAACATATAT 


ACCTAAGGAT AATGTTGGAC CATTTAAAGA 


1B0 


TAAGCTTAGT 


GAAAATGGAT 


TAGCGCAAGA 


AGGTAATTAT GAATATTGTT TCTTTGA^AG 


240 


TGAAGGAAGA 


GGGCAATTCA 


AACCAGTTGG 


TGAAGCTAAT CCAACAATAG GACAAATTGA 


300 


TAAAATTGAA 


TATGTAGATG 


AAGTTAAAAT 


TGAATTTATG ATAGATGCAT ATCAAAAGTC 


360 


AAGGGcTGAG 


CaATTAATTA 


AACAATACCA 


TCCATATGAA ACACCGGTAT TTGATTTTAT 


420 


TGAGATAAAA 


CAAACATCCC 


TTTATGGACT 


TGGCGTTATG GCAGAAGTGG ATAATCAAAT 


460 


GACATTGGAA 


GATTTCGCAG 


CTGATATTAA 


ATCTAAATTA AATATCCCAA GTGTCCGTTT 


540 


TGTTGGTGAk 


TCTAATCAGA 


AAATTAAACG 


TATTGcAATT ATTGGTgGTT CAGGTATTGG 


600 


ATATGAAkAT 


CAAGCtGTCC 


AACAAGGcGC 


AGATGTCTgT TACGGTGATA TTAAACATCA 


660 


GATGCCTAGA 


TG 






672 



45 

(2) INFORMATION FOR SEQ ID NO: 767: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 03 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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GAAAAAGTTA AACCTACTGT AACTACAACA AGcmAAGTTG AAGACmATCa CTCTACTAAA €0 

GTTGTmAGTA CTGaCACAAC AAAAGATCAA ACTAAAACaC AAACTGCTCA TACAGTTAAA 120 

ACAGCACAAA CTGCTCAAGA ACAAAATAAA GTTCAAACAC CTGTTAAAGA TGTTGCAACA 180 

GCGAAATCTG AAAGCAACAA TCAAGCTGTA AGTGATAATA AATCACAACA AACTAACAAA 240 

GTTACAAAAC ATAACGAAAC GCCTAAACAA GCATCTAAAG CTAAAGAATT ACCAAAAACT 300 

GGTTTAACTT CAGTTGATAA CTTTATTAGC AcAGTtGCcT TCGCAACACT TGCCCTTTTA 360 

GG TTCATTAT CTTTATTACT TTTCAAAAGg AAGaaTCTAA ATA 403 
15 (2) INFORMATION FOR SEQ ID NO: 768: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 483 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 766: 

TTCAATGCTC GTTTTGCTTG CGTTAGTAAT TTCTAACATA TTCATTCACA TTTTGGAGCC 60 

TAAACTATCA ATCACCACTC AAATCATCAT CGTTTTGATT TTAATTGAAG CACTAATTGG 120 

ACTGCGTTTC TTGAAAGCGT ACGATGTTAA GCGTGGCAAA GATAAAGAAA ATAAGAAAAA 180 

TAGTAAGGAT TTCGTTAAAC TAAAATCAAT TTTAGTAGCA ATTTTATTTA CATCATTGGC 240 

GCTGACAGCA GG TACTGTAG CTGATATATA CGGTTTCACT GACTTAGGAA ATACTAGAAG 300 

35 TGATTTAATC GTTTGGAGCA TAGGTGGTAT TATATTTGGC CTCGTATGTT ACACAATGGA 3 60 

AGATAAAAGA TAACGATAAG GAGCTGGCGA TTATAAAGCT AGCTCCTTTT TTAACTTaTA 420 

TATGTAAAGA aCTaTCCTAA GGGkTTTTAA TCATATGTCA ATAATTTCTA TAATACATTA 480 

TTA 483 

(2) INFORMATION FOR SEQ ID NO: 769: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 769: 
ATTATATGTC TGATGTATTT CATGTCGTTA AAAATAAAGG TACGGAAGAT TTTAAAGAAG 60 
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10 



25 



30 



40 



45 



GGATGATGCT TGAAGGTGAA AAAATCAAAG CTTTTTATGA AGATATGCCA CCGTATCAGA 180 

CTGTCAAAAA AGGAACGATA CAAATTAAGC GTGATGGCAC ACCTATTATC CTATTAAATG 240 

TCATTATACG CTnGTAGCTA CCCGCAAATC GGTACAATCG CAGTTATCAT TTACGAATTA 300 

GCACAAAACC GCAGGATCAC GTTGAATTCC ATTATAGTAT TTACGGCTGA AGACTGTAAG 360 

nTAnACTGGT AACCATATCA TGGATGATTG ATCATAGATG 4 00 
C2) INFORMATION FOR SEQ ID NO: 770: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 750 base pairs 
is (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 770: 
ATCTCTTCTA CTTCTTTTAC AAATTGAGAA CGTCTTAAAA TGGCTTTCAC ACGGGCAACA 60 

ACTTCTCTAG GTkAAAAAGG CTTAGTCATA TAGTCATCGG CACCTAATTC TAAACCTAAT 120 

ACCCGATCAA ATTCATCATT TTTCGCCGTT AACATTAATA TAGGGACTAA ATTTTTATTT 18 0 

GTTCTTACAG TCTTACATAC GTCAATGCCA T CTTTTTT AG GTAGCATAAC ATCTAAAATA 24 0 

ATTAAATCTG GCTGTTCACT TTCTACCTTT TCTAAAGCCT CATCACCATC AAATGCGACA 3 00 

ACAACTTCaT AACCAGCTGk TTCtArGyTA TATTTAAGTA ATGgTTaCGa TTGaATGtTC 360 

GTCATCTACT wCCaACACTT TTTGCGACAT GG tATGCCTC CCTAACttAk AATTATATTT 420 

35 CATTATAACC GAACTATTTA TAAAAATAmC ATCCTACACA TTATCTTTAC ACATTTTTTA 4 80 

CATTACTTTA CATATAAATA AAATACTTCT TATATTTCCT TCTmTCATTT GCATGACTTA 54 0 

CTCTGGGACA ACGAAATAAA TTTTGTAAAA ATAGCATTTC TATCCCACTA CCTATGCATG 600 

AGTTTTACTC ATTTATTCCT AAGCTTATGT ACATATTCGC TTTGTCTAAT GTGTAAGAAA 660 

CACTACATAA TCAATCATTG GTGACTCTTT ATTATTTCTA TCCTGTTGCC AaCTTCAATT 720 

CATTTAAAAA GGCGAACCTA GCAATTAAAG 750 
(2) INFORMATION FOR SEQ ID NO: 771: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1462 base pairs 

(B) TYPE: nucleic acid 

50 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATTTAATTAT TATTTTTCCA AATCAATACG ATATAATTTT ATGTTTTTAT CTTTTGGTGC 60 

TGaGGCAAAT TGATAAGtCG TTTTGCCTTT TTCAATATAT CCAAAATTAC CAGCAACTTT 120 

5 

GCTATATTCC GTATGTGTTA CTGCTTTGCC AACATAATTT TTAACAGCTT GATATGTTGG 180 

ACCATTAGCA TCATTATACA TCATTGAAAT ATGAGTTACT TTACCATTGT TTTTCTTA CC 240 

ATCAGTAGTT ACTAATAACA TACCTTTTTT AGTATGGAAT TCGTAATAAT GTTCAGTTCC 3 00 

10 

ATCTTCATTG TAAGAATACA TCGGATTTTT GTACTTAGTT AAAACATCTT TAATAGATTC 360 

GCCAATTTTC ACATTTTCTA ATGACTGATC ACCCTTAATT AATTGTTTAA CAGTATCAAT 420 

)5 AGAGTTGCCA CTTGCTGCTT CGGCACTAGT GTTCACTAAG CCTACTCCCA TTACAACAGA 4 80 

ACATGCAATT ATACTTGCCG TTAGTAGCTT TTTCATAAGC ACTCTCTCCT TTTATTTATA 54 0 

TCGTCTTGTG CTCACAACCA TTATACAACA ATCATTTCTA AAACAACAGT CATATTACAA 600 

20 TTATATTACA AATAATAATT ACTACTTTTA TATTTCACTT ATCACTAAAA ATTAAACATG 660 

CTTTTCATCG TGATGTTTGT TGATAAATGC AGCTGTATCT TCGACGAATC TTTCTTGCTC 720 

TTcAACAAAT GGGAATGCAC TTGATTCTTG ATATACTTCA AAGTCCGCAT TTTGGATTAA 780 

ATCAGCTACT TCTTTAGCTT CTAATCTTGT TGTTCTTTCA CCGAATTCTC CTGCAATAAT 840 

TTTTGTCGGT ACTACGACAT TTCTATAAGT TTGAGAAATA TCAGCGTTTT TGAATACCTC 900 

TTTAACTGCT TGAATTTCTT CTTTAGTTGA TATTTCATTT GTATCTACGA CATGTTTGAG 960 

30 

GAATCGATTC ATTTTTCTCG GACGATAATA CTTACGCTTA TTTAAAAATT TAwCyTGtTT 1020 

TTCaGGATCC CaGTTtCGAA TAATATGGGC ATaTTTTCTA AATAAACGTT CTTCCGGTAA. 10 BO 

3S TTCACCTTCA ATAGATGTTG GATTTACCAA 1 CGTAAGTGAT GATGTAAATT CAGGATAACG 114 0 

TACTGAAATA TCCGCACCAA TGATTCCACC CATTkCATGG cATACAAATG CAACTTCTTC 1200 

AATATATAAA TATTTAAGTA ATTCAACAAT GTCATCAGAA AAATCTTTTA TTTCAATGTG 1260 

40 ACGAGGTTTA TCAGAATAGC CATGTCCACG TAAATCAATT AACACAACTT GAAATGATTT 1320 

TGCTAATTCT GCTGCTAATT TATTAAACAC AGAATAATTA TCAAGTACAG TATGAATCAA . 1380 

TACGATAGGA TAGCCTTCAC CTAAnGTACT GTAATGTATC GATGTTCCAT CTTTTCTAGT 1440 

45 

AAATAGATCC ATAATTTTCT CC 1462 

(2) INFORMATION FOR SEQ ID NO: 772: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 704 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 772: 



5 



10 



20 



ATTATTATAA 


GGAGTTATCT 


TACATGTTAA 


ATCTTGAAAA 


CAAAACATAT 


GTCATCATGG 


60 


GAATCGCTAA 


TAAGCGTAGT 


ATTGCTTTTG 


GTGTCGCTAA 


AGTTTTAGAT 


CAATTmGGTG 


120 


CTAAATTAGT 


ATTTACTTAC 


CGTAAAGAAC 


GTAGCCGTAA 


AGAGCTTGAA AAATTATTAG 


180 


AACAATTAAA 


TCAACCAGAA 


GCGCACTTAT 


ATCAAATTGA 


TGTTCAAAGC 


GATGAAGAGG 


240 


TTATTAATGG 


TTTTGAGCAA 


ATTGGTAAAG 


ATGTTGGCAA 


TATTGATGGT 


GTATATCATT 


300 


CAATCGCATT 


TGCTAATATG 


GAAGACTTAC 


GCGGACGCTT 


TTCTGAAACT 


TCACGTGAAG 


360 


GCTTCTTGTT 


AGCTCAAGAC 


ATTAGTTCTT 


ACTCATTAAC 


AATTGTGGCT 


CATGAAGCTA 


420 


AAAAATTAAT 


GCCAGAAGGT 


GGTAGCATTG 


TTGCAACAAC 


ATATTTAGGT 


GGCGAATTCG 


480 


CAGTTCAAAA 


CTATAATGTG 


ATGGGTGTTG 


CTAAAGCGAG 


CTTAGAAGCA AATGTTAAAT 


540 


ATTTAGCATT 


AGACTTAGGT 


CCAGATAATA 


TTCGCGTTAA 


TGCAATTTCA 


GCTAGTCCAA 


600 


TCCGTACATT 


AAGTGCAAAA 


GGTGTGGGTG 


GTTTCAATAC 


AATTCTTAAA 


AGAAATCGAA 


660 


GAGCGTGCAC 


CTTTTAAAAC 


CGTAATGTTG 


ATCCAGTAGA 


AGTA 




704 



(2) INFORMATION FOR SEQ ID NO: 773: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 427 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 773: 

GACCACGACA CCACAAGCTA TGAAGAATGC CyTTGTTAAG CCGCcTTTGa ATtGCATAGA 60 

GATGAAAATA CCAATATTAA TAAAGAAGTT ACAGAAAATA CCTTTTGTAA AAATATTCAA 120 

40 

CCATGTTGAA TCAACAGTCT TTTTCTGAAC TAAAGCTGTT AAAGCTTGTG TCATTTCAGG 180 

TGTCATGACA TGCGCAAATT TCATTAAGAA AAATAACACA AACCCACCTA AAATATTTCC 240 

TAAGAAACAA TATAATAAAA TCCAAGTCAT CTTCTTAACA GAAACGACTT TATAATACCA 300 

45 

GCCTACTGTA AAGTACATGA AGTTACTTGT TAATAATTCA GAGTTAGTTA ATACAACTAA 360 

AATCAAACCT AAACTAAACG CAATGGCTCC CATTAAATTG ATAAGTCCAT CTACGTGAGT 420 

50 ACtCGCA 427 

(2) INFORMATION FOR SEQ ID NO: 774: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 774: 
AGGTGAGCAT TATTAAATAT TGCCAAGAAA TCAAAACTAA AAACTATGCA AAAGATGTAG 
ATATATTATT TATTCTTGGC OGTGATGGCA CGGTTAACGA ACTTATAAAC GGTGTTATGA 
CACATGACCT TCAACTCCCT ATTGGCATTT TACCAGGCGG TACTTTTAAT GATTTTACAA 
AAACGCTAAA TATAGCACCT GAATCATAAA CAAGCTAGTG AACAAATGGA TTTCTGCACA 
AGTTGGTACA TACGATGTAA TTGAAAATGA AATGAATCAA TATGCACTCA ACTTTGTGGG 
CTTAGGTCTC ATGTTCAAAA CGCGGAAAAC GTACAAGAGG TTCAAAAGAT GTATTGGGTA 
AATGGAGTTA TATTGGGTCC ACCGTCCAAA ACnCGGCnAA 
(2) INFORMATION FOR SEQ ID NO: 775: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 84 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 775: 
AATTATTCCT CAATAGGAGC ATTACCCGCT CTCTCACCAA TTCCCAAAAA TGTAGTTTCT 
ATTTGCTTTG CACCATTTAA AATTGCAGCT AATGTATTTG CAGTAGCCAA CCCTAGATCG 
TTATGACAAT GAGCAGAAAa AATTATGTTA GAATATTTTT TTACAAAGTA ATTAAAAATA 
TCTCCGTATT CTAATGGTGT CGAACATCCT ACAGTGTCCG CAAATGTAAC TGTTCTAACT 
TGATATTTTG AAATAATTTC CATGTATTCT TTCAATTTTT CTCTAGAAGT CCTTGTTCCA 
TCTTCAAAAC AAATATCTAC TCCTTTTTTA. TCTTTCTTTA ATATATCCAA GCAGTCTTTG 
ATTTTCTGAA TATAATATTT ATTTGAAAAA TTAAGCTTTT CTTTTATATG CAAGTCAGAT 
ATTGGCAATA GTATTTtTAC CACTAAATTA TGAATTTtCA ATtTGACTAT TTTTgGTATG 
TCnT 

(2) INFORMATION FOR SEQ ID NO: 776: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7B8 base pairs - 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 776: 





CGCAAGATGA 


AG7TAAAGAA 


CTTAATCGCT 


TATTAGGTAA AGTCATTCAT GCATTTGATG 


60 


£ 


AAACAAAGGA 


AAAATAATTA 


ACTTTTGTCA 


TGACAATTAA AGTAATGTTT AGAATTTATT 


120 




AAGAATAGAA 


AAACAATTAG 


CACGCGTAAC 


TTGTTAGTnA AAAAACTGCT TGAAAGGTTT 


180 


10 


CTTAGCCTAT 


CAAGCAGTTT 


TTTTATGCAT 


TATATTGAAT CATATTCCAT AAAGCACCTT 


240 


GATTAGCTAG 


TAATTGTTGA 


TAGTTTCCCT 


TTTCAACTAT TTTACCATTT ATCATCACAA 


300 




TGATGGTCTC 


AAAACGTGAC 


AATAAAGTTA 


AATCGTGTGT AGCAACAATT AATGTTTCTG 


360 


15 


CATGTGCTTC 


AATTAAATCC 


ATAACTTTTA 


AACTATTTTG TTTATCTAAT GCAGTTGTTG 


420 




GTTCATCTAA 


AATCCATGTT 


GATGCAGTAT 


CTTTTAATAA CATCCTCGTA ATCGCTAAAC 


480 




GCTGAATTTC 


TCCGCCAGAT 


AATGTATGAC 


CATCTAAGTC AATTTGACGT tCTAGTGCCA 


540 


20 


AATGTTCTAA 


ATCTAATTGC 


TTAAATATTG 


CTTGCACCGC TTCATCTTTT TCATCGGTAA 


600 




ATAAATTTTG 


ACGTATTGTA 


CCATCAAATA 


ATTGTTGAGA TTGTAGCAAG ACATTTAACG 


660 




ATTCAAACTT 


ATCTTTGTcA 


TCTATTTCAA 


ACATATCCAT ATTTTCGAAA CGAACAGAGC 


720 


25 


CACTATCTAA 
AAGGACCC 


TTGATATAAC 


CCTGCCATAA 


TTTGTAGTAA TGTACTTTTT CCTGAACCAG 


730 
788 



(2) INFORMATION FOR SEQ ID NO: 777: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1145 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 777: 



40 


TTACAAAAGA AATGCAACAA AATTTTTGAA TCATTACATT 


TTTTTATAAA AATTTCACTT 


60 




TAGATTCACA ATAATTACTT ATTTTGTCAA 


TTTATTTAAT 


GTCAATATGT TGATTAATTA 


120 




ATAGTGTTGT CTAATGTATA TAATATTTAG 


GTCATCGTTA 


TAGTCAACAA TAATAAGGTA 


180 


45 


TTTCGAGTTG AAATTTATCT TATTATTTTT 


CCACTTTTAC 


GTGCTATCCC ATTACACAAA 


240 




AACAAATAAG TAAAGATATT AAGTACAAAA AAGTGGAACA 


CCTTGTAGAT GCTCCACCTC 


300 


50 


AATTATATTA AGTTATATTA TTTTGCTGCT 


TGGTATAATT 


CATCAACTTT TTTCCAGTTA 


360 


ACAATGTTCC AAAATGCAGT CATATAGTCT 


GGACGTTTAT 


TTTGATATTT CAGATAGTAG 


420 




GCATGCTCCC AAACATCAAA TAGTAAGATT 


GGTGTTTTGC 


CTTCTGTTAA TGGATTATCT 


4B0 



55 
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GATCCAAATA ATGTTGTTGC TTTATTTGCA AATTCATTTT TAAATTCATC TAAAGTGCCC 600 

CACTGCGcTT TGATGTCATC TATTACGCCA CCTTTTTCTT CAGAATTAGG TGATAGTATT €60 

TCCCAGAATA ATGAATGGTT AAAATGACCA CCGCCATTAT TACGGACTGA CATCCTCATC 720 

GCTTCCGGTA CCTTGTCTAA GTnAGCAATC ATATCCGCTA GTGATTGATG CTCTAACTCT 7B0 

GTTCCTTCAA CTGTTGCGTT TAATTTCGTC ACGTACGTAT TGTGATGTTT GTCGTGATGA 840 

AACTCCATTG TTCTTTGATC TATATATGGT TCCAATGCAT CATATGCATA TGGTAAATTT 900 

GGTAATTTAA ATGCCATAAG TATATTCCTC CTTTTATGAA TATACTTTTA TAATAATTAA 960 

TTTTGGGTGT GTTTTGCAAT AATTATTATT TATTTGTTAT ATCCAATTAT ATAAAATTTA 1020 

AATATTCGAA ATTGATATAA TATTTAATAT TGAATGCAAA AAGGCATTTA ACTGCTTTTG 1080 

TTTCCCGCTT TTAAAGAGAA TTAAAAAAGA CAACTTCCAT TTTTCAATAA GAAATTGCCT 1140 

TTTCT 1145 
(2) INFORMATION FOR SEQ ID NO: 778: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 902 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 778: 

AAAAGAGGnC CAATATTATC AATATACGGC CGGAAGATTC C CT T TTT CTT CTCCACTTAC 6 0 

TAAGCCCATC AATGGTGTAT GTTATCaATT GGTGCTATCC GCATTAAATA ATTATCTTCA 120 

ATTGTGGTAT CAAAGCTATT CGCTTTTTCT CCAATAATGT CTTTACCATA TTTATTGAAT 180 

GCACCTACTA AAATCAACAT AAATGGAATT AACAAGGTTA CTACAATTGC CATTCTTATA 240 

AATTGTGGTA TATAAGAAGG TCGCCAATTT AAAGCTTTAA TACCTTCTTw TCTAAATGGA 300 

GTTTCTATAA AGCGATATGA TAGCTCTGCA AATATAATTG TTAAACTTAT ATCTATAAAG 360 

TACACATATA CAGGTATCTG TCCGTCTACA TAGTAACTAT GTACGAAACT AATTACTGCA 420 

45 AAATGCCATA AATATAAACT ATAAGACCTT TTCCCGATAA ATACTAACAC TGGATTTGAA 4 80 

AATATCTTCG CtATCCATGT AGATGGATGA ACGACACTAG CAATAATAAA TAACGTTAAT 54 0 

ATGGATATTA AATAGAAACC ACCATCATAT ATCCAATTCG TCTCATCATT AATAATGAAA 600 

so 

AATAATAATA TAAGTACTAT AAATGATAAA - CTACCTATGC TATCAATAAC ATATTTTACA 660 

ACTTTAGGTG GATCATTTTT CAATTTAAAC GGTGGCCATA AAAAAGCTAA AATAACACCC 720 

55 
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ATACTATAGA TAAACATCAT TAACCCTAAA GAAATTATTG 


ATACTCCCCA 


AAAAATAAAT 


840 




CCTATTTTGT ATCGCTTTTT AATTGTTAAC AATAATGTAA CCAAAATAAC TGGGAAAAAT 


900 


5 


AT 

(2) INFORMATION FOR SEQ ID NO: 779: 






902 


10 


(i) SEQUENCE CHARACTERliSTICS : 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








IS 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 779: 








GGAATTAGTT TCCTAATGAT CAAACCATAA CCATTAOCTT 


TTAATGATGT 


TACGCCTTTA 


60 


20 


AAGATTAGAG GATGATGACT AATAATAGTA TTATAACCTT 


TTTCGATTGC 


TTCATTTACT 


120 




ACTTCCAACG TACAGTCTAA TGCTGTTAAA ACACCAGTAA 


CTTCAACATC 


TTCATCACCT 


180 




ATTAACAATC CTACATTATC CCAAGATTCA GCAGTACTAA 


ATGGCACATG 


ATGATCTAAC 


240 


25 


AATGTCATTA AATCAGCTAT TTTCATaCTA TAACACCCTT 


TCAATTACaG 


CAATTTCGTC 


300 




aTTAATTTGa GCTAAAcGTT GakGAkGTTG TTCAGTAwTG 


AGtTTCGaTT 


TAAtATGATA 


360 




AAgTGCyTCt AACTCtCTTT GcCATTTtTT TATAAAATAT 






400 


30 


(2) INFORMATION FOR SEQ ID NO: 780: 








35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 557 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








40 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 780: 








GTTCCTGTTT TAACATCAAT ATCTATCTTT TTAATATCTT 


GAACATAAAG 


CAGATTCAGT 


60 




TGGCTAATAT TTGAATTTAT GTTAATCACT TTTTTCGTTC 


CATTTTTAAA ATGTACTGTG 


120 


45 


TAAGTnGCTT TTTTTGCATA TTTTAATTCA CGTTCCCCGA 


TACCTCTATC ATGCTTCAAT 


180 




ACTGaTTTAA CTTTTTTAGA GATATCTTTG TAGCCTACAC 


GTGGGTCACC 


TGTAAATTTT 


240 




AAATCTGaTA AAATTGGTGT TGATGTGCCA TTTACTGCAA 


TTGTGTATGG 


TACATATCTG 


300 


SO 


TCTGCTTTCG CTTTAGTTCC TGTTTTAACA GTAATTTCAA 


TTCTCTTAAC 


ATCTTTCGCA 


360 




TGAACTAAGT TTGCTGTGTA ATTTTTTGAA TTCAACTGTA 


AGATTCTTTT 


ACCACCATTT 


420 
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CTATCATGCT TCAGTACTGA TTTTATTTGA CTAGTTAAAT TTTTGTAATT TGTCCAAGGT 540 
TTATnTGAAA ATGAAAG 557 
5 (2) INFORMATION FOR SEQ ID NO: 781: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

w (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 781: 

is 

TTCTGATAAC ACAACTTTCT TGGACCAAGT GACAATGGTA AAGTTGATAT TCTCAAAAAA 60 

TATATTTTAT TGAAAAGGTA ACTATATCGA CCGCTCTGTG CGACTTGCTG TGTAGAAACA 120 

20 TCAAGCCCAG CTTGAAACTT CATATAAACT AATAACTTTT GATAAATGCT ATTCCCAATT 180 

TGTTCATACT GCTCCAAGTT ATCATTAATT CTAAATTTAA TACTTACTTC ATTTTTAACT 240 

GGAATAAAAT GTACATCACT CGCTTTCATT TCTATCGCTT TATTAATTAT TTCTTGAAAT 300 

25 AGAATCTTCA AAAAAACACC TCCTACATAT AATCACGTAG GAGGTGTTTT TATTACTTCA 360 

ATTTAACCGT GTAAAAATGG ATTTAATTGT TCATCATCAA CCGTCGTATA TGGACCATGT 420 

CCAGGGAATA AAGGTAAATC GCCTTCTAAT TCAAATATTT TATCTTGAAT AGAAT 475 



30 

(2) INFORMATION FOR SEQ ID NO: 782: 

(i) SEQUENCE. CHARACTERISTICS : 

(A) LENGTH: 400 base pairs 
XB) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 782: 



45 



SO 



ATGAATATCG 


GTGCAGGACG 


TATCGTTTAT 


CAAAGTTTAA 


CTCGAATCAA 


TAAATCAATT 


60 


GAAGACGGTG 


ATTTCTTTGA AAATGATGTT 


TTAAATAATG 


CAATTGCACA 


CGTGAATTCA 


120 


CATGATTCAG 


CGTTACACAT 


CTTTGGTTTA 


TTGTCTGACG 


GTGGTGTACA 


CAGTCATTAC 


180 


AAACATTTAT 


TTGCTTTGTT 


AGAACTTGCT 


AAAAAACAAG 


GnGTTGAAAA 


AGTTTACGTA 


240 


CACGCATTTT 


TAGATGGCCG 


TGACGTAGAT 


CAAAAATCCG 


CTTTGAAATA CATCGAAGAG 


300 


ACTGAAGCTA 


AATCCATGAA 


TnAGGGCATn 


GGCCATTTGC 


ACCGGGGCCT 


GGCCGTAATA 


360 


AGCCAAnGGA 


CCGGGCCAAC 


CGTTGGGACC 


GGGAAGAAAA 






400 
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<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 610 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
•(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 783: 





GATTAATGTA 


CTCGTGTACA 


TAACGTTTGA 


AGTATGATAA 


ATAGATATAG 


AAATAAAGTC 


60 




ATATGTGACA 


TCAATTAAAT 


GATGTTCAAA 


TGACAAGATA 


CAATAGAGGA 


ATGTTTGTAA 


120 


15 


TAnAAACGCT 


TCATATAAAG 


GTCGAGTCAA 


TATATGATAC 


GACTTTATAT 


GAAGCGTTTT 


180 




ATTTGCTATG 


AGCTAGTATA 


TTTTATAATA 


ATTTTTCTAT 


TTCTCTTTCG 


ATTTGAACAG 


240 




GTTTTTTTTG 


AGGTGCAAAT 


CGTTTAACAA 


CGTTACCyTC GCGATCCACT AAAAACTTAG 


300 


20 


TGAAATTCCA 


TTTGATTTTC 


TCATTAAAGA 


ATCCGTGTTG 


TGCCGCAGTC 


AAATATCTAA 


360 




ATAAAGGTAA 


TTGATGTTCC 


CCTTTTACGT 


CTATTTTTTG 


ATGCATAGGG 


AAGGTAACAC 


420 


25 


CATAGTTTAA 


TTTACAGTTT 


TGAGCTGCTT 


CTTCGCCTGA 


ACCAGGTTCT 


TGGCCACCAA 


480 


ATTGATTACA 


AGGGAAACCT 


AGAATTACAA 


ACCCTTGATC 


TTTGTATTTC 


TCGTATAATG 


54 0 




ATTGCAAACC 


TTCAAATTGT 


GAAGTAAAGC 


CACATTCGCT 


AGCTGTATTA ACAATTAGCA 


600 


30 


TAACGTCACC 


CTTATATGCA. 


TCTAATTTGT 


AAGTAACACC 


TTTATTTGTT 


TCTACTACAA 


660 




AATCATAAAT 


TGTCTCCATT 


GTATCATCCT 


TTCGATTTAC 


TTAAaATGTA 


CCAcAAAATC 


720 




GTGTAAtAGT 


CTkTACTArA 


ACTCTATGAT 


AGAATACTTT 


GAGTAGGATT 


TTATTAAGGA 


780 


35 


GATGTATAAC 


ATGGnTCAGC 


AACAAATTCA 








"BIO 



(2) INFORMATION FOR SEQ ID NO: 784: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 788 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 784: 
AAATTCATTG CAGAAAACAT AGATAAATAA ACAAATTGAC TTTAAACGAG CGTTGCAACA 60 
50 TATCTCGAAT TGTAAAGGAG CTTGAAAATG AATAAAAATA TAGTCATTAA AAGCATGGCA 120 
GCATTAGCCA TTCTAACCTC AGTAACTGGA ATAAATGCTG CAGTCGTTGA AGAGACACAA 180 
CAAATAGCAA ATGCAGAGAA GAATGTTACG CAAGTTAAAG ATACAAATAT TTTTCCATAT 240 

55 ' 
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ACCAATAAAC ATGTATCAAA AGATTATAAA GTTGGCGATA GAATTACTGC CCATCCAAAC 360' 

GGTGACAAAG GAAATGGTGG TATATATAAA ATTAAAAGCA TTTCTGATTA TCCGGGTGAT 420 

GAAGACATCT CTGTCATGAA TATTGAAGAA CAAGCAGTCG AACGTGGACC AAAAGGCTTT 480 

AATTTTAATG AAAATGTCCA AGCATTCAAT TTTGCGAAAG ATGCTAAAGT TGATGACAAA 54 0 

ATTAAAGTTA TTGGTTACCC ATTACCTGCT CAAAATAGTT TTAAACAGTT TGAATCTACA 600 

GGAACTATAA AAAGAATsAA AGACAATATT TTAAATTTTG GATGCCATAC ATTGGAACCC 660 

GGGGAATTCA GGGATCACCA GTTCTAAATT CTAACAATGA GGTCATAGGT GTGGTGGTAT . 720 

75 GGGCGGGTAT TGGGAAAAAT TGGGTTCTGG AATAATAATG GGTGGCCGTA TACTTTACGC 780 

CTCCAnAT 788 
(2) INFORMATION FOR SEQ ID NO: 785: 



10 



20 



25 



30 



40 



45 



(i) SEQUENCE ^CHARACTERISTICS: 

(A) LENGTH: 1023 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 785: 

CAATATTTTA GAAACAaTAA TTAACTTGCG ATGAACAAAC TATTAACAAT AATCTTGAGT 60 

ATTATATTTA TCTTAATTAA TAATATATTT ATTTAAGATT ATGTTACTTC CAACTTTCAA 120 

AGTAGAAAAA CGGTATAATT TGTTGATGGG TGTTTATTGA TAAACTGCAA AAAATACAGC 180 

35 CTACAACCAC ATAGATTGTA GACTATATTT AAAATAATAG GTATTTATCA TATCTCGTAA 24 0 

ACTTAGTGCC AATTTTTATA CTCGTGGTGC TGGTAAGCTA CCCTTAAATT CAGGAACGTA 300 

GTGTGTaGGG CTATAACTTG GAACAGCATA TTGATAATTT ACATTTTTGA TATTTAATGA 360 

TGGTTTCCCA ATTTTATAAC CATTTGATTG TGAAAATGAG AAATAkTTCT TCACACCTTT 420 

AACTACTTTA TAAGAATAGA AGTATTTATA GTCATATGCT TTATTTACTT TAGCATTTTG 480 

ATGTGTTGCT GTTGTGTTAT TTTGGAAACT TGGTACATGC ATACGATGTG AATTGTGACC 54 0 

ATATGGTGGA ATTACCTTGA AACTATTTAT TTGTGGCACA ACACAAAAGT GATTAATTTT 60 0 

AATGCTAGCA TGCCCTGGTG TAACAAATTT ATGCGCGTGA TATCCAGGAA CTGCAAAATG 660 

50 ATGCTTGATA ATTAAAGATT gAGATGGATG TGTATATCTA GGCGATTCTG ATGGTTTAAC 72 0 

AATAAAGTGT TTATTAATAG AATCCTTTGC ATGATTTACA TGTTTATGTA CATGTGTTGA 780 

TTTGTATGAA GTAATAACTT TCTTGTAGTG GGTTTGCGTA GTAATGAAGT GGTGGTTTAC 840 
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TTGTTGTGAA GCTGGATTGT TTGTTGCTAC ATTCACGCGA TTATTGATTT CTTTGTACTC 960 

TGGCACAATG TTTCCTAACT TTGATTCTGG nACGACAAAG TTTTTATCTA CAATTTTACT 1020 

AGC 1023 

(2) INFORMATION FOR SEQ ID NO: 786: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 487 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

75 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 786: 
TCAAGTTTAT TCGCTTGTTG GAAAGCTTTA ACTTGATTTT CTAAAGCTTT ATCAAATTGC 60 
20 GTTGATTCAT TATCAACTTT ATAACCTAAA GCTGATAAAC CAATTTTAAT AGTTTTAATA 120 

TTTTTATCAT CGTCTCCAAC TTTAAATGTT TTCGTATTAG GAATGACATT TAAAGATTGA 180 
TATTTAGGTG TGTCAATAGT AACGTCTGGT TTAATGCCTT TACCGTGAAT ATAATGACCA 240 
TCTGGCGTTA ACCATTTCAT TTCAGTATAT TTTAACAATG AACCATCCTT AAACTCTCTT 300 
GTAGTTTGTA CGACACCTTT GCCGAATGTT TTTGACCCAT AAACTTTAGC TTTATTATAG 360 
TCTTTT AG CG CACCAGTAAA CACTTCAGAA GCGCTAGcTG AACCTTCATT CACTAAGATG 420 
GATATATCCA TGTCTTTCGC TTCTTTTAAC GCATCATTAG AAGTtGAATT GCTCAGTACT 4 B0 

TTACCTT 4B7 
35 (2) INFORMATION FOR SEQ ID NO: 787: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 803 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

•» 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 787: 

45 

GnAAACnATA TTTAnAATTA AGTGTTCGAT TTGCAGTAAA TTCATACAGA AAAGCTAAAA 60 
TTAAAATGAT ACaGTGGAAT AATGTCACAT TCTCCATAAA TAATTGACCG AAGAAGCATA 120 
50 AAACATAGAA TAGTACAGTA ACACTTACCG GCTGCTTTTC TTTATATATA ATTGCATTAA 180 
TACAATAATA AATAATAAAG AGTGAAATTA GTGTTGATGT CGCATAATTA TAAAATCCTG 24 0 

CAAACCAGCC ATATGTATCT GCATAAATAG CACTTGGTAA AATTAACATT AAAGAAAATG 300 

55 



25 



30 
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TGCCCATACT AATGAGGCCA TATGAAAGCC AACGTAACCA GCTTACATGT ACAGCTATAA 
TTTCAAAGAT ATTTCCGATA TAGCGACCAT TAAGTGATGC AAATCCTACT TTTAAAATAT 
CAGTATTATA ATTGCTAAAC CATTGTAAAT CATCGTGCAT GAGTGGTAGT AAGATACCCA 
TAAAAGTATA AAACAATAAT ATCGCAATTA ATATCAAAGT TGTCTTGTGT AATTGAATTG 
TTTTCACTTT GCTAATCCTC AAATCTAGTT AAATTTTCCT CAACTTGTAG GTCGAAAAAT 
TAATTCAATA TTTTTAATGT ATTTCTAATT TTCACCTATG CATGTTTCCT CAATCAAATT 
AGATAAACAA GGTATTTAAT ATTACTTTCA ACAATTTATC TAAATCGCCC CTCGTCTTTT 
TCTATGaCGA ATGATTACAC TTG 
(2) INFORMATION FOR SEQ ID NO: 788: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs. 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 788: 
GAAAAGTTGT ATTATGAATA AAGTTAATCA AGGTGCTCAG GAAGAGGCAA TGGAAGAGTT 
ATTAGTGACT TTTCAAAAAT TGATTAAAGA CTAAAGGAGT TTAAGATGAT ACATCAAAAT 
ACGATTTACA CAGCGGGAAT TGAAACAGAA GAACAAGTAA GTCAATTGAC AGAACGCATT 
TCAAATATGA TAGGTGTTCA TCAAGTGAAT ATTAATATAA TAGATGGTCA AGTAACTGTA 
TCGTATGAGA CACCAGCAAA TTTGAATAGT ATTGAAAAAG AAATCTATGA TGAAGGATAC 
AAAATTGTAT TTTAGGGTAT AATGTAGAGT GCGCTATAGA TTTTAATTTT GAAAATAAAT 
TAAAAATTTT GTAAATGATG TAGTAAAGGT ATGTCGAATA 
(2) INFORMATION FOR SEQ ID NO: 789: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 762 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(DJ TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 789: 
GTTTAAAGAT GGAGAGATTA TTGATTTTTC AGCTGGAAAA GGTGAAGCGG TATTGAAAGA 
TTTAATCAAT ACTGATGAAG GTTCAAGAAG ATTAGGTGAA GTAGCATTAG TACCTGATGA 
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10 



20 



ATGTCATCTA GCGATTGGAT CTGCTTACGC CTTTAATATT CAAGGTGGAA CGGAAATGAC 240 

TGTTGAGGAA AAGATTGCAA GCGGATTAAA TGATTCAAAT GTACATGTCG ATTTCATGAT 300 

TGGTAGTAGT GATTTGACTA TTTATGGCAT ATTCGAAGAT GGTTCAAAAG AACTAGTATT 360 

TGAAAATGGA AATTGGGCAT CAACATTTTA ATAAATGTTA TTTTGAGGTG CTGAGTAGGA 420 

AATGAAACAT GTATTTAAAG GTCAAATACG TGATTAAAGT ATAGATTGGG AGATAAAATA 480 

ATGACAAATC AGGACAGACC AATGAAATCT ATGTCAGAAT CAAAATGTTA TAAAAATAGA 54 0 

CAAGTTTTCC CTCAAGATAC GAATCACCAT CATACAATGT TTGGTGGTAC ATTGATGGCT 600 

15 AATATTGATG AAATTGCAGC AATCACAGCT ATGAAACATG CTGGTGCGCA AGTAGTTACC 660 

GCATCTACAG ACTCAGTAGA TTTCTTAAAG CCGATTAAAA CAGGGGACAT ATTACAATAC 720 

GTAGCGATGG TTTCATACGC TGGGACTAGT TCAATGGAAG TG 762 
(2) INFORMATION FOR SEQ ID NO: 790: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE : nucleic acid 
25 <c) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 <xi) SEQUENCE DESCRIPTION: SEQ ID NO: 790: 

TTTAATTTTA TCTTTTGGAT ATCACATAGA TGCTACTTTT CTATACTGTT CTAAATCTAT 60 

TATTTCGTCA CATTGATTTG CAATAGCCAT ATCATGTGTT GCTAAAATCA GCACTTTATT 120 

35 TTCATCGACC AAACCAAATA ATGATTGAAT AATCATCTGT CCTGTTTTAG GATCTAACGC 180 

ACCCGTTGGT TCATCAGCTA ACATAACAAT CGGATCTTTT AACATCATTC TAATTAAAGC 240 

GACACGTTGT TGTTCACCTC CACTTAGCGT ATGAACTTTT CTTTTTAAAC TGTTTGACAG 300 

40 

ACCAAACTGT TCtATATAAC GtATCTttAT TTGTnCTTTT TCTTTCTTAC TTATTTTTTT 360 

ATATGCTAAT CCAATATCTA AATTTTCATT TACTGnCAAA 4 00 
4S (2) INFORMATION FOR SEQ ID NO: 791: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
50 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 791: 



55 



1669 



EP0 786 519 A2 



AACGTAAAGC GGAAGTTGGG ATGGTTGAAG ATGCTGAATT TCATGCGCAA CTACATCGTT 120 

ATAATGCTTT TCTAGAGCAA CATCAAGATG ATAAAGTGTT GTATTTnGAA ATTGGAATTG 180 

5 GTTATACTAC .ACCACAATTT GTGAAGCATC CTTTTCAGCG TTGACACGTA AAAATGAAAA 240 

TGCCCTTTAT ATGACGATGA ATAAAAAGGC TTCGCATTCC GAATTCAATT CAGGACGTAC 300 

CATACTTTAA CTGAGATTCT CAACTTGTTC AGCAGACTCC GGACGCAGCA CACGAAATAC 360 

10 

ACATGGGGGC AGAGTnACTT ATGGACCGTT GAATGnGATA 400 
(2) INFORMATION FOR SEQ ID NO: 792: 

15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 678 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 792: 



40 



GTTCCGTTTG 


ATACAGAGAA 


GCGAATGTAA 


GCATAATCTT 


TAACAGTATC 


GTATGATAtA 


60 


AtTTAATTGG 


CAACTTTTTG 


TCACCTTCAT 


AAACTTCAAA 


TTTTCTCCAA AATTGACCTG 


120 


ATTGTAATCC 


TAATTCAATT 


TCTGGTTTTG 


AATCAGTGAA 


AATAACTCTA 


GCAGGTTTAA 


180 


CAGAACTTGC 


ATAATGATAA 


AACTGTTGAG 


TTCCATCTTT 


CTTTTTCATT 


TCAAAATCAA 


240 


TTGGACGAGA 


GTTTGGTGCG 


CTATGATCTT 


TGTCTTTTAT 


TGCAGGGTTT 


TTAATCGCTT 


300 


CTCTAAGTTC 


CTGaTTCAAA 


ATAGGATATG 


TATTGTTAgT 


GGCyTTTGCT GCTGGTTTAA 


360 


CTTCTTTTGT 


TTCCTTAGGG 


GcTTTAACTT 


CTTTAACTTC 


TTTAGCTTCT 


TTTGTTTCAG 


420 


AAGTAGGGGC 


CTCAACTTCT 


TTATTAGATA 


CTGAGACAGC 


ATTAGCTACT 


GGTTTAGTTT 


.4 80 


CTGGAGCTTT 


TTCAGATGTT 


GTTGTTGGAC 


TTGCAACTGC 


TTCAGTTTTT 


GGTTGTGCTT 


540 


CTGTATTTGT 


ACCACCTGTT 


TCTTCAGCTG 


CTGCTTGTGT 


TCGCCATTTG 


ACATTAATAA 


600 


TAAAAGTGTA 


CTAATTGCTA 


CAGATGCAAC 


GCCTAGTGAT 


GACTTTCTAA 


TTGAATAAAA 


660 


TGATTTAAAT 


TClTlTlXi 










678 



(2) INFORMATION FOR SEQ ID NO: 793: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 351 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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ACTGGCGGAT CCACGATGGC ATGTGTTAGT GAAGCAATTC ATTTATTACC ATATAATGTA 60 

TTCTTCGTAC CAGCCAGAGG TGGACTAGGC GAAAnTGTTG TCTTTCAGGC AAACACAATT 120 

5 GCAGCCAGTA TGGCACAACA AGCTGGCGGT TATTATACGA CGATGTATGT ACCTGGATAA 180 

TGTCAGTGAA ACAACATATA ATACATTGTT GTTAnGAGCC ATCAGTCATT AAACACTTTA 240 

GGACAAAATT AAACCAAGCA AACGTTAATA TTACACGGGC CTTTGGTGAT GCGCTGGAAG 300 

10 

ATnGGCGCCT CGGACGTCCA ATCACCTGGA AAAGGTCCTT GGAACCACTT C 351 
(2) INFORMATION FOR SEQ ID NO: 794: 

,5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 794: 
GGGCGAGGCG TTCGCGCACC GGCCGAGCAG CAAATAGGCC GGGATGTCGC GTCTTGCTGC 60 

25 

AGGTCCGAAA AATATCAACT TGAAGAGGTA ACACCGnTAG ATATGTTTCC ACAAACAACA 120 
CATGTTGAGA CAGTGGCATT ATTCAATTTG AAATAGCGCA CATGATTTGA GATAAAATTT 180 

3Q CGCCTTAATA AGTGAAGAAA GAAGTCGTAT ATTAAAATTT TATTACAGCC AATTTACTTT 240 

CTTGATATGA ACTTTTTAAA TTAAATGTGA CATTGTATAC TATATTTAAA GAAGAATAAG 300 
AATGTCATGA TCGGGAGGGT TGGTAATGCA TAAAATTGAT TTAACGACAA ATAATTTTCA 360 

35 AATGCGAGGT TTATTATCCT GGCAACCTGG TTATTGCGTT 400 

(2) INFORMATION FOR SEQ ID NO: 795: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 407 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 795: 
ATGTTCCAGG AACGTATTTT TACAACCGAC AATTAGCATA TGTCATAATG AGTTTTATAA 60 
so TTGTATTTTT TATTGCATTT TTAATGAATG TTAAATTACT GAGTAATATT AAAGTGCAAA 120 

AAGGTATGAT TATAACTATC GTCTCACTAT TATTACTGAC GTTAGTAATA GGTAAAGATA 180 
TTAATGGTTC TAAAAGTTGG ATAAACTTAG GATTTATGAA CTTACAGGCA TCTGAGTTAT 240 

55 
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TAAGTAAACC AAAATTAATt TTAAGTCCTA TTGTATTAGC ATTAGGTTGT ACGTTTTTAG 360 
TTTTCCTACA AAAAGACGTG GGCAACATTA CTAATATTAA TllATnTT 4 07 

(2) INFORMATION FOR SEQ ID NO: 796: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1053 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



J5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 796: 



20 



25 



30 



35 



45 



CAGTAGGTAC 


ACTTGCAGCA 


TTTGTTGGAT 


ACTTAGAGTT ATTGTTCGGG CCTTTACGTC 


60 


GTTTAGTCGC 


ATCATTTACA 


ACTTTAACGC 


AAAGTTTTGC TTCAATGGAC CGTGTATTCC 


120 


AATTAATTGA 


TGAAGATTAT 


GACATCAAAA 


ATGGTGTTGG TGCTCAACCT ATTGAAATTA 


180 


AACAAGGTCG 


TATTGATATT 


GATCATGTTA 


GTTTTCAATA TAACGATAAC GAAGCTCCAA 


240 


TTTTAAAAGA 


TATTAATTTG 


AGTATTGAAA 


AAGGAGAAAC AGTTGCTTTC GTAGGTATGA 


300 


GTGGTGGTGG 


TAAATCAACA 


TTAATTAACT 


TAATACCGAG ATTTTACGAT GTAACTTCTG 




QGCAAATTTT 


AATAGaTGGT 


CACAACmTTA 


AAGaTTTTTT AACGGGAAGT TTAAGAAATC 


420 


AAATAGGATT 


GGTGCAaCAG 


GATAATATTT 


TATTCTCTGA TACAGTTAAG GAAAATATTT 


480 


TaCTTGGTCg 


TCCaACAGCa 


ACAGATGAAG 


AAGTAGTTGA AGCGGCgAAA ATGGCTAATG 


54 0 


CACATGACTT 


TATTATGAAC 


TTGCCACAGG 


GATATGACAC TGAAGTAGGT GAACGAGGTG 


600 


. TTAAATTATC 


AGGTGGTCAA 


AAACAAAGAT 


TATCGATTGC TAGAATATTT TTAAATAATC 


660 


CGCCAATTCT 


TATCTTGGAT 


GAAGCAACAA 


GTGCACTTGA TTTAGAAAGT GAATCCATTA 


720 


TTCAAGAAGC 


ATTAGATGTG 


TTGAGTAAAG 


ATCGAACGAC ACTTATCGTA GCGCATCGCT 


780 


TGTCCACTAT 


TACACATGCT 


GACAAAATTG 


TCGTAATTGA AAATGGACAT ATTGTTGAAA 


840 


CAGGTACGCA 


TCGTGAATTG 


ATTGCAAAAC AAGGTGCTTA CGAGCATTTA TATAGCATTC 


900 


AAAACTTATA 


AAGTATTAGT 


TGTTTGACTT CAGTACAATC TTGAAGAGAA AATTTGTAAC 


960 


AGGATGGTGG 


GGTCATAACA 


TAGAAAAAGC AGTAAGAGAT TTTCTTAGTT GAAAATAATC 


1020 


TTnCTGCTTT 


TTTAAATTTA 


ATTTCGnGAT 


TCA 


1053 



50 (2) INFORMATION FOR SEQ ID NO: 797: 

SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 544 base pairs 

(B) TYPE: nucleic acid 
ss (C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 797: 
AAGAGTGACT CCTGAAGTGA AAGAAGGGGA CCGTGTCGTG TTCCAACAAT ATGCTGGTAC 
AGAAGTTAAA CGAGATAATG AAACATATCT GGTATTAAAT GAAGAAGATA TTTTAGCAGT 
TATTGAATAA TACAGAACTT AATTCATAAA TAAATTAAAT AGAACGAAAA TGAAACACAA 
CTAAACAAAT GGAGGTTTAT CATTTATGGT TAAACAATTG AAATTCTCTG AAGATGCACG 
TCAAGCAATG TTACGTGGTG TTGACCAACT TGCAAATGCA GTTAAAGTAA CGATTGGTCC 
TAAAGGACGT AATGTTGTAT TAGATAAAGA GTTTACAGCA CCTTTAATTA CGAATGATGG 
TGTGACGATT GCyAAAGAAA TCGAATTAGA AGATCCATAT GAAAATATGG GGGCTAAACT 
AGTTCAAGAA GTCGCAaATA AGACAAATGA AATTGCTGGT GACGGTACGA CAACTGCAAC 
AGTATTAGCT CAAGCAATGA TTCAAGAAGG CTTGAAAAAT GTTACAAGTG GTGCGAACCC 
AGTT 

(2) INFORMATION FOR SEQ ID NO: 798: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 372 base pairs 

(B) TYPE: nucleic acid 
(CJ STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 798: 
CAATTGCATA GATATTGCTG ATAGACGTAC GGCTTTGTTT ATCAACTTCT AATAATCCAC 
GGTCAGCGAA TTTAACACCT AATTTCGTCT GTGTTTGGAC GACGACCTAC AGTTACTAAT 
ACATAATCAG CTTCGATTGT TTTCTCTTCG CCTTTAGCTT CATAAGTAAC TTTAACTCCG 
TTATCTGTTT CTTCAGCTGA TTTAGCCATA GCTTCAGTAA CGATTTCAAC ACCTTTTTCT 
TTCATACCTT TTTTAACAGG TTGTGTGCAT TTGnTTTTTC GAGGCCACCT AAGATATCTT 
TAGCACCTTC AAGGGATGGG GTACTTTCTG nAACCAAAGT TAGCAAATGC TGTACCTAAT 
nCTGGTCCAA TG 

(2) INFORMATION FOR SEQ ID NO: 799: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1182 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 799: 



5 



10 



30 



AAACGACATT TGTTTAATTG 


GAAGAATTaG 


CACCATTTTT 


TGAGGCGGGT ATAGAtTCAT 


. 60 


TTAAAATCGA TGGTATTCTA 


CAAACGGAAG 


AATATATTAA 


TGTGGTAACG 


GAACAGTATC 


120 


GTCAAGCGAT AGATTTGTAC 


AATGAAGATC 


CTGAAATCTA 


TGAAGATGAG 


AAATTTATGT 


1B0 


TGATGGATCC AATTGAAGAA 


ATTCAACCTg 


ATCATCGACC 


ATTTGACGAA 


GGTTTCTTAT 


240 


ATAAACAAAC AGTATATTAA 


GGAGGTTAAT 


CATGAAGACA 


ATAGAAGAGA 


TTAAATCAAC 


300 


TCCTAAAACA GTTATGAAGA 


AACCAGAATT 


ATTAGCACCT 


GCTGGAAACT 


TAGAAAAGTT 


360 


AAAAATAGCA GTACATTATG 


GCGCTGATGC 


CGTATTTTTA 


GGTGGTCAAG 


AATATGGATT 


420 


ACgTTCaAAT GCTGaTaATT 


TCaCGATGGA 


AGAAATAGCT 


GAAGGTGTTG 


AATTTGCGAA 


480 


CCGTTACGGT GCCAAAATTT 


ATGTTACGAC 


AAATATTATT 


GCACATGATG 


AGAATATTGA 


54 0 


AGGTCTAGAA TCATATTTGC 


GTAATTTGGA 


AAAGACTGGT 


GCGACAGGTA 


TCATTGTTGC 


600 


AGATCCTTTA ATTATCGAAA 


CATGTAAAGA 


AGTTGCGCCA 


AAACTTGAAA TTCATTTATC 


660 


TACTCAACAA TCACTTTCTA 


ATTACAAAGC 


TGTAGAATAT 


TGGAAAGAAG 


AAGGATTGGA 


720 


TCGTGTTGTA TTAGCACGTG 


AGACCGGCGC 


GATGGAAATG 


CGTGAAATGA AGGAAAAAGT 


7B0 


AGATATTGAA ATCGAAGCAT 


TTATTCATGG 


TGCTATGTGT 


ATCGCCTATT 


CAGGTAGATG 


840 


lALAi J.AAGT AATCATATGA 


CTGCAAGGGA 


TTCCAACAGA 


GGCGGTTGCT GTCAAAGTTG 


900 


CCGTTGGGAT TATGAATTAT 


TAGAAGTTGA 


TGATAATGGT 


GAACTTGATG 


TTTTTTATAA 


960 


TCAAgGTGAA GTTACACCGT 


TTGCGATGAG 


TCCTAAAGAT 


TTAAAATTAA 


TCGAATCAAT 


1020 


TCCTCAAATG ATGGATATTG 


GTGTGGACTC 


ATTAAAAATT 


GAAGGACGTA 


TGAAGTCAAT 


1080 


TCATTATATT GCAACAGTTG 


TCTCAGTATA 


TCGTAAAGTC 


ATTGATGCGT 


ATGCGGCAGA 


1140 


TCCTGACACT TTAAGATTAA 


TCCGGAATGG 


TTAATAGAGT 


TA 




1182 



(2) INFORMATION FOR SEQ ID NO: 800: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

SO (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 800: 

GTATTCTTCC AATAACACCT TTCATTTGGA AACCGTTTAG AAAGAATACA GTTACTTCAG 60 
TTTGGTTTGC TTTAAAATTC TCTaGTGCTT TGTCTTGGAT GTTTTCGtTG CAATCATCTG 120 

. ss 
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TTTTCTTTAT 


CTAACCAATG 


AACACTCATT 


TTATTCTTGA ACCATGTCAT 


TTGTCGTTTT 


240 




GCATATTGGC 


GTGAATGTTG 


CTTTAAATCA 


TTGACAGCAT 


CTTCATAAAT 


CATTTGTCCG 


300 


5 


TTAATCACAG 


GTATTAATTC 


TTTATATCCA 


ATAGCTTGCA 


TACTTTGGCA AGATTCATAG 


360 




CCTTGTTCAA 


CAAGTTGTTG 


CACTTCTCTA 


AATAATCCGT 


GATCCAACAT 


AATATCAACA 


420 


10 


CGTTTATTTA 


TTCTTGAATA 


TAATGTTTTA 


CGCGACATTT 


CAATCCCTAA 


TAATAATGTA 


480 


TCATAATTTT 


CAGTAAATTG 


TTGCACTTTC 


TTGCGATTAC 


TCAAAAGTTT 


TTTTGTTTTT 


540 




AAATAATATT 


CAATAGCGCG 


CAACACTCTT 


TGGcGGTTGT TAGGGTGAAT ATTTTCTGaC 


600 


15 


AGAAACCGaC 


ATCAAATTGA 


GCTAAATAAT 


CGTGTAGTTG 


CTGATTATCT 


AAATGTTCTA 


660 




ATGCAGATAA 


CTTTTGTTTA 


ACTATGGATA 


ATTGTGCAGG 


TGTAcTGTTT 


CATCTTCTAA 


720 




TTCATAATTA 


TATATTAATG 


ATTGAATATA 


TAAGCCTGTT 


CCACCTGCTA 


TGATTGGaAC 


780 


20 


TTTACCTCTA 


TTCGTt ATAT 


CAGTAATTAA 


ATCTTCTGCT 


AATCGCTTGA ATTCATATGC 


840 




TGAAAATGTA 


TCATCAGGAT 


TCAAGATATC 


AATTAAATGA 


TGTGGAATAC 


CATCCATTTC 


900 




TTCAGGTGTn 


ACTTTTGCAG 


TTCCAATATT 


CATATGTCTG 


TAGACTTGCA 


TAGAGTCACC 


960 


25 


GCTTATGATT 


TCACCATTGA 


TACGCTTCG 








989 



(2) INFORMATION FOR, SEQ ID NO: 801: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 543 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 

■ (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 801: 



40 



45 



50 



CGTTAGATAT AGAATTAAAA 


GATAATATTA 


AATTTCAAAA 


CGGTCAAAAA TTGACTGCAG 


60 


AAAAAGTGAA 


ATCTAGCCTT GAAAATAGCA 


TGAAAAAAAG 


CGACTTGGTC AAATATTCAT 


120 


TACCAATATC 


ATCAATTACC GCTAAAGGTC 


AAAAACTGAC 


AATTAAAACC AACTCCGCTT 


180 


ACCCTGaACT 


TGTATCTGAA 


TTAGCTAATC 


CTTTTATGGC 


AATTTATGAT ACAGATGCTA 


240 


AATCAGATGT 


TAATCAAACT 


CCTGTTGGTA 


CAGGCCCTTA 


CCAAATAAAA GATTATAAGC 


300 


AATCTCGAAA 


AATATCATTG 


TCGAATTTTA 


AGGACTATTG 


GCAAGGTAAA CCGAAACTTG 


360 


ATCATATTAC 


TGTGACGTAC 


CAAGAAGACG 


GCAATAATCG 


CGTCAGAAAT TTAGAATCTC 


420 


AAAAAGATGA 


TTTAATAACT 


GATGTCCCAG 


TTAATAAAGT 


TCAAGACATA GAAAATAATC 


480 


AAAATTTAAA 


AGTGTCAAAA 


GAATCTGGAT 


TTAGAACTTC 


TTTACTTATG TATAATCATA 


54 0 



55 
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(2) INFORMATION FOR SEQ ID NO: 802: 

(i) SEQUENCE CHARACTERISTICS: 
5 (A) LENGTH: 1139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 802: 



25 



30 



45 



GCGAATTCGT 


ACACACATAT 


ACACAAAGAT 


AATCATAGTT 


TTACATTGAA GCACATGAAA 


60 


GATAATTCAT 


TTAAAGGAAG 


GTATTATCAA 


TGACTAAAAA 


AATGGGATTA TTAGTTATGG 


120 


CTTATGGCAC 


ACCTTATAAA 


GAAAGTGACA 


TAGAGCCATA 


TTATACAGAT ATTAGACATG 


180 


GTAAACGTCC 


ATCTGAAGAA 


GAACTTCAAG 


ATTTGAAAGA 


TAGATATGAA TTTATAGGTG 


240 


GTTTATCACC 


ATTAGCAGGT 


ACAACAGATG 


ACCAGGCTGA 


TGCGCTAGTT TCAGCATTAA 


3 00 


ATAAAGCATA 


TGCAGATGTT 


GAATTTAAAC 


TATACTTAGG 


ATTAAAACAC ATTTCACCAT 


360 


TTATCGAAGA 


TGCGGTTGAA 


CAAATGCACA 


ATGATGGCAT 


TACTGAAGCA ATCACGGTAG 


420 


TACTAGCACC 


ACATTATTCT 


TCATTTTCAG 


TAGGATCATA 


TGACAAACGT GCTGATGAAG 


480 


AAGCTGCAAA 


ATATGGTATT 






TTATTATGAA CAACCTAAAT 


54 0 


TTATTGAATA 


TTGGACGAAT 


AAAGTCAACG 


AAACATTAGC 


TCAAATACCG GAAGAGGAAC 


600 


ATAAAGACAC 


GGTATTAGTT 


GTTTCGGCAC 


ATAGTTTGCC 


AAAAGGTTTA ATCGAAAAGA 


660 


ATAATGATCC 


ATATCCACAA 


GAACTAGAAC 


ATACTGCGCT 


TTTAATTAAA GAACAATCTA 


720 


ATATTGAACA 


TATCGCGATT 


GGTTGGCAAT 


CTGAAGGTAA 


TACAGGTACA CCTTGGTTAG 


780 


GGCCAGATGT ACAAGATTTA 


ACACGTGATT 


TATATGAAAA 


ACATCAGTAT AAAAACTTTA 


840 


TATATACGCC 


AGTAGGTTTT 


GTATGTGAGC 


ATTTAGAGGT 


GCTTTATGAC AATGATTATG 


900 


AATGTAAAGT 


AGTTTGCGAT 


GATATTGGTG 


CGAATTATTA 


TCGTCCAAAA ATGCCGAATA 


960 


CACATCCATT ATTTATCGGT 


GCAATTATTG 


ATGAAATCAA 


GTCTATATTT TAATGACGAA 


1020 


AGAAGCGTGA AACGTTGTGA 


CTAAATCAGT 


GGCTATTATA 


GGAGCGGGGA TAACAGGTTT 


1080 


ATCAAGTGCA 


TATTTTTTAA 


AACAGCAAGA 


TCCTAATATT 


GATGTAACCA TCTTTGAAG 


1139 



(2) INFORMATION FOR SEQ ID NO: 803: 

so (L) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 989 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 
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(Xi) SEQUEKCE DESCRIPTION: SEQ ID NO: 803: 

TATCCriAAAT AnAAATGTGC GCATTTATCA AAACCTTACA TACAATAGAG TTTTCCCTAA 60 

5 

CAGTAAATTA GATATTATTA CACCTGTTGA TATGTCTTCT AATGCCAAAC TGCCAGTTAT < 120 

TTTTTGGATG CACGGTGGTG GTTATATTGC GGGTGATAAG CAGTATAAAA ACCCATTATT 180 

AGCGAAAATT GCTGAACAAG GGTACATTGT TGTGAATGTA AATTATGCAT TGGcGCCACA 240 

10 

ATATAAATAT CCCACACCAT TAATTCAAAT GAATCAAGcA ACTCAATTCA TTAAAGAAAA 300 

TAAAATGAAT TTACCTATTG ATTTTAATCA AGTAATTATT GGGGGTGAyT CTGCAGGTGC 360 

75 TCAATTAGCT AGCCAATTTA CGGCAATACA GACGAATGAT CGCTTAAGAG AAGCCATGAA 420 

ATTTGATCAG TCATTCAAAC CATCGCAAAT TAAAGGTGCT ATACTATTTG GTGGTTTTTA 48 0 

TAATATGCAA ACAGTTAGAG AAACTGAGTT TCCAAGAATA CAGTTATTTA TGAAAAGTTA 540 

20 

TACTGGCGAA GAAGATTGGG AAAAGAGTTT TAAAAACATT TCACAAATGT CGACAGTAAA 600 

ACAATCGACA AAAAATTATC CaCCAACATT TTTATCTGTT GGAGATAGCG ATCCATTCGA 660 

AAGTCAAAAT ATAGAATTCA GTAAGAAATT ACAAGAATTG AATGTACCAG TAGATACTTT 720 

25 

GTTTTATGAT GGTACGCATC ATTTACATCA TCAGTATCAA TTTCACCTTA ATAAACCTGA 780 

ATCGATAGAT AATATCAAAA AAGTGTTACT TTTCTTAAGT CGTAATACAT CCTCTAGTGG 84 0 

TATTCAAACT GAAGAGAAAC CACAAATAGA AAATCCGAGT AATGAATTAC CGTTAAATCC 900 

30 

TTTAAACTAA TGATAAACAG TAGTAATTTA TTACTTAAGC AACATTTAAG ATTTTCAAAT 96 0 

TAAAAACGAG AATTTAAAAC ATGTGGTGC 989 
35 (2) INFORMATION FOR SEQ ID NO: 804: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 711 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 804: 

45 

TAAnCCCTGG TTTAATGATT TTGATTACGT GTTTTTATAA TAAAAACATA TCGAACATTG 60 

ACTACGTTAT TAAGCTGCTT TTTTGTACAC TTTATAACCA ATAGCTTAAG ATTTAAAACT 120 

50 AATCGGAAAG AACAATGATT CACCAnAAAA ATATTTATGT TGCTATTAAA AATCAGTTAA 180 

TACGAATGTT AAAATACGTT TGATTTTCAT TAATAATGAT TCAAGTTTAT TTAAATGAGC 24 0 

GTTAATGTCA GTCTGTTTTG ATGCACCTTA TAATAAAGAC AGATAGTTCA AATTACGTAA 300 
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AAAGCAACAT TAGCATTAGG AATATTAACT ACAGGTGTGT TTACAGCAGA AAGTCAAACT 420 

GGTCACGCGA AAGTAGAACT TGATGAGACA CAACGCAAAT ATTATATCAA TATGCTACAT 480 

5 

CAATACTATT CTGAAGAAAG TTTTGAACCA ACAAACATTA GTGTTAAAAG TGAAGATTAC 540 

TATGGCTCTA ACGTTTTAAA CTTTAAACAA CGAAATAAAG CTTTTAAAGT ATTTTTACTT 600 

GGTGACGATA AAAATAAATA TAAAGAAAAA ACACATGGCC TTGATGTCTT TGCAGTACCT 660 

10 

GAATTAATAG ATATAAAAGG TGGCATATAT AGCGTTGGCG GTATAACAAA G 711 
(2) INFORMATION FOR SEQ ID NO: B05 : 

1S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 6B0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
. (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 805: 



25 



35 



ATCAGGAAAA 


ACAGaATAAT 


GTAAATCAAG 


CTGTTCAGCC 


TCAAAATAAT 


ACTAATGAAA 


60 


CATCAAAAGT 


ACCGGCTAAT 


TTTGTCAAAT 


TGAATGATAT 


TAAACCAGGT 


GATACTTCTA 


120 


TACAAGGAAC 


AACTTTACCA 


AATCAATTTA 


TACTATTAAC 


TATTGATAAA 


AAAGATGTGA 


180 


GCTCAGTTGA 


AGATTCTGAC 


AGCAGCTTTG 


TTATGTCTGA TAAAGATGGG 


AATTTTAAGT 


240 


ATGACTTAAA 


TGGTCGCAAA 


ATTGTTCATA 


ATCAAGAAAT 


TGAAGTGTCT 


TCATCAGATC 


300 


CCTATTTAGG 


TGACGATGAA 


GAAGATGAAG 


AAGTAGAAGA 


AACTTCAACT 


GAAGAAGTTG 


360 


GTGCTGAGGA 


AGAAAGTACA 


GAAGCTAAAG 


CTACATATAC 


AACACCGCGA 


TATGAAAAAG 


420 


CGTATGAAAT 


ACCGAAAGAA 


CAGCTAAAAG 


AAAAAGATGG ACATCACCAA 


GTTTTTATCG 


480 


AACCTATTAC 


TGAAGGTTCA 


GGTATTATTA 


AAGGCCATAC 


CTCTGTAAAA 


GGTAAAGTTG 


540 


CTCTATCTAT 


TAATaATAAA 


TTwATTAACT 


TTGAGaCAAA 


TGCTAtGGTG 


GtCCaAATaA 


600 


AGAAGaAGCG 


AAATCTGGAT 


CAGAAGGAAT 


CTGGATGCCT 


ATTGATGACC 


AAGGATACTT 


660 


TAATTTTGAC 


TTCCAAACGA 










680 



^ (2) INFORMATION FOR SEQ ID NO: 806: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 950 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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5 



10 



IS 



20 



GTATTTCATT 


AGCCATTGGC 


AATTGACATC 


TGACAATGAG 


GAATGGCTTT TGCATTGGAA 


60 


TTCACAGTTG 


GAGTGCATTT 


ACTGGTTATT 


TTGGAGGTCA 


TTGTTGATAC ATTATTAATG 


120 


GAGAATAACC 


GAATTTGTTA 


TGTTATTTCC 


ATTTTTAATA 


TTTGCAATTG TATTAAATGC 


180 


TGCACTTGGA 


GATAAAATTA 


AAAATCCTTA 


TGGATCTGCC 


ATAATTCTTG TTCTAGTTAT 


240 


TATCGTATTA 


AGTTGGGGAG 


GTATTGACAA 


GACTTGTTCG 


TGGTAAAGTA CTTCAAGAAA 


300 


AAGAAAATGA 


ATACTTTTTG 


GCAGCAAAAT 


CAATTGGTAC 


ACCCACATAT AAAATTATTT 


360 


TGAAACATCT 


TTTGCCGAAT 


ATATTAAGTG 


TAGTTATCGT 


ACAAGCAACA TTGTTATTTG 


420 


CCGGTATGAT 


TGTAGTGGAA 


TCAGGrTTGA 


GCTTTTTAGG 


ATTCGGAATT AGTAAAGCAA 


480 


TACCATCTTG 


GGGTAATATG 


TTGAGTGATG 


CTCAAGAAGG 


GGATGTTATA AGTGGTAAAC 


540 


CGTGGATATG 


GATGCCACCT 


GCTATAATGA 


TTACATTAAC 


TATATTAAGT ATAAACTTTG 


600 


TAGGGGAAGG 


GATTAAAGAT 


GCTTTTAATC 


CTAGAGGTAG 


ACGTTAAATA ATAAAAGAGG 


660 


CACTAGTTAA 


TTCTAGTACC 


TCTTTATTTT 


ATCTCTTACG 


TCCTAAACCC ATCGCTTTTT 


720 


CCATTTTTTT 


GACAGTTTTA 


AATGAAACTT 


TGTGTGCTyT 


ATCTCTACCT TGATCTAAAA 


780 


TATCAyCAAG 


TTTATcTGAG 


TTATAGAAAC 


TTTCGTATTT 


TTCTTGGAAT TCTACTAAAA 


840 


ATGCTTTAAC 


TATTTCAGCA 


AGGTCACCTT 


TAAATTTACC 


ATAACCTTCG CCCTCATATT 


900 


TTGCCTCAAT 


ATCTTTAATT 


GGCATGTCTG 


TTAATCCAGC 


GTATATTGAA 


950 



. (2) INFORMATION FOR SEQ ID NO: 807: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 
35 (C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 807: 

40 

GTTACCAGTC GAAGCTTTAC GACCTGTTAC GAAACATTGT TTACCCATAT GAGTACCTCC 60 

' TTTAATAAAT ATAAATACAC ATAACTACTT ATATACTTAA TTAAGATAGC ATAGTTTCAT 120 

45 TTGAAAAACA ATGAATAATT TTCACATAAA AGTCAAAAAT ACTGAGTTTT GTGATATAAT 180 

TGTAGACTGT GAAGTTATGT AGTATGATAT TTTAGAGAAA TAAGTGAATG AAACATTTAA 240 

AATTTTTATT TrATGATACT ACATCTATTA AAAGAAAACG CCTATAGATA ATCGTCaTGA 300 

50 GTGTAAATTC AGTTtACTCa AcAtATAaTA GGkGaTTTAA gCgTAAGtCa TTCTGATAAG 360 

TTGTACCATT GATATTTTAT GAAATAnTCn ATGAACTTAA AGCGTTTATG CTACACTA 418 

•55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 842 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 808: 



15 



20 



30 



35 



AATGGCnATT 


AATCTTTAAT 


ACGATGCTTG 


AGGATTTTTC CTAATAAAAC CTTGATTTCm 


60 


AAAAGGGTTT 


AAATCinAATG 


AAACAATAAT 


AAaAAAATGw CGCAATATAA TAATAAGTAC . 


120 


AAATTTAATT 


AAGAAATTAA 


ATTGATTGTA 


TATGTATATT TTGGTAACGT AAAAGAGAnA 


180 


TATACAAAAT 


AATTAATTAT 


TTATATGAAA 


AGAGAATATA AATGAAGTAT AAAACAGAGA 


240 


GACGTGAAGC 


GATGGGATAT 


TTAAmAAGGT 


TTGCATTGTA CATAAGCGTT ATGAyTTTAA 


300 


TATTTGCGAT 


AGCAGGTTGT 


GGCAAAGGTA 


ATGAAACAAA AGAAGATTCA AAGGAAGAAC 


360 


AAATCAAAAA 


GAGCTTTGCG 


AAAACATTAG 


ATATGTATCC AATTAAGAAT CTCGAGGACT 


420 


TATACGACAA 


AGAAGGATAC 


CGAGATGGCG 


AATTTAAAAA GGGTGATAAA GGGATGTGGA 


480 


CGATATATAC 


AGATTTCGCC 


AAAAGTAATA 


AACAAGGTGG ATTGAGTAAT GAAGGTATGG 


540 


TCTTATACTT 


AGATAGAAAT 


ACACGGACTG 


CAAAGGGACA TTATTTTGTT AAGACATTCT 


600 


ATAATAAGGG 


CAAATTCCCA 


GATAGAAAAA 


ATTATAAAGT TGAAATGAAA AATAATAAAA 


660 


TTATCTTATT 


AGATAAAGTA 


GAAGATACAA 


ATCTAAAAAA GAGAATAGAA AACTTTAAAT 


720 


TTTTTGGACA 


ATATGCAAAC 


CTTAAAGAAT 


TGAAAAACTA CAACAATGGT GaTGTCyCAA 


780 


TTAATGAGAA 


TGTTCCAAGT 


TATGACGCAA 


AATTTAAAAT GAGCAATAAA GATGAAAATG 


840 


TT 








842 



(2) INFORMATION FOR SEQ ID NO: 809: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 809: 

TAAGACTATG CCATCTTGGC AACGCGTTGT CGCATATTCA TTATTAGTAA GTATATGCAA 60 

.50 GCATGATTTT GCCATAACtT TACCtCtTTC TATATTTAAG TACCACTTTT ATCATTCCCT 120 

ATAATATTTA ACTTATTTAA TTAAACGTAA ATTTAACACA ATACGAATAC CAAATCAAAA 180 
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AAAAATAGAT GTAGTCAGTT TAATTAAACT ATCCAATTGA AAACACACTA CTTTTTAGTA 300 

TTTTCAAAAT AATTTTAAAT GACCACATCT ACAACGTATT ACTATTATCT TTTGTTAGTT 360 

ATATATTCAT TTTCAATTTA rAAATAAACA CTTAtTTAAA TCACATTCTA CTTTTGAGAT 420 

GAGCTCAAAC TAACATTAAA TTGTTTATAT T 451 



(2) INFORMATION FOR SEQ ID NO: 810: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1499 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 810: 



20 


ACATTTTGTG 


CTTGTTGTCC 


AATCCAATTT 


CTACTTTGTT 


GAACAGTATG AGTTGCTTGA 


60 




CCACGCATGT 


TACTCAGCAT 


TTGTCCAACA 


TGAGTATTTT 


GAGCAAAGTT ACGAGATTGA 


120 




TTCAACACAC 


TGTTTTTCAT 


CATTnCCATT 


TTAGGTGAAA 


TACGACTAAC TGCTTGTTGC 


180 


25 


ATTGTCTTTT 


GCAATCCTGT 


TTTAACTGCA 


AATTTTTCTC 


CGCCTTTGAA GCCTAATTTC 


240 




ATTAAACTTT 


TCCCAGCACC 


TTTGAGGAAG 


CCCATACCTG 


GCAATGGAAT AAGCGAAAGT 


300 




CCTTCCATGA 


TTCGTTCTTC 


TTTAGATAGC 


TTTCTTCCAG 


TTACAATATT TTTTCCTGTG 


360 


30 


GCTGCATTTG 


CTGCTGAATA 


AGCGGAATAA 


GCACCAATAG 


CTACTGCTCC AGCAACAGGA 


420 




TTAATACAAG 


ATAAAACTAC 


CATACCAGCT 


AGTGCTGCAA 


TCTCTATAGC TTCTTCTTTT 


480 


35 


TTCTTTTGCT 


CCTCTAGcTC 


TTCCGCTTCA 


GCTACTGTCA 


TGTAATCACA AGAAGCGCCC 


540 


TGAGTCATCA 


TTTTTTCAAA 


ATCACTTTTA 


GATAATTTCT 


TATCTTCTTT TTTCATTTCT 


600 




TTGTAATGCT 


GATCAATTAC .TTTTACAAAT 


TCTTGATCCG 


ATTTTAATAT TTCATCGATG 


660 


40 


CTATAAACAT 


CGCTGTCACT 


TTTCTTGGCA 


TCTCCCTTTG 


CAATTTTCCC AGAAGCCCCA 


720 




TAGGTATCTC 


TACCATGTCC 


ACCACCATTT 


TTAATTAGTA 


TATTTTCCAA TTTAACATTC 


780 




GTTATACCTT 


TGAATGCTTT ATCTATATCT 


TCTATAAATG 


GCGTGTCAAT AATTTCTTTC 


640 


45 


ATTTTCGAAT 


CTAATTGAGA 


TGCATAATCA 


TATAACTCaT 


TTTGATAATA CCCCATTAAT 


900 




TTAGATCCTT 


GTGGAcCTAC 


TACACTACTT 


GTTAATGATA 


TTGCGCCATC TTCGTCTAAA 


960 




GAACTGACAT 


CGCTTACCGC 


ATCATCCATT 


GTTGTATCTA 


CATCTAAGAA GTCTTGACCC 


1020 


SO 


TcTATTGCAT 


CTATTACACT 


TTGGATAGAT 


GACTTTTCAT 


TGTCATAATC AGCTGTTAGA 


1080 




TATTCAATAT 


CTTTTGTCAT 


GTCATGCACC 


TATCCCTCAA 


TATTATAGTA AAGCTTGGCA 


1140 



55 
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10 



CCACCGAGTG TACTCAATGG ACCTTTGAGA CCACTAATGA CTGTTGCAGC ACTACTATAG 1260 

CCATTTTCCA TAGAATGTGC AGCTTTTTCA CTACTGTAGT AATTTAATTG CGCTATTTTT . 1320 

CCAAAGTTTT GAGCTATTTC ATCATACTTT TGACTTATGC TTTCCAATTC TTTTACAACA 13 BO 

TGTGCAATCG TTTCAGCTTT AACACTAATT TTTCCACTCA ACGTCATTAC TCCTCtGcTt 1440 

TATtAATATG ATTTTcATCA GTATCGAATC CAAATAATTC TCGTGATAAA nGTATTGAA 1499 
(2) INFORMATION FOR SEQ ID NO: 811: 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 620 base pairs 

(B) TYPE: nucleic acid 

« (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 811: 



30 



35 



CnCGCTGCAC 


CGAAGAATGC 


TTTTGGTTTG 


TGTAAAGATG 


CAGGATCTAA 


ACCACCTGAT 


60 


AATGTACGAC 


CACTTGGTGG 


AATAACyAAG TTATAAGCGC GTGCTAATCT 


CGTTATAGAA 


120 


TCCATTAAAA 


TAATGACATC 


TTCCCCAATT 


TCTACTAAAC 


GCTTTGCACG 


TTCAAGTAAT 


180 


AATTCAGCTA 


CTTTAACATG 


GTGTTCTGGT 


GGTTCGTCAA 


ACGTTGAATG 


AACGACTTCA 


240 


GCAGCTTCTA 


CTGAGCGTTC 


TAAATCTGTT 


ACCTCTTCAG 


GACGCTCGCC 


AACTAACAAA 


300 


ATAAATAGCT 


TTGCATCTGG 


TTTGTTCGTA 


CTGATTGCAT 


TCGCTATTTC 


TTTTAATAAC 


360 


GATGTTTTAC 


CTGC7TTAGG 


TGGCGCCACT 


ATTAAACCAC 


GTTGACCTAA 


ACCAATCGGT 


420 


GTTACTAAAT 


CCATGATGCG 


CGTTGAATAA 


TTTTGTATTT 


CTGTCTCTAA 


TTTAATACGC 


480 


TCATCTGGAT 


AAAGTGGTGT 


CAAAGCTTGG 


AAATGCGGAC 


GTTTCTTCAC 


TTCTTCTGCG 


' 540 


TTATGGTCAT 


GACAAAGTCA 


ACTTGTAATA AGCCATAATA TTTTTCGTTA TCTTTAGGTT 


600 


TTCTAACTTT 


CCCAGTTACT 










620 



40 

(2) INFORMATION FOR SEQ ID NO: 812: 

(i) SEQUENCE CHARACTERISTICS : " 

(A) LENGTH: 1094 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 812: 

CCGGCTGAAA TTAAGTTCTT GTTATATTTC GCAAAAATGA CAAAGAAAGA TAAAAATAAA 60 

55 
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TGAACTATTT GCAATTAGAG AATTGTTAGA GAAAGGTTTG ATTGGTGATT GTATTCAACC 180 

TATAATTGAA CCAATTAAAT ATACAACCAC ATTTAAAAAT ACTTTGCAAT ACTGTGGTGA 240 

AAAAGCATTC TCTATAAATT TAGTAGTAAA TTCAAAGTTA ACTGAAGAAG AGATTAGTAA 300 

CGAAaCTGTt GCACATTTAA CTGAAATAAT AACAAAAAAC AAAAGTGTTA TTCAAAAAGC 360 

TTACTTGGGT CCTTCTGATG AAGGCAATGA TAGGTTGAAA CAGCAATTTT CAAGTAATAG 420 

10 

TTTAGCTATT TTAACAAGTG TAGATGATTG GGAAATGTTT GGAGATAAAA ATAAACTTGA 4 80 

AATGGTTTTT GTACCAGATG ATAGACACAT TAAACGTAAA TTGCGTAATA TTCCAAACAA 540 

is AGGCATsATT ATGGATCCTT TTAATAAACT AAGTCGTAAT GTTGATTATT TAGATAATGA 600 

TGACGAGTTT TATAGCGACG ATCACCTTTA TTATAAGGAA GATGGATACG TAGCATTTTC 660 

AGACTATTCT GTTATAGGTG GAGAATATGT AGACGGTGGC TTTTCGCCAT TAGCtATTGc 720 

20 

GrTACATATT GTCTATTTTG ATGAGGCTAA TGAG CTAAG A GTTAAGCATT TtGTCTCTGa 780 

TTCTAATAAT GATAGATCAA ATCCAGGTAA AAnGTTTTTT GAGGCTGTAG ATAAATTAGT .840 

AACATGGTCA AAAAACTTAG ATATTAAAAA TAGATCTTAT GCGCTTGGAC AATTTGAAGA 900 

25 

ATTAAATGAA AATAATAAGT ATCCAGGATT AGGTTTAATT AAAnGTTATC TATCAGCATC 960 

ACCTAGAAAT TATGAATAGA TACTGGGTCT CAAAGAAATA GGAAAAGTTT ATATCGAACT 1020 

30 GAACTGCAAG AATGCATTGA CATGGATGAA CACACGGGTA AAATGATCGC AGCTGAAATG 1080 

ACTATGGTAT TAGC ■ 1094 
(2) INFORMATION FOR SEQ ID NO: 813: 

35 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 930 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear. 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 813: 

CACCTATCAT CGTCTCTTTA ATTCTATCTA TCTATTTACC CCAATCGTAA ATATGATGGA 60 

45 

CGTTATAGAA TACCAAGAGT TGCAGGTATn TCTATTATTT ATCTAGCTGT AGTAGGTGTT 12 0 

ATTACGTTAA TTGTTAATTT ATTGATACCT ATTATTGGTT CGCAAGTAGA TAGTTTAGTT 180 

50 AAAAATTCAC CGCAATATCT AGAAAAATTA ATTAATTCTA TTGATAAAAT AGCAAATAAT 240 

ACGTTTTTCT CTTCGTATTA TAGTCAAATT AATGATTGGT TAAATTCTTT ACCTAAGAAA 300 

ATACCATCTA TGTTAAGTGA ATTTACAGAT GGCTTTGGGT CTAAAATTGC AACGTTTGCA 360 

55 
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CTTAAAGATG GACATCACTT CAAAGAATTT TCAACGAATA TTATGCCACC GAAATTCCGA 

AAAGATTTTC ATGATCTACT TGAAAAAATG AGTGTTCAAG TTGGTTCATA CATTCAAGGA 

CAAATTATCG TTTCATTCTG TATCGGTATA CTGTTGTTTA TCGGTTATTC GGTTATCGGG 

TTGAAATATA GCTTAGTATT AGCTAGTATT GCGGCAGTTA CAAGTGTTGT ACCATATTTA 

GGGCCTACTA TAGCGATTTC TcCAGCTATT GTAATnGCTG CTATAACAyC GCCGTGGATG 

CTCTTAAAAT TAGCAGTAGT ATGGACTTTA GTACAATTTG TTGAAGGGCA CTTCATTTCA 

CCAAATATCA TGGGTAAAAC ACTTAAGATT CATCCACTTA CAATCATTTT CATTTTACTG 

TGTGCAGGCA AATTGCTTGG TATTGTAGGC GTTATTTTAG GTATTCCGGG ATATGCTATT 

TTAAAAGTAT TtAGTTACTC ATTTATTCCA 

(2) INFORMATION FOR SEQ ID NO: 814: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 Si base pairs 
c (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 814: 
GTATGATTGC TGTTTTAATA CCAGATGATG GCAGTGGCAA ATCTTATGAC TATATGCTTG 
TGAACCCAAA AATTGTAAGT CATAGCGTTC AAGAAGCTTA TTTACCAACT GGTGAAGGTT 
GCCTTAGTGT CGATGATAAT GTTGCTGGTC TAGTTCACCG TCATAATAGA ATTACAATTA 
AAGCCAAAGA CATCGAAGGT AATGATATAC AATTACGACT AAmAGGATAT CCAGCAATTG 
TTTTCCAACA TGAAATTGAC CATTTAAATG GTGTAATGTT CTATGATCAC ATTGACAAAG 
ATCACCCATT ACAACCACAT ACAGATGCAG TAGAAGTTaA aACACATTTT cTAATTATCA 
AAgCTTAGGA TAAtATGATG tCCTAAGCTT TcCTTTACAA CTTTTCGGAT AACCAACAGT 
TAATATATCA CCTTCTAACC AAACTTTTAA TCCCTCATTA A 
(2) INFORMATION FOR SEQ ID NO: 815: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 573 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
,{D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 815: 
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AACGATATAA TAGAATTGAA TAAAGTGGGT GATAATGTGA CTAAAAATGA GATTAGAAAA 120 

TACATTTTAC ATAAAATGAA GAATTTTAAT AAAGCTGAAA AGCGAAAAGC AGACACATGG 180 

5 TTAAGAAATC A ATTTTTT GC AACTGAAGAA TACAAAGAAG CAAACGCAAT TGCGCTAGTT 240 

CTTTCTTTTA ATCATGAAGT AGATACTTTT TCTATTATTG AACAAGCCTT AATGGATCAT 300 

AAACGTMTT TTGTACCGAA AATGGATTAT TTAAATCATC AAATGACTTT TAAAGAGATA 360 

10 

TTTAATCTCA AAGATATTGA TGTCGATAAT AAGGGGATTT ACTATCCAAC TTCAAAAGGT 420 

GAAACAACGA ATAACCTAGA TTTAATTGTT GTTCCTGGTG TTGGATTTCA AGACGATGGA 4 80 

1S TATAGAATTG GGTATGGTGG TGGCTATTAC GACAGGTTTT TAGCTAATTA TCAGACAAAG 540 

ACAATAAGCT TATTATACGA TTTTCAAATA ACA 573 
(2) INFORMATION FOR SEQ ID NO: 816: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 9B9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 816: 
TAGTTTAACA ATGTCTATTC TCATAGAATT TGTCCAACTA AATGTAGACG ATTCACTAAA 60 

30 

CTTCTAAAAA AATAAACCCC AAATTATAAT AGTCTCAATA TTAATATATT ACAATTTATT 120 

CATAGGATTT TTATCATTTC AATTAATAGG TTGTTATCAG GGTTAAATAA ACATTTTTTG 180 

35 TAATAGTCCT TTTTTACGTT CTTTTAATAA CTCAATTCTA TTCATTTGAT TATTCATTTT 240 

ATTGTCTATA GATTTTAACA ATGCACTTAC TTTATCyTGT TCAGTGAGAC AAGGTATTTT 300 

AAGATTTATA TTGCATAAGT TCGCATGAGT TAATTTAGCT GGTGCATTAC CTGTGACAAA 360 

. 40 TGCTCGTAGT TCTTTAAAAT TTAAATAATA ATTCATAAAA AACAAATTAT GATCATTACT 420 

TTTAACTACA TGCGCATGAT TATTTACCCA GTATTGCCCA TTAGCAATAA AGCTACTCGT 4 80 

CTCAAACTGC CCCCATTTTG CACCATCTTC TCCTATTAGT AATCGTTCTT CATTATTGAA ^540 

45 

TAAATAATCT TTTACGTAAT CAATAATTCC AGTTGCACCA TAGTAAGGGT ATAACCCCTT 600 

TTCTCTTAAT GAAGAAGTAA TTGGTTTTCT TCTATTATTT TCAAAGATAA AGATGTCTTT 660 

TATGAATTTG TTTTCCCACT CTGGATATTC TTCACCATTC TCATCTTTGA ATCGCAATTC 720 

50 

TTGTGTGAAG ATTTTCTGCA TATAGCCTTT TTTCTGTTGT TGAAGCAATT CAAGTTTTTG 7B0 

TTCTTCTAAT TCAATTTGTC GGTCGAGTTT GCTGAAGAAC TTGCCTATTT . TTTGCTGTTC 840 

55 



168S 



EP0 786 519 A2 



w 



is 



20 



25 



30 



35 



40 



45 



50 



55 



TCGACTACCT 


CCACTTTGTG 


CAAGGAAAAT 


TTTCCTTTTA 


CCTTTTCTTG 


ATAATAGATA 


960 


CTGTCCAAAA 


AAATTATAAT 


AATACTCTTT 


TTTCAATCTA ATAATACATA 


CATGTTGATT 


1020 


TAAATTAGCA 


TGCGTTTCAA 


CTATCGAATT 


AATGGCTGTT 


CTACCTATTG 


ATGCTCCTGT 


1080 


AATATTTAAA 


AGAACATCAC 


CATAGTACGT 


TCTACTATTT 


TTCATCTCAT 


CATCTATATC 


1140 


TTTACTAATA 


TAAACTAAGT 


CATTAAGATT 


TAATTTACCA 


TTTCTAATAT 


TTTGACTCCT 


1200 


TAAAAATGGT 


ATGCCTTTGT 


TTGTATAGTT 


TTCACTTCCA 


CCTTTGGGAG 


TCTTTCCACT 


1260 


ACCTATTTTG 


GTAGTAAGAT 


TCCCTAACTT 


CTTCTCTTCC 


CATTCGCCTT 


CAAATCCTGG 


1320 


GAATCTCAAC 


TCTGGCACAT 


TTTTCGTTTG 


TGTATTACTC 


ATCTTTCAAC 


ACCCCAAGTT 


1380 


CTTTCAGGTA 


TGCATTGATT 


TCTTGTTCAA 


TTTCTGCGAT 


TTCTTTGTCG 


ATATTTTTCA 


1440 


AATCTTGTTG 


GACTTGATCT AAATCAATTG 


GTGCTTCTTC 


TTCGAATGTA 


TCAACATATC 


1500 


GCGGTATGTT 


TAAGTTGTAA 


TCGTTATCGG 


CGATCTCTTG 


TAATGTCGCG. 


CTGTAGCTAT 


1560 


ATTTATCAAT 


CGTTGCTTTA 


CGCTTATATG 


TGTCTATAAT 


ACGTTCGACT 


TGGGCATCGC 


1620 


TTAAATGGTT 


TTGATTTTTT 


CCTTTTTCAA 


AATCATTGGA 


TGCATCGATA 


AATAGTACGT 


1680 


TGTCGTCTTG 


TTGGCGACAT 


TTTTTAAATA 


CTAAAATACA 


TGTTGGAATA 


CTTGTCCCAT 


1740 


AGAAAATATT 


GGCTGGTAAC 


CCAATCACGG 


CTTCTAAGTA 


GTTCTTTTCT 


TCTATTAAAT 


1800 


AGCGACGAAT 


CACACCTTCT 


GCGGCACCAC 


GGAATAATAC 


ACCATGTGGG 


aGTACGACTG 


1860 


CCATGGTACC 


TTCATCGTCT 


AGGTAATGTA 


CCATGTGTTG 


AATAAAGGCA 


AAGTCTGCTT 


1920 


TGGaTTTTGG 


CGCAAgCTTG CCGTAACCAC TGAATCGTTC ATCATTTTCA 


AATTTTGAAT 


1980 


CTGCTGTCCA 


TTTCGCACTG 


TATGGTGGGT 


TCGCAATAAC 


CGCATCAAAT 


GTATTGCCTA 


2040 


AAAAGGCTGG 


ATTTTCCAAT 


GTGTCATCAT 


TACGGATCTC 


GAAGTTCTCA 


TAACGCACAT 


2100 


CATGTAATAA 


CATATTCATG 


CGTGCTAAGT 


TGTATGTAGT 


ATTGTTACGT 


TCTTGTCCGA 


2160 


AATAACGATA 


CACTTGCGTT 


TCTTTACCAA 


CACGTAACAA 


CAATGAACCG 


GAACCACATG 


2220 


TTGGGTCGTA 


CACGTGACGT 


AATTTATCTT 


TACCGTCTGT 


GACAATCTTC 


GCCAGTATCT 


2280 


TAGATACTTG 


TTGTGGTGTA 


TAGAACTCGC 


CTGCTTTTTT 


ACCCGCTGTC 


GCCGCAAAGC 


2340 


GCCCGATTAG 


GAATTCATAT 


GCATCACCTA ACATATCAAT 


TTCCATGTCA 


CTGTGAACGA 


2400 


ATGGTAAGTC 


GTCAAGATTA ACCATGACTT 


TAGAGATTAA AGCAGTACGT 


TCTTTGACAT 


2460 


TGTTACCTAG 


TCGCGTTGAA 


CTCAAATCCA 


TATCGCTGAA 


CAGCCCGATA 


AAGTCATTTT 


2520 


CACTTTCTTC 


ACCTAATGTA GATGTTTGAA 


CTTTGCGAAT 


CGCCGTCGCT 


AGATGTTCTA 


2580 


TATCGAAATC 


TTGCGTTTCA ATTTCACGAA 


TCATCGCACT 


GAATAAATCT 


TGTGGCTCAA 


2640 
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CCCATGCTTC TTGATACGTG ATGTCTTCAC CTGACAAGGC ATCTGCATAT TCTTGTTCGG 2760 

CTTTTTCAGA TAAGAAGCGA TAGAAAATCA AGCCTAAAAT GTAATTACGG AATTCACTCG 2 820 

5 • CATCCATGTT CCCTCTTAAA TCATTCGCAA TCGACCATAA TTTTTTATGT AATTCAGCTT 28 80 

GTTGCTGACG TTGTTTTTCA GTAATAGACA TGTGATTCCT CCGCCTTTGC ATAAGTAATT 2940 

TATCTCTTTG TGTAATAGAT TTATTATAAC ATTTGGTTAT GTTGCGATGT TGATAATTTG 3000 

GATGTTGGTG GTGGAAATTT TGAGTTTTAG TGGCGCAATT GTTATTGAAA AATTTTATAG 3060 

AAATGTTGTA GCTTTCAAAT GCTTTCAAAA TCATTTATAT TCTTAATGAT GTCAAAAAGT 3120 

TGTTCATTCA TACATAAATA AAACCAATCA ACAATTGAGT TGGTGAAAAT CAATCGTTGA 3180 

TTGGCTTTGA TGCATATTAA ATAATGCAAT ATATATTAAT AAATGTTAGT TATAGTATAT 3240 

TTTGTCAGGA TTGGGTGAAT GTCTAAGTTT TAATTATTTA TCTAAATTAT CTGCAATGAA 3300 

20 TTTCTTAATT TCAGGAGAGA AATAAACAGC AAATCCTCTT GTGCTTTCAC CTGATGGCTT 3360 

ATTACCGGCA TAGATTACAC CAATAGCTTC GTGTTTACTA TTTAATATAG GTGAACCAGA 3420 

GCTACCAGGC TGAATAATTG CATCCGATGA CACTATATTC CCATTCACTG ATAATACTTT 3480 

25 ACCAGTTGAT TCATACATTT GTAGTTTATT TCCATTAGGA TTTGGATAAC CAATGACTGA 3 54 0 

TATAGGTTCA TTTTCTTTAG CTTCTGATGC TATATTAAAT TTACTAGTGA AATCTTTGAA 3600 

TTTTCTACCT TTTGGTTGTG TTGATTTTTC TTCAACTTGT ACAACCGCAA TATCTTCTTT 3660 

ACCAGGATAA TCTACAATCT TAGTAACTTT ATAAAGTCCA CCACCGTTAT TATAAAAACC 3720 

ATTAGGATGT GCTTTGATTT CATCACCGAC TTTCATGTGA TAGGTAACAT GTTTATTGGT 3780 

AATGATTGTA TGATTTCCAA CTACAAATCC TGTTCCAGCG CCCATCCATG TAACACCACT 3840 

GTATGGTGCA ACATTTGTAT TTGTAATTTG TTTAACAGTA TTTTCGGCTT TGGCTGTTTG 3 900 

TTGAATACCT TCAACCATTG TTGTGCCGAC ACCAGTTATT GATGTTAAAA TCGTCAATGC 3 960 

40 TGCAATACTT TTGATGATTA TATTTTTAT 3 989 
(2) INFORMATION FOR SEQ ID NO: 817: 
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(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 904 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 817: 
CGTTGAGCAC AGTTTTAGAT AATAGATAAT CTTGCtCTAG TTGTTGCAAT GTCTGCGATT 60 
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TATGCGTGTA ACGTTCAGTT ATCGTTTCTA AGGTATACTG AGATGTATTT AATTGATATC 
CTTTAGATTT AATACTTACG ATGATATCAT CCATAAAATT GCTATTGATA ACATGGATAT 
CATTGCGAAC TGTGCGGTTT GAAACATTGA CATGTTTAGC AATTTCATTA GAACTAATGT 
GCTTTGATGG ATTTTTAATA AAAAACTGGA GTAGTTTTAA GTGTCTATCA AGCATTTAAC 
ATGTACCTCC TTTCTAAATT TTTCGTGTAA GCGTTTTTAA GGGTGTATTA ATATTATTAA 
ACATGAGAGC TTATACATAC GTCAATGACA TTAAAGCGAA CTTTTATATG ATTTTAACAG 
AGTGCGAATT ATGCAAATAA AGAACAGCAG TAAGATATTT CAAATAGAAA AATATCTCAC 
TGCTGTTTTT CTGAATTTAT GCATCTGGTA CTTGTGGACG TATCAGGCAA ATGATTAATT 
TTTAGGTGAT TGTGCTTGAG GTGTTTGTTT AGAAGGCGTA TTGGTATTAT TTGATTTATT 
AGCAGGTGGT GTGTTCTGTT GTTGATTATT TTGGTTGTTT GTTGACGATT GACTGCCACC 
ACCGTGAGTA TTATTTTTAT TTGAATTTGA GT C T T T T GGC TGAGCTGGTT GCGTATTCGA 
TGGTACTGTT GAATGGTTGT TAGGAACATT TGACGGTGAA GATGGTTGAT TGTCATTAGT 
AACGTTATTG CCATTATTTT GATTtTTATC TGTTTTGTCT GCACTATCAT CTTGTTGATC 
ATTT 

<2) INFORMATION FOR SEQ ID NO: 818: 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1104 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 818: 
TTTAACAtGa TAATAAAAAA TCTTTTGTTA TATCATTAGG AATATTTGAT GAACTTGATA 
TTAAGGTTAC ATTTTGAGAA ATGGACTTAG GGATATTTCC ACTATTATTT AGTAAGAAAT 
CTTTAGCTAA TAAAGATTTT CCTACACCAT TTTTACCAAC AATGTGATTG ATCTGACCAA 
GATAGAAATT TAAATCACAA TTGTCAACTA GTTGTTTGTT TTTAACTTTT AAAGAATAGT 
TATTTAGTTT CATGTATACA ACTCCTATGT ATAAAGGGAT TTATTACACC GATATTTAAT 
TGTATTTTTA AAAATTCtTT CACATTATGT, ATAGAwGTTA TAAATTAGTA TATCaCACTA 
TATTTtGkCT AAATGATAAA TATATCGTTA TATTTTTACA ATATTCTGAA ATTTATGTTC 
GCCTCTGAAT GCTATATCCA GTGTAATGTG TTTTGCATAT ATGAAAGCAA TTTCAAAATG 
TGAATATAGG TTCATTGTGG TATGACAAAC TTCATTGCTT GTCATGAGAT GGATATAATG 
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GACGGGACGT TTTTAGATTC AAAAAAGACA TATGATAAAC TTAGATTTGA AGCGATTTTT 660 

ACTGAACTTA GAAATAGAGA TATTACATTT ATTGCTGCGA GTGGCAATCA ATATGCGAAG 720 

TTGAAGTCTA TTTTCGGGGA TAGAGATATG TATTTTATTT CTGAAAATGG TGCAGTTATT 780 

TATAATGGCA ATGAGTTATA TAATTATAAA AGCTTTAATC GTCAGGTGTT TCAACAGGTT 840 

GTCGATTACT TAAATATGAA GCAAAGTATT GATCAACTCG TCATCTGTGG TTTGAAAAGC 900 

GCGTATATTT TAAAACATAC TTCTGAAGCG TTTAAAGAAG ATACGAGATT TTATTATCAT 960 

CAGTTAAAAG AAATTGACAG TCTACAGCAA TTACCTGAGG ATGATTATGT CAAAATAGCA 1020 

TTTAATATTA ATCGTGAGAC GCATCCGAAT GTTGACGAAG AAGTAGCAAC GCAATTCAGC 1080 

AATGATATTA AACTTGTCTC AAGT 1104 
(2) INFORMATION FOR SEQ ID NO: 819: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1425 base pairs ' 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 819: 

CCCnTTTTAC GGATTAAnGG CTTTTTCCTA TTTAAAACCT ACGGCATTTT CTTTTCAACC 60 

ACGGCGGCTC CCATATGGAT GGTATnGGGA TTGGGGTTTA TAAAATGGGG ATTGGAAATG 120 

GTCCATCCTA AATACTCAGT TTAGTGCTTA TTTCCTTTAG TGCTGACGAA TAAATATGAT 180 

CTAACTCTAC GAAACCTAAT AACGTATCAA ATGCTTCATC ACCATTTTCA AATTTACTTA 240 

AACTATTTTT AAAATCATGT CGCAAATCCT CTAAATATAG TGATGGTTTT CGATCTACAT 300 

ACATTTTTAT ACTAACCTCC GATATATAAT CATCTTTATT GTACCTAACA TTTTATTAAG 360 

40 ATAATACTAA TACACTATCG AACTTTGGGC TGATACTGAA ACAGCACAAA GAAACTCAAA 420 

CAATTCGAAT TATGTATCAA AACCTTCAAT TACAGGAAGT GATTTTAAAA TGATTTAACG 480 

CAAAAAAACA CCTGTTACCG TTATATAGGT ACAAGTGCTT AATTGATAGA GATGTTATAC 540 

GTCTTTAAAT GATTCCACAA CTTTTGGATG . TGGACCATCC ATAAGCGGTT CTCTTTGTCG 600 

AACGCCACCA CTTTGGTTGC CAATTGATTC GCTATCAAAG TACGATTTAT CTTGATTTGA 660 

TTGTTCTTGA ATATGTTCTT CATTATCAGT TGTTGCATAT TCACTATCCA CCTCTGTTTT 720 

TTCCATTGTT GCTGTATGGA ATTGTACGAA GTTCTCTTCT TCTAATGCTT TGATTTCTTC 780 

TTTCGATAAC GCTCGATACC AATCTTTCGC TTTTTTCGCC GCAaTAGGAa CAaCAtCTTT 840 

55 
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TACTTCAAAT TCTTTCCATT TTTCGGGGTA ACCTTTCATT GTAAAGGGCA TTCCCTTAAC 
CTCCAATATG TTATAATTCA TGTTATATAC CACCATAATT ATAAAATGAA ACATATTAAA 
CACAATTTTT TAGACTCCCG TCATAATAAC TTCAGACAAA CGTTAGTCTG ACCTGATAAA 
TATTTAATTT TAAACGTTCT TGTCTCTCTC ACGAAAACCA TTGTGTTGAA TATTCATCAT 
TGCTGATTGA ATATAATTAT ACTGTGGTAA ATCTGGTAAT ATCTTTATTT CTAATTCTGT 
ATTTAATTCG AAATGCTTAG CGATGTTTTC AAATATTGCT AAATACTCGC CCATTAATTG 
TTCATTTATT GTAAGTCTAT CTTCATTAGC CATGGCT'CTA TTTAACATAA AGCTAATTTC 
TTCTAATGCG AATAAGCTAG GATAATAATT TTGAATCAAT GTCTTATCAC TAAATAATTC 
ACCATTAGCT GCATTATAAA CTTGCGTGCA TGTTATTTAA TTTGC 
(2) INFORMATION FOR SEQ ID NO: 820: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 1165 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 820: 
nCGnCCTTAT ATATGTTTTT CATGTCCTAC AAAAAACGAA ATATTCCAAT TGCCTATATA 
CAGATATTCA TTAATGACAC ACTCAATAGC AACCATnACA ATATTATGTT ACCTATTTAA 
ATGAAATAGT TCTTTTAAAG AAATACATTT TTCACATATT AATCTATAAT CAAAATCAAC 
TGACCGATAT TCTATAATTT ATGATTAAAA TAAGTTATAA TATAATAGTA AAGATAAAGA 
TAGAGGTGGC TATAATGTGT GGACTTAGAA GTATAACATT AGGTACaACA AATATAGAAC 
AGACaAAACa TTTCATGGTT GACATATTAG GATTAAATTA TGAAGAACTT CTTGAAAACT 
CAATTCGTTT CGGCGATGCA GATATAAGCC CAGGAACAAG ACTTCAATTT ATACAAGTTC 
CAAGTGAGCA ATTAGAAGAA TCTCACTTTG TGGGTATTGG ATTACGTACA CCAACTGACT 
CAGGTTTAGA GGAGTATGCG GAAATATTAT CGAATAAGGA TATTCCATTT ACAACAGTTA 
AAGAATTAAA TGGCAATAAA TATTTCAGTC TCGAAGATAA CAATGGTCAT ATTTTCTCAA 
TATATTCAAA CGAGAATAAT TATGGCGTTG GTTTAGGTAT GCCTTCTTyT GAGAGTGCGG 
TCAATCCGTT ACATCAAGTG CAAGGTTTAG GACCAGTGAT TCTTAAAGTG AATCATGTAG 
ATATTACAGG TCAAATTTTA ACAAATATAT TCGGACTTGA AGTATTTGCA GAATACCAAC 
CCTTCGACAA TGCTGACTAT CATGTCCAAG TATTCAAAGT TGGAACGGGT GGTCTAGGTG 
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ATCAAGTTGA GTTTGAAACG AAAGATGCAG ATTTCTTTAA TCAAGCGAAA TCGCGCTTAG 960 

ATGAAGTGGA AATACCATAT CAAACGCTTG AGCAAGATGA TATTGAATCA ATTAGAaTTA 1020 

CTGAAAACAG TGGATTATCG TTTATATTCA CTTTACAAAA ATAATTTTTT TACGATAGCG 1080 

AGGACAAAAT TTATGTTACA TGAAACTTGG AAAGAACGTA CACCAATCAA GAAAGTAGAA 1140 

GTCATTAATA CAGATGCAAA GAAAT 1165 



10 

(2) INFORMATION FOR SEQ ID NO: 821: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 821: 

TCGCCCAATT ATTATGAAAT ATTCAATACA GTTTATTGAT CAAAAAACAA AAATCCATTA 60 
TGAACCTTGT CATCACGAAT ATTAACTGAC GTTAGATGGA CCTCTTTAAT GATAGAACAA 120 

25 TTAATAAATA ATGCACTTAA GTATGCGAGA GGTAAAGATA TATGGATTGA ATTTGATGAG 1B0 

CAATCCAATC AATTACACGT AAAAGATAAT GGTATCGGTA TTAGTGAAGG nACTTGCCTA 24 0 

AAATATTTGA TAAGGGCTAT TCAGGTTATA ATGGCCAGCG CCAAAGTAAC TCAAGTGGGA 300 

30 

TTGGTTTATT TATCGTAAAA CCAATTTTCA ACACACACAA ACCATCCGTT TCCGTCGTAT 360 
. CTAAACAAAT GAGGGTACAA CCnTTACGAn TnCCATTTCC 4 00 

(2) INFORMATION FOR SEQ ID NO: B22 : 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 760 base pairs 
'(B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



'(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 822: 

4b TGATATAATA CTTTTGTAAA GAAAAGCATG TGTGGGAGGT ATGACCTGTA TGTCGAACGA 60 

AATACTTATC GTAGATGATG AGGATAGAAT CAGAAGATTA CTTAAAATGT ATTTAGAAAG 120 

AGAAT CTTTT GAAATCCATG AAGCAAGTAA TGGCCAAGAG GCTTATGAAC TTGCAATGGA 180 

SO 

GAATAATTAT GCTTGCATAC TACTAGATTT AATGTTGCCT GAAATGGATG GTATCCAGGT 240 

GGCAACTAAA TTGCGTGAAC ATAAACAAAC ACCGATTATT ATGTTGACTG CTAAAGGTGA 300 
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TTCACCAAGA GAAGTAGTCT TAAGAGTTAA AGCACTTCTA AGAAGAACGC AATCTACAAC 
TGTAGAACAA AGCGAACCTC ACGCACGTGA TGTGATTGAA TTTAAACATT TAGAAATAGA 
TAATGATGCA CATCGCGTAC TTGCTGATAA TCAAGAAGTT AATTTGACTC CTAAAGAGTA 
CGAATTATTA ATATATTTAG CTAAAACACC AAATAAAGTA TTTGACCGTG AACAATTATT 
AAAAGAAGTT TGGCATTATG AATTCTATGG TGATTTAAGA ACAGTTGATA CTCATGTTAA 
ACGcTTAGAG AAAAGTTAAA TCGTGTGTCT AGTGAAGCTG CGCATATGAT TCAAACAGTC 
TGGGGCGTTG GGTATAAATT TGAGGTTAAA TCTAATGATG 
(2) INFORMATION FOR SEQ ID NO: 823: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 823: 
CAAAGGAAAT TGCACAATTA GAAGACCGAT TACGTTCACG CTTTGAATGG GGGCTAATTG 
TTGATATTAC GCCACCAGAT TATGAAACTC GAATGGCAAT TTTGCAGAAG AAAATTGAAG 
AAGAAAAATT AGATATTCCA CCAGAAGCTT TAAATTATAT AGCAAATCAA ATTCAATCTA 
ATATTCGTGA ATTAGAAGGT GCATTAACAC GTTTACTTGC ATATTCACAA TTATTAGGAA 
AACCAATTAC AACTGAATTA ACTGCTGAAG CTTTAAAAGA TATCATTCAA GCACCAAAAT 
CTAAAAAGAT TACGATCCAA GATATtCAAA AAATTGTAGG CCAGTACTAT aATGTTAGAA 
TTGAAgATTT CAGTGcAAAA mAACGTACAA AGTCAATTGC ATATCCGCGT CAAATAGCTA 
TGTcTTGTCy AGaGAGCTTA CAGATTTCTC ATTACCTAAA AATTG 
(2) INFORMATION FOR SEQ ID NO: 824: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 355 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 824: 
CACCGCGGTG GCGGACGCTC TAGAACTAGT GGATCCCCCG GGCTGCAGGA ATTCGGCACG 
AGGTAAGGAG GTCTCTGTAC CATGGCTCGT ACAAAGCAGA CTGCCCGCAA ATCGACCGGT 
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GGAGGGGTGA AGAAACCTCA TCGTTACAGG CCTGGTACTG TGGCGCTCCG TGAAATTAGA 


240 




CGTTATCAGA AGTCCACTGA ACTTCTGATT CGCAAACTTC CCTTCCArCG TCTnGTGCGA 


300 


5 


GAAATTGCTC AgGACTTTaa AACAGATCTG CGCTTCCAnA GCGCACTATC GGTGG 
(2) INFORMATION FOR SEQ ID NO: 825: 


355 


10 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1099 base pairs 

(B) TYPE: nucleic* acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 




15 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 625: 






TTCACTTGGC TTGTTGAcTG ACTTGTAtAT GATGATGTGC TTTGTGAATC GGATTCGCTC 


60 


20 


GTGCTTGTAC TTGTTGAGTT TGAGGCACTT TGGCTTGCTG AGTTTGAGTC TACTCCGCTT 


120 




TGATTCATTG AGGCACTTAG TGACAATGAT GTACTCGTTG AGTCAGACAA ACTTGTACTC 


180 




GTTGACGTAC TTGTACTTCC TGATGTTGAT TGAGACATAC TTATGCTCAT TGATGTTGAA 


240 


25 


TCGGATTTAC TTTCACTTGA TGATGTTGAG TCGGATTCAC TTTCACTTGT AGAACCACTT 


300 




AATGATGTGG ATGTACTAAT GGAATCAGAT TTACTTGTAC TGATTGAATC ACTTGTCGAC 


360 




ATTGATGTAC TTAATGAATC AGACTTACTA TCACTTGTGG AATCACTTAA TGATGTTGAC 


420 


30 


AAACTTGTAG AGTCAGACAA ACTTGTACTC GTTGACATAC TCAGTGATGT TGAAACACTC 


480 




TCGCTCTTAA ACGTTGACGT TGATTCACTG ATACTTGTCG ATGTTGAAAT GGACGTACTA 


54 0 


35 


CCACTTGTTG AATTACTTAA TGATGTTGAT GTGCTACCAG ATTCTGATGT ACTGTCTGAT 


600 


AATGACGTAC TCTCACTTGT CGAACTACTC ACTGACTCTG ATGTTGATTC AGACGTACTT 


660 




TCACTTAATG ATTCACTTAA . AAAGGCAGAT GCACTTTGTG ATTCTGAATC GCTAGTACTA 


720 


40 


TTTGATTCAC TTAATGATAT AAACGTGCTC TCTGAAGCAG ATATTGCTTC ACTTATAGAG 


780 




TCGCTCGTTG ACGTTGATTC ACTTATTGAA TCAGACTCTG ATGTACTTAA GCTTGTGGAA 


840 




TCACTCaTAG ATGTTGaTGT ACGTTCTGAA TTACTTGTAC TTAACGATGT CGAAGTACTT 


900 


45 


ACCGAACCAG ATGTGCTCGT AGAAGCACTT TgTGATATTG ATTCACTTGA TGCAGTTGAT 


960 




GCTGATTTGC TATCACTCGT TGAATCACTA AACGACGTTG ATATGCTCAt TGAATCGGAT 


1020 




TGACTTGCAC TCAnTGAACC AGACGTACTT TGTGATTCCG AAGTACGTAC TGAAGCACTT 


1080 


SO 


GTCGACGTTG ATGTACTTG 


1099 



(2) INFORMATION FOR SEQ ID NO: 826: 

55 
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(A) LENGTH: 33 8 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 826: 

10 TATGCGTATC CTGATAGTCA CTTTGATTTT GATATGGAAT TAGCGAAAGA GCAATCTCAA 60 

GACAATCCAG TTTACTATGC TCAATATGCA CATGCGCGTA TTTGTTCAAT TTTAAAACAA 120 

GCGAAAGAGC AAGGTATTGA AGTGACTGCT GCGAATGATT TTACAACGAT TACTAATGAA 180 

lb AAAGCGATTG AnTTGTTGAA AAAAGTAGCT GATTCGGAnC CTACAATTGA AAGTGCTGCT 240 

GAGCATAGAT CGGCACATAG AATTACTAAT TATATCCAAG ATTTAGCCTT CTCATTCCAT 300 

AAATTCnATA ATGCTGAAAA GTGTACCAGT TGGTATTG 33 8 



20 

(2) INFORMATION FOR SEQ ID NO: 827: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 838 base pairs 

(B) TYPE: nucleic acid 
25 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 827: 
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GTATTTTCTT 


TCGCGAATGA 


TTTTATTACT 


TGAATACCAC 


GAATAACCTC 


CAATACCTTT 


60 


TCCACTAATT 


GGTTTTGTAC 


ATTATGATAC 


GcTGGCGCAT 


TTTGTCGACT 


CTTTCTTTCT 


120 


AATAATTGAA 


TCGCAAAAAA 


TGATAGTAAT 


ACGCCAATGC 


ATGCTAATAA 


TGATACTTGC 


180 


CACGAAACTA 


CAAGTAGAGA 


CAATATGAGT 


ACTGTAATTA 


ATATGTATCC 


ATTAACAACA 


240 


ACGTCCACCA 


TTTTCATAGC AAAGTTTTCT AAAAAGGTTA 


AATCTGTTGT 


TACTATTGTT 


300 


GTTAACTCAT 


TTGAATGATG 


CGAATTAAAA 


TAACCTAACC 


TTACATTTTT 


CAATTTATCC 


360 


CCTATATCTA 


AACGTTCTTT 


CGCACTCATT 


TCATAAGCGA 


TGCTCTCATG 


GCTTTTGCTT 


420 


TTGAAATATG 


CTGTAATAAA 


TCGTCCAATC 


ACTAATAAAA 


CCATGATAAT 


TACAACATTC 


480 


AATATATCTT 


TCATATAAAT 


AGGTTTATGA 


GATAGCACAT 


TATTAAATAT 


TTTTGCAGCT 


540 


AAAAAGATAG 


GTAACGCAAT 


AAAAATAGCA 


TTTAAAAATG 


ACATGCTAAA 


TCCTAAAATC 


600 


ATTCTTGCTT 


TATATGGTCT 


TATCCAGTTT 


AAAATTTTAA 


ATGTAATTTG 


AAACATATTT 


660 


GACCTCCTTT 


TCTCATTAGT 


TTAATTATGT 


CCAGTATTGA 


TTCCCCAATC 


TTTTGTATGC 


720 


ATGTGCGTAT 


CCCACATTTT 


CTTATAATTA 


CCGTTTAATT 


TTAGCAATAA 


GTGATGTGAC 


780 
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(2) INFORMATION FOR SEQ ID NO: 828: 

(i) SEQUENCE CHARACTERISTICS: 
. (A) LENGTH: 898 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 828: 



15 



20 



25 
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TCCnAAAGAA 


GAAATAaCAT 


TATCATaAAA 


TCCATTGAAT 


ATAAATTCAG 


TCATCCCCTC 


60 


TGAACTtCTC 


CATAAATATA 


ATGGTGAATA 


AGTATTTGTA 


ATGCAATTAG 


TAGTTGTTTG 


120 


ACTTATTAAA 


TAAGCTTTTA 


TAATTAAATT 


TTTAAATCCA 


TCAGTCTTAT 


AACCATTTAA 


180 


TCGAACTCTA 


TCTCTAATTA 


TTTCCATATT 


GTAATCACTA 


GGCAACTTAA 


CATGATATTG 


240 


CATTGCATGC 


ATTTAGCATA 


CCCCCTTTTA 


TAAAAAGGAT 


AGCAATAATA 


AGTAAAATCT 


300 


CATATTATCC 


AATTGTGATA 


TAGTTATCAT 


AAAAAGTGAT 


AGGTGATTAA 


ATTGAACTTT 


360 


AATGATTTGG 


AAATTTTTAT 


AACTGTATGT 


GAAGAAGCAT 


CTATCAATAA 


AGCTGCAATT 


420 


AAACTTAGAT 


ATGCACAATC 


TAATATATCT 


CAAAGAATTA 


GCAAGCTTGA 


AAATGAATTA 


480 


GGTGTAGTTT 


TGCTTTTTAG 


AAATCAAAAA 


GGTGCTAAGG 


CAACTAAAGC 


AGGCGAAGAA 


540 


TTCTTAGCGT 


ATAGCAAAAA 


AGTATTAAGA 


GATACAGAGA 


CTATAAAAAA 


TAAAATGAAA 


600 


AATAATACTA 


TGTCTATTTT 


ATGCTCAGAA 


CTGTTATTTA 


ATTATTTATC 


TGAGAGCGAA 


660 


GAAATTATGA 


TGTCGAATAA 


CTCAATTAAT 


TTTATTTCTA 


GTGGAAATAT 


TAGAAAAGCT 


720 


ATAGAAAAAA 


ATAATTATGA 


TAAGGTTATT 


TCATTCATAA 


AAaTTAACGA 


CTCAAATTAT 


780 


AGACTTAGTA 


ATGTTGATAC 


TATGaaAGTA 


ACGCTTTACA 


GTAATGGAAG 


TAATTATGAT 


840 


AAAGAGGCTT 


TACTAATAAA 


TAAAGATGAG 


TTTGGTCCTT 


TAAGGnAAAT 


AACTTTAG 


698 



40 (2) INFORMATION FOR SEQ ID NO: 829: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 798 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 829: ' 

AGTAGGAGTC ATAAAATCGn ATTTCAAAAG CAATTAATGC AAAATACTGA GTCTACAGTG 60 
CTATTTGCGA AAGCGTCcAT ACGAAgTTGA GcAATGCTAA TAATAATGGT CTATCAAAAA 120 
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AATATGTTGA TAGTTACACT GATTCTCTTA CTGGAGTAAC AACTTCTGCT TTTTTAAATA 240 

AAGATACAGG CAAAGTAACT CTCGGGATGA CTGGGACTAA TTTACAAGAC GAAGCCTTTA 300 

AAAAGTTAAA AGAAGGTGAA TTTTCAAGAC AAAATGTTAC CAATGCTTTG GAAACAGTTA 360 

AAGATGGATA TGCAGATCTT AAAATATTAT ATTCTCCTGC ATCTGATCAA AACTATAGAT 420 

ATGCGAATAC ACAAGAATTT ATAAATAAAA TAAAAAGTAA GTATGACATT GATTTTATTA .480 

CTGGACATTC ACTAGGTGGA AGAGATGCGG TAGTTCTAGG AATGAGTAAT GGTATTCCGA 54 0 

ACATTGTGGT TTATAATCCA GCTCCTATTT CTATAACTAG TTTGAATCCT AATTCCCCAG 600 

ATGGAAAACG TTTATTAGAA TTATATAAAA ATTATAAAGG TAATATTACT AGGTTTGTTG 660 

CAGAAAATGA TGCATTGACA GAAAATCTGA AGAAATATAA GCATTATGTT TTTTTCGGTA 720 

ATGATAAAGT CTTTAAAAAT GGTAAAGGTC ATGAAATGkA AGGCtTTCTG ACCGAAGAAG 780 

20 AACAAAAAGC tATAAAAn 79 B 
(2) INFORMATION FOR SEQ ID NO: B30 : 



10 



15 



25 



30 



45 



50 



' (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 830: 

AATATCAATC TCTTCATAAG CTGAATTATT TTCATGCACT TCTTGATGTG ATGATTTGTC 60 

35 ACGAACnGCT ACAACTAACA TTTTATCGTC TAAAATAAGT TGTTTATATT TTTCTAATTC 120 

ATCAGGCGCT AAGTTGTAGC GTGATAAAAC TGCATGTTCA CCATCTTCTC CTGTTAACAG 180 

TTTAGTCATT CTATCACTAA ATGTTCCACT TGTTGAGATA AGGGGAGATT TCAGAGTCGT 24 0 

40 GTAAGTCATT AGGTGTAATT TACTTTTACT AATAATTGTT AGCCTGATCT AAATAACTTC 300 

AGATTCTTTG ATTGATAnGT GnATAATCGC AGTGCATTAC ACAGAATACT GCCAAGTGCG 360 

CCCTTAAAAT TGTTATTAnC TTACCTCTAT ATAAGAACCC 400 
(2) INFORMATION FOR SEQ ID NO: 831: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 475 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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AAGTCTGCAT GACTTTTATT TGCAAGTTCG ACTGCTTGAT CAAAAGCGCG ATGATCTTCA 60 

GGATtAGCAC TTTGAACTGA GCTGAAATTC GGATCAGGTT TACATTGTGC TTCTACAAGA 120 

5 TTAAATTGAT TGAAATTCAA AGATTGTAAT AACTCAGGTA CAATCGGAAC ACTTGTACCA 180 

TGCAAACTAG TGAACACAAC TTGTAAATCA GATTTAGGAA TATAGCCAAT CATATTTTGA 240 

ATGTGTTTCA TATAGTCATC AGTTACAGAT TTCGGAAAAG GCTTGaTATA AGATGTATTT 300 

TGTTTAGAAA tAGGTATATC AATCTGTAAT GGATCGCCAA CTTCTTCGAT ATAACGACTT 360 

GCAAGCTCAG ATGCATCAGT CGATAATTGC GCACCATCAG AACCATATAC TTTGATGCCG 420 

1S TTATAGTCTT TCGGATTATG ACTTGCtGTA ATCATAATGC CAGCAGTAGT ATTAA 475 

(2) INFORMATION FOR SEQ ID NO: B32 : 

(i) SEQUENCE CHARACTERISTICS: 

<A) liENGTH: 724 base pairs 
20 (b) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 832: 





CTTGAATTGA 


AGGATACGCA 


AATGATGCAT 


AATCAGCTTC 


TGTATAAAGC 


TGTAATGTAA 


60 


30 


TGTCATCAAG 


GTCACCTTTT 


CTAACAAGCA 


CCTTATTAAT 


AGAAGTATGA 


TTCGCTTGCC 


120 




AAGTACCTTG 


ATTATTTTGT 


TCTAAATGAA 


TGACTTCGCC 


TAACGATTTC 


AACGTAATAT 


180 




CTGCACGCTC 


GTCTTCGCTA 


ATAGTATATG 


TCTTACCATC 


TCGCAAATTG 


AGCATCTTCA 


240 


35 


ATTGTTTGTT 


ATATTTTATA 


ATCAATTTAT 


GCATTGTCTT 


TGCCTCAGTC 


CTATACTATT 


300 




TTTTTCTTTC 


AGCTTCTTGG 


CGTTTTTCTT 


TATCTTTTTG 


TGCTTGTTCT 


TTTTGTTTCT 


360 




TTTCGTTCTC 


TTCTTGTTGC 


TTTAATTTCT 


CATCTTTCGC 


TTTTGCTTTC 


TCTTCTTCAG 


420 


40 


ATTTCGCTTT 


TTCATCTTTA 


ACTTGTTTTT 


CTTTGTCTAA 


AATATCTTGC 


AATTTATCGT 


480 




TATACTTTTT 


CGTTTCTTCA 


GAACGTTTAT 


CATTCGATAA 


ATCTCCGTTA 


TTTTTAATCT 


54 0 


45 


CATTTAATTT 


ATTAATCAAC 


GCTAACTTTG 


TAATATCGTT 


ATCATCTAAA 


TAAGTGGCAA 


600 


TATTAATCGC 


TTCATCAAGA 


TGTCCTTGTC 


CTAATTCCAT 


CCAATATAAT 


AAGTAGTCTT 


660 




TGTTTGAATT 


TGGTGTCACA 


TTATTAAGTA 


AATTTTCTTT 


CGTATCTGTn 


TCTAAACCnT 


720 


50 


GTTn 












724 



(2) INFORMATION FOR SEQ ID NO: 833: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 994 base pairs 

55 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 833: 





AGCAGTTTGG 


CTCGTAGAAA 


TCTCCCGTCT 


CTATTCTATG 


TATTAACTTT TATTATATTT 


c n 
o U 


10 


GTTATGATAC- 


TATCGATATG 


AAAGCGTTGT 


CAATGGTTTT 


TGTAAAAATT 


TTGTCAAATT 






TAATTTTTCA 


AGTCTTTATA 


AAAG TAG TTT 


AATTTGAAAA 


CTAGAAATAC 


CAATCCTAAC 


x. o u 




TAATTTATTT 


ATGACGCTCT 


TTTGTTTAAT 

xxx vj xxx rv* x 


TGACATTCTT 


CAGCCATATT 


TTTTATGACT 


^4 U 


15 


AGTTGCATTT 


TTTACTAATA 


AACACCTCTA 


AAACTTTAAT 


GATTTTAATC 


GTTTTAGAGG 






TGCTAATTAf 

X\JWXJa*^X XfkA 


TTTATTTGAT 

XXX *V VJ*V X 


TATTTTTTGT 

XAX X X X X 1 V* 


TTGATACCTA 


CTGCATATCC 


CATATGAAAA 


^ ^ ft 
360 


20 


CGGCTTTTTT 


Ax lAiUl J. / i X 


ATGACTAAAT 


CTCGTGAAAA ATGAAATTTT 


TGCAGACTTA 




TGATTTAC CA 


AAGTTTATCA 


TAACTAGTAG 


TTACATATTT 


CGCTCCATTT 


TTAACAGCTT 


a q n 




CATTGACTTC 


fvxwx^vx ivln 


TT AATTAGG C 


CACCTGCAAT GACTTGTGTG TTTGTTTCTT 


C4 ft 

Z>4 U 


25 


TCTGAATATG 


ATGAATCGCT 


TTACTCGCAA 


CACCTGGAAG 


TACTTCAACA AAATCAGGTT 


600 




CAACmTlT 


TATCAAATCT 


ATACTGCGTT 


TCAATGCTTG 


ACTATCAATA 


ATAAATACTC 


660 




TAAAAATCGT 


TAAAGTATTT 


AATGATTTAG 


CTTTTTTTAT 


TACTTTAGAT 


TTAGTCGATA 


720 


'30 


CGATACCTTT 


TGGCTTGTAT 


TGCTGAATAA 


TAAATTCACT 


TGCAAATTCA 


TCGTGGCTTA 


780 




AACCTTTTAT 


CAAATCTATA 


TGAATAAAAC 


ACTCTATATG 


ATTTTGCTTC 


AGCAATTCCA 


840 




TAATACTTTT 


TATATGTCCT 


ATATGCATAT 


CTAGAAGCAC 


ACACATTTTA 


TAGTCTGTnT 


900 


35 


TAATCCAGTT 


TCTCTAAATC 


nTTAATGTTT 


CCTATAAGCA 


GGCAATATGT 


TGTTAATTCA 


960 




CTnGATCCAT 


CCTCTCTACA 


TCACACGCTT 


AAAT 






994 



(2) INFORMATION FOR SEQ ID NO: 834: 

40 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 783 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 834: 

so 

TCnACAAAGT CGGTATTAGG CTATGGGCCA TTTnACTATT ATAACTAATC GGAAATATAC 60 
CACATAACAT CATTATTGAG TTGATTCTAT CATTTGGCTT ATTAGGGTTT TTTATCATAA 120 
TGATTTGCAT TTTGCTACTA GTTTATAAAA TGATTAGGAA CTATGATCCA AACACTATAG 180 
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ATTTAGTTGT 


AAGTGAATTT 


TGGTTTGTGT TGTTCTATTT TATTACAAAA GGACGGCGTC ' 


300 




ATCATGGCTA AGAAAGTTTT 


TATTATGGAT AGCGTAAAGA 


CAATAATTGG TACGTTGCTT 


360 


5 


ATAGCTTTAG GATTACAATT TTTAGCTTAT CCAATTATTA ATCAACGAGT AGGTAATGAA 


420 




GCGTTCGGTT 


CTATTTTAAC 


GATTTATACA ATAATAACAA 


TCACGAGTGT TGTATTAGGC 


4B0 


70 


AATACGCTTA 


ACAATATACG 


mTTGATTAAT 


ATGAATCTAT 


ACAAATCCAA TCATTACTAC 


540 


TGGAAATTTG 


CATCGATACT 


TTTAATCTCA 


ATTCTGATTG 


AGAGTATAGC TTTAATTATT 


600 




GTATTTCTTT ACTTTTTTAA TTTGAACATC ATCGATATTA TCTTTTTAAT TCTACTTAAT 


660 


75 


ATTTTAATGT 


GTTTAAGGAT 


TTATCTGAAT 


GTATTTTTTA 


GGATGACTTT AAAATATAAT 


720 




CAGATTTTGT 


ATATTGCTCT 


TATTCAATTT 


TTAGGTTTGC 


TGATAGGACT ATTTCTATAT 


780 




nAT 










783 


20 


(2) INFORMATION FOR SEQ ID NO: 835: 






25 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 794 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 






30 


(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 83 5: 




TTACCTAATT 


TTTCAAtCaT 


AAGATTCCCC 


CTATTGTTTA 


AACaTAAAAA TATTATaCGA 


60 




TTAAGCACGA 


GmCACTTCAA 


TATATTTTTC 


AGAATATTCT 


TATAAATATT AATATGATCA 


120 


35 


TTTCACTATT 


TAAACACGTT 


TAATATTAAA 


ATAAGTTATT 


CATATAAACT GGTGCTGTTT 


180 




GATCCAATTG 


CAGACTTACG 


AGTCATTGAA 


AAGTCTCACA 


AACTATTGmA AGTAAAtATC 


240 




TTAAAATAGA 


AAGTGAATGG 


TTAATTTAAG 


TATATTTmAA 


AAATATTAAC CTTTTTAAGC 


300 


40 


ACTGCTATTT 


AGGATATACT 


AAATAATAAC 


TAAGTTTAGA 


AAAATAGGAG GAACATCGTT 


360 




TATGTTAAAC 


AAGGTTTGGT 


TCCGAACTGG 


CATCGCTCTG 


ATTATGCTGT TCATTCTCAT 


'420 


45 


CAAACTATTT 


ATGGAAGTGC 


ACGAAGTATT 


TACTCCAATA 


GCTACAATCA TCGGTTCTGT 


480 


ATTTCTTCCA 


TTTTTAATTA 


GTGGTTTTTT 


ATTTTATATC 


TGTCTACCTT TTCAAAACTT 


540 




ACTTGAAAAA 


GTCGGTTTTC 


CAAGATGGGC 


TAGTATAACA 


ACAATCATGT TAGCTTTGTT 


600 


50 


TGCTATTATC 


GGCTTAATTG 


TTGCGTTTGT 


TGCACCAATC 


ATCATTTCAA ATATCAACAA 


660 




TTTAATTAGT 


CAGACACCTG 


gCCCTACAAA 


AAGGAAGCAG 


AGCAAATTAT TAAATTCGCA 


720 




CTGGCTCAAA 


TGGnTAAATT 


ACCTGGAGGA 


TGTAACCAnG 


TGGATTACCA ATATGGTAAA 


780 
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(2) INFORMATION FOR SEQ ID NO: 836: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 
5 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 836: 
TGTTTATACT GTGCCTGAAT TAGAAGAGGT TTTAACACCT ATGAGACAAG ATGGAACTCG 60 
15 TGATATTTAT GTTAATTTAG AAAATGTGAG TtATATGGAT TCGACAGGTT TAGGTTTATT 120 

CGTAGGTACA TTAAAAGCAT TAAACCAAAA TGATAAAGAA CTATACATTT TAGGTGTGTC 180 
AGATCGTATC GGTAGACTAT TTGAAATTAC TGGTCTTAAG GATTTAATGC ATGTTAATGA 240 
20 AGGAACGGAG GTCGAATAAC ATGCAATCTA AAGAAGATTT TATCGAAATG CGCGTGCCaG 3 00 

CATCGGCAGA GTATGTAAGT TtAATTCsTT TAACACTTTC tGGCGTTTTT TCGAGACTGG 360 
TGCCACATAT GATGATATTG AAGATGCCAA GATTGCAGnT 400 
(2) INFORMATION FOR SEQ ID NO: 837: 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 837: 

ATTGATCCAT TTTGCGGTTC GGGTACAATA GCTATAGAAG CTTGTTTAAT TGCTCAAAAT 60 

ATCGCACCTG GTTTTAATCG CGAGTTCGTA TCAGAGCAAT GGAACATCAT GCCAGCAAAT 120 

40 ATTTATGATG ATTACCGTGA TGAAGCGGAT AAGATGGCTG ATTATGATAA AGAAATCGAA 180 

GTATATGCTT CTGATATCGA TCCAGAAATG GTAGAGATTG CTAAGCGTAA CGCTGAAGAA 240 

GTTGGGTTGT CTGATATTAT TAAATTTAGT GTAAAAGATG TCAATACATT AACAATTGAT 300 

ACAGAAGAAC CGGTGGCGTT AATTGGAAAT CCTCCATATG GTGAACGTAT TGGTGaTCGT 360 

GAAGAaGTcG AAGAAaTGTA CCGTTATATT GGTAAACTAA TGAAACAACA TCCATTTTTA 420 

TCTACATACA TTT 433 
(2) INFORMATION FOR SEQ ID NO: 838: 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 926 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



5 





(xi) SEQUENCE DESCRIPTION: SEQ ID NO: ! 


638: 








ATTGGGCCAA CTAATCCTGT 


ATATCCAAAT CCAGCAGACA ATGGTGTTCC 


TTTAACCTGA 


60 


10 


AGAACGTAAG CAATGATTCC 


AGTAATTATT CCATTTATAG 


TCAATGGTAT 


CGAAATAATT 


120 




AAATTTTTCA AGTAAACTGG 


GATCATCATT TTTGCAGCTC 


CTATGAGTAA 


AACTGCATTT 


180 




ACACCAATAG AATTGACACG 


CAATGAGCCA AATAAGAAAG 


TAACACAGGC 


AGCCACTATA 


240 


15 


CCTAGGTTTG CTGCTCCACT 


TCCTAGTCCG TTTAAACTAA 


TCGCAGTTGC 


AATCGCTACT 


300 




AACGATATTG GTGTTACCAT 


TAATAATGAA AATGCCACAC 


TAATAAGTAT 


AGACATTAAC 


360 


20 


AACGGATTTA AGTCTGTAAA 


AGAATGAATT ACATTTCCAA 


TTGCTTGAGT 


AATTTTTCGA 


420 


ATGTAAGGTA ATGTGATTAG 


ACCGATACCC CCACTAACGA 


TAGGTACTAA 


AACTGGTAAT 


480 




ATAATTAATT CAAAAGATCC 


AAGTTTGTTT TGTAATACCA 


TATATATAAG 


ACATGCAATA 


540 


25 


ATAACAACCA AACTCGTATT 


TATAATGtCa CCTATACCTT 


TTAACATAAA 


ACTATTATTG 


600 




CTATATACAA CAGCACCTGA 


ACCAATCATA GCTGATGTAC 


CTACTATAGC 


AGCACCTGCA 


660 




CCATTAAATT TAAATTGATG 


AGCAGCTAAA ACCCCAATAA 


TAAATGCCAT 


AAATGATTGA 


720 


30 


ATTAGTATCA CTAACTGATA 


CGTTAATTCT AAAATTTCAT 


TACCACTTTT 


AAATATTTTt 


780 




AATACTTCAC CTAATAAAGC 


ATTCGGAACA AGTGCAATAA 


CAACACCAGC 


ACCAATAGAA 


840 




TTTAAAATCT TACTGAAAAA 


CTGTTTATTA TCAGCATTAT 


TTGCGTTACT 


CATAAACGAC 


900 


35 


CTCCAATTTG AACATTAnAC 


TCATCA 






926 




(2) INFORMATION FOR SEQ ID NO: 839: 








40 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








45 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: B3 9: 








CTTCGAAACG TACTTGGTCA 


TTCCCTTTAC CAGTACAACC 


ATGTGCAATA 


CCTACTGAAT 


60 


50 


TTGTTTTCTC AGCAATCTCT 


ACTAATTTTT TAGCGATTAA 


TGGTCTTGAT 


AAAGCTGAAA 


120 




CTAATGGATA TGCATTTTCA 


TACATTAAAT TTCCTTTGAT 


TGCATAACTT 


ACATACTCAT 


180 




CACTAAATTC TTTTGTTGCA TCAATAATAT GACATTCAAC TGCTCCCATA 


TCTAAAGCTT 


240 
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CGTATCCtTT GTCGATAAGC CATTGAACGG CCACACTTGT aTCTAGtCCt CCkGAATATG 360 

CTAAAACAAT TTTCTCTTTC ATAAAATTCA CCTCATTGTA 400 
(2) INFORMATION FOR SEQ ID NO: 84 0: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1043 base pairs 

(B) TYPE: nucleic acid 

w (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

1S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 840: 

GGATCGCGGT GTTGTGCTTG ATATACTTTG TAGCGATAAC GTTTACCTAC ACAATCATAA 60 

CGACAATGAA AATCGTCATC GACTGTAACT ACATTGTTGA CATAAATATC ATCAGGTAAC 120 

20 GTTCGGTTCA TTGCATATTG CCATTGTGAC ATAGGTATAT TCAGCTCTGT GTCGAAATGA 180 

' AAGTATTGCT GTATCGCATG TACACCTCTA TCAGTCCTAC TTGAAGGATG GATTCTCACA 240 

TGTCTTTTAT GCATGCGTTG TAATAGCTTT TCAAATTGTT GCTGTACCGT TCGACCATTT 300 

TGTTGAATTT GAAAACCTAG AAAATTATTT CCTTGATACG CAATTTCTAC TAATATACGC 360 

ATGAATTTAC ACTCCTGAAT ATTTTAATAC GAATAAAATA ATTGCAATTG GGATAATCAT 420 

GGTTAAAGAT ATCGTATCTC TCAATTGCCA TTTAAGCTGT CTGTAGCTCG TTCTCTTAAC 4 80 

ATTGGCATCA TAACCCCTAA CTTCCATTGC GACCGCTAAT TCTTCGGCGC GTTGGAAAGC 54 0 

TGAGATGAAT AGTGGCACTA GTAATGGAAT AAATGATTTA ATACGTGTTG CAATGTTCCC 600 

TGAACTTATT TCAGAACCAC GCGACTTTTG CGCCAAAATG ATTTTATCTA ACT CATC CAT 660 

TAACGTCGGG ATGAATCGTA ACGCAATGGA CATTATCATA CTTAATTGAT GAACTGGTAA 720 

TTTAAACATC tTTAGTGGTG CAAGTAATCT TTCAAACGCA TCTGTTAAAT CAATTGGACT 780 

^0 TGTAGATaGT GTCaTAATTG TTGCAATCAT TAcAATCCCA ATTAAACGCA GTGrTATATA 840 

TAGCCCTTCT AAAATACCAT TAGTTTCAAT CGTGATGCCA TGCCATTCAA CTAATACATA 900 

TCCACCTTTA GTTAAAAATA TATGCATCAT TAATGTGAAG ATTAAAAAGA AAAATATTGG 960 

TGTTAAACCT TTGATTAGGA ACCATAATTG AATTTTTGCT AATCTCATAA TGAAnAnGAT 1020 

AAGTGCAAAC ACCCAAAGAT ATG 1043 
(2) INFORMATION FOR SEQ ID NO : 841: 



25 



30 



35 



45 



SO 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 343 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 841: 

TCGAAATCAA ATTATAATAG ACAATTTTAG GAGGTGGACT TTCGATGACC AAAGGAATCT 60 

TAGGAAGAAA AATTGGGATG ACACAAGTAT TCGGAGAAAA CGGTGAATTA ATCCCTGTAA 120 

CAGTAGTAGA AGCTAAAGAA AATGTTGTAT TACAAAAGAA AACTGTAGAA GTTGATGGAT 180 

ACAACGCAAT CCAAGTTGGA TTTGAAGACA AAAAAGCATA CAAAAAAGAT GCAAAATCTA 240 

ATAAATATGC TAATAAACCA GCTGAAGGTC ACGCTAAAAA AGCTGACGCn GCACTGAGAG 300 

nTCCCCTCAT AATTTCCCCA AnCGTAACCA TGTGTGAATA AAT 343 



15 

(2) INFORMATION FOR SEQ ID NO: 842: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 430 base pairs 
2 0 <B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



25 ■ ■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 842: 

ATGAGTATTT TAATGATGCG CTACGAGCGT ATGGTCTTAC TGTGAnAACA GGTGAATTTG 60 

GAACACACAT GAATGTTAGC ATAAATAATG wtGGwCagTC TCTGGTAGTT TAGAATTAGA 120 

30 

AGAATTAAAG CAATTTAGAC AATGGGGTTC TAAAACACCA GGTCATCCTG AATACAGACA 180 

TACAGATGGT GTAGAAGTTA CTACCGGACC ACTTGGACAA GGTTTTGCTA TGTCmGTAGG .24 0 

ATTmGCTTTA GCAGrAGATC ACCTAGCAGG GAAATTTAAT AAAGAAGGAT ATAATGTTGT 300 

35 

AGATCATTAC ACATATGTAT TAGCTTCTGA CGGTGATTTA ATGGAAGGTA TATCGCATGA 360 

AGCAGCTTCA TTTGCTGGAC ATAATAAATT AAGTAAATTA GTTGTTTTAT ACGATTCAAA 420 

40 TGGATATTTC 43 0 

(2) INFORMATION FOR SEQ ID NO: 84 3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



so 

{xi} SEQUENCE DESCRIPTION: SEQ ID NO: 843: 
GGACTGCCCG ATGATTTGAC AAATGAATTG CTGATTTGAT TTATATATTA TCTGnAATTA 60 
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TCATCTATCC AACAATATGT CTTATCAATG GTATAGTCTT TGCACACCAA TGGAGGnAAA 
TAAATCTCAA CCTTACTATA TTAATATATA ATCAAATCTT AGATTAACTA GTGTAATGAT 
ACAGATGGAT AATTGAGTAC AAATTTAAAA CCCTGAGATT TTCGCTTTAA TTTGAAAACC 
TCAGGGnTTA TTTGATTTTT ATATAATGA 
(2) INFORMATION FOR SEQ ID NO: 844: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 844: 
AGATGAAGAA AGTGTTTCAT CGATTATCTT AACAGTTGTT GCTTTATATA CTACAnTTCT 
TCCTAGATCT TGTACCGTTG AAAAGAGACC TGGTTGTAAA ATCTTAATTG ACATTTTCAA 
TCACCACCCA GTCATCAACA TTAAAGTTGC CATCTGATAT ATCTCTTTCG ATTTGTATAA 
ATTTCTGTTC ATCTATTGCA TAAAATTGTA TCCATTCTCC TGCTTCGTAC ATTGACATTG 
GTTCACGCTC GCTGCTAAAT ACTTTTAACG GTGTGCGTCC AATAATTTGC CATCCGCCAG 
GGAAATCTGA TGGGATATAG TCCTGTTTGA TTATTCGCAA TACCTACAGA ACCTGGCATG 
AATTTTTTAA CCTTGGGCnG ATTACGTCTA nGGnGTATGT 
(2) INFORMATION FOR SEQ ID NO: 845: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 845: 
CATAAGTCCA GTAACCTTCA ACTGTGTCCT CATCTTTAGT TCCAAATATA TCAACGTATT 
TATTTCTTAA CTGATTAATG TTTCCCCAAC TCTCGGATCC AAACACTTGA ATATGACTAT 
ACCAAACCCA CGTTTGCAAT GTTGCATGAA TGCTACCATT GGGCTTTTGC CATAACCATT 
TTCCAGATAA TGAAAAATGC GGCTGAGTGT AATATTTGAT TAGTTCATTG ATATTAGTCT 
CGTTTTCACT GATATTATAG GCTTTTGCTT CAGATGAAAA ACTGATTGGT GTTTTAGGAA 
GTTGTGTTGA TGTAGTTTCT AAAAGTAACA ATGCTGTTGA TAAAACTAAT TTATTCATGA 
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ACCGATTCGG AATTAAATAA AAGCTAAAAC TATGTTAAAT AAACTTAAAC AGTTAGTAGT 
GTTATTTAAG CAAAACTTAT CATTTTTAAG TTGGACAGAA CAGATCAATA AGAGC 
(2} INFORMATION FOR SEQ ID NO: 846: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 509 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 846: 
TCAATCGATC AAATGGTGAT TGACCTTTTA AATTTGTATG ATCAAAATAA TGATTGTTGG 
CCATATCTTT ACTATGCTTA CGTGCAGTTT CAGAATTCTG TTTAGAATAC TTCAATGTAG 
ATAATTGATG TTGTTTTCTT TCAGCATTAA CTAAATCAAA ATTTTGTAGT TCAAAACTAT 
CTGCAAGCGA TTTCGATGGT GCTCCATATT GTTCTTTTAA TCTATTTTCC ATAGCATCAC 
TTACTTGTAA AACAGCTGTT ACATTATTAC GTCGATGCTT ATCATAAAAT ACCGTCGTAT 
AAATGTGATT TTTATGGAAA ACATCATATT CTTTATTATT TTGTTCGTAA CGCACTCTAC 
CTTTAACAAT CTCTGTTTCT GGTTCGCCTA ATCTTTGCCT TACAACCGAT ' TTAGGTGTAT 
TGTATTTAAT TTTTGATTTT GAAGTGATTA TATTTTGATT TGTATATAAC GCATTAACTT 
TATCTTTCAT GTAACTTATC ATTATAAAA 
(2) INFORMATION FOR SEQ ID NO: 847: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 
•(C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 84 7: 
CGTATTGATC TATAAATAGT GTTTAGATGC TATAGTCGGA TGCTTAAGTA ATTTAAAGAA 
AGTATCTTTA ACATCGATGT GTGTATAATC ATTTTTAGAA GTATTATAAT CTTTTT CTT C 
TCCCTTCTAA AATATATACA GGTGCTTCAT CAGCTAGTGG TTCAACTGGA ATGTCAGCAT 
AAACTTCGCA TCATATGTTA AACAAAAACG ATTGTATCTG TACTTCACCT ATAACAGCAC 
TATCCAATTC GTGCTTATCA AATAAACTAA GATTTTTGTC AGTACCTTTT CACAACTAGT 
ACATACGTCT TGAGTTCTGA AGCATCATTT CATAAGGAGA ATACTGGCCA CGTGTGGACT 
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(2) INFORMATION FOR SEQ ID NO: 848: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 848: 





TACCACTTGA 


ATACATTGCA 


TATAAAATAC 


GACGTTGTAC 


TGGTTTTAAA 


CCATCACGAA 


60 


75 


CATCTGGCAA 


TGCACGCTCT 


TGAATAATAT 


ATTTACTATA 


TCTTCCAAAG 


CGATCACCTA 


120 




AAACATCTTC 


AAGTGATAAA 


TCTTGAATTA 


TTTCACTCAC 


TAGATTTCCT 


CCTCATCAAA 


180 




TTGATCATTT 


TCAAGCACTT 


GTACTTCAGA 


ATTATCTAAA ATACTTTGGT 


CCTCTTGCAT 


240 


20 


ACCAAACTCA 


ACATGCTTTT 


CAATCCATTC 


ACGTCTAGGT 


TGTACTTTGT 


GCACCCATTG 


300 




AATGTTGTTA 


CACGTTTAGA 


TGGAACGCAC 


TTGCATCTTG 


CAACTTGTGA 


CACGnATTGA 


360 




AAGTTnCGTG 


TTTGCGGGGG 


TTnCAGCGTC 


GTTTGCCCAT 






400 



{2) INFORMATION FOR SEQ ID NO: 84 9: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 
(3) TYPE: nucleic acid 
30 (O STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: B4 9: 





TTTATATGAT 


TTATTTGAGG 


AGTTAAGAGA 


TTTATTTAAA 


GAAGAAGATT 


TAGAACCATG 


60 




GACATCATGC 


GAATTTGATT 


TTACAAGAGA 


AGGTGAATTA 


AAAGTTTCAT 


TTGATTATAT 


120 


40 


TGATTGGATA 


AATTCAGAAT 


TTGGTCAAAT 


AGGTCGACAA AATTACTATA AGTATAGAAA 


180 




ATTTGGAATT 


TTACCAGAAA 


CGGAATATGA 


AATTAATAAA 


GTTAAAGAAA 


TCGAGCAATA 


240 




TATTAAAGAG 


CTAGAAGAAT 


AAACTATCTT 


AATGTAAGAC 


TAAACAATAA 


AGCTTTGTTT 


300 


45 


AGTCTTTTTA 


GCGTTTAAGT 


AAAAAGCaAT 


AGATACCGTA 


AAGTTGATGC 


TCATCAAATA 


360 




AT AATAT AaA 


GATAATTTTA 


GGTTTTTAAA 


CTTTTAATCG 






400 



(2) INFORMATION FOR SEQ ID NO: 850: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 923 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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10 



15 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 850: 

CAGGCCATAC TTACAATTTT GGCCATCGTC CGGATGTATT nGGAATCGTT GGATAATAAG 60 

CATCAAGGTC GTGACTATTT CGTAAAATTC CAACTGTCCC AGAATTTACA TCATTATGTC 120 

CAATTACTGG GCAACCaGAT TTTGCCAACT ATCTATATtT CaTATATTCC aAATGTTAAA 180 

ATGGTTGAAT CAAAATCTTT GAAATTATAC TTATTTAGTT TCAGAAATCa CGGTGATTTT 240 

CACGAAGATT GTATGAATAT TATTATGAAT GATTTGATAG AGCTTATGGA CCCACATTAT 300 

ATTGAAGTCT GGGGCAAGTT CACACACGTG GTGGAATTTC TATTGATCCT TATACAAACT 360 

ATGGACGTCC AAATTCTAAA TATGAAAAAA TGGCTGAGCA TCGTTTGATG AATCATGATT 42 0 

TATATCCCGA AAAAATAGAT AATCGTTAAA TGTATCATTT AATAAACACA CCAATAAGTT 480 

20 GATTTTCCTA ACTTATTGGT GTGTTTTTCA TTTAGCATAC ATAATAGGTT ACATTAAAAT 54 0 

AACATTTTAT ACCAAAGTAC ACCAAAAGAA TATTAGTACA CGAATTAAAC AACATTTTTA 600 

TAGAAACCTA TTGCACTTTA ACGTCAATAA GTATATTTTT ATATTATCTC TAATTAATTG 660 

TGCGCGCTTA ATAACAGAAT ATTCTCAATA TTTTTATTTT TTTGTGATTT GTTGGAATAT 720 

TTAGTTGATA AGGCACAATC AAATTTACTT AAACTATTGT ATTAGGGGAA GAAAGGATGG 780 

GATGTATACA TGACACAACA AAACTCCCAT GGAAATCAAA TTCAAGACAT ACCTCAAACA 84 0 

GGATTTTTCG GGcATCCTCG AGGACTAGGC GTACTCTTCT TTGtAGAGTT CTGGGrAAGG 900 

kTTAGTTATT ATGGGCATGC GTG 923 
(2) INFORMATION FOR SEQ ID NO: 851: 

35 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 1004 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



25 



30 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 851: 

TTAnTTGCAT CTATTTCAGT TGGTGTAAAG AAAGCGGAAT TTGATTTTAT TGAAAAGTTA 60 

GCtCAAGAAA AATTAATCCC CgAATATATT ACAATAGATA TTGCGCATGG TcACTcAGAT 120 

TCaGTGATAA ACATGATTAA ACATATAAAA ACCCATATAC CTGATAGTTT TGTTATTGCT 180 

GGTAATGTTG GTACGCCAGA AGGTGTTAGA GAATTAGAAA ATGCTGGTGC TGATGCTACC 240 

AAAGTCGGTA TAGGTCCTGG TAGAGTTTGT ATTACAAAGA TTAAAACAGG TTTTGGTACT 3 00 
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GCCGATGGTG GTATAAGAAC GCATGGCGAC 
ATGGTCATGA TTGGTTCATt wTTTGCGGCA 
CTTGATGGTA AACAGTATAA AGAATATTTT 
CATAAAAATG TAGAAGGTAA AAAAATGTTT 
TTAAAAGAAA TGCAACAAGA TTTACAAAGC 
AAATCATTAC GTACTGTAGA TTATGTTATT 
GATTAATATT TATAGTAGGT GAkGTAAATT 
TTATCAATTA TTATCATTGC GCCAnTAACA 
GAAGTAAGAC AATATATTGA CATTCACATC 
TTGGATACAC CGAAGAAACA ACAATTTGCC 
TCAGATGTTG AGAAAACATT AAATAAACCA 



ATTGCTAAAT CAATTAGATT TGGTGCATCA 
CACGAAGAAT CACCTGGTGA AACTGTAGAA 
GGTAGTGCAT CTGAATTTCA AAAAGGCGAA 
GTAGAACATA AGGGTTCATT AATGGATACC 
TCAATTTCAT ATGCCGGTGG AAAAGACTTG 
GTTAGAAACT CTATTTTCAA CGGTGATAGA 
AAAAAATTCA TAGTAACTGT TGTTGCGTTT 
GAATTTAAAC CATTCATTCA TTTACAAAAT 
AATAAAGAAA CAATTTCTGC GGAAAATAAA 
TTTAATAATA TACAAATGAA CATGTCGAAA 
AAAAGAGTGA CATn 



(2) INFORMATION FOR SEQ ID NO: 852: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 852: 
GTTCGCTCCA ATGTACGCAG TTGTATATGC ATAGACACCA AAATCGAACC ATTCCATTGC 
ATTACCGATA CCGGTTGCAA CAACGGTTTT TTTAGCTTTC TTTGCATCCA CCATGTTAAT 
ATTCTCTTTA TTAAAATCCA TGAATTCATA CACTCCCTCT TGTATGTTTC ATAATTATAC 
GGAGTTAATT AGTTTTGTCA AATTTATAAG AATTTAAACT TTTATATAAA TACCAAAGTA 
TGTTTGTTCA AAAATTTCGC TTTATCGnAT TAAATTCATG GTTTTATACC TCATACATTG 
ATTAGACATA ATAATCTAAA CACGACGTTA CTTTGGATAT ACTnAATGGT nnCACACTCA 
GTAGGAAGCA CGCATTCGCT CGATAATGTA ACCATACTGA 
(2) INFORMATION FOR SEQ ID NO: 853: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 361 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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10 



IS 



25 



30 



35 



GGTGAAATTG TAACTATATA ATCGATATTA CCTTCAGTGT TCAAAACTGG AAAATAATAA 60 

TTACCATCAC T CTTTTT GTT GAATTTATAA ATTTTAAAAG GTTCACCAAG TGTATACGAG 120 

GCTTTCTCTT TATTATAGAT TTTATCAAGT GAAGTAACAT AAGAAAGATA GTCTTTTTGC 1B0 

GCTAAATTAC GTACGTCAGT AGGAAcACTC TTGTCCTCGA CATTAACTTG CACATGCTTC 240 

TTATCTTTTG CTTTGATATT TGAATTGCTC TCAGCGTTTG CAATAGGGGT AACACTTAAA 300 

GAaAATATTA AACTTAAAGC TATTAATTTT GGAAAGTTTC TTTTCATATA AAAACTCCTT 360 

T 361 
(2) INFORMATION FOR SEQ ID NO: 854: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 605 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO : 854 : 

ACnTTATACT AGAnAAAGnT ATTAAAGTAT ATCTGCTTTA CACCATTAAA AAGCGGCAAA 60 

TGCTATAGAT AATCTAAAGC ATCTGCCGTA TGAGnATTTA TTTATTTTTG ATTGTCATAA 120 

TCGTGTGGTT TTAAATTAAT TGTTTCTAGC TTTACAAATT TTGTTTTGTG AATGATTTTA 180 

TGAATAAAGT AAATCAACGC TAGAATGATT AAAGGTAAAA AGTTTTTAAA AGCATTTAAC 240 

CATTGATCTT TTAAAATATA TTCAACTGAG CCACCAAATA GCAAGAATAA TAGTGTAGTG 300 

ATGACAATGA TTGGTCCTAA TGGATAAAAA GGTGCTTTAT ATGGTAGGAC CTTATTAGGG 360 
TCTTGACCTT GTTTTTTAAT AGCTTGTCGC AATCGTATTT GTGACCAAAT GCTTGATCCC , 420 

CAAACAACTA TAATCATTGA ACCAATAATT TCAAGTAAAT TAAAAACGGC ATTTGAATTA 4 80 

40 AAGTTTGCAT AAATAATAAC AATAACAACG ACTGCATAAG TAGTTAATAA TGCTCTTAAA 540 

GGTAACTTAG TTGTCTTGTT TAATTTACTT AAAAATTGGG GTGCTTTTTT GTCTGAACTT 600 

AAGGA 605 
45 (2) INFORMATION FOR SEQ ID NO: 855: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 515 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 



1709 



EP0 786 519 A2 



TTGAACAAGC ATTTTTAAAT AGTCAACAAC ATCAmGCAAT TAGTGAAGAG GCACAACTTT 60 

TAAATAATCC AGATGAATTA ATGGCATTTT TAAAGAATAA GCGTAAAAAT ATTTTAGAAA 120 

ATGCATACAT TATGCCGCAT AATATGAGAG AAATGTTACG AAGTTATTTG GAAAGTATGT 180 

CTCAAGACTT TAATGTTGGC GGATTTTTTA ATAAAAAGAA GAAAAAACTA CAAATTCAAC 240 

AACAGCGATT ATTAACAGCG ACAGATGCGT TACAAGAACA TGTTAATCAA CAAATTCGTC 300 

10 

AACCAATGCG AGAAGATATG TCATTTGTTA CGCGTTTTAT CAATAAAAAA GmAGCTTCAG 360 

aTAltlAGTATT AAATCAGCAT TATGACGTTA AGCCAGAAAT GATTGAAGGT TTATATCAAC 420 

15 CACAAACATC AATCAGCAAT ACTTATGTAC TTACATTTTC t AGACGAAGTG GTTAAAGCCA 480 

TTAAGAAATA TGTTGAACAA CAATCAACAC CAATT 515 
(2) INFORMATION FOR SEQ ID NO: 856: 

20 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 160*7 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: B56 : 



30 



35 



40 



45 



50 



GTCATTTTTA 


AAATTTCGCA 


TTCGCGTTTA 


ATTTTTTCTC 


TTTTTTCTTT 


TTCTTCTAGT 


60 


GACATACTTT 


CTTTAGGTGT 


TTCAACCAAT 


TCAGATGTAT 


CTACATCATC 


AATTTTAGTG 


12 0 


ATTTTGTCTA 


CATCTTTCTT 


TAAATCTTCT 


GGGACGTTCT 


CGAAACGCTT 


ATATTGGTCT 


180 


TTAGAGATAC 


TAGCAGCTAT 


TTCATTAGCT 


CCTAAAATTT 


CATCTATCAA 


GCCGAAAGAC 


240 


AAGGCTTCTT 


CTGCAGTAAG 


CCAAGTTTCT 


GCATCTAACA 


TCTGTTTTAA 


GTGTTCTTGA 


300 


TCTAAATCTT 


TTGCTTTATC 


TAAATAAGCT 


GAATTACTAA 


CAGCATCTGT 


TTTTTCAAGT 


360 


AAATCCGCTG 


TCTTTCTTAA 


TTCTTCTGCA 


TTACCTACAG 


TCATAACCCA TGAATTATGA 


420 


ATCATTAAAA 


AACTATTTTT 


GTGCATAAAA 


ATAGTGTCAC 


CACTCATAGC 


GATAACACTA 


480 


GCAATTGATG 


CCGCTAAGGC 


ATCGACATAG 


ATATTAATTT 


TTGCAGGATG 


CAT'ITITAGC 


54 0 


ATATTGTATA 


TTGCATGTCC 


TTCAAATACA 


CTGCCTCCAG 


ATGAATTTAT 


ATGAACATCT 


600 


ATTTCACTGA 


TGTCTCCTAG 


TTCATCTAGT 


TTATTTTTGA 


AATCTGTAGC 


AGTTACATCA 


660 


CTTTCAAACC 


ATTTATCACT 


TACAATATCA 


CCATAAATGa 


ATATTTCACC 


TTTACTTTTT 


720 


GATTTTCTTT 


TCATTTGAAA 


ATACTTAGCT 


TTCATTGACA 


TTTTTATCAC 


CACCTTTCaA 


780 


AGATTTTCTT 


AATTCAAGTG 


GCGTGTCAAT 


TGGGTATAAA 


TCACCGCTTA ATTAG CGGCT 


84 0 
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AACCACTACG AACTGCTTTA AAGTACACTT CTGCTTGTGT TGCACTATCA GCCCTTAAAT 
AAGATTTAAC GTTAAATTTA AAATACCTAT TTTTTTCTCT GTCTGTTTTA GTAAGTAGTT 
TCCGATTAAn TTCTTCTTCA TACTGTTTGA CGATTGGCAA TAAGGTATGC TGCAAGTAAA 
ATCTGTTTAA CTCTTCATTT TTCGCGAAAT TTGTATTTGA TCTTGCATTT AAGAATATTG 
AGGGCAATTG AAAAACGTTA GCTACTCTTT CTCTTGTTAA ATTCTCGCTT GCCACTATAT 
CTTCAGAGAC ATATTTTTTA GGTAAAGGTT CGATTTCAAC ACCAGGCTCT TGGAATAATA 
TTCCACCGTT TTCTTCATAG TACTGTTTGA AATCTTCTAA CACTTGTTGC CTTTTTTCTG 
TACTTACATT GGAACCATAT TTAAGCATAA AAGAATCTGG TTTTTGCATT TCTGTAAGAT 
TAAAGGTTCT TACTGCATTG TCAAAATCAG TTGTATTCTT CAACACATCA ATCGGACTAA 
TACCTTGAAC CATATTAGAT GCCACGATGT GTTTAAAATG CAACATGTCC aTATTATGAA 
GAATCAATTT ATTTCCAGTT GCAGCATGAA TGGAATAATA AAGTTCACcG TGATTGGTTT 
TCAATTAACA TTTCAACAAC ATCTGGATTT AATAAGAAAA GCTTTGA 
(2) INFORMATION FOR SEQ ID NO: 857: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 346 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 857: 
ATGCGCATTT AACCGCATTT AGAGATGCTG AAGAT CAGTA TAAAGCTTTG TTAGAAATTA 
CAACATTACC AGAAGGTAGA ATTTATGTTG CTCGCCAAGA TCAACTCATT GTGGGTTATG 
TCACTTTCCA CTATCCTGAT GAAATTGAGC GCTGGTCTAC AGGTAAGCTT CCATATTTAA 
TCGAATTGGG GGCAATTGAA GTCAG CATCA ATTTTAGGCA ATTACAACTT GCAGAAAAGC 
TGATACAACT TAGCCTTCTA CACCAGAATT CGAGAATTAT ATCGGnATAA CTACAGGATA 
TTACCGGCAT TGGGGGTTTA AAAAATTCCC AGTnAGAGGT ATTnCC 
(2) INFORMATION FOR SEQ ID NO: 85B: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1176' base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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5 



10 



15 



20 



25 



30 



ATTACTATAA 


TGAATCTTGC 


GAGATATTTA 


GAAGTAATGA 


TACAAnATAA TGGACTTTAG 


60 


AATTTmAATT 


TACAATAAAT 


AGTTCTATGA 


TTAAATGTCA 


GTTTTATGAC ATTTATTTAT 


120 


TGAAAATACG 


AACGAATGaG 


CGATaTGATA 


ATATAGATAA 


GAATGATTTT AATTTAGGAG 


180 


GCCTTTATGG 


TGCATAATAA 


AAACAATACA 


ATTTTAAAAA 


TGATCAAAGG TGAAGAAACA 


240 


TCACATACAC 


CTGTTTGGTT 


TATGCGACAA 


GCTGGCCGTT 


CGCAACCAGA ATATCGAAAA 


300 


TTGAAAGAAA 


AATATTCACT 


ATTCGATATT 


ACACATCAGC 


CGGAGTTGTG CGCTTATGTA 


360 


ACACATTTAC 


CAGTTGATAA 


TTATCATACA 


GATGCAGCAA TTTTATACAA AGATATTATG 


420 


ACACCATTAA 


AGCCAATTGG 


TGTCGATGTA 


GAAATTAAAT 


CGGGTATTGG TCCAGTGATT 


480 


CATAATCCAA 


TCAAAACAAT 


TCAAGATGTT 


GAGAAACTTT 


CTCAAATAGA CCCCGAACGA 


54 0 


GATGTACCAT 


ATGTATTAGA 


TACAATTAAA 


CTTTTAACAG 


AAGAAAAGTT AAATGTGCCG 


600 


CTAATAGGAT 


TTACTGGGGC 


ACCATTTACA 


TTAGCGTCAT 


ATATGATTGA aGGCGGACCA 


660 


TCGAAAAATT 


ACAATTTTAC 


AAAAGCGATG 


ATGTATAGAG ATGAAGCAAC ATGGTTTGCT 


720 


TTAATGAATC 


ATTTAGTTGA 


TGTATCTGTT 


AAATATGTAA 


CAGCTCAAGT CGAAGCAGGT 


780 


GCCGAATTGA 


TTCAAATTTT 


CGATTCATGG 


GTAGGTGCAT 


TAAATGTCGA GGATTACAGA 


840 


CGTTACATTA 


AACCACATAT 


GATTCGATTA 


ATCAGTGAGG 


TTAAAGAAAA ACATGATGTG 


900 


CCGGTAATTT 


TATTCGGTGT 


AGG TG CCAGT 


CATTTAATCA ATGAATGGAA TGATTTACCG 


960 


ATTGATGTAT 


TAGGCCTAGA 


TTGGAGAACG 


TCTATTAATC 


AGGCTCAACA ATTAGGCGTT 


1020 


ACTAAAACAT 


TACAAGGGAA 


TTTAGATCCA 


TCAATTTTAT 


TAGCACCATG GAATGTCATT 


1080 


GAAGAGAGAT 


TGAAACCAAT 


ATTAGATcAA 


GGTATGGAGA ACGGTAAACA CATCTTTAAT 


1140 


TTAGGACACG 


GTGTATTCCC 


AGAAGTGCAC 


CAGAGA 




1176 



(2) INFORMATION FOR SEQ ID NO: 859: 

40 (i) " SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 859: 
■ ATTTGGTGCA TTGGTAnAAT TGACCTGAAA GnAAAAAGTG GTTTAGTTGC ACATTGAGTG 6 0 

nAAGTGCGCA GATGAATTAT GTTGAAAACG TGAGGAAGAG CACCTTTGCT GTTGGTGATG 120 
AAGTAGACGT AAAAGTATTA TCTATTGCTG ATGATGGAAA AATTAGTCTT TcAATTAAGA 180 

55 
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AAAAAGCCGA AGATTTTGAA AAGAAATTAA GCAATTTCTT AAAAGATAGT GAAGATAAAT 

TAACTTCAAT CAAACGTCAA ACAGAATCTA GACGCGGTGG CAAAGGTTCA AGACGTTAAT 

TAAAATAAAT AAAGACTGTT TCGATAAGGA ATATATTTAG AATGATGCGT ATCGAATAAT 

CGATTGCAGC GTTAGACAAT CTAAGACTGT TTCTTAAATA AGGAGCAGTC TCTTTTATTT 

GTAATGATAT AACTAAGACT TATACCATTT TTGAAAATTG TAAAAGTGAG GTGATGTTAT 

GCAGTTAAAT AGTAATGGTT GGCATGTTGA TGACCATATT GTTGTCGCTG TTTCTACAGG 
TATTGATAGT ATGTGTTTnn T 
(2) INFORMATION FOR SEQ ID NO: 860: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 544 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY : linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: B60: 
TATATTAGGT ACTTGTGTTG GTATTGTGCC AGCTTTGCTG TCTACTATAA TTTCTAAAAG 
GTTTGAACAT ATAAAAGGGA AAGTGCTAGG TGTATTTAAT TTTGTGAGAT ATATTGGAAT 
GACTGTCGGT GCATTATTAA TTGGTATCAT TTCTCAGCCG TTGGTAGCCT TTTACTTCAC 
AACTATAACT ATCATGTTAA TAGTAATATT TCTTTATATA AAGATAGTTG ACTTTCAGCT 
AAAGTATGCC AAATAAACTT AAAAAAGCAG TGAATCATTT CAATAGATGG AATGATTCAC 
TGCTTTTTGT TTATAGAAAT TTAAAATTCA TGCCTTTATG CTAACAACAA TATTATTTTA 
CTCTCTTACG TTTGCCAATG TAACcGTATA GCAAAGTAAA TGACTGCGAT AATGATGACA 
ACATACATAt ACGTGAATAG GAAGGAAGAC CAGTCATCAA CATACCAAAG CTATCGCTCA 
AAGTGCGnCT GGTAAATTAA ACCGAATTCC AnGATTGTAG GAACCCnATT AATGGAAATA 
ACGG 

(2) INFORMATION FOR SEQ ID NO: 861: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 651 base pairs 
(3) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 861: 
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AATTTAAAAT ACATTTATGC TTTACCCATT CGAAAATACC TTAATCATTT TCATTTATAG 
CATTGTTTGA TTGAAGGATA AAAAGTTGTT GTTTACAATA AAAATAATGA GTATCTGAAA 
TGAGGGATTC AnTATGACAC ATGTGGAAGT AGTAGCGACT ATCGCGCCAC AATTATCTAT 
CGAAGAAACT TTAATTCAnA AnATTAATCA TCGTATTGAT GCAATAGACG TATTAGAATT 
ACGAATTGAT CAAATTGAAA ATGTCACAGT TGaTCAAGTG GCAGAAmTGA TTACAAAGCT 
GaAGGTTATG CAAGATTCAT TCAAATTATT AGTTACGTAT CGTACAAAGT TACAAGGTGG 
CTATGGGCAA TTTACAAATG ACTCGTATCT TAATTTAATA TCAGACTTAG CAAATATCAA 
TGGCATAGAT ATGATTGATA TAGAATGGCA AGCAGATATT GACATTGAAA AACATCAACG 
AATCATTACA CATTTGCAAC AGTATAATAA AGAGGTGGTT ATATCACATC ATAATtTCGA 
AAGTACGCCT CCATTAGATG AATTGCaATT TATATTTTTn nAAATGCAAA A 
(2) INFORMATION FOR SEQ ID NO: 862: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 862: 
TGTGAGTGGT* AATGACACAC AAGCAGATTT AATAGGTTAT AGTTTTAAGT TTGACGGTGC 
GATTTCAAGG CAAGAGGCTT CTAAAGATGT ACATGCAGTT ATATTATCGA ATAAAACACT 
ATATTTATTA GATGTATTAC AAAAATTACC GATAGATAAA ATGAATTCAT TGAATATCCA 
TCAAGAAATT ATTGATGAAA TGTCAGATAT CATTTTAATG TTATATCGTG AATATGCAGG 
TATGTTTTTT AAAAGTCAGA AACTAATCAA CCAATTAAAA AGATTGGAmC AATAACaTAA 
AATAATAAAa GGTATTCAAG TAGCCACATA GATGTGTTTA TTTGAATACC TTTTkGAATA 
GaaAAGaGAT ACTGGcAATT TTACTAACCA GtATCTCTTT T 
(2) INFORMATION FOR SEQ ID NO: 863: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 308 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 863: 



1714 



EP 0 786 519 A2 



ACAAAGTAAT CAATATGTAA TATAAAATAC ACTGGTACTC AATATATAAT GATGATAAAA 120 

TTAATTTTAA TTAGATAGAG TTGCTTTGTG TTTTTAACGC nGATGCTACT ACTTATCTTA 180 

ACAGTTGATT AAGTAAATCA TTTAACAGCG AGATTATnCA ACCAGGGGGA TGACTTAATG 240 

AATTTATTCA GACACCAAAA TTTAGTATCA GAAATTTAAT GTCGGTATTT TTTCAGCTTT 300 

AATGCCAC 308 



10 

(2) INFORMATION FOR SEQ ID NO: 864: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1827 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 864: 



35 



40 



50 



TTCTACATCT 


TAAACGATGA 


CCTAACTATG 


ACGTTAATCT 


GGAAAGATGG 


GGAGTTGCTA 1 


60 


GTATGATGTT 


GAAATTTAAA 


GCTTGGGATA 


AAGATAAAAA 


AGTTATGAGT 


ATTATTGACG 


120 


AAATCGATTT 


TAATAGTGGG 


TACATTTTGA 


TTTCAACAGG 


TTATAAAAGT 


TTCAATGAAG 


180 


TAAAACTATT 


ACAATACACA 


GGATTTAAAG 


ATGTGCACGG 


TGTGGAGATT 


TATGAAGGGG 


240 


ATATTGTTCA 


AGATTGTTAT 


TCGAGAGAAG 


TAAGTTTTAT 


CGAGTTTAAA 


GAAGGAGCCT 


300 


TTTATATAAC 


TTTTAGCAAT 


GTAACTGAAT 


TACTAAGTGA 


AAATGACGAT ATTATTGAAA 


360 


TTGTTGGAAA 


TATTTTTGAA 


AATGAGATGC 


TATTGGAGGT 


TATGAGATGA 


CGTTCACCTT 


420 


ATCAGATGAA 


CAATATAAAA 


ATCTTTGTAC 


TAACTCTAAC 


AAGTTATTAG 


ATAAACTTCA 


480 


CAAAGCATTA 


AAAGATCGTG 


AAGAGTACAA 


GAAGCAACGA 


GATGAGCTTA TTGGGGATAT 


54 0 


AGCGAAGTTA 


CGAGATTGTA 


ACAAAGAACT 


GGAGAAGAAA 


GCAAGCGCAT 


GGGATAGGTA 


600 


TTGCAAGAGC 


GTTGAAAAAG 


ATTTAATAAA 


CGAATTCGGT 


AACGATGATG AAAGAGTTAA 


660 


ATTCGGAATG 


GAATTAAACA 


ATAAAATTTT 


TATGGAGGAT 


GACACAAATG 


AATAATCGCG 


720 


AAAAAATCGA 


ACAGTCCGTT 


ATTAGTGCTA 


GTGCGTATAA 


CGGTAATGAC 


ACAGAGGGGT 


780 


TGCTAAAAGA 


GATTGAGGAC 


GTGTATAAGA 


AAGCGCAAgC 


GTTTGATGAA 


ATACTTGAGG 


840 


GAATGACAAA 


TGCTATTCAA 


CATTCAGTTA 


AAGAAGGTAT 


TGAACTTGAT 


GAAGCAGTAG 


900 


GGATTATGGC 


AGGTCAAGTT 


GTCTATAAAT 


ATGAGGAGGA 


ATAGGAAAAT GACTAACACA 


960 


TTACAAGTAA 


AACTATTATC 


aAAAAATGCT 


AGAATGCCCG 


AACGAAATCA 


TAAGACGGAT 


1020 


GCAGGTTATG 


ACATATTCTC 


AGCTGAAACT 


GTCGTACTCG 


AACCACAAGA 


AAAAGCAGTG 


1080 



55 
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AGTGGTGTAA GTAGTAAAAC GTATTTAGTG ATTGAAACAG GCAAGATAGA CGCGGGATAT 1200 
CATGGCAATT TAGGGATTAA TATCAAGAAT GATGAAGAAC GTGATGGAAT ACCCTTTTTA 1260 

S 

TATGATGATA TAGACGCTGA ATTAGAAGAT GGATTAATAA GCATTTTAGA TATAAAAGGT 1320 
AACTATGTAC AAGATGGAAG AGGCATAAGA AGAGTTTACC AAATCAACAA AGGCGATAAA 13 80 

1Q CTAGCTCAAT TGGTTATCGT GCCTATATGG ACACCGGAAC TAAAGCAAGT GGAGGAATTC ' 1440 

GAAAGTGTTT CAGAACGTGG AGCAAAAGGC TTCGGAAGTA GCGGAGTGTA AAGACATCTT 1500 
AGATCGAGTt AAGGAGGTTT TGGGGAAGTG ACGCAATACT TAGTCACAAC ATTCAAAGAT 1560 

15 TCAACAGGAC GACCACATGA ACATATTACT GTGGCTAGAG ATAATCAGAC GTTTACAGTT 1620 

ATTGAGGCAG AGAGTAAAGA AGAAGCTGAG CGCAAATACG AGGCACAAGT TAAGATAAGG 1680 
AGAGATGGAG ATGCCAAAGA AAACGGTAAC GATTGATGTA GATGAAAACT TATTAGTAGT 1740 

20 

AGCTAGTAAT GAAATATCAG AACTATTATA TGAaTATGAC AGTGAGTtAA TGTCAGCTGG 1800 
ATGGAAGATG GCGATAATAG GGATATC 1827 
(2) INFORMATION FOR SEQ ID NO: 865: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1379 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 (D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 865: 
35 TCAATTGCAT CATCATATGA AATTCTaGGG AATGGTGTCG CAACTTTTTC AAGTTTTGAT 60 

GTATCACGCT CTAAAATTTT CAACTCTAGT TTACAATTTT CTAAAACTGA TTTTACAACA 120 

TGTGTTACAT ATTGTTCTTG AATTTCTAAA CTTTCAGCAT GATTTGTGAA AGCCATTTCT 180 

40 

CCTTCAATCA TCCAGAACTC GATCAAGTGT CTACGTGTTT yTGATTTTTC AGCTCTGAAA 240 

GTTGGACCAA ATGAAAATAC TTTTCCGTGT GCCATTGCTG CAGCTTCTAA GTATAACTGA 300 

CCACTTTGAG ATAAAAACGC ATCTTGATCA AAGTATTTAG TATGGAATAA TTCACTTGTA 360 

45 

CCTTCTGGCG CACTTGCTGT CAAAATTGGT GGATCAACCT TTGTAAATCC ATCTTTGTTG 420 

AAAAATTCAT ACGTTGCACG AATAACTTCA TTTCTAATTT TCATTACAGC ATGTTGTTTT 480 

50 TTAGAACGTA ACCATAAATG ACGGTGATCC ATTAAGAATT CTGTACCATG ATTTTTAGGT 540 

GTAATCGGAT AGTCATGCGC TTCTGAAATA ACTTCAATTG ATTTCACTTG CATTTCGTAT 600 

CCTAAGTCAG AACGATTATC TTCTGTAATT GTGCCTGTAA CGTATAGAGA TGATTCTTGA 660 

55 
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10 



15 



TGCATAAAGC CTGTTCCATC ACGTAATTGT AAAAAGGCGA TTTTACCACT TGAAcGTTTA 780 

TTTGTTAACC AAGCACCAAT TGTAACGTCT TGGTTTAAAT GATCTTTCGC TTGTTTAATC 840 

GTTGTTTTCA TAACCATTCT CCTATTTATT TTTTCGkTAT ACAATACTCA TTCATTTTAA 900 

CAAAATCCGC TTTCAAGTTC TAGAACTAGA CTAAAAGATA ACGtGTAAAT GGTAATGATT 960 

TACGCACAGA TTTGAACATT AATTTATATT AAGACAATAC AATCATCAGT TTCAGACAAA 1020 

ATATATAATA AAGCCTCAAT ATTATTAGTA TGAAGGTCGT TTTTTGTnCT AACATTCAAG 1080 

TTTAATTATA AAACTCAAAT TTTGACGACC AAACAAAAAT TCTTGTAACA GAACATAGCA 114 0 

CATGTCATGT CACAAGAATT TTACTTTGTA TTTCACTTTT TCTTTTTTTG AATTTGTCTT 1200 

AATAATTTTC CAAACTGTTG AATGTCGCCT TTTTTCTGAC GATAATTTTC AAGTGTTTGT 1260 

TCAAAAAAGT TTTTATAATT ACTGTTTATG AGTCGATCAT CAAATGAAAC TATTATGCCG 1320 

20 CGATCATTTT CATTTCTAAT TAATCTTCCA AGTCCTTnGT CTAnAACGTG TAACTGCAT 1379 

(2) INFORMATION FOR SEQ ID NO: 866: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 866: 

TTACAACGAA TAGTTCACCG TTCGAGTCTA ATCTATGTAA TGCTTGCTCG AATATACGAT 60 

35 GCACGGTTTC TTTCCCTGCT CTTATTGGTG GATTGGTTAA AATAAAATCA AAACTTTTGT 120 

CTTCCACAGC AGACAAAGCA TCACTTTCCT TTACGATCAC ATTATCAATA CCATTTAATT 180 

TTTTGTTTTT TTCAACTAAG GCTAGCGCTC TGTGATTAAC ATCTAGCATT GTAATTGAAT 240 

GATGTGGTGA TACTTTAGCA ATCATCAAAC CAATTGGTCC GTAACCACAA CCAACATCGG 300 

CAATTCGCTT ACTTGGACCA GGTGGATGCG CTTTTAAAAA AGTTTGAACA AGAACATCTG 360 

aACCATAATC TACTTTAtCt TTCGAAAACA CTCCtTATCA GTTATTAAAT CAATTTTATG 420 

AnGGTTnTAA TTGGATATTG AGTACGTTGT 4 50 
(2) INFORMATION FOR SEQ. ID NO: 867: 



40 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 867: 



5 


AATGTTAACA ATATAGCACC AATTAACCCC GAdATGATAA TAACGTGTAA TGTTTTATTT 


60 




CCTATTAATT GTCTCGCAAT 


ATGAGGTGCA ATTAATCCTA AAAAGCTAAT ACCACCGACA 


120 




ACTGAAATTG CGGATCCTGC 


TAATATTACT 


GCTAAAATTA ACAATAGCAT 


TTTAATAGTT 


180 


10 


.TTAACTTTTA AACCGAGTGC 


GGTTGCAACA GCATCACCTA GATTCAATAC 


ATCTAATTGA 


240 




TAACTCCATA AAATGATGAT 


GGATCGTTAT 


TAAAACCAGG 


GAATATAGAT 


AATATCCGCA 


300 




TATCAGGGCA TATAGCTACC 


TGTCACCAAC 


AGGTTTnTGC 


TCCAnGGTTC 


TnTAATAAGA 


■ 360 


15 ' 


CGCACACGCG ACAATGCGCT 


TGCTACCATT 


AGGCAGCTTG 






400 




(2) INFORMATION FOR SEQ ID NO: 868: 








20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1132 base pairs 
. (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 








25 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 868: 








AATCCnTnAA GTGCCATAAC 


ATCATCTCCT 


AACATCTTTA 


TTATACATCA 


ACATTTTATA 


60 


30 


AAAATAACnT CTTATGATAA 


AAATGAAAAT 


ACTAATTTTA 


ATGAGAGCGT 


CTTAAGCCGC 


• 120 




AATTGATAAA ACATATGCTA 


CAATATTTTT 


AAACACTATA 


CAGGAGGTAC 


TCTATGACAA 


180 




AATATACATT TAAACCTAAA 


GATTTCAAAG 


CGTTCAACGT 


AGAAGGCTTA 


GACGCACGAA 


240 


35 


TGGAAGCTTT AAACGAATAC 


ATACGaCCAC 


AACTCCGTGA 


ATTAGGAGAA 


TATTTTAGTG 


300 




ACTTCTTTAC AAGTCAAACT 


GGTGAAACAT 


TTTATCCTCA 


CGTAGCAAAG 


CATGCTAGAA 


360 


40 


GAAGTGTGAA TCCTCCTAAA 


GATACATGGG 


TTGCTTTTGC 


AACAAACAAA AGAGGCTATA 


420 


AAATGTTACC TCATTTCCAA 


ATTGGTATGT 


TTGAAGATCA 


ACTGTTTGTT 


ATGTTTGGAA 


480 




TCATGCATGA AG'CAAAAGAT 


AAAGCAACaC 


GTGCAAAAGT 


TTTTGAAAGA 


AAATTTAAAG 


540 


45 


CTATTCAACA ATTACCTGAT 


GATTATCGTG 


TTTGCTTAGA 


TCATATGAAA 


CCTGATAAAC 


600 




CATTTATTAA AGATTTAACG 


GATGATGATT 


TAATAGAAGC 


GATACAAAGA 


GCCATCAATG 


660 




TGAAAAAAGG TGAATTCTTT 


ATAGCGCGTG 


CAATCACACC 


ACAAGATAAA 


AGATTAAAAA 


720 


50 


GTGACAAAGC ATTTATTGCA 


TTTTTAGAAG 


AAACCTTCGA 


TCAGTTCTTA 


CCATTTTATT 


780 




CTGCATAAAT AACTTTGTTT 


AAATAATAGA 


GCACTTAATC 


ACATCCaTGA 


TTTCGTGCtC 


84 0 


55 


TTtTTTCTTA ATATTAAATC 


GAACGTtCAA 


CATAaTAATT 


CATACTTTTA 


AAAAAATTAA 


900 
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GGTGAGTCAA AATGAATAAT AAACGACATT CAACAAATGA ACAATTAAGT TTAGACGAAA 1020 

TAAACAATAC AATTAAATTC GATCATCGCA GTTCAAATAA ACAGAAATTT TTATCATTTC 1080 

5 TTGGACCTGG GTTATTAGTC GCTGTTGGTT ACATGGATCC CGGAAACTGG AT 1132 

12) INFORMATION FOR SEQ ID NO: 869: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 418 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 869: 



CTCAATGTAT TGACTGTGAT TTGGAGTGTA TAATGCTACT GTTTTGTCTA TAGCCATCGC 60 

20 ATCAAATATT ATATTTGAGT AATCAGTGAT AACGACATCA GACATTAGAA TTAAGTCTTG 120 

AGCACTAAGA TACTTTGGTG CAACAAGCGC CTCTTCTGGT AAGATCGCTT CATCAACACC 180 

CTGAACCACA ACATGATAGG CTTTGAATAA AGCATCTGAT AGTGGTAATT GTTGCGCACT 240 

25 

CACTAATCCA ATAGGTGCGT ATAATAAAAC TGGTTTTTGA TCATTTATTT TAAACGATTT 300 

CTTATACTGT TGTTGAGTAG TACTTTCGTT ACTCTTTTGT AGTAAATATT GGTGTTTAGG 360 

ATTACCATAA GCCAATACAT TGCTTGGTGA CTAGGAAAAG nCGTTTGATA AAAGGGCT 418 



(2) INFORMATION FOR SEQ ID NO: 870: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5B7 base pairs 

(B) TYPE: nucleic acid- 
ic) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 870: 



SO 



GAGTnTGTTA CAGTAATTGA. 


TTTTATTGGT 


AATTnTAAGA 


CAAATTATTT AATTCCGATT 


60 


GCGCTTTCTG GGGATCAATC 


GCAAnATAAA 


GATAATTATA 


AGAAGTTTTT AACGAATAAC 


120 


GATTCGATTA ATGGAGTATC TACAATTAAT 


TTTGAAGAAG 


TTGCTAAAAA ACAGATTTAC 


180 


AATTCATTGG ATGCAGTATC 


ATTAAATCAA 


AATAAATTAA TATTAAAAGC TTATGAAGAA 


240 


GTTGAAAATA GATTGGGACA 


CATGCCGTTA 


CTAATGGATT 


TCATACAACA ACATTCTATA 


300 


GATCCAAGCG TTATATTTTC 


TAAATTTAGT 


AATTATTACG 


AGTTCTTAGT GAGATATAAA 


360 


AAAATAGATA CATTATTGAC 


GGAAAATGAA 


TCTAAAAATC 


TGGTTTTCTT TTCAAGGCAA 


420 
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TTAACATATG ATGAATTAAA AAATAAAATG TTGAACGAAG TTAAGGATAT AACAGAAGAT 


540 




GATATAGATA 


CTTCATTAAG . AATTTT AGAT TTTTCATTTT ACAACGC 


587 


5 


(2) INFORMATION FOR SEQ ID NO: 871: 




10 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 971 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 




IS 


<xi) i 


SEQUENCE DESCRIPTION: SEQ ID NO: 871: 






CTACTAAACC 


AGGTAATCGG TAGAATCCAA GCAGGAATAA ATAAATTAAT GCnACACCAA 


60 




TAAACGATGC AAACACAGTn TTATCTAATG CATCTTGACC AAATTGGGCA CCTACTGAGT 


120 


20 


TTGAATAAAT 


TTCTTTCAAG TCAACTGGTA AAGAACCTGC ATTTAACAAT kCGGCGATTT • 


180 




GTTTTGCTTT 


TTTAACGCCT TCTTGTCCTT TAAATCCACC CGAGATTTCT ACGCTATCAG 


240 




AATTGATTGG 


TTGATCAACA CTTGCTGCAG AAATAAATTT AGGGTTTTTC TTTTGTGCTT 


300 


25 


CTTTTTTATA 


GCTGTCACCT TTTTTGAAAT CTAACCAAAC AACCATGACA TTATCACGTT 


360 




TCTTAGAGAT 


TTCTTCCGTT ACTTTTTTAA ATTTGriTlT GTCTTTTACT TTAAAAGTAA 


420 


30 


CTGTAGGcTG 


GTTTGTTTCC tGtTTAAATT CTTGtTTGGC AGATCCCTGT TTaATATCAG 


480 


AACCGCTTAA 


TTTtACTTTA TCTTCTGCAT CGCGAATTGT TAAATTAGCT TGAGAAGATA 


540 




AAATTTTACG 


TGCTTCATTC TGGTCTGTTA CACCAGCAAG TTGTACTCTA ATTCTATTAG 


600 


35 


GTTCTTCAAC 


TTGAATTTTA GGTTCCGAAA CACCTAAAAC GTTAACACGA TTTTCTAATG 


660 




TTTGCGCTGT 


TGATTGTAAG GCTTTTTTAT. CTATTTTGTC GCCTTTATTT AAAGGATCGA 


720 




CTTGATAAAG 


CACCTCAAAT CCACCTTGCA AATCAAGTCC TAAATTGmCA TTCTTTATAA 


7B0 


40 


CACTTTTATA AGTTGcAGCC ATTCCGGCAA ACAACAATAC GACTAAAAGC AAGAACGCAA 


840 




TTATTCTACT 


ACTTTTCTTC ACATGAACAC CTCATTATTT ACGTATGTAT TTAGAATACT 


900 




TGAATACTAT 


TTTATAACGC AAGTGAAATC TTTCTTACAA AATTTATTAG CCTTATACAT 


960 


45 


ATTAACATAC 


T 


971 



(2) INFORMATION FOR SEQ ID NO: 872: 

(i) SEQUENCE CHARACTERISTICS : 
50 (A) LENGTH: 636 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 872: 





CTGGTTAACA 


ACATCTGGTT 


CGGCATCTCC 


TTAGCTTCCA 


CTAAATCTTG 


GATGTCTTGG 


60 


5 


nnATCTTGCG 


CAAGTnGTGC 


TTTGGCTTGT 


TCAATTtCTy CTTTAGTCAT 


CGCATTGTTA 


120 




ATGCCGTTAT 


GACCTTGTTG 


AAGTATTTGA 


TTAATACGAT 


CTTTAAGTGC 


TTGTTTTTCC 


160 


if) 


TTATCTGTTA 


GATTTGGATT 


TCGATCAATT 


TCGTCAATTA ATGCTTGaAC 


TTGcTTATCA 


240 


ACGTCTTTAT 


TGGCATCAAT 


TTTTGCTTTT 


GGTATTTCAT 


TGGCATGCAC 


TTGTTCAATC 


300 




GCGTGGTTGC 


CTGCTGTTTG 


AACTTGAGAT 


ACAGCCTGAT 


TACTTGTTGC 


TTTATTAATG 


360 


75 


TTGTTGATGA 


TGCTGTTTGC 


CAATTCTTCT 


GCTTTATTTT 


TCGCAATAAG 


CTTGTCTTGA 


420 




TCCGTCGCAT 


TTGAAGCTTC 


GATTTCTTTT 


AGCTTATTAG 


CTAAAGCTTG 


ATTAATAGAT 


480 




TGAATTGCCT 


TGTCTTTAGC 


ATCTTGTAGT 


CGTTGATCAC 


CATTAAGATT 


ATGGATTGCA 


540 


20 


TCATTGACtG 


CTTGGATTGC 


GCCATTGATA 


TCATTCACAT 


tTGTGTaTCA 


CTATTTAGCA 


600 




ATGTATTTGC 


TAGACGTTgG 


CATCATCGAA 


GTTTGT 






636 



(2) INFORMATION FOR SEQ ID NO: 873: 

25 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 659 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 873: 



35 



40 



45 



50 



CTCTGAATGC 


ATTCATTAAA 


CGCTGGGTCT 


CTTTATATGA AACGTATTTG 


TCATTTTTAG • 


60 


AACTCAATCC 


GTAAAAATTG 


TCAACTTTCT 


TTTTAATATT ATCGTAATCA ATGGTTACAT 


120 


TACTTAAATC 


AATATCTAAA 


TCTATATTTT 


CTGCATCTTC 


TTTAAAGCCC 


GCTATACTGA 


180 


AAAAGCCTTC 


AATCGGCTGA 


TCAATCATTT 


CAATATATTT 


TAAAGCTGTG 


ATTGAACCTA 


240 


AACCATGTGT 


TACAAAATAT 


GTATCCTTTT 


TGCGTACATT 


AATTTGTTTC 


GTCATAGCTT 


300 


CAATCCACTG 


ATCCACTGTC 


TTCGCTtCAG 


GGGATTCAAA 


ATTAAATAAT 


GTTACGTCAT 


. 360 


ATCCTTCTAA 


AGTTAAGTTA 


TGCTCCAACC 


ACTGATACCA 


ATGATTTCTA 


CTATTTCCAT 


420 


GCATAGAATG 


TACAATAtTA 


CATCTGTCAT 


CTCATTCTCT 


CCTTTCAACT 


TACTACTTCT 


480 


TTTCTATTTT 


TAAAAAAATG 


ACTGATTACC 


TATAATTGTA 


AAATAAAAAC 


ACCTTAATTA 


540 


GAAATGTTAT 


ATCGCAAAGT 


GACATTTCTA 


ATTAAAGTGT ATTGTCATCA TTTCAATATC 


600 


ATTCAAAAAC 


AGCTAAACCT 


TTGTCTCTGC 


TTCAATTTCA 


CAAAAATAAT 


TCCCGCTGA 


659 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 874: 

AAAATAAAAT CCATTTTAGT ATTCGTAGAA AATGAGAGCC CCCTTTACCA CAATATGTAA 60 

ATGTATATTG TGAAAAGGGG GCTTTTATTT ATGACATAGT TGTTCAACTC ATACATTCAG 120 

TTGATAATGC AGTGCGTTAT GCAGCTATCC ACTTCATATG TATGGCGCAT ACTTGATATA 180 

TGAATATCAT CAAGATTTTA AAAGTATCGC TAATTTTAAA GTCTTAAAAC GCAGATTAAT 240 

GGTCATGATG TGTTAnTGCG TTTTGGTTTT TAGAGCCACG TAATTCCCAn GGACATATAG 300 

GAGGAACCAT TCCAC 315 



(2) INFORMATION FOR SEQ ID NO: 875: 

(i) SEQUENCE CHARACTERISTICS: 
2s (A) LENGTH: 482 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 875: 



35 



40 



TACACGACCT 


TGTAACTCAG 


CTGCTTTATA 


AGCTAGGGCG 


ATGTTATCAA AGTTGGATGT 


60 


ACTTTGAGCT 


TGGCATTTCG 


TTAATACTTT 


TAACGAAGGA 


TTTAATATAT 


GTCTGATACG 


120 


TATATATTGA 


TTATATTCAA 


TTCTTaATTt 


GGATAAGATT 


GCTGATAATT 


TGAAGCAATC 


180 


GGAGCTAAAT 


GGATCGAAAA 


ATGAATAAAT 


TTCGATTTTA 


CTTACAGGTG 


ATAGaTTGAT 


240 


ATCTTCACGA 


CTCTTATTTT 


CCATTAwTCG 


TAATTCTCCA 


GCCATGTTTA 


TTCACCTACA 


300 


ATTAATTTTA 


GGAATTCACC 


ATATGaTTAG 


CAGTTAATCT 


TaAGCGCTCA 


AATAAATAAT 


360 


CTCCAACACC 


TTGTGGAAAC 


caGCgCGaTT 


AATTGCTGTC 


TGCATATTTT 


CTAGCCATGC 


420 


ATCTCTTTCA 


AATTCAGTGA 


TTGTAAAATC 


TATATGTCTT 


TTTCTTA6CA 


TAGGATGTCC 


4B0 



GT 482 



(2) INFORMATION FOR SEQ ID NO: 876: , 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 670 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 876: 



. 5 





ATGCAATTGA 


ATTCTTAAAA 


GCAGAAGGCT 


TTGATGATAT 


TGAATGGGGC 


GAAGATTTTG 


€0 




GTGCGCCACA 


TGAAACAGCC 


ATTGCTAATC 


ATTATGATTT 


ACCGGTGTTT 


ATTACTAATT 


120 


1 u 


ATCCAACTAA 


AATTAAGCCT 


TTCTATATGC 


AACCAAATCC 


TGAAAATGAA 


GAAACTGTAT 


180 


TATGTGCAGA 


CTTAATTGCA 


CCTGAAGGAT 


ACGGTGAAAT 


TATTGGTGGA 


TCTGAACGTG 


240 




TGGATGACTT 


AGAATTGTTA 


GAACAACGCG 


TTAAAGAACA 


TGGATTAGAC 


GAAGAAGCAT 


300 




ATAGTTACTA 


CTTAGACTTA 


CGTCGTTATG 


GTAGTGTGCC 


ACACTGTGGA 


TTTGGTTTAG 


360 




GTTTAGAGCG 


TACAGTAGCA 


TGGATTTCTG 


GTGTTGAACA 


CGTTCGTGAA ACAGCGCCAT 


420 




TCCCAAGATT 


ATTAAACCGT 


TTATATCCAT 


AAGTTTTAGG 


GTCGTTACTT 


AAAATATATA 


480 


20 


TTATGATAAT 


AAAAAGAATC 


GTCCAGCCTA 


TATTAATAGC 


TGGGCGATTT 


ATTCTTGGAA 


540 




AATGAAGATT 


GATAACGGCT 


TATATTAATC 


ATGAACAAAC 


TTTTGTTTAG 


TAACTATATT 


600 


25 


TGGAAAATAA 


ATAGTTCATC 


TGATTATCCA 


TAAAGTTCAT 


TGTTAAAATC 


TAGTGTTAAA 


660 


AAATACTGTT 












670 



(2) INFORMATION FOR SEQ ID NO: 877: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 1258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

35 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 877: 



40 



45 



50 



TGCAAAGTAA ACTGGATGGC 


TnTCTTGCCG 


CCAAGGATCT 


GATGGCGCAG GGGATCAAGA 


60 


TCTGATCAAG AGACAGATCC 


TAACCTCTCA 


TTTCCGATAA 


GTTTAGTATG TATTATAGGT 


120 


GATGATTnTC TTGTTTTCAA 


CCTTATACAC 


AATATTTTTG 


AnATTTATAT TATATTCATA 


180 


TAAGGAAGGC GATTAATTAC 


GATTCATCAT 


TATATCTCTA 


CGTGTTAATT TATTAACCtA 


240 


TACTATATTT ATCaACaCAA 


CAGTTTTTCC 


TTTCATTTTC 


AAACATAACA GAAAAAGCCT 


300 


GAGACATACT TAGTCCCAGA CTGTTCTATA ATTAATTAAA 


GATATAACAA GGTTAATCAT 


360 


TCAAAGCTAA CATCACATTT ATTCACTGTA ATTTTTCATA 


AAGAATAATA ATGACTGTAA 


420 


TTCTATACCT AAGTCAATTT 


GATGTACTTG 


CACATCTGAA 


GGCGTATTAA TTCTACCAGG 


480 


AGTGAAGTTT AAAATACCTT 


TCACACCAGC 


TTGGACGAGT 


TCATCTGCAA CTTTCTGTGC 


540 
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10 



TAATTCATCG TTATCTTTAA CAATAACGTT CCCTATTTTC TGGCCAATAA CATCTTCTTT 660 

TACGTCAAAC GCTTCTGTAA TCGTCATATC GTCATGTATT GAAAAGTTAT ATGTGAGCAA 720 

AGCTTTCCCT AGGTTCCCAA CTCCGACAAT TGCGATTTTG ATCATGTCAC TCTCGCTTAG 780 

TTCAGATTTA AAGAAATCCA ATAAACTATC TATATTATAT .CCGTaCCCTT TTTtACCTAA 840 

TTCGCCAAAA TATGAAAAGT CACGACGAAT TGTtGCCGAG TCAATTTGTa ACGCATCGCT 900 

AATCGCTTTT GAATTTACAC GATCTATACC TTTAGATTTT AATGAACTGA CAAATCTATA 960 

ATATAACGGC AAACGTTTTA AAGTTGCTCG AGGAATTTTA ACTTGGTCAC TCATTCGCTA 1020 

1S TTTCCTCCTT CGTGTTTGAA TGAATTATAT CTATGTATTC AAACGAGTTA GAGCGTAATT 1080 

GTTAGATTAT TAAAArACAA ATAATTGaTT ACAATGAATA CCTAAATATT ATACATTAwT 1140 

GAATTTTAAA AATAAATATC TTAGTTTCAA GACGGCGTTT TATACTGTAA AATGGTTATT 1200 

20 AATGTAATAA TTACGCTTGT TTCATAAGTT AATAAAATTT GAGAAGACGA AGGTGAAG 1258 

(2) INFORMATION FOR SEQ ID NO: 878: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH : 54 6 base pairs 

<B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 878: 

AAACAATCCC CnCCAAAGTA ATTCCACCAC TAATTGTAAA TAACTTTGGA AAGGATGTTG 60 

35 TTTGTnATTC CCATGGACAC ATGCCATCTn ATTTAAGTTT TATACTCTAT TTTTCTGATG .120 

TCTTGATTAA TTGCAAGCGG TCTTGCGAGC CTTGTTCAAG CTCATATTCT TTATCTACTT 180 

GCTTACCGTT TTCryTTATT AATCTTACGG TTGCGGTAAT CTTATCACCA TGTTTTTCAG 240 

40 CATCTATAAT TTGTGGTGAA CTAATCATCA TTAAACTTCC TTTAGATACA CGCTTCTTTA 300 

CATCATCATA AAAAGATGAT CCTTTTTTTA TATATGATGA TACAAAATCA AAATCCGACT 360 

GATTAAACGC AGCATTATTC GCTAAAGAAT ATCCAGCAAA GAATTCTATC AATTTGTTCT 420 

TCAAGCTGTT TTCTTCTTTT TCTTTCTTTT CAACATAGTC TTCGATATCT TCACTGTCAA 480 

AATTCAAAGT TATCTCTGTA ■ TTGTATTTTA AATCGCTAAG CTTTAATCGT CTTCGTCTGT 54 0 

GATGTA 546 

(2) INFORMATION FOR SEQ ID NO: 879: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 870 base pairs 



45 



50 



55 
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<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

£ 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 879: 



10 



15 



20 



25 



30 



TTTATGAATT 


TTTTATGTAA 


CATTAAATAT 


TCTTCTGGCA ATCTTCCtAC TATkTCTkTT 


60 


ATATATCCTT 


CTTCTCCTTC 


TTTAAATACT 


AAACCTGTGG GAGTTAGTGT GTTATACCTA 


120 


ATTTCTATCT 


CATTCAAATG 


ATCATTCAAA 


AGGTCAAAAA ATTCTCTATT AAAATAACCC 


180 


CATTTCCCAT 


TTTGTAATAA 


ATATTTCTTT 


CCATCTTCCA TTTCAACTTC TGCATGTAGT 


240 


ATCGTTTTTA 


GTaGCATATT 


ATTTGAATGC 


CCCAAATTAT CkATCACkTc AATTCTGACA 


300 


TCATTTATAG 


AATTAACGTC 


ATATTTTAAT 


AAGTAATCTC CTATTTCAGT GATGTAATCA 


360 


ACTTCAGGAT 


CAGTCGCATC 


AAATGTCTCT 


AATGTATTAT TTTTAAAAGA ATTTATATAT 


420 


ATATTGACGT 


CGAGCATATC 


ATCTAACAAG 


AGTATCATAT TACTTAGTTC TAAAAATCTT 


480 


GATATATCTA 


TAGATATATT 


TTCTGTAGTT 


GAAGAATTCG ATAGTTTTTT TAGCAATAAA 


540 


GTATCTAATA 


CTTCTATTTT 


ATTCAAGTCT 


TTTAAAGTTA CTATTCTAGG AAATTCACTA 


600 


ATTTTTTGAG 


GAAGATTAAT 


AATAGCGTTT 


ATTTCTTTGA TTATCACACT AATTTTATCT 


660 


ATGaATTGCT 


GcTTTCTATT 


CGGtACACGC 


AATGaAATAC TTGTACCACA AGTCCATTGt 


720 


TTTTCCaAAA 


ATTTGAGGaT 


TCTGTGGaTG 


tCCTTGGaCT GGaTATATAA GATTCTGAAG 


780 


GTCTAACGTA 


ATCTACACTA 


TTCCTTCTAT 


AATTAACAAT CTCTTTAAGC CTGTTTTGTT 


840 


GAAAAAAATT 


" AACATTTTTA 


nTAACTAnGG 




870 



35 (2) INFORMATION FOR SEQ ID NO: 88 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 880: 

45 

GGTTTACATC TTTATCAACT GTAATTTCAT TGACATCTTC ATTCATATTT AAAACACCAT 60 
TAAATGTCCC TGAATATTCA CTTGTTTGTT TAACTGTGGC AGTGACTTGT CGTTACCATA 120 
50 TGTCATCATA TATTGTGCAA ATGTTAAAGT CCCCATTGAG TTGACCGACA AAGTTGAATT 18 0 

TATCGAATTG TATTCAGATT GTAAACTTAG CAGACATTTT TAACCACGAG CATTCTTATC 24 0 

CAAATAGnCC TTGTCTGTTA TTTTCAAGTC AATTTTCACA ATAGGATCAC TGCATCTTTT 300 

55 
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ACCCTCTTTT TCTGCTnCTC ACATAACTTT CAGATGGACT 400 

(2) INFORMATION FOR SEQ ID NO: 881: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 333 base pairs 
CB) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 881: 



1S AAGATATTGC GAAAAGAAGT GACAGTAACA GAGGAATTTC AAAATTTATT CAACCATATA . 60 

TTGAAAATTG ATGAAATGGT GCATGCAAAT GAAGCGCGTA TTGCGTATGA AGCAGACATG 120 

CGAAAACCTT TATATAGTAA. ACGCATTTAT CAAAATTTAA CATTAGACTC TATTGTTTTT 180 

20 AGAAATACAT TGAGATATAC AGCGATTATG ATGATAGCGG ATATTTATTG CGTTAATGTT 24 0 

TGATTTTGAA AAAGCATCCT GGATCCCGTT ATCTGCACAT CCAnTATTCC TAGGGCCATC 300 

AnCCTATACC ATGCCATCCG AGAGAGGTAT GGn 333 



(2) INFORMATION FOR SEQ ID NO: 882: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 617 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 882: 



50 



ATTGAAAGAA AAACTGACGT ACTTATCCGA 


TGATAAAATG AAAGAAGTAG ATAATGCACT 


60 


AATGATTAGT 


TTAGGGCTGA ATGCAGTAGC 


TCACCAGAAA AATTAGGCGT CTATTATATG 


120 


TATTTTTCAG 


AGATAAATAA AATATTGATA 


TAAAAGACAA TAACTTTATA ATAATTATAA 


180 


CTATTTCTAA 


ATTC'TGTACG AAGAATTTTC 


TTATAAACAA AGATTTTAGC AAATACCAGT 


240 


TATGATATTC ATATTTTTTA TTATAAAAGG 


ATGTCTTAAG TTTTTTAGGC TTTAGGTATT 


300 


CCATCCTAAA 


GTTTTTTTAG CTTAAAAGTA 


TCATCTACAG CAAAATTGCA AACGACAAAA 


360 


TTGATAAGTG 


CAATTAAATA AATGTTAGTA 


AGTGAATCAT AATTATCCTT GCTTAAGCAT 


420 


TTGCTTTGTA 


AGGGAAGTGA GGAGGCAACT 


AATCGTGGAA GAATTTAAGC AACATTATAA 


480 


GGGTTTAATT 


GATGAAAGTT TAACGTGCCA 


AGATAAAGTA GAATTGATAA AAAAGTGTGA 


540 


GAAATACACT 


GACGAAGTGA TTCGTAAGGA 


CGTCTTGCCT GAAGACATTG TCGATATTCA 


600 



55 
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(2) INFORMATION FOR SEQ ID NO: 883: 

'(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 583 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 883: 
GTTGTATTAA AATTTTTCAA TTTAACAACG AAACAATACG AAGAAAAACA TATTACTGAG 
CCACTGGAAT TGACGAGCTT ATTAGGTAnT ATTTCTCGTC TAGACAATGG TCATTTTGCA 
CATTTGCATG CTACTTTCGG TACTCAATCA TATGAAACTT TTAGCGGACA TCTCGCTAAA 
GCAATTGTTT CTGCAACTGC TGAAATCATT CTTACTGTAA CTGATTTGGA CATTCAACGT 
TCATTCAAAG ATGCAGTCGG TTTAAATTTA CTTGATCCTC AATAGCACTT TACTTTTTAG 
TTGAATCAAA TTTTATTACA AGTTATCATA AGTGCTTAAT CATAAATGCA TTCTCGTTTA 
TTAATATACT TCAACGAGTT AAATTAAATT TTTAATTAGG AATTGAATGC GTTTATAACT 
ATCACATTTG aATGaATAAC TTTTTCATTA GAAAGTAGTG TkTATTATTG aTGCAATCTC 
GaTACATCGC TAGAATTTAC TTTtaATTTT ATTTATCGTC TCCCTTnTTG AGACTAAAAT 
TTTTCAATTT ATGACTTTAA ATTTGTTTTT AGCATATATA CCT 
(2) INFORMATION FOR SEQ ID NO: 884: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 524 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 884: 
GCAGAGGGCT TACGATAAAA AGAAAAATCT CGCTCGCCCG CTTCAGTCAA ACTCACAAAA 
GcTAACGCAt GwyTGCTyCA .TTCGTGCGAT ACACTTTTGA TACATCTACA CCAATACTTG 
ATATCGTTTC AATAATGCTA TCTCCAAATG CATCATTACC TAATTGTGTA ATCATAtACG 
CTTGTTGTCC tAACTTTTGA ACTGTACAAG CTACATTGCA TGGCGCGCCA CCAATTTGCT 
TCGTAAATGT TTGAACATCT TTTAAATTCG CATTTGTAAC ATTTGGAATA AAATCAATTA 
ACGCTTCTCC TATTGAAAAT AGACGTCTCA TTTATGTTCA TCCTTTAAAT CATATTTAGT 
AAATTGTAAA TATACTTGCC CTGATTCTGT TGATGTTTTA ATACCTAGTG CGTCTTCTGT 
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TGTATCAACA AAAATTTGTA ATTGCTTTAA TGGCGTGTCT AATA 
(2) INFORMATION FOR SEQ ID NO: B85 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 885: 

15 TTAATGATTT CTTCGTATGA TTCACCAACA ATTTGCGCTA AAATATCATG CGCAAGTACT 60 

TCACCTTCAA CCATAAATTC TATTTGCTCT ACTAAAAATT TTACGTCATT CATGCTATTC 120 

ATTTTTGGAA TTAACAAAGC ATATAAGTTT GTAAATTCTT TTAAATACGC AGCATCAGCT 18.0 

20 TTTAAGTAAT GGCGTAATGC GTCACTCCTA TATCTCCGGA TAACATCTTC TGAATAAAGT 24 0 

CATCCTCATA AATATCATAA TGATGGCTTG CAGCTGGGTA CAATTTTGTG AAATTCCATG 300 

TAAAAATCCT CCCTAAATAA AAAAOTACTT CCAACATGAA GAGTTGAGGC ATGTGCAAAC 360 

2S 

TAGC.CCATCA CTCATAACAC TTCCACGTGG GACTACCAAC 4 00 



(2) INFORMATION FOR SEQ ID NO: 886: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 857- base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
ID) TOPOLOGY: linear 

3S 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 886: 



50 



AAGAATTTAA ACAAAATGAT ATTTGGAAAC ATTTTAAAGC 


TGTGAAAAAT AATCATGTTT 


60 


ATGACTTAGA GGAAGTGCCA 


TTCGGTATTA 


CAGCAAATGT 


TGATGCTGAT 


AAGGCAATGA 


120 


CTCAATTATA TGATTTATTT 


TATAAGGATA 


AAAAATAGTG 


AGTTGATATG 


ATGATAAAAA 


180 


ATAAAAAGAA ACTACTATTT 


TTATGTTTGT 


TAGTCATTTT 


AATCGCAACT 


GCTTATATTT 


240 


CGTTTGTAAC CGGTACAATT 


AAATTGTCAT 


TTAATGACCT 


ATTTACAAAA 


TTTACAACTG 


300 


GTAGCAATGA AGCAGTGGAT 


TCAATCATTG. 


ATTTGCGATT 


GCCACGTATA 


TTAATTGCAT 


360 


TGATGGTTGG CGCAATGTTA 


GCAGTTTCTG 


GAGCATTA'JTT 


ACAAGCAGCA 


CTACAAAATC 


420 


CTTTGGCAGG GCGAATATCA 


TTGGCGTTTC 


CTCAGGTGCA 


CTTATAATGA 


GAGCcTTTGt 


480 


ATGTTGTTtA TTCCACAATT 


GTACTTTTAC 


TTACCATTAT 


TAAGTTTTAT 


TGGAGGTTTA 


540 
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ATATTAGTAG GTGTTGCGTT ATTCGTATTA TTAAATGGTG TTTTAGAAAT TTTAACTCAA 
AACCCTTTAA TGAAAATTCC TCAAGGCTTA ACAATGAAAA TATGGAGCGA CGTATACATA 
TTAGCAGTAT CAGCATTATT GGGATTAATA TTAACATTAC TATTGTCCCC TAAATTGAAT 
TTACTAAATT TAGACGACAT ACAAGCGCGA AtATCGGTTT TAATATTGAT CGTTACAGAT 
GGTTAACAGG GTTATTA 

(2) INFORMATION FOR SEQ ID NO: BB7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 675 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 887: 



GATTTATTTT 


AATATTATTG 


TTAGAAGGAA 


TTTTTACAAA 


TTCAGCGAGT 


GCAATCGAAT 


ATTCAGACTT 


ACATCATAAA 


AGTAAGTTTG 


ATTCAAAGCG 


TCTAAGTAAT 


GCTAAGATGT 


CATTCATCAA 


TCCAACTCAG 


CTTGAAAATA 


AAAACACAAA 


CGATAGACTG 


TTGAAGCATG 


ATTTGTTATT 


TCATGACATG 


TTCGTAAATG 


ATGATTGGAA 


AAAGGATTTT 


AAAGTTGAAT 


TTGAAAATGA 


GGCACTTTCA 


AAGAAATTTA 


TAAATAAGGA 


TATCGATATA 


TTTGCTGGAA 


ATTATGGATA 


CGGATGTCAT 


GGGGGAGCAA 


CCAATAAAAC 


GCAATGTAGT 


TATGGTGGTG 


TTACTTTAAG 


TGACAATAAT 


AAATACGATG 


ATTATAAGAA 


TATACCTTGT 


AATTTATGGA 


TTGACGGACA 


TCAAACAGAA 


ATAGAACTAA 


CTGCAGTAAA 


AACGAAAAAG 


AAAATTGTTA 


CTATTCAAGA 


ATTAGAGGTT 


CAATTAAGAA ATTATTTGAA 


TGAGAAGTAT 


AAGTTGTACG 


AACAAGGTGG 


CGACATTGTT 


AAAGGGTATG 


TTAAATATTA 


TAATGATGAT 


GAACAAAATG 


TAGAATATGA 


TTTTTATAAT 


TTAAATGGTG 


AGTATGGTCG 


TGAGGTATTA AAAATGTATG 


CnGATAATnA 


AACnA 











(2) INFORMATION FOR SEQ ID NO: 888: 

(i) .SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 851 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear . 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 888: 
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AATTCCTAAT TAACAATGCC TTCATTTAAA AACCTTCTTC CATCAATTTG ATGTTTTCAT 
GATATTTAGG TTTTCTCTTT TCAAGATAAA CCATTAAGTy CTCAATAGAA GGTGTAGTGA 
TATTCACTTT AkGACCTAAA ATTTCTTTAA ATACTTG CGC ATGTTCTGTT AATCCTATAA 
ATCCCGTACG CTTATGTTCT TCATAAATAA GTAGAGATGC TAATTCATCA TCAAGGTCCT 
CAATAGCACC ACTCACAACT TGATATCTTT GTAACAGTTG TTCTTTTGAA CCATTTAAGA 
TAACTTCACC GTCACTTAAG TGAATGATAT AATCAGCTAT TTTTTCTAAA TCTGAGATAA 
TGTGTGTCGA CATGAATATT GTTTTATTTT CATCAATTAA TTCTTGCTGA ATTATCTCCA 
ATAGCTCATT TCTAGCTAAG GGATCGAGAC CTGAAGTAGG CTCATCAAAA ATATACAATT 
CAGCATGATG TGAAAATGCA ACTGCTAGTG ATAATTTCAT TTTCATCCCA GTCGAAAATG 
■TTTTTATTGA TTTATTGATT GGTAAATCGA ACTTTTCAAG ATAAAATTCA AATACTTGAT 
GGTCCCATTT ACGATAAAAA GGGGcAATCA TTTTTTCAAG TTGCTTCGTA GTCCATCTCT 
CATTAAAATA ATTTTCAGAG TAAACnAACC CAATTCTATT CTTTAACTCT ATCGGATTGA 
GAGCCATATC CTCTTCTAAT ACTCTAATAA CACCAGTTTG TGGGTGATAC AAATCCATAA 
TTAACCTAAT n 

(2) INFORMATION FOR SEQ ID NO: B89: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 362 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: B89: 
AATCACCTTC ACGCCAATAT TGATTTTCAT GCGTAAATAC TTGTGCCGTT TCATGATACT 
TTGTCAATCG TGCGTGTTGC TGGGGGGAAT ATTTTTCAGT AGCCCAATTG GCTGCATGAC 
CTTCAATGGC TAGTTCAATT GCAGGATTAA TTAAATCTTC CAATGACAAT TTAGCATAAC 
GCTTGTGAAT ATAATCAAAC AGCTTTGGAA TTGCTGGCAC ACGACAGTTT TACCATGTGT 
AGTCATATCA AAAAATGATT TATATTCGCC TGAATCAnCT AGATAAAATT GGTTGGCTAC 
ATGGTCAGGG TGCTGGCTCA CGGTGCATCA AACGCnGnTA ATACTGCCCA GTACCTTTGC 
TC 

(2) INFORMATION FOR SEQ ID NO: B90: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 74 7 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 890: 
TTGGTTTATT CCCAAAAAGT ACGCGAAAAT TAGTAGAGAA GAATTTGAAA GTTTAAATAT 
TGTCAAACCC GCTAAAAATA ATACTTTCTG GCCTGTTGCA GGATTTGCAG TGTTATTAAC 
AACCTTAACA AGAAAATATA TCTATTTGCT TAACATCCAT TTAGAAAAAG AAATAGTTAT 
ATTAACATGC TGTATGATAC TTCTAGGTGT TTTCGCATTG TTTATATATA TAAATACAAA 
ATTGAAGTTA CATATTTTTG ATAAAAATAA AAGTAATAAC GAAAAGATCA TATTAATACC 
TACATTTAAA AATATTTGTT TATCCTTATT TGCTTATATA TTATTTGGTG GATTGTCAAC 
AATGGCTCTG AGTATGTTAG TAACTTCATC CCCTCAAAAT ATAATAGAAT TTCTTGCTTT 
AATTGGCATG ACTGCATGCT TCTTTCTACT GAATATGTCA TCGGTTCTAG ATAAAAAAAT 
TCATGTTATT TTAAAAACAA ATAAGTAGTA AAATTGATTA ACTTAGGTAG TATCGGATAC 
TTAAATGTTG GTTCATAAAA AGCAATGATT TTAAATCGAG GAGCTATCTT AGAAcAGGGA 
AATAAAACAG CCAAAGTTAT AAAAAGTGAA TTAATAACTA ATTATATTAT GTTAGCCACG 
CTTCAAATAA AAnATAATTA GAATAAGGTG GGATTGATAA TCAAnGCTAn GCGAATCTAA 
AATCATCAAT AAAAACCCCA AATATAG 
(2) INFORMATION FOR SEQ ID NO; 891: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi> SEQUENCE DESCRIPTION: SEQ ID NO: 891: 
ATTTTGAAAT TTTAGCAGAT ACTGATAATG GTCTCGATGC AATGAAACTT ATTGAAGAAT 
ATAATCCTAA CGTTGTTATT TTAGATATAG AAATGCCAGG CATGACTGGA CTTGAAGTTT 
TAGCGGAAAT TAGAAAAAAG CATTTGAATA TTAAAGTGAT TATTGTAACA ACTTTTAAAA 
GACCGGGATA CTTTGAAAAA GCAGTTGTGA ATGAtGTGGA TGCATATGTT TTAAAAGAAC 
GTTCTATAGA AGAATTGGTG GAAACCATTA ATAAAGTAAA TAACGGAGAG AAAGAATATA 
GCGCCACATT GATGACTTCA TTTTTTGTAG ATAAAAACCC ATTAACGCCC AAAGAACAAA 
TTGTATTAAG GGAAATTGGC AaTGGTTTAA GTAGTAAAGA AATAAGTGAA AAATTATTTT 
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ATCGTTTTGA TGCTTGGGAA AAGGCAAATG AAAAAGGCTG GACCTA 526 
{2) INFORMATION FOR SEQ ID NO: 892: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 892: 



t c 


TATTTGATG T 


AAAAGCGAAn 

•win w w wwu * 


GAACCATACA 


ATGTAACAAT 


TACTAGTGAT AAATACATCC 


o u 




CTAATACTGA 


TTTGAAACGT 


GGGCAAGCTG 


ATTTATTTGT 


AGCGGAAGGT TCTATCAAAG 


120 




ATTTAGTGAA 


ACATAAGAAG 


CATGGTAAGG 


CAATTATAGG 


AACGAAAAAA CATCATGTTA 


180 


20 


ATATTAAGTT 


ACGTAAAGAT 


ATTAATAAAA 


TCTATTTTAT 


GACAGATGTT GATTTAGGTG 


240 




GACCAACGTT 


TGTCTTAAAT 


GACAAAGACT 


ATCAAGAAAT 


AAGAAAGTAT ACAAAGGCAA 


300 


25 


AGCATATCGT 


CTCTCAATTT 


GGATTCGATT 


TGAAACATAA AAAAGATGCT TTAGCATTAG 


360 


AAAAAGCGAA 


AAATAAAGTT 


GATAAATCTA 


TTGAAACAAG 


AAGTGAAGCG ATAAGCTCAA 


420 




TATCAAGTTT 


AACCGGAATA 


TTATTATTTG 


TAACATCATT TTTAGGTATT ACATTCTTGA 


480 


30 


TTGCTGTATG 


TTGCATTATA 


TACATAAAGC 


AAATAGATGA AACCGAAGAT GAGTTAGAGA 


540 




ATTATAGTAT 


TTTGAGAAAG 


CTTGGATTTA 


CACAAAAAGA 


T ATGG CAAGG GGACTAAAGT 


600 




TTAAAATTAT 


GTTTAATTTT 


GGGTTACCTT 


TAGTTATTGC 


ACTATCACAT GCATATTTTA 


660 


35 


CATCATTAGC 


ATATATGAAA 


TTAATGGGTA 


CAACGAATCA 


AATACCGGTT TTCATAGTAA 


720 




TGGGATTATA 


CATTTGTATG 


TATGCTGTTT 


T 




751 



(2) INFORMATION FOR SEQ ID NO: 893: 



40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 357 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 893: 
GGATGTTGTA CATTATTCAG ATTACTTTGA AGGGGCACAA AAATATTTGA GCTATTTAAA 60 

50 

ATCAACAGTA GATGTTAACT TTGAAGGTTT GAAAATTGCT TTAGATGGTG CAAATGGTTC 120 
AACATCATCA CTAGCGCCAT TCTTATTTGG TGACTTAGAA GCAGATACTG AAACAATTGG 180 

55 



5 



10 
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TAGCTGAAAA AGTAGTTGAA ACTGAAATGA TTTTGGGGTT AGCATTTGGA CGGCGATGGG 
AGACAGAnCC ATAGCAGTAA GATGGAGAnG GnCCAATCCG TTGACGGTGG ACCCAAT 
(2) INFORMATION FOR SEQ ID NO: 894 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 464 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 894: 
GCTAACCTGC AAAATTCGAT AAGCGATTCA ATaCTGAcGC CTGcATTTCG CAAATTTTTC 
ACTAAATAAA TCCAATTTAA ATCACTATCA TTATATATTC TATATCCATT TTCATCTCGA 
TTAACTGGTG GAATCACACC AACCTTTTCA TAATATCTTA AAGTGTCTTG AGATATATTC 
ATGAGCGCTA CGACTTCTTT AGTTTTCATT GCGACTATCC TTTCAGTTAT GTTTGGTCGT 
CTAAAGTAAT GTTGCTTTAT ATATTGTCAT CTTCGTTTGA ATACTTCTTA . TTTTATTACT 
CAAATTTAAA TTTGTCTCTT TTTTAACATT TTACATTTCA TCGTTTTTAA TTACTTTAAA 
AATTGTATAA CTkAAATATT TAAAATGATA TAAwCACTAA GATTGaTAAT AtTTAATTnT 
TTGGAAAATT ATnTTAAGTT GCCAATTTTG GGGGGATGCG GAAA 
(2) INFORMATION FOR SEQ ID NO: 8 95: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 599 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 895: 
AnAATGAAAG AGTTACAGGA CTTTATTGCT CGTTTCTCAG TAACGCTTCT AAATCTAAAC 
AAGCAACAAG TCGTAAAAAA CAACTCGAGA AAATTGAATT AGATGATATT CAACCATCAT 
CAAGAAGATA TCCTTTCGTT AAATTCACAC CTGAGCGCGA AATCGGTAAT GACTTACTAA 
TCGTTCAAaA TCTATCTAAA aCGATTGACG GTGAAAAAGT ATTAGATAAT ATTTCATTCA 
CAATGAATCC AAATGATAAA GCAATTTTAA TTGGGGATAG TGAAATTGCG AAAACCACAT 
TGCTTAAAAT ATTAGCCGGT GAAATGGAAC CAGACGAAGG TTCATATAAA TGGGGTGTAA 
CAACGTCATT AAGTTACTTC CCTAAAGATA ACTCAGAATT CTTTGAGGGC GTTAATATGA 
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GCGGCTTCTT AGGCCGTATG CTATTTAGTG GAGAAGAAGT TAAGAAAAAA GCTAGTGTAC 540 
TTTCAGGTGG AGAAAAAGTA CGTTGTATGC TAAGTAAAAT GATGTTATCA AGTGCAAAC 599 
(2) INFORMATION FOR SEQ ID NO: 896: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 896: 



20 



25 



35 



TCATTAGTAG 


AAATCGTTAA 


AGATGCATCT 


TGCAATTTAA AATCCGAATC 


TTTGTATTGT 


60 


TTGTTCACAC 


TATCTACGTT 


TAACAATGTT 


GTCATATCCA 


TGCTCCTCTT 


TGTTTAATTT 


120 


TAATAAAAAC 


GTCTTCTCTT 


CAATAAATAA 


CTAAAGGCGT 


TATATATGAA 


AATAG CAATG 


180 


ATAAGTAGCA 


AACTAAAGCT 


TTGATTAATT 


CCAGTAGTAA 


TTGAATAAAG 


CCCAATTACT 


240 


ACAATGGCTA 


GTATCAATAT 


TGATAAGTTG 


ATCGCATATG 


TTTTAAATAA 


TGCAATTAAT 


300 




GTTCACCTTC 


ATCCAATATT 


TCCAATCGCT 


TTTCAGTGTA ATTCTTATCT 


360 


GCAATTTTTG 


GTATTCTGTC 


ATCAAACCTT 


CTATTAAACA GTGTAAATTG TGTATTGAAA 


420 


ATAGCACTAG 


CAAAAAATGG 


TATTAAAAAG 


AATAGTATTG 


CATTTGCTGC 


AGCGTGCCCT 


480 


ACCACAAAAA 


TAAGCAATAC 


TAAAAAAGCA 


ATAACTGTCT 


GCAGAATACT 


TAATATACTT 


54 0 


CCATTTAAAA 


CATAACGATT 


CGCAAGCAAC 


TCATATTGAT 


CTGTTGCATC 


GATATCTACC 


600 


TCTTTATCTA 


CAAGTTGCTT 


ATACTTCAAT 


GCACGTCTTT 


GATTCATCAA 


AACAATGGCT 


660 


TCTACCAATA 


TGATAATAAT 


CGTCGCAACA 


ATCGATATTA 


CAACGACATT 


GTTATATGTC 


720 


GCAAACGTCA 


AGTTCTCAAT 


ACCAAATCCA 


TCAAAATTAC 


CTAAAATTCC 


ACCTATGATA 


780 


CCCCCTACAA 


GTCCACCCAG 


TAAAAGATAT 


CCGATATATC 


TTAGTATTTT 


CATGCCTCAT 


84 0 


CCTCCTCAAT 


AATAAAAACA 


GTTyCCACCG 


TTTCATTGAA AATGCGAGCA ATTTTTATTG 


900 


CCGTTAATAC 


TGATGGCATA 


AAATTGTTTC 


GCTCAATTAG 


CGATATGGTT 


TGTCTTGAAA 


960 


CGCCCGCTTG 


TTTAGCAAGT 


TGCGTTTGGT 


TTAAGCCATC 


TCGTGCTCGT 


AATTCTTTCA 


1020 


ATCGATTACG 


CACATCGCAT 


CAACTCCnTA 


ATTACAC 






1057 



(2) INFORMATION FOR SEQ ID NO:. 897: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 336 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 897: 

5 AAATTATTGA GTTAACAAAT CATTACGGAG CACATAATTA TTTACCATTG nCAATTGTCA 60 

TTTCAGAAGC CGAnGGGGTA TGGGTTAAAG ATCCTGAAGG CAATAAATAT ATGGATATGT 120 

GAnCTGCATA TTCCGCTGTT AACCAAGGTC ATAGACATCC GAAAGGTATT CAAGCATTAA 160 

70 AAGATCAAGC TGATAAAGTG ACTTTACTCT CACGTGCTTT TCATAGTGAT AACTTAGGTG 240 

AATGGTACGA AAAAATTTGT AAACTGGCAG GTAAAGATAA AGCTTTACCA ATGTAATACA 300 

GGTGCTGTAA GCAGTAGAAA CAGCTTTGAA AGCAGC 336 



IS 

(2) INFORMATION FOR SEQ ID NO: B98: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 302 base pairs 

(B) TYPE: nucleic acid 
20 iC) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2S . (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 898: 

TGnTGnTTTC TTATTGTTGA CCAATTATTT TGCATACCAA ATCCAAATCA TGTATTTATC 60 

AATCACATCC TAATGCTAAA TCCAATGTAT TACATTGTTT AATGGTATAG CACAATCTAT 120 

30 CATATTTGGT ATATCAAGTA TGGAAAACAT TCCATATCAT TTTTACTTTA TTTTATTCTT 180 

ATGTTTAATA GCTGCAGTAA ATTTCGTATT AGCAGGTATA CGACACACGG CAATTTATAA 240 

TAAAACATCT AAAGTGACAC AAACTGATAA TCAACAGGGA GTTTCAAAGA TAGTCAGACG 300 

GG 302 



(2) INFORMATION FOR SEQ ID NO: 699: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 469 base pairs . 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 899: 
CCACTAGCTG ATACAGTAGG TATACCAAGA TTCGTCATCG TTACGACATA TCAATTTGGT 60 
SO CAATATGCGA TGTTATTCTT AGCGCCGACT GGAcTTGTTA TGGCCACACT ACAAATGTTA 120 
AACATGCGAT ATTCACATTG GTTCCGATTT GTATGGCCGG TAGTTGCTTT TGTATTGATT 180 
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TATAAAAAAT ACTAATGGGG TTTTATGCAT CTCGTAGGTT TGTAGAAATA CTAAACTAAG 

CGAGGTGCAT TATTATTTTT GATTAAGAAA ATAATGACGG TAATGATAAC ACTAGTAAGT 

r- 

AATTGATACA ATGCTCTATT TAATAATGAT ATTTTTAAAA TTTGTTTTTA ATGTAATGTT 
AGATCTATGG TATATTATAT TTAACGTGGT AAATATGTAT TTGCTGTAA 
(2) INFORMATION FOR SEQ ID NO: 900: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 821 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEONESS : double 

(D) TOPOLOGY: linear 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 900: 
CTTTATTGGT AACTCTCTAT aATATtTTaA CATTTTTACG CTATCGTGCA AACGCAAcCa 
CTTTGAATTy TCTGATATTT TT AG CAT AT A ATTTACACCC TGCACGATGA ATTGTTAATC 
CAGTTGTATA AATATCGTCA ACGAGTAATA TTTCCTTACC ATTTAAATCT AATTCCGTAT 
CAATAATAAA TGGATTTTCA TCTGCCAGAC GCTCTTTCTT AGTTAAATGA GACTGTTTTG 
GTCTATTTGA CATCTTTAAA ATCTTATCaA AGCGAATCCC TTTAGCTTTT AGTACTGCTT 
CTACCGGGTT AAATGTTCTA GATAAATCAT GTGCCGGCGA AGAAGGAATT GGCACAATAT 
AGTCATAAGA TGTTTGTGGT ATTTCAATCA AATGTGCCAA TAATTCACAT AAATAATAGT 
CTTTCAAAAA TTTATACTGA TGTATCATCT CTTTCATTAA ACCGTCATAT TGAAATTGAC 
AATATAATTG TTCCATTAAA TTAAAGTGTG CCGATAGAAA CTTGCAGTCT AAACAATACG 
CTTCATCTTG ATTTAAGTGT TTTAAGCACC TTGAACATCG CCTTGCTTTA ATATCAAGTT 
TAATATTGTC CCAATTCTCT TTGCATCTGT CACATAATCT. ATTAGGTTTC TTGAACAAAT 
TATAAATGGT TATATTTTCA TATAACTTAG CACCACAACT CAAACAATTA TTCATCAATC 
CAACCTCTTT TTAATGCTAA TTTGTTCATC CTTTGAATCT CTTTTTTAGC TTGAATCATA 
TTCATACTTA CCCCTTnCAT GAAAAAACAA TACTTTTTCC A 
(2) INFORMATION FOR SEQ ID NO: 901: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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TGATGCTATT AAAAAGTTAA TTGTGCGGTC TGTTTTGATA ATTTTAATAA TAACTTCAGG 60 

TAACTAAAAA TCCTAATATT GAAAAAACAA AGCCATTTAA AACATAACCT AGTATATTCC 120 

5 ATGTATGATT GTAACTCATT TGCCAGTTnT GTACTGACTT GCATAATTCT GTCACGTnCG ISO 

AACCATGTAC AAGCCTGCAA CTACTGCTGA ATGATTCCTG ATGCGTGAAC AATTCAGCAA 240 

TTAAATACGT AACAAATGGT GTTACAATTG AATAATTGAA ACATATTAAT GTTTCATATC 300 

10 CTCGACGnCA TCAATGTTAA TCCGGACCTT ACTAATGCAT ACCTATAAG 34 9 



(2) INFORMATION FOR SEQ ID NO: 902: 

(i) SEQUENCE CHARACTERISTICS: 
is (A) LENGTH.: 3 90 base pairs 

(BJ TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 902: 



CAAGTGGTAT AAATTTACCT GGTGAAATGT AGGTCGTGGT GGCGATGATA CATTATTCGC 60 

25 TAAAATCGAC GGCGTTGTTA AATTCGAACG TAAAGGTCGC GACAAAAAAC AAGTTTCTGT 120 

ATATGCAGTT AG CTGAATAA TTTTGTCTAG TTAACACCAG AAAGTGAATC TTCTGGTGTT 180 

TTTTACTTTT TATAAAATGT TTTTCATTAT TATTTTCATT ATGnTATTTA AAAATGGGGA 24 0 

30 TTTTAGACGT TATACTAAAT GTGCACTGTA TAGGGGCCCT AATCACTAAC TATAGGGGGA 300 

CAAGGATACA GTGCAGCGTT AAGGATAACT GnCCACATTG GTCTGGGAAT ATAGGATTTA 360 

AGCAAGGTTA TAAAGTACTC nTAGGCCCTA 390 



35 

(2) INFORMATION FOR SEQ ID NO: 903: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 903: 

GGAATAGACG TATATGCGCG TTGAATTTAA TCCTAATAAG CTTTCGCATG ATGAAGTGCT 60 

TTGGTTAAAA CAAAATATCA TCAGTTATTT GGACGATGTT AGTTTTACGA GATTAGATTT 120 

GGCTTTTGAT TTTGAATTTG ATTTAAAATG ACTATTATGC ATTGTCAGAT AAGnCGGTAA 1B0 

AAGAAACTAT ATTTTATGGC CGAATGTAAA ACCAGAACAA AATATTTTGG TGTnCGAATA 24 0 
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GATTCACATT CTATGCGGTG GAATTGATTA AACG 334 
(2) INFORMATION FOR SEQ ID NO: 904: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 904: 



15 



20 



CAAGTTGGGG 


ACACTGTTTC 


AAATGAAACA 


ACAGTGTGTA TTTTAGAGGC AATGAAACTA 


60 


TTTAATGAAA 


TTCAAGCAGA 


AATTTCAGGT 


GAAATTGTTG AAATCTTAGT AGAAGACGGA 


120 


CAAATGGTAG 


AGTATGGCCA 


ACCGTTATTT 


AAGGTGAAAT AATGAAAAAG GTTTAATTGC 


180 


AAACCGCGGT 


GAnTCGCAGT 


TAGGATTATT 


CGCGCTTGTC GTGATTTAGG CATCCAAACT 


240 


GTTGCATCTT 


TCTGAGGGGA 


TAAAGATCGC 


TACATCTCAA ATGCTGATGA GGCATATnCG 


300 


TGGGCCnCTT 


GTCTAAGGTC 


ATATTTAATA 


TCCG 


334 



(2) INFORMATION FOR SEQ ID NO: 905: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 905: 

35 



TTCACCCTGT 


AATTCTTaAC 


CGTCAATTGA 


CCTTTATGCA GATTTAATAT 


TCCTAAAACG 


60 


TATAACTCTT 


CTAATGATAA 


TTGACACATA 


TTTAAATAAC ACTTTAGATT 


CCGCAATAAT 


120 


TCTTGTGACG 


ACAATAAATA 


CGATACATAG 


TCACAACTAA AATCAAATGT ATATTGACCA 


180 


AAATAACGTT 


CAATATAGTC 


ACTAAGCTCA 


TTATTCATAT ATTCTATTTn 


ATCTATATGC 


■ 240 


ATTTTATTAA 


TCGAAATAGT 


CAATTTACGT 


TGATCCCTTT GATCTCTTCC 


TTTCAATAGC 


300 


CATTGATGAT 


TGTnTAAATA 


ACTTAACATT 


TGAAGAAGGG CCATCTTAGA 


TTGGATTTCT 


360 


TTCATTnTTA 


ATAAACTATC 


TATTGAAGCG 


CTATTGGAA 




399 



(2) INFORMATION FOR SEQ ID NO: 906: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 78 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
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Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 906: 

TACATTTTTC AATTTATGGC AAAATnCACA ATTTTCATCG AAATATTTTG TAnAnAATCA 60 

5 

TTGCTAGGTG ATGAGTTATC TCGGTTAATT ACAAAACGAT ATCTAAGTTT AAATATTAGA 120 

TCATAAAAAA ACTATCTACT GAATATCCCT CCATAGCATC ATTTTTCTAA GCAAAAGTTA 180 

AACTTAGTTA GATAAAATGC AAATGAGATT ATTGTAGATA GTCTCTTTTT AATGTTTAAA 240 

70 

AATGATTAAT GCCaTAAACA TAAATAATTC CaAAGAATGC CGCGCCACTA AGCACTGTTA 300 

ATATAAGACT TAGTAACAAT GTGCGTTTAT AATGTTTaAC AAATGCtATG AACAAAATGA 360 

15 CTACATTATA CGCAAAGAAA AGTCCGAATA ATGTCATTGT TATTTTCAAA TCAGAATAGA 420 

AAATATTTAA TAGTAATACA ATGACTGCAT AAATTAAATA CGGAATAACT ATAAATTTGC 480 

GTTCAAATCT TAATTGTTCG AAaCGtTTAT CTTCGTTTGT CATGATATGC TCCTTTATGA 540 

20 ' TTTAAAAGTA AATAAGTTTA CGGAAtATCT TGTCCGaTAA TAGCGGTGTA AAtGTCAAAC 600 

CACGACTGAT CATCtAAATT AAGTkGTAGC CCtTCGATTG CTTGATCAAT ACGCTTTAAC 660 

TGACTTGTTC CAAGTATCCG GCATGATACG ATGCGGTATT TTTACTAACC ACGCTATCAT 720 

25 

CACAGCTGTG TCACTyACAC mATATTTGTC AGCTATTGAT TGAACAACTT TCATAATACG 780 

TTGCGCTTTA ATATCTTCCT TGTCGAAAAT TTTACCGCCT GCAAAAGGAC TCCAAGCCAT 840 

GATTTGAACA TGGTTTTGAT ACATTGAATC CATTGTTCCA TCTTGTAAAC TATCAACGTG 900 

30 

ATATGGCGAT AATTCTAATT GATTGATGCT AATATGTAGT CTTTCTTTCA TAATATATTG 960 

ATTTAACAAT TGGTATTGTG AATGATTAAA ATTCGACACC CCGAATGACT TCAACTTACC 1020 

35 TTGTTTAACA AGTTTAGTTA ATGCATCAGC AACTTGTTCT GGaTCCATCA ATGGTGaAgG 1080 

ACGATGAATG AGTAGACTAT CTAAATAATC TACATTCAAA TTGATTAATG ACTGTTCAAC .1140 

AGATTTCACG ATGTGCTTAC TACTCAAATC ATAACGATGT CCATTTGTAA AATCAAATTG 1200 

40 CTTAGAAGGC AAAATGATAC CACATTTCGT AACAATTTGA ATTTTATTTC TTAATTCGGG 1260 

TGATAAATCC AAAGCATTAC CAAACAGTGA TTCACATTGA TAATCTCCAT AAATATCAGC 1320 

ATGATCCATC GTTGTAATTC CACGTTCAAC TAATTCATTT ATAAAATAAT TTAACTCTTT 1380 

45 

CGCAGTCATC TTCCATTCAT TTGCACGCCA AAAACCTTGT ACAAGCCTAG AAAAATGAAC 1440 

ATAGTGATTA ATCATTATTT GTTCCATATn TCATCCAC 14 78 
(2) INFORMATION FOR SEQ ID NO: 907: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 543 base pairs 

55 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

5 

(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 907: " 
TTCTATGAGA ATAGATATTG TTAAATTAAG AAAGTAGTCA ATTTTATTAT GACAAAAGAA 60 

10 AGAAGAACAT TTAGTTCAGA GTTTAAGTTA CAAATGGTTA GATTATATAA AAATGGTAAG 120 
CTTAAGAATG AaATTATACG AAAGTATGAT TTAAAACCCT CAATTATCTC AAATTCGATA 180 
AAACAACACC AAAATACTGG ACCCTTCAAT CATCAAGATA ATTTAAAAAG TGATGAAAAA 240 

15 GAGTTAATAA AATTACGCAA AGAAGTTCAA CATTTAAAAA TGGAACATGA TGTTTTAAAG 300 
CAAATTTTAA AGTTGATTTA GAAACTGAAC TTTTTAAAGA TACATTACTT GAAGATTCTA 360 
ATTTATCTGA TTATATAAAG AACCAACTTC CAACGATATG GAGAGACTAA GAAATAGTTT 420 

20 

AAATTTATTT AATTTCAATA AAAACTCATA ATAATATTTA AGTAAGTTAG ATTGCTTTTA 4 80 

GCATTAGGAA TTCGCTTATA ATTAGATGTT CAATATAGAC T T TTTTACAC ATACATGAAC 540 
TAT 54 3 

25 

(2) INFORMATION FOR SEQ ID NO: 908: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 751 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



<xi> SEQUENCE DESCRIPTION: SEQ ID NO: 90B : 



40 



45 



50 



TTTnCGGTnG 


AACATGATTn 


CGATATTACT 


TCTAAAACGA 


TTCAACCAAA 


ACCAGGAGAA 


60 


AGTGTTGGAC 


GTTTAGTTAA 


TGTCGATTGT 


AAAGATACAG 


TACTGGAGTA 


TTCAGAATTA 


120 


GATCCAGAAG 


TTGCAAATCA 


ATTTAACAAT 


GCTAATATCG 


GTATTCATGC 


ATTTAAACTT 


180 


GGTTTTATTT 


TAAACGCTGT 


AAATCGTGAG 


•TTACCTTATC 


ATTTAGCTAT 


TAAGAATTTG 


240 


AAGCAATTAG 


ATGAGAACTT 


TGGTGTTATT 


GAACAACCAA 


CTTTAAAATT 


CGAATTGTTC 


300 


TATTTTGACA 


TATTTACTTA 


TGGAACTAGT 


TTTGTCACGT 


TACAAGTTCC 


TAGAGAAGAA 


360 


GAATTTTCAC 


CTCTTAAAAA 


TAAAGAAGGT AAAGATAGTG 


TTGCAACTGC 


AACAGAAGAT 


420 


TTACGTAGAA 


TGGGTTTAAT 


TTAATGATAG 


GCGGTGACTA 


TGATGTCCAA 


ATCAACTAAA 


480 


AAAAGTAATG ACACAATCGT AGAAACATTT AAGGATATCA 


TCCCTTTATC 


ATTTGGAGAA 


540 


GAAATTGGTA 


ATGCGGCATC 


TCATGGTGTT 


GCAGCTTTGT 


TAACATTATT 


AGTGTTACCC 


600 



55 
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TATGTTATTT CTATCTTTAT GATGTTTATT 


TCATCTACTA TTTATCATTC TATGCAAAAT 


720 




GAAACACCTC ATAAATATAT TTTAAGGATT 


A 




751 


5 


(2) INFORMATION FOR SEQ ID NO: 909: 






10 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 






15 


(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 909: 






AAATTGTCAC AATTCAGTCT AATTTAAAAG 


GAAGTTAATT 


ACAAAATTAA AAAATCATGC 


60 




ATCGTGTCAT TTAGATATTG AAAAAGATGA 


GTCAAATTTG 


TTTCATATAT TTTCCAATAA 


120 


20 


CAAATTATTA TATCTAAATA TACAATCAGA 


AATATATGAA 


TAGAAATATC AATAAGAAAA 


180 




ATAATATGAT TAAAAATGAT GAATGGCATA 


CTTATAAAGT 


GTCTAAATAT TGGCGGTCAA 


240 




TATTACTTAC AAACACGAAT GTTAAGTAAT 


GTAGACAATG 


cTGaAAAAAC AATaAGTAGa 


300 


25 


AACTCTATAT AGAATGTTCG TTTTTATTTT 


TaATAATTGa 


ATAATCaTTA TTGGGTGGAT 


360 




TTTAATATGG nATTCCGGTC GGCACACGGA 


TTAC 




3 94 




(2) INFORMATION FOR SEQ ID NO: 910: 






30 
35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1022 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 910: 




40 


TAAAGCTAAG AAAAAAGAAA AACGTAAAGC 


TTATAACCAA 


CGAATGAAAG AACGTAGGAA 


60 




AAATCAACCT AGCGCAGTTA GTCAACGTCG 


AATGAATTTT 


GAAGAGCGAC GTCAAATTTA 


120 




CAACAATGAT ATTTCTGAAG AACGCAATTC 


AAGTGAAGTT 


AAGGACAAAA AAGAGCAAGA 


180 


45 


ATAAATATTG ATGATACCCA GATTAGTAAG 


CAGAGGTCTT 


TTGCTTATTA ATCTGGGTTT 


240 




TTATATGAGG TTAATTATCG ATAACGTTTA 


ATTAAAGTGT 


TTAGGTGTCA TAATTTTAAA 


300 




TGACGATTTC CCCATTACkA TACACCTAAA 


TtATCATCAA 


TCTGaATTCA GATGTTTATk 


360 


50 


ATAAAAATTA GATGAAAAAT ATGTTAATAT 


ACAAGkAaTT 


TAATGTGcGk ATATCTATAG 


420 




GCTCGTAGTA TACTTATTAA AATATTAAAC 


AAAGAAAGGA 


TTTTAAGATG AATAGAAAAC 


480 
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GGCTATCCTA TTTTTCTTTA AAAAGTGGTA ATGCATCACA ACGTGAAGAA TTAGCGAAGC 
AATTATCTCA GAACGGTGGC AAGGTTTCTT TAGATATGCT TCAGACAACA ATGGGTGGAT 
TAGCAATTAT TTTATTAATT TCAACACTTT ATGGTATATT TGCGACAATT TGTATTAAAG 
GACGTAGAAA ATTATCGATT ATACTTTTTG TTATCGCGAT AATTGTAAGT TTGATGGCTC 
TTAATTTAAT TGCAATTGTC TTATGGGTTA TCGTGATGAT TATGTTGATT TCTAAAAAAG 
AATCAAAAGA AACAACACAT AAGGACGATG AGTATATTTA TCATTAATAT GTTCATAGCA 
AAAGAAAAAC CATTAAAATG TTAAGTTGTT AATTATTAGA TACAACCAAC ATTTTAATGG 
TTTTATTTTT AACTTTGTAG TTCTTGGAAT GTTTGAACGA TTAAATAGAC ATTTAAAATA 
CT 

(2) INFORMATION FOR SEQ ID NO: 911: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 911: 
GGTACTTTAT TTTTATTTTT GGTCTTAGTG ATTTTTACAT TATTTACATA TAAAGCGCCT 
AATGGTATGC GTGCCATGGG TAGCATTAGC TAATGCAGCA ATCGCAACAT TTTTAGTGGA 
AGCATTTAAT AAATATGTnG GTGGCGAGTA TTCGGTATTA AATTTTTAGA AGAGCTAGGA 
GACGCTGCGG AGGTCTAGGT GGTGTCGCTG CCGCTGGATT AACAGCATTA GCTATCGGTG 
TGTCACCAGT ATATGCATTA GTTATAGCAG CCGC 
(2) INFORMATION FOR SEQ ID NO: 912: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 67 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 912: 
CTTTAATTGT TCACTTTATC ATTATTATCA ATCAACTCTA TGATATGAAA TTGTGTTAGA 
GATAAATCTA GTTCCTCATT TTCCTTTTTA TACTGTCTTC TTTTATCGGC ATTTTCTCTT 
TCTATAATAA ATTGTTGTAA TTTTAAAATA ACTTCTCTCT TCATATCCAC TATTATTCCT 
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TTGTTGTAAT TAACATATAT TTACTTAGTT ATGCACCATT TATTCCGCAT TTCACAAATC 
CACCTATAAT ACAAACTATC ATTTTAAATA AGCAGAATTT ACCCTGTCTT CCACACAAAT 
TGTTTTCGAC ACTTTAATGT GCCTAGTATC CAAAAAAGTT ACTCATAACA AATTGCGCAT 
TTTAATAAAT GTAGCAATGC CATTTGCATG CTGCAAAAAT ACTCGTATTC ATATTTATGT 
TTTATATTAT TAAAGTTTTT TAAATCCATA TTTTTATAAC ACTTGCTATG TGATAAAATT 
AATnTTATAT ATAAAATCTT AAGATTCAGA TTATTTAATA GCAAAGGAGA TAGTGATATG 
GATGTTTTAA CAATAGAACA TTTAACAAAG AAGATAGGCA ACAAAACGAT TCTCGAAGAT 
GTATCATTTA AGCTGAAAC 
(2) INFORMATION FOR SEQ ID NO: 913: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 564 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 913: 
GCCATCCTTC AGTGCTAATT TTTTCAAATT CTAATGCAGA TGCACCACTA ACCATACCAT 
ATATAAAGAA AAAGACCACG AAAGnAATAA GTAGTAGTAT AGATGCACAT ATCATAATGA 
CATTGGAGTT TTCAGCGCCA AATTTAAATG TCAATGsGAA AATGATAGAG TATGCCCCTA 
TGATACCAAA ACTCATTAAA CCTGACATAA TACCAATCAT CACACTTTGG GTCACAATCG 
TAGTCACAAC TAATCCAATC ATTAAACTTG CACCGAATAA GATTAAATAA AAGGCAAAGT 
ATGACTTAAT ATAATCACTA CGTTTAACCG GTAAAGTAGA TACATAATAC ATCCATCTTG 
AGTC TTTTT C aTGTTTAATA TTATCAGTAA TAGGTGTGAT TAACATAACC CCAGCCATkG 
CCGAACTCAT CAACGGATTA AATACTGCAA AGTATCCTGC AGCTATAATA GCGACTATAA 
AATAAATATA TGTTTGCnTT CTCGTTGCAT AAAAACTACT TAGGAACATA CCTTTCATTA 
TACTTCACCT CGCATTATGA TTTT 
(2) INFORMATION FOR SEQ ID NO: 914: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 643 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 
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AAAGAAGAAG CAAGTGCAAA TAATTTAAGT GATATATCAC AAGAGGCACA AGAGGTTCAA 60 

GAAGCTAAAA AAGAAGCACA AGCAGAGAAA GACAGTGACA CATTAACTAA AGATGCAAGT 120 

GCAGCAAAGG TAGAAGTATC AAAACCAGAG TCACAAGCTG AAAGATTAGC AAACGCTGCA 180 

AAACAGAAGC AAGCTAAATT AACACCAGGT TCAAAAGAGA GTCAATTAAC TGAAGCGTTA 240 

TTTGCAGAAA AACCAGTTGC TAAAAATGAC TTGAAAGAAA TTCCTCAATT AGTTACTAAA 3 00 

10 

AAGAATGATG TATCAGAGAC AGAGACGGTT AATATAGATA ATAAAGACAC TGTTAAACAA 360 

AAAGAAGCTA AATTTGAAAA TGGTGTTATT ACACGTAAAG CTGATGAAAA AACAACTAAT 420 

75 AATACAGCTG TTGACAAGAA ATCAGGTAAA CAATCTAAAA AAACAACACC TTCAAATAAA 480' 

CGAAATGCAT CAAAAGCATC TACAAATAAA ACTTCAGGTC AGAAAAAGCA ACATAATaAG 540 

AgATCATCAC mAGGTGcAAA GaAACAAAGT AGTTcmAGta AGTCaACTCA AAAGAATAAT 600 

20 CAAACTAGTA ATTAAGAnTT CAAAAACAAC AAATGCTAAn TCC 643 

(2) INFORMATION FOR SEQ ID NO: 915: 

(i) SEQUENCE CHARACTERISTICS : 
25 (A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

3D 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 915: 



AGGTTTAAGG GAAAAATCCT ACCTGGAGAG ATTCAAAAAC TAAATGGACT ATTTCCAAAA 60 

35 ATAAAGGGCA ATCACATGGT GGTTCTTATT GGAAGTTAAT AAATAACAAA GGAAAAAGAA 120 

TAGCTTCTTT AACTAAAGAA GGAAAAATCT TAAGGGAATA AGGTGTTGTA AATTATGTGC 1B0 

TTTGATATAA ATAATTTGGA TATAAAAAAG TTGAATTTTA GAAAGGTAAA AAACGCAATT 24 0 

40 CATTTAAGTT TAAGTGGCGA TAAATACCAG TTTTTGGAGG ACGGTAAAAC TATCAATAAT 300 

ACTTATTTTT TAGCTGTATA TGATAATGCA ATAAACATTT TTGAAGATTT GTTTtCGCCT 3 60 

tCTGATTCTA TAAATTTGGn TCATGTAGTA TATGTTTATA 4 00 



45 

{2) INFORMATION FOR SEQ ID NO: 916: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 
SQ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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CCCAAAGTTC GATAGTGTAT TAGTATTATC TTAATAAAAT GTTAGGTACA ATAAAGATGA 

TTATATATCG GAGGTTAGTA TAAAAATGTA TGTAGATCGA AAACCATCAC TATATTTAGA 

GGATTTGCGA CATGATTTTA AAAATAGTTT AAGTAAATTT GAAAATGGTG ATGAAGCATT 

TGATACGTTA TTAGGTTTCG TAGAGTTAGA TCATATTTAT TCGTCAGCAC TAAAGGAAAT 

AAGCACTAAA CTGAGTATTT TAGATGACAA TTTCAATCAC ATTTATAwAC ACAATCCTAT 

ACATCATATG GAGCGACGTG TGAAAGAAAT GCGTAGTTTA ATAGAAAAGC TTAATCGTAA 

AGGATTACAG ATTAGCGCAG AAACTGCCAA AGAACATATA 

(2) INFORMATION FOR SEQ ID NO: 917: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 400 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 917: 
GAAAATTACT ATGAAGATTG CATTAGGATG CGACCATATT GTTACAGATA CAAAAATGCG 
TGTATCTGAA TTTTTAAAAT CAAAAGGACA TGATGTCATT GACGTAGGAA CATACGATTT 
CACAAGAACA CATTATCCAA TTTTTGGTAA AAAAGTTGGC GAACAAGTTG TTAGCGGTAA 
TGCAGACTTA GGTGTTTGTA TTTGTGGAAC AGGTGTTGGT ATTAACAATG CTGTAAATAA 
AGTACCGGCG TTCGTTCAGC ACTAGTACGT GATATGACAT CAGCGTTATA CGTAAAAGAG 
GAATTAAATG CGAACGTTAT TGGCTTCGGT GGACGTATTA TAGGTGAGTT ATTAATGTGC 
GATATTATCG ATGCCATTAT TAATGCTGGA TTATAAACCC 
(2) INFORMATION FOR SEQ ID NO: 918: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 918: 
GTCTCCAGCC ATTwCACCAC ACATACCTGT CCATTTACCT TCTTTATGTG ACGCTTCAAT 
AACTTGTTTA ACTAAACGTA AGATTGAAGG GTTATATGGT TGGTATAGAT ATGATACACG 
CTCTGACATA CGGTCAGCAG CTAATGTGTA TTGAATTAAA TCATTTGTAC CGATACTGAA 
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5 



10 



25 



• 30 



GATTCCTAAT 


TCTATATCAT 


CCGAAATGTC 


ATGACCTTCA 


TTTTTAAGGT 


TTTCTTTTTC 


300 


TTCTAATAAT 


ATAGCTTTAG 


CTTCTCTAAA 


TTCGTTAATT 


GTTGCAACCA 


TTGGGAACAT 


360 


GATATTTAAC 


TTACCATAAA 


CTGATGCACG 


TAATAATGCA 


CGTAGCTGTG 


GTCTGAAAAT 


420 


ATCTTGTTGC 


GCAAGGCATA 


AACGAATCGC 


ACGGTAACCT 


AAGAATGGAT 


TCATTTCTTC 


480 


AGGCAAGTTT 


AAGTATGATA 


ATTCTTTATC 


TCCACCTATA 


TCTAAAGTAC 


GTACAACAAC 


540 


ACGTTTACCG 


CCCATTGCTT 


CTAATACTTC 


TTTATAAGCT 


TCAAATTGTT 


CTTCTTCTGT 


600 


AGGCATTTGG 


TCACGACCCA 


TATATAAAAA 


CTCAGTTCTA 


TATAAGCCGA 


TACCTTGTGC 


660 


ACCATTTTCa 


ATAACACCTG 


GCAAATCATT 


AGGTGTACCA 


ATATTTGCAG 


CAAGCTCTGC 


720 


GTGAACACCA 


TCAACTGTAA 


CAGTATCAGC 


ATCACGTAGw 


TTTTGTAATT 


CTTTCTTGTC 


780 


AGCAAAATAA 


CGCTCACGTT 


TATCTTGATA 


AGCGATTAAC 


TCATCTTCAG 


TTGGATTAAC 


840 


GATTACATCA 


CCATTTAATC 


CATCTACGAT 


AATCATGTCG 


CCTTGTTTAA 


CTTCTTGAGT 


900 


AATTGATTTT 


GTACCAACAA 


TTGCTGGaAT 


TTCTAAAGAA 


CGACTCATAA 


TTGCAGAGTG 


960 


aCTTGTTCTT 


CCGCCAATGT 


TTGTAGCAAA 


ACCTTGTACG 


AATTCTTTAT 


TTAATTGAGC 


1C20 


AGTATCAGAT 


GGCGTTAAGT 


CATTCCCTAC 


AATAACAACG 


CTTTCATCAA 


TCATACTCGG 


1080 


ATTCGGTAAT 


TCTACACCTA 


AAATATGTGA 


TAACACACGT 


TTAGAAACGT 


CGCGAATATC 


1140 


AGCCGCACGT 


TCTTTCATGT 


ATTCGTTATC 


CATAGATTCA 


AAAATTGTAA 


CAAATTGTGT 


1200 


TGTnACATCC 


GCTTAATGTG 










1220 



(2) INFORMATION FOR SEQ ID NO: 919: 

35 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 919: 
TTAGATTGAC TGCAAATGCC GACTCAAAAG AACAAGCACA ATCATTGATT CAACCTGTTA 60 

45 

AACAAGAAAT TCTTGATCGT ATTGGAGAAT ATTATTATGG TTCAGATGAC ACATTAATTG 120 
AGCAAGCTGT AATAAAGAAA ATTCATGAAC CTTTTGTAAT ATATGATGGT ATTACTAATG 1B0 
SO GTGCTTTATA TCATCGATTG AAAGAAGTGG ATTTAAACGA TGTTCTAAAG GGTATGATTA 24 0 

ATCACAATGA AAACTTTGTt GATATTAATa AACCTATTGA GCaGCAATTA AAAGATGCAG 300 
TGCAATTTGT TAATA 315 

55 
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10 



20 



25 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 433 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 920: 

* 

ACTTTGTAAA CCTACAAATG TAATGAATAA TCCTATACCT GCTGAAACAG CCATCTTCAT 60 

TTGATAAGGA ATTGCATTAA TAATAACTTC CCTAAACCCT GTGACGGTTA ATATCGCAAA 120 

15 GAATATACCT GAGAATAAAA CGCCTGTTAA ACCAACTTGC CAAGGAATAC CCATGGTTAA ' 180 

CACAACAGTA AATGCAAAGA ATGCATTTAA TCCCATACCT GGTGcTAACG CAATTGGATA 24 0 

TTTAGCTATT AGCCCCATGA ATAGCGAGCC TACAAATGCt GCTAATGCAG TCGCTACAAA 300 

AATGGCACCT TGGTCCATTT TCATATCTTC TGATACGCCT TTAACACCTG CTAAACTTAA 350 

AACTTGCGGG TTAACTGCTA AAATATAGGC CATAGATAAG AAAGTTQTGA TACCGCCTAA 420 

GATTTCTCTT TTA 433 
(2) "INFORMATION FOR SEQ ID NO: 921: 
(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 439 base pairs 

30 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 921: 

CTATTTTATT TGTATAGCGC TTGATGTAAA GTTATTTTAG CTGTCATACT ATGTGCTAAA 60 

CCAAATTCAG TTGTGTCATA TCTTCGCTTT TTTGTCACTG TTATAATAGG TATAATCGGA 120 

TATAATGAAA AGGkGAGGGA GGATTCAAAA TGCGTAGGTT ATTATATTCA TTTCTTTTTT 180 

ATATGATCAT AGGTTTATTT AGTGGCTTTT TCTATAGAGA GCTAACAAAA GCATATGATT 240 

TTACTGGTAC AACACAATTA TCACTTGTAC ACACACATAC ACTTATTTTA GGTATGTTTA 300 

TGkkTTTAAT ATTATTACCA TTGGaGAmGT TATTTAAATT ATCAAGTTAC TACTTATTTA 360 

ACTGGtCTTC TATGTATATA ACATAGGTGT TATCGTTACT ATAGGTATGA TGGTGACAAA 420 

SO AGGATTCTTC CAAGTCACT 43 9 
(2) INFORMATION FOR SEQ ID NO: 922: 



40 



45 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 474 base pairs 
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(C) STRANDEDNESS: double 

(D) topology: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 922: 
GTAATATCTA TATACGTATA TTTAAATGAT ATATCTGGGT ATTTTCTTTT TAATAGCGGC 
TGTAGCCAGT CATAAATATC TtTCGaTGTT GGCGCATTAA CGCAACTTGC ACATATAACA 
TCTGCCCCAT AAACTACCAC ACTCACGTGC TCCATATATT TATCCCCCAT TGGTTTGATA 
GATTTTTATT ACACTATCTA TTATAATATA sTmATAAAGA TTATCAAATT CATCTCTCGA 
AAGGAGACTT GCCTGATGCC TACTGAAGAT ACAACGATGT TTGATCAAGT AGCAGAAGTT 
ATTGAACGTC TTCGTCCATT TTTATTACGT GATGGTGGCG ACTGmTCATT GATTGACGTG 
GAAGACGGTA TTGTAAATTA CAATTACATG GTGCATGGTG GnACATGCCC CAAGTTCTAC 
AATCACTCTT AAAAGCTGGG TATTGAGCCG TGCATTACAC GAAGAAGTGC CTGG 
(2) INFORMATION FOR SEQ ID NO: 923: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 554 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 923: 
TCAACTTGTG AAGAAAATTT ATTAACTTTA GTAACTCTTC CAACAAAACC TTGTGATGTC 
ATCACAGCCA TATTTGAAGT TATACCTGAT TTAGATCCCT TATCAATTAC AATTGTATTC 
ATCCACTGAT CCGGATTTCT TGCCAAAACC GTAGTAGAAA TAGGATCAAA TTTTGAAATA 
TCTTTTAAAT CAAGCTCTTT TTTTAATTTT TCATTTTCCG CTTCTAATTG TTGGTTCTTA 
GATTCTAACT GGCTAATCTT ATTTTTAGAT TCTTTAGAAT CTCCTTTTTT AAAAAAGTCC 
CCAATCGTAC CAGCAACAAA ATTAACTGGA TAACTCACAA CTCGTTGTCC AAAAGACACA 
GAATCACCTA TATATTGTTC AGGAGGTGAT TGAGATTGTG AACGTATGGA CAGCCCAATT 
AATGCAATAA AAACGATAAT TGCACATAAA ACAACAATTA ATTTGGTATT TTTAAAAAAC 
TTAAGCACCC AGAACACCTC TATTATGTCA nAATATTGTA TATCCTTTTC TAATTnATAT 
TACTCCCATT ATGA 

(2) INFORMATION FOR SEQ ID NO: 924 : 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 575 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY; linear 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 924: 



75 



20 



ATCTTTAGGC 


ATTAAGCAAG 


TTTATGTAGA 


AGACTTTGAA 


CATAAATCCT 


TTAGCAAAGC 


60 


TAAAAAAGCC 


TTAGAAGAAA 


AAGGGTTTAA 


AGTTGAAAGT 


AAGGAAGAGT 


ATAGTGACnA 


120 


lni. HjA J, OHLt 




TTTCTCAATC 


TCCTAAAGGA AAATCAGTAG ATGAGGGGTC 


1B0 


AACGATTTCA 


TTTGTTGTTT 


CTAAAGGTAA 


AAAAAGTGAC 


TCATCAGATG 


TCAAAACGAC 


240 


AACTGAATCG 


GTAGATGTAC 


CATACACTGG 


TAAAAATGAT 


AAGTCACAAA AAGTTAAAGT 


300 


TTATATTAAA 


GATAAAGATA 


ATGACGGTTC 


AACTGAAAAA 


GGTAGTTTCG 


ATATTACTAG 


360 


TGATCAACGT 


ATAGACATTC 


CTTTAAGAAT 


TGAAAAAGGA 


AAAACAGCAA 


GTTATATTGT 


420 


TAAAGTTGAC 


GGTAAAACTG 


TAGCTGAAAA 


AGAAGTCAGC 


TATGATGATG 


TATAAATATA 


480 


ATTGAAGTAA 


ATGTACCGAG 


GTTTCTATTT 


GGAAGTCTCG 


GTATTTTTAT 


GTTGGAGATT 


540 


GCGGTAGTTT 


TAAAATGCnT 


CnTGTCnTCA 


TATAC 






575 



(2) INFORMATION FOR SEQ ID NO: 925: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 925: 

AAAAACGCTG CTGAGTTTAA AAAGCAACAA TTAAATGAAC AAGGTAT T T T . CAAGAAACCA 60 

GTGATTACAC CTATTAAACC ATATAAAAAT TTCTATCCAG CTGAAGACTA CCATCAAGAT 120 

40 

TATnACAAAA AGAACCCCGG TACATTATTA CCAATATCAC CGTGGGTCAG GTAGAAAAnC 180 

GTTTATAGAT CnCATTGGGG GGATCCAAAG CTTAAAAAGA TAAAAGTGAC TACCAGATAT 240 

45 AGATATATTG TTACACAAGA AACGGCACTG GACCACCATT TATGAATGAT ATTGGACCAT 300 

TTGCTAA 307 
(2) INFORMATION FOR SEQ ID NO: 926: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 926: 

CTCTACGCAC TCATTCAGAT ACACACATTG CGCATTGCTC ATAGAATCAG GTGTACACAT 60 

5 

TAAAGAAATA CAAGAACGAT TACGGCATAA AGATATCAAT ACCACTATGA ATATCTATGC 120 

TAAAATCACA AATTCATACA AAAAAGACGC CTCCCATAAG TTTAGTCACC AAATGGAAGA 180 

w CGTCTCGTAA TTAAAATATA TTTGCGTGCA TTCTAATTTA TACTTAGAAT GAATCATACT 240 

CGTGCATAAT GTAATTTTCT AGTTAGTCAA AACTATAAAC AGTTTTACAT CATTCCTGGC 3 00 

ATGCCACCCA TGTTAGGTTG GTCATTATTT TTTTCTGGAA TTGATGCTAC AACCGCTTCA 3 60 

15 GTCGTTAAGG ACATTGCTGC AACACTTGCA GCAGTTGnAA 400 

(2) INFORMATION FOR SEQ ID NO: 927: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 809 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 927: 



30 



40 



45 



50 



TTTAGGTGGT 


TTTAATnGTT 


ATGGTCATTC 


TGGACACCTA 


CTTATAATAA AATTTCAAAT 


60 


r CAAACTGAAC 


nTTTTGTACC 


GTTTGATTAA 


ACTCAAATTT 


ATTAGCTCCG , TATTTAAATT 


120 


TTGGTTGGGC 


TATATTCGTT 


TCGGTACTTA 


TTTCAACACC GTTTTTATAA ACTCGGAAGC 


180 


TATCATAAAC 


AATTcTGTCT 


CCAGCTTTTA 


GTTTGATCCC TTCGATTTTC ATTATTTCAG 


240 


CATGCGTTAA 


ATTCCATACA 


AACGATTCTG 


TATCTTCGCC 


TAAAATAATT GTTATCTTTT 


300 


TATACATGTT 


GAATTGGTCG 


TTAGGAGCAC 


TACCATGATA 


GTAAACTGTA CCTTTGCTCA 


360 


AATTTTCAAA 


TGTATACTTT 


CTTTTGTCTC 


CGCCTGCATG 


CCAATCAATA TTAAAATCAA 


420 


ACGACCACAA 


TCCAACCTTT 


TTGTTTTCTT 


CTAACTCTAG 


GCTTGTTCCA ATACTTTCAC 


480 


CGTATGGTAA 


TTCTGTAGTT 


TCGAATTTTA 


GTTCAAAAGA 


AACTTTATTA CCTTTTTGTT 


540 


TAGGGTTTAT 


AACTCCGTTA 


AAAATAACTT 


TATACTGTTT 


ACCATTTACA TAAATTTGTT 


600 


GATCGTGTCT 


TGAATATTCG 


TAATCCGGGA 




ATCTAATTTC ACGTAATCAT 


660 


CAGAAGTTGG 


TTGAGTAAAC 


CTGTAATTCA 


ACTCTTCTTT 


TCTTCTGATn TCTCGCAAAT 


720 


ACATAGGTTC 


TATGTCTGTC 


GTTAACGAAT 


ACAACATATC 


TCGCATATAA GCAATGTCTG 


780 


AACGATTTTT 


AACTTTACAA 


AAACAAGGA 






809 



(2) INFORMATION FOR SEQ ID NO: 928: 

55 
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(A) LENGTH: 1016 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

£ 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 92B: 



10 



15 



30 



35 



nAAAACTATT 


ATCGCATCTA 


CaGTaATCGC 


TGCAGGTCTT 


TTAACTCAAA 


CTAATGATGC 


60 


TAAAGCTTTC 


TTTAGTTATG 


AATGGAAAGG 


TTTAGAAATC 


GCAAAAAATT 


TAGCAGATCA 


120 


AGCTAAAAAG 


GACGATGAAC 


GTATTGATAA 


GTTAATGAAA 


GAGTCTGATA 


AAAATCTAAC 


180 


TCCTTACAAA 


GCTGAAACTG 


TTAATGATCT 


GTACCTTATT 


GTTAAAAAAT 


TAAGcCAAGG 


240 


TGATGTGAAG 


AAAGCAGTTG 


TCAGAATTAA 


AGATGGTGGT 


CCTAGAGATT 


ACTATACTTT 


300 


TGACTTAACT 


CGTCCTTTAG 


AAGAGAACAG 


AAAAAATATT 


AAAGTTGTTA 


AAAACGGTGA 


360 


AATCGACTCG 


ATTACTTGGT 


ATTAAAAAAC 


ATACTGAATT 


AAATAGTTGT 


ACGCCAAACG 


420 


TTAGAAAACA 


ATGCTAACGT 


ATTGGCGTGC 


TTTTTTATTT 


TAAGTAACTT 


CCAATTTATT 


480 


TAG C ATCTTT 


ACGACTGTTT 


AATAAAGCAC 


G T ATGATT AA 


CACGGTTGCC 


ACT AT AT CCG 


540 


TTACAATTTT 


TATGATTGTT 


AATACATTCG 


ATTTCCTTTT 


CACAATAGCC 


ACCTCGCTTG 


600 


TTCAAAACAT 


AAACAACTAT 


TGCATTCACT 


TTTAAGTAAT 


TATTTATATT 


ATTTATCCCA 


660 


ATAAGCTCAC 


CATTCAAATA 


AACACAATAT 


TATAAAATAA 


TTATACCTTT 


GGATATAGCA 


720 


AAAAGCCACA 


CTCTATAGCA 


TGGCTTCTAT 


CAATTATTTT 


AAAGTATTAT 


ATTTTAAAAC 


780 


TAGATCGATT 


TGTCTTTGTA 


ATTTTTTCTT 


TTCATAACTG 


TGTTGGAAAT 


GAATTAAATT 


B40 


AACAGCTCTT 


TGTGCTTTAC 


GGTGTGTTGC 


AACAGTTCTT 


GTACGTTTGA 


AAAAGTTTAC 


. 900 


AGCTTTTTGT 


GCATCCACAA 


CTTTTttATT 


TACTTGwTTT 


kTAAAGTTTG 


TTGATACTGA 


960 


TCATATTTTT 


TAGCAGCTTC 


ACCGTTnTTA 


GTTGTGCATG 


AGATCACCGG 


CAACCG 


1016 



40 

(2) INFORMATION FOR SEQ ID NO: 929: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 400 base pairs 

(B) TYPE: nucleic acid 
45 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 929: 

AAATGAAATC ATACCTAAAA GAGATATTAT TACAGAATCA ATGATTTGTG ACTGTATTCA 60 
AAATGCAGGT ATTGATTTAG AAGTATTTAA AGACGACTTA CAAAAAAGTA AACTAACCGA 120 

55 
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CGTTTTCTTT AGTGAAGATG TTCATGAAGA AGGTTTAAAA GTCGAAGGAT TATACCCATA 240 

TCACATCTAT ACTTATATAA TTAATGAATT GATGGGTAAA CCTATCGAAA AGAATCTTCC 300 

TCCTAAATTA GAAACTTATA TACAGCAACA ACAACTTGTA ACGATGGAAG AATTACTTAC 360 

TATTTATGAA TGGCCAGAAA AACTTTTAAA CAAAGAGTTA 400 



(2) INFORMATION FOR SEQ ID NO: 930: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
<D> TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 930: 

20 TATTTCTGAC ATGTATCATG CTCCTTCGTG CTTTATTCTA ATGTAATATA CACAATTATA 60 

CATTGAAATT CAAATTTGTG AACACATTGT GAACTGACAT AAATTATACA CAATTTCATA 120 

GCGCATAATG TTTTAAAAAT ACACATTATA GCAATTATAA TTAAGTTCAT TGTGTTGTCA 180 

25 TCAAAAAAAG AAAAAGGTGA TGTTTTAAAT GGATACAGTT GAATCAGTCG GTTTTTGACA 240 

GCATGACTTA GCAGTTCATA TCATTTTGCA ACGATGGTGT GGTATGCTTA ATGTCGCAAT 300 

TCAGATTTTA GGATCGCAAA ATGACTCATA TATAGCATGC CAAGTGGCAA GTATCATACG 360 

30 

ACATGGnGTG TCAGGCATAn GGCTCATACT GTGCAATATG 400 

(2) INFORMATION FOR SEQ ID NO: 931: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 543 base pairs 

(B) TYPE : ' nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 931: 



45 



CCAATCAATG TTGCCAATGG 


ATCACCTAAT 


GACAACCAAG 


TAATTGCTAC CATAGTAACT 


60 


GCTGTTTTAA TTTTACCTAA 


TTGACCAGCT 


GCACTTACGA 


ATCCTTGTTC AATTTGTAGT 


120 


AAACGTAAAC CAGTTACGGC 


AAATTcTCTG 


GCAATaATAA 


TGATTGCTAC TACAGAATTT 


180 


GTTAGTCCTA GTTGCACAAG 


TACAATTAAA 


GCACTTGCAA 


CTAATAATTT ATCCGCTAAT 


240 


GGATCCAAAA ATTTCCCCAT ATTTGTAACT 


AAATTCCATT 


TTCTAGCTAA ATAACCATCA 


300 


ACAAAATCGC TAAGGGAAGC 


CAATATAAAA 


ATAAAACCAC 


TGATTAATAA CTCAATTCTT 


360 



55 
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ATAAAAACTG GTATTAACAC TACTCTAAAA ACCGTAATCT GGTTCGGAAT ATTCATTATA 



480 



CATCCTCATT TCTCACTAAT TTATTTCTGT TAAAAATATT AAAACTAACC ATGATCCATA 



540 



ACC 



543 



10 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 932: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 932: 

GATTGCGAAT GAGATTAGTG GGATGATACC GGTAGAATGG GAGCAAGTAT TTACAATAGC 60 

TTATGTAACT GATCAAGCTG GAGAAGTCAT CTTTAATTAT ACTAAACCAG ATAGTGATGA 120 

ATTAAATTAT TATTCAGmCA TACCTAAAGA TTGCAATGTC TCAAAAGATA TTTTTAAGAA 180 

TTCATGGTTT AAAGTTTATC GAATGTTTGA TGAGTTAAGA GAAACTTTTA AAGAAGAAGG 24 0 

GCTTGAACCA TGGACATCAT GCGAATTTGA CTTTACAAGA GATGGCAAAT TGAATGTATC 300 

TTTTGATTAT ATAGATTGGA TAAATACAGA GTTTGATCAA TTGGGCCGTC AAAATTATTA 360 

TATGTACAAA AAATTTGGGG TTATACCAGA AATGGAATAT 400 
(2) INFORMATION FOR SEQ ID NO: 933: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 759 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 933: 



ATTTTTACAA 


AGAAATAGTA 


GTCTTATATA 


TCTTAACATT 


TAATAACTAA ATCAAACATA 


60 


TTTTGAGCCT 


TTTTTTGAAA ATTAACATTT 


TAACCTTTTT 


GATTTTACAA CAAATTATAG 


120 


CTACGTATTG 


AAAATTAAAG 


CmTTGGtTTA 


AGTGTTGTGT 


TAAAAgyTTT TATGTTTAGA 


180 


TTGTTATATT 


ACTATGTATT 


CTTAAATTTG 


TTTAAAAAAT 


TAATGCTTTA AATTGATGTA 


240 


TAATGGGAAT 


AAGAAATAAA 


TAAAAACGAC 


CCGCACGATT 


AACGTACGGG TCCACTACTA 


300 


AAGGGAGTCA 


AATTTTTACC 


TcGTTTGTAT 


CATGCAGCGT 


TTTACACATA CTTTTAAGAG 


360 


ATGTTTATTC 


GTTATCGAAG 


GTACACCTTT 


ATTATAACTT 


ATATCATTTT TATTAAAATA 


420 
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ATAATGATTA TTAAATAGTA ACTAAATACA AAATTACATG GGGTGAATGA TAATGAAACA 540 

ATACTTAATT ACTGGTGGGA CTGGTATGGT TGGATCTCAA TTAGTTAATG AAATTAAAAA 600 

ATCAGATTCA CATATCACGA TATTAACGCG ACACGACCAA ATTTCAAATG ATAAGAAAAT 660 

TTCATATGTC AACTGGGCTA AATCTGGGTG GGAACACAAA GTTCCTCAAA ATATCGATGT 720 

GGTCATCAAC TTAGCAGGTG CTACATTGAA TAAACGATG 759 



10 

(2) INFORMATION FOR SEQ ID NO: 934: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double " 

(D) TOPOLOGY: linear 



20 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 934: 

GATCAAGCTG GGGAAGTCAT TTTTAATTAT ACTAAACCAG GTAGTGATGA ATTAAATTAT 60 
TATTCAGACA TACCTAAAGA TTGCAATGTC TCAAAAGATA TTTTTAAGAA TTCATGGTTT 120 

25 AAAGTTTATC GAATGTTTGA TGAGTTAAGA GAAACTTTTA AAGAAGAAGA GCTTGAACCA ISO 

TGGACATCAT GCGAATTTGA CTTTACAAGA GATGGCAAAT TGAATGTATC TTTTGATTAT 240 
ATTGATTGGG TGAATTCAGA ATTTGGACCA ATGGGAAGAG ACCATTATTA TATGTATAAA 300 

30 

AAATTTGGnA TTTGGCCTGA AAAAGAATAT GCCATAAATT GGGTTGnAAA AATAAAGnTT 360 
ATGTTAAGAG CAAGnTTGAG CTGAACTATA GGGGAGATAA 400 
(2) INFORMATION* FOR SEQ ID NO: 935: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 345 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 935: 



45 



CTGAATATAA 


TTTTTnCAAC 


TACATCTCGT 


TTATTAGACA 


CCGTGCACTG 


ACTAAGAAAA 


60 


TTTCTCTTGA 


CTCCATTAGT 


CCTGACGAAT 


ACTAACATTT 


AACTACCTTG 


CTCATCGATA 


120 


GAAACAACTT 


GTAATGTTAA 


TTTCCCTTAT 


TTTCTTAGTT TTAATCTATC AGCGATTAAT 


1B0 


TCGATTGCAT 


CTTTTTCATA 


AGCAATTGGA 


TAAACTTGAC 


CGCGGTACAC 


CTAACGCTCG 


240 


AAATATGATT 


TTnTATCGTC 


ATAATCTAAA 


ATATTATnGG 


CAAAATCACA 


GCAGTTTTCA 


300 



55 
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(2) INFORMATION FOR SEQ ID NO: 936: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 base pairs. 

(B) TYPE: nucleic acid 

. (C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 936: 





GTTATCATCA 


TCTTCTGAAG 


TATCATCTTG 


CCCATCGACT 


TGACTTGCAT 


CAGTTCTCTT 


60 


15 


TAAATCATCA 


AACTCTGGAC 


TAGCTAATGT 


ATTATATAGT 


GTTTTAGGTA 


AGTAATAATA 


120 


TTCGTTTAGC 


ATATCTTCTT 


TTAAAATATC 


ATCTACTTGA 


TACATAATTC 


TTTGAGCCAG 


180 




ATACATATTA 


TCTTCAGAAT 


TTTGATTTTG 


GAAAAAATTA GGTAAATATA AGAACATATT 


240 


20 


TACTAAAATA 


TCATCTAAAT 


CTGAATGTAT AGATGGTATA 


TCAAAGAAAT 


CTTGGCTTAA 


300 




AAATGCATGT 


TCTAAATACA 


AAAATAATAA 


ATCTGTATAT 


TGTGTTTTAG 


TACGATACAC 


360 




TTTAATTTGA 


GATTCCGTAT 


ATGATATACG 


TGTATCTAAG 


CGAAGATCAA 


TTAATTTAGC 


420 


25 


AGTACTTGGG 


CGCTCAACTT 


TAATAGAATT TAATACGCGC ATATCTTCTA ATAATTTAAA 


480 




AAGTTGTTGG 


ATAAAATTTA 


GGGtGTTTAA 


AAGTTTTATC 


TTGtACTACT 


TCATTTACAA 


54 0 




TTTGtACATC 


CaTCATATGA 


TAACCGTAAG 


CAGCTAACAT 


AACATCTGTT 


TTTAAACCAG 


600 


30 


CCATTTCGAT 


ATGGCTTGG 










619 



{2) INFORMATION FOR SEQ ID NO: 937: 

.(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH : 615 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 937: 



TAATCCCTTT CATAATGGGC ATCAATATCA TATTAATCAA TCTAAAAAAC TTACAAATGC 60 ' 

45 TGACGTTACT ATTGCAATAA TGAGTGGTAA CTTTGTCATG CGTGGCGAAC CAGCAATCTA 120 

TAATAAGTTT ACTCGTGCAA AAATGGCATT ATCAACAGCT GATTTAGTTA TCGAACTACC 180 

AGCAACTGCC AGTTTATCAT CTGGcGATCA TTTTGCCGAA CTAGCAGTTA AAGTCGCnGA 240 

50 TTATATGAGT GTCGATACAA TTGCATTTGG TAGTGAAAAT AATGAt ATCA AAACATTAAA 300 

rCmATTAgCA CACAGCATTA ATGaAATTGA mCAATCTGaA TCCTTTTCAC aAAAAGTrAA 360 
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AAGTCCTAAC AACATACTTG GTATTAGTTA CCTGAAAGCA ATTGCTAAAA ATGCTAAAAA 
CATCAATGCA ATTTCTATCA AACGAGAAAA TGCTCAACAT CATGATTCAT TAATTCAACA 
CCATCAGTTT GCAAGTGGTA CATCTATTAG AACATCAATC ATTAGTCAAG ATGATCATTG 
GCATCATGTG GTACC 

(2) INFORMATION FOR SEQ ID NO: 938: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 938: 
AGAATTGAAG ATGGTAAAGT TTCAAAATAT CATTCGGTTA TCATAAAAGA CGCACAAGCA 
ACTTCACCAT ATTCAATTTT TATCAGAGGT GCTATTTATC GCTTTGAACC ATTAGTATAA 
ATATACGTAA GTGCTATGAG CGAGAATGCC CATATGAATA ATGACAAGCA CAATGGAAAG 
AATCGTAATA TATTATTTAA TCGTGATGCT TAATTAAAAT GAAAAAGATT GATAATATAA 
ATGTGAAAAA GTAAGTATAC CCGTAAACTA AAGTATTCAC GGTGAGAGGT GCTCATGTCA 
TAATGATGCA. CGTGTCATAA TTATAATAGC TTAnATATGT CCATACAACA CCATGTAGAT 
ATGCATATAC AGGGTATGAT AGACATAGnG TCTGnGAAAT 
(2) INFORMATION FOR SEQ ID NO: 939: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 939: 
GTACAGGTAT GATTGGTGAT CCCATCAGGT AAATCAGAAG AACGTGTGCT ACAAACAGAA 
GAACAAGTAG ATAAAAATAT CGAAGGTATT AGTAAGCAAA TGCACAATAT TTTTGAATTT 
GGAACAGACC ATGGTGCAGT GCTTGTTAAT AATAGAGACT GGTTAGGACA AATCTCATTA 
ATTAGTTTT1' TACGTGACTA TGGTAAACAC GTCGGCGTTA ATTACATGTT AGGTAAAGAT 
TCAATCCAAA GTCGTTTAGA ACATGGTATT TCATATACAG AATTCACATA CACGATTTTA 
CAAGCTATTG ATCTCGGTCA TTTGAATAGA GAATTGAATT GTAAGATTCA AGTAGGTGGA 



1756 



EP 0 786 519 A2 



GTAGATAAGC AAATGATTAA TAGTTGAGGG GTATGTCGAT GAAGCG 
(2) INFORMATION FOR SEQ ID NO: 940: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 940: 
ATGAATCACC CATTGATACC AATCATTTAC ATCACTGTCA TACGAACATA TATTTAAATA 
ATTTTAAAGA TTATAACTAC TCTTAATCAT TTTAGTGAAT TAAAAAAAGT 
GCAAAATATA CTTTATACAC TACAAATCAT TTATTTATAA TAAAGTTTCA 
TTCCAACTAA TGAAACCGCT TGTTCAGCAG TATGATTATn ACTGTCAATC 
AATGGATTTA CTTCAACTAA ATCCATTGAG GAAATTAAAT GTGATTGATG CAGTAATTCC 
AATGCnAAAT GGCTTTCTCT ATnACTAAG 
12) INFORMATION FOR SEQ ID NO: 941: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 315 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 941: 
CGATATAGTA CCAATTTATT TGAAGAAAAA TCCGAACTTC AAATTTCCGA TGAAGCAAGA 
TATACCGGTT ATTATGATTG GACCAGGTAC TGGAATTGCT CCTTTTAGAG CATATTTACA 
AGAACGTGAA GAACTTGGTA TGACTGGAAA AACATGGTTG TTCTTTGGTG ATCAACACCG 
TAGTTCTGAC TTTTTATATG AAGAAGAAAT AGAAGAATGG CnTGAAAATG GAAACTAACA 
CGCGTAGATT AGCATTTCCA AGAGnCCAGA CACAAGATAT GACAGCCnCG ATATGGAGAA 
GTAACGTTCC ATGAT 

(2) INFORMATION FOR SEQ ID NO: 942: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 699 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



GAAAAAAATT 
AGTGCAAAAA 
CCAAAAAATG 
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(xi). SEQUENCE DESCRIPTION: SEQ ID NO: 942: 

TACCAATAAT CAACTTGTTT TACATCATCG TGTGTTTCAA TCACTTTCCT TTTTATACCT 60 

TTCTTACCTT TAAAGAATGC ATCAACATCT TTTATTGAAG ATCCTTTTGA ATCTTTAGTA 120 

AATCTATAAT CTGCAAGTTC CACTTTACCC ATTAATTCTT TGTAATCTTT CCATCCTTGA 180 

ATCCCTTTAC TATTCGTAAA CGACTTTTCT GTCATTAATG GATCCAACTT ATCAAGAGAT 24 0 

GGCGCATCTT TAATTCCTTT CATTTTCTTC TCTATTTCTT CATATTTCTC TAAATCTGGA 300 

TCTTTCTCTT CTTCCGATTC TTTTTTTACA GTTTCTTGTT TTGATTCTTC TGATTTCTTT 360 

GATGATAATT TATGATTCCC ACCAAAAGAA CAACCTGCTA CGACTAAAAG CATAATTAAA 420 

AGTAAACATC CCAGTGTCTT TTTCATAAAC TTTCTCCTCT TAAGTTATTT TGTTTGAATC 4 BO 

AAATAAACAT ACATAAATTT AAAAAACTTA TTTATATTGa TATTTTACAT TAACCATCAA 54 0 

20 TATTTTTAAA TACTTTAAwT GaTAATTTAA GAAAATTGTT TTGTCTAATA ATTTAAGTAG 600 

TTAAAACATA GATAGATATA AAATAGTTTA TAGCCATTAA TTTATAAGTT TAAAATTCGC 660 

ACTATTTCAA TTTGCCATTT AAAAATAGTT TGTTTAATC 699 
25 (2) INFORMATION FOR SEQ ID NO: 943: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



10 



15 



35 



40 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 943: 

CGCTCCATAA ATCAACAATG TTCGAGTCTG TAATCTAGAT GATAAAACGC CAAGAATGAT 60 

CATAAATGGC ACTTCTAAAc CAGCACATAA ACTAGCTAAA TAACCGACAT GTTGTTCATT 120 

TTCTTTTAAA TAATCAGTAA CAAATAAAGG CATATTCATC GTATACATCC ATTGTCCAAT 1B0 

GTGTAATAAA ATAAATGCAA TAAATGGTAA TAAAAGCGTT TTGTCTTTAA ACATATTAGG 240 

AGCAATTTTT TCAACATGTT GTTGCGTACT AATAGGGTGT TTAATGTTTA AATCCTTATA 300 

45 GAnAAACACT TGAAGTACTA AAGTAAATAA AATGATACTT ATTGGGCCAC CAAACAATCC 360 

AGCATAGCCT TTTAATCCGA TTAATTGGGC ACCAATAAAT 400 
(2) INFORMATION FOR SEQ ID NO: 944: 



so 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 944: 

5 AAGATGACAA TACTTTGTTT CAAGAATTGA AAAAAGAATT AGAACAATGG GATTTTAATG 60 

TTGCTGGTAT TGAAGATTTC GGCAAAGTAA TGGATACATT TGAAAGTTTT AATCCTGAAA 120 

TTGTTATATT GGATGTTCAA TTACCTAmAT ATGATGGGTT TTATTGGTGC AGAAAAATGA 180 

10 

GAGAAGTTTC CAACGTACCA ATATTATTTT TATCATCTCG TGATAATCCA ATGGATCAAG 240 

TGATGAGTAT GGAACTTGGC GCAGATGATT ATATGCAAAA ACCGTTCTAT ACCAATGTAT 300 

TAATTGCTAA ATTACAAGCG ATTTATCGTC GTGTCTATGA GTTTACAGCT GAAGAAAAAC 360 

75 

GTACATTGAC TTGGCAAGAT GCTGTCGTTG ATCTATCAAA AGATAGTATA CAAAAAGGTG 420 

ATCAGACGAT TTTCCTGTCC AAAACAGAAA TGATTATATT AGAAATTCTT ATTACCAAAA 480 

20 AAAATCAAAT CGTTTCGAGA GATACAATTA TCACTGCATT ATGGGATGAT GAAGCATTTG 540 

TTAGTGATAA TACGTTAACA GTAAATGTGA 570 
(2) INFORMATION FOR SEQ ID NO: 945: 

2S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH :, 1241 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 945: 



35 


CCGAGnCCAC 


CGTTCCAAAG 


TCCATTTTTA 


TCCCCATCCC 


TCTCTGAATT 


GAACTATAAT 


60 


TTTTGAATTT 


ATTAATAATG 


CTATTTTTnT 


TATTTTATCA 


AAAACTGATT 


ACAAATACAC 


120 




ATAGAAAATG 


AAATATATTT 


TTCTTTGACT 


ATACATAACT 


GTTATTTCTT 


TGGATATTTA 


180 


40 


TATTAAATGC 


ATTGGTAGTA 


GTTGTAGCGC 


ATTAATGTTT 


GACGGTGTAT AGTATTAATT 


240 




ATATTGAAAG 


TTAGTTGGAA 


GTTGATAGTA 


GGAGTGGGAG 


CATTGAAGCG 


ATTTGTGGCG 


300 




ACGGTATTAT 


TATTACTAGT 


CTTTATATCA 


GGaTGtGGTA ATGrTaAATA 


TGTGAAAGAA 


360 


45 


ATaGATGAAG 


CAGTTAAAaT 


TCAAAATCaA 


AAACAAGAAC 


ACTTGCCCAA AAAAGGCAAC 


420 




GGTGATCGTG 


TTGATCATTT 


TGAACGCAAA 


GATGCTAATA TTTATGTCTA 


TGATAAGGAT 


480 




AAAATTATCA 


TTTTAGCTTA 


TAAACCTTTG 


AGTAATGATG ATGAAGTGCA TTATTATGCA 


540 


50 


TATGATTTTA 


GTGATAAACG 


TGTATCATAT 


AAGCAAGATT 


TTGATTCGAG 


ACGATATTAT 


600 




CAACAACATG 


ATGCGGATTA 


TCATGAAGAA 


AATATGACGA 


ACTAGATATG 


AATAGGAGTT 


660 



55 ' 
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GGTAAAAAAG AATCAGCAAC GACATCTTCG AAAAACGGCA AACCATTAGT TGTCGTATAT 780 

GGCGACTATA AATGTCCTTA TTGTAAAGAA TTAGATGAAA AAGTCATGCC AAAGTTGCGT B40 

5 AAAAATTATA TAGATAATCA CAAAGTGGAA TACCAATTTG TCAATTTAGC TTTCTTAGGT 900 

AAAGACTCAA TTGTTGGTTC GCGTGCGAGT CATGCAGTAT TGATGTATGC ACCTAAATCA 960 

TTTTTAGATT TTCAAAAGCA ATTATTTGCT GCCCAGCAAG ATGAAAATAA AGAATGGTTA 1020 

10 

ACAAAAGAAC TATTAGATAA ACATATTAAA CAACTGCATT TAGATAAAGA GACGGAAAAT 1080 

AAAATTATAA AAGATTACAA GACAAAAGAT AGCAAGTCTT GGAAAGCTGC AGAGAAAGAT 114 0 

AAAAAAATAG CGAAAGATAA TCATATAAaA aCGACACCAA CTGCATTTAT TAATGGCGAG 1200 

75 

AAAGTTGAGA TCCATATGAT TATGAAAGTT ATGAGAAGTT A 1241 
(2) INFORMATION FOR SEQ ID NO: 946: 

20 (i) -SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1057 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

2S 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 946: 



40 



SO 



TAATACAATG 


ACACCATTTA 


GCATGACCGT 


TATCCCTGTA 


ATTCAGCTGA 


TATTATCTGT 


60 


TGCAATTTTA 


TGTGACGAAC 


TGTTGCACTT 


AATTTGATAA 


ATCnACAAAT 


ACAAAAAATC 


120 


TAAGTTGAAC 


AATTATGATA 


CAACCGTGCA 


AACGATATGT 


AGTATAACTT 


GTCAACTTAG 


1B0 


ACTTATTGAT 


AAATATATTA 


ATAGTGGTTT 


AC CATAGCAG 


GAGATTTCAC 


aTCaAAATTT 


240 


TGAAGTAGCG 


TATCaATCTT 


TGaATCaTCa 


ATATATACCt 


TATGTAAATT 


TTTCaTATAC 


300 


ATCGAATGAG 


AAAGTGCTTC 


aTAATTTAAT 


GAAAAAGATA 


TATGATCTCC 


AACTTGATAG 


360 


TGTCCTTGAC 


CATTTAAATC 


AAGCATTAAA 


TGATCACTCG 


AAGCGCCTAA AAT ATTGATA 


420 


TGCTGATCCA 


TAGGTGAAAT 


ATTATCGACT 


TTTGTAyCTA 


AATAACCAAT 


ATCTACAATA 


480 


GCTTGTAAGA 


ATGATTCATG 


CGTTTGTGTA 


TTAACTCGAG 


GTTTAATTTC 


TAAAATCTCA 


540 


GCCTCCAATG 


TAATCGCATC 


TTGATATAAC 


ATAGCGATCG 


CTTGATTTGT 


CGTTGTATCA 


600 


ACACCTCTAA 


ACAACGTTTC 


ACCTATTCGC 


AATTCATTTA 


TTTTACCCAA 


ATCATTATAT 


660 


AAAAGTTGTG 


GTAACATGCT 


CGAATTACCA 


CCTGAAATAA 


TTTTCAATCG 


ATATCCTATT 


720 


TCTCTTTCAA 


CAGCTGAGAC 


GAATCGATTA 


ATCATAAAGA 


TATCATCATC 


ACTTGGCGCA 


780 


TCAGATTTAA 


AACACATAAA 


ATTGAATGCT 


AAACCTACAA 


AATGGATATT 


TTTCAAGTGA 


B40 



55 
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CAATCTACCA TTAATAAAAT CTTATGTTTT TTTCCTAAAA CTTCTGgCTA CTTCATTTAT 960 

TTGATGTATG GGTAGATAAT TCCGGTTTGG GATACCTCCA TATCnAACCT TTTTTCCTAA . 1020 

5 TCAATATCCT GGAAAAnCCC T T TT TTTGGG CAGGGCG 1057 

(2) INFORMATION FOR SEQ ID NO: 947: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 64 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 

(xi) SEQUENCE DESCRIPTION : . SEQ ID NO: 947: 





GATATCCGCT 


CGATGAATAT 


ACCTTTAAAG 


GTGTTGGTGC 


AAATTCTGAA 


CAATTAAGTG 


60 


20 


CTATGAATCA 


TGATTCTTTA 


AAAAAGGAGT 


ACATTTCAAA 


TGACTGTTAA 


ATATAATCAA 


120 




AATGGCGAAT 


TAACAATGGA 


TGGTATTAGT 


TTAAAAACGA 


TTGCACAAAG 


CTTTGGTACA 


180 




CCTACCATTG 


TTTATGATGA 


ACTACAAATT 


aGAGAACAGA 


TGCGCCGTTA 


CCATCGCGCA 


240 


25 


TTTAAAGATA 


GTGGATTAAA 


ATACAATATT 


TCATACGCCT 


CAAAGGCATT 


TACTTGCATT 


300 




. CAAATGGTCA 


AACTTGTAGC 


TGAGGAAGAT 


TTACAGTTAG 


ATGTTGTTTC 


TGAAGGTGAA 


360 


30 


TTATATACAG 


CTTTAGAAGC 


AGGTTTTGAA 


CCGAGTCGCA 


TCCATTTCCA 


TGGTAACAAT 


420 


AAAACGAAAC 


ATGAAATTAG 


GTATGCTTTA 


GAArATAATA 


TCGGTTATTT 


TGTTATAGAT 


460 




TCATTAGAAG 


AAATTGAATT 


rATAGACCGC 


TATGCTrATG 


ATACGGTTCA AGTTGTATtA 


540 


35 


CcGAGTTAAT 


CCcAGGTGgT 


TGrAAGCcAC 


mTrCaCaCgA 


aTTTATTCaA ACyGGGcaaG 


600 




AAgGATAGGT 


TAAAGTTTTG 


GGATTAAnCC 


nnTTTCCCAT 


TAATGGGCC 




649 



(2) INFORMATION FOR SEQ ID NO: 948: 

40* (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 472 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

f 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 948: 
TACAAAGCTT AAAAACAGAG CTTAGTyTCG TTGAACCATT AATTAGCCGC TTAGAATTAG 60 

50 

AAGAAGCTAA TGATAAACTA GCTAATATCA ATGATAAGTT AGATGACATG TATGATTTAA 120 
TTGAACATGA AGTTAAAGCT AAAAATGATG TCGAAGAAAC AAAAGATATC aTTACGGATA 180 
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AAAACTACTA TATAAATGAA TCTGATGCTC AGAGTGTTGG TCAATTTGaA AATGrAATTC 300 

aAAGTTTAAT TTCTGTATAT GATGATATTT TAAAAGAnAT GTCTAAATCT GCTGTGCGAT 3 60 

ATAGCGAGGT TCAGGATAAT TTACAATATT TAGAAGATCA TGTCACAGTT ATTAATGACC 420 

AACAAGAAAA GCTACAAAAT CATCTGATTC AATTGCGTGA AGATGnGCAG AA 472 
(2) INFORMATION FOR SEQ ID NO: 949: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 949: 

2Q AGAGGATCCC AGGCGTTTGT TGATCCAGCA GCCGATnCAG AATACATCTT CTATGCATAT 60 

AAACGTTTCA ATTTCGAATT GAAAGATCCA AACTTCACTA GTGAAGAAAA TATGTTTAAA 120 

GATGTATCAG ACAAACCATT AATACCTGCT CGTAAAGCTC AAATTACAAA TGCGAACTAT 18 0 

25 AAACGACATG GTATGAAGTT GATGATTCTT GGGAATGACG AACCATATAA CAATGAGTTC 240 

AAGAAAGATG ATCGAAAATG CGAAGAATGA GAAATTTCAA AGGTGCGCAn GTCAAGCCAT 300 

GTGTCAGGGG GGCATATGAT TTCnGGTCAT CAGTAAGGAC TGTGTCATGC CATGGCCATC 360 

30 TGTTGTACnG ATCTCTAATA GAATCTCAGA TGCCCGGCAA 4 00 

(2) INFORMATION FOR SEQ ID NO: 950: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 556 base, pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 950: 

AAAATCTTCG ATGACAAAGA AATCTAATGA TGTTTCTTTA AAATGACCAA ACTGCAACGG 60 
45 AATGATATAA CCACACTTGA GATATGGCGC TTCTTTTTTC AACTTAGTCA TCACATCATA 120 

ATCCAAAGAC ATCACACGAT ATTGATGTTC AACACCATGC TTTTTCAAAA TATCAATAAC 180 

ACoTTGTGTA TAATCTGCTG GTTCTTTACC ATGTGGCTTT AACTCTACTA GTAGCTTCAC 240 
50 ATTTGATTGT TTAGCCGTTT CAATAAATTC GTCTAAGGAT ACAAATTTTG CTTCATGTCC 300 

ATTTTGACGC ATTTTCAAAC CGACGATATC TTTGAAATTA GATTCAGAAA TATTTTTATT 360 

s 

55 
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CATAATTGTA TCTAACTCAA CGTATTCGAC ATTCGCTTTT GCAGCAGCTT TCAATGACGG 480 

AATAGAATTT TCAACACCTT TATCTTCGAA ACCACGTGGC CAATAATGGA GATATTGTAA 54 0 

5 TTGATAGTAT TATTGG 556 

(2) INFORMATION FOR SEQ ID NO: 951: 

(i) SEQUENCE CHARACTERISTICS : 
10 (A) LENGTH: 469 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 951: 
TTGTGGTAAT GATGTTAATC AGTATATTTC TACTTTTGAC AATGACATCT AAAGGATTAA 60 
20 GCAATCTTAG AGTAATAGAT GATGAGGCAA ATATCATTTC TTTTATTACT GAATTGAATT 120 
ATATTAAGTC GCAAGCTATA GCAAATCAAG GAT AT AT CAA TGTTAGATTT TATGAAAACA 180 
GTGACACTAT TAAAGTAATA GAGAATAATA AAATACGATT TCTAAAATTA AAAGTAGGCA 24 0 

AAATAATTAA TGTTGCAAAA GTTGATATTA TTGCCTTTGA TAAAAAAGGG AATATCAATA 300 
AATTTGGTAG CATAACAATT TACAATAACA ATTCAATTTA TAGgAATAAT ATTCCATATT 360 
GaAAAAGGcA AGAATTCGTT ATGAAAAGCT ATAAGTGTAA AGGTTCATTC TTAATAGATA 420 
GTATGGCTGG nTTTTTGCCA ATTGGGATTG nATTACATTA CTATTGATT 4 69 

(2) INFORMATION FOR SEQ ID NO: 952: 



25 



30 



35 



40 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 730 base pairs 
.(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 952: 

ACATnTATAA TnCTTATTTG CTTGCTCCTT TCACTTTTTA AGAGTAAAAn ATAACCTGAA 60 

45 ACATAACTTT AACATCCAAG TTTCAGGTTA GTGTAATGTA AATGATTTAC TTTGTTnTAT 120 

TATGATTAAA CATCTAATTC TCTGCTTGTA AGTGCTTTAA TCATCTCTGG ATCAGCATGT 180 

GAGAAAAATT GGCTATTACT TTCTTCTAAT GTCGCAATTT GTTGTTTATC TTCTTCTGTA 240 

50 

AGTTCAAAAT CAAATATATC TAAGTTTTGA GCCATACGTT CTGGATTTAC TGATTTTGCG 300 

AGTACAACAA TGTCACGTTC TACTAACCAG CGTAAAATAA CTTGTGCAAT TGATTTGTTG 360 

55 
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AATGGTGCCC AAGCTTCTAC CACAACATTT TCTTGTTGAA GTGCTGCAAC TTGTTCTTCT 480 

TGTTGATGGA ATGGATTAAT TTCTATTTGG TTCACTTGTG GTTGAATTTG ATTGTGAATA 540 

5 CCTAAATCTA CAATTCGATC AACGCCGAAG TTAGATACAC CGATTGCTTT AAaTTTTACC 600 

ATTTTCtTTT AACTCCTCCA ATGCACGCCC ATGGAGCCAT ATACnGCCAT TGGTAAAGGT 660 

TGGGTGGAAT nAAGCACCTA GGATCCAGGA TAGGTCCCAA ATTTTAATCC GGTGGGnAAT 720 

10 

GGATCGGTTC 730 

(2) INFORMATION FOR SEQ ID NO.: 953: 

(i) SEQUENCE CHARACTERISTICS : 
15 (A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

{xi) SEQUENCE DESCRIPTION: SEQ ID NO: 953: 
ACAAATGACA ACGTACAATG ACGTCTTCAC AAACATGCAT TTTAGGAGAT TGATTATTAC 60 
25 AAATATCCAG TTTTTTGTTA CATCTATTTG CAAATCTGCA CTTATTAACG TGAAATGCTT 120 

GGATTGATGG CACAATTCCT TCTATCGTTT CAAGTCGTTT CTGCTCTTCT TTAAGTTTTG 180 
GAATTGTTGA TAATAATTTT TGGGTATAAG GATGTTTCGG ATTGTGTAGT ATTTCTTTTA 240 

50 

TGCCTCCAAA TTCTACAATT TGACCTGCAT ACATAACTAA GACTTTATCG CAAAACTCAG 300 
CAACTACACT CAAATCATGT GTAATCATCA TAATTGCCAT TTGCGTTTCC TTTTGCAATT 360 
CTTTTAATAA GTCTAATATT TGCGCTTGAA TCGTGACATC 400 

35 

(2) INFORMATION FOR SEQ ID NO: 954: 

* 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 954: 

TAAATCTTGA GGACGTTTGG TCTGAAAGCG ATGCTCAACT GTTGATAAAT CCAATTCGAT 60 

AACATCTGTA TAATTAGGAT CTTCTTTCTC AACATCAAAn ACATATGGTT TTGTTTCAAA 120 

so 

TATTCTTTTA CTAGCGCGAT ATGTTCGTCT GATCTACCAG TTAACTTCAT ATATTTAAGA 180 
ATTCTCATCA ACTGGGGAAG AATCCGCAAG TTGCTCCATA, CTCTGGTGCC ATGTTTnCAA 240 

55 
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(2) INFORMATION .FOR SEQ ID NO: 955: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: double 

(D) TOPOLOGY: linear 

.(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 955: 
AAACATTTAC ACAAGTAGTA GAAGCAATTT ATGATGAAGA AGGTAACAGC TTaGamgCGG 
CGCGCCATCC aTTACAAATC GTTCAAATTA AAGTGGATCG CCCGATATAT CCAAACAACA 
TGATGAGAAA GGAAATTGGC TAATGAAAGC TACTACAATC ATTGGCATAG CTGGTGGATC 
TGGCTCAGGA AAAACAACTG TAACTAACGA AATTATGAAA AACTTAGAAG GTCATAGTGT 
CGCTTTACTT GcTCAAGATT ACTATTATAA AGATCAAAAG CACTTGACTT TCGACGAGCG 
CCTAGAAACC AATTATGACC ATCCATTTGC ATTCGATAAT GATTTATTAA TTGAAAATCT 
TAAAGACTTG AAAAATGGTA AAGCAGTAGA AGTACCGACA TATGATTATG CtAGTCATAC 
AAGAAGTGgA CATTACCATT GGATTTTtAA ACCTAAAGAT. GtTTATTTAT CGTAGAAGGG 
CCTTTTTCGC CTTTAGGAAA ATAAGGGTAT TTAC 
(2) INFORMATION FOR SEQ ID NO: 956: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 88 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 956: 
ATTTTAACGA ATATCTTGCC GTACGATGCC CAAGCTGCAT CTGAAAAGGA TACTGAAATT 
ACAAAAGAGA TATTATCTAA GCAAGATTTA TTAGACAAAG TTGACAAGGC AATTCGTCAA 
ATTGAGCAAT TAAAACAGTT ATCGGCTTCA TCTAAAGAAC ATT AT AAAGC ACAACTAAAT 
GAAGCGAAAA CAGCATCGCA AATAGATGAA ATCATAAAAC GAGCTAATGA GTTGGATAGC 
AAGCCATAAA GTCTCACCTG AATGACGGTC AAGTGTATAG CATAATAGTC ATGCTAAGTT 
AATGAGTCTC AATGTGTAGG GCACAAGTGC GGGCGTCTAT GCATGAATGT TGCCACGCTC 
ACAACTGGGA AGTGTAAATG TGACTGTA 
(2) INFORMATION FOR SEQ ID NO:' 957: 
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(A) LENGTH: 394 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 957: 
AACAAAGTAC ATCGCAGTTA CGAGAAGAAA TAAATGCATT GTATAATCAA GTTATAGAGG 
TTGGACAAGC GTTATCAAAG CAGCGTCGTA TAGTCGCTAG GGAGTTAAGw GACCACATTG 
TATCGGAAAT TCAAAACTTA CAAATGAAAG ACGCAAATCT TGAGATTTCA TTTAAAAAAT 
TAGAAGAACC GAATATTGAT GGAATCGAAT TTGTAGAATT TTTAATCAGT CCAAATAAAG 
GGGAACCATT AAAAAGTTTA AATAAAATTG CGTCAGGTGG AGAACTTTCT AGAATTATGT 
TAGCGTTAAA AAGTATTTTT GTTAAAt CGa GAGGTCAAAC TGCAATTtTr TTcGATGAGG 
TTGaCTCAGG TGTATCtGGr caAGCTGcAC CAnA 
(2) INFORMATION FOR SEQ ID NO: 958: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 
tC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 958: 
CCGCCATTTT TTTCGATTTC TTTTTCAAGT TTTTCACGGT CTTTTGAAGA TAGACTATTG 
AAATCTTTTG CACCACTATC ATCAAAATCA ATATCTGCTT TCAATTTTGT GCTAGATTTT 
AAAATGAAAT TAATTGGTTC TTCAGcATAt TTGATGCCGA TATyTAACGT AGATTTCTGA 
GTGTTATTyT TTACGTCAGA ATCTATATTA TTTTCAAAAG TGAATTCATT TTCGTCGCTA 
TATTTATCTA ATGCGACAGT GATTTTACCT TTATCTTGAC GTTTTGTGCC ATCTACTTTT 
TCTTGGTTAT CTAATTTGAT TTTTGATTCA TCATATTCTG TCTTTTTACC AAATTCGTAT 
TTATCATTAT ATTTATtATC TTTwTCTTTA GAAGATACGC CTTTAATTGT ATATTTCGCT 
TCAGCATACG TGTATTTATC TTGATCGAAA TCAAGTGCGT AATCTAGTTT TAACTTATCG 
TCTTCTAAAG TATTAGTACC TTT 
(2) INFORMATION FOR SEQ ID NO: 959: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 307 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 959: 

TCAAACGTAC AGAGCTTGTT AAGTGTTTAG ATTATTTACG AGAGAGCGAC ACATTAGTTG 60 

TCTATCAACT TGATCGGTTA GGTAGAACGA CAAAACTATT AATTGAATTA TCACAATGAT 120 

TCGATGATAA CGGAATTGAC TTACCAnATT AGTAACCTGA ACATTTCCAC GAAAGACCCC 180 

ATGGGGCCAA AATGTTTTTA CCGATGATGA GTGCCTTTTT CCGGGTTAGA AGTTAATTTA 240 

CTATGTGAGT GTATAAAATA GACTnAGCAG CCACAAGnCG AGAGGCCGAA AAAAGCGGGC 300 

GCCCTCT 3 07 
is (2) INFORMATION FOR SEQ ID NO: 960: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



20 



25 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 960: • 
ATAAGGACAT TTATAGTCGC CATATACGAC AACTAATGGT TTGCCGTTTT TCGAAGATGT 60 
CGTTGCTGAT TCTTTTTTAC CGCAAGCTGT TAAAATTAAC ATGCTCACTA TAAATAATGT 120 
30 TAGTAATTTT TAGTCCATAA TTACTCCTAA TCCTAACCAn GTCnGCnAAA TTTCCTCCAT 180 

GATAATCCGC ATCATGTTGT TGATAATATC GTCTCGAATC AAAATCTTGC TTATATGATA 240 
CACGTTTATC ACTAAAATCA TATGCATAAT AATGCACTTC ATCATCATTA CTCAAAGGTT 300 
35 TATAAGCTAA AATGATAATT TTATCCTTAT CATAGACATA AATATTAGCA TCTTTGCGTT 360 
CAAAATGATC AACACGATCA CCGTGCCTTT TTTGGCAAGT 400 
(2) INFORMATION FOR SEQ ID NO: 961: 

40 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
45 (D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 961: 
50 ACAATTAGTT ATACCCAAAA GAGTTATGTC AGTGAAGTAG ACAAGCAAAA CTCAAAATCT 60 

GTTAAATGGG GTGTTAAAGC AAACGAATTT GTTACGCCTG ATGGAAAAAA ATCTGCGCAT 120 

55 



1767 



EP 0 786 519 A2 

GCTCCTGATA ATCAATTGCC ACCTTTAGTT CAAAGTGGCT TTAATCCATC GTTTATCACT 
ACACTATCAC ATGAAAAAGG TyCAgTGATA CGAGTGaATT TGaAATTTCa TATGGTAAGA 
ACTTAGATAT TACCTATGCG ACTTTATTCC CTAGAACTGG TATTTACGCn GGAAGGAAGC 
ATAATGCATT TGGAAATAGG AACCTTGTAn TTAnGTATGG 
(2) INFORMATION FOR SEQ ID NO: 962: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 358 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 962: 
CTTTTACAAA GACAAATTAG CGGTAATCGG TAAnGATCGT TGTTGATCCA GTTGCACTAT 
TGAAAGAATT AGACGGATTA AATGAACGTG GCATTCCTAC AAGTAATTTA CGTATATCTA 
ATCGTGCGCA AGTGATTTTA CCATATCACT TAGCACAAGA TGAATATGAA GAACGTTTAC 
TGGTGACAAT AAGATTGGTA CACTAAAAAA GGTATCGGTC CAGCATATGT AGCCAAGTCA 
CGTATCGGTA TCGTATGCAG ATTACTTGAA AAGAACATCG AAGATATTAA ATCAACATGA 
TATAACCAGC TATTCAAGGA TGTTACGAAC ATGnCACATT GATGAAnCTT GAGATATA 
(2) INFORMATION FOR SEQ ID NO: 963: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 258 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 963: 
AGTGAGCAGG TTGAAGTAGT AGGGCCGATT CAAATGGAAA GAGATACACA TAAAGATGGT 
AAGGTAAAGT GGCAATTGCT TTATATAATG AATCAGGATG ATGATGAAAT TAAGCCACCA 
TTTTTTATTC AATGGGAGGA AAGTGATTCC ATTGCGTACT AAAAAATTGC AAAAATATTT 
TCAAAACCAT TTTCAATTGA AACTGTATTG TGAAAGTAAA ACCGTCACAA CAGTATCGAT 
TGGTTGAAnT GGTTGnAT 

(2) INFORMATION FOR SEQ ID NO: 964: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 388 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 964: 
TTATCAAACC GCCATAGCCA CATCTCGATT CAAGGATAAA TTTGTTTTCT CATATCTTAT 
CTCTCATGAC AGTGCACGAT ATATATCTTT TCCAAATATT CATCACTTTT TTCCTGTTAA 
AATCATTTCT ATTATTAATG CTTTAATGAC AACTTATTTA ATTTATTGCA .TTTAATGCCC 
TGCTTCATTT TCAAATATTA ACTTTAATCG ACTAGCTCAT CACACTATTT ACCCTTTCCA 
CCTATCACTA CTGAATTCAA AACGATTTGT TTACAAATCC CCTTATGCCA ACGGTCGAAT 
TTCATCATGT TTACATTTAA TTGCAAATGC CCTCTTGAAA TATTTTAAAA TTAATATTAT 
TCTTAGCATA CTAGTCGGAA TTATATTC 
(2). INFORMATION FOR SEQ ID NO: 965: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 602 base pairs 

(B) TYPE: nucleic acid 
(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(Xi> SEQUENCE DESCRIPTION: SEQ ID NO: 965: 
AAAGTGAGGT ATAGTCCGAC CATACTCTAA AAACGTAGCG AGATAAATAT ATTTCAATCC 
TAACTTTTAT G TT T T GAGGC ACTTGCCATT TAGGATATTG TCGTTCGTAA TACGACACTT 
GTTGTATAAA TACACCTAGT CCAAATGGCA GCATCATGAG TAAGATACTT CTTAAATAAC 
TTAAACCAAT ATCATGCCAT ATGTGTCCAA TAATCAATTG AAAGACAATG ATAGATACTA 
TTAAAACGAT TATATTTATT GTCACTTGTT CAAACGCACT CCTTTTCCAA ATAATAGAAT 
TGCTGCTTGC ATGACAACCA TAAAACATAC AAACATAGCA GTTTTAAGCG TTAGACTTTC 
TAGAATGTGa TTTAGAACAT GTAAGGGCTC ATTAAAGAAA TAAACGGaAT GTAAGCGTAA 
GAAACGACCA ATATAAATTC CGAATCCATT TAAAAACATT AGcACGaCAA CrATTAATCT 
ATTaAGCCAA CGGTGaGaAG TCAATGtTAG TATTTCAAAA TAGATTnAAA TCATCACATA 
AACCGCTAAG AAGACACCAA GCAGTAAATA GGTAAAGTAT TTCCACTCAC TTAAATTTAG 
TC 

(2) INFORMATION FOR SEQ ID NO: 966: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 744 base pairs 
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(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 

5 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 966: 



10 



20 



25 



30 



GTCCTAAGGT 


AGATTTATCT 

A A X1»X WX 


ACAA cmTTfl 

nwin^> x sj x X w 


X IKi l 4 X rti l w 


rtA 1 X/\X X^TJK 


wttffti X X XAX 


D u 


TAATGTTTGC 


TAGCATATAT 


TTTATnAAAA 


X X W X VJ*»V.~ *V X w 


Jl T*21 ( ~ f w [ " I ' 21 <T 

/i X rt W 1 X X ,nw> X 


21 21 G 21 & 21 CTTT 




TAGAAATTAA 


ATTTAATTCG 


AAATAGTATT 


TTCAATGAAG 


X w X X Ulftftw 


w x nrtnL. X X X v. 


i an 


ATGACATTTT 


TTTGAAACTA 


GAGAAAATGG 


^"T l O IV TV 21 Xlf""F 


TPTAftPrOAZl 
x w lnvj\, uVjrtrt 


X www IXLiiO 


o^n 

«4U 


AAAATG^ATT 


AATGTTGAAT 


iWi^l a www 


t\\3 X LA 1 nLUn 


ZIPZIZITHTJIZIT 




inn 


ATAAATTTGT 


ATCGGTGCAC 


GTTTATGTTT 


TAACGGAAAA 


AGTATAGTTT 


TTGGGTTGAT 


360 


GTAAATAGGA 


ACCAGTTTAT 


TAATATTTAG 


TATGTTTTTT 


GCAGTCTCAA 


TTAGCACACT 


420 


TTGTGATGAT 


AAATGCAATT 


TCGCATAATA 


TGrAACGAGT 


TTGTTAATTG 


GAACTTTAAG 


' 480 


AATGTAATCG 


CAATGTAAAA 


AGACACAATT 


TGTTTCGAGA 


TGGTTTGAAG 


TAGCAGTTTG 


54 0 


GATATAAAGT 


AGATATTTAG 


TAGAATTGTC 


TTGCAAATAT 


AGTACTCCTC 


GTTTAATAAG 


600 


TTGTTAATTT 


AATGGTTAAA 


ATAAATTTAT 


ATGAAGATGT 


TAGGGTGTTT 


CAACATCATT 


660 


AACACATCGA 


CCGAAATTTC 


ACAATAAACT 


AATTAAAATT 


ATGTTGGCAT 


TTACGCTTAA 


720 


CGGACATCTT 


AATTAATTTA 


ATCG 








744 



(2) INFORMATION FOR SEQ ID NO: 967: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 503 base pairs 

(B) TYPE: nucleic acid 

( C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 967: 



45 



TACTTTGCAG 


GAGCATTAGG 


TATTCTAGCA 


TTTATAATGT 


CAATTGTATT GATTCACGAT 


60 


CCGAAAAAGT 


CTACGACAAG 


TGGTTTCCAA 


AAGTTAGAGC 


CACAATTGCT AACGAAAATT 


120 


AACTGGAAAG 


TGTTTATTAC 


ACCAGTTATT 


xTTAACACTTG 


TATTATCGTT TGGTTTATCT 


180 


GCATTTGAAA 


CATTGTATTC 


ACTATACACA 


GCTGACAAGG 


TAAATTATTC ACCTAAAGAT 


240 


ATTTCGATTG 


CTATTACGGG 


TGGCGGTATA 


TTTGGGGCAC 


TTTTCCmAAT CTATTTCTTC 


300 


GATAAATTTA 


TGAAGTATTT 


CTCAGAGTTA 


ACaTTTATAG 


CyTGGJcCaTT AwTATATTCA 


360 


GTTGTTGTCy 


TAAwAwTAwT AgTTTTtGCT 


AATGGACTAT 


TGGGTCAATA ATGGTTAATC 


420 



55 
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TTTTCCnAAn ATTGCnGGGG AAG 
(2) INFORMATION FOR SBQ ID NO: 968: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 465 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
.(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 96B: 
TACAGGTAAA AAAGGTGGTC CGTTTATCGA TTCATTAAAC CATCGTAGTT ACACGAAAGC 
ATATGTTTCA TCAGAAACCG ATATTCCATT TTATGCTGAA GCATTTGGAA TTAGAGAAGA 
AAATGTTGTA CCAACAGGTG TACCACGTAC TGaTGTACTA TTTGATGAAG CTTATGCAAC 
ACAAATTAAA CAAGAGATGG AAGATGAATT GCCAATTATA AAAGGTAAGA AAGTTATTCT 
ATTCGCACCG ACATTTAGAG GTAATGGTCA CGGTACGGCA CATTATCCAT TTTTTAAAAT 
TGATTTTGAA CGTTTAGCAA GATACTGCGA GAAGCATAAT GCAGTTGTGT TATTCmAAAT 
• GCATCCGTTC GTAAAAAATa GmCTTAATAT TcaCgTGaAC CATAGACCAT ACCTTAATCC 
GATGTGGTCC AGATCCATCC GTGGAAAGTT AAACGGATAT TCCnC 
(2) INFORMATION FOR SEQ ID NO: 969: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 969: 
AATTCGGAAC AACGAGCTGG CAACAACATA AGATGACAGA GGCGAATGGT CAATCAATTC 
AAGTTAACTT AAATAATGCG CAAnGCGATT TGGGATATTT AACTGCTGGT AATTACTATG 
CAACAAGAGT GCCGGATTTA CCAGTAGCGT TGAAAGTTAT GAGGGTATTT ATCGGTATTC 
GTAAAGATGA TACAACAACn ATTTACTCAC ACCTATACTC TAAAAGATTn CACACGTCAT 
CACAACGGCG ACTTGAGCAA CATGGACAGT TCCTAATGAA CATAATCAAC GGTATTGTTC 
(2) INFORMATION FOR SEQ ID NO: 970: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 970: 
ATACGCGAGC GAACGGTGAA GCGACTTGCT GCTGCAAAAC GTCTGCGACC TTGAGCAACA 
ACATGAATGG TCTTCGGTTT CCGTGTTTCG TAAAGTCTGG AAACGCGGAA GTCAGCGCCC 
TGCACCATTA TGTTCCGGAT CTGCATCGCA GGnATGCTGC TGGATTATCA TCCGATTTTT 
AAAACGTTAT TAGGCGTTGT CGTTGCAATT CAATTTACAG TACCTGCTTT GATTGGTGCA 
TTGGTTGCCA TGAAATTTGA CCTATCGCCA TTAGCAATTG CTGTTGTGGC AAGTGCAGCA 
TATGTAGGAA GCGGTGCAGC GCAGTTTAAA AATGGTGCTT GGATGATTAC GGGTATTGGT 
GATTTAATriA ATACAATGAT TACAGCAGCG ATTGCAGTTG 
(2) INFORMATION FOR SEQ ID NO: 971: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 251 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 971: 
AAGTGTAAGA TTAGAATAAA AAATTTATAA ACGAATTGAG AAAGACCAAA GCTAGCAATT 
ATTCAGAAAT TAAATCGAAG AGTCCTACAG TTAATGACTT ACCTGAACGA GATTTATCnA 
ACAAATCTCA GTATGACCAA TATGGCAAAT GCCGTGTCCA TGTTAACTGA TGAAAAGTAC 
TTTGGTGGTA GTTTTGAAAG ATTACAAGCA TTGACGACAA AAACAACATT ACCCGTATTA 
TGCAAAGnCT T 

(2) INFORMATION FOR SEQ ID NO: 972: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 414 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) . TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 972: 
TGAAGCTCTT aATAAATCTT TTCTTGAACA CACACCAACA AAATCGTTAT CTTCATTAAT 
AATAAATAAT GTACTTACAT CTTCTAAAAA AATTGTACAA ATAGCGTCAT AAACTGTTGT 
ATTCTCTCTT AGCACAACAG GTTGAGACAT ATAGTCCTTA ACCTCAAATT GTCGAAGTTT 
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TTCTAAAAAA CCTGACATTG TTAATATCGC TAAATCTGGT CTAAGCGTTG CTCTTGTTAA 300 

ATTCAACTTA TCTGCTATTT GTTCACCAGw AATGGGTCCT TTAGTTTTAA CAATTTCGAT 360 

GATTCGTTCT TGTCTTTGAC TGAGTTCTrw AGGkCTTCAC CCCTTTtTAT GCnC 414 
(2) INFORMATION FOR SEQ ID NO: 973: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 747 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



75 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 973: 





GAAATCGATA 


CTAATATCTT 


CTTTATTTAC 


ACCTGCTAGT 


TCAGCTTCTA 


AGTAATATAA 


60 


20 


TTCGTCTGTt 


CATAtATATt 


CGTtACAAaT 


GAAGGTGTtG 


AAAaGTAaTT 


AAaTaCTTGT 


120 




TTaCCTAAAT 


CTTTAAaTGT 


ATCACTTGGA 


TTACCGTTAA 


AAAAGTTTTG 


ATTCTCGAAT 


ISO 




TGATTGAAGT 


TCAtATCATC 


ACTTCCATTA 


TAGAAATTTG 


TATTACAAAG 


CCATTATATG 


240 


25 


AGTAGTATTA 


AAAAAGTTAA 


AGTAAAAGGG 


GAGATAAAAA 


AGTAATAAAT 


GGGTAGATTT.- 


300 




GTGCCTGATA 


AGGAATCAGG 


GCAAAAGAAC 


TGAGCATCGA 


GATTTATACA 


AGCAATAAAG 


360 


30 


ATAATGAATA 


AACATATCGA 


TGCATACTTC 


TGCCCATCAC 


CTTATGATAG 


GGAGTTATTT 


420 


TTTGTCGGAA 


TAGTCAAAAT 


TTTTAGAAAC 


AAAATTCCGC 


AAATTATTAA 


TTATATCTGA 


480 




TGCTTCTCGT 


TCTGAAATGT 


TGAATTCATT 


AAAGACcTTA 


ACAGAAATTT 


CAGCAAGAGG 


540 


35 


GCTTTTTATT 


GCTTTACCTT 


GTTCTGTTAG 


AGAAATTTGT 


AGGTTTCTTT 


CATCTTTCTC 


600 




TTCACGTGTT 


CGAACAACGT 


AATCTTTCTT 


TTCTAATTTC 


TTTAGTAATG 


GJcGTCAGTGT 


660 




TCCAGAATCT 


AAGAACACAC 


GTTCACCTAA 


TTTTTTGATG 


TTAAGTTTTT 


CATCATTTTC 


720 


40 


AATCGCCAAT 


TAAACCAnGG 


TAnCnGG 








747 



(2) INFORMATION FOR SEQ ID NO: 974: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 257 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 974: 
TGCTTCCAAG TAATGTGGAC ATTTAATACG AGCTGGGATT GAGATACACC TTGCAATGCT 60 
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TCTTGGnCAT TTGCATTGCT TCTCCCAAGT GAATGCCTTT AACAGCTTGT GTCATCATCG 180 

ATGACTTGCC ATTGAAATCG ACAACCTTCA CCTTCAAACT TAGCATCTTT TATAATGCCG 240 

nCTTCTATAT CAAATGT 257 



(2) INFORMATION FOR SEQ ID NO: 975: 

■ (i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH : 451 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 . 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 975: 

CAATGGAGCA TATTTTACGT TCGGCAATGG ACATTGATCA nACAACAAAA CATTATATAA 60 

20 mTAATAAAGT AGAAGTTATC AAAGAAATTA TTAAAAAAGT AAGAGGGGGA AATCACGTGA 120 

CATCACAAAA TCCAGAAGTT AATTACGAAG CATTAGCTAA ATATGGCCGC GACTTAGTAG 180 

AAGAAGTTAG ACAAGGTAAA ATGGATCCTG TTATAGGAAG AGATGAAGAA ATTCGAAATA 240 

CGATTCGTAT TTTAAGTCGT AAAACTAAAA ACAACCCTGT GCTCATTGGT GAACCAGGTG 300 

TTGGTAAAAC TGCAATTGTT GAAGGATTAG CGCAACGTAT AGTTAAGAAA GATGTGCCAG 360 

AATCATTATT AGATAAAACT GTTTTTGAGT TAGATTTAAG CGCATTAGTA ' GCGGGCGCTA 420 

30 

AATATCGTGG TGAATTTGAA GAGAGATTAA A 451 
(2) INFORMATION FOR SEQ ID NO: 976: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 976: 



45 



SO 



AACTATTAAT 


AATATTAAGT 


TCACTACAGA 


TGTTGCTAAT GGACCATAAG TTTTAAAGAC 


60 


ATCTTCACTT 


TTATAACCAA 


CAATCGCATC 


TAAAAATTGA ACTAAGATCA TTGCAATGGA 


120 


TATAGTTATC AAAAATATAG 


CACTATGAAT 


GACTAAAGAA AnAATAGCTA ATAAaAATAA 


180 


AGGTAAGCTT 


CGACTAAGTG 


cATAATATGC 


ATTTATATTA TGGCTAGATG CACATGCTTG 


240 


AATTGAATAA 


CCTAAACTTA 


CACTGGCACT 


GATTATTGTA AATATTGCTA AAACAAAATA 


300 


CATGTTAATC 


CTTCTTTCTA TATTTGGATA 


TAAACAAGTA CTTGTCTAAA GTTATTTAAA 


360 
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TATAACTATG TCATAAACTG AATTTGTTGA AATTTTTCAT TATGCAAATT TATTAATAAC 460 

AAACAGCTCG AACTATAGCA TCATTTTACT AATGAATGCA TTAAAGTAAC TATGACTAAA 540 

5 AATGCATATT AATTATCATT ATTAAGACTA T .571 

(2) INFORMATION FOR SEQ ID NO: 977: 

(i) SEQUENCE CHARACTERISTICS: 
w (A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

15 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 977: 

CCCGAAAAAA TTTAGACATT TTCAAACAGT AACATTTGAA GATTTGTCTA AGCTGGAAAA 60 

20 GAGTAGTATG CCATCAGGGG TCGGTATATT AGGCTATGCT GTTGACAAAG GTGTTGCTTT 120 

AAACAAGGGG CGCATTGGTG CAAAAGAAGG ACCAGATGCG ATTAAACAAG CATTTGCAGG 180 

TTTGCCGGAT TTGAATCCAT GTGnACTTTA GTCnGATTAC CGGAATGTTT ATCCTGGTCA 240 

TGGGGGAATT AATTGGATAC CCCAAAAAGG AATT 274 

(2) INFORMATION FOR SEQ ID NO: 97B: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 877 base pairs 

(B) TYPE: nucleic acid 
{C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 978: 
nTnTTACTTC TT C TTTTAAT ATTTCTCGTA TAGCTTCAAA TTTTTCACTG TCAAACAGTG 60 
40 TTCTTAATAT CCCTTGCTTT TCACGACTGT TTGATATTAA AAAGCGTTTG AATTCACCTT 120 

GAGGCAAAAT AAACAATTGT CGGAATTGAT CTGCATTTAC TCCTAATAAT TCAATAATGA 180 
ATTGGGTACC TGAAATTACT TTACTTTCTC TAATTTCATA CTTGCCATCC ACCATCTCAA 240 
ATaCATCAAA TTTAGCGTTC GTTTTTGTTG TATTACCTTC TTTGATATAA GGGCCTTGTC 300 
TATGCACTTT ATAAATTCGA TGATTTAATT GAAATTCAAA TGTTACTGAC ATCGGCTGTT 3 60 

TACCATCAGC GAAATGACTT CTCAAATCAT TTTCTTCTCT TTGTTCAGTT GATGCTTTAC 420 
CAAACAAGGC ATAAGTCATT GCATCAAAAA TCATTGTTTT ACCCGATCCA GTCTTACCAC 4 80 

TTATTAAAAA CAATTCATTA TTATCAATTT TAGAAAAATC AATTTCTTCT TTTAAAAAGG 540 

55 



45 



50 



1775 



EP0 786 519 A2 



CGTTTTCTAA AATATTTTTT ATCTTTTTAG ATTGGATATC CgATAATTCT TTATCAGTTA 



660 



TATGTTTATA AAACATTTCg ATAATTGACA TGTCaTCTTT TTCACTTATT TCyATAGCAT 



720 



TATTTTCTTC ATnGTAATTA AAAGTTTCAT TCGTCAGCGC TAAAGTATTA GGATAAATTn 



780 



GTTTTAAACT CATCATTGGA TCAGTAATAT GAGACATATT TTTAAGTTTA AAATGTAAAA 



84 0 



TAATTATCGT TATnTTTCAC ATGAACTTnT TCATTAA 



877 



10 



15 



20 



25 



30 



35 



40 



45 



SO 



(2) INFORMATION FOR SEQ ID NO: 979: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 624 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 979: 



ATTGGTACTA 


TACCATTTGT 


CCATTTTAGC GAAGACGTTA 


AACTATCGAT 


TAGCTAGTGA 


60 


AGAAATTAAT 


CTCACTATAC 


CTTCTATACA TAAGCAAATT 


AAAAATTTAG 


AACAGCATCT 


120 


CAATGTGAAA 


CTATTTGAAA 


CTTATAAAAA TCAAATTATC 


TTGACTGAAG 


ATGGTCATAC 


180 


ATTTCTTCCA 


ATTGCGCAAA 


GTTTTATTGA ACAATATGAA 


AGTGGCATCA 


AACATATCCA 


240 


ACTTAAAAAA 


ACGATGTTTC 


AATCGAAATT AAATGTTGTC 


GTGTCTTCAT 


ATATCGCGAC 


300 


ATTCATTATG 


CCCAAATTTT 


TAAAATCTTT CTTTAACGAG CATCCTTTTA TCGATGTTTC 


360 


TCTTCATGTC 


AAAAATGAAA 


ATATTGAAAA AGATATTAAT 


AATCATACGT 


ATGACATCGG 


420 


GATTAGCAGA 


AATCAACCTA 


AACTAAGAGA AGTTCATTCT 


GAAAAAGTAT 


GCGAAGGTAA 


4B0 


AATTGTATTG 


ATTGCTCCCA 


ATAAAGAGAA TAATCATCTT 


CTAACTGAGG 


CATCTTTATT 


540 


TGAAAAATAC 


AAAATAATAA 


GTGATAACCA TCCAGAATAT 


TGGGTCATCA 


CTTAAAAATA 


600 


ATATTTTAAA 


TATTTATGAA 


AAAG 






624 



(2) INFORMATION FOR SEQ ID NO: 980: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 457 base pairs 

(B) TYPE: nucleic. acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ IT) NO: 980: 
TTTGTCCTGT TGCATAAACT GCAGCTAGCA TAATGTTAAT TGTTGCACCT ACGCTAACCA 60 
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TkGATTCATC AATTTCAGCA CCTAAAGCTT TAAATCCTTT AATATGTTGA TCAATTGGAC 180 

GTGGCCCAAG TGGACATCCT CCCGGTAAAC CAATCACACA TTTTTTAAAT CTACCTAACA 24 0 
5 TGGCACCCAT CATATAGTAT GAAGCACGTA ATGATTCAAC TTTATTATTT GGtAATGCAG r 300 

CATTTTGtAT TTCAGTTGTA TCAACTTCTA AwTCCGTACC ATTTaGTGaw GcCTTAAtAt 3 SO 

TTAAAtCtkC tAAtAAACTC aCTAAAGTTT TAACATCAGA GATTTnGCGG TAACCCTTCT 420 

10 

AATTTCACAT GTCCTTGAGC TAATAATGTT GCAGGAA 457 



(2) INFORMATION FOR SEQ ID NO: 981: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 997 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 981: 



25 



30 



35 



40 



45 



50 



ATTGCAGCAA 


CGCCCCCTGT 


AAAACTACGT 


ACTCTTACAT 


TTATGAAACC 


TGCTTCTTCA 


€0 


AACATGCGCT 


TCAACTCTTC 


TTTTCCAGGA 


AAATTAAACG 


TAGATTGCTG 


TAACCATTCA 


120 


TATTCTTCTT 


TTGATTTTGC 


AAATAATTTT 


CCAAAAATAG 


GCATAACAAA 


TTTAAAGTAT 


180 


AGTGCATACA 


TTTGTTTAAA 


GACTGGCAAA 


GTTGGTTGGC 


TCGTTTCAAG 


ACATACCACC 


240 


ATACCACCTG 


GTTTAAGTAC 


TCTATTCATT 


TyTTTTAaCG 


CGACTAAATA 


GTCTGGCACA 


300 


TTTCTTAATC 


CAAACCCAAT 


TGTTACATAA 


TCAAAAGAAT 


TGTCTTCAAA 


CGGCAATTCC 


360 


ATTGCATCAC 


CATGAACAAG 


TTTAACATTT 


TCCATTGAAG 


CAGTTTTTTC 


TTTTCCTACT 


420 


TCTAACATAT 


TCTCACTAAA 


GTCAATACCA 


GTAACTTCAC 


CTGTTGGTCC 


TACAGCTTTG 


480 


CTTAATGCGA 


TTGTCCAATC 


ACCAGTACCA 


CAACAAACAT 


CTAATGCTTT 


CGTCCCTTTT 


540 


CTAACACCCA 


TGTCTTTCAT 


GACGCGTTTT 


CTCCATACTT 


TATGCTGCTC 


AAAACTAATA 


600 


ATATTATTTA 


ATCTATCATA 


TTTTTTTGAA 


ATATTTTGAA 


AAACGCGATG 


TACTTGCTCT 


660 


TTATTTGCTT 


TATTGTCAGC 


CATGCTTAAT 


TACCTCTACT 


TTTTAAATAA 




720 


TATCGTGTAA 


GTAATGCTTT 


ACTTCACTTT 


GATTATATTT 


CTTGAAGTAT 


GATGGATAGT 


780 


AATCAGACAT 


ATCTTCAAAT 


AAATAATTAT 


ATATTTCCGA 


CTCATCAATA 


TTGATACCGA 


840 


AATGAGATAA 


CGTAAtATAT 


GGgAAAAGTG 


tTTCAATTTT 


TACTATTGCT 


TGAGAAATTT 


900 


CATAATCATT 


TAAAGCTTGA 


TGATGTAATG 


AAGATTTCAA TTCATTAATT 


TCAACAATTG 


960 


CTTTACTAAT 


TTCATTTTGA 


AATGATAAAT 


CATTGAT 






997 
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10 



15 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1118 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 982: 

CATTCCCTTT ATCGGnACAA TTATTTTAAA TAATTGGTTA ATAAGTTTTT CTATTAATTG 60 

CTGTTTTACT TTCTAGAGCA ACAATAAATG TTTTCAGTTT AAAATAATTA AGATTGTCAT 120 

TTACTAATTT GCTTTTTCCA TTCTTTAACC ATAAATCAAC TTGTAATTGC CACTCAATAG 180 

GATGATTTTT AATATAAATT TGCTCTGGAA AAATATAACC ATAATTGTGC ACTACTTCTT 240 

GATCAGTTAA CTGTAATTGA TACATTGCAC TTAAAGTGGG TTCTAAAACA GAATTTTGCC 300 

20 AGCGACAATA TTTGATATAT TGATTAATTG CGAACTTTGA TAATTTATAA ACAGCATGAT 360 

AATCACAAGG CGCCTCATTA AACAACTCAC TAATTTTGGC ATTTCTTTTT TCAGCGACAT 420 

ATTTGTGCCC GCCTAAATAT TGTATTTGTT GAAACATCAT TATTTGTTTT TTCTCTAAAT 480 

TGAACGTATG AAGTTTTCGA GTGAATGGAT TAATAAACAT ACTTTGAAAA TGATTGAACT 54 0 

TCACTTTATT ATGACAATAA TCAACATCAT TTAmTAACCA ACTTACTTTA TATCCCATTT 600 

TCTTkAAACC TTCCGTTCGT TGAAGAATCT GTTTATAAGG AATTGGCGAA TACTGTAGCT 660 

CAATAACATA TTTATTATTA ATCAAAATAT CCGGAATTTG TTTTATTTCT TTTAAAAATG 720 

GCTCTATCTC TACTTTACAT CCTTGTTGCT GTATCATCTG TGCTAAAATC AATTTACTTT 780 

TATAATGTTT AATCGTTTCA TTATTAAAAC ATTTTTGTTC CGCTAAATGT TTATGTGCAA 840 

AATGACTAAT TACTTTGAGC CCACGCTTTA AAATAACTTG CTTGCCACAC ACCGGACAAA 900 

AATATTGTGT CTTTCTCAAT GCAGTAGTTG CTAATACGCG TTCCTTTTCT TCATTTAAAG 960 

40 CTACTAACAT ATCTTCACCT CAAAATATTA CACGTAAGTT TAACTACTAT TACTTTTTCA 1020 

GCTTTGGAAA TCCATTAAAA AAAAGGGACC CCCAAAGGAG GCCTCCTTTC ATAnGTGCGG 1080 

. TGGCCTAATT AAATAATCCG TTAATTCCAG GTGGnCnC 1118 
(2) INFORMATION FOR SEQ ID NO: 983: 



25 



30 



35 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 269 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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CATCCCCTGT TAAAAATTGn TATTTGTGTA TTGATGCTTC TCAGnCATTA TTACGAGTCC 



60 



ATATTCATTG CTTCAGTGAA ATATCAGTAG CAATAACATT TAAATCCGGC TTTTCACATT 



120 



TCAAGGTAAT TGCAGTACAC CACTACCCGT TTCCGATATC TACGGATTGT TGCATCATCT 



1B0 



TCTAACTGTT GGTAAGAAAT GCAACCATTA CTGCTTCAGT TTCAGGTCTG GTATCAAACA 



240 



ATTTGAGTTT ACATCAAACG GTCTACCTT 



269 



w 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO:. 984: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 299 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 984: 

AACCCAAGTG GAAnnGGTTT TGGTGGAAAA ATTTTAGGTC CTTTATCTGT CGTGCGAATC 60 

ATTGTA TTTT TTAAAC T TTC CGCATAATCT TGATGGTCGA TTAACTTTTC GCGATAACAA 120 

TATTCAATCA CTTCAGAAAT CGCTTGCTCT GATATCTCTT CTTTTTGTAA ATATTGTATA 180 

ACTTCTTTTT CAGTTCTCTT TTTATATGAT AAATATTGGA TTGCTTTATT TAAACCTATG 240 

CGATAATGAT CATACTTTTG AATCTCTGCC ATGTCAGCAG CnTCAAGTTG TnGCCCTT 298 
(2) INFORMATION FOR SEQ ID NO: 985: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 431 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 985: 



CCATGCGATT CCACCTATTA ATAGTAAGAC 


AATAATAATA GGAATTATAA 


mTTAACTT 


60 


TTTAGACATT TTGCTTCCTC CCAATTATTT 


ATGAGTTAAT CATATCAGAA 


CAATACATTT 


120 


TATTAAATGG TTTATTTTAA ATTTTGTTTA 


AATTAATAGA ATTTTAGTTA 


TAATAATGTT 


180 


TAATAAGTTA TTGGrAATCy AATAAACTAC 


mAAAAATAGT TTGATTACaT 


AATGATTCtT 


240 


GAAAAATGkT GGkTAACyTA ATAATATGCA 


TTTTTTkGGC GArGArGATT 


tATTtAACyT 


300 


ATAAAAATAT kGAaGTAAGA TtGGGGAGAT 


tATGAATTtA TGGrATtGAA AGTCGATGAT 


360 


TTTGTAAAGA ATATAAAAAG ACCATACTTG 


ACTGTATTGG GAGTATTTGT 


AGTTGCAGTT 


420 
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20 



(2) INFORMATION FOR SEQ ID NO: 986: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE- DESCRIPTION: SEQ ID NO: 986: 

AAGGTTAGGn TGGAGTACCC AAAGTAGGTT TTAAAGGTTG GAAATACnAG GAGGATGAAT 60 

AACATGAATC AGTCAGTCAA ATTACTTAAA CATTTAACAG ATGTAAACGG CATTGCTGGT 120 

TATGAAATGC AAGTTAAAGA AGCAATGCGT AACTATATAG AGCCTGTCAG TGATCAAATT 180 

ATTGAAGATA ACTTGGTGGC ATTTTTGGAA AGAAAAATGC TGAGAATGGT CAATACTCAT 240 

TATGATTTCT GGTCTATGGA TGAAGTGGTT ATGGTAACAA GATGTAAACT GTTATTCATT 300 

ACGCGTGGTG TGTGATCAGC nGCACTCAA 329 
(2) INFORMATION FOR SEQ ID NO: 987: 

25 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 545 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 987: 
CATATTATTT nTATATGAGT TTACCAGTGT CAAAAAAACA ATTGCTTAAT GCCAACTATA 60 

35 

TTACATGCAT TGTTTTAACA TTAATTGGAA CGCTTGTTAT TAGTTTATAT GCTTATGAAG 120 
CAGATGTGAT TGAACCAAAT TCAATCTATT TTTCAACTGC ATATGCATTT GTCATATCTA 180 
40 ACTTCTTGTC TATACCAATT GCATTTAGTC AATTTACAGA ATTGCGTAGA GTCAAAGTGC 240 
CATATGGTAT ATACGTGTTT ACTATTATCA TTTTAGTTCC ATTTTTATTT TCAATTGCAA 300 
TAGTATTGGT GAATTATTTT GTTCTAAGTC AATCATCATT CCCAGATTTA TATTCATATA. 360 
TTTTAAATAT TGGTTTTCTA ATTATAAGCA TTGTTATACT TATTGTTAAT TATTTTAAAC 420 
AGCTCAATAA AATAAATACT AGAAAATTTA AAGGAGGCAG TCGATGAAAT TAGAACATAT 480 
TACAAAAAAA TACGGCTCAA ATGTCGTTTT AAATGATATT GATTTTGACT TTGGCGAnAG 540 
TAGAA 545 
(2) INFORMATION FOR SEQ ID NO: 988: 

SS 



45 



SO 
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10 



(A) LENGTH 607 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 988: 
ACCTAGCATG ACATAATCAA ATATATCAAA AACACCTTTC TGAACTGATT CAACCGTTCC 60 
AATAGTTAAT GGTCTTTCAG AATCAGATGT CTTTCCACCT TGAACAGTCA GATGACTAAT 120 
TAAAATATTG ACGGCATCTT CATCAATTTC CGGTGCTATT GTTTCAATAC AGCGCGTAAT 180 
IS TCCCTGTTGA TGTGTTTCAA TGGTGTCATC TTCAAAGTAG TGTTTCATTT CGCTCACAGT 240 

AGCATAAGGG AGTGTATAAA AATTAACCCC ATTTATCTCT ATTGGTGAAT TAATCGATGT 300 
AAAATCTGTT CTTATAAATA ACTGATTATG TTCAAACCAA CTCGCCCCAT AGTTTAACCT 360 
CTCTTTACCA TCGTGATTTC CACTAATAAT AATTATTGGT ATACGCAGTT CTAAATTTAA 420 
CTTTCCAATC GCTTGTTCTA ATAACATGAT TGCATCTTTA CTTGGATATG TTGTGTCATA 4 80 

TAAAcTCCAG CTATCACAAT GrTATCAGGT TCTTCTTCTT TCATTTTTTC TACGAACATA 540 
TCTAAAATAT ACGCTTGaTC TTCTAAAAGC TGTTTGCCAT CTAATATTTT CCCTAAGTGC 600 
CAGTCTG 607 
(2) INFORMATION FOR SEQ ID NO: 989: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 722 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
35 (D) TOPOLOGY: linear 



20 



25 



40 



45 



50 



55 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 989: 

ATCGCGTTAA GCAACAATTA CAACAATCAT CATATAAGCT AACGCCACAA CGCGAACTAC 60 

TGTTAGAGTT CTAATTGAAA ATGAAAAAGA TCATCTAAGT GCTGAAGACG TATATCTGAA 120 

AGTAAAAGAT AAAGCGCCTG AAATTGGCTT GGCGACAGTA TACAGAACGT TAGAGTTGTT 180 

AGCTGAACTA AAAGTTGTCG ACAAAATTAA CTTTGGTGAT GGCGTCGCTC GTTTTGATTT 240 

AAGAAAAGAA GGCGCAAAAC ATTTCCACCA TCATTTAGTA TGTATGGAAT GTGGTCGTGT 300 

AGATGAAATC GATGAAGATT TGTTACCAGA AGTTGAAAAT CGAGTTGAAA ATGAGTTCAA 360 

TTTTAAAATT TTAGATcATC GTTTAACTTT CCATGGTGTG TGTGAAACGT GCCAAGCTAA 420 

AGGTAAAGGA TAGTAAATTG CGTAGGTTAA ATTAACCTTC GCTTTTTTTA GAGGTGTGGT 480 
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TAACACAATT GGTGCTTATA GACGTGATTT GAAAAAGTAT CAAGATTATA TGACTGAACA 
TCATATCTCG CATATTGATT TTATAGATCG ACAATTAATT CAAGAGTGTn TGGGGCATTT 
AATAGACCAA GGGCAATCTG CTAAATCTAT TGCGCGATTT ATTTCAACAA TCCGTAGTTT 
TC 

(2) INFORMATION FOR SEQ ID NO: 990: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 366 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 990: 
ATCCAAAAAA TACGAAAGCG CTTTCTATAT TGGTATGCAA GTATTTCAAA AAGAATAAAT 
TTAATTTTCC TACTTTTCTA AACATTTATC TTTATGTATA ATGTTTTCAA GTAACTAAAT 
TATAAATTAA ATAAAGGGAG TGTTTATCAT GCTTACAATG GGGACAGCAT TAAGTCAACA 
AGTAGATGCC AATTGGCAAA CTTATATTAT GATTGCCGTC TACTTCTTGG GTnATGACTC. 
CAACTTATTG ATAGTGTTnT TATGTTTCAG ATAATGCCCG ATGACTTTGT CATGCCAGCT 
CCACCGATTT TGAGnAACGA CAGCGACTTC CGTCCCCAGC CGTGCCAGGT GCTGCCTCCA 
GTATTC 

(2) INFORMATION FOR SEQ ID NO: 991: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 338 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 991: 
TGACTACAGT AACATTACCT GTAAATAATG ATTGAATGGC TTCATTTTCA TCATGAATAC 
CTGTAATAAA TTCTAATTCT TCATCTGAAA CTTTTACAAT ATGCGCCAAT GGCAAAAAGG 
TATGAATAGT TTGTCTTAAA TCTTCAGCGT TATCCCATAA TGGCAATCTT ACATTAGGAT 
CAAATACGAC AGTACCATTT GCATTTAATG TTTTGGTAAT CAATTGATAA TGTGCATCTC 
TCATCGGACT ATCGACCAAA TCAACAGAAC AAAAATGTAC TACATCATTT TCATTCACAT 
CaATATCATT CACAAAACTk GGkTCaAaTA aCATATCn 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH:. 371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 992: 
AGTTCATCGT ATTCTTAATA CCTATATGCT ATCGTTCACA AACTTTTGTT CTTTTCTTCC 
AACATACCGC CTATAATGCT CAAAAAGAAA TCAGCAGTGG AACCACTACG TTCCCTACCG 
CTTTATCATT TTCAGGCTTA AATATTTTTA AAATTTTGTG AGGAATCCGC TTAAAAAAAC 
TCATATCTTC AAAAAAAATG ACAAAATTTA AATTTTTGAA TAACATCATT TTTTAGTAAA 
TGTTATAATT TCTTTATAAT ACAAATATCG AAAGAAGGCG ACTGACTTTG AAAAAATTAT 
GTTCATTAAT TGTAGGTAGC ATTAGTTTGT ATCCTTGCCA TTATCAGCTT GTGGnAAAGA 
ACnAACCnAA A 

(2) INFORMATION FOR SEQ ID NO: 993: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 530 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 993: 

ATAATGATTT ATCAATCGAT GTTAGATGCG GTTCAATTAT ATCCTAAGTC TATCGTGGTT 

GATTATACTG TGGATGGACA ATACaAAAAT GATTGTCACT ATTTCGGTCA ATCATCGATC 

AACATTGCCG ATTGGGCTCA AAACAATAAT TATTATCCTA ACCTGATATA TGCGATTCAA 

CAGACTCTTG ATTTAATACA TTATTACTCT GTAGAAACGA TTTTCGATTT AGCTTTATTG 

ACACTTCTGA AAGGTGATTT ATCAATCGAT GGTCATGTCG TTTTTGATTT TAAAGCCCCT 

ATAGCAACAA GTGCTTCAAT ATGGGAAACT ATTAAAACTA TTGAAGATTT TGATATGATG 

TCcAGTTTTt TTGAtAAAtG GCTTATATtG ATCATCATCC TATACCTTTT CGCAACCTTT 

TTATCGAGGA TTCAGAACAA CTTAATTCGC CAGATAATTG GTTGTATTCT ACCAAGTTTA 

TGCTACCTAA ATGGGnTATA TAAAATAAGC AAAGCAACGT GCTGATAACA 

(2) INFORMATION FOR SEQ ID NO: 994: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 551 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 994.: 

TATTGGATTn TTnCCACCAA AGGCAACGGT TCCTTTTCTA ATTCTAATAA TCAAATCCCC 60 

10 ATTAATATCA ATCCCACTCA AATTCTTTCA CATCATTGAG TATAAAATAA TATGGTGATA 120 

TACGITGAAG TTGATTnTCT TATTTAAGTA TACTCCAACA GTATTTTATA AAATTATTTA 180 

GTCATCATCT TCAGTGTTTA ATTCAAAAAT ATAGAATCTA CTGTATTGCA TAATAAATTG 24 0 

15 AAATGCCACA AATTGTGTAC AAGCAATAAT AATGATACTT ACCATTATGA CAAACTTGGA 300 

TTCTACGACA TTGGGCACCA ATATATATTC GATAATTCTT TCAATATATA TAAATGCGCT 360 

CAACGCGGTT AATAATAAAC CGAAATGTGT TTTaGTTTTA CCACCCCAAC GTTTTGTyAC 420 

20 

TTTAGGTAAT TTTAATAACG TGAACATTCC GCCAATTACT AATAACAAAT AACTAATTAT 4B0 

. GATTGAAATC GTACCCATCA ACCAATAATT GAATAGGTCA TAATGATTTT GTACAG CAAC 540 

AAAATAATAA A 551 

25 

(2) INFORMATION FOR SEQ ID NO: 995: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 729 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 995: • 

ATTCGCCACC ATTAGGTAGT GGAACACGAA 
CTTTAGCCAT TGGTGATTCA TTTGAAATCT 

40 

CAACGATTTG ATAACTTTCC TCTTCATCAC 
CAATTTTAAC AACGTTGTTA TCTCCAGTAT 
GCTCAATTCT TTGAATATCT TGTTCGATGA 

45 

CAGAGTTCTC TGATAAGTCA CCAAATGAAC 
GACGCTTAAC TGTTTTTAAT TCTTCAAGTT 
50 TCATTGGATA TTGCTTTTGA TTTTCCATAT 

GCTTG CTAAC TAAAGACTGA ATTTTTGTTG 
CACCTTCAGG AATAATTATA TCAGCATATT 

55 



CTT CAT CATC TAAACCTTTA 
TACCATTAAA TGCATCTGAT 
CTGGTAATTC TACAAACGTT 
CTTCAATGAT TAATGCATTT 
ATCCTTGTTC ATCTTTTGCT 
GTGCAACTTT AATTTTCTCT 
CACGCTCTAA TTTTTCAAAA 
TGTCATCTTC CTTTACTGAA 
TCATAATATC TATTGCAACT 
TCTTAGTCGG TTCAATAAAT 
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CCAATTAACG 60 

TCAGCTGAAC 120 

ACTGTTTTAC 180 

CTTAACATAT 24 0 

GCATCATACT 300 

ACAACTTCAG 360 

CCTTCTTGAG 420 

TTATACT ATT 4 80 

TTATTGCTCC 540 

TGGTCATGCA 600 
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TTAGTATCTC GTGTTGAAAC GGCGTAATAT nCTCAAGTCT GGCATCTGTA TCAACATATA 
TnTTnAACA 

(2) INFORMATION FOR SEQ ID NO: 996: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 996: 
GACAAATATA CTGCTTCTAC TCAAATATTA GTGAACATGA AAAAGTCCTC AAGTGATTTA 
GCTTTCCCAA AATGTTCAAA GTAGTTTGCA GTCTGTTAAT ACATATACGG AnATTATCAA 
AAGTCCTAGA TTCTAGATAA AGTATCTAGA GAATTGATGG TCAGTATTCA ACAGCTGAGT 
GAAATCATTT TAAAAGTAAC CATCCAACCA ACTCTCAAAT TATTACTGTA CAGTTACACT 
GGGATAATCT GATCTGACAA ATGTATAAAT ACTAAGTTTT GCCAGCAGCT AAAT AG AGGT 
GTATGTACGA CTTCCAnCTG TAGnG 
(2) INFORMATION FOR SEQ ID NO: .997: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 71 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 997: 
CTTCAGTTGT CGGCAAATCT TCTATATTGG AAATACCAmA TACATTTAAA AATAAATCAG 
TAGTAATTAA CTGTTGGCTA CGCTGTTCAT TAACCACTTT AGCCTCAACT AGTCCTTTGG 
CAATCAATGT CTTAACTGCA CCATCTGAAT TGAtACTACG AATTAATTCA ATATCACTTC 
TTGATAATGG CTGGTTATAA GCAATAATTG gATAGTACTT CCATTGCTGC TTGTGAtAAT 
TTCATTTGTG aCTTTTGTTC AATTGAATTG T 
(2) INFORMATION FOR SEQ ID NO: 998: 

(i)' SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 4 98 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 998: 
TTTATGTGCT AATTTGTAAT TGAGATTCAT AATGGTATTT AAATCATTAT GGTGCGGGAG 
TTATGGTATT GCCATAATAT GAGAGTGCAT CCACTTCTAT AAGTAATGCA TATTGCGAGT 
GCAGGaATGA ATGAGTGCTT GATTAAAATC CTTATGGGTG GTTGACATAA TTAAAAGAAA 
CCACATTTAA AATTTCTTAA TCACAAGCGG TTAACTAGGT ATAGTTTAGT TTTGAGTAAA 
TCTTTTTAAA GGTGTACTTG TGCATTTTAC TTAATTAAAG ACATAAGACA TTTAATCGGC ' 
CTAAAATAAA TAAAAAACTA CCTGTTTAGG TAGTTTTTTT AATGTAATAG ATTAAAACAC 
TAGTTCATTT CTTATTAAAG ATGGATAGTT ATTTTATAGA TAAATTTGTC CTTTAGTGTA 
GCGGTAATTT TTAGGACTTT TTGGTGGnAA AAATGTTCTT AATAAAAGTA ATAGTCCTAC 
TTTACCGCCA AAGCCATA 

(2) INFORMATION FOR SEQ ID NO: 999: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 568 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 999: 
TGAGCTGGTT GTTTATTATC ATCAGTTTTC TTCTGATTAT. CATCTGATTT ATCTTGAGAT 
GCTTTGTCAT CCTTATCTTT CGACTCATCC TTTGATGCTT TATCATCATT CTCTTCTTTA 
GCAGGACGTT GCTTTGTAGG TGCTTGTTGC TGAGGTGGCA CATATTGATA TTGATTCTGC 
TGTGGCAATT GCTGATATTG GTTTGCTTGT GAATCTAGCT CATTCTGCTC TTTTTCCTCT 
TGTTGCTTTT TCTTTCTCTC TTTTTCTGCA TTTTCTTTTT GACGCTTTTC TTTTTGTTCC 
TGCTTTTGTT CGTTCAACAT ACGTTCTTTA GCTTTATTCG AATGATCTAC ATATGCAAAA 
ATTGCAAATA CTAATCCTCC GAGTAATAGC AATACCGTCG CAATGCTAAC TACTTTCGCT 
GTTGTACTCA TTTTCTTCTT ACTTCCATCT TCATTTTTAT TGTAGTCGTA ATTTTGATTC 
GACATACTCT CCCTCACTTC AAACTAATTT AAATATAGAA ATTATCACGC AAcCATTAAC 
ATTTTCACAA CTAATTTGCG ACCnAGTA 
(2) INFORMATION FOR SEQ ID NO: 1000: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1000: 



5 


AGTGTTTCTA 


ATTGCAAAAT 


TCCAAATACA GTGTCGTTAT TTACTTTTTT 


CATGCTAAAC 


60 




ATCTCCCAAT 


TAATAATCAC 


ATATTTAGCT 


ACTATACAAG TATTAACAAA 


CATCCCAACA 


120 




ATCCCAAAAC 


TTGTATGTGC 


TAACTTATGC ATAAAACTAC AAGTGTAAAT AAACTTGCTT 


180 


10 


TCTATTCAAT 


TGTCAAAGTT 


GAATAAAATT AAATAAGTAT AAAAACCAAT AATCAATAGA 


24 0 




GTTAATATTT 


AGAAATAAGA 


TAATAGTACA TAAAACAAAT CACTCTATTA CATATTAATT 


300 




TTATAATTTA 


TATACAAAGA 


AAATATATCA ATATTACCAT TAAATTGCAA 


CAATTCTACA 


360 


15 


TATATTTCTT 


CAAAAAAGAT 


ACGCATTGAA 


TTTTTATTAC ATATAACTCT 


CATGAAAATT 


420 




GATTAAACTA 


AATTAATTAG 


TTAGCCTTAA ATTTAACAAA TTAAATCAAA 


CCGAATGAGG 


480 




CTAAATTGGT 


TATTTTTCAA 


GATATGaTTT 


GTATAAATCA AAACATTTCT 


TACAAATCAA 


540 


20 


ATGTGTATAA 


AATGTGATAT 


ACATTTTCAA 


CCTTAAAAAA GTTCTAAAAA 


AAGATGAGCA 


600 




TCTACTGTCT 


CATCTTTCAG 


TTCTTTTTTA 


CGGGTCTGTT TTCTAATTTG 


AGCACAATCT 


660 




TCGATTTCTT 


TATCTAAATG 


ACTACCAATT 


AAATCTATTT CTTCTATTGT 


TAAATCGCTA 


720 


25 


TCTCCATCTT 


CTTTTATCTC 


TGGTATTATT 


TTTTCTTCAA CTAAGTCACG 


ATATAGTGTT 


780 




TTTGAATTCT 


CGTTCAATTT 


CGATTCGTGA 


TTTTGAATAC TTTCTTCCGC 


ACCAATGTAT 


840 


30 


ATCTATTGGC 


ACTAGCTTCT 


ACTTTTGTAC 


CATCAATAAA AATTGAATTA 


TCATCAATAA 


900 


GATTTTGCTT 


TAAACATTGA 


CTATAGAACT 


GAATAAATAA AGATTCAATT 


AACGCATCAG 


960 




TATTAGGATT 


CACTCTAAAA 


CGATTAATAG 


TTTTATAAGA AAGTGTTTGA TCTTGGGCTA 


1020 


35 


ACCACATCAT 


TCGAATACTG 


TCATGAAGTA ATTTCTCTAT TCTACGACCA GAAAATACAG 


1080 




ATTGAGTATA 


TGCATATAAG 


ATGATTTTTA ACATCATTTT TGGaTGATAG 


GATGTTGCAC 


1140 




CACGATGATG 


TCXGAATTCA 


TCGAATTCGC 


TATCAGGTAT CGTTTCAACA 


ATTTCATTAA 


1200 


40 


CATATCGCGA 


AATATCATTT 


TGAGGAATTC TAACGAAGTT TCTATTGGTA GTGTAAGTTG 


1260 




GGGTCATGTT 


ATAATTTTTA 


TACATAAGGC ACCTCTTTAA TTTAGTTTAG 


TAGTATTTAT 


1320 




TAAATTATAC 


GaAGGGACCC 


AACACAGAAA ATTCATTTTA. TTGAATTTTA 


CATTTATGTG 


1380 


45 


CAAGTTGGGG 
A 


AAAAATGTCT 


TATTTTTTCA 


AAGTATTTAA AAGTAAAATT 


ACATGTTAAT 


1440 
1441 



[2) INFORMATION FOR SEQ ID NO: 1001: 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 587 base pairs 
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10 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1001: 

TAATAATATC TGTCATATTA AGTCTCCCTC ATTAGAACTC ATTATAAATG AAGTTATTGT 60 

GTGTGTCGCC ACTGCCATAA ATTAAATATA AAGTAATAAA TATTGCCAAA TACAATAGTG 120 

TTAATAAATA TGGTTTGAAT GCTTCAACAT ATTTATTAGG TGGCTGTTTA CTTTTAGATT 1B0 

TCATATTGCA CCTCTTAAAG TTCTTAGTAA AAACGCCTTT ATAAAGACCG TTCAATATAA 240 

15 AATACGTTTT AAAATTTGTT TTTTACAATT CATTATATCG ATATTCATAA TGAAATTCAA 300 

TTTTAATTTT ATAGATTCAA CATAGTAATT GGTTGTCATC ACTCAAkTAT TGTAAATGAT 360 

ACACTTTGTT GAGrACATCA TTCATTTTAA AGGTTATTTA AACAATAAAC AATTACAGTC 420 

TATATAACAA TTTTGTTATA TACGTCAAAA TCAAATAAAC TCATCACATT AATATGACGA 4B0 

GTTTATAATG TTATTGAATT ATCATCAGCG CAAATATATA CATTCGCAAG TCAAGCATAA 540 

CATATTTAAC AATTGCTTTG CTTGGTTTTA CCAATGGATT AAAAACC 587 
(2) INFORMATION FOR SEQ ID NO: 1002: 



20 



25 



( i ) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 522 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1002: 

CAATTGTGTC ATAATTTTGC TGATGATAAC TAATCGTTTT GGAGTAGCTA TTTGATGAAG 60 

TTCGTCCAAT ACCTTTTGTT GAATCGAATT TACCACCTGA GCGTATATGA AAATGTACTG 120 

40 TCTACTTTTG CAGTTGAAAT TTTATTTTTC GGCAATTGGT CTAATATTTC AGTTTTACGA 180 

TTTCTTTTTA CTTGAAAATC TACATGGTAC TCACTTGGAT ACTTTAACCA ATTTGAATTT 240 

TTTTCTTCTT TATGAGATTC AAACTTTAAA TTTGAATGAA TTGAGCCTTG TTTTTTAACA 300 

AGTAATACAT TCTTGTCATA AGTTGGATCA TCGATAAAGT CGAATTGTAA ATTTTGGAGT 360 

ATATTtGTvT TCTCATCATA TACTGTTTCA GTTCGTTTTG TGATTTTACC GTTTTTCCCA 420 

ATATCATCCG GTGCTGTTGA ATTTTTATCT TTATTAGTAA CATTACGTTG CTTTTGTGAn 4 80 

CTTATCAACA TGTCTTCTAT TTGGCTGAnG CTTATGGnGA TT 522 
(2) INFORMATION FOR SEQ ID NO: 1003: 

55 



45 



50 
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(A) LENGTH: 323 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1003: 
TCTTTGTTTG AGACATAATG TCATGTAAAG TTTTCACAAT GCGTTCTTCA ACTTGTTGCC 
TTGCCTCTCC TTTGAAAGGA ACAATTCGAT CTCCGAATAA GTCCTCAGGC TTGTATAGAT 
TATCAAATAG ATAGACTGAC TCACCTTCAA ATATTCCAAA ATGCCATTCC TTCAGTCCCT 
TAAAACGTTG ATACGATTGG wTAGGTGCAA CATTTTCAAG TGTGTCACTT GCGCGTTCTT 
GCGTTGATGA TGCATATAAA TCGAAGTTTA TCCCCTTAGT TTCATAATAA CTACGTGCCT 
TTnGAGCTTG AGCAATTCCA AGT 
(2) INFORMATION FOR SEQ ID NO: 1004: 

(i) SEQUENCE CHARACTERISTICS': 

(A) LENGTH: 320 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1004: 
TTCTAACTTG ACGTGACTGG GCATTTCTGT GnAAATnTTC TGACATTCTT TTTAATATCC 
GCAACGCAAT TGCTTCAGTT AAAGATTGnA ACTTGCAACA TTATTTGGTA CGAAAAAGTC 
AGTTTTAATG TCGATATAAG ATTTATTTTT TTTGTTATAT AGTTTCGCAA CTACATTTGG 
TTGTCTTACT TGATCATATT TTGCAACCGT ATCGAATGCC GTCTTTTCAA ACAGCTTTAC 
GAGATACGTA AACCATGACC CATCCATCGA AGTCCTTGGT AAAAACCCAG GGTTTCCGAA 
GCCGTAGGGT TTGAAAGATA 
(2) INFORMATION FOR SEQ ID NO: 1005: 

(i) SEQUENCE CHARACTERISTICS: • 

(A) LENGTH: 305 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1005: 
nATTAATGCT AGC CAAAACA GAnTTGCAAn ATGTAAAGAC ATGTGAAACC CTCCTTATAT 
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CGATATTTAT AAAAAATAAT ATTTCACTTA ACCAGTTTTT AGTTATCATT GCAATGGTAA 180 

AGGATACGAT GAGTATCACA CCACAAATGA TAATACCAGG CAGGAGCCAA CATAAATCAT 240 

CTAAATCTTT ATTATATGTG ATTAAAATAT TAAAGATAAC AAAAGTGGTA GTAATAACTA 300 

TATTG 305 



(2) INFORMATION FOR SEQ ID NO: 1006: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 334 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1006: 

20 TTCAAGATAT CAGTATAAAT AAAAACTTAA TATGCTCATT ATAGACAGCT ATAACTATAT 60 

TTTCTCGTCC CACTCTATAG AAATCAATTT CATTTATTAC CATTTTCTCT ATTTAAAAAC 120 

ATATTTTTGG TATTAACAAT TAAAAGTGGG TATATATATT TAATGAACCA ATTTTTTAGG 180 

25 AAAGTAGGGC TAAACCTATG GAAAATAAAT TATACCACAT GGTGTCCTCT TTTACCATGA 240 

ACACAGTGAA TTAAAAAATA TTAAnCAAGG TATTGGGGAA GTAnCACCAG CATTAGGTCC 300 

AATTGGTAGC CACCTTCCAA TCCATTGGGG GnAA 334 



(2) INFORMATION FOR SEQ ID NO: 1007: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 565 base pairs 

(B) TYPE: nucleic acid 
"(C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1007: 



45 



AGAAAATTTG TTATATTAGA 


AGCGAATAAT 


TTTAAAACAT 


ATTATGAAAT 


GTAAGAAGAA 


60 


ATATTATGAA ACTAGGAAAA 


GTAGATTGTT 


GAAAATTCCA 


TACAATAGGn 


GAnGTAAAGT 


120 


GAATAAATTC TGGTATTACT 


AAAGCTATTG 


ATTGTCCTAA 


TTGTAATAAA 


AAGTTGGATA 


180 


GTGTAGTTGA GGGGAGAAAA 


ACTAAAATGC 


AAAAATATAT 


TCTCAATGGA 


TCATTATAAA 


240 


TACCAATTAG aTTAAAGTAT 


TGCGaACATT 


TACAATTAGa 


AATCAGTATT 


AATCTGCCAT 


300 


AGATAATTGT ATTAATGTAA AAmATAAAAT 


AGAATGAGAA 


TATACGGAAA 


TTTTTTATAC 


360 


ATTTATAACC AATTCTGTTA 


AACTGTGAAG 


AGTAGAGTTA 


AAAACGAACT 


TTTTGTTTAT 


420 
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AATGATTGAA CGCTATTCTA GAGAAGAAAT GTCTAATATT TGGACTGATC AAAATCGCTA 540 
TGAAGCCATG GTTAGAAGTG GAAAT 565 
5 (2) INFORMATION FOR SEQ ID NO: 1008: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

w (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



15 



20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1008: 

CAACATTAGC GTCTAAACAA ATCTTTGACT AAACGTTCAC TTGAGCGACC ATCTTGATAT 60 

TTAAAATGTT TATCTAAGAA TGGCACAACT TTTTCAACCT CATAATCTTC ATTGTCCAAA 120 

GCATCCATTA ATGCATCAAA GGACTGTACA ATTTTACCTG GAACAAATGA TTCAAATGGT 180 

TCATAGAAAT CACGCGTCGT AATGTAATCT TCTAAGTCAA ATGCATAGAA AATCATCGGC 24 0 

TTTTTAAATA CTGCATATTC ATATATTAAA GATGAATAAT CACTAATCAA CAAGTCTGTA 300 

2 s ACAAAGAGAA TATCGTTAAC TTCACGATGA TCTGACACAT CGATAAAGTA TyGTsTATGT 360 

TCACGTGAAA TAyyAAGTCy ATTtTTTACG AaCGGATGCA 400 
(2) INFORMATION FOR SEQ ID NO: 1009: 

30 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 454 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1009: 

TATTGTAAAA ACTGTGGTTC TTACAATGGC GAAGAAGTAG CAGCTAAATA ATTTTAGTTA 60 

CTCAATATAA AAaGTCCCGC TTAAAATGAT TGTTTTAAGT GGGACTTTTT ATATTGCGAA 120 

AAATAATTGG cGAACGAGGT AACTGGATAC CTCATCCGCC AATTAAAATT TGTTAATTTA 180 

ATAATTAAAT ATAAAGACGA TTTATTAGTT TTTACGTTTT CTAGGTAATA CGAATGCAAC 240 

GATGCTACTT AAAGCTAATA ATGCCATTAA TGGTAATGTC ATATCTnTAT TTGATTCTTC 300 

ACCAGTTTGT GGTAATGATT TTGCTTTATT TTCTTGTGTA TTTGTATTGT TTTGGCTTTG 360 

SO GAGTGTGTCC ATCATTTGTG TTTTTAATGT TTGCTTnTnG TAATGGAGCA CTAATCTTTT 420 

GCCTCGCTAG AACCTGCCGG AGGTTTGGAA CCAC 454 



40 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NC>: 1010: 

10 

GGTACAACAA CTTTAGACAA TGAAATATTA AAAGATACGG ATGATAAAAA GTCGAGTAAA 60 
ACTTTTGTTG GCGGAACAAA AGTTGATGAC CAACATGCTA GTATCGGAAT GGATTTTGAA 120 
AATCAGGACA AAACTTTAAC TGCCAAAAAA TCATATTTCA TATTAAACGA TAAAATTGTC 180 

75 

TTCTTAGGAA CTGGCATTAA AAGTACTGAT TCATCAAAGG ATCCCAGTTA CCAACCGATG 240 
AAAATCCGCA AGCCGATGGG GAATACGGTA TTTTCCGGCC GTTAACCAAC CACCCATTCC 300 
GGTTATCCGG GAAACCATTC CATCCTTTTA GGAGnCnCCG GTnCCnAAAA GGACCTCGGG 360 

20 

TATCCTTTTT TTAACCAACC CGAAATTACT GGTAAAAAAG 400 
(2) INFORMATION FOR SEQ ID NO: 1011: 

2S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1011: 



TTAGGTATTC GCAAAAATGA TCTTCAATAT ATAGCTATGG CCAAAAGATG GGCTAAAGCT 60 

TATACAATTA CTGTAGCAGT GGGAGTTGTT ACAGGTACAA TTATAGGACT TCAATTATCA 120 

TGATTTGGCC TACATTTATG GAAATGGGTG : GACACGTATT GCACTTCCCT TATTATGGGA 180 

ACATTGCGGT CTCTTGGAGC TATTTCTTAA GTATATATTA TATACTGGGA TCGTTTAAAA 24 0 

TAATGGCACA TTCTTAATAG TATACCAGTA ATTATGGGGC CTTCTCAGCA TCTCATACTC 300 

AGGAATCATT ATGAATACGC CTGCAGGTTT GAGTGAAGAT GGAAGTGGCC ATGTCACCnT 360 

AGAGCGTGTT ACCCACGTTA AGTCGTCATT nCGAATACAC 400 



45 

(2) INFORMATION FOR SEQ ID NO: 1012: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1012: 

CTTTATGGGA TACACTACAA ATCGAGACTA TAAGGTTTTT TATTTTATTT ATTATTACAT 

TATCAATAGT TTTATAATCG AGCTTCAAAA CTTTAGAAAA TAGTAGAAAT AGCATTCAAT 

ATAGTGCAAA AGTGCAAATT GATAACTTGA CACTTATCTC CTATAAACCG TACAATTAAT 

TTGTATGATT TATATATAAT TTCATAAAGT CATATTGAAT TTCATATAAA GAGCAAACCC 

TAGAAAAnGA GGTGTTGTGG AATTATTAAG CTCCTACTCA TTTGCGGGGT CATCTAGGGT 

GTGGAGGATG TGTGCCGAAT TGTGCTCAAA llATCCCAGCA GCAACGCGAG TTGAACAGCC 

TAAGACGCAA CAAAGGATCT GnnAAGGACC ATCAGGCACG 

(2) INFORMATION FOR SEQ ID NO: 1013: 

(i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 3 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1013: 
GTCGTTGCTG ATACCACGCC AGTATTGGTT TTAAATGATA AAAAAATCTG TGAGATTGCA 
TCATTGAGAC AAACATCGCT ATTTGAAATG GCCGAATATA TAGGGTTTAG CGAGCCACAG 
AAATTAGTAC AATTATTTAT TAACCATGAT AGGAAGGTGA GACGCCAATG AATCAATATA 
ATACTATAGG TTTTCACCCG GGAAATAGTC GTATTCATCA ATTAAATGCG ACTGTTAAAC 
TTTTATnCTT ATTAGTTGTT CTATTCTGCA ATGGTnCTAT GCACAGGTAT TnATTTAATT 
AGGCTCACAT TTATGGGCAA TTCTCAATGA TGAACAGTCG TTGTGTAATC TCG 
(2) INFORMATION FOR SEQ ID NO: 1014: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 627 base pairs- 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1014: 
CAAAGCCTTC AACAAATAAT CCTTTATTAG TACTGATTGG TTGCACAACA TAAGTTACAC 
CAGGTTCATC TTTAAATGTA ACTTCCTTAT AGAAGAGACC TTTTTTAGCA CTATACTCTG 
TTTTTTCTGA TTTAATTTTA TCTTTTAGAT TTTTCTCTTC CAAGTAACTG TCAATTAGTT 
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ATGAAATAAA TAATATAATG ATAACTTTAA TCGCGTTTTt TTTCTT CTTC ATTATGTAAC 300 

TCCTTTTTGC TTATCTATTC AGTATGAACG AATCAATTTA TTTAATAAAG AGTTTTTACC 360 

5 AGATTGTAAC AAATTTTTTT ATTCTCAATA TTATTATAGA AATCGAATTA CATACACGAT 420 

GCTTTTAAAA AATTTAATAA CAAATTTTAT TTTACTATCT ATCAAAATAA TTGACTACTT 480 

TTAACAATTA AGGATGATAC AATTACATTT TTTTCAATTT AAAAAATCAG TCATATCAGT 54 0 

10 AATTACCGAT GAAAATACTA AGATATCGAT ATATTTTGCA TTATTTAACA TTTAAATTCn 600 

ATTATAGACA ATTCAATAAT TGCGATG 627 



(2) INFORMATION FOR SEQ ID NO: 1015: 

75 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

20 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1015: 

TAGTGTGACA TTAACACTGG AACAAAAAGA AGCTGCAATT GCAGAAGTTA ATAAGCTTAA 60 

25 

ACAACAAGCA ATTGATCATG TTAACAATGC ACCTGATGTT CATTCAGTTG AAGAAATTCA 120 

ACAACAAGAA CAAGCGCATA TTGAACAATT TAATCCAGAA CAATTTACGA TTGAACAAGC 180 

30 AAAATCAAAT GCAATTAAAT CGATTGAAGA TGCAATTCAA CATATGATTG ATGAAATCAA 240 

AGCTCGTACT GATCTAACAG ATAAAGAGAA GCAAGAAGCT ATTGCTAAGT TAAATCAATT 300 

AAAAGAACAA GCAATTCAAG CGATTCAACG TGCGCAAAGC ATCGATGAAA TAAGTGAGCA 360 

35 ATTGGAACAA TTTAAAGCTC AAATGAAAGC AGCTAATCCA ACAGCAAAAG AACTAGCTAA 420 

ACGCAAGCAA GAAGCTATTA GTAGAATTAA AGACTTTTCA AA 462 
(2) INFORMATION FOR SEQ ID NO: 1016: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 235 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

45 



(xi) SEQUENCE DESCRIPTION :. SEQ ID NO; 1016: 
TGATTGGTCC CATCGACTCG TTATAACAAA TTGAAGGTAC AAAAGTATCT TGTGATTTAA 60 

50 

TAAATTCAAG TAAATTCGTA CCTGGTTCTA CAAGGTAATT CTTTTCCATC AAGTGTAACC 120 



55 



1794 



EP 0 786 519 A2 



ATAAATTGCG CATATCCACC nAAAATAACG ATGCCCCACA CATCTTTCAG ATAGG 
(2) INFORMATION FOR SEQ ID NO: 1017: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1017; 
GTTGAAAACT ATAAGATCAn ATAAAGGTAA AACAGTGGTT TGTGTCATTA GTGGTGGTAA 
TAATGATATT AATCGAATGA AAGAAATTGA AGAACGTTCA TTACTATACG AGGAAATGAA 
GCATTACTTT ATCTTAAATT TCCCTCAACG TCCAGGTGCA TTGAGAGAAT TTGTAAATGA 
CGTATTAGGA CCTCAAGACG ATATTACTAA ATTTGAATAC TTAAAAT 
(2) INFORMATION FOR SEQ ID NO: 1018: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 284 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1016: 

GTCTTACTTG CACAAGGTAT TATTTCTATC TTACTCGTTG CTTTCGCAAT CATGCTATAT 

ATCATTAATA TTTTAGATGC ATATCGTAAT GCTGAACGAT TTAATCGCAA TGAGGAAATA 

AAGGATCCGA AGCGCGTATG GTGGCACATG GGACAAGACG TTCCCATACT TACTAATCTC 

ACCAGGTACA TnCTTATTGA TATTGTAGTG TAATTCCATT ATATTATGTT GGAGTAGCAT 

TACAATACAA TTATACACGC GCCTCGAGAC nACATAGATG GGTG 

(2) INFORMATION FOR SEQ 'ID NO: 1019: 

• (i) SEQUENCE CHARACTERISTICS: 

{A) LENGTH: 779 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1019: 
CAATCATCAC ACATAAATGT TCGAATAGGA TTGTTTTTAA GTCGTTTAGA CTCAGTTGTG 
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CACCTCAATA TGAGTTACAT GGTTGATTCG ATATGAATAA CCATCTTCTT GATTGTAAAT 180 

AAAACTATCT ACACCATTAT CGCTGTAAAG TCGTTTACCG TCTTTTGCAA ATTGGAAAAA 240 

TAAATAGGGT AATAGATCTA TCGGTATATC TAAATGATCA TGCTCATTTG ATAATCGAAT 300 

AGTAGTTGCA GAGTCATGTG GTTCGGAATG TTTGAAAAAT GGTGTCATAT TAATGACAAA 360 

TGAACCTTCT AACATGGCAC GTTTTTTATA TTTTATTTCT GAATTTAAAG TAGGCGGATT 420 

AGTTTGTCCt TCtAGGATAG CaCGATTCCa TTCaTGaTTA TCTTCAAAGT CGATTGGTTT 4 80 

TGAACCATCa AATACACCTT TTTCTAAATC TTCGATGCTA ACTTTTCTAT CATCGAAAAT 540 

CCAAGTCGTA CTATCTAATG TTATAGGAAA CTTTACGGCT CCTTTAATTT GTATCATTTT 600 

CCCACTCCCT ATCAATGTAT ATAGCATTAT TTTAACACAA ATTGCTAACG ACACATTTTA 660 

AATCTACTTG CTTTTTAATA TTTAGTAAGA TAAACTTTTA GTAAGACTTG AGAATTTATA 720 

TAGAGGGGGA GCGTGTCATG GCGAAACAAG CAACAATGAA AAATGCAGCT TTGAAACAA 779 
(2) INFORMATION FOR SEQ ID NO: 1020: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 950 base pairs 
25 (B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1020: 

TTAATGGTCC ATTAAGAATA ATGGCAGAAA ACCGTTTGCA TAAATCAAAC CTTGATTCAC 60 

TAACTTAAGT AATACTCGCT TTTCnTAAAT CCTTTAGAAA GGAACTATnT ATCATGATTA 120 

CTTCATTCAG ACATTCTGAA GATATAGATA AACATATTAT AAAAACACCA TTAGATCATA 180 

CAGCGTCATG GATTAATGTA GTAGAnCCAG ACCGAGAAGA AATTGAAAAT CTTATGGAAC 240 

AATATAATAT ACCTGAAGAC TTTATACGTG ACCCTTTGGA CTCAGAAGAA AGTTCCCGTA 300 

TTGAATATGA CGAAGATACT GGTTACTCAT TAATCATTAT TGATTTACCT ATCGTCAATT 360 

CAACTAATCG TAGCGTTCTA TCTTTTGTAA CGATTCCATT AGGCATTATT ATTGGCAACG 420 

45 GTATTATCGT AACAGTTTGT GACGCTGAAA ATGAATTTTT AGAAAATTTA CCTAAGCGTG 4 80 

ATATTAATTT AAAATTTCAC AGCAGATTTG CATTAGAAAT TTTAACTACT ATTGCCGACC 540 

ATTATAATCG TAACTTACGA TTACTTAATA AAAGTAGAAT TCGTATTGAG AAAGAACTAA 600 

50 

AAAATAACAT TACTAACAAA CAACTTTTCA AATTAATGGA AGTTGAAAAA AGTTTAGTAT 660 

ACTTTTTAGC TGCCTtAAAA GGTAACGATA CAATTATTAA AAAGTTATTC CGTTTACCTG 720 

55 



1796 



EP 0 786 519 A2 

AAGCCATCGA AACAACAGAA TTACATCAAC GTATCTTAGA AAGTATCACA ACATCATACG 84 0 

CTTCCTTATT ATCTAATGAT ATGAATACGA TTATGAAGAC ATTAACACTT TTCACGGTAC 900 

TATTAACgTT ACCAAyACcC GTATTTAGTT TCTTCGGTAT GAATGTGTCG 950 



(2) INFORMATION FOR SEQ ID NO: 1021: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 528 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

75 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1021: 



AATATATAAA ATGAATACAT TGTGAGGCAA TTATCAAATG AAATTTAATA AAGTAAAACT 60 

AGTTATACAT GCGTGTGTAC TATTATTTAT CATTATTTCT ATAGCGTTAA TTTTTCATCG 120 

ATTACAGACG AAGACACATT CTATAGACCC AATACATAAG GAAACAAAAT TATCAGACAA 180 

TGAAAAATAT TTAGTGGATC GTAATAAGGA AAAGGTTGCG CCGTCTAAAC TAAAAGAGGT 24 0 

ATATAATAGC AAGGATC CTA AATATAAGAA AATTGACAAG TATTTACAAA GTTCATTATT 300 

TAACGGTTCA GTAGCTATAT ATGAAAATGG CAAATTGAAA aTGAGTAAaG GTtATGGaTA 360 

TCAAGaTTTT GAAAAAGGTa TTAAAAaCAC ACCGAaTACG aTGTTTTtAA TnGGTTCAGC 420 

TCAAAAATTT TCAACAGGGT TACTGTTAAA ACAGTTAGAA GAAGAACATA AAATAAATAT 4 80 

CAATGATCCA GTAAGTAAAT ACCTTCCATG GTTTAAAACA TCTAAGCC 528 



(2) INFORMATION FOR SEQ ID NO: 1022: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 713 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 . (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1022: 



45 



GCATTnCGGT TAAAgTAkGt TTCAGCTTCT CTCaTTTTAG 


AAGTACCGaA 


AAtTGGTTGG 


60 


TTAtCTGGAT 


TTAATrtCAC 


GATAAATATT GCTTACTTTG 


TTACTTTGTr 


AAATGTACCC 


120 


ATTACGAGTC 


TCAATGTTAT 


TCCAATAAAT. ATCACTTGTT 


GGTGCATGGT 


TTGGATATGC 


180 


ACAATGATTT 


CTTGAAATTG 


TTTGAACGAT TTCTAATGGA 


TCACAGCCAA 


ATGTACTGTT 


240 


TAATACTTCA 


GAGTCTCTGA 


ATAATGCACA AAnTGAAATA 


CAAGTTGTCC 


AGTTTGGTAA 


300 



55 
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AGTATCTTGC GTATAACCAG CCTCTATACG AACCATTTTA AATTTTGTTT GAATTAAATC 
TGTAAAACTC TGTCTATCCA TTCTGTTATC TACCTTTCTG TTTGGGGAAT TTTATCCGGA 
CACAAGAAAT TGCAATAATA CACATTTCTT GAAACACAGA TTACATCTTA ATATATTTTT 
AATAAAATGA AAAGAGTCAA TTTCACATTT GTATTAAATT TTGATCAAGT CAACAAGAGT 
TAAGTTTACT TTATAATGAA GCGAAAGTAA ATGAAGTAAA ATTTTATGCA TAATCACTAG 
ATTTGATAAA ACTTACACTT TTATAATGaT ATCAAATTTA CAAaGGAAAA CTG 
(2) INFORMATION FOR SEQ ID NO: 1023: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 570 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1023: 
AATAAAATAG AAATGATTAT GGGTACAATT ATCGCTATCA TATCCTTGTT ATTAATTATA 
TTACAAGCAT TTAATATTAC TTGGGGCGTT ATACCAATTA CAAATTTTGG ACATCAATTT 
TTCTTTTTCA TTGGTATTAT TTTAGTAATT GCCGGCATAT TTTwyAAGCG ACTtGAGTTT 
TCGGGaATcG GGTTATTATT TTGTcAAAAA ACCGTCGATG CAATGATTCA TAATCCACAA 
TCAGCCCAGA TTTTTTCATT AATTATATGG ATATTATTAG TAGTTCTAGT TATATATTTC 
ACAATTAGAT TATCTTCACG TACAAGATTA TAAATATGAT AAAACTATTC ACTTGaTTAA 
TTGTATTAAT TGaGATGaAT AGTTTTTTTA TTGTTGGAAT AACTTTTGGT AATTTATAAA 
TAATTTAAAA AAATTGTTTA TAAAAgGAAG CGTATATAGA ATGAAGGTTG GGTATATAGT 
TTATTGAGGG AGGTGTCACA ATGAATAAAG TCACAaTTAA TCCTCAAATC CAATTAACTT 
ATCAAATTGA AGGTAAAGGG GATCCTATAn 
(2) INFORMATION FOR SEQ ID NO: 1024: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 487 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1024: 
CTTTGTTGTG CTTGAGCATT ATTAAAAAGA GTATTTAAAT TTAGTTGTTG GTTTGTAATA 
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TCAGAATTAT TAACACTATA TTTCCCTTTA AATAATGGTG ATTCGrAATA ATGCTTATCT 
TTATCTGCAG CTAACTGGAA TTkCCCTAAT GCAGAGTCTG CGATTGTTGG TTCAAGATTA 
ATCATTGATT TCT CTTTTTT AGGATCATGT CCATATGACA tGTaATTTTC GATGCATTAA 
CAACAGATTT AkGAATGCCA AGCCCTTTAA CAATTTCATC TGATGCATCT GCGCTTAATn 
CTAATGAAGA TAAAAACGAt TAt CTTTCAT CkkTTCTTGG AACTTCACTT CATTTTCAAA 
ACGGTCATTA AAATAATCTT' TATACATTTT TGCTGTTTGT TGTTCACTTT TTAGGTATGT 
ATTTTTC 

(2) INFORMATION FOR SEQ ID NO: 1025: 

(i) SEQUENCE CHARACTERISTICS :' 

(A) LENGTH: 364 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1025: 
ATATGTTATA AACAATTTTA CAGTTGTATC ATCTACTCGT TCTATTCTCA TATCATCTCA 
CTCCTTACAG TCGATGAATA GTATCCACAT TGTATTAAAG CATCAAACAA AATACAATTT 
ATTTGTTTGA TTTCTTTCTC TAAAmTATTT TGGTAACTTA TAATAACATA TTCGTTGTAG 
CAAAACTATT TAAATGATTT ATACAGTTAA AGCATTAAAG CACTTTCTTA ATTTAATTTT 
ACCTTATCAT AGCAAATGTT TCATCCAAAA CACTCTAATA TAGAGCTATA TTAACAATTG 
CAd^TTTGaT TTAAAAGATA CTAAGCAATT ATTGtAAAAT TAAAACCAAA AAaGCAACAC 
TATA 

(2) INFORMATION FOR SEQ ID NO: 1026: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 333 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1026: 
TGCCCCAGGG GAGTACnAAA AAAAAGnGGG GTGGGGAATT GGAAACCGGA ATTAGTGGTT 
AAAGGGGTTT ATTGGATGTT GGGTCTAAAG TATTCCCAGA AATTTATCCA TCnATGATTG 
CTTTAGCCAA GTTTTTTTCC AACCCATCTT TATGGAATTT AGTTATCTCA ATTACATTAA 
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CTTATATAAA AATGGCACAA TTATTTCATG TACCAACAAG AACAATATTA ATACGTCATT 300 
TAACACCTAA AATTATACCG GCTATTATCG TTG 333 
5 (2) INFORMATION FOR SEQ ID NO: 1027: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 450 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1027: 

15 

TGTTCCATTC GTTTTAATAA TGGTGATACT GTACCAGTAT CGAGTGCTAA TTCAGTTACG 60 

ACTTTCTTGA CGTTTACAGG AGATTCATCC CATAAAATTG TTAAGACAAG AAATTGTGGG 120 

20 TATGTTAGAT TGTACTTCTT AAAAACTTTG TTAGAGTAGT AGCGATTAAC TTGTCTTTGA 180 

GCATTGTACA AACTAAAGCA TAGCTGTTCT TTTAAATTAT GTTGATCAGA CATTAAAGTT 24 0 

CTCCTCCAGA CATACTATCC GTTTTTnTCT CTTTTCGGAT TGGTAATCAT TAAAAAGTTG 300 

25 ATTGTTTATT AATTCACAAC TTTCTTTGAT TCAATGCCAT GCnAAAATTA AAGTATGTTT 360 

AAAGTTTAGA AGATATTTTT GATTAAATCA AGCAAAAAGA TAATTTAATA TATATGTGAT 420 

CATTTTTAAA AATAACTGTn ATAGAAAAGA 450 



(2) INFORMATION FOR SEQ ID NO: 1028: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 535 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1028: 



45 



TGCATCTATA CCTAGAACTT 


TAATTTCGGA 


TTTTATTGCG 


TCTGTAATAT 


CTTTAATACC 


60 


ACTTTCGATA GGGAAACTTT 


CGTCACCTTC 


AACATATTTA 


ATACCAAAGC 


CACCACCAAG 


120 


GTTTAATAAT TCAACTTGAA 


TGCCTTGctC 


TTTaAGCCAA 


CGTAAAmCAA 


TTTTAGCaGT 


180 


TTCAATAAAt GCTTCTGTAC 


CTTCAAtCTG 


TGrACCaATA 


TGACAATGTA 


CACCTTTTAA 


240 


TTTTAAGTGT TTAGATTGTT 


GGACTTTGTC 


AATTGCTTTT 


TTAGCTAAGC 


CATATTGaAT 


300 


TGaTAATCCA AACTTACTAT 


CTTCTTGCCC 


AGTTTGaATA AATTCgTGTG TATGTGCTTC 


360 


aACACCTGGa .TTAACTCGTA AtACAACTTG 


aACCGTATCA 


TTAGCATAGC 


GGTCTATTAA 


420 
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AATTTCATGT TTCGTTTTAT TGTTACCATG GAAATGGATG CGACTCGGTT CAAAA 
(2) INFORMATION FOR SEQ ID NO: 1029: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 359 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1029: 
GAAGATGATT GAATTATGGA ATTTTTTTCG AGATATGTTA TTTCGGGGAT CAGATTTTGC 
GATTTTGCCT TGGATAGAGT AGTTGCGATT TGTAAATACG TTTTTCAGGT TCGATTCAAT 
TTCGTTGTTA AGATAATATG GATTAGTTTC ACTGACATGT TGATGCATGT CTTTTTTGTT 
CTGATAAAGT ATAAAATGTT TACTTAACAT TACTTAGTAT GACACCTGCT AATTCAAACA 
TTATTTGAGA CATTCTTTTC AAATTAATTA TAAATTTTTA CCtATAGACT AGTTTGATAT 
TTATCTACAT CTCAAAATTC TCATCAACAA TCTGTCACAT CCAACATTTT TACTTnAGT 
(2) INFORMATION FOR SEQ ID NO: 1030: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 341 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1030: 
TACATTGGTA AATAATTCTC ATAGTTTATG TCTCGTTGAT TAATCCAATC ATTGTTTTTA 
TCGGGTAAAA CATTCAAACT TTTCAAATTC ACAATTGATT GATGTTTCAT TAAAGTATCT 
AATTTTTGCT CTCTAGTTAA ATATTCACCA ATGTCATTAT AATAGATATC GTGATTGTAA 
CTTCCATCTT TAACTAAAAT AGATATAGCT ATCGGTGCTC TACTTCCTGA TCCAAATATT 
TTTCCACCTT CTTTTCTTGa TTTTTCACCC TGTGTTCGTT GGTCGCCTCT CAAATTAATT 
ATGtACAGaT AGTTAAATTC ATCGaTAAAA CTTTTCTAAA T . 
(2) INFORMATION FOR SEQ ID NO: 1031: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 326 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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. 5 



10 



15 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1031: 

GTCTAACTTT AATGTTTAAA AAnCTTAGAA ATTTTCTACT CATAAATGGA TGCTTTATGG 60 
AATATTATTA ACAGGCCCAG CTTCAATGCT AGCTATAGAA TTTGGATGGT TCTTAACAGA , 120 

GATGGGTAGA CAGCCTTGGA TTGTTCGTGG TTATATGCGC GTGGCAGAAG CAGCAACACA 180 

GCAGCGGAAT AACCTTCGTT ACAATTTTAT TTGGCATATG TACATCATTT AAGTATACAG 240 

TGCAACGTAT ATCGTAGTTA AAATAACCGC GTAGAGAGTA ACGTAGCAGA GCAGAGGAAT 300 

GAAAnATTAG CTTATGnAAC GGTAGG .326 
(2) INFORMATION FOR , SEQ ID NO: 1032: 



(i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 263 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1032: 

25 ACAATCAGGC TTGTGAAGAA TGAATTGCAT GACGTTTGTG TAATTGAACT GCGAGGTTTA 60 

ATTTCTAAAA TGCTCCAGnC CTCCAATGTA ATCGCATCTT GATATAACAT AGCGATCGCT ' 120 

TGATTTGTCG TTGTATCAAC ACCTCTAAAC AACGTTTCAC CTATTCGGCA ATTCATTTAT 1B0 

TTTACCCAAA TCATTATATA AAAGTTGTGG TAACATGCTC GAAATACCAC CTGAAATAAT 240 

nTCAATCGAT ATCCTATTTC TCT 263 
(2) INFORMATION FOR SEQ ID NO: 1033: 



35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 514 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
40 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1033: 
45 TTTCTATAGA GGATCCTGTA GAGATGCAAA TTCCTGGTAT CGTCCAAATT AATGTGAATG 60 

ATAAAGCTGG CATAAACTAT GTAAATTCGT TTAAAGCTAT TTTAAGATGT GATCCTGATG 120 

TTATTTTAAT AGGTGAAATC AGAGATAAAG ATGTTGCCAA GTGTGTTATA CAGGCTAGTT 180 

50 

TAAGTGGTCA CCTTGTTCTG ACTACATTGC ATGCAACTGA TTGTAAAGGT GCTATTTTAA 240 

GGCTATTAGA AATGGGCATT TCTGTACAAG AATTGATACA GGCAACTAAC TTAATTATAA 300 

55 
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AGCAACAACT CCGATATTTC TTTTCCCaTA ATCATtCATT ACCATCATCA TTTAAGAACT 420 

TAGAAGATAA ACTTGATGAT ATGACAAAAG CAGGTGTCAT TTGTGAAACT ACAATGCATA 4 80 

5 AATACATTTA AACTACATTC TAAGAAGCGA CAAT 514 

(2) INFORMATION FOR SEQ ID NO: 1034: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

is 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1034: 



TTTTTCACGA ACATTTTTGT TTAAAATTTT CTTTCTTAAA CGTATACTTT CTGGTGTAAC 60 

20 CTCAACAAGT TCATCATCAT TAATGAATTG TAACGCTTCT TCCAATGTTA GAATACGCGG 120 

TCTATTCATT GTTTGTGTTT GGTCTTTCGT TGCAGAACGT ACGTTAGTTT GATGTTTTGT 180 

TTTAGTGATG TTAACAGTTA AATCATTTTC ACGATTATGT TCACCAACAA TCATACCTTC 240 

25 ATAAACTTCA GTACCAGGTT CCATGAAGTT TACACCTCTA TCTTCAAGTC CCAAAATGGc 3 00 

ATAAGTACTT GCAGAACCTT GATCCATTGA AATTAATGCA CCATTACGAC GACCGCCAAT 360 

TTGTGCTTTA ATACGTGGTC TAAATTCyTC aAATGTATGG TTAATAATAC CGTAACCTCT 420 

3D 

TGTCATTGAC ATAAATTCAG TCGTATAACC AATCATACCA CGAGCCGGTA CATTAAAGAT 4 80 

TAAACGTGTA AGTCCATTAT CAGTTGTAGT CaTATCAACC ATTTCACCTT TaCGTGcACC 54 0 

TAAkGATTCa ATAACAGCAC CTGcATTTtC TTGTGGCACT TCACATTGCA CACGTTCA 598 



35 

(2) INFORMATION FOR SEQ ID NO: 1035: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 313 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear.. 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1035: 

AAACAGTGCT TCATTATCGG TATTGATATT GATCCACAAG CCGTTGACCT AGGGCGCACA 60 

ATCGTTAACG TCTTAGCACC AAATGAAGAT ATAACAATTA CGGATCAAAA GGTATCTGAA 120 

50 CTTAAAGATA TCAAAGATGT GACGCATATC ATATTCAGCT CGACAATTCC TTTAAGTACC 180 

AGCATTTAGA AGGAATTATA TGAnTTAACC AATGAAAATG TCCGTAGTGC CATGCGCCTT 240 
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AAGTGGGCCA TGT 

(2) INFORMATION FOR SEQ ID NO: 1036: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH; 235 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
. (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1036: 
AATAATGAGT AGGTTTTGAT TACTTACGAG ATAACATGGT GAATTATTCT GAAGATAGAC 
TAATGCGTCC ATTACATTTT GCAATCATTG ATGAGGTTGA CTCAATTTTA AATCnACGAG 
GnCGACGCCA TTAATTATTT CTGGTGAACT GAAAAGTCAA CGTCACTTTA TACACAAGCA 
AATGTT TTT G CGAAAATGTT AAAACAGGAC GAAGATTATA AATACGATGA AAAAC 
(2) INFORMATION FOR SEQ ID NO: 1037: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 263 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1037: 
nCAATATACC TCCTATCATC ACACAAAATG CGTTGTTTCG TATGATTTTC TCAACATCGT 
AGACTAATGC AATCAGACAC AACACCATGC TCTATATCCA ATATTTGCTT TATTGCCTAT 
CAATGAGCGT ACTGCGTGCT TTTAAATAAT CATCATCAAT TAATGACTGT ACAGGCACCT 
CATGAAAATT ATCATCCGCC AAGTATTGCG CACGATCACT ATATGCTAAA TGCATCGCTT 
GTATCAnATG ATGCAAGTAA TCA 
(2) INFORMATION FOR SEQ ID NO: 1038: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 218 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) . TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1038: 
ATTGTACCTG GCACAATCTC ATTATGTATT GGATTATCAT CATTACGAGC GCCAAATATT 
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ACCGTTATGA CATGTTGATC ATTTTTAGTn AATGTTAAAG CTATAACGTG TGATATTGCG 
AGTCTGTCGG CATTCAATAT CTTCnAAATG CCCTACAT 
(2) INFORMATION FOR SEQ ID NO: 1039: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 96 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1039: 
GTTTAGACGC TAATGnTGGC ATATATTACT TGCCATGCAA ATGAGTCATA AGGCATAGTT 
TTCAAGAAGG GTTCTGACAA TGAAGTGAGC AACATGTCAT GATGAGAAGC AGGACTATAC 
AATGAGAATA ACCTTTTGAT TTTTCATGCA TAAGGCGATT CAATCAAAAG CAATCACCTT 
CCAACTGAAT TGTCATTTTG TAAAATAAAA TAATCGATCC AATCGTTATC TAATTCATAA 
ATGTGTAAAC ACATACGTTA TGAATGGATA ACGATTTTTT GTTATGTTAA AGTGGTACAT 
TAATCATGTA TTTCGTATGA TAATTAACGA CAAGTGTAAT GGTTAAATGT ATTTTATGGA 
TGGAAATGCn ATAATAGGCC TGGGTTACCA TGGnGC 
(2) INFORMATION FOR SEQ . ID NO: 1040: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1040: 
AGCATTTTGT TTAAAAGCTG ATGAAGCAAT AATAAACTCT TCATATCCTC TGTCCATTAC 
AATTTGACCT TGATATACAA TTTCTTTAAA GTTTTCGATT TCTTTAAATT CTCTGCTATC 
ATTTAAAATA GGTGCATTCG TAATAACATT CGCTTCCTTC TTATATCCTT TAGATTGATA 
ATATTCTTTT GCTGCATGAC TTACTGTTAC GAAGGCATTA ACACGATGTT TTACTATGTG 
TTTTTCTATA CTTTCTACAA ACTTTGAAAT AAGTGGAACT TTATTAATAA AGGCATTTTT 
CGCATATATT TCATGCGCAT CATAAACAAT ATTAGCTTTT TtATAATTGC TTAAATAGAC 
CATTAATAAT ACGTCGAAAT CATTTGCATG AATCACGTCA 
(2) INFORMATION FOR SEQ ID NO: 1041: 
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(A) LENGTH: 268 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1041: 
ACTTGCGTTT CTATTTAGCT CAGACAGGAA AAATATTCGC TGGAAATATG TAGGAATTTT 
ACTTGCAATT CAGCTTATCT TTGCATTTAT CTTGCTTAAA ACTACAATGG GAATCAAGGT 
TATTGGTGGT ATTTCAAACG GTTTTAACTA CTTATTATTA AAAGCAGCAG ATGGTGTTAA 
CTTTGTATTC GGTGGTATTC AATATATCGT CCCTAAGCAC CCACCCATTC CTCCTCCGGG 
GAATAATAAC CAACnGAATT AATTCCGn 
(2) INFORMATION FOR SEQ ID NO: 1042: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 277 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1042: 
GTGAAGATAA GAAATTAAAG TTAAGACAAG GTTAAAAATA AGCAAAAAAT ATTGAAATTA 
CAGATCATTG GAnTGGTATT CCAGAGGAAG ATCAAGATTC ATTTTTGATC GCTTTTATCG 
ATGGATTAAA TCTCGTTCAA GAAGTCAAGC GGTAATGGAC TCGATTATCT ATTGCTCAAA 
AAATCATCAA TTAAACGGnG ATCGATTAAA ATTAAAGTGA ATTAACAAGG AGCAACGTTA 
AATCTATTTT AATCAGACTG AGACGTCATC AAGTCAT 
(2) INFORMATION FOR SEQ ID NO: 1043: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 79 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
•(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1043: 
TTATCATCAC AAGTGATTTT GAAAATAAAA AAATCGAAGA TGATGACAAT GATTCAGGTA 
GTGTGCCGAT ATTGCCGTTT ATTAAACGAG ACAAAGCATA TCAAATTATC CAGAATTTAG 
TACCAGATTT ATCTTATCAA CAAGTTAATG AAGGTATGCC TTTGTCAGGn TTCCACAGAC 
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10 



is 



20 



25 



30 



AACCATTATT ATTGTTATTA CCGATATAAC CATAGTCATA TGGTTATTGT GATCCAAGGT 300 

ACCTTATGTC AACAGCGGGT TAAAGCAGAG ACAATACATC AGATTCGGCA TGAACGAAAT 360 

TATATTTAAn GCTnCTAAG 379 
(2) INFORMATION FOR SEQ ID NO: 1044: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 3 99 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1044: 

ATATTGAACT TGAAGACCAT GCAGATATAA ATGAAATAAC ATATCGATTA TCAAAAATTT 60 

TCGGTATTAA ATCTATTAGT CCAGTATTAA AAGTAGAAAA AACAATAGAG GCAATAAGTG 120 

CAGGGACAAT TAAATTTGCG CCATTnTGAA GAAAACAGCA CATTTAAAAT TGATGTGAAG 180 

CGTGCCGATA AAAATTTCCC AATGGATACG TATGAATTAC AGCGTGAATT GGGTGGTGCA 240 

GTATTGAAGC ACTTCGACAA TATTTCAGTG aATGTCaAAC GTCCAGATCA TGgAAATTCg 300 

AGTGGGmAGT TtAGGmTTAG ATGCCAATTT TATATGGTAT GnAAGAAGTT GGTTCCGGGG 360 

TTCCAGGTGG GATTTACCnG TTGGGTACTG GTGGGTnAA 399 
(2) INFORMATION FOR SEQ ID NO: 1045: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 
55 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1045: 

AnGATAAACA GAGGAGCACA AAAATGAATn nAAATATAAT CATCAAAAGT ATTGCGGCAT 60 

TGACGATTTT AACATCAATA ACTGGTGTCG GCACAACAGT GGTTGATGGT ATTCAACAAA 120 

45 

CAGCCAAAGC AGAAAATAGT GTGAAATTAA TTACCAACAC GAATGTTGCA CCATACAGTG 180 

GTGTTACATG GATGGGCGCT GGAACAGGAT TTGTAGTTGG GAATCATACA ATCATTACCA 240 

SO ATAAACATGT TACTTATCAC ATGAAAGTCG GTGATGAAAT CAAAGCACAT CCTAATGGTT 300 

TTTATAATAA CGGTGGTGGA CTTTATAAAG TTACTAAGAT TGTAGATTAT CCTGGTAAAG 360 

AAGATATTGC GGTCGTACAA GTTGAAGAAA AATCAACGCA 400 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 626 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1046: 



ATAAACTTCT 


ACACCATCAT 


CTTTACCTAC 


AACATTTGTA 


GCTTTAACTT 


TTAGTCCAAA 


GTTATCTTTA 


AAGAATTGTT 


CACCTACTTT 


TTCAAATTCT 


TTACGATGCT 


TCTTCACAAA 


TTCAATCGCA 


TCTTTTTCTG. 


CAGGTGGTTG 


AAAGCCTTGT 


CCTACATATT 


TTGAAGCTTC 


CATTTCTTCT 


GGTACTGATT 


TTGTTTCTGT 


ATTTGTTTCT 


GTATTTGTTT 


CTGTATTTGT 


GTCTTTTTTT' 


GATTCATTAT 


TCATCGCTGA 


ACATCCTGAT 


AACAGTAGCG 


TTGCTATTAA 


GATTAATTTC 


GCCTTTTTAA 


ACATAGCTCA 


TCACCCATTT ATGTGkTTAT 


ATAGAATATT 


AAAAAGCATT 


TTAAAAAAAA 


TTTATTCATC 


TTTACGATAC 


ACTCCATATT 


CTATTGGATC 


ATTATCATCA 


TAAAAGGCTT 


TTTTAGTATT 


AATAGTAGGC 


TTTCCTATTT 


GAATTGTAAT 


TGTAGATTTT 


TCTGGCTTAT 


TTTTAAAATT 


ATATAGTTTA 


TCACTCAGTT 


CAATTACATC 


ATCAACTGTA 


TTGTCTTTAG 


TAAAATTTTC 


CTTTGTACTA 


AACAATGTTG 


TTACTGTATC 


TGTATTAGCA 


GCGnAATTCA 


CTTCTT 









(2) INFORMATION FOR SEQ ID NO: 1047: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1047: 
GACGGCGTTC AAAATGCAAC GGTCAATTTA ACAACAGAGC AAGCTAAAGT TGACTATTAT 
CCTGAAGAAA CAGATGCTGA TAAACTTGTC ACTCGCATTC AAAAATTAGG TATGACGCGT 
CTATTAAAGA TAACAATAAA GTCAACGTCA CGCAACTGAA GCGTACAACA TAAATTGATA 
GnTATCATAT CAGCAGTATA TCTTACCACT ATAATGTAAT GTTGACACTT TCATATGCAT 
ATACAGACTA TTACGATCAT GTCCATTATT TAGCnCACTG TCATTATATG GTGGCATTAT 
GAGGGTATAA ACTTAGn 

(2) INFORMATION FOR SEQ ID NO: 1048: 
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(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) . SEQUENCE DESCRIPTION: SEQ ID NO: 1048: 
ATACGnATTA TGCAAAAAAT AAAGTGAGAT TGGCATATGT GTAAATCTAA AATACTGTTG 
AAAAATATTT TTAGTGAAGT AATCAGAAGT TAAAGATTTA ACTGAAGTAA AAATATAATC 
AAGATTACGA GTCATTAACA TTTAGCTTTA AAGAGGGAAA CATATCAAAG TAGGTTAGCT 
AAGnAAAACA CCGACTAAAT CGGGGATATT TCGTGACATG TTG 
(2) INFORMATION FOR SEQ ID NO: 1049: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 608 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1049: 
AAATATGAAT AATGCAATTG ACCAACGTTC GATAACGTTG CCATACCCAT ATTAAATTTT 
GTATAACCTT GTTCTTTACT CCAAAGTAAC ATATGCAAGT ATAGACCATC CATTAATGGT 
AAATCTAACT CTGGCAACCA TCTAATTAAA TCGACTGAAA TGGCATCATT AAAGTATGTT 
GGCATTAAAC TACAAAATGC AATTACTTCA TTTTCTTCAT TTCGCATTAC ACCAATTGGC 
GCTTTAGATA AGTATTCTTC ATTAAATnCA CCAACAGAGA AATGCATTTC CyGACGATTA 
TCTAGCCaTA AATcACTTAC ATGTTGAAGt TCATTTAtAA ATTCAGTTGr AAACGGtGGt 
TCAATAATTT CGAACGAAAT ATTAAGTTCA TCGAATTnAT TTAAAGTCGC TCTAAATCCC 
ACGGCGTTTT TTACCTGAAG TTGAAAATTG CGTTAAATCA ATAATTGCTT CTTCACCTAA 
TTTGAAAAAT TGGTTACCGG nATTATGATA TAAAGGCATG TGTTGATCTG TAACTTGATA 
GAATATAACA TCATAGCCTA AATACTCAGC GTAATTATAG AATGCTTCTA ACAATTCATC 
AAAGGCAT 

(2) INFORMATION FOR SEQ ID NO: 1050: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 292 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1050: 





TGTTTTTCTA 


CGCTAAACTA TATGATAAAT 


TGCCTATGTA 


TTTACTGGTG 


TTTATGGCCT 


60 


5 


TTACAGCTGT 


AATTTTGATT ATGATGTACA 


TACAAGAGAA 


AAATGAAAAT 


TACAAAGTTG 


120 




AAAAAAGATA 


TGTGGTTAGA TATCTCACAC 


TTAACGTTAT 


TGTGGGATAT 


ACTTTGCCAT 


180 


10 


TGCTTTTTGT 


ATCTATTTAC GTTTTTGGTG 


TAGTCGGTTT 


TGGATTTGAT 


GTTTTCAATT 


240 


ATTGTCTAGG TAkTATCTTG ATGTyATwTA TTTCTTGGTT TAGGTnTATT 


nT 


292 




(2) INFORMATION FOR SEQ ID NO: 1051: 








15 
20 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 734 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1051: 








TAAATCnTCA 


AAGATTTCTT CTAAAACATT 


ATAATCTTCA 


ACGATACTCG 


AATAATAATA 


60 


25 


CAGCTCATCA 


TTTCTGGGCT TGGTATAATA 


AAATATAACT 


TGCCCACCTC 


CATCGTCCAC 


120 




ATATGCAATT 


ACATATACCT TTTCCCACTC 


TACTGGTATC 


ATTTCATTGA 


TCTCATTTGC 


180 


30 


AATTTTATTG 


TACATTTCAC TTAATTTTTC 


TTCGAAATTC 


ATGTTATCGC 


CTCTATTGTT 


240 


CAGCTTCTTC 


TTGCTCTTTA ACATATTGCT 


CGATTTCTTT 


AACTTTATTA 


ATTTCATATT 


300 




CCGTTTCTGG 


TAAAATTCCA AATTTTCTAT 


ACTTATAGTA 


ATTTTGTCGA 


CCTACTTGAC 


360 


35 


CAAATTCTGA 


ATTTATCCAA TCAATATAAT 


CAAATGAAAC 


TTTTAATTCA 


CCTTCTCTTG 


420 




TAAAGTCAAA 


TTCGCATGAT GTCCATGGTT 


CATGTCCTTC 


TTCTTTAAAT 


AAATTCCTTA 


480 




AATTTTGAAA 


TTGTTTATAC AACTCATACA 


CTGAaTCCAT 


AAATTCCGAT 


TCCGATATAT 


54 0 


40 


CATATTTATT 


TAATACACTA GTATAGTAGT 


ATAATTCATT 


ACTTCCAGGT 


TCTGTGTAAT 


600 




AGTAGAACAC 


TTCTCCTCCT CCGTCATCTA 


TATATGCCAT 


TGCATATACC 


TTTTCCCACT 


660 




CTACCGGTAT 


CATGCTGCTA ATCTTATTCG 


CAATCTCGCT 


ATACATTTCA 


CTTAATTTTT 


720 


45 


CTTCGAAAGT 


CATG 








734 



(2) INFORMATION FOR SEQ ID NO: 1052: 

(i) SEQUENCE CHARACTERISTICS: 
50 (A) LENGTH: 775 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1052: 

ATTTTTAACC AAAATTGCTT TTTTATCTGA AAAGAAAGGC AATGTTAATG TTTCTTCAAC 60 

AATTGGTGCA ATCTCTGTTT CGTATAAATT ATATTTCACA AAGTTAAAGT CATCTCTATC 120 

ACTTTTCAAA AATTGTGATA TGATTTCTGC ACTTTGTTTT TCAACCAATT CAGGCACATC 180 

TCCATAAATa GCTACAATAT TGTCGCTCAT TCATTACACC CTTTCATTCG TTTGACACAT 240 

10 

CAATAGATTA TATCATGTTT CTGTATCTAT AAACCACTTG CATTTCCATA AGAGCTTGAA 300 

TCAACTTTTA AATTATCATC TAAGTCAATT GTAACTTGAC CGTTTtGTTG ACTATTGTAA 360 

75 ATGCGtACTG CGAATCCTTT GCAATCGTTt aACAACTTCT ATATTAGGaA GwTGATACAT 420 

ATTGTTCTTC CCAGAAGAAA TCAAACTTAT TTTAGGCTTA ATCATCTCTA TAAATTCTTT 4 80 

AGAACTACTT GTCTTGCTCC CATGATGTCC TACTTTTAAA ATATCAATCT CCGGCAAGTT 54 0 

20 ATATTTTTTT AGTAGTAAAG ATTCATTATT TTTACTAGCA TCGCCCATTA ATAAAACTTT 600 

TTTATTTTGA TATGTAATCA TAGTAATAAT CGAATACTCA TTTTTATCTC GGCTATTTGG 660 

AATAAAACTA TCAAAAAAAT AGAAAACTAC TATCTCCAAG TTTAAAACTA CTAACTTGTC 720 

TTACATCCAT AAGTTTAATG GTTGTACTTA TGGGCTTAAT TTCCGATAAT AACCA 775 

(2) INFORMATION FOR SEQ ID NO: 1053: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 234 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1053: 



GAGACCAATC ATCATAATGA ACGACAATCA TGTATTTAAT ATTTAATAAA ATACTGAATA 60 

40 ATGCTATAAC TGCCATTAAT TGAAGACCAA TCGCATCTAA TGCGACAACA CGATCGGCAA 120 

GTGATGGGCC TAGCACAACG CGAATGAGCA TAGCTAACAT AGGAnGTGAC AACTATGATT 180 

AATGCAATAA CGATGAATAA CCTTGATGAn TCATTATAAT TCGCCCACCT CTCT 234 



45 

(2) INFORMATION FOR SEQ ID NO: 1054: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 
50 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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CTACTGGTAT CATCCACTGA TTTCATTTGC AATTTCATTG TACATTTGAC TTAGCTTTTC 60 
TTCGAAAGTC ATGTTATCGC CCCTATAGTT CAGCTTCATC TTGCTCTTTA ATATATTGCT 120 
5 CGATTTCTTT AACTTCTTCC ATTTCGTATT CCATTTCTGG TAAAACACCA AACTTTTTAT 180 

ACATATAATA GTTTTCACGG CCTAATTGAT CAAACTCTGT ATTTATCCAA TCAATATAAT 240 
CAAATGAAAC TTTTAATTTA CCTTCGCTTG TAAAGTCAAA TTCACTTGAT GTCCATGGTT 300 

10 

CAAGCCCTTC TTCTTTAgAA AGTTTCTCTT AACTTCTTAA ACAATCTATA TAAATCCGTC 360 
CACAAATCAT AAAATACTTT TTCAGAGACA TTATACTCTC 400 
(2) INFORMATION FOR SEQ ID NO: 1055: 

15 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 333 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1055: 

25 GnGGCATATA TAGCGTTGGC GGTATAACAA AGAAAAATGT GAGATCAGTG TTTGGATTTG 60 

TAAGTAATCC AAGTCTACAA GTTAAAAAAG TTGATGCTAA AAATGGCTTT TCGATAAACG 120 

AGTTGTTTTT TATTCAAAAG GAAGAAGTAT CATTGAAGGA ACTGnACTTT AAAATAAGAA 180 

30 

AACTCTTAAT CGAAAAATAT AGATTGTATA AAGGAACGTC TGATAAAGGT AGAATTGTTA 24 0 

TCAATATGAA AGACGAAAAG AAGCATGAAA TTGATTTAAG TGAAAAATTA AGTTTTGAAC 300 

GTATGTTTGa TGTAATGGaT AGTAAGCAAA TTA 333 

35 

(2) INFORMATION FOR SEQ ID NO: 1056: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1056: 

TGGGCAGTTA CATTTATGAT TATCAGCACA TGTTTAGCTT ATAAATTTTT ACGAAAGTTT 60 
AGCCGTTTAT AAAGCACATG CATAATGAAA CGAGTATXTG CCACTTGATT AGTACTTCAT 120 

50 

TATTATGTCG AAAATAAAAA TAAGTGGTAT TTTTAATATA TTAAGAAGCA CTCATAATCG 180 
GCTGTTAATT AATAATATTT TTCATAAGTA TTGATTCATC ATTTTCTTTA TGTTAAATAT 240 

55 
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TAATACAGAT ATCAATATGG CnAAAGTGTT nnATGAATCA GAGGTAA 347 
(2) INFORMATION FOR SEQ ID NO: 1057: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 312 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1057: 

TCCAAAACGG AGAGCCAAGA GTTAACTCTA CTTATATAGG ATACGCGCCA ATTGATGATC 60 

CAAAATTAGC GTTTTCAATT GTATATACAA ATCAGCCTGT ACCACCACCA TGGTTAACAG 120 

GTGGAGACTT AGGTAGAGAT GTAATTAACT ACTACTTTAA GCAGTTAGGT AAAGATGATA 180 

AAAATAAAGA CAAAGACAAA TAAAATTTAA CCTGACGATT GTGTAGCGCA TGGTTGTAAA 240 

ATTTTAACTT TGCmAATATT ATAGATGTTG GTATAATAAT AAAGTCGTAT TtAGaAATGA 300 

TwAGGaGGGA AT 312 
25 (2) INFORMATION FOR SEQ ID NO: 1058: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 04 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 <D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1058: 

GCCATTTTGC GATGCTAAAT CCATTGAAAA TGCAATACCT GAAAGTCCAC CAAACAAAGA 60 

ATTGTTATAT TGACCACTCT GTAAATATGG TGCTAATTTC ATGATGTACT TATGCACAAT 120 

TTGCTCAGTA TTAATATCGA AAACTTTTtG AtAAGCATCT AAAAACAAAA TGATACCTGG 180 

TATACCATGT GATAAAGTTG AAGGTTCAAA ATAATCGGTT TCAGTAGATG CTTTTGAAAT 240 

GAAATCATCT ACTTCTGAAA TCTCTTGAAA CTTCTTCTTC AATATTAGAT TCATATCAAT 300 

45 CATTTTtGTC ACCACACCAA TGCTTTTGAG TTTTAACAAT TTCTTTTACA ATAGATAAAA 360 

CAAATGTTTC TTGATCCTTA TCAATGCCAA ATATTCGATT GnAG 404 
(2) INFORMATION FOR SEQ ID NO: 1059: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

55 
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(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1059: 
ATGTACATAG GnATGACATA AGTGTGCCTT CTTTTTCAAG ACCCTCCATA GAAATAATAC 
ATGGATGTGC ACCACGnTAC TACCTCTTTA ACGTTACCAG AATTGATAAA TTAACTTTCT 
CTTGTGTTGC TAAACCTACA ACTGGTGTAC CTTCTTCGAT TAAGGCAATT GTACCATGTT 
TAAGTTCTCC ACCAGCAAAA CCTCGCTGAA TGTAGAAATT, CTTAAGTTTA AGT 
(2) INFORMATION FOR SEQ ID NO: 1060: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1060: 
GCACCACTAC TGTTAATAAC ACACCTAAAA AAGTCGAAAT ATAAGGGTTA ATCGCTGTTT 
TAAATGTATA TCCTAGTGCG ATTGCCATTA ATTATCCGAT AGCTTTTTCT GCTGCTTCCA 
TAGTCATCTC ACCTAATGTT GGATGTGCAT GGATTGTTAA TGCGATATCT TCAGCATTCA 
TACCAGCTTC AATTGCTAAA CCTAATTCAG AGATAATATC TGATGCACCA GTACCAACTA 
CTTGAGCACC GATTAAAGTA TCATCTTCTT TAGTGTAATA AGTTTAACAA ATCCG 
(2) INFORMATION FOR SEQ ID NO: 1061: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 413 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1061: 
CAATGCCTCT CGTGTTGCAG CTTCTTCCGG TGAAAAACCA ATATTTTCAA. GTGCAATAAT 
TTGATCTAAG TCCTGCATGA TTACTTTTCT TAATACCATA AAAACACTTC CTTAGCC CTA 
TATATCAAAA GTAATACCTT CATCTTTATA TTCATTATTT TCAAACTACC TTCACTATAA 
AAAACGAGTC TTTTTATTGA AAGCCATTCG CCTTAATCAA TTAGACAAGT TGTATkATAC 
GAATTAGTAA ATAATCATGA TTATAATTCA TTTTAAGCAT ACTCAAAAAG ACTGGTACAT 
GTATACCAGT CTAATTCGAA AAAATATATT TAGTTAAAAC CATTTCAAAA ACGACTTCAC 
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(2) INFORMATION FOR SEQ ID NO: 1062: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



w 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1062: 
AAGCTGCATA CTCTCCAAAC TTAGAACCAC CGAAGCGATT CACAAAATAT TTATTAGCTA 60 
AAAAGTCGGC ACATAAAATT AATATTGTTA GCAATGCCAT AGATACATAA AACACCCATG 120 

15 

ATAAATGCTG ATTATGAAAG CCAAATTGAT AGATTAAAAA GCCAACCCAT AATACTAAAA 180. 

CAGAAGGAAT AATCGGCTTA ATCAACCCAA CAAATGCTAA CATGAAGGCA GCGATGATAA 240 
20 GTAGCCATAA AATAATTGTC ATGTTGATAT CACATCCTCT TTTGTATTTT TTGATTTTTA 300 

GTAAAGAATA TAAGTATCAT ACCTAACATG GTTGATGCTG CTGGATACCA AAAnATACAT 360 

TTTCCCAAAC CTTACCCAGT GGACTCCTAG GCACCACCTT 4 00 

25 (2) INFORMATION FOR SEQ ID NO: 1063: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 



40 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1063: 

TGGATCAGAA GGAATCTGGA TGCCTATTGA TGACAAAGGA TACTTTAATT TTGACTTCAA 60 

AACGAAACGT TTCGATGATT TAGAGTTAAA GAAAAATGAT GAGATCTCAT TAACATTTGC 120 

ACCTGATGAC GAAGATGAGG CATTGAAGTC ATTAATTTTC AAAACTAAGG TAACGAGTTT 180 

AGAAGATATT GATAAAGCAG AAACTAAATA TGACCATACT AAAGTGGAAA AAGTAAAAGT 240 

ATTGAAAGAT GTTAAAGAAG ATTTACATGT AGATGAAATT TACGGAAGCT TATATCATAC 300 

45 AGAAAAAGGT AAAGGTATTC TTGATAAAGA AGGTACTAAA GTaATTaAAG GTAAGACTAA 360 

ATTCGCAAAT GCAGTTGTGA AGGTAGACTC TGAACTAGGT 400 
(2) INFORMATION FOR SEQ ID NO: 1064: 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 434 base pairs 

(B) TYPE: nucleic acid 
•(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1064: 

s GTGGTCGTGA AGCAACTGAT TTCAAAGAAG AAGATATCGA GCTTTTCAAA GACATTGCCG 6D 

■ ACAAAGTAAA ACAAACAAAT AGTTATGATC TAGCGTTTGA TGAATTAGAA AAAGAAAAAG 120 

ACTTCCTGCA AGTCATTGTC AAAAACGATG ACAAAAACTT ACCTACTAAT CAAAATGTCG 180 

10 CTCAACTAGT AGAAGATTTA CGCCTAGAAA TCCAGAAAAT GCGCGAAGAA CGTCACCTAC 240 

TTGGTCAAAT GATGAATCAA GTACATCAGC AACAACAAGA ATTAAAAGAA CTTCAAAATC 300 

AACTTACATC TAAAATCGAT TCAAATAGCG AATCCTTAAA AGCCATCCAA ACATCACAAG 360 

AGGCTATCCA AGAAGCGCAA GCCTCTCAAG CAAAAGCATT AGCTGAATCC ACCAATAAAG 420 

TTGGAAAGAA TGCT 434 



(2) INFORMATION FOR SEQ ID NO: 1065: 

20 

<i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 23 9 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1065: 

TAAAACATAT GGCTTGATAT TGCAGAAGAA TTGAAGTTAT CGTACCAGAA ACGGTACATA 6 0 

ATCATGATTA TATGGAAACG AAAAAAATAA AAATGGGTCA TTTAATATAG GAGGATTTAA 120 

CATTGAATTT TGAAGGTAAn TAATTGGAAA AGATTTGAAG TTGCATCGTT ATTATCGTTT 180 

AATGATTTTA TCACTGGAAG ATTACTTGAA GGTGCAAAAG ATACnTTGAT TCGACATGA 23 9 



(2) INFORMATION FOR SEQ ID NO: 1066: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 202 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1066: 
AAATATCTAT AGACCTTTAT TCGGAAATGG TTCGTCTCCA GAATGGAAGG ATGAAGTACC 60 
50 GAGTATTGAA AGGTAGAAGG TCTCCAAACT TTACCTGnAC GTGCACGAnA TCTAACTGAA 120 

GATGATTTAG CAATTGAATT GAAACAGCCA ATTGTCGGTT GTAATAACTT AGGAGAGTTA 180 

55 
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(2) INFORMATION FOR SEQ ID NO: 1067: 

(1) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 749 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1067: 
CTACAATTCG ATCAACGCCG AAGTTAGATA CACCGATTGC TTTAATTTTA CCATTTTCTT 

TTAACTCTTC TAATGCACGC CATGAGCCAT ATACGTCATT GTAAGGTTGG TGAATAAGCA 

CTAGATCAAG ATAGTCCAAA TTTAATCGTT GTAATGATCG TTCGATTGAT TTGATAGTAT 

CTTCATAGTT AACATTTTCA ATCCATACTT TAGTCGTAAT GAATAGTTCA CTTCTATCAA 

TACCTGAAGC TTCAATACCT TGACCTACTT CAGTTTCATT TAAATAACTT TGTGCTGTAT 

CAATATGTCT ATAACCCGCT TTAATTGCTT CTTTAACCGC TTCTGCnGTT TGTTCTTGAG 

GAATTTGAAA TACTCCGAAA CCTAATACTG GTATCTTTAC ATCTTTACTT ATTTCAATAT 

GATTCATAAA ATGAACCTCC TTTATCTtGT ATGTCCACTA TAAACCATGG AGCGAACTCT 

ATAGCAAGCC ATAGGTTTAC TTTCTCTCCC AAAGACGTTC AACATTGTCA TCATAATTAT 

CACTAGCATT GATTTTAGCA ATATGATTAT CATAATTATC AATTTTATAT TGTAGTAAAT 

CTCTCGCATC ATGAATTGTC TTTAAGTTTT CATTTAATTC TTCGAGTTGC TTATTTAAAA 

TTTGCTTTTG CTGTGCTTGA ATATTTTCAT TTTTAGGCAA nTGCGCTAAC CTGCAAAATT 
CGATAAGCGA TTCAATACTG ACGCCTGCA 
(2) INFORMATION FOR SEQ ID NO: 1068: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 436 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1068: 
TCCTCGAGAT AGACAAGAAT GGTGGTACAA TGTTTTTTAT GACGATGGCA AATATnATAA 
GAAnATAA'lT AAGAATGACA TGTCAAAAAT TAGGCGTAAT TTTTAGATTT ATCTATATAT 
TTATGCCACC GCTCAAACTT TTATnAAAAG TAAAAAGACA ATATTTATAT TGAACTTGTG 
ACGTGAATTT CTAAAACACA ACCAACAATC CACAAGGAAA CTAGTAAATA AGTTTAAATA 
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TACTGACTAA AAAGATTTTG AAAGTGATTA AGTATTTAAA ACACGCCAAT CTTAAACCGC 360 

TATTGACAAA TATGAAGCGG AACCACCGTT GTnTTTCTTT TnCAGGTCCA ATTGAATAAA 420 

5 TGTTGATGAA TAGAAT 436 

(2) INFORMATION FOR SEQ ID NO: 1069: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 551 base pairs 

(B) .TYPE: nucleic acid 
tCJ STRANDEDNESS : double 
(D) TOPOLOGY: linear 



75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1069: 



TTCTCAAGTA ACAGGTATTA TTGTTACTGG TACAATCAAT AGAGAAAAAT ATGGCATTAA 60 

20 CTTTAACCAA GCACTTGAAA CTGGTGGCGT AATGCTAGGC AAAGATGTTA AATTCGAAGC 120 

ATCAGCTGAA TTCTCAATCT CAGAATAATC TCATTACACA ATCCTCATTG TTTTAATATA 180 

TATACGAAAT GCCAACTATA TCATCCCTAG GTATAGTTGG CATTTTTCGT TTAACTCATG 240 

25 TGTAACAAGC ACTAACTGTT CACCCTAATT TTAGACGCCT TTCATTCAGG TAGGTCTTAT 300 

TGAAACTGAA AACTTTGaTG ACCTTTTGcA AaGCCATTAA CTGTATAAAG CATAGATCCT 360 

CCGCCCATTT CTATATCATT GGAACAAATG ATGAGTTGAT TTGTTCCAGG TATAAATTGC 420 

30 

GGATGAGTAG. AACGTAACAT ATGCCCTTCA TCTCGGCCTG GTATCAATAT TTGTCCTATT 480 

GGATAACCCC TTTTATTAAA AACTAACACT CGACCTTGGA CCATACATTG CTACGGATAA 54 0 

ATTATCGGCA C 551. 



35 

(2) INFORMATION FOR SEQ ID NO: 1070: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 325 base pairs 

(B) TYPE: nucleic acid 
40 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1070: 

TATnATGGTA ACGCATGAnC CTGTTGCAGC ACGCTATGCA AATCGAGTAG TGATGCTAnA 60 

AGATGGTCAA ATTTTCACTG AATTATACCA AGGGGATGAC GATAAACATA CCTTTTTCAA 120 

50 AGAAATAATA CGTGTACAAA GTGTTTTAGG TGGCGTTAAT TATGACCTTT AACGAGATAA 180 

TATTTAAAAA TTTCCGTCAA AATTTATCAC ATTATGCCAT CTATCTTTTT TCGTTAATTA 240 



55 
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CAGAGTCATA TCCAATTATA CnGGC 325 
(2) INFORMATION FOR SEQ ID NO: 1071: 

5 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 771 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1071: 

AATTGGATAG CTTTAATATA ATTAATAGTC AAGGTATTnC CAACAAnAAT 60 
CCCGAAGGAA GCGGGGTATn TTTGTTACTG ACGTCCCACA AGAAGCACAn 120 
GTGCTTACTC TGAAATTAAA AAATTAAGAG ATTATGATGA AAAACTAATT 180 
TTTTTGGCGT ATCTCATGAA GGATATATAG TTACATTTCC ACGTGGCGGA 240 
CTGGCGCCAT CATATCAAGT GGTGTTAGAG CCACAATTTA TGAAAATTTC 300 
CTGGTATTTA TAAAGCTAAT CCAAATATCA TCAAAGATCC TGAATTAATC 360 
CTTATCGAGA AATGCGTGAA TTATCTTATG CTGGTTTTAG TGTTTTCCAT 420 
TACAACCACT TTATAAAGAT CGCATCCCTG TGGTGATTAA AAATACGAAC 4 80 

ATAAGGGTAC ATTTATAGTG CATGATAGAG AAATAAATGC GAAAAATGTG 540 
TAAGTTGTGA TAAAGATTTT ACCGTTATTA ATATCAAAAA GTATTTAATG 600 
TCGGCTTTAC TAGAAAGATA TTAGGTGTCT TAGAGGATAA TAATATTTCA 660 
TGCCTTCAGG TATTGATACT ATCAGTATCG TTATGCGTTC AAAACAAATT 720 
AAACAAAAGT ATTAAATGAA ATTCGTCAAA AATGTGATGT T 771 
(2) INFORMATION FOR SEQ ID NO: 1072: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 237 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1072: 
CGGTGGATCC GTTTGATCTT GAGTGAGGTG CCAATGAAAG TTATGAGCCA CGTTGTCGCG 60 

SO 

CGCACCATAT CGTAGCACCT AGTGATAATA ATAAGGAGGA ATTATAAGTG TTTGATCAAT 120 
TAGATATTGT AGAAGAAAGA TACGAACAGT TAAATGAACT GTTAATGACC CAGATGTTGn 180 

55 



TAAATGCCGC 

15 

ATATTATCCA 
ATATTAGATA 
20 ATCCCAGGAT 
TCAGATATAA 
ACCGATGTGT 



25 GAAGAAATCA 



GATGAAGCCT 
CGCCCTCAAG 

30 

■ ATTAGCGGTA 
AATAGACAAG 
TTTGATCATA 

35 

CAAAATAAAG 
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(2) INFORMATION FOR SEQ ID NO: 1073: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 733 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1073: 
CTTCTTGTGA GGGCGTTACA TCACCGGATG GATGATTATG AACTGCGATG ATTGCATTGG 
CATTTTCTCT CACCGCAATA CTAAAAATTT CACGTGGATG TACAATCGAA CTATTTAATG 
TACCTTTAAA AACACAGGTT TCTTTAATCA CTACATTTTT TGAATTTAAC AATAAAATGA 
CAAAATGTTC TTGTGTTAAA TCTTTCATTG TTGGAATCAT ATAATCAGCA ACATCACTTG 
GTTGCGTTAT TTkTATACGA TTATTTTCAG CTCTTCTCCC CATCCTTTCC CCTAACTCAA 
ATGCTGCTTT TAAAGTAATT GCTTTTTGTA ATCCAATCCC TTTAACTTTT ATCAAATCGT 
TAATTGAAGA TTTTTTCAAT TCATTCAGAT TcGAAGCAGA TTTAAGCAGT TCATTACTAA 
TGTCTATGCT CGAGAATCCT TTTCTTCCGG TGTTAATTAA TATAGCTAAT AATTCTGTAT 
TCGAAAGACT TTTTGCACCA TGGCTTAACA AACGTTCTCT. TGGCATTTCT GAAGTTACCA 
TTTCTTTAAT TTTCAAAAAT . ATACGCCTCC TAAAAATTGA TGGATATCAT TATAAAAAAG 
TGAATTGATA AAAAAGGArA TAAATATAAA TGGAACAAGG GGTAATAGTT TAATCGGCTT 
AAATATCATG GTAATTAAAG CAACTAAACC AGCAAT5ACA AATGTAAATA AnATGACATA 
AATAGTGAAT TGG 

(2) INFORMATION FOR SEQ ID NO: 1074: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 768 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1074: 
ATGGTTATTT TTTATGCTCT AAATCGTATG TGTTTATAAG GTTCAATTTC TTGCCATTAT 
TAATTTTGCC GAGCAGTAAT GGGTTGTTAA AGTATCCATC AATTATGCTT AGCAATATTG 
AAAAATTTAA ATGAAAAAGC GGTTCAATTC TTGAAAAATA GTGTATATTT ACATTTTGGT 
AATAACTAAC TATAATTTAC ATAAGGAGGT TGTCATTTGG TAGTATTGTT ATCTTATAGT 
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TATTTACATA GAAGATCGAA ATGTGATTAT TGTAATTCGT CACTCAAAwG GTATGAATTA 
ATGCCGATTA TTAGTTTTTT ATTATTAAAA GGGCGATGTC GAAACTGTCG AAAGCGTATT 
TCCCTAACAC ATTTCTTAGG GGAAACCTTT GCTTTAATAC CTATCGTCTT TATTAAGTAT 
GATTTCACAT ACGTAAATGt ACGCTATTTA TAACTACATA TGTTtTTCTG CTTATTTTTA 
CTATGACCGA TATCACTTCT TTAATGTTAG ATTGTCGCTT AATTATAATT TATTGTATCG 
TTTCTCTCTC GTTAAGTATG ATTTATCCAG TAGCTTTTAT CATTATTAGT ATGACCACGC 
ATATATTCTA CTTTTTATTT CGGGCATATA TTGGTTATGG TGACGTTTTA CTAATATCTG 
CACTTTCTTT GTTTTTCCCC TCTCCAATTC ACTATTTATG TCATTTTA 
(2) INFORMATION FOR SEQ ID NO: 1075: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear • 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1075: 

T CAT ATT CAT TTAAAATACC ATTCATCTCA GCTTTTCTCA TTTTCTCTTT ATAAAGTTGT 

ATTTCGTGTC TTAATTCTGG CTCAGACATC TCACTCAATT TCTTCTGTTC CATCGGCAAT 

ACCACTTTCT TCTAATTTAG CTTTAATTTT ATCATATTTA TATCCTTTTC TCATAAGGCC 

TTCGATAGTT TTTGAAATTA ATTTCTGTTG CGTGTACTTC TTTCGATTTT TATTATAAAT 

TTTTTCTAAA TCTCGTTGTA ATAA&TCGTC TAAAACCGCT TCATCTTGTG TAAAATCCAT 

TTCATTCAAT ACAGCATGAA TGGTTTCCAT TTCAAACCCT TTTTGAATTA AAGATTGCAT 

TACTTTTGCn TnAACTnTAT TTTGTGGCCC nTTTTTTGTT 

(2) INFORMATION FOR SEQ ID NO: 1076: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 753 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1076: 
TTAGGTCAAA ATATGTTTTA TCCAAAAGGC ATTTTATCTC AATCTGCTGA AGCAAAGAGT 
ACAACATATA ATGCAACTAT AGGTATGGCG ACAAACAAAG ACGGAAAAAT GTTTGCATCA 
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CAAGGCATCG AAGAATTACG TGATTTATGG CAACAAAAAA TGTTGCGTGA CAATCCAGAG 240 

CTATCAATCG ACAACATGTC ACTACCAATT GTTACGAATG CATTAACACA TGGTTTATCT 300 

TTAGTTGGCG ATTTATTTGT AAATCAAGGT GACACTATCT TGTTACCAGA GCATAATTGG 360 

GGTAATTACA AACTTGTTTT CAATACGAGA AATGGTGCAA ACCTTCAAAC ATATCCTATC 420 

TTTGATAAAG ACGGGCATTA TACTACTGAT TCACTTGTAG AAGCTTTACA ATCATACAAT 4 80 

AAAGATAAAG TCATTATGAT TTTAAATTAT CCTAATAaTC CGACAGGTTA CACACCTACG 540 

CATAAAGAAG TGACTACCAT CGTCGATGCA ATTAAAGCAT TAGCTGATAA AGGTACAAAA $00 

GTTATAGCTG TTGTGGATGA CGCATACTAT GGTTTATTCT ATGAAGATGT GTATACTCAA 6S0 

TCATTATTTA CTGCATTATC TAATTTAAAT TCAAATGCAA TATTACCTGT TCGTTTAGAT 720 

a GGTGCAACAA AAGAATTTTT CGCATGGGGA TTC 753 
20 (2) INFORMATION FOR SEQ ID NO: 1077: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 05 base pairs 

(B) TYPE: nucleic acid 
2S (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



w 



15 



30 



35 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1077: 

AGAAGCGATT TGACaATAAA ACAATTGAAA AACTTTTGGA AAGCAAGTGG TGGGAGAAAA 60 

CGCCTGACAA ACTAAAAGGA TTTTCGGTTG AATATTTAAA TAAAAAGGAT ACTTAATGAT 120 

ATGAGAATTT TAAATATTGT ATCGAGTAAT ATTGTTCAAG ACCCAAGGGT ACTTAAACAA 18.0 

ATAGAAACAA TTAAAGGCGT TACGGATGAT TATAAAATTG TTGGAATGAA TAATTCACAA 240 

rCTACTAATA AGCGATTGGA AAATTTAGAT TGTAATTATC GTTTGTTAGG TAGCAAGGTA 300 

40 GATCCAAAAA ATATTCTTTC TAAATTAATT AAGCGTATAA GATTTGCaAC AGGTGtTATC . 360 

CGAGAAATTA AAGCTTATAA AcCTGACGTG AtTCATGCaA ATGAT 405 
(2) INFORMATION FOR SEQ ID NO: 1078: 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 504 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1078: 
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CATATTCACA CCTCAATATT CCATAACTTT TACTTTAATA TCAATCCACT TAAATCCATT 
CACATCATTG AGTATAAAAT AATATGGTGA TATACGTTGA AGTTGATTAT CTTATATATA 
AGTATACTTC ACAGAATTTA TAAAATTGTT TAGTCATCAT CTTCAGTGTT TAATTCAAAA 
ATATAGAATC TACTGTATTG CATmATAaAT TGAAATGCCA CAAATTGTGC TCCAGCGATA 
ACCAACATGC TCACAATAAT TATAAACTTA GATTCTACGA CATTAGGmAC CAtATATATT 
CGATAATTTT TTCAATATAT ATAAATGAGC TCAACGCAGT TAATAATAAA CCGAAATGTG 
TTTTAGTTTT ACCACCCCAA CGTTTTGTTA CTTTAGGTAA TTTTAATAAC GTGAACATTC 
CGCCAATTAC TAATAACAAA TAAC 
(2) INFORMATION FOR SEQ ID NO: 1079: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 54 5 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double ■ 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1079: 
CATTCTTAAT CAACTTCTTC ATAATGGGAA TTTGGCATGG TAnCGAAGTG TATTACATTG 
TTTATGGTTT ATACCATGCA GCATTGTTTA TAGGTTATGG CTATTATGAA CGTTGGCGTA 
AGAmACATCC GCCACGTTGG CAAAATGGTT TCACAACAGC ACTTAGCATT GTGATTACAT 
TCCACTTTGT AACATTTGGC TTTTTAATCT TCTCAGGTAA ACTTATATAA TAAAGGAGAA 
TTTAATTATG GAATTTAGAG AACAAGTATT AAATTTATTA GCAGAAGTAG CAGAAAATGA 
TATTGTAAAA GAAAATCCAG ACGTAGAAAT TTTTGAAGAA GGTATTATTG ATTCTTTCCa 
AACAGTTGGA TTATTATTAG AGATTCAAAA TAAACTTGaT ATCGAAGTAT CTATTATGGa 
CTTTGaTAGA GATGAGTGGG caACACCAAA TAAAATCGTT GAAGCATTAG AAGAGTTACG 
ATGAAATTAA AACCTTTTTT ACCCATTTTA ATTAGTGGAG CGGTATTCAT TGTCTTTCTA 
TTATTACCTG CTAGTTGGTT TACAGGATTA GTAAATGAAA AGACTGTAGA AGATAATAGA 
ACTTCATTGA CAGATCAAGT ACTAAAAGGC ACACTCATTC AAGAT 
(2) INFORMATION FOR SEQ ID NO: 1080: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 267 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1080: 
AGATGCCAAA ATCAAAACCA ATAATTTGGC AACAACATAT AAGTCCACTT TACCGGACCT 
nTAAACTTAT CGTACTGTTA AGAACCAAAC GACTGCCATT ATTTGGTCCT AAAGCCACGA 
TGGTGGTAGC CnTTGTAATA ATGCTGGCGT AGAAATTAAC CGCCACCCCT AACACCGAAC 
TATAACGCGG ATAAAC CAAT AAAATTATGA TATAATCATC CGTTAAGTCA ATCCATAGTT 
TAACATCCCT TATGTTTAAT TTACTCT 
(2) INFORMATION FOR SEQ ID NO: 1081: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 441 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

. (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1081: 
ATATTTGTCC ACTATTAGAA GATTGTAGAG AAGGACAAAA ACGTTATAAA GCTAGTTTGA 
AAGAAGCGTG ATAACATGAC AAATATTACA AAAGAGGTAT TTGATAATTT AGAACAAGAG 
ATTGATTTAT TTGCCAAAAA TAAGACATTA GGTTCTAGTG AAGCAAAGCC CTACTTGGAT 
GAATACCATA GTAAAATTAT TGATTATTTT AAGCAGGTTA ATGACATAAC TGG CAAT ATT 
GATTTTGATA AATTAAATCA ATATCCTGTT GTGCCAATGA ATTTTAAAGA AAGATATGAT 
TATATGATTG AACGTAAATA TCATTTTATG GGCTATCGAC AAATGAAGAC CTTTAAAACT 
GAATTGATTA AAATGnATGC TAGTTATCAA ACAAGGTTAA AGAATAAGCA GGTATnGACA 
AGATTGATCA TTTCCTAGTG T 
(2) INFORMATION FOR SEQ ID NO: 10B2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1082: 
AAAATTTCTT AGtTGATGGT TATTGTTGTT GATTTGAATA ATTTGTAGCG AATATTTTGT 
AACAGGGCTA CTAAGAATTT TTATAACAAG AAGTGATTTC AATCGTACTA ATTTTCAGCG 
TTTGATGACA AAAGTGTATG TCCAAACTTA AATAATGGCG ATAAAATTTA AGTCATGGGT 
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GTTGAATATC AATAAAAGAT TAGCAAGTTA GTTCAAAATG GCTATGTATT GTTTTTTGAA 300 

ATACATAGGT TTGAAATCTA TATTAATTAT GCACCAAATA .ATTTAATTAG ACTCTATCGA 360 

5 

AAATTTCCAA ATTTTGCTTA CTATCTTTCT. AATTTTACTT TTATATTGTG 410 

(2) INFORMATION FOR SEQ ID NO: 1083: 

(i) SEQUENCE CHARACTERISTICS: 
io (A) LENGTH: 586 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



75 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1083: 

CAGATATTCC CTTCTATTTA CAAGTTGGTA ACCCATATTT ATCAGACAGC GTAGATAATC 60 

20 ATACCGAAAA GTTGTTAGAA CGCTACGAGC AGTTGGTGGA TTTAGTTATG yAAAGTAATG 120 

ATATGAATCA CGTTTATGTG TTACCACAAC TTCATACATT ACTTTGGAGC AATAAAAAAG 180 

GTGTATAAAG GATAACTTAT TTTACTTTTT GCAAGTCATA TCTAATGAAT AACAATATAT 240 

25 TAATTGATAT AATAGTTGAT TGAAGTATAA CGAACAGGAG TCCATCATGA TTGTATATAT 300 

ATTAATTAAT ATTGCTATCG TCGTACTTAT AACTGGATTC AATTTATATA GGCATCAAAT 360 

GCAACATTTA TCATTAAGTG CAATGTTGTt ATCTATTACG ATaAACGCCT TTATAAATAC 420 

30 

ATTCATAATT GATAAATATA ACTTTATTAC TCTGTGTACT ATAACGATGT TTATCATTTG 4 80 

GACGATACTA CAGTTTTACA TTGATAAAAA ATTAAAACCT GTATACATTA CAGACCAAAA 54 0 

ATTTATTGCT ATCATATTAA CGATTGTTGT CAGTTTAACG CAACGT 586 

35 

(2) INFORMATION FOR SEQ ID NO: 1084: " 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 638 base pairs 
40 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1084: 

AACACAAGGT AGCGTCATTA GCATTTAGAT TTTATTGCCA AATTATATAA CGATTTATAC 60 

ATCAATAAGA AAACAATAGA TGATGTATCT GCAGTrTCTG AAGTGAAATA TGATATTAAA 120 

SO 

TCACAAATGA GTGATGATGA GATTAAGCGT TTAGAGGAAC AAGGACTTCA AGCTATTAAA 180 

GAAGGACAAT TTGCAGTACT TTTGATGGCA GGTGGTCAAG GAACAAGACT TGGTTACAAG 240 



55 
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. TTAAAAACAT TGAATCATCA ATCTGGACAT ACGATTCAAT GGTATATTAT GACAAGTGAT 360 

ATCAATCATG AAGAAACTTT AGCTTATTTC GAAGCACATA GTTATTTTGG ATATGATCAA 420 

GAAGCAATCC ATTTCTTTAA ACAAGATAAT ATTGTGGCGC TTAGTGAAGA AGGGAAGTTA 4 80 

ATTTTAAATC AACAGGGTCG TATTATGGAA ACGCCGAATG GTAATGGTGG CGTGTTTAAA 54 0 

TCTTTAGACA AGGCAGGATA CTTAGAAGAA ATGTCTAATA ATGGCGTTAA GTCATATTTT 600 

CTTGAATAAT CATCGACAAT GTTTTCAGTA ACATGTTG s 63 B 



(2) INFORMATION FOR SEQ ID NO: 1085: 

(i) SEQUENCE CHARACTERISTICS: 
. *5 (A) LENGTH: 618 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1085: 
ATTGCAATAT TATCCGCTTG GTTCAACACG TCATCTTTaT tAaTACcATT TATTGgAAaT 6 0 

25 tCcACCcATT TCCcGGgCTt CTtACTTCCc TTTATAAAAC TGcCCATTTA ATGAATACTG 120 
TTTAGCTTCA TCTATTCGTA CTTCCACTAG TTTACCAATC ATTTCTTTAG GCGCTTTGAA 180 
ATTAACTAGC TTATTTTTAT CAGTGTAGCC AGCAAGAACC TGATCATCTT TTTTACTACT 24 0 

30 TCCTTCACAA AGTACTGTTA CAGTTTGTCC TTCGTACTTA CTcATAGCTA TTTGTGAATA $00 
ATGACCAACT TTTTTATTCA AACGTTGCAA TCGTTCCTTT TTGACATTTA AAGGTACATT 360 
ATCTTTCATT TTAGCAGCAG GCGTACCATC ACGTTGTGAA TACAAGTACG TATATGCATG 420 

35 

TTCAAAACCA ACTTCATCAT ACAGAGTTAA AGTTTCTTCA AATTGTTCCT CTGATTCATT 480 
TGGATACCCT ACAATAATAT CTGTAGTTAA TGCTACATTA GGAATTCTAT CTTTGATTCG 540 
TTTTACTAAA TCCAAATAAC TTTCTCGTGT ATATTTTCTA CCCATTATnT TTAATACTGC 600 

40 

ATTATTTCCA GATTGAAC 618 
(2) INFORMATION FOR SEQ ID NO: 1086: 

45 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 502 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double. 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1086: 
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70 



•75 



CAGCAATATC ATGATCAGTT TAATTCATCA AGAAAGTCTA TCTATACTTT TACAGAAACA 120 

GTTCAACCAA ATCGTGTTAT TTATGATTTT TATGATTATG ATCCGTATCA ATTAGCAGCA 180 

AATAATGCGA AAGCATTAAA GGATCATATT GAACAAAACT TTAACTTTAA AGTCCAAAGT 240 

ACCGGTGTTA kTTACTTTAG TGATGGTACA GTTAATATCA TTCAAGGTTC TGAAGAAAGA 300 

GATAAATATG TTGATACAGT TTCAACAAAA TCTTCGTTAC GACGTATTAT TAGCGAACCA 360 

nAGAGCTATC TAAACACCCT TTAAATAAAG AACAGGTTGA nCAAATCACA GCTATATTCA 420 

AGTAAAATCA AAGGCATAAT AATCCATTAC GCTCGATACG CACTATGGTT ATTATGCCTT 480 

TAAATTATTT CTATATATAA nT 502 
(2) INFORMATION FOR SEQ ID NO: 1087: 



(t) • SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 429 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



2S (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1087: 

TTGCCTGATG GACGATGGCC ATCACAGACT GAGTTTAGAT TGTCTTTAAT GCAACAACTT 60 

GCTGTAAACC AAATTACGAG TGGTAATGAA AGAATAAGTT CAGTTAATGG GCCACCAGGT 120 

30 ACAGGTAAGA CTACTTTATT AAAAGATATA TTTGCTCATC TAGTAGTTGA mAGAGGTAAA 180 

GAGTTAGCTA AACTAAATAA TCCTAAAGAT GCATTTGTCA ArACAAAAAT TCATGAAACG 240 

GATGATAAAT ACGTATACTT ACTAAAGGAn nTATTGCCAA ATATAAGATG GTAGTCGCAT 3 00 

CTAGTAATAA TGGAGCTGTT GAAAATATAT CTAAAGATTT ACCGAAAATT GAAGAAATTA 360 

TAAGAnATCC CGAAAAATGT AAATTCCCTA AATATGAACA GAATTATGCA AATTTAGCAC 420 

ATGAATTAA 429 



40 

(2) INFORMATION FOR SEQ ID NO: 108B: 

It) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 611 base pairs 
' (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
. (D) TOPOLOGY: linear 

50 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1088: 

TAAGACCTAA TATTTTGGnC CAATACTATG GTGCAGGTnA TTTATATATG ACACCAACAG 60 
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ATCCATTATT ACATGAGTTT GGTACGAAAA AGTATCCAGA TGAATATCGA TATGGTTTCT 180 

ATGCTAAGCC AACATTGAAT AGACTTAACG GGGGATTCTT TGGACAAGTC TTTACTGTTT 240 

ACTATAATGA TAAGTATGTA GTTGTACTTG CATTAAATGT AAAAGGAAAC AATGAAGTTC 300 

GAATCAAACA TATTTATAAT GATATTTTAA AACAAAATAA ACCTTACAAT ACGAAGGGTG 360 

TTATTGTTCA ATAATTAATA TAGAAGATAT AACATGTATA TGGCATTAAG GCATCGACCT 420 

10 

TATCTGACCA GTATACGAGT TATATCTTCT TTTTTATAGT GGTAAAAAGT TTAAAGTATA 4 BO 

AGGTTGAAGA AGGATGAGTT TAAAAATATG TGTTAACTGA TAAAAGGGGA nATCATTTGG 540 

TGAGTTGGCA TCAGnCTAAA nTGAATGAAG ACGAATACGT TGGTCCATGC GTGGTGTGAn 600 

75 

TGTTTTCTAA A 611 
(2) INFORMATION FOR SEQ ID NO: 1089: 

20 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 644 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

25 ; 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10B9: 

CTGGATAATC GCTTTAATAT GATCAATTTG CTTTTCACTC ATATTAAGCT TATCTAGCCC 60 

30 

TTTCATTTTA TTCATACCTG GAATCATTTT CATAATATCA TCCAGTGGTC CTAGATTTTT 120 

CACCyGwTCA AGTTGTTCTA AAAAATCATC TAAAGTAAAC GATGACTCAC GCATCTTTTT 180 

CTCTAAATCT TTTGCTTTTT CTTGATCCAC ATCTTGTTGC GCTTTTTCAA TTAAACTTAA 240 

35 

CACATCACCC ATACCTAAAA TACGTGATGC CATACGTTCA GGATGGAATA GCTCTAAACC 300 
ATCTAACTTT TCACTCATAC CAACAAATTT AATTGGTTTT TGTGTCACCG AACGAATAGA 360 

40 TAAAGCTGcA CCACCACGTG tATCACCATC TAATTTAGTT AAGGtAACAC CTGTGACATC 420 
AAGTTGATCG TCAAAAGATT CTGCAACATT GACAGCATCT TGACCCGTCA TTGAATCGAC 4 80 

AACTAACATA ATTTCGTTTG GTTTAGCAAT GTCTTTTACT TCTTTTAATT CGTTCATCAA 540 

45 TGCTTCATCG ATGTGTAATC GACCTGCTGG TATCAATGGA TTACAAAGTC TAAATGTTCT 600 

TCTTTAGCAn GTTTTAATGC ATTAGTTACA ATTTGGTGTG GGCC 644 
(2) INFORMATION FOR SEQ ID NO: 1090: 

50 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 207 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1090: 
GATCCTATCG TTTTAGTGCA TGGTTTCAAT GGGTTTACAG ATGATATTAA TCCTTCAGTG 
TTAACTCATT ATTAGGGCGG TAATAAAATG AACATTCGCC AAGATTTAGA AAAAATGGTT 
ACAAAGCTAT GAAGCAAGTA TAAGTGCTTT TGGAAGTAAC TATGACCGCG CATTGAACTT 
TATTATTATA TCAAAGGCnG TCGnGTA 
(2) INFORMATION FOR SEQ ID NO: 1091: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear • 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1091: 
GACTCTAGAG GATCCCCGCT CAAATGCCAT TTGGTGATAT TCGTGAAGCT GCTGGCGAAA 
AAACATTTAA TCCAGTGGAA GATCGTTTCG TAATAAAAAA ATAATCAATA TATTGTTAGG 
AATGGGACAG AAATGATAAA GATTCATTAG TT ATT CATT A TGTTGTAGTT CTTACACATT 
AGCCGCTGCT AATnGCAACT TAAGAnTAGG TT 
(2) INFORMATION FOR SEQ ID NO: 1092: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: -208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1092: 
TGGACCTGCA ATTGGTTGCC AGCAGCAACT TTAGCTTCAT AGCTAGCTTT GTGAGCAAGT 
GGTAAACCTG GAACGATATC ACCAATTGCA TAGATATTGC TGATAGACGT ACGGCTTTGT 
TTATCAACnn CTAATAATCC ACGTCAGCGA ATTTAACACC TAATTCTTCT AGGCCTAATT 
CGTCGTGTTT GGACGACGAC CTACATTA 
(2) INFORMATION FOR SEQ ID NO: 1093: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 902 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1093 : 
CTTAAAAGTT GTnTCAATAA TGATTGTATT AGTTGCATGC TTATTATATA TGAATCATAG 
TATTGATTTA CCTACCCTTA TTATGATTTC AATTTTTTCA TTTGTGATAT TTGATAGTGT 
TGAAAATATT AATAGTGCTG CACACGTACT TGAAATGATA GATATGACGA TAGATGATAT 
TGmAAAGATA AAAAATGCTC CAGAACTGGA TGAGAATGGA AAAAATTTGA CGATTAAAAA 
TGAAAATATC GCTTTTCAAA ACGTGAACTT TTCATATGAT GATAAACAAG TGATAAAGAA 
TGTGAATTTT GAGATACCTA CACAAACATC AACAGCAATA ATTGGACCTT CAGGAAGTGG 
GAAATCTACA TTATGTCACT TACTCTTGCG CTTTTATGAT ATCGATGATG GAAATATTCG 
CATCGATGGT GTTGATATTA AAGATATGAC ATTAAGTACG TTAATGTCGA AAATTAGTGC 
AGTATTTCAA AAGGTGTATT TATTTAATGA TACGATTGAA AATAACATAT TGTTTGGCAA 
TCCAGGTGCA ACGAAAGAAG AAATTATTCG TGCCGCGAAG CAAGCATGTT GCCACGACTT 
CATCATGTCA TTACCTGAGG GATATCAAAC AATGCTAAAT GAAAAAGGTA GTAATTTATC 
TGGCGGAGAA AAGCAAAGGA TTTCTATTGC TAGGGCGATA TTAAAAGATG CACCAATAAT 
TATTTTAGAT GAAGCAACTG cAAGTATTGA CCCTGAAAAT GAACAGCTGA TTCAAACGGC 
AATTAATGAA TTAAGTAAAG GCAAAACAGT AATTACAATT GCACATAAAC TTGAAACTAT 
TAAAnATGCA GATCAGATTA TAGTGCTCAA TGllAGGTGAA ATAATTCAAA AAGGTAGTCC 
TG 

(2) INFORMATION FOR SEQ ID NO: 1094: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 632 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1094 : 
ACGCAAGGGA TATGATTAAA CACCATAATA TACAATCACC TTTTTTATAC ATGAAAGCAC 
TAATAGAATC GATTCATTTA AACATAAAAC ATGATTTTAA CCAGCAAGAT TTGATTGAAA 
TACCAATTGT GTATGGTTCG AAATATGGTC CGGATTTAGA ATCACTTTTA AAACATTACA 
AAATCAAGCT AGAAACTTTT ATTGAATTAC ATTCTAAGGC GCAATATTTT GTTTCGATGA 
TGGGATATTC ACCTGGGTTt CCTTATTTAA CTGGATTAAA TAAGAAATTG TATATTAATC 
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GCGGTATTGT AACTACGGAT ACAATTAATG ATTGGTTAGT TATTGGTTAT 


ACACCATTAT 


420 




CACTTTTTAA TCCGAAAGAA TCAGATTTCG CACGCTTAAA GTTAGGCGAT 


AATATTAAAT 


480 


5 


TTAGACCTAT CAATGAAAAT GAATTAGAAG TAGGAGCGTT TAAAGATGTC 


AATCATAATT 


540 




GAAAAAAGTG GCTTATTCAG TAGCTTTCAG aCTTTGGCAG AAGGGGATAT 


GAACATGATG 


600 


10 


GTGTAATTCC ATGTGGTGCA CTTGATACTT TA 




632 


l z / inr vivini ivn run o&y xi> i^w . iujj. 






15 


(i) SEQUENCE CHARACTERISTICS: 

( A) T.FNGTH • 212 base nairs 

(B) TYPE: nucleic acid 
(D) TOPOLOGY: linear 






20 


' {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1095: 








GCAACACGAT TGCTAAAAAA TGATATACTT ATTAATTGTT GCAACACTTG 


TTTATGTGAT 


60 




ATACTGATCA TGGCATTGTG GCGGATTGAT GACCATACAC ATGACGTGAC 


AATGATAAAT 


120 


25 


CATTGCCACT, ACTTGTTAAA ACAGCAATTT CTCAAGTGGA AGCTGGTGCn ATATATTnCG 


180 




CCAATAATAT GATGGATGGT TTGTTGCTGA AA 




212 


30 
35 


(2) INFORMATION FOR SEQ ID NO: 1096: 






(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 466 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1096: 






40 


AATGAACTGG GTATTnCTTT TGCCATGCTT GCCCACTGCC ATGCATTTGC 


TGAAGAATGT 


60 




TTTTGTAGAA CCGgCTTAAG AATTTGAAGT TTGTAGTAGT ATCTAAAAAG ATTACTTGGT 


120 




GATACTCAGG ATGAATTCCT TGTTcATTAT TTTCAGCTCC TTTGCCCTGA ACCATCTGGA 


180 


45 


ACAGAGTTAT TTGTGAGTTT TTACCCAATA CTGTGTAATT ATAAAGGTAA 


TTAATTCAAA 


240 




ACGCAAGCCC CTATATAATG TTTATTAAAT TATAGGTCGA CCCGTTTTAG 


TACTTTCTTC 


300 


50 


TGCAGACTTT TGTAGCTGCT TGAAGAAATC TtCATTATTC TTAGACCTTT 


TAAGTTTGCG 


360 


AATAAATCTT TCAGTAAAGT CAGTTGAGTC AGTGAATAGA TTTCTTAATT 
GTCTAATTCA GATTTACTTA TCAACAATnC TnCTTTACGC GTTGAA, 


GCCATAATGT 


420 
466 



55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 208 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1097: 
TTTTGATTCA ATAGGCGGTT CCGTGTTATC ACTGACAACT TTAGTTGTAG CTTCATCTTT 
ATGTATTTCT TCGTTAAATC CTTCAAGTTT TTAGTCGTGG GATTTTAACC TCAGGATGTT 
CCATCATGTC TTGACTATCA AGTCCTTnTA CACGTGTCTT TATGTGATGC TTGATTGCGT 
nCCCTTACTT TTTGAATAGG GGTAGTAA 
(2) INFORMATION FOR SEQ ID NO: 1098: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 571 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1098: 
CCTTCAACGA AAATAGCATA TTGACCATAA ATATTATGAA TTGTTTGTTC AACATCTTCA 
GTTATTGGAT CTTGACTCAA AAATACATTA AATACTTCAA TACCAAATTT ACGTGACATT 
TCTACAGCTT CATACGTATC AATAATACCA TCTTGACTAT AATTAAATGC AGACGGTT.CG 
CCGTCTGAAA ATAnGATTAA AAATCGTTGA TGTTGATTTC GACGCATTAA TCGTTCACTT 
GCAACTCTAA TAGCAACACC ATCACGATTA TCATCTTGAG GTTCAAGTGC CATAATACGT 
GGGCCATCTT TTTCAAAGGT TGAGTAATCA TAGTTAATAA TTTCATTAAT GATATTTGGT 
TGTGCATGCT CGTCTGAATC AAAGGCATCC TCACTGAATG ATAAAATTTC ATGTTTAATG 
TTCAATGCTT TTAACGTCTC GTGGAATAAC ACTACACCTT TCTTCGTTTC AGCCATTTTA 
TCATGCATAC TTGCTGgACG CATCAATTAA CAATGTAAAT GTnCGCATCA AATGATTnAC 
TTAAATCTTG TTTTTTGTAA AACAATTTAT A 
(2) INFORMATION FOR SEQ ID NO: 1099: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS : double 

(D) TOPOLOGY: linear 
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10 



15 



25 



30 



45 



50 



Ui) SEQUENCE DESCRIPTION: SEQ ID NO: 1099: 

TGCAAGTGAA CAAGTCATTA ACTTGTTTAA TACACGTGGG GCACAAGTCG AAGAAGTTTG 60 

GGTGAAGGGC CATGAAATTA CAGAAACTGG ATTAACGGCT GGTCAACAAA TACTTGGGAA 120 

ATAACAGTTC TATTAAGAAG CGGACAGATG GAAAAGATTT TACCTTTCAT CTGCCGCTTT 180 

TTGATTTGAA GGCCGGACCn AATTTACCAn GGAA 214 
(2) INFORMATION FOR SEQ ID NO: 1100: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1100: 

CACCGTATTG TACAAAGAAT TTAAAGTTTT CGATTTCTTn TCTTAATTTA TCATCAGCTA 60 

. TCGGTTTGGT TGGGATAATT TTATTATGTT CCATTTTTAC TGGATATTCT TTTTCTTTAC 120 

TACGTGAGTA TCCTnTTTTA TCTTCCCATA ATTCCCTCAC AATAAATTCA CCTTTTGTCG 180 

TTCTGGTATT GCGGTCTACA TACAGGACCA ATCCTCTGGA TTCCaTATTc TTTCCATTTG 240 

TTTCTATTGT CATTTTAGAG TGAATTATCC aAGTGCCTTT GTCGCCCTTT TCAAACTCTT 300 

GATCTCGAAA GCCTTCCTTA TCATAAAAGT CTTCAAGATT TTTAGTTGGA TACAGGCTCA 360 

ACGTTTTGTT GAAGTTTTCT TTAATTTGTT TATCTTTACT GCTTTCTTCC TTTTTCATTC 420 

35 CGCAAGCCCC TATAAAAACA GCTAATAATA CTAAATACAG CCAGAGCCTC AACCTTTTTG 480 

AGTGAATCAT CATTTCACAT CCCCATTTTT ATTTTTGATG CAAAGTTAAG TtCAAATTCC 540 

AATATTAATA ATATTAGAGG aAAtTTtATG ATTAAaTCCT CGTGGATTtT AATTGTTAGT 600 

40 TGTATTTTTC ACATATCAAC ATCACAAATT ATTTAAAAAG CGCAAATATC TTTATAATTT 660 

TTATTGGCCT AACCAACTAA TTAATTAAGA TAAATTGCGC TTATATTTAA AATAATAACA 720 

CTGAAACTCA ATGTATTTAC TTATTAAAAT TGATGTTTAA ATACAACTTT ACTAACATTC 780 

ATTTTTCGGT TTACATTAAT TkGTTAGATA ACGATATATA TCAtCTCTTA CAGCTTTATC 840 

CAGTGCTAAA TCCATCGTTA CTACGTTTGA ACCATTCGGC ATTTTATCTT GCTTTTCTGA 900 

ACCTTCTATA TATCCAGCAG TTCCTAAATA ATAAAAATAT ATACCATCAT CATCTTTTTT 960 

CTGTACAAAA ATATACATTT TAATTCCTTT TGCTCTATGG GACAAAATTT TCTGAACTTC 1020 

TTTTGATTCT AGCGTTCTAT TAGATTTAGT AAACCATTTT AATTCATCTT GACTTAAAAA 1080 

55 
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TGGCATTTCT TGGCTTTT 

(2) INFORMATION FOR SEQ ID NO: 1101: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 353 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1101: 
CCAAATGCTA TAAATGCAAA AATnGCTAAT GCGATATTTT TATAGAAAAT ATCTGGTCCT 
AAATTTTGTA AATACGTGGT AAACGTTGCG TATGACAATA AGCCAATCAC TACAGCGTAA 
ATATGTGAAA TAATTAAACC GAACCGTCTC GCTCTTTCAA AAACAAGTGT TGCTAAAATA 
ATAATCAGTA ATCCAAATGA CAAGGGTTGT CGCCAACTTA ATGGTAAAAA CTGACCTAGA 
TAGCATCCTA TGCCAAATAT AATCCAATAA TACATAAAAT AAAGCCACAC TTTACTTAtT 
GATTGAGCAT GCTGGTTTTG ATGATAATAT GAATGATTTG TATTTTGCGA CAA 
(2) INFORMATION FOR SEQ ID NO: 1102: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 194 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1102: 
CCCACACGTC TTCCACCGGA TAGGTTTCGA GCTCGGGCGA ACCGGGCAGG GCGGTGACCA 
CCCCGTGCGG TGTCTTCTTG GCCAGCTTGT TTGCGAGCTT GGCCAGCGTC TTCGTGGGGC 
CAATGCCCAC GCAACACGGG ATGCCCACCC ATTGGAGGAT CTGGGCACGG ACCTGGCGGG 
CCATCGCTTC GnTG 

(2) INFORMATION FOR SEQ ID NO: 1103: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(Dj TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1103: 
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TCGCTTTTGC TTTTGCATCA GGTATTGCTG CTATTTCAGC ATACTTTTTG ACTTTCAAAT 120 

CTGGTGATCA TGTCATCTTA CCCGATGGAT GTATATGGGC GGTTACTTTT CGTCTCACTG 180 

5 TGGCAAATTT TGGATCGATT TAATATTGAA TTTTACAACC / 220 

(2) INFORMATION FOR SEQ ID NO: 1104: 

(i) SEQUENCE CHARACTERISTICS: 
10 (A) LENGTH: 235 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1104: 



GGACATTGTA TCGACATTTC AAAGATAAAA GCGATTTATG CTACTACGTC ATACAAAGAG 60 

20 ACCTTGATAT TTTTATCACT CATTTTAAGC AAATTAAAGA TGACTATCAT TCTAATTATG 120 

AGGTTATGCA AGTGTCGTCA TCGCTTTTAA ATAAGTCATA ATAAAAATCA AATAATTCTT 180 

GATAAAATGC GCTTTGGTAA AAACGTAATT TATTGTTGCC TGCTTCAATA CATTG 235 



(2) INFORMATION FOR SEQ ID NO: 1105: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



3S (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1105: 

GCGGGGGCAG GTAAATCCAC TTTGACACAT CTTATTGCAG GTGTTTATCA GCCAACAATA 60 

GGTACTATAA GTACAAACCA GCGTGATTTT AATATTGGGA TACTTAGTCC AnAGCCCTAT 120 

40 ATTTCAGGCT CTATAAAAGA GGATATTACG ATGTTTAAAG nTATAGAAAA TAATACTAAT 180 

GAAGAAGTGC TAGACGAAGT AGGGTTTATT AGACCAAGTG CCATCCTTC 229 



(2) INFORMATION FOR SEQ ID NO; 1106: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 596 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear - 



(xi) SEQUENCE DESCRIPTION: SEQ .ID NO: 1106: 
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rAATTAAAAT CAAAATTTGA TGAAGTAAAT GTAATCGCGT CAACAAACGG TAGAAAGTTT 
ATTAATGGTG AAATATTAAA GCAATTTTGC GATAACTATT ATGACGAATT TGaAGACCCT 
TTTTTAAATC ATGTAGATAT AGCAAATAAA CATGACAAGA TTATTATTTT ACCTGCGACT 
TCTAATACGA TTAATAAAAT TGCmAATGGT ATATGTGaCA ATTTATTATT AACTATTTGT 
CATACAGCTT TTGrAAAACT TTCTATATTT CCAAATATGA ATTTACGAAT GTGGGAAAAT 
CCAGTTACTC AAAATAATAT TCGATTATTA AAAGATTATG GTGTATCAAT ATATCCAGCA 
AATATTTCAG AAAGTTATGA ATTAGCGTCA AAAACATTTA AAAAGAATGT TGTCGCACCT 
GGAACCATAT AAAGTTCTGG gAATTCATTT GaGATTAGAA TATGGAAAAT CaTAAAGCGC 
GCkATTGATT AGTTTAATGA ATGATTGGAG TCnTTTGAAG ATnCGATTTA CAATGG 
(2) INFORMATION FOR SEQ ID NO: 1107: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 673 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION; SEQ ID NO: 1107: 
ACTAGCTAAA AATAATATTC TATTCTTTTT AACATGGGCA GTTCCAGCGG CAATTAGTGG 
TATTTATATT AAATATATAA ACAAGGCTAC GGTAGAAAGA TTTTTTAAAT TAGTATTTTT 
CATATTTTCT GTTTCATTTA TTTTTGTAAT TTTAATACCA AAACTTACAG GTGAGATACC 
TAGCTATATC AATTTTGGAC TTATGAACTA TCAAAACGCT TCGTACCTTT CAGCATTTAC 
TGCCGGATTA GGCATTTATT TCATTATGAA AGGTTCAGTT AAACATAAGT GGATATATGT 
TCTATTTACA ATAATTGATA TCCCTATTGT GTTTATACCA GGAGGGCGTG GAGGTGCTAT 
TTTATTAATT CTTTACGGCT TATTTGCATT TATACTTATT ACGTTTAAAA GAGGAATACC 
TATCGCAGTA AAAAGCATTA TGTATATTTT TGCATTAAGC ATATCTAGTG TATTGATTTA 
C TTT CTT TTT ACAAAAGGTT CGAATACTAG AACATTTTCA TATCTACAAG GTGGAACACT 
TAATTTAGAA GGTACTTCTG GgAAGAGGAC CGATTTATGA AAAAGGTATT TACTTtATTC 
AACAAAGTTC GTTATTAGGC TATGGGCCAT TTAACTATtA TAAACTAATC GGAAATATAC 
CACATAACAG CAT 

(2) INFORMATION FOR SEQ ID NO: 1108: 

(i) SEQUENCE CHARACTERISTICS :. 

(A) LENGTH: 506 base pairs 
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(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1108: 
TCACAGTGGC GGCACAAAAA GTAAGTGGAG ATTATTTTAA TTTAATTGAC CATAACGATG 
GCACAATGAG CTTTGCTGTT GCAGATGTCA TTGGAAAAGG TATACCAGCT GCTTTAGCAA 
TGAGTATGAT AAAGTTTGGC ATGGATTCTT ATGGACACTC ACAATTACCG AGTGATGGTT 
TAAAACGTTT AAATCGTGTT GTTGAAAAGA ATATTAATCA AAATATGTTC GTCACAATGT 
TTTATGGTTT ATATGAAGAA ATGAACCATT TATTGTATTG .TAGTTCAGCT GGTCATGAGC 
CTGGATATAT TTATCGCGCT GAAAAAGAAG AATTTGAAGA AATTTCAGTT AGAGGTAGAG 
TGTTAGGAAT CAGTTCACAA ACACGATATC AACAACAAGA AATTCCAATA TACCTTGATG 
ATTTAATTAT CATTTTAACG GATGGTGTGA CTGAAGCTAG AAAAGTGAAG GTACCTTTAT 
AGATAAACAA AAACTTTTAG AATATA . 
(2) INFORMATION FOR SEQ ID NO: 1109: 

(i). SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 552 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1109: 
TGATATTGAA AAAGATGTCA TTGTTAAAGC ACAAGAAAGT AAAATTGCTC AAGTTATCAC 
GAATTTGTTA ACGAATGCAA TTAACTATTC TTATGAAGAT GGAGATATTA ACGTTCGTGT 
GTATCGAGAT GACTTTCGTG TCATTTTCGA AGTACAAGAT TTTGGTATAG GTATTAAATT 
AGAAGACCAA CAACGTATTT TTGAAAGATT TTATAGAGTA GATAAAGCAC GAAGTAGnAG 
ATTCTGGTGG GACAGGTCTT GGATTATCAA TTACAAAGCA TATAGTAGAs GCCCACCAAG 
GCAATATTGA AGTGAATAGT CAAGTTGGCA AAGGCTCAAC GTTCAAAGTT ATTCTAAAAG 
ATTATAAAGA ATAAnAATTA AAGTGGTAAC AGCGCGTGTA TTTTTCACGA AGTTGTTGCT 
G CTTTTTTA T TTCGTTCAAA TCTCTATTGG TATTAAATTA GAGTTAATGA AGTGGAATAA 
ATTCGGTTGT CAATTTTGTC ATTTGTATAT GTCGATTTGT AAGTTATAAG TAATAGATTG 
TTGTGAATCA nG 

(2) INFORMATION FOR SEQ ID NO: 1110: 
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(A) LENGTH: 244 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1110: 
AACAGATCAA CAATTGCAGA TCAATTACTC GTCATATTGA TCAATTATCT CTGACAACGA 
GTGATGATTT ATTACGTCAT TGATTGATCA TCACAAGATA AGTCGCTATT GATTTCTCAA 
TTTTACAACG AnATTAGGAA AGCTGnAGCA GATAATTGGC TAAGTTGGAC GATAAAGGTT 
ATCAATCGCC AATCGTTGCC ATTGAAGAAC TTTTGCTCAC TGGCGCACGC TTCAGATGAT 
ATTT 

(2) INFORMATION FOR SEQ ID NO :^ 1111: 

(ij SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 825 base-pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1111: 
AGTATTGAGT GTTTTGATTT AAGAGACTTG GAAAGTGAGT CAATAATCAA AATCTTACTC 
AATACTTTTA TTTTGGTACT CGCAATTTAG ATTTGTTAAC TAATTTTTTC GCTTATTATA 
AAACGAAACA ATCAAGTAAA CGATAAAGCC TACAAAGATA CCCAATAAAA TAGATAGTAC 
TGCCGTCACT ATTAATGGTA ATTTGAAAAA TATTTGTAGG AAAATmCCAA TGATAATTGC 
GATAATTACT GCAAtTaATG TGACTGTATT TTCATTTGAA ATGTTCATtT ATttTCACTC 
CTtAACAATA ACATTATATC ATGCTATAGC TTTCCAAAAT ATTGAAATAT GTAGATATGG 
CTATTGACGA TATTTCTTAA CTTTTATATG ATTAATCGGA ATGAAAAAAG AGAAGTAGGT 
GGCAATATGA AGTCAAATAA ATCGCTTGCT ATGATTGTGG TAGCCATCAT TATTGTAGGT 
GTATTAGCAT TTCAATTTAT GAATCATACG GGTCCTTTCA AAAAGGGGAC GAATCATGAA 
ACTGTACAAG ATTTAAATGG TAAAGATAAA GTACATGTTC AAAGAGTTGT GGATGGTGAT 
ACATTTATTG CAAATCAAAA TGGTAAAGAA ATTAAAGTTA GGCTTATAGG GGTTGATACG 
CCAGAAACGG TGAAACCGAA TACGCCTGTA CAACCATTTG GCAAAGAAGC ATCAAATTAT 
AGTAAGAAGA CATTAACAAA TCAAGATGTT TATTTAGAAT ATGATAAAGA AAACCAAnTC 
GCTATGGTAG ACCATTGGCG TATGTATGGA TAAGTAAAGA TCGTA 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 621 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1112: 



10 


TCCCTATAAA TCCCCACAAA AAAAGGGTAA ATCCGAACGT 


CAAATTTCAT 


CTCTTTATGC 


60 




CAATCAACGA TCGATTTACC TTTTAAAATT TAAAATGTAT 


CTGTAATACT 


CTTAGCAATA 


120 


IS 


TTTGTATATG CTGCATCTTC CATAGGTGGA TTGTGTAATC 


CTGCACGCAT 


GTCACGATAG 


180 


TAACGTTGTA ATGGTCGATT CATTTCTAAG CTTTTAGCAC 


CAACTATTCT 


CATCGCTAAA 


240 




TCAATCACTT CAAGCCCTTG. ATTCATTACC ATCACTTTAC TCGCACTTGT 


TGGATTTCTT 


300 


20 


ATCTGACTGT CATCTTTATA TTGTTGATAC CCTTTTGCTG 


TACTCCATAA 


AAATTGTCTC 


360 




GCACATAACA ATAGCGTTTC CATTTTCCCT AAATTTTGTT 


GCACAGTTGG 


CAATGTTGCA 


420 




ATCGTTCCTT CAATACTATT AGGGCTATGT TGGATTGCAA 


AGTCTACAGC 


ATAATTTCTA • 


480 




GCAGCTTGTG CTATACCTAA ATAACAGCTT GGTATATGCA 


ATATCCAACC 


ATTTGGCGCT 


540 




TTACTTTTCT- CTGTTTCCAC TAAATGTTTT AATGGTACTT 


TTACATCATT 


TAATATTAAA 


600 




TCATGACTTT CTGTCGCTCG C 






621 


30 


(2) INFORMATION FOR SEQ ID NO: 1113: 








35 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 619 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 








40 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1113: 








GTACCTTGTT TTTCAGTGTT AATTTCCCAA ATATAACCTG 


GTGTAATACC 


GTATTGTTTT 


60 




GAGAAGTACT TGAAGGCACC TTCACTTGTa ATCATGGCAC 




TGGAATGTCA 


120 


45 


TTAAATTTGT CTTTACTGTC TTTACTGTCA TTATTTAATT 


TTTCCAATTG 


AGCAATGTAT 
AAATGTTTGT 


180 
240 




TTGTTACCTT GCTTTTCATA ATCTGCTTTA TGTTTTTTGT 


CGTTATCGAT 




TGAATTGTTT TTACGTATTT AATACCGTTA TCTAAACTTA 


ACCATGCGTG 


TGGATCTTGT 


300 


SO 


TTATCTTTGT TGCCTTCTTC . ACCGTTTAAA TAGATAGGTT 


TAACATCTTT 


TGATACTGCG 


360 




ATAACTlTl-r TATCTTTTAA TGATTTACCA GCCTGTTCTA 


AGGCTTTTTC 


AAACCAACCG 


420 



55 
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10 



IS 



25 



30 



35 



45 



50 



55 



ATATCyTTAG GTTtAACyTC aTAwTCaTGA GGATCTkGAC CAACAGGTAC AATACnATGA 540 

. ATATCGACGT TGTCTCCACC AACATTTTTA GCCATATCAG ATAGAATTGA ATTCGTCGnT 600 

ACTACTTTnA ATTTGCCAT 619 
(2) INFORMATION FOR SEQ ID NO: 1114: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 549 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1114: 

GAAAACCGCA CGACAATGAA AAAAGATATC CGGTTAAAAT GGTCGATAAT AAAATCATTC 60 

20 CAACAAAAGA AATTAAAGAT GAAAAAATAA AAAAAGAAAT CGAAAACTTT AAGTTCTTTG 120 

TTCAATATGG TGACTTTAAA AATTTGAAAA ATTATAAAGA CGGAGATATT TCATATAATC 180 

CAGAGGTGCC GAGTTATTCG GCTAAATATC AATTAACTAA TGATGATTAT AATGTAAAAC 24 0 
AATTACGCAA AAGATATGAT ATACCGACGA GTAAAGCTCC AAAGTTATTG TTAAAAGGTT . 3 00 

CAGGGAATTT AAAAGGCTCA TCAGTTGGAT ATAAAGATAT TGAATTTACG TTTGTAGAGA 360 

AAAAAGAgGA AAATATATAC TTTAGTGaTA GCTTAGATTA TAAAAAAAGC GGAGATGTAT 420 

AATCATGGCT CAATCAGAAT ATGAAATCAA TCCCGGAAAA AGAGAGTGAT GAAATGATAA 4 80 

AACGTGTAAA TAAATTAGTG CTTGGTATTA GTCTTCTGTT TTTAGTCATT AGTATGCACT 540 

GCTGGTTGT 549 
(2) INFORMATION FOR SEQ ID NO: 1115: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 317 base pairs 
40 (B) TYPE: nucleic acid 

(C> STRANDEDNESS: double 
(D) TOPOLOGY: linear- 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1115: 

TGTTATTGCA ACTATTAAAA ATATnAAAAA CAAAGACTTA ATTGATTATT CTTTATATAT 60 

TAAAAAAGGT TTAGTGGAGC CATTTATTGT ATTCTATAAT CAAGAAAATA TATTCTACAT 120 

TAGCAATGCT GTTATAGATA TCATCAGTGT ATCCAAAGAT ACTCTTGATT ATATAAAGTC 180 

AGAATTCAAT GATCACTGTG tTCGCCATAC TGATTTTATA GCACAGAGAA AAGCCGTAGT 240 
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TTATCATCTT CGATGAA 






317 




l^J INr xUN FOR bLU iU Wv*> : 111b: 








5 
10 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1116: 






15 


ACTATGACTG CAATGCCAAC AATGAATGTT ATTGCATAAA GCATATGCAT 


TTAATCAAGC 


60 




TTATGTTATT TAGCAAGGGC AAGCCACTTC TCGATTCATG 


GCTTGCCTTA 


TTTTATTGTA 


120 




TATTTTTAAT TATCGTTGCT GGGCCCTTGA TTGACAATCG 


TATTAAAATG 


GCCTTATTAA 


180 


20 


GTCAACTTTG TCTATACGGT TTGGAATCTT CTACCCAATG 


TCTTATAAAA 


GACAATCCCG 


240 




CACCTGAAAC ATAACTCATG AAnATAAGAA nATGGTATAC 


CATTAATnTG 


AATCCATTTC 


300 




CAnTTTATAA CGTTGTAAAT GACACATAAT TAGAATCATA AAGTTTTTCG 


TGGGATATTG 


360 


25 


AAACTTTACC CATACGAGAC ATCATGGATA AAAGCGAGGC 
(2) INFORMATION FOR SEQ ID NO: 1117: 






400 


30 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 696 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 








35 


(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1117: 








TGCATATAAT GTTCAAACGG CAATGATTCC ACTGTATAAA 


CCTCTTGTCG 


TGGATGATCT 


60 


40 


TTAATTTGTT GTTTTAATAA CTGAACTCTT TTTTCATGTT 


CATAATGAAC 


ATAAATAAAT 


120 




GCACTAAGAT ATATCACAGC TAAAACTAGT GATGCACCTT 


TAATAAAATT 


AACATTGATT 


180 




GACTTATATT TCCGAAATTC TTTTAATAAA ATAATTAAAA 


TGATTACATG 


TATCGTATAA 


240 


45 


ACAATCAAAA AATTACCTGG TTCTATTGGA GTAACAATGA 


CTAGTGTCGA 


CGCCGAAACA 


300 




CATATTGCAA TAAGTAAGGA ATATAAAGTG ATTTGTGTTT 


TACGATCATT AATAGACAAA 


360 




TAAATACCTA CAAATATCGA AAACGCAAAG TAACCACATA 


CAATTACGTT 


CACAAAACCA 


420 


50 


ACCAATCCAA TATCTGTATT TTTATTTAAT AAAAACTGAT 


TGTAAAATAG 


TAAATAATAA 


4B0 




AGCGGTAAAG TGATAAATCC TATCATAATG ATACGACGCT 


TTAATATTGT 


TAAATGTACA 


540 
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ATCTGACTGA AAATGACACC ATATGGAAGT GACGTTGATA TCATTTCTGC AAATTTTGAA 
AAGATACCTT GATTATTTGA AACTTGTTGA TATTCT 
(2) INFORMATION FOR SEQ ID NO: 1118: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1118: 
GCAGTGACAA TCAATAATTT GTAACTAGAA GATAATAAAG AGAACGCTCT ATAGAGACGA 
ATTGAAGGTT TGATTTTAAT GTCTGTTAGT AAGAATCATA TCAATGAGAT GCCTATAGTA 
CTCAGAnTAT ATTAAATTAA AACCGTCATT AATTGTTTTT CCGAAAACCA TTTGTAACCA 
TTTnAATGTA GTTGGACCTA CCTACGTTCT CCAATAATCC ATT 
(2) INFORMATION FOR SEQ ID NO: 1119: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 265 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1119: 
CTGCTTTACC TTTTCCAACT TTAGAAATTG CTACATAGGG GCCTTCTGCT TTACCGCCCC 
AATTCCAATT CCACATTTTC AAGAAATAAG GAGGCAAAAA TGTTCATATA TGAATTGGAA 
TTATTTGTTT TCTTATTAGG. CCGAGATGCG CCGCGTGCGG CTGCTGGAGA TGGCGGACGC 
GATGGATATG TTCTGCCAAG GGTTGGTTTG CGCATTCACA GTTCTCCGCA AGAATTGATT 
GGCTCCAATT CTTGGAGTGG TGAAT 
(2) INFORMATION FOR SEQ ID NO: 1120: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 329 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1120: 
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CATGACATTA CATGCTTATT TAGGTAGAGC GGGAACAGGT AAGTCTACGA AAATGTTGAC 



120 



CGAAATAAAA CAAAAaATGa AAGCAGATCC GCTTGGAGAT CCAATCATTT TAATTGCGCa 



180 



ACTCAAAGTA CATTTCAATT AGAAcAAGCC TTTGTCaATG ATCCGGAATT AAATGGTAGT 



240 



TTAAGAACAG AAGTGTTGCA TTTTGAACGA TTAAGTCATC GTATTTTcCA AGAAGTTGGT 



300 



AGTTATAGCG AACAAAAGTn ATCTAAAGC 



329 



70 



15 



20 



25 



30 



35 



40 



45 



50 



(2) INFORMATION FOR SEQ ID NO: 1121: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 252 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1121: 

ACACCTCTTT TTTAATATAT GTGAACTATA TATAGTGTAT AGAGAGGTTA CTTGTTACTC 60 

AATATAAACA AAAATCAACT TTGTCAAAAT AAATGTGACA AAATTAAATA AAGTGTCATC 120 

AATGTGACAG TATAGATCAT TTTGCAAAAA GTCAAAACAA AAAAATTGTT TTAGGGATTT 180 

TTCAAAATTT TCATTGTGGA AAATGATTTG nCAAAACAAA ACAACACnTT GTGAGCAATG 240 

AAGCTTCAAT GG 252 
(2) INFORMATION FOR SEQ ID NO: 1122: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1122: 



nAATAATAGG 


CGCCACCTAA 


TAAACCAGCT 


GGAATGCCTA TCATTGGTGT TGTGAATGAG 


SO 


CTTAATACAA 


TAACAAGTAT 


TGTTAAAGCA 


ATGACGTTAT ACCAAGTTAC AGTCAAATTT 


120 


TTCAAATCCT 


CATATGATTG 


TTTTACTAAT 


TCTCTAAATT TCATGATTCA ATCTCTCCTT 


180 


TTCTTAAATC 


AATTAATAGT 


TATACTGCAT 


TGCTTAAAAT CAAnATATCT AATGAAGCAT 


240 


CTTTTAATCG 


TAAGTGTTAA 


ATGGTTTTCT 


TAATCAATTT GTTTGAATGT ATAAnAGCAA 


300 


TTAAAGCAAA 


AGTAATGTAA 


ATGAGGGTGT 


ATACAACAAT TACTACAATA ACCGGCATGA 


360 


AGGAAATATT 


ACCCATCAAT 


TTCATAAATG 


CGATTGCGGC 


400 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 270 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1123: 
AATACAACCT TTTATCATAG TGTAAATGTA TTCTACCAGT ATTGAGAAGT TTTCATATTA 
TTCAATACCT GAAATCGCCA TAGTAATATT ACTAAATGCA CACTGCATAT GTTGTTTAAC 
AACACAACTT TAAAAATATA TTCTAACTCT ATCTACGAAT GTACTTAAAT ATCATAACAA 
TCTATTCCAA ATCnAATTAC ATTATTAGCT ACCTTAAAAA CCAAACCGAG GCCTTAGGCC 
TCGGTTTAAA TATATATAAC GnGCGACATG 
(2) INFORMATION FOR SEQ ID NO: 1124: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1124: 
GGTCTTCTGT ATCAAATTAT ATTATGAGAG TTAGACTCAG TATTAATGAA ATCGTTGATA 
AAGATCAAGT GCTTGCATCA CAAATAGGTG GATTACTATG ATGTTTAATC AAATTAATAA 
TAAAAATGAA TTAGAAGAAT CATATGAATC TGAGAAAAAA CGTATAGAGA ATGAACTGCA 
AAATTTAAAT GAACTTAGGC ATAGAACTCG AAAAGAAAAT GAACGTAGTT ATGATGTTTT 
TCAATATTTG AAGCACGAAA TGAATTATAG TGAAGATGCC CAAAGGAAAA TGACGAGAAA 
TATAGAAGCG TATGAGCAAG AAATCAATGA GATAATTAGA AAGCAAGAAT GGAAATTAGG 
AAGAATATAA AGAAGACTTA AAAAAGTCTT ATGAAAAGCA 
(2) INFORMATION FOR SEQ ID NO: 1125: 

• (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 377 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1125: 
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AAATATAyTC GATCATTGAT AATATGTGCT TTACTATTCC AAATAACTTT CAAATTTTGA 120 

TAATATTCAT CAATACTCTC AATATAACTA ACCAGTTTAT ACAACCATTC GCCATCGACT 180 

5 TTCACATATT TTTGAATACT ATTTCCAAyT TtAAGACGTG TTGGTTCATT TACAAAATGC 240 

CCCAAATTAA TACCACTTAA TAATCCAAAT GGAGTTGGAC GTGTTGACAT TCTAATTAAA 300 

TATTTTAATA AACTTTCTTT AGCATCTCTA ACCTTCTTGT TATCGCCATC AAAATTTATT 360 

10 

nCnTGTAAAG TACAATA 377 

(2) INFORMATION FOR SEQ ID NO: 1126: 

(i) SEQUENCE CHARACTERISTICS: 
15 (A) LENGTH: 227 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1126: 

GTTACATTAC TGTTCATGTC CACAACGTTT GTCAGCGATT ATCATCATTG TGTTATATCT 60 

25 

TATTTATAGC AAGTGCATGG TCTGGATAAG TAAAGGTATT CATACTTAAG TAACTTGAAC 120 

' ATTGGTTTAG GACTATTTTA ATGGTnCTGC TTnAATTGTT GGCCAACTGG TTCTTATTTT 1B0 

AAATATGTAA CTAGCTCTAC GGGTAGTTAC TAAACACATC CTGGTTT 227 

30 

(2) INFORMATION FOR SEQ ID NO: 1127: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 717 base pairs 
35 (B) TYPE: nucleic acid 

AC) STRANDEDNESS: double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1127: 

CCAATGATAC CGTTATAGCA TGTTAATGTT TTTGTACGAA TATAATCTAA AACTTCGAGT 60 

TCATCACTTA ATTTCCACGT TTCTGCTTTG TTTAACCACA TTAAAGGAGT ATGAATGnCA 120 

AAATCTTTGT CCATAGCTAG GCTTAATGTT ACGTTCATTG ATTTTATAAA ACTATCGCGA 180 

CAGTCTGGGT AGCCTGAAAA GTCTGTtTCA CATACGCCTG TAATAATATG CTTAGCCCCA 240 

ATTTGATAAG CTAGAGCGCC TGCAAACGAC AAGAAAAGTA AATTTCTAGC TGGAACAAAT 300 

GTATTAGGTA TACCATCTTC ATTATTAGTA ATTTCCATAT CATGTTGTGT TAATGCGTTT 360 

GGAGTAAGTT GTGATAATAA TGACATATCT AAAACGTGAT GTTTCATTCC TTGATCTTGT 420 
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AGTTCAACTT 


CTTTGAAATG TTTTTTTGCA TAAAAGAGAC ATGTTGTACT 


GTCTTGACCA 


540 




. CCACTAAAGA 


CAACGATGGC TTTTTCATTA 


TTTAATACAC 


TTTCCATTTT 


GTAATTGCTC 


600 


5 


CTATCATTAA 


TAATATTAAT 


AAAGAGGTTA 


ATGGCATTGA 


TAAGCCCGTT 


TTTAATTTAT 


660 




AAAATAAAAA 


AAGCCnATCT 


CCATAAAAGA 


TAGACGAnAG 


AAATGGGTTG 


CTCCTAT 


717 




(2) INFORMATION FOR SEQ ID NO: 1128: 








10 
15 


(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 595 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 










(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1128: 






20 


GATTATTGCA 


TTGACAACTT 


ATATCTATCT AACGTTAGTA 


GCATTTAATT 


TCATTTCCGG 


60 




GGTTCTAACT 


TTACCAGGAT 


TAGCGGCGTT 


GGTATTAGGT 


GTAGGTATGG 


CTGTAGATGC 


120 




CAATATTATC 


ATGTATGAGC 


GTATTAAGGA 


TGAACTTCGA 


ATAGGTAGAA 


CGATAAAGCA 


180 


25 


AGCCTTTTCT 


AAAGCAAACA 


AAAGTTCATT 


CCTAACAATT 


TTTGATTCTA 


ACTTAACAAC 


240 




AGTTATCGCC 


GCAGCAGTAT 


TATTCTTCTT 


CGGTGAAAGT 


TCAGTTAAAG 


GTTTCGCGAC 


300 


30 


AATGTTATTA 


TTAGGTATTC 


TAATGATCTT 


TGTTACAGCC 


GTGTTCTTAT 


CAAGATTCTT 


360 


ATTATCATTA 


CTTGTTTCAT 


CAAATATATT 


CAAAAATCAA 


TTTTGGTTAT 


nTGGTGTTAA 


420 




AAAGAATAAA 


CGACATGATA 


TTAATGAAGG 


TGTAGATGTT 


CATGACCTTA 


AAACTTCATT 


480 


35 


CGAGAAATGG 


AATTTTGTTA 


AATTAGCTAA 


GCCATTAATT 


GGAGTAAGTA 


TTTTAATTGT 


540 




AGTGTCGGTT 


AGTTATTCnT TATATCTCAA GTAAACTTAG 


GAnTGAnTCT 


CAAGT 


595 



(2) INFORMATION FOR SEQ ID NO: 1129: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 371 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1129: 

ATTTTAACTT TCTAAGAACA AGTTGTCAAG ATTTTTTCTT AACATATCAT GTTCTGTTAC 60 

AATATTAATA ATGAATACCG AAAGGACGAA TTGCAAATGA AAAGCGTTGG CTTAATCACA 120 

GAATATAATC CCTTTCATAA TGGGCATCAA TATCATATTA ATCAATCTAA AAAACTTACA 180 
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ATCTATAATA AGTTACTCGT GCAAAATGGC TTTCCACCGC TGTTTAGTTT CGG ACTACCA 300 
GCAACTGCAG TTATCACTGG CGnCATTTGC GACTGCCGTT AAGCGCGGTT TTGGTGCGTA 360 
CATGCnTGGn G 371 
(2) INFORMATION FOR SEQ ID NO: 1130: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 200 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1130: 

ATGATGATGA GATTATGACT TTGCGAAAGA TATCGGTGCG CCTTATGAAT TTAAAACAnA 60 

20 nTAAAGACCA TGGTCGTTAC CGGTAGTTAA CTTTGCAGCT GGTGGCGTTG CGACTCCTCC ' 120 

AAGATGCTGC TTTAATGATG GAATTAGGTG «CTGACGGTGT ATCCGTTGGA TCCGGGAATT 180 

TTAAAACCAG AAGATCCCGG 200 
(2) INFORMATION FOR SEQ ID NO: 1131: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



3S (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1131: 

ATGAGCTAGT CGATTAAAGT TAATATTTGA AAATGAAGCA GGGCATTAAA TGCAATAAAT 60 

TAAATAAGTT GTCATTAAAG CATTAATAAT AGAAATGATT TTAACAGGAA AAAAGTGATG 120 

AATATTTGGA AAAGATATAT ATCGTGCACT GTCCTGAGAG ATTAGATTTG GGAAACCAAT 180 

TTATCCnTGA ATCGAGATGT nGC 203 
(2) INFORMATION FOR SEQ ID NO: 1132: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
(Dj TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1132: 
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10 



15 



25 



30 



ATCTAACAGT ATTGAAATCT TTGTTAGCGT CAATCGACCA TGTATTTTTA AGTATTGTCT 120 

CTAACAACTT AGCACGACGT CTTAACCCTT TAGTAAACAA TCCTTCTTTA GllTTTAGTAC 180 

GT 182 
(2) INFORMATION FOR SEQ ID NO: 1133: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 485 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1133: 
CAAATTTAGC CAGGnATACA AATACATATA ATAAAACTGT TTGTAAGCTT ACGTTGACAA 60 
20 TCTGGCGTAA TTGGAAAACT AATGAATTTT CTCCAAGTAG GTTTTACCCT GTAAACmAAA 120 
TAACAATTCm AATAATATGr AATCACAAAg CGACTAGAAA TCCGGTAATA TGACTAATCA 180 
TATATTCAAT GTGTAaTAAT TTTAACAGCA ATAAATAGAC AACATAATAA TTTAACGTAT 24 0 

TAATGCCGCC AACAATGATA AATTTTAAAA TTTCAGCATG CGTTTGTGTT AGTTTCATAT 300 
GTGTACTCCT CAACATCAnA ATATATGCAT AACTACGTTC TCGAACATAC TCGAATATGC 360 
GAGCCAATCC GCTTCACTTC AAATATGCTT ATTTCAATCT TTATACCCTT TCACAGCAAT 420 
TTAGTCTCTT TnCCCTCATC CTTATAGGCC nTATATGTAA CCGATTATCG GGTGGACTCA 480 
TTGGC 485 
(2) INFORMATION FOR SEQ ID NO: 1134: 

35 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
*o CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1134: 

CTTGTTTTAT CGACAGATGC TGAACACGCA GTCaAAATAA ATAAATTACA TAATAAAATT 60 

CCAATTAATC TTTTCATAAT ATCATCCTTT. TGTTTTTCAA TTGATATTCA TTTTCAATTA 120 

TAAAATATTA rAAGAATTAG TCAACGCCTG TGAGTAATAC ACATCAGTAA CATTTCTATT 180 

TTCATTTATG ATATTATCTA ATTATTAATT TAT 213 
(2) INFORMATION FOR SEQ ID NO: 1135: 
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(A) LENGTH: 526 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1135: 



10 


TGTGACTCCA 


TGTATTCATG 


TATCATTGTA 


CATTTGTGAA 


TTGGTTGAAA 


ACTGATAGTG 


60 




CCCGTTTAAT 


TACAGGTACT 


GAGGATGCTT 


TCAGGACATG 


GATTTGTTCG 


AAAATATGAC 


120 




AATATTTCAC 


TATGGCTCTT 


CACTTCTCTA 


GGGAAATTGC 


TATCTTGATT 


AATCCAATCA 


180 


15 


ACCAATTCAC 


CTAATGGTGT 


GTCATCACCA 


AGAAAACTTT 


GCATAAATTC 


GTAAAAGCTC 


240 




AATACCTCAC 


CTCGATTaAT 


ATACTTGAGC 


CATTaATATA 


ATACCATAAA 


AGAATGTGTA 


300 




TTTTaCATAA 


AACTTTTTaT 


AAAGCAATTT 


TATAAATCTA 


AATTTATTGA 


ATGTTTAAGA 


360 


20 


AGTTGTGTAT 


AATGAAATAA 


ACGTGTTGAA 


ATAGTTAGGA 


TGATGTTAGT 


GGATATCAAA 


420 




CATATGAAAT 


ATTnTATTGA 


AGTCGTTAAG 


CAnGGAGGCA 


TGACTAATGC 


TTCCAAATCA 


480 




TTATATATTG 


CACAACCTAC 


AATTAGGAAA 


GCnATTAAAG 


ATATTG 




526 



(2) INFORMATION FOR SEQ ID NO: 1136: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 321 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1136: 



45 



AGCAAAAGCA 


ATGATGACAA 


AAGAACAAGC 


TATGTTAGCA 


GATGGAAGTA 


TTAAACAAGA 


60 


TCAATATGAT 


AAACAACTGT 


TATCGAnAAT 


CGGAAAATCA 


CAATTAGATG 


AATTGTCTTC 


120 


TAAAGATTTA 


CAAGTTTTAG 


CTATTTTTCC 


GAGAGATGAA 


TGCAGGAACA 


GTTTTAGATC 


180 


CACAAATGAT 


AAAAnATGGA 


AGATGTTCAG 


TGCAAAAAGA 


GTATGGCAGC 


AGTTTTCTCC 


240 


AGCAACTTTT 


CCAnATTTAC 


CCAGGTGTTT AACACGTCCT 


ATTGGATTTG 


GGCTTAGGAC 


300 


AATTATTCCA 


TATGGGCGGA 


T 








321 



(2) INFORMATION FOR SEQ ID NO: 1137: 



50 (i) SEQUENCE CHARACTERISTICS: ' 

(A) LENGTH: 330 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1137: 

ATAAATCCTT ATAAAATCAT GAGTTGGACG ACTTATCCAC ACAATTATAC TTTTTCAGTT 60 

CTCTATATGT CAAATACCTG TGAATATGTT GCTAAAAATA GTATAACTTT " GTGTGTAATT 120 

TCTAATTATC CACAATTCTG AAAACTATAA ATGTGCATAA GTGGATAmCT TTcCTTCTAT 180 

AGAGTATCTG TTAGTGAGTG TATCAAAaCA GTTTGGGAAa TAATTtATAA AGTnTGTATA 240 

AGAnCTGTAT AAGGTGTTCG AACATTGTAA ACACTCATGC TTCGGACCAA ACTCATGGTG 30 0 

ATGTTATGAA ATTTGATTGC TCGCATCGnG 33 0 



(2) INFORMATION FOR SEQ ID NO: 113 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 459 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113B: 
25 CTTGATAAGG GAAGATTTGC CAGCACCATT CGGGCCCATG ATACCAATTA TTTCGCCGCG 60 

TACTGGTATC GATAAGGAAA TGTTTTTAAG TACATGCTTA TTACCTAAAA ACAGATTTAA 120 

ATCTTTTGTT TCTAACAAAC GTTTATACCT CCTAATTAAA AGTTTAGGCT AACCTAATTA 160 

30 

ATTGTATAAT AAACTGAGAA TATTTATCAT GTCAAGTAAA TTCGTGATAT AATATAGACA 24 0 

ATGTATGTGA GGTGAAAGTA TGTTAACTGA AGAAAAAGAG - GACTATTTAA AGGCAATCCT 3 00 

TACGAATAAT* GGCGATAAAA ACTTTGTGAC AAATAAAATC TTATCTCAAT TTTTAAATAT 3 60 

35 

TAAGCCTCCA TCTGTAAGTG AAATGGTAGG ACGTCTTGAA AAAGCAGGCT aTGTTGAAAC 420 

AAAACCATAC AAAGGTGTTA GATTAACAGA GGATGGTTT 459 
40 (2) INFORMATION FOR SEQ ID NO: 1139: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



■ {xi) SEQUENCE DESCRIPTION: SEQ ID NO: 113 9: 

50 

TAAATACTCT CGCCGGTTAT TTCGCTTCCT CTATACGGAT TGGCGAAAGA TACGGTGCAT 60 
CGGTTTCAAC TAGCAAACGC TCCATTGACA CATGCTTAGC AACTTCTTTA GGCTGTTTAG 120 

55 
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(2) INFORMATION FOR SEQ ID NO: 114 0: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 214 base pairs, 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1140: 
ACCTGAATGC CCATCAATGA CAGACTGGAT AAAGAAAATA TGGTACATAT aCACTATGGA 
ATACTATGCA GCCAGAAAAA AATGAGGTCA TATCTTTTGT GGGAcATAGA TGCAGCTGGA 
GGCCATTATA TTTAGTAAAC TAATGCAGGA ACAGAAAAAC AAATACTGCA TGTTCTCACA 
GCAGGAGCTG AATGATGACA ATTCATGAAC ATAA 
(2) INFORMATION FOR SEQ ID NO: 1141: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 229 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1141: 
GTTTCTAGTT GTTTATAGTT ATCGCATCTC TTGATGTAAA ATCTAAGATG TTTATGCATT 
TCTAGATGTT AATTCTTCGC TGCTTAAGAA TTATCTTGAT GGTAAATCTA GTTGTGCCAT 
GCATTTTCTC ATGATGAATC TAAGGTGTTA ATTCATTGAT GTGTGCTGTT TCTTGnnGCT 
TCATTATCTT GATGGTGAAT TTCGTTGTCT AATGCACTTT TTCAAATGA 
(2) INFORMATION FOR SEQ ID NO: 1142: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 311 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1142: 
TTATGAATCG TAAACACTAT TATGAAATTG CGAGAGTAGT ATGGACAATT AGAATGATGG 
TTTGAGATnA ATGAGTGTTA TTAACCAGTG CGTCATTTTC AAAATTAGAT GCCATTTGCT 
GTTTATAATA AGAAGATTTG ATAATTAACT ATGATAAGAA AAAATAATAA TCCCCTAAAT 
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GGTAAATGTG TGAATAGGAA TATCGTTaAA CTAGTTGTGT TCATGCTAAT TTTAGTTGTA 300 
GCAGTAGCGG G 311 
5 (2) INFORMATION FOR SEQ ID NO: 1143: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 403 base pairs 

(B) TYPE: nucleic acid 

10 (C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 



1S (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1143: 

GATCATGCTG CTTATGAATT ATTAAAAGAG CAATTAGAAG ATGTGCTTGA TACATTAACT 60 

GATAGAGrAG AAAATGTATT ACGATTAAGA TTTGGTCTTG ATGACGGCAG AACAAGAACA 120 

20 CTTGAAGAAG TTGGTAAAGT TTTCGGTGTT ACACGTGAAC GTATTCGACA AATTGAAGCA 180 

AAAGCACTTA GAAAATTAAG ACATCCAAGT CGTAGTAAAC GTTTGAAAGA CTTTATGGAT 240 

TAATTTATAT CAATTTTAAT GATTGATACC AAGACATGAA TAAGCGCTTA TTTATTTTTA 300 

25 

GATTAAGATT TAACTTGAGT GAGGTTGGTA CATTGAATAA TGTTCAACCT CTT TTTT CAT 360 

AAAGGAGAAT AAAATGATTT CGTTAAATAA CCGATTAACG ACn 403 



(2) INFORMATION FOR SEQ ID NO: 1144: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 612 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1144: 

40 nTTTTTATTG ACGAATnGAT TTGGATGCCA GTGGTTTGGT CCTAAATCTT GAATGGTAAT 60 

CGGTATTTTG AAGTTGTCAT GTGCTACACG GTAACCGATA CCnATAAGTA CGAGTGCGAT 120 

GACAATGGTT GTTACGAGCA AGATGTATTG TAACCATTGC TTGAACACAA CAAGTTGTAT 180 

45 

ATAAGGCTTC ATTGACGATA CCTCCAAACC AATACAGCTA AATTAATTAT CAAAAGTGCG 240 

ATGAAsCTAA GATAGAAACT AGGGTGCAGT TCTAAAATGT AGTTGTTTAA AATAATTTCT 300 

AACAATTGAT TTGTTACAAC TGCGAACGGT TGAATATTGA AAACACCATT TGCTATATGT 360 

50 

TGTAAAAAAA TCGTAGGTAT TGTTAAACCA GATAACACCA GGATGACAAT AGCTAATATG 420 

ACTTTACTAA TACTATTCAA CAAGCCTGTT GTTAAAAGTT CGATGAGTAA TAACCACAGT 480 
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AAGATATTCG GAATACTGAA CACAATCCAA ACTACACCAA CGATACTCCA TAACATAGTA 600 
TAAAACCATG Tn 612 
5 (2) INFORMATION FOR SEQ ID NO: 1145: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 223 base pairs 

(B) TYPE: nucleic acid 

10 <C) 5TRANDEDNESS : double . 

ID) TOPOLOGY: linear 



' (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1145: 

GAATTAATTA CGCAATATCC GCATCTTAAA TCAGTAATCG AAAAATATGC TTTAAAATAA 6 0 

AAGTGATCAA TGAAGTGGTT TGAAGGTTGT nAATAAACCT TTGAGTCACT TCCATTTTTA 120 

20 AAnGTATCCT GGATGGAACC AGAATAGATT TGAAGCTTCA GCGGTTTTTA ATGAAAATAG 180 

CCATTAAATG ATTTGGAAAC GGTAAGAGGG GGTATTTAAA ATG 223 



(2) INFORMATION FOR SEQ ID NO: 1146: 

25 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 598 base pairs 

(B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 
CD) TOPOLOGY: linear 

30 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1146: 
CGCTACGGTT GTCACGTAGA GATTAAAGAC ATTCGTTTTA GCAGTCTTTT TTATTGAAGT 60 

35 

AGACAATGCA AGATTAAATG GTAGTGTCAT TTCATTTTAT TCAACTGCTA GTAATACAAT 120 
ACTCAGTGTT ATTTCTGAAG ACTGGTTGAT AATGAGACAT GCATATTTAC AAAACTGTGT 180 

40 ATATTGTGTA TATTGTATAT ATACAGAAGT TGATAGGGGG ACGTTGATGA AAATAATTTT . 240 

AAAAAACAAT AGTGATTTTC CGATTTATGA ACAGATTAAG CAACAAGTAA AACAAAATAT 300 
TTTAAAGGGA CATGTTGCTC CTGGAGAGCA TTTGCCGTCA ATGAGAGAAC TTGCCAAAGA 360 

45 TCTTCAAGTA AGTTTGATTA CTACCAAACG TGCTTATGAA GATTTAGAGA AAGACGGTTT 420 

TGTTACAACA ATTAGAGGAA AAGGGACCTT TGTTAAGGAG CAAGATAGTT CTATTTTAAA 480 
AGAGAAACAA TTTTTTACCA TGAAAATTTG GGTTAAAGGA ATTGGGTTAA TTAAGCnCAA 540 

SO 

GCCCATCCGA ATTGCCACTT GAGGGAnTTC CAGGATATTT nACCGTCATT TATTGAGG 598 
(2) INFORMATION FOR SEQ ID NO: 1147: 

55 
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(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1147: 
CAAATCCAAG TGCAACAGCT TTATCAACCG CTAAAGCTAA ATCCGTATCA GCTTTTTCAG 
CTTGAACTGG TTTGATTTGT AACTGTTCTG TTAGAAGTTG GCGTTCTTCT TTACTGACTG 4 
AATCAAAGTC TCCCACTGAG AAAAAAGGGA TAATTTGATG CTTCAATAAA ATCAAAGCAC 
CTCTATCAAC GCCGCCCCAT TTACCTTCAT TACTTTTGGC CCAAATATCT TGCGGCAAGT 
GTCGATCAGA ACATAATAAA TTTATATGCA TATACACTCA ACCTTTCAAT GCTTGTGTGA 
CTTTTTAAAA TCCCCTGGTT AAAGGAAAAT GAACnGGTTA CCAGCATTGT AGGCACCATT 
TCAACACCAA CTTCGGCGGA' TCnGGATTAC GGCTCCATCC 
(2) INFORMATION FOR SEQ ID NO: 1148: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 3 52 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 114 8: 
TTTTATGGGT CGTATTTTTT AAAGAACAGT CATCTCAATC AAAAGAATAG AATTATGATT 
AAAATTGATG TrGTGTAATT AAAGTAAGAA ATrrkAAATT TCCTAGATAC TAGATAAATC 
GTTCAATATT TTATTAATAA GGAGGCCAAT GAyTGTGAAT TATCTTAAAC ATAAGTTTTA 
TAACTTGTTA ACTACAATGA TTGTTCTCTT TATTTTTGTA CTTTCTGGTG CGATTTTTTT 
AACATTTCTA GGGTTTGGTT TATATGGATT AAGTAGAATA CTTATTTATT TTAGGTTGGG 
TGACTTTACA TATAATAGAA GTATGTACGA TAACCTATnA TATTATGGCA GT 
(2) INFORMATION FOR SEQ ID NO: 1149: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 916 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1149: 
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GGCAAGTGTC 


GATCAGAACA 


TAATAAATTT 


ATATGCATAT ACACTCAACC TTTCAATGCT 


TGTGTGACTT 


TTTTATAATC 


CTCTTGTTTA 


AAGAAAAATG AACCTGTTAC TAGCATTGTA 


GCACCATTTT 


CAACACAAAC 


TTTCGCTGTA 


TCGGTATTTA CGCCTCCATC AACTTCAATA 


TCAAAGTTTA 


ATTGACGTTC 


CATTTTAATA 


GCATTAAGAC CCGCTATTTT TTCTACGCAT 


TGATCAATAA 


ATGATTGACC 


ACCAAACCCT 


GGGkTAACTG TCATCACTAG TACATAATCA 


ACAATGTCTA 


AAATAGGkTC 


AATTTGTGAT 


ATkGGkGTAC CAGGATTAAT TACTACACCA 


GCTTTTTTAT 


CTAAATGTTT 


AATCATTTGA 


ATAGCACGAT GAATATGAGG CGTTGATTCG 


ACATGAATTG 


AAATCATATC 


GGCACCATGT 


TCTGCAAATG ATGCAATATA CTTTTCTGGA 


TTTTCAATCA 


TCAAATGTAC 


GTCTATAGGT 


AATGTTGTGC CTTTTCTTAC TGCATCTAAT 


ATTGGTAAAC 


CAATAGATAT 


ATTAGGCACA 


AATTGACCAT CCATAACATC AAAATGAACT 


CCGTCGACGC 


CTGCTTCTTC 


AAGTCGTTTT 


AATTCATGTT GTAAATCCAA AAAATCAACA 


GATAATAATG 


ATGGATATAG 


TTTTGTCATT 


TAATATCTAA CCTTTCTATT TGAAATTTCA 


TTAAATAGTT 


GTAAATAATG 


GTCGTATCTA 


AATTGCGCAA TATTCCCTAT CTCTAATTGA 


TGCnTAACCA 


TTACCATTAG 


GGTCCTTGGn 


AAGGnTACCA ATCCCAAACC TTACCATGGT 


TCCCCCATAA 


CGGTTT 







(2) INFORMATION FOR SEQ ID NO: 1150: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
CD) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1150: 
AATTGGTTGT GGCTTCATAA TGTGCAGTTG CTAGACTGAC AACAACATCG TTTTCTAAGC 
CTACGATAAA GACATAATCG TTTTCGGTTT TGTTGTTTTC TAATGAGCGT TTGATGATAC 
GTTTATCTTC AGTTAAAGAT AAATTTAACT TAGTATCGTA AAGTTTAAGT GCTTCGTTGT 
AATGTGGATC TTTGACAGAT TGAATGGTTT TAAATTCCAT AAGAACACCT CCCCAATTTA 
AATAATATTA TAGCATAATC GCCTGCTGTA AAAGACTGTT CATAAACTTT TAAATGGTAT 
AAAAaACTGT ACTATCTTAA ATTAGACAGT ACAGTAATCT CATTTTGAAT TCAGTGTGAT 
AACTAAGCtT TGGGACCTTT AGATGCnTTC ACAAAATGTG 
(2) INFORMATION FOR SEQ ID NO: 1151: 
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(A) LENGTH: 156 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1151: 
CACCGTAGTT GCTACCGAAT ACCACCATGT CGCCAGGTTG TGCTAAGAAG TCCGGTGTAT 
TTTGGTATAC AGTTAGCTAA TCCGTCGAAG TTGTTAGCGA CGGnATATCT TTTGACCTAA 
ACCTTTTAGA GTAATCCAAA CAAAACTTTC CAACCA 
(2) INFORMATION FOR SEQ ID NO: 1152: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
..(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1152: 
TCAATATATT TATTTTCTAA TGTTAGCATT AGACTGTCAA TTAACTTGCT AAGTTGTTCT 
TTTTTAGTAA TTAAATAACG CATATAATCA TCTAAATTTT TCATTTTTTC ATCTAGCATT 
AAAAGGTAGT GCTTGATATT ATTTCTACTG TTGCTCCAAA AACTGnTCAA ATATACGTCA 
CATAGTCATA CTACACCTTT ATATAGTTTA AATACCAATn GCATGACCTC GTG 
(2) INFORMATION FOR SEQ ID NO: 1153: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 339 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1153: 
TATATTAnCT ATCGTTTAGT GAATCGTTTT TTTAAAGTAA AGAGAATTTC ATATATTACT 
GGTACAACTA CAAGTGTTAA TAAAGTTGAT GAAATTmAAC CACCAATAAC TGTAGCAGCT 
AATCCTTTCG AAATAAGAAT CGnGCTATCT TGACCAAATA ACAnAGGAAC TAATGCACCA 
ATTGTTGCAA TCGCCGTCAT TAATATCGGT CTAATTCTAG TACCGCCTGC TTCGATTAAT 
GCTTCTTTCA TCTCCATGCC CTGTTGCTCA TTATTAATAA CACGGTCTAT TAACACAATG 
GCATTTGTTA ■ CTACGATnCC AATTCAACAT TAGCATACC 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 817 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1154: 



75 



25 



35 



GTCACCCGAG 


GAGATATnCA 


ACTATTACCA 


TCAAGATGnG AAAAATGGTg 


CCTTGGGCAA 


€0 


TTGGTTTTTT 


CATTTTAATC 


TTAATTATTA 


TATTGTTCTT TTTACTAAGA AACTTCAATT 


120 


CACCTGAAGC 


GCAAACTAAA 


ATATTAGTGA 


ATGCGATTGA GAATAATGAT AAACAAAAAG 


180 


TAGCAACATT 


ATTAAGTACT 


AAAGATAATA 


AAGTAGATTC TGAAGAAGCA 


AAAGTATACA 


240 


TTAACTATAT 


CAAAGATGAA 


GTTGGGCTTA 


AGCAATTTGT CAGCGACCTT 


AAAAATACGG 


300 


TACATAAATT 


GAATAAGAGT 


AAGACCAGCG 


TAgCTTCTTA TATTCAAACC 


AGATCTGGTC 


360 


AAAATATATT 


ACGTGTAAGT 


AAAAATGGCA 


CACGTTATAT CTTTTTCGAT 


AATATGAGCT 


420 


TTACTGCACC 


TACCAAGCAA 


CCAATTGTTA 


AACCGAAAGA AAAAACAAAA 


TATGAGTTTA 


480 


AATCTGGTGG 


TAAGAAAAAG 


ATGGTTATAG 


CTGAAGCAAA TAAAGTGACG 


CCAATAGGTA 


540 


ATTTTATACC 


GGGGACATAT 


AGAATTCCAG 


CTATGAAATC AACTGAGAAC 


GGTGATTTTG 


600 


CAGGCCATTT 


AAAATTTGAT 


TTTAGACAAA 


GTAATTCTGA AACGGTAGAT 


GTTACTGAAG 


660 


ATTTTGAAGA 


aGCAAATATA 


TCTGTAACTT 


TAAAAGGCGA TACAAAATTA 


AATGaTAGTT 


720 


CTAAAAAAGT 


AACTATAAAT 


GaCCATGAAA 


TGGCATTTTC AAGTTCCAAA 


ACGTATGGTC 


780 


CATATCCACA 


AAATAAAGAT 


ATTACCATTT 


CCAGCTT 




817 



(2) INFORMATION FOR SEQ ID NO: 1155: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 322 base pairs 

(B) TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



45 

<xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1155: 
AnAGAAAATC TACTCTTTTA TTTCACCGGT CCTTTCnTAT CTCTCATTTG GCATACTCCT 60 
TTTTCCATAT TTTCCACTTT CAnTACAGTG AATTTACATG TAATTTTACA AATACAATAG 120 

SO 

GAGTGCACAA TGGAATATAA CACTTATCTT TATAAAATTA TTTTATATAT TGaCGCACTT 180 
AAAACAATCT ACAAATAAAA TTATTATTTT TAGTTTTCAA TGAATAATTC ATTCTTATCT 24 0 
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ATTTAAAAAT ACATTTAACT GC 322 
(2) INFORMATION FOR SEQ ID NO: 1156: 

S (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 777 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY : linear 

10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1156: 



20 



25 



35 



GTTTCCAAnG 


GAAnGnGAAA 


AGCGTCAACC 


AATGTAACTA 


TTTAAAGTCA AAGTGTTTGA 


60 


CCAAATTTGA 


CTTAATATGT 


AAAATAATGA 


GTAACAGTTA 


TTACAAGGAG GAAATATAGA 


120 


TGAATTTAAT 


TCCTACAGTT 


ATTGAAACAA 


CAAACCGCGG 


TGAACGTGCA TATGATATAT 


180 


ACTCACGTTT 


ATTAAAAGAC 


CGTATTATTA 


TGTTAGGTTC 


ACAAATTGAT GACAACGTAG 


240 


CAAATTCAAT 


CGTATCACAG 


TTATTATTCT 


TACAAGCGCA 


AGACTCAGAG AAAGATATTT 


300 


ATTTATACAT 


TAATTCACCA 


GGTGGAAGTG 


TAACAGCTGG 


TTTTGCGATT TATGATACAA 


360 


TTCAACACAT 


TAAACCTGAT 


GTTCAAACAA 


TTTGTATCGG 


TATGGCTGCA TCAATGGGAT 


420 


CATTCTTATT 


AGCAGCTGGT 


GCAAAAGGTA 


AACGTTTCGC 


GTTACCAAAT GCAGAAGTAA 


480 


TGATTCACCA 


ACCATTAGGT 


GGTGCTCAAG 


GACAAGCAAC 


TGAAATCGAA ATTGCTGCAA 


54 0 


ATCACATTTT 


AAAAACACGT 


GAAAAATTAA 


ACCGCATTTT 


ATCAGAGCGT ACTGGTCAAA 


600 


GTATTGAAAA 


AATACAAAAA 


gACACaGATC 


GTGaTAaCTT 


CTTAACTGCA GAAGAAGCTA 


660 


AAGAATATGG 


CTTAATTGAT 


GAAGTGATGG 


TACCTGAAAC 


AAAATAATTC AAAGTAAAGA 


720 


GTAGACTAAG 


CnGTCTGCnC 


TTTTTGTATG 


AGTAAACCAA 


GGTGTCAATA ATTTGTn 


777 



(2) INFORMATION FOR SEQ ID NO: 1157: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 155 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1157: 
TAATGTTGGT AAATGGGTTA AGGCTCAAGG TCGTATTGAA GAAGATACAT TTATTAGAGA 60 

50 

TTTAGTTATG ATGATGTCTG ATATTGAAGA GATTAAAAAA GCGACAAAAA AAGATnAGGC 120 
TGAAGAAAGC GTGTAGAATT CCACTTGCAT ACTGC 155 

55 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1158: 
GACTGGTAAA ATTTACTTGC AAGTAAAGCT AAAAGGTCAA ATAGATAAAG AACAACTTGT 
TTTTCAAAAT GACAAAAATG AAGAATTTCC TTTTGTTATA AAAGATGAAA AGGATGACAC 
AATAGTAAGA ATTTTAATTG AACAGCATAT GGATAAAATC AATATGCATG TTAAAACGTT 
GGCTGAAAAG AAAAATCTAG ATAACAAAGA AATGGTGTAT TCTATTCATT TTAAAGAGAA 
AAAAGTACAA CATGATGATG CAAAAGAAGT GCCTTCAAAA CATCAAAATC AAGAAAATAA 
TCAAGATCAG CTTAAAAnAG ATATTGATGA CAAAnAAGAT AGTCAAAAAT CAGATACTAA 
GGGAAAGACG TACTAGCCTT TTTACTGGAA AAAGGGTTTA 
(2) INFORMATION FOR SEQ ID NO: 1159: 

■ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 347 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1159: 
GTTGGTACAG TGTTGAGTGG CTTTGAATAT CGAACACAAA AAGAAAAGTA TGACAATTTA 
TATAAATTTT TCAAAGATAA TGAAGAGAAA TACCAATATA CAGGCTTTAC AAAAGAAGCA 
ATAAACAAGA CGCAAAATGT CGsATATAAA AATGAATATT TTTATATCAC TTACTCGTCA 
AGAAGCTTAA AAGAATATCG TAAGTATTAC GAACCATTsA TTCATAAAAA TGATAAAGAA 
TTTAAAGAGG ATGGACCAGC CAGAAAGAGT GATTACGCGC TATCAGTCAG TACACATGTT 
AGTCCAGGAA TTTCTAGCAT CnGTGTGTGA ATGAnGGGTA CTTTATn 
(2) INFORMATION FOR SEQ - ID NO: 1160: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATAGTTATTA ATTAGTAGAA TGAGGATATT TAAATATAAA GGATAGTGTT GAACTTATGG €0 

ACATGAATTT CGATTATACA TGAACGGTGT TGTAGAACAA GCAAGGAATG AAATTGAATC 120 

5 

TGCGGGATAT GAGCAATTAA CTACTGCAGA AGATGTTGnA AAGTCCAAGA TGGnCACCTA 180 

GTTATGATCA TCTGTATGTG GTG 203 
(2) INFORMATION FOR SEQ ID NO: 1161: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 203 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
« (D) TOPOLOGY: linear 



20 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1161: 

AATTTTCCnT TTCCATTGAC ATAACAATAT TTAATGGGGC CCAGTTACAG GAAATATCAG 60 

TTTAATTTCG TCTTCAATAC CATAGTCATA ATATTGAGTT TCTTGTGTTG GAAAATTTCC 120 

TACATAGTTA CTCATTTTAA TTTGTnCAAT GTTTGAATTC GGAGGCACTC TGTTAGCATA 180 

TCTTTAAACA TTAAATATGG ATA 203 
(2) INFORMATION FOR SEQ ID NO: 1162: 

3D (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 220 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

35 ' 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1162: 
GCCATTTTGT TGATAAAGGT ATACTAAAGG TTATCGTTTT GAAATTTTTA GTAACTAGGA 60 

40 

TATGTTTCGT GTTATAGGAC CGGAATTTGT GGTATACGGT AAAATTTTAA TGCTATTGGA 120 
ATTTTTAAAA ATGGAAAAAC ATGGACATTn AAATTGGAAT TTCATAATAT GTCCTAATTG 180 
45 GACTTAACTT GTTGGGAGTT CATTTTACTA TTTnTATGGT 220 
(2) INFORMATION FOR SEQ ID NO: 1163: 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 409 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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TTCACGTGCT TTGCCATCTT TTAAGAAAAA CACAGAAGGC TTTTGCATTT GATTGTAATT 60 

GGATGCCATA GAGTAATGAT ATGCACCAGT TGATAATATA GCAAGATAGT CTCCACGTTT 120 

GACTGGATGA AGGTAATTTA GCGTCTTTAA TAATGATATC ACCAGACTCA CATAATTTTC 180 

CAGCTATAGT TACACTGtCA TCTGCTTCTT CATTTCTATT AACAAGCAAT GCTTGATACT 24 0 

TTGCGTCATA AAGTGCAGTT CTGATATGAT CACTCATACC GCCATCGATT GAAACATATT 3 00 

TATTArTCTC TGGAATTTCT TTAATGGtTC CAACTTCATA TAAAGTAACG CCAGCtTCAC 360 

CTACAATTGA TCGTCCCGGT TCAATACCTA TTTCTGGTGC ATCTATACC 409 
J5 (2) INFORMATION FOR SEQ ID NO: 1164: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
2° (D) TOPOLOGY: linear 



10 



25 



30 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1164: 

CTAAATAATT GTTTAAATTT TTCTTTACTA TTTAGACTCt TTCGTTTGCC ATGACTTTGT 60 

TGACTTGCAA TGATATGAAC TTAATTTTTG CGAAAAAAGG AATCTAGATT CATATATTCG 120 

GTAAATCTAC CTTCTTCATC TTTTTGAACT TGTAATTCTA GCAATTCACG TTGTATCAAA 180 

TTTTGAATAA CCATTGTAAT ATCACGTGGT TGCATAGTTG AGCCCTTCTG AAGCAATTCA 24 0 

ATTGAAGGCG TTTATTTGAG TTTCGAAGCA TAAATCAATT TAAGCAAAAG ACTAAATCTG 300 

35 CTCATCTAAA CCTAAGTCAC GTAAGGTCTA ATAATCnCTC GTATCACTAC GGTCTGCTTT 360 

AATGATATTA nCCAGTnCGT GCCCCTTTAA ATTAACCAnA 400 
(2) INFORMATION FOR SEQ ID NO: 1165: 



40 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 327 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1165: 

TCGTGATTAT TATTAACATT TTTTAGTAAT TGATAAAAAT CTTTTCTCGC TTCTGTAGGG 60 

CTAGTGATAA TCATATAAAC CCTCCTTTGT TTTCTGTTAC GCTAATTATA GCGTACGCAC 120 

TTGAGTACGT CATATTACCA ACGAGTTCAT CGTATnCTTA ATACCTATAT GCTATCGTTC 180 
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GGACACACTA CCGTnCCCTA CCGCTTAACC ATTTTCGGGC TTAAATATTT TnAAAATTTT 300 
GTGAGGATCC GCTTAAAAAA CCCCATA 327 
(2) INFORMATION FOR SEQ ID NO: 1166: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 14 8 base pairs 

(B) TYPE: nucleic acid 

10 * (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



<xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1166: 

TTAGGATTTC AAAAATATGT CAGGTTTTCT AACATAAAGC TTAAAACATT GATATAACTG 60 

CGATTTCGAC TTCTAAAACG aCATrAAACA ATCTACAGCT AAAATATTTT TTCAATTTTT 120 

AAAAATAATC nGATAAATCT . CCGCAAAA 148 



(2) INFORMATION FOR SEQ ID NO: 1167: 

(i) SEQUENCE CHARACTERISTICS: 
25 (A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1167: 



nAAGGTTATC GCTCGTAAAG ATGTTGTTGA AGAAGTATTT GACGACAGAG ATGAAGTGGA 60 

TTTAAGTACA GCGCTTGTTA AAACCCTGCA TATGAAATTG GTGTATTACG AGAAGAGTAC 120 

ACTAAAGTTT TGGTCGTGTA GGTCTCAGCA GCGAACAGCA GTATGCACGT CTnCGTGTGC 180 

TGACGTGAAT TTTATTTGAG 1 GATTATAGAC AAGAGAGACA ACTTACTGGA TTTTGCCCTG 240 

TTG 243 



(2) - INFORMATION FOR SEQ ID NO: 1168: 

(i) SEQUENCE CHARACTERISTICS: 
4S (A) LENGTH: 321 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

50 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1168: 
TTATATACTA CTATAAAGTA ATCCAACTCT CAAGAAGGTA AATGGGCAAT CAAAAGCTAA 60 

55 
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CTACGATATG .ACGCACAAAT GTTTAGGGTA GGTGTTGACA TGTGTACAAA AACCAGCAGA 180 

AAATGGTTGT ATTCTTAGGG AAGCATTCAT CCTGTTTGTT TATGGATTCA TGTTACGTTA 240 

CGCAATATGT TTTTATGTAG CACAGCGAAG TTACAAGTTT ATACGCTATA TATCCnTTGA 300 

TnATAAGCAG ATGAAAnCGG C 321 



(2) INFORMATION FOR SEQ ID NO: 1169: 

10 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 191 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID' NO: 1169: 

20 TCTACATATA AAATTACTCT AAAAATATGT ATAGTCATAA ATTGTTGGTT GATTTAATTA 60 

AAAGTATGGA AATTAAGGGG CTCTTATGTA TATAAAAAAA TGAATTATGA TAAAATGTAA 120 

GAnAATATTT AGGGTCGATT GGAGAGATAC CAGTGGTACC AATTAGAAGA CGACAGTTTA 180 

25 ATGTTACCAT A 19.1 



(2) INFORMATION FOR SEQ ID NO: 1170: 

(i) SEQUENCE CHARACTERISTICS: 
30 (A) LENGTH: 303 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1170: 



AATGTGGCAA GGATGTGTGA TATGACTACA CATTTAATAT CATCCTACAT AATAATTTTG 60 

40 CGCGCCAAAT TATCACACAT ACCACACCTA CGATAACAAC CAGCAAATAA GGAGATAGCT 120 

ACACGTATTT ATCGTCCAAC GCACTACTAT ATGTTGTTGn ATGGGCACAA GTTACAAGTG 1B0 

CTGCGCGATA nCAAAAGGTA TACCCCTAAA ACGTCCCAAA AnAATCGGTT GCCCTCTAAA 240 

45 ' TTGTTTGTGT GCCATGATTA GTAAGCTCCT ATGTTAGAAA CAAATTTTTA ATATGTATAT 300 

TAG 303 



(2) INFORMATION FOR SEQ ID NO: 1171: 

so 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 299 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
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(Xi) SEQUENCE DESCRIPTION; SEQ ID NO: 1171: 

5 

AAAAAGGTGA GAAGACAATA ACGnACGCCA ACACTAAAAA ATCCATTAAC TGGAGAAATT 60 
ATTAGTAAAG GTGAATCGAA AGAAGAwATC ACAAAAGATC CAGTTAATGA ATTAACAGAA 120 

1Q TTCGGTGGCG AGAAAATACC GCAAGtCATA AAGATATCTT TGATCCAAAC TTACCAACAG 180 
ATCAAACGGA AAAAGTACCA GGTAAACCAG GAATCAAGAA TCCAGACACA GGAAAAGTGA 240 
TCGAaGAGCC AGTGGATGAT GTGATTAAAC ACGGACCAAA AACGGGTACA CCAGAAACA 299 

15 (2) INFORMATION FOR SEQ ID NO: 1172: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 821 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1172: 

25 



30 



50 



CCATAGCACC 


ATACCCATAA 


CCAACCCGGA 


TCCACTAACA 


AATAAGTAAT 


AATTnACACG 


60 


TAAAGAAGTA 


AATAGTTTGT 


TTAGTTTTTT 


ATCAACATTA 


TCAATGTTTT 


CCTCGACAGA 


120 


CACTGGAATT 


TTAACGATAG 


CTGTACCATT 


TGTAATTGAG 


TAGTTCGTTA 


TTTCACCCAT 


180 


ACTACTATTT 


GGTAAAATTG 


TTAATTCTCC 


TGAAATTGTA 


TTGATTCGTG 


TTGAACGAAG 


240 


TCCAATTGAT 


TTAACTGTTC 


CCTCTGCTAC 


AGTTGTACCA 


CCGTTATTTA 


TCTTAACATA 


300 


ATCACCTACA 


TCAAATTGGC 


TTTCAAATAT 


AATAAAGAAT 


CCTGTAAtTA 


CGTCTTTAAC 


360 


TATAGTTTGA 


GCACCAAAAC 


CTACTGCTAA 


GCCTACGACA 


CCAGCACTGG 


CAATTACCCC 


420 


TTCAACACTA 


ATGCCAAATT 


TACTTAAAAT 


CGTCGTAATA 


ACTATAAACC 


AAACGATATA 


480 


CTTCACTACA 


TTTTGAACAA 


GAGATATTAA 


AGTTTTAGAG 


CGCTTTTTGT 


TACTCTTTTT 


540 


ACTTTTATTT 


TGAATCTTAA 


ATCCCTGTTC 


AATCATTTTA 


TTCAGTATTT 


TAATAACTAT 


600 


GAGGGCTACG 


ATAATATAAA 


TAACAATCAT 


AGCGATTTTA 


GTTGCAATGT 


TTTCATATGT 


660 


TTCTATTTTT 


GTTAATGGCT 


CAAATAGAGA 


TGAAATAATA 


TTCATGACTT 


GATTCATGAT 


720 


AAGCGTTACT 


CCCTTCTATA 


TGATATATCA 


ATATTGtATT. 


CATATTTtga 


TATACTTCCC 


780 


AACCTACGAA 


AGTCTCAATA 


TATCTAGAAT 


CAAATGGTCA 


C 




B21 



(2) INFORMATION FOR SEQ ID NO: 1173: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 318 base pairs 
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(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1173: 

CCCGAAATTA AAATAATAAT GTATGAATTT TTAAATATGA TTTAAACGTT TTCAGTTTTT 60 
10 ATGAAAACGC ATGCATTTTA CAAATAAAAA TGGTACGATG GCACTGGTTA AAACGTTTTA 120 

CTAAAAACAA ATCATGAGGT GTATAACATG GCTGTCTCTT GATCAGATCT TGATCCCCTG 180 

CGCCATCAGA TCCTTGGCGG CAAGAAAGCC ATCCAGTTTA CTTTGCAGGG CTTCCCAACC 24 0 

IS TTCCCAGAGG nCGCCCCAGC TGGnCAATTC CGGTTCGCTT GCTGTCCATA AAACCGCCCA 300 

GTCTAGCTAT CGCCATGT 318 

(2) INFORMATION FOR SEQ ID NO: 1174: 

20 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 206 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1174: 

GTTTTAAAGC GGTTATAAAG GTAAATATGC AATATTATAA ATATCTTCTT CAGAACTAAC 60 

CGAATCTCTT AAACTCATCT GTTTAGATAG CGTTTCAAAT TGATAAGTTT TAATTTTAAC 120 

CGTTACAGTT TTAGCTGACT GCTGTAATTT ATTTGACGTG CAGCTGTTTT CCTGACAATn 180 

CCATACTGTC TTAAAnCTCT GCATCA 206 



<2) INFORMATION FOR SEQ ID NO: 1175: 

(i) SEQUENCE CHARACTERISTICS: 
40 (A) LENGTH: 653 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1175: 
TCTGCATCAT GATATTTTAC ATCATCACTT ATAAGAACGT CACCTACATT TAAACTTTCA 6 0 

SO TCTAAAGCTC CAGCAGAACC TGTATTAATA ATGACGTCCG GTTTAAACTT ATTAATTAAT 120 

AATGTCGTAG AAATTGCAGC ATTAACTTTT CCAATGCCAC TTTGGGTAAT CACTACTTCT 180 
CTATCTTTTA AAATGCCAGT ATAAAATTTA ACATGTGCAA CTGaAATTTC GCTTAATTGT 24 0 

55 
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TTTTGATTCA CCTCTTTTAA AAAATCCTAG CATTTGTTAT TTTATCACAT TTTAATCACT 
ACAACGACAA GTATCATGAT TTTCATTACT GTATAAAACT TTTGCTTTAT TTTTATAAAA 
AGCTCTTTCA ATAACTTGGT ATCATAAATA ATTATTTTTG AACTCATAAC AGTGAATTCG 
ATGCTTCTAA TTTGTCTATT TTTCAAAAAC TAATCATATC AATTACATCA ACCAATCGTG 
TCAAGAAACT TTATTAAATA ACAGGAAAAT GATATGTTTA AAATAATAAT GATGAAATAA 
GAGGATTTTA CAATGGCTGA AAATAATCAA AATAGTCTCG TAACAAAGAT AGC 
(2) INFORMATION FOR SEQ ID NO: 1176: 

<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 410 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1176: 
ACATGGCCGT TnGnATCAAG ATCGTCAACG ATTGATTGGT ACAATTGAAG TGCCTATGAC 
ATTGGCAACC GTTGGCGGTG GTACAAAAGT ATTACCAATT TGCTAAAGCT TCATTAGAGC 
TACTAAATGT AGAGTCAGCA CAAGAATTAG GTCATGTAGT TGCTGCCGTT GGTTTAGCGC 
AAAACTTTGC AGCATGTCGC GCGCTTGTGT CAGAAGGTAT TCAACAAGGT CATATGAGTT 
TACAATATAA ATCATTAGCT ATCGTTGTAG GGGCAAAAGG TGATGAAATT GCTAAAGtAG 
tGAAGCTTgA AAAAAGAACC CCGTGCAAAT ACACAAGCAG CGGAACATAT TTTACAAGAA 
ATTAGACACA ATAGTAAAgG CGAtAGCGAT TaAAGAAAGT TAAGTCAGTC 
(2) INFORMATION FOR SEQ ID NO: 1177: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1177: 
CATCAAGATA ACTACGGTTA TAATCCAAAC GACCCAACAT CATATAACTA TACTTACACT 
ATTGATGGCA CAAGGTAACT ACCATTACAC ATGGGAAAGG TAACTnGCAT CCAAGTCAAT 
TAAACCAAGA TAATGGCTAC TACAGC 
(2) INFORMATION FOR SEQ ID NO: 1178: 
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(A) LENGTH : 225 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1178: 

TGTTCCTGGT GGAACTTTGG TTTCTTAATG TCATTCCAAA TTGCTGTATA TTCATTCATT 60 

GGTATTGAAC TTATAGGTGT ACTGCTGGTG AACGAAAGAT CTGAAAAAAC CTTACCGAAG 120 

CAATTAATAA TGTACTATCC GTATTTATAT CTATATCGTG GTnATATAAT ATGTCATCAA 1B0 

CTGGATnTAC GTCAATAGAC CTTCGTAACC TTAATATCGC GACAT 225 



{2) INFORMATION FOR SEQ ID NO: 1179: 

Ci) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 319 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1179 : 



TGTACGCATA TGGTATCACT TGATGACCAA TTCACTTTGT CGAGAATAAC GTTATTTAAA 6 0 

GACATTGATT TAAAGAATTG GTATTnCGTT AATTGTATTA GACTTTTCAC AGAAGTAGCG 120 

ACAACAATGA TTTCATCGTA ACACGTGCGC ATATnAAGCA TGGGAAAAGA CATGGnCTAT 180 

TGACCTGGCC TTTTGTGCAC TCGTACGATT GGTCAAACGT GGCTATATTG AAAGTTTGAA 24 0 

ACAAGATGCG ACGGACACAC ATGCACGGGT GGGGATAGAT CTTGAGTCTT ATTGAGACGC 300 

TGTGACTGTG GCTGACATC 319 



(2) INFORMATION FOR SEQ ID NO: 1180: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH': 654 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1180: 

TTTTATCTTG GTTTTGAACT AGATATGTGT AGTCAGGTAT AACATAATCA ATCCCTAATA 60 

AGTTATCATC AGGGAATTTG ATACCTGCAT CGATAATGAC AATTTCGTCT TTATACTCAA 120 

CTGCATAAGT ATTTTTACCG ATTTCACCTA GACCTCCAAG TGCATATACA CCTACTTCAT 180 
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TTCGTAATCT AAATGTGCGC CCTCTAATTT AGTGATAAAT TCGATATTAA AATTACGATC 300 
TTTCAAGTAA CGACGTACTT GTTCTTCTGT TTGAGCTTCA ACATAAAGTG ATTGTGTATT 360 
5 TTCACGCACA ATTACCTCGT CTCTGTTATG TTGATAaAAA ACTTTAAATA CTGCCATGTT 420 
AAAAATTCCT CCTAAGAATG TTTGTyTAAT TTATTGTTAA CCCTAGtAAA ATCGTATTGG 4 BO 

AGTATATATC GATAAATTCA TTCCAATCAT CTCTATAATG TAACTTAATA ACGATTTGGT 540 

10 

TAATAACTAG GTTCATCATG TCGTTCATTT TAAAAAATTA GTGAAATAAC ACTAAAATTT 600 
CAGTTAAATC TTATTTTACA TGATGAATGA TAATAAATAA AGCAGTTTAT CTCA 654 
(2) INFORMATION FOR SEQ ID NO: 1181: 

15 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 231 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
20 (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1181:. 
25 CGGGTATATT ATAGACAACG TTTAAGGACT TCTTTTTAAC TAACTTGTTA AATGTTGCTG 

TATTTTTGTA TTCAACACTG TAATATAGTT CTTCTGACGC TGATGCATTA CTCATTGTAA 
TAGATATTAA AAGACTCAAT ATTATTAaAC TAATAATAGC GCGCTTTATG ATTTTCATAT 

30 

TCTAATCTCA AATGAATTCC AGAACTTTAT ATGGTTCAGG TGCGACAACA T 

(2) INFORMATION FOR SEQ ID NO: 1182: 

(i) SEQUENCE CHARACTERISTICS: 
35 (A) LENGTH: 245 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1182: 
TAAGTTTATT AAGCTCTTGT CCGTTAGCAA TCACGCTACC GCTAGAAATT TGGTCAATTG 60 
45 AACTTAGTAC ATTTAATAAG GTTGTCTTAC CTGATCCAGA AGGCCCCATA ATCGCACGAA 120 
TTCGCCTTTT TGTATGTCAA GTAATATCTT AGGCTGAATG GTGCTTTACG AGTTTGAACA 180 
GTGCCGAAAA CGCAAGCCCC TTGATATCTT AAATGTGGAG ACTACAACnC ATCAGTGnCA 240 

50 

CGGGA 24 5 

(2) INFORMATION FOR SEQ ID NO: 1183: 

55 
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(A) LENGTH : 4 90 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY; linear 

5 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11B3 : 



10 


AGGTAAAGAT 


GATGGTGTAG 


AAGTTTATGT 


GCATTGCGAA 


GATCATGGCA 


TTGTATTTAA 


60 




TGCAAGTCTA 


CCTTTGTACA 


AAGATGCCAT 


CCATCAAAAA 


GGATCAATGC 


GTAGTAATGA 


120 




CAACGGTGAT 


GATATGAGTA 


TGATGGTGGG 


TACAGTGCTG 


AGTGGCTTTG AATATCGAGC 


180 


15 


GCAAAAAGAA 


AAGTATGATA 


ATTTATATAA 


ATTTTTAAAA 


GAAAATGAAA AGCAATATCA 


240 




ATATACAGGT 


TTTACTAAAG 


AAGCAATTAA 


CAAGACGCAA 


AACGTCGGGT 


ATCAAAATGA 


300 




ATATTTTTAT 


ATTACATATT 


TATCAAGAAA 


TTTAAAAGAA 


TATAGAAAAT 


ATTACGAACC 


360 


20 


ATTGATACAT 


AArcAATGAT 


AAAGAGTTtA 


AAGAAGGTAT 


GCAACGAGCT 


AGAAnAGAGC 


420 




TAAACTATnC 


TGCTAATACA 


AATACTGTAG 


CAACGTTGTT 


TAGTACGAAT 


GATGAAAGGA 


480 




ATAGAAAAGA 












490 



25 

(2) INFORMATION FOR SEQ ID NO: 11 84: 

{ i ) SEQUENCE CHARACTERI STI CS : 

(A) LENGTH: 153 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



35 (xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1184: 

CTCGCATTTT AATAAATCAT ACTCTTCTTT TCTTGCATAC GAATTAGAAT AACTCGCGAG 60 

ACCTATAAGT CTCTTTCCTC ACTAGATAGT TTATACTTTT GGTCnGTTGA AGTCAATAAT 120 

40 

TTTATCTAAA GCTATAAAAA ATCTTTTGAT AGC 153 

(2) INFORMATION FOR SEQ ID NO: 1185: 

(i) SEQUENCE CHARACTERISTICS: 
45 (A)^ LENGTH: 164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

so 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1185: 
TCATTTATAT ACTGTATCAC GCTATCTTTA GTATGTGTGC ATnATCATTC GTTAGTGCGT 60 

55 



1869 



EP0 786 519 A2 



AATGAACGTT ATATTTTAAT TCATGATTTA CTAAGACCAA CAAT 
(2) INFORMATION FOR SEQ ID NO: 1186: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 143 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1186: 
CACAAGTGAC GTCATATTAT AAAAGTCACT CGGTTTGCGA TACGTTCTGT CTAAGAAATA 
GCGACGTGCA ATTTCATATT TTnTATAAAC ATCCGTTGAA AAAGGACATA AAACCATGCG 
TTGAACCAGG GTCTATACTT TCT 
(2) INFORMATION FOR SEQ ID NO: 1187: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 209 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1187: 
CGTACAAGTT AAAACGTTTA TACGATTACG AAAAACCACC GAAGAGTGAG AAAGATCCAA 
TGGTTGTATA TGCAGTAAGT GCAGGATACA AATGGTTCTT TGCTTATCCA GATGAACATA 
TAGAGACTGT TAATACATTA nCAATCCCTA AAGATGTCTG TTTATTAGCT CAGCATGAnC 
ATACAGTTCG GTCACATAGT GTCAATTCA 
(2) INFORMATION FOR SEQ ID NO: 1188: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1188: 
TTGGAATCGC TGTCGACGGA TAGCAACCTG GATTGTAAAA ATAATGTTGG CTTTAAAAAT 
ACCGTGGCAA TATTAAATGA ATAATCGCTG CTGAATATAC ACCCAATGCA CATTGAACTA 
ACATAAnCCC ATTTGACTCA TCTACGCCCA CTATTGGCTT TATAGTCACT GAACTAAGCT 
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TAT 243 
(2) INFORMATION FOR SEQ ID NO: 1189: 

5 (i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 527 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

10 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1189: 
CACCAGGCAT TACCATACCA GCACTCATAC CACCAATCAC kCTCGATAAC ATCAATACCG €0 

15 

AAAAATTGTG GCCAACTGCA AACATAAATT. CTGACACTGA AAACAAAATT AATCCTATAC .120 
ATATAATTAA TTTCTTCCCT AATTTGTCAG CTAGCGTACC ACCAAACGGC GATATAATCA 180 

20 TTTGAGATAA CGCAAAAGCA GCAACTAGTA ATCCTAAATC ACTACCAGTT AATCCCAAAT 24 0 

CTTTTAAATA AACAGGCAAG ACTGGTATTA CTAAACCGaT ACCTAAAAAA ATCAAGAAAA 3 00 

TATTAAAATA TAAGACAAAA ATCTGTTTAT TCATATGCTC ACCTCTTTTT TCTCCATGTC 360 

25 ATGCTTAAAG CTGTTATTCT TCTTTTCCAC GACAGATTGC AATTCATGCT AAATATTCGT 4 20 

AAAAAGTTTC TATATTGTAT TACAACATTG CTACATTTGA CAATATTTTt TCTTTGTAAT 480 
TATCACTATn TTCCATTTAA TTGTATAATT AATGACATAT TnnAATA 527 

30 

(2) INFORMATION FOR SEQ ID, NO: 1190: 

U) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 869 base pairs 

(B) TYPE; nucleic acid 
35 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1190: 



AAAACGTGGC 


AGTAATTCAG 


TGTATGTTCA 


ATATGATGAT 


ATTATGTTTT TTGAATCATC 


60 


AACAAAATCT 


CACAGACTCA 


TTGCCCATTT 


AGATAACCGT 


CAAATTGAAT TTTATGGTAA 


120 


TTTAAAAGAA 


CTGAGTCAAT 


TAGATGATCG 


TTTCTTTAGA 


TGTCATAATA GCTTTGTCGT 


180 


CAATCGCCAT 


AATATTGAAT 


CTATAGATTC 


GAAAGAGCGA 


ATTGTCTATT TTAAAAATAA 


240 


AGAACACTGC 


TATGCATCGG 


TGaGAAACGT 


TAAAAAAATA 


TAATAAGATA ATAAAGTCAG 


300 


TTAACGGCGT 


ATTCAATTGT 


AAATCTTGTT 


GGATTTTAAC 


AAGATAACTA GCAAATGCAC 


360 


TGTATAGCTG 


GCTTTTTAAT 


TTTATTAACA 


AAATTAAATA 


TGACGCGTGA ATTAAAAAAT 


• 420 
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CTCTACAATC CTATGGCGCA ATTTTTAACA AAATATACTA AATtTGAAAT TGACAGTTTC 
CATAAATAAA ATGCTATAAT TTCCTACTCC GTGAATCTTA TTTTGTTTTA AAAAACAGTG 
TAGCACAAGm TTCAACTTGT TTCCTACACT GTTTTATAAA TGAATTATTT AAAATGCTTG 
TTCAACCTCT TGTAATGTTG GCAAACTATT AATAGCACCA TACTTCGTTG TCACTATTGC 
AGCAACGCGA TTGCTAAACG CTAAAATTTC CTCACCTTCA TTTTCAAATA ACTGAGTTAA 
ATTTAACACA TCGGTTGCTA AAATCCTGCT AATAACTGCA CCAATAAATG CATCCCCTGC 
CCCAGTTGTA TCAACAGGTT TTACCTTATA GCCACTATGA TAATGATTA 
(2) INFORMATION FOR SEQ ID NO: 1191: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 254 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 1191: 
AGTTAAAGGC GTTGAATTAA TGGCCTTCCC ACACATGACA TATAAACAAG CGTGCGCCTA 
TGGTCTGATA ACCAGATACA CGTTTGAATG GATTAATTGC GTTCTCAATT AGGCGTGATA 
TGGCTTTAAA GATTTnAGTC TGTTGAAATG ATGGTGAATT AAAGCATTGC GCTAAAGGTG 
CACTGACAAT ATCTCGTAAA GATATGGTGC TTTAACAGAT TTGTAACATC TATGGTCTAA 
nGATACGGGG TTAA 

(2) INFORMATION FOR SEQ ID NO: 1192: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 189 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1192: 
GGCTCATAAG TTACACCTAA TGCATCTTCA TCTGGAAGCT GTGGTTTATC ATCTTCTAAA 
TCAGCAGTTG GCGTTTTTTC ATATAATTCC TTTGGCGCAC CAAGATACGC TAATAATTGA 
GCTGTGTATn CTAGGCATCC ATACCGTATT CGTTAAACCA GTATATTTCA GCTGAATGAT 
CTGTCCTAC 

(2) INFORMATION FOR SEQ ID NO: 1193: 
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10 



15 



(A) LENGTH: 215 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1193: 

TTGATATTGA TCGATGACCG GTGATGTGAA ACTGAAACTG ACCAATGAAT GAAAAAATGG 60 

GCAATGTTTT TGACAACAAT GGTCATTTAC CAAATAAAAA TATTGnCACC ATTAATACCT 120 

GnCAAACGTG TTTGTTCCTA AACCTAAAAT GATAAATCCT ATTGAGATTA TGGACTTGGT 180 

AAGCTGCAAT CTTTTTAATA TCTTTATAAG CAATG 215 

(2) INFORMATION FOR SEQ ID NO: 1194: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: .158 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
.(D) TOPOLOGY: linear 

■ (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1194: 
AGAATTTGCn ACTGGTTTAC CAACGCATCT ATGTTCGGTC GCTATACCGC ATACAGATGT 60 
30 CGAACATATT AACCATAGAA CGTAGGTGTG GTGTTCTAGA AAAAGAAGTG CCGTTATTGA 120 

AATGGGACAC TTGATCAACA GCCGAGTGAA ATCGTTTT 15 B 

(2) INFORMATION FOR SEQ ID NO: 1195: 



35 



40 



50 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: .180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1195: 
45 ACTGATTGCC GTGTCTGTGG TTGATCATTG ACTAATTGTT GGCCATCATT GGnTTCTTGG 60 

TCTAATAGTC CTCTTGCTCC TGCGTCGTAA CATTGATGCC AATCATGCAC CACAGCCTTT 120 
CGGTTCATCA TAACATTTCT GCCAATGTCT CTACATGACA CGGGCTCATA TTTCTTTCTG 180 
(2) INFORMATION FOR SEQ ID NO: 1196: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 274 base pairs 

(B) TYPE: nucleic acid 

55 (C) STRANDEDNESS: double- 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1196: 
GTTAAAGCGA ATTATTCACA ATATAAAAAA TCATCTGATC AAGTCGTATT AGAATTAGGT 
ACAAATGGCG ACTTTACTGT CAAACAGCTC GACGATTTAC TTAATCAATT TGGaAAAGCC 
AAGATTTATT TAGTTAATAC ACGTGTTCCA AGAaTTTmTG AGGgCAAATG TAAATCGATT 
ATTaGCTGAC GCGGgCGAAA CGGAAGTCCC AATGGTCACA TTAATTGGAT TGGGTATTAG 
GCGCTTCACA AGGGACCTTG GTGGAATATT TTTG 
(2) INFORMATION FOR SEQ ID NO: 1197: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 154 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO : 1197: 
ACGTAACCTA TATTCATCTC TGTTCCCAAT AATAATAACA ATACATAACT GATATTAGCT 
GTCTTTCATC TCTAATTAAT ATGCGTCATG AGCTAATATC GGTTATTTTT TTGTGTCAAA 
nAATTTTTAT TTTATCTCTC CATCGCATTA ATTG 
(2) INFORMATION FOR SEQ ID NO: 1198: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 249 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID. NO: 1198: 
AATCCTCACC CGTAATAAAC TCGTTTTGAA TGCGTTGTTC GAACTCACGG TATATTAAAG 
CAATATCCTC TAACTTATTT TTAGTTCGAG TTTGCATATT TTTATCAGCA ATAAAGTGTT 
CTAAATGTTC AGGCGTTACT GCATATTTTT TAAAATCTTG AATTTGTTCT GTTAATTTTT 
CACTAAACCC ATAATATTTT GCTTGTGATT GATAAAGTTT TAAATACTTT TGTTGTTCnT 
GAACAATGn 

(2) INFORMATION FOR SEQ ID NO: 1199: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 165 base pairs 
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10 



(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1199: 
TTGGTAAAAA TCGACATCAA ATACTAAATC TGATCCATCT GAATCCCATG TTTAAAACCG 60 
AACTTGTGCA TTAATTGTAA AGTTCAAACT CTTCATCTTC ATAGATCGCG ATGCGTCTTT 120 
TAATTCTTTA GGTGATACTT TGTAGATCTA TAACAAAATT AnCTA 165 
(2) INFORMATION FOR SEQ ID NO: 1200: 

™ (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 236 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1200: 
TTATTAGCGA TTAAAAATAT GTCCTTTTTT AATAAAGGAG ATATATGTTT AATTGGATGT 60 

25 

CAACATCGGA GTCATTGGCG AAAAATAGGG CTGCGGTCGA TGGAGTTCAA AACTATTTTA 120 

ATGCATTACA TGTTGTGAGT nAAGAGCAGG CGTACTTTTC TTTTCAAGGA TGTGACATAT 180 

30 CACAGAGCGT ACAATAGAAA CTCAGATACC ATTACATGGA nAGTCCGCGT CCAATG 236 

(2) INFORMATION FOR SEQ ID NO: 1201: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 146 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

40 

(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1201: 
TAATGCTAAA CAGTATTATG GAGATAATGG CTTAGTTCAA ATTTCTGATG nAAGTCAACA 60 
CTTCTAAAAT TTGTAAATGA AGCATTAGAC AATAACGAAC AATCAGTTGA AGATTACAAA 120 

45 

AATGGTAAAG GCAAAGCTAT GGGCTT 146 
(2) INFORMATION FOR SEQ ID NO: 1202: 

50 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

55 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1202: 
ATTGATGATA GTATTGATGG TAATACCATA AACATACAGC ATTATCACAA GTAAAAGTAA 
GCCTAAATGT CCATAAAATC TACTTGTCAC AATATATGTC GGTATTATCG TAACGGGAAG 
TCATTTCGTA CTTGATTAAA CTTTTGTGTA ATTGCTTTAG TACCTTCTAA ATACCTGGTT 
GATGAGACCA CATACTGATA CCACCATAAC CATAAACAAA AGGTACACCA GAATTGGGCA 
TACTTCTTAT TCCATCCAAA CCTCCGTAAC CTAATTGCTA CGGGTAATAT TCCAGCACAC 
TAATAGTATT GATGTAATTA ACTGAACGGC CGCTnGATAA TAAGTCTTAC AGTCTTAAAC 
GTCCATGCGC ATCnTnACCC ATTCA 
(2) INFORMATION FOR SEQ ID NO: 1203: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5B0 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

* (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1203: 
ACATTGAACG AGTACAGAAA ACTTATTGnA AATAAAGGCT GTTAAAGATA AAGAAGAAGA 
TACTTACAGA GGTAAGTATT TTGCGGAAGA AAGAAAAAAC GAAAAATTGG AAAAAGAAAA 
TATAAAACTA AAAAAGAAAA TTTATGAATT ACAAAACGAA GAAGATAACG AGGAGGACGA 
AGAAGACAAG GAGGACGAGA ACGATGTATT ACAAAATTGG TGAGATAAAA AACAAAATTA 
TAAGCTTTAA CGGGTTTGAA TTTAAAGTGT CTGTGATGAA GAGACATGAC GGTATCAGTA 
TACAAATCAA GGATATGAAT AATGTTCCAC TTAAATCGTT TCATGTCATA GATTTAAGCG 
AACTATATAT TGCGACGGAT GCAATGCGTG AyGTTATAAA CGAATGGATT GAAAATAACA 
CAGATGAACA GGACAAACTA ATTAACTTAG TCATGAAATG GTAGGAGGTA TGAnAAGTGA 
ATGATTTACA AGAGAGAGAA TTAGAAACAT TCGAACAAGA CGACCGATTC AAAGTAACTG 
ATCTAGACAG TGCTAACTGG GTTTTTTnAG AAACTnGGAT 
(2) INFORMATION FOR SEQ ID NO: 1204: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 139 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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ATCGTATGGT TGAGGCGAAT TTAACTGAAA AATTAGACAA CGAATCTTGA TGAAAAGAnA 60 

ATGATTGGCT AAAATAGCAG CTAATCAAAT CAACGATAAT GAATGCTTAT TTATCGTGCT 120 

GGTTCATCTA CATTGGAGC 133 



(2) INFORMATION FOR SEQ ID NO: 1205: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : 229 base pairs 

(B) TYPE: nucleic acid 
CO STRANDEDNESS : double 
(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1205: 



AGGAGGATAT CAGaCTTACA rACTATTCTT TATCATACTG rAATTCAAAT AGACACCTTA 60 

20 ATTGaAGaAG GCGTTGACGC GCTACTTTTC rAAACGTATT ACGACCTnGG AAGnGTTAAC 120 

AAATGTCATT TCCACGGAAC GGGGAAAGGA AATACGGCGT TGCCAATCCA TTGCTCCAAT 180 

TGAACCGGTT GCAAACACAA ATTGACTTGG TTGAAGGGGC AGGCATGCA 229 



25 (2) INFORMATION FOR SEQ ID NO: 1206: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 3 99 base pairs 

(B) TYPE: nucleic acid 
30 (C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1206: 

35 

TCGACAAGTT TTTAACAGTT CGTTATTATA TGAATGTAAG TAAAAATTTC TTAGCTACAA 60 

CTTACATATT ATAAATGCAT AAATTAAACA AAAAGGGGCG AAAAAAGTTG ACTCATTTAT 120 

40 CAGATTTAGA TATTGCGAAT CAATCAACAC TACAACCAAT TAAGGATATT GCTGCATCAG 180 

TAGGTATTTC AGAGGATGCA TTAGAACCTT ATGGTCATTA CAAAGCTAAA ATCGACATTA 240 

ATTAAAATTa CGCCAAGAGa AAACAAAGGG AAgTTGTTTT aGTTAACTGc GATGaGCCCA 300 

45 ACACCAGCTG GTGaAGGtAA tCmACGGTTA CAGTTGGtTT AGCTGaTGcA TTCCmTGaGT 360 

TaATgAAAAC GTTATGGTTG ccTTaGGGGG CCTGCCTTT 399 



(2) INFORMATION FOR SEQ ID NO: 1207: 

50 

(i) SEQUENCE CHARACTERISTICS: 

.(A) LENGTH: 138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1207: 

ATAAACAATA CGATTTGTAT GCTACTTGTC TGAAGACAAC ATAAAACAAT ATGTTAACTG 60 

ATATTGAGGA TATGGGCTTT CAAAATCCAn TTCAAAAAAG GTTATGTTGA AATATGACGC 120 

GGTTGTTGAA AGAGACCC 138 



(2) INFORMATION FOR SEQ ID NO: 1208: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 655 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 120B: 



25 



35 



AGTCGCCTAA 


CTTTTATTAT 


CAGTCCAATC 


AGTTTCATCA 


AGTATAAATG 


TTATTGCTTA 


60 


TTTTTGTATT 


CTTGGCGTAT 


TTTTTGTTCT 


TCTGCATAGC 


GCTCTGGATT 


TTTCTTATAA 


120 


AAATCTTGGT 


GATAGTCTTC 


GGCTTTGTAA 


AATTGTGACG 


CTGGTArTAT 


TTTTGTTGCA 


180 


ATTGCCTTAT 


CAGCATTAAT 


CGTATTTTTA 


AGCTGCTCGA 


TATAAGTCTC 


AGCGAGTTCT 


240 


TTTTGATGAT 


CATTAGTGTA 


GAAAATAGCT 


GTTTnATATT 


GAGGACCACG 


GTCTTGATAT 


300 


TGACCACCTG 


TATCTAATGG 


GTCAATGACT 


GAGAAAAATA 


TTTCTAATAA 


CTTATTGTAT 


360 


GAGAATAATG 


CAACATCATA 


TTGAATTTCA 


ACAGTTTCTA 


AATGACCACT 


CGTACCTGAT 


420 


TTTACTTGTT 


CGTAAGTAGG 


ATTTTCAATA 


TGTCCGCCCA 


TATATCCAGA AGTTACTTTT 


4B0 


TCTATGCCGT 


CAAAGGTGTC 


AAATGGTTTC 


GTCATACACC 


AAAAGCAACC TCCGGCAAAA 


540 


TAAGCTGTAT 


TAATATTCAT 


TTTTGACATC 


CTTTCCATTA 


GACCTTAGTA 


CGATTTATTA 


600 


AGAAATCACT 


TGCTTTTTGA 


ATTGTTTTTA 


TATAACGTTA 


ATAnGnGATT 


ATnAT 


655 



40 (2) INFORMATION FOR SEQ ID NO: 1209: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH : 213 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1209: 

SO 

CCGnCCCCAA TGGAAATTAC CTTTGGTnCC CCATTTTAAT TGTTAAATTA TTGGAGTGTT 60 
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AATCTCACTA TCTTCAAATA AGCTTGTGTA TAGTTTAACA GCTTCTTCAG CTTGGTTATT 180 
AAACATTAAA AATGTCGTGA TTTTTGGAAT ATC 213 

5 

{2) INFORMATION FOR SEQ ID NO: 1210: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 149 base pairs 

(B) TYPE: nucleic acid 

™ <C) STRANDEDNESS : double 

<D) TOPOLOGY: linear 

1S Cxi) SEQUENCE DESCRIPTION: SEQ ID NO: 1210: 

TTGATTGCAA CTTTGATAGT GAAAACATTT ACATGATGTT GTCACTGGTG CACGGGCGTC 60 
GAGAACAATC ATTGCGTTAT TGACGGTGCT GCCCTGCTGT CTCAGTTTGT TGATGCAAGC 120 

20 ACTACACGAA ACCCAATAGC nTCACCAGG 14 9 

(2) INFORMATION FOR SEQ ID NO: 1211: 



25 



30 



35 



45 



50 



55 



(i) SEQUENCE CHARACTERISTICS: 

(A)- LENGTH: 142 base pairs 
<B) TYPE: nucleic acid 
(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1211: 
AATGTTTCTC CTAATTTTAA ATGTTTAGGA ATATTAACTA GGATAATAAA TCCAGCGACA 60 
TCCCAAATGG GATTCATATA AGAATGTTGG ATGATAATnT TGGCCGTTAA TATACATATT 120 
TTCTATTATA AAATTGGGCA AA 142 
(2) INFORMATION FOR SEQ ID NO: 1212: 

40 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) ' SEQUENCE DESCRIPTION: SEQ ID NO: 1212: 

TTTAAATTTA GTCACTTCAC CTTTTAAAGC ATGTTCATAA AATGTTTGCA TCATCAATGC 60 

ACGTTCTGAA CCAGAGCCTT CAACACAAAG ATAAATTTGT ACAGCAATAC CGCCTCTAAC 120 

TCTTCGTTGC GATGATGGCA ATACCACTCA ATCCAACGCT . CnG 163 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 765 base pairs 

(B) TYPE: nucleic acid 

5 (C) STRANDEDNESS : double 

(D). TOPOLOGY: linear 



10 (Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1213: 

GTCACCTTTC ACGTCTTGAA GAAAAAGGCT ATATAAGAAG AGATCCAACG AAACCACGTG 60 

CTATAGAAAT TGTAAGTGAT CAAACAAATG ATAATATTAA TATGGAAGAA ACGATTCATG 120 

IS TGCCAGTTAT TGGTAAAGTC ACAGCAGGTG TTCCTATTAC CGCAGTAGAA AATATTGAAG 180 

AATATTTTCC ATTACCTGAA CACTTAACAT CGACACACAA TAGCGACATA TTCATATTAA 240 

ACGTCGTAGG CGACAGTATG ATTGAGGCTG GTATATTAGA CGGAGACAAA GTAATTGTnC 300 

20 GCAGTCAAAC CATAGCAGAA AATGGAGACA TTATTGTTGC TATGACTGAG GAAGATGAAG 360 

CAACTGTCAA ACGCTTCTAT AAAGAAAAAA ATCGTTATCG ATTACAACCT GAAAATAGTA 42 0 

CAATGGAGCC AATTTACCTA GACAATGTTG CTGTAATTGG GaAAGTAATT GGTTTGTACC 4 80 

25 

GCGAAATGTA ATATTTTAAA CCGTTATATA TTATCGTAAT TGTTAAGCCC TCATTTTTAT 54 0 

AAATTTTGGG nCTCTTGAAA AAGTTACGTT TTCAAGAGGT TTTATTTATT CTAATCTAAA 600 

" . TTCAGTTCAA ACAGAAATTG CGAATTGTTA GATAATCTCA TTCTTTArTA TAAATnTTGA 660 

30 

ATTACAGATT TCTGCAAAAT GTTACACAAA TTAAAACTCG CCCCCGTAAT AATTTACAGG 720 

AGCGAGCCAT TATTATTTTT TTCnTTCTTT TTTATTAAGT ATATC 765 
35 (2) INFORMATION FOR SEQ ID NO: 1214: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 95 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
4° (D) TOPOLOGY: linear 



(xi) -SEQUENCE DESCRIPTION: SEQ ID NO: 1214: 

45 

AGTTGCCATC AAAAATACAG AACCTAATnA TnACCTTTTA TGATTTTTTG TGAATTCAAA 60 

GTCACTTTCT TTTTnAAGCT TTAAATTTCT CCCCATTTTT TTAGCCCCCT ATAAGGATTG 120 

AATATCAATG CCTTCTTTCA TTAAAATTTC TCTAATTTTC GAAACAAATA ATAATGCATG 180 

50 

TTCTCCATCA CCATGCACAC AAATTGTATC TGCTTGTAAC GTTACTTCCT TATTGTTTTT 24 0 

TGAAATAACT TTATyyyCCt TCACCATCTT TAAAACCTGC TTAAGTGCTT CGTCAGTATC 300 

55 
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ATCAGCAAAC ACTTCAGAAG CTGTAATTAA TCCGACATTC TTTGCTTCTG AAATTAGATA 
TGAATTTGCT AATCCTACTA ACACTAGTGA TGGATCAAAG TCATAAACAG CTTGTGCTAT 
AACGTTTGCn ATTnC 

(2) INFORMATION FOR SEQ ID NO: 123,5: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 243 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1215: 
AnTGTCCAGG TACCCGTGGT CCAAAATTCG ATGTACGTAG ACCATTGnCT CGTATTTGTC 
AGATTGTGTT AATGATTGTC TCTGTATTAG GAAGTTTATT CTCCATTTTA ACAATTAGAA 
AAATAGATCC GTTAAAGGCG ATTGGGTAGG AGGTGTGCAA ATGTTGAAAT TGAAATGTAC 
AAGTCATTAA GTGGATCTAC ATGAGCGTAA GGTCATTGGA ATAGGTTTGC TGTGACTCGC 
CGA . 

(2) INFORMATION FOR SEQ ID NO: 1216: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 256 base pairs 
■ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1216: 
GGGCTTCTTT CTAAAATAAC TAAGTGAATT ATCTATATAT GAAGAGGTCT CCAATACTTT 
TTATATAATT CGTGATTACA TTCTATACTT TTAATTGTAT TTTCTGCfGTA GGAAAGTGGT 
TTTATTAATT CGATTAGTTC GAACTACAnT CAAGAAAAGG TAAAATTTGT GCATGAGCAA 
GAGAAACATA CCTGAACCAA AGGCACCCGA ACAGTACCTA GGCCTAAGCG GCTAATTATA 
AATGCTCAAG CGCAnG 

(2) INFORMATION FOR SEQ ID NO: 1217: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 00 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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10 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1217: 

TTTATTTTTT CTTCTAAACG AAACATTGCT TCTTTTTGTG ATTGTTTTGT ATTCTTGTTC 60 

AACACAACAC TTACGCGCTT CCATTTATCT GTGTATGGAT CTTTATACTT CTCGTAGTAT 120 

CTGTATTTAG TTTCGTTATT TTTGTTTTTA AATTTTTCAA TCCACATGTT TATACCTCCT 180 

GAGAGAACGT ACGTTCTGTA AATTTGTAAA AAATAATAAG GGTAGGTGGG CTACCCAAAA 240 

TTTAGTACTA GGTACTAAAT ATGTTATAAT AAAATAAAAA GTAGGTGATA AGATGACTCA 3 00 

ATTTCTAGGG GCGCTTCTTC TTACAGGAGT TTTAGGTTAC ATACCATATA AATATCTAAC 3 60 

75 AATGATAGGT TTAGTTAGTG AAAAAAACAA GATTATCAAT 400 

(2) INFORMATION FOR SEQ ID NO: 1218: 

(i) SEQUENCE CHARACTERISTICS: 
20 (A) LENGTH: 400 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

25 

- (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1218: 
CCAAGTGTCA ATTTTAGATG ATTTTTTTGT TGACCTATCG CCTTAACTGA AGAAACTGAT 60 

30 AAATCATCCA TTTCAAAAAT AGGTCTAGAA AAATCTGTTC CGAAGGtCTT AAACGATTCA 120 

TATCACGAAT ATTTTTAATC GTTATATCAT TTTCTGTTAA TAATACATCT ACTTGCTTTA 180 

CGGGATCTAA CGAAGTTGTT TTAGATAATT CTTTCATCCA TTTATTTAAA CCTTCAGCTA 24 0 

ACGATTCTAT ATTTTCAATA TCCATCGTCA TACCTGCAGC CATATGATGG CCGCCAAATT 3 00 

TAGCGATTAA CTCTTGATGT GCTGATAGTA TTTCAAACAT CGACACTTGA TCAATTGATC 360 

TGGCGGAGCT TTTGCATGAT TTTGCTCCCT ATCAATATTT 4 00 
(2) INFORMATION FOR SEQ ID NO: 1219: 



35 



40 



(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 458 base pairs 
45 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



so 



55 



(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1219: 
AAGATATTAG ACATAAAATC TAAAnACAGC AGTAAGATGA TTTATGATTA GAAACTATCT 60 
TACTGCTGTT CACTTTTTAT AATACTTCTG AATGTCTTCA CTTATACTTC TAGTCACAGA 120 



1882 



EP 0 786 519 A2 



10 



15 



20 



25 



50 



55 



GCATTCATTT CTTGTcTAGC AACGTTCTAC TCTAGCGGAA nTAAtTAGCT ACCATCCTCG 240 

CTAAGAACCT TTCTTGACTT GTGACAATCG CTTGCTTCTT TCCTCTCCTT CGGcTCTCGC 300 

TTACTCATTT AGCTCTACTA AACTCGTTGC GCTC TT T T CT CGTTTCGTCA GATTCAAACG 360 

TTTTCACTTC GCCAAGCCAT TTTTCTTTGT GTTTACTTTT TATTTTGACG TTTTAGACAT 420 

AAAAAAAGAG ACCTCACGGT CTCAACTTGG CCTGGGCA 458 
(2) INFORMATION FOR SEQ ID NO: 1220: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 94 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1220: 
CAAAATTTAT TTCTTGTAAA GTACAATATA AGTTAAATGT TGTTGTTAGG ATTGATTCTT 60 
TCATCTGAGC ATTTAATTGT AAGTCGCTAT ATTTAATTTG TTCAGTATTT AmAAATAAAT 120 
TGAAAAATTC AACTGATAAT AAAGGTGTTC TAATCATACA AATACTTGAC GGTTCGTAAA 180 
TAGTGTTCAT TTGTAAGCAC CTCACATATT TGAATAGACA AATAGAGCCC TTTCAGGCTC 24 0 

30 TATTTGCTGT AATCTTAACA GCAGAAGCTA TTAAAACTAC CAGTCTTAGC ACAACCAGGA 3 00 

GTACATAAAC TATGACTTGT AATACGTTCG TCACCTGCTG AATCATTTGA GTnATTGTTT 360 
GCTTTAACTT GCACGTCTAA ATCAAGAACT TTTTCCATAG ATAAAACCTC CTATTATTAG 420 

35 

TTGAAGTTAA GGCCTACTTC AATTGTCATA TTATCCCTTT TCACACAACA AATCAATAAA 4 80 

TTATATAATT ATAT 494 
(2) INFORMATION FOR SEQ ID NO: 1221: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 148 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
45 (D) TOPOLOGY : linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1221: 

ACTATTATTT TAATAGTGAC GACAGTTTTA ATTTTATTAA GTATTGCTTT AATTACCTTT 60 

ATGCAATACT ATTTCACACA AGAnACCGAA AATGCCATAA GAGAAGATGC TAGACGTATA 120 

AGTTCACTGG TCGAACAATC ACATAATA 148 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 401 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1222: 
TCGCTTTTAC ATCAAATAAA TTATCTTTAT ACAATTTCGT ATGAATTACT TCTTTATAAT 
TATAAAAATG AGGAATTTTT TGATTGTTTA ATTCACTTGC TAATTGATTA GCTTTTTGTT 
GGTCTTTAAT TGTAACGTCG TGTGGTGCAG TATATTTTAT TTCACTTGAT AAGGACGCTC 
TACTTATAGC AGCAAAGCAA AGAACTGAAA CAGTAATCGC TGAAATGATT GCCATGACCG 
TAAGTGAAAA AGCATTTTTC TTAATACGAT ACATAATAGA TGATGAGAAC ATGACATCAT 
TtACACTTAT AACACCTTkT CTAAACtTCT TCACCATTTT AAAAACTAGA GArACAGAGC 
TTTTAAAAAA TAAGnGTGCC CCAATCACAG TTGACAAnAA A 
(2) INFORMATION FOR SEQ ID NO: 1223: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 base pairs 

(B) TYPE: nucleic acid 

(C) . STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1223: 
CTTGACCAAA TCCATTTTTT AACAAAAGTG GTCAAGTGAA ATCCCATCTT TCCAAATCAA 
AAAAGAGCTA AAGCAAAATG CTCTAGCCCT TGaTATTACT GATTTCCCAT CATTGTnTAG 
CGTATTAATA TTGCTTCATG TACTGATCTC TTTCCCATTC AGAGACTTGA GTTCTG 
(2) INFORMATION FOR SEQ ID NO: 1224: 

Ci) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 283 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(Xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1224: 
AATACTTTTA CGATAATTAT AAAGGTTCAT TCAAATCTTA GGGCAAAATG TATAATATTT 
TTATGCAAAT TTAAAACAnT AACACTTATT TCAAGGTTCA ATATTTTGAG AATAAGGAGT 
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AATGCCAATC AAACTCGCTC GAAATAGGGG GAAACGAGGC nATCATTTTT GACAATGAGT 240 
GTCAATAGGG AAGAATAACA ATTAGGAATG ACTCATAGGA GGA 283 

5 

(2) INFORMATION FOR SEQ ID NO: 1225: 

(i) SEQUENCE CHARACTERISTICS; 

(A) LENGTH: 217 base pairs 
10 (B) TYPE: nucleic acid 

(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



76 (xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1225: 

AAAAGAAGAG AAGAAAGTTG AAGAACCTCA AGCACCTAAA GTTGATAACC AACAAGAGGT 60 
TAAAACTACG GCTGGTAAAG CTGAAGAAAC AACACAACCA GTTGCACAAC CATTAGTAAA 120 

20 

ATTCCACAGG CCATTACAGG AATTTAAGTC GGATACCACG TGAATnACGT CAGGATGTCn 180 
GCCGTTCACA TGACACGCTC TACATATTCA CCACTCG 217 
2S (2) INFORMATION FOR SEQ ID NO: 1226: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 176 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
30 . (D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1226: 

TGGGTACTAT TTATTACGTA AAACTTTTCG TTAGCGATGC CGGCATTGCA AGAGCAAGGT 60 

TTACAAAAGC GGACTAGGTT TGCACTTTCT GCTGTTCCAT CGCATATGGA TTAGTAAGGT 120 

CTTTATGGGn ACTGTAAGTG ATCCGAGCAA TGTCGGATAA TCTAGGTCTT GGATAG 176 



40 

(2) INFORMATION FOR SEQ ID NO: 1227: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 400 base pairs 
4$ (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1227: 
AATGAAAATA AAGAAGATGA TTTTGTTGCA TATGGGTCTC CAGAACATAA TTATCAATTT 60 
GGTGGAAGTT TAATCAAAAG TAAAAATTTA AGCACGTTAT TAAAACCAGT ACATCAAACC 12 0 

55 
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GCGGCCGTTT TAATTTTGTT TTCAAGTTAA TCATACTTAC ATATAAAAAT GAAAAGTTGA 
ATTTTTAGTA CAATTTCCAC TGTATTTATC TGTTAACCAC TTTATACCCG CAATATTTCG 
TTGATCTTAT ATAAACACTT TTGGCTGCTT TTCCTTGACC TAGATTAAAA TTATAGATTT 
TCCTTGTAAG AGTGTGTTTG TAGTATCATA AATATGTAAA 
(2) INFORMATION FOR SEQ ID NO: 1228: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 150 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1228: 
ATCTAGCATG CTTTTGGAAA ACGATAAGAA TTTGGTGTTT GAATTTCATA ACCAACACCA 
GCAGTTTCAA CAACTACGTG TGTAGGATAT AAATGTGTTA CTTGACC T TT GACATACGCG 
TACATTATAG GCACATCCTT ACATACTCAn 
(2) .INFORMATION FOR SEQ ID NO: 1229: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 138 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 
<D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1229: 
TCGGTACAGT GCATTTGGTG CGTCATGGTT TACAAGGnAA AAATAAGTGA TCACTATTTA 
TCAGTATGGG AAAAAGCAAC GACGTATCAA ATGTACCATG GCTTAGCATT ATTAATTATA 
GGTGTAATTA GTGGTACA 

(2) INFORMATION FOR SEQ ID NO: 1230: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 182 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE' DESCRIPTION: SEQ ID NO: 1230: 
TTGGTATGTA CCTCCAATAG TAATTTCAAT AACTTTGTCT GTTGAACACT AAGAGCAATT 
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CCTTGATTAT nCTAAGAAAA GTAAAAGCAC ACGGAGTATC CTATCAGAAA ACCAGTATAC 
CA 

(2) INFORMATION FOR SEQ ID NO: 1231: 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 233 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1231: 

TCTCTTTAAA TTGTACACAA TTCTGGATAA CTATCCCCAT TTCTGTGGAT AAACACCTTG 

ATGTCTTATT ACTTATCCTC GTTTTTATAT CCCATATTTA TCAACGGkTA TCGCTTTTTT 

TCATAAAAAA aCACTACCGT ACCyCTTATT TAAGAAATAC AGTAGTATTA TTTTCAATAT 

ATTTTAGCTA GCnTTTCTAC GTCGTACAGT CGCGATTAAA CCTATACCTG CTA 

(2) INFORMATION FOR SEQ ID NO: 1232: 

. (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 385 base pairs 
(B> TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1232: 



GTTTTTGGTA 


AAAGTACTAA 


TATTAaCCAA 


GTATTGAGTC ATATGTTAAA ACAAGATGCG 


AATGATATTG 


GCTTTGCTAA 


ATTACTACAA 


AATGAGAATA ATCGTATGAG 


TTATAACGAG 


TTAATGAGTG 


AATGGCAATC 


ACATCAACGT 


GCATTTTTAG AAGAGTTGAG 


GCATGTTGAA 


ATGTTAAAAG 


AAGAATCTAT 


TAGAGCATAT 


GATGTTTATA AAAATTGTGA 


GTCTTTCTCT 


AAGATTGAAC 


AGGTTATTAA 


TAGTGAAAAA 


ACAAGTATTG AAGAACAGGT 


ATATCATTTA 


GATAATGAAA 


CGTTACGAGA 


CAATAAAGAA 


ATAGAAGTTT GGGATAATCG 


ATTTAATTAn 


ATTGnTAGCC 


CAATGGnCCC 


TTTAA 







(2) INFORMATION FOR SEQ ID NO: 1233: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 180 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1233: 
CTGAAAAGAT TCAACTAATC TGAGTCAGGT TACAATCTCA TGATAACCGC TCGCACCTCG 
GAGAAAGACG CTCAAACAGT ACAGAGCTCG ACCGCAGGAT ATTTTTTGTG AAAATGAACG 
TATGAGCTGT TCAGAnGGCA CTTTATTGCG CAATAAAACG GTAAAAATCC TGAAGTGAAG 
(2) INFORMATION FOR SEQ ID NO: 1234: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4 53 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1234: ■ 
ACCATTTTAA TTTnCnTTAT GACCTTGTAT GCCATTATCA aCAGTTATGA TTAATGTTAT 
GCyTTCATCA TGAGCATTTC TAAATGCTAG TTCGTTTGGT CCATATCTTC GGTAAAGCGG 
TTAGGAATAT GCCATCCTAC TTGTGCACCT AAAAGTTGTA ATGTTATCAC TAAAATTGTA 
GTTG.CGGTAA CACCGTCGGC ATCGTAATCA CCATAAACTA GGATTTTCTC ATCATTCGCT 
ATCGCTCTTT TAATTCTTTC AATAGTCTTA GTCATATCGC TCAATTGnAG TGCATCATGA 
TTGATATCTG TATCTGAAAT GATGGATTCT ATTGCTTGTn CATCAATAAT CGATTTACTT 
nCTAATATTT TTTTTACGAT TGGCGTTAAC TTTAATTTTG ATGTTAATTC ATCACTTATG 
TATTCAGCTG GTTTAGTTAA TTTCCACTTA TAC 
(2) INFORMATION FOR SEQ ID NO: 1235: 

"\ 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 163 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1235: 

CGTGATTAAG TTGGATACTA AAATTTGGGA TGTTAGAGAA TGTGATGAAG ATTTnCAGCA 

ATGTCCTGAA ATTAATGAAA TAAGAGACAT TGTTTTAAAG CGGTGGTTTG ATAGGTTTAC 

CAACTGAAGC AGTTTATGGA CTTGCAGCGA ATGCGGCAGA TGA 

(2) INFORMATION FOR SEQ ID NO: 1236: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 144 base pairs 
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(C) STRAND EDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1236: 
GTTAACACAT CGCTTATAAC GTGTAAATAT AAACACTGTT TTATAAAAAC TTTTTATACC 
ATTACCGGCA CCGATAAATG TGTACTATGG AGCACAACAA TAACTGGAAT ACTTnGATTT 
AATCCCGCTA TAACATTTCC TAAT 
(2) INFORMATION FOR SEQ ID NO: 1237: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 204 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION : SEQ ID NO: 1237: 
ATAGTTTAAG GTGATATTCA AACAATATTA GGTTTGATTT AACTATCAAG CATCTACTTT 
GTCATGTTTA ATACCAGCAA ATAATAAAAT CCATGTCACG ATAATAAATG GCATGTTAAT 
GCAGGGTAAG CCCAACCGGT CCAGCCATGG TGGTGGTACC CTAnTTGGCC nCCACCTACC 
GGTTAATTAA CCACCCCCAA AAAA 
(2) INFORMATION FOR SEQ ID NO: 123 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 157 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1238: 
CGGTCATTTT GGTTATTnAA CAACTATACC TGAGGGGTGG TGGGTAATTC GTATTGCCGA 
TTCAGTTTTG TTAATATGCT GACCACCTGC ACCAGAAGCT CTGAATGTAT CAACTGTGAA 
TAGCATCCGG ATTGGTTTCA ATCTGCTATG GCATCAT 
(2) INFORMATION FOR SEQ ID NO: 123 9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 164 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 123 9: 
TTAAAGCATG TTCATAAATG TTTGCATCAT CATGCACGTT CTGGACCGAG CCTTCAnCAC 
AAGGTAATTT GTACAGCATA CCGGCTCTAA CTCTTCGTTG CGTATGCCTG CAATTTCTTA 
CCTCGATACT TAAGTCAATT TTCCGGGCAT AAGGGTGTTC ATTT 
(2) INFORMATION FOR SEQ ID NO: 1240: 

(i) SEQUENCE CHARACTERISTICS : 

(A) LENGTH: 239 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 
CD) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1240: 
TTGAAAAACT GAAAACGGTG AAGTTTTACA AACAACTAAA ACAGATAAAG ATGGTAAATA 
TCAATTTACT GGATTAGAAA ATGGAACTTA TAAAGTTGAA TTGAAACACC ATCAGTTACA 
CACCACACAA GTAGTTCAGA ACTATAAGTT GATCAATGTC TCACACAGTG CATAATnGAA 
CAACATGCCG GTCTCAACAC TCACTAGGCA GAGGATCATA ACGGTCGTnG TAAGCTCGG 
(2) INFORMATION FOR SEQ ID NO: 1241: 

,<i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 152 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1241: 
TGATAAATAA TAGCGCCATC ATAAGTTGCA GGGTGGTAAT CATAATTTCT ACCATGAGTC 
TTTGCCTTGA AAGACCGTGC ACCACTTTCT TTTAAATCGG TAAACGATAA TGnCAAGTTT 
AAAATAATTT GGCGTGTTGG ' CATTTGAGTA TC 
(2) INFORMATION FOR SEQ ID NO: 1242: 

(i) SEQUENCE CHARACTERISTICS-: 

(A) LENGTH: 148 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1242: 
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ATCTTTACCT TTCACTGTTC ACTTATGTAC CATAATACTT CTGACAGTTA CTAATTAACA 
GCAACTCTAA CTCATTTATA TTTAACTA 
(2) INFORMATION FOR SEQ ID NO: 1243: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5B8 base pairs 

(B) TYPE: nucleic acid 
"(C) STRANDEDNESS : double 
(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1243: 
GGTACCAATC GATCGTTACA AACGTTTCGT TAAAGACGAT AAAAAAGTAC CAACAGGCAA 
TGAATATCGT GAATTAGTAT TAAAAGCAAT TCACATGATT ATGCTTGGTT TCTTGTATAA 
ATATATTGTT GCTTACTTTA TTAACACATA TGCAATCATG CCGTTACAAT TAGACTTACA 
TGGCTTTGTC AATTTGTGGT TATATATGTA CGCATACAGC TTATATTTAT TCTTTGACTT 
TGCAGGTTAT AGTTTATTTG CGATAGCATT TAGTTATTTA TTCGGTATTA AAACACCACC 
AAACTTCGAT AAACCTTTCA AAGCGAAAAA TATTAAAGAT TTCTGGAATA GATGGCATAT 
GACATTATCA TTCTGGTTCA GAGATTGTAT TTACATGAGA TCTTTATTCT ACATGTCTCG 
TAAAAAATTA TTGAAGAGTC AATTTGCAAT GTCCAACGTG GCATTCTTAA TCAACTTCTT 
CATAATGGGG ATTTGGGCAT GGGAACGAAG TGTAATACCA TTGGTTTAAG GGGTTAAACC 
ATGGCCAGCC TTGGTTAAAG GGTAnGGnCC AATAAGGACG GTGGGCGG 
(2) INFORMATION FOR SEQ ID NO: 1244: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 376 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS:. double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1244: 
ATTGCTGGAG AAAGGCAAGG CTTTGCAGGC GGATTGAACT CGACATTCAC TAGTATGGGT 
AATTTCATAG GTCCTTTAAT CGCAGGTGCG TTATTTGATG TACACATTGA AGCACCAATT 
TATATGGCTA TAGGTGTTTC ATTAGCAGGT GTTGTTATTG TTTTAATTGA AAAGCAACAT 
AGAGCAAAAT TGAAAGAACA AAATATGTAG CATAAGTATT TTGGTGTATA TTGATATAAA 
GTAAAGCGTA ATATTATGAA TGATTAGCAT CGTTTTTCTT ATGAATTTTA TTAAGAAAAT 
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GTTATATGTn ACAAAA 

(2) INFORMATION FOR SEQ ID NO: 1245: 

(i) SEQUENCE CHARACTERISTICS : 

{A) LENGTH: 288 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1245: 
CAATTTGTTC TAAAGCAAAT TCTTTACATT TCTCACGGAA TTCAGCTGTT GACATTTTCT 
TTCGGTCAAC ACCTTTTTTC GTTAATGGCT TGTTCAATTG GGTAAAC CAT GTGGTTATCC 
CCCAACCTGG GTACCGTATG GGGTGGCCAT TAGGAAACCC CTTGGCAATA GGTTTTTTAA 
TAAACCGGTA nCAAATTAAA AGGTCCTTTT TTAAAAAATT TTTGGTTTCC AAAGGGGCCA 
ATGGnTCCCC CAATTAATGG TTAAAGGTTT TAACCCCAAT TTTCCGGC 
(2) INFORMATION FOR SEQ ID NO: 1246: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 161 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1246: 
CATCATCAAT AGAACGGTAT TCGCAACGTC GTAGAAATCC. CTTAAACGAC GAGAACAAAG 
CAAGTCGACA TCAGTAAAAC ATTAAAGATA . TTATGGCTGA AGCAGAGAAC TTAATAACTT 
TTTATAAATC TATACCnTCC AAAATGTAAA ACTACCAAAA T 
(2) INFORMATION FOR SEQ ID NO: 1247: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 421 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1247: 
TCACTACGTT GTTTAATTGT GGTAATTGCA GTTTTACAGG CTTGTCTAGT GAAGATTTAA 
AACTTAATTC AAGTGTTTTA GTCATGAATA TGTCCTCfCTG ATTAAATTGA TAAAGATTTG 
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TTAATTTGGT CGTTTGATGC ATCAGGGTTA ATGTTAGCGA ATCGACGCTT AAATTCTGTT 
TGTTTGCCGT TAGCGTCtAC TTTAGTAAAT GaTAATACAA TAGTGATGTG GTTTATTTtA 
CTCATATTTT AAAACCTCCT TTCACACTAT ATATCGAAAC AAAATAATAA. AATGGCTAAT 
TTTATTTTCT ATGTTTAAAA TCTATAAAAn AGGCAATAGA TATGTGTAAC TAAAATATAn 
G 

(2) INFORMATION FOR SEQ ID NO; 1248: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH : .4 09 base pairs 
<B) TYPE: nucleic acid 

(C) STRANDEDNESS : double 

(D) TOPOLOGY: linear 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1246: 
ATAAGCTGCT GTTGATAATC CAAGTGCTAA AGTTAAATAT AATATAGTGC CAAAAATTTT 
ATTCTTCTAT AAAAACACCT CTTCATATTT AATCCATATC TCTTTTTCGT TATTTATACA 
GAATTTTCTG AAAATAGTCA AGGTTAATTT ATTTGCTGCA AATTTCAACC ACTTTATGTT 
AAATGGACAT CAAAGTATAA TTGTATTATT ATTAAATTGT GCAGTGTATA AATAGACGAT 
ATGCAGGAAC ATGTCGCCTA TTGAGCCCGT TAAAAAGACG GTGACTAAAT GAGATTTTCT 
TTAACCATCA TTCGTTGTCA AAGTTTGGAA ATgATGGTTG TTTTTTATTG TTTAAATTAA 
TCATTGCGAC TTCTATATCA ATGAAAGTTA TCTTAATATA TGAACTTCA 
(2) INFORMATION FOR SEQ ID NO: 1249: . 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 217 base pairs 

(B) TYPE: nucleic acid 
<C) STRANDEDNESS: double 
(D) TOPOLOGY: linear 

(Xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1249: 
CTTTCAATTA CGAACAAATC ATTACTAGTT TATTTAATGA TAATGACTGn AATTTAGCAA 
AAAACGTAAC ATGATCATTT AGCTTGAATT GGTAAAACTG TTTATCCAAC ATTTCCAGAC 
CAAATCACAC CGGTAAAACG TCAAAATATA TTGTnTCTAT " GCCCGTTTTT ACCAGTTATC 
TCATTGCAGG TATATTGATG AATTCACAAG TTGGTAT. 
(2) INFORMATION FOR SEQ ID NO: 1250: 
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(A) LENGTH: 176 base pairs 

(B) - TYPE : nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1250: . 
10 CAGAACGTGC GGATCGTTTT TATGCAACCT AAACCAGGTA CGGATTTAGC GTGGTTAAGT 60 

GGCAGTGCAC TAAGTATATT CATTGATCCT GCATTTACAC GATGAAAGCA TTTATTGCAT 120 
GAGTGGGTAG ATGCATTTTG CTGGAATGAT TCACAAATGC ATTAGAGnAC ATTTAC 176 
(2) INFORMATION FOR SEQ ID NO: 1251: 



75 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 212 base pairs 

(B) TYPE: nucleic acid 
20 (C) STRANDEDNESS: double 

CD) TOPOLOGY: linear 



25 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1251: 

TTTCAACTTT CTAAATTATA ATATATCTTT TTAAAATAAG CTAGAATTTC TATATAATAA 60 

ATGTTAATAA CGAAAAGGGA TGATGCATAG TGATCGCAAG CACGCCAGAG GCCGATTTGA 120 

30 TATTGCGAGT TAGTTTATAT GGTTTGGGAT GATATGGATA GAATTGGTAA ACATCTCCTA 180 

AAGCATGGTA TAGTGCAATT nAAAAnCTGT GT 212 
(2) INFORMATION FOR SEQ ID NO: 1252: 



35 



40 



45 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 131 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 1252: 
CAGTAAGTCA ATTGAAATGG CTAATTTATT AAATCTATAA AAGCATAGAG CACTGTTGTG 60 
AGTTCATAAT CAAAGATTTA TAAATGTnTC AGACTGCCAA TATAACATTT AGGACCTAGA 120 
ACATTGATTA T 131 
SO (2) INFORMATION FOR SEQ ID NO: 1253: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 462 base pairs 

(B) TYPE: nucleic acid 
55 (C) STRANDEDNESS: double 
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